Desmond Chua

Vancouver, BC, Canada

Data Analyst Analytics Engineer AI Business Analyst
Desmond Chua

I'm a data analyst and analytics engineer based in Vancouver, BC. Originally from Singapore, I came to Canada to build a career doing what I genuinely enjoy — turning messy data into decisions that actually matter. I love sitting with a complex dataset, asking the right questions, and building something — a dashboard, a pipeline, an analysis — that helps someone make a better call. I'm drawn to healthcare, retail, and insurance because the stakes feel real. Interested in working together or having a chat? Feel free to reach out.

Analyzing & Engineering

SQL Python dbt Snowflake Databricks Git / GitHub Statistics Excel

I love getting into the weeds of a messy dataset and making sense of it — whether that's a complex SQL query, a Python script, or a dbt model that finally makes the data trustworthy. Good analysis starts long before the dashboard.

Visualizing

Power BI Tableau Streamlit Looker Studio

I genuinely believe a well-built dashboard can change how a team makes decisions. I enjoy translating numbers into visuals that non-technical stakeholders can actually act on — not just look at and nod.

AI & Business

Prompt Engineering LLM Integration AI / ML Requirements Stakeholder Comms

I'm convinced the analysts who thrive are the ones who ask the right business question before touching any data. I use AI as a multiplier across my workflow — from framing problems to communicating results more clearly.

Featured projects
World Life Expectancy infographic

World Life Expectancy — What Drives a Long Life?

An end-to-end MySQL case study on 15 years of country-level health data — 193 countries, 2007–2022. Act 1 cleans the raw 2,941-row import with ROW_NUMBER() de-duplication, self-join back-fills, and honest NULL handling; Act 2 runs the analysis that surfaces a 5-year global rise in life expectancy against a stubborn 12-year development gap and a 9-year wealth gap. Built to production shape — a staging → clean → serve (Bronze → Silver → Gold) flow that maps directly onto the Azure Data Factory → Microsoft Fabric → Power BI stack.

MySQL Window Functions Data Cleaning Medallion Architecture EDA

Maven Movies SQL + Power BI

End-to-end SQL due diligence on a DVD rental business. Complex queries → Power BI executive dashboard with DAX measures and drill-through reporting.

MySQL Power BI DAX
Maven Movies screenshot
CedarCare screenshot

CedarCare — BC Health Platform

AI-powered personal health management platform with a symptom assessment chatbot, facility finder, and analytics dashboard for the BC healthcare ecosystem.

Python Streamlit OpenAI API SQL

LAX Flights Excel Analysis

Data analysis and forecasting on 4,985 LAX flights. COUNTIFS, nested IF logic, assumption-driven forecast model, and airline performance dashboard.

Excel Power Query Forecasting
LAX Flights screenshot
Healthcare NLP screenshot

Healthcare NLP Analysis

NLP pipeline on 100 r/healthIT posts — LSA extractive + GPT-4o-mini abstractive summarization, importance scoring, and sentiment distribution analysis.

Python OpenAI API NLTK BeautifulSoup

Stock Price Analysis + Streamlit

S&P 500 — CSV vs Parquet benchmarking, Pandas vs Polars, ML price prediction (R²=0.9997) with a live interactive Streamlit dashboard.

Python Streamlit scikit-learn Plotly
Stock Price screenshot
Automation
Claims lifecycle automation diagram

Automating Claimant Communications

A Power Automate build that replaces manual claimant emailing for an insurance claims team. Seven cloud flows read an Excel claims table, route the right message for each of six statuses, attach settlement letters, and write status back to the source so no claimant is contacted twice. The case study works each problem two ways — loop-and-condition vs. filter-at-source, Switch vs. per-status queries — to show the production trade-offs, and surfaces the silent 256-row pagination limit that breaks naïve flows at scale.

Power Automate Excel Online Office 365 Outlook OneDrive OData
Upcoming projects
RetailAirbyte + dbtPower BI

RetailIQ — Customer Analytics Platform

End-to-end retail analytics pipeline. ELT with Airbyte → dbt data models → Power BI customer segmentation and sales performance dashboard.

Airbyte dbt Power BI SQL
In Progress

FinSight — A/B Testing & Churn Intelligence

Statistical A/B testing framework and churn prediction model for a fintech use case. Scikit-learn + Streamlit + Power BI reporting layer.

Python scikit-learn Streamlit Power BI
In Progress
Banking / FintechA/B TestingML
InsuranceBigQuerydbt

InsurFlow — Modern Insurance Data Pipeline

Cloud-native insurance analytics pipeline. Airbyte → BigQuery → dbt models → Looker Studio covering loss ratio, combined ratio, and claims KPIs.

Airbyte BigQuery dbt Looker Studio
In Progress

PolarMart Canada — Snowflake BA Project

Business analyst capstone on a Canadian retail dataset. Snowflake data warehouse → business requirements → KPI framework → executive reporting.

Snowflake SQL Power BI BA Framework
Planned
SnowflakeRetail BACanada
Practice
SQL ChallengesCase StudiesInterview Prep

SQL & Analytics Practice

Ongoing practice log — SQL challenges, business case studies, and interview prep across multiple platforms. Updated regularly as I work through problems.

SQL Python Case Studies
Certifications

Earned credentials and certifications, with more in progress.

Microsoft Certified: Azure Data Fundamentals (DP-900)

Microsoft

Earned