Business Intelligence · Consulting Portfolio

Airline Passenger Experience Intelligence

A multi-tool BI case study analyzing consumer complaints, baggage mishandling, denied boardings, and on-time performance across 9 major U.S. carriers from 2022–2024 — using Power BI, Tableau, and Python to surface what the numbers actually mean for passengers and regulators.

Key Decision Required

Airlines exceed industry complaint threshold

▲ Warranting regulatory review

Detected Anomalies

Statistically significant spikes (≥2σ)

▲ 2 classified critical (≥4σ)

Data Coverage

3 Yrs

108 monthly observations, 9 carriers

↓ DOT ATCR · BTS Transtats

Best Performer 2024

Delta

Score: 78.8 — Despite CrowdStrike outage

▼ Complaints fell 10.7% YoY

Tool 1 — Power BI

Interactive Dashboard — Composite Passenger Experience Score

Decision-centric layout built around a single question: Which airlines warrant regulatory attention this quarter? Features a composite score model, AI-generated narrative briefing, and a What-If scenario slider.

Composite KPI Score Smart Narratives (AI Briefing) Key Influencers (XAI) What-If Scenario Slider Q&A Natural Language Query Semantic Color System Mobile-Responsive Layout

Airline Passenger Experience Power BI Dashboard

📊 Power BI Desktop — Interactive version available upon request POWER BI

🧮

Composite Passenger Experience Score

Blends on-time performance (35%), complaints (30%), baggage rate (25%), and denied boardings (10%) into a single 0–100 index.

Risk Scoring

🤖

AI Executive Briefing

Power BI Smart Narratives auto-generates a plain-language summary of the most significant findings each time the data or filters change.

AI-Generated

🔍

Key Influencers (Explainable AI)

Power BI's XAI visual reveals which factors statistically drive complaint rates — showing analysts exactly why scores change, not just that they did.

Explainable AI

🎚️

What-If Scenario Modeling

Interactive slider models how a carrier's composite score changes if complaint rates improve or deteriorate by up to ±50% — driving forward-looking decisions.

Scenario Modeling

💬

Natural Language Query

Embedded Q&A visual lets non-technical stakeholders ask questions like "Which airline has the highest baggage rate?" and receive immediate visual answers.

NLQ Interface

🎨

Semantic Color System

Color encodes velocity of change — not just good vs. bad. A carrier improving 6 points year-over-year shows dark green, while a 1-point improvement shows light green.

Cognitive Load

Tool 2 — Tableau Public

Benchmarking Visualizations — Industry Comparison & Score Trends

Published to Tableau Public for live, embeddable interaction. A benchmarking bar chart compares each airline's passenger score against the industry average. A trend line chart tracks score movement across 2022–2024, making year-over-year shifts immediately readable.

Benchmarking Bar Chart Industry Average Reference Line Trend Line Chart (2022–2024) Per-Airline Color Encoding Multi-Sheet Workbook Interactive Tableau Public Embed

Benchmarking Chart Logic

The benchmarking bar chart compares each airline's composite passenger score against the industry average reference line. Color-coded bars make above- and below-average performers immediately visible without requiring the viewer to read a single number.

Key Insight: Hawaiian and Delta consistently outperform the industry average, while Spirit and Frontier fall significantly below — a gap that single-metric reporting often obscures.

Trend Lines — Who Moved?

The trend line chart tracks each airline's passenger score across 2022, 2023, and 2024. Rising and falling lines immediately reveal which carriers improved or declined over the three-year period.

Notable movement: Southwest's score dropped sharply following its December 2022 operational meltdown, then showed partial recovery in 2023–2024. Frontier showed significant volatility driven by complaint surges in 2023.

Tool 3 — Python · Streamlit · Plotly

Anomaly Detection Engine — Autonomous Data Scout

A live Streamlit application that scans 36 monthly time series for statistically significant events using z-score detection — simulating an agentic BI system that proactively surfaces anomalies rather than waiting for a user to look.

Z-Score Anomaly Detection Adjustable Sensitivity (σ) Real-Time Alert Cards Multi-Carrier Time Series Contextual Event Notes Plotly Interactive Charts

Flagged Events — Auto-Detected at ≥2σ

Airline	Period	Metric	Value	σ Score	Context
Southwest Airlines	Dec 2022	Complaints/100K	18.4	5.3σ 🔴	Holiday meltdown — 16,700 flights cancelled
Delta Air Lines	Jul 2024	Complaints/100K	5.8	4.0σ 🔴	CrowdStrike IT outage — longest recovery of any U.S. carrier
Frontier Airlines	Jul 2023	Complaints/100K	38.4	2.3σ 🟡	Peak of complaint surge during rapid capacity expansion
United Airlines	Jul 2022	Complaints/100K	6.8	2.5σ 🟡	Summer 2022 travel surge — staffing shortfalls post-pandemic
Delta Air Lines	Jul 2024	Baggage Rate	0.82	4.1σ 🟡	CrowdStrike outage disrupted baggage handling systems simultaneously
American Airlines	Jul 2022	Complaints/100K	9.8	2.3σ 🟢	Post-pandemic demand surge; returned to baseline by Sep 2022

Key Findings — Decision-Centric

What the Data Tells Regulators and Carriers

These findings are structured around decisions, not just descriptions — each insight is paired with its implication for regulatory oversight or carrier strategy.

🔴 Regulatory Concern

Frontier Airlines exceeded the industry complaint average by 5–10× in every year studied. Despite a partial recovery in 2024, its complaint rate remains 22.1 per 100,000 passengers — more than 3× the industry average of 6.9.

Implication: Warrants formal compliance review. The 2023 peak (33.0/100K) coincided with rapid capacity expansion without proportional service investment.

🔴 Infrastructure Risk

The CrowdStrike July 2024 outage revealed Delta's outsized operational dependency on a single IT vendor. Despite ranking first in annual on-time performance, Delta's recovery was the slowest among U.S. carriers — a resilience gap masked by aggregate metrics.

Implication: Annual KPIs obscure event-driven fragility. Monthly anomaly detection would have escalated this within days, not after the full-year report.

🟡 Watch — Service Gap

American Airlines improved on-time performance from 76.1% to 77.8% across 2022–2024, yet its baggage rate worsened from 0.81 to 0.90 — the only major carrier to move in the wrong direction on baggage. High operational performance is not translating to passenger experience.

Implication: On-time metrics alone produce a misleading picture of American's service quality trajectory.

🟡 Watch — Recovery Speed

Southwest's December 2022 meltdown generated a 5.3σ complaint spike — the most extreme event in the dataset. But complaint rates returned to near-baseline within 3 months, and 2024 on-time performance reached an all-time observed high (86.9% in Nov 2024).

Implication: Southwest's recovery velocity suggests operational fixes were substantive, not cosmetic — a meaningful signal for carrier credibility assessments.

🟢 Best Practice

Alaska Airlines consistently maintained the lowest complaint rates among the Big 6 carriers (2.1–2.4/100K), despite a slight on-time dip in 2024 from the Hawaiian Airlines integration. Alaska demonstrates that low complaint rates are achievable even during major operational transitions.

Implication: Alaska's service model warrants benchmarking study — its complaint rate is 70% below the industry average while operating a growing network.

🟢 Industry Trend

The industry composite Passenger Experience Score improved from 52.1 (2022) to 56.4 (2024) — a meaningful but uneven recovery from pandemic-era service degradation. However, complaint volumes remain 270% above pre-pandemic baselines, meaning absolute performance is still far from historical norms.

Implication: Trend direction is positive, but regulators should calibrate expectations against 2019 baselines, not 2022 pandemic-era lows.

Methodology & Transparency

How This Dashboard Was Built

Transparency about data sources, methodology, and limitations is a core principle of trustworthy BI. The following notes are surfaced deliberately — not buried in footnotes.

Step 1 — Collect

Data Acquisition

Annual metrics sourced from DOT Air Travel Consumer Reports (ATCR) and BTS Transtats. Monthly complaint and baggage data compiled from 36 monthly ATCR releases covering Jan 2022–Dec 2024.

Step 2 — Score

Composite Score Model

Metrics normalized to 0–100 scale and weighted: On-time (35%), Complaints (30%), Baggage (25%), Denied Boardings (10%). Weights reflect regulatory prioritization in DOT consumer protection framework.

Step 3 — Detect

Anomaly Detection

Z-score method applied independently to each carrier's monthly time series. Events flagged at ≥2σ (notable), ≥2.5σ (warning), or ≥3σ (critical) from the carrier's own baseline.

Step 4 — Validate

Cross-Reference

All flagged anomalies cross-referenced against known industry events (CrowdStrike, Southwest meltdown, pandemic recovery timeline) to distinguish systemic from one-time events.

Data Confidence & Transparency Indicators

DOT changed complaint reporting methodology in June 2023, moving to "consumer submissions" (complaints + inquiries). 2023 H2 data reflects this change and may overstate complaints by ~9% vs. prior methodology.
Baggage rate reporting changed from per-1,000 to per-100 enplaned bags in January 2022. All data in this dashboard uses the post-2022 per-100 methodology for consistency.
Denied boarding (oversales) data is reported quarterly, not monthly — monthly figures in the anomaly detection component use quarterly averages distributed evenly.
Spirit Airlines filed for Chapter 11 bankruptcy (second filing) in August 2025 and is currently restructuring, having reduced flights by ~40%. The carrier plans to emerge as a smaller airline by late spring/summer 2026. 2024 data in this dashboard reflects pre-restructuring operations and should be interpreted accordingly.
On-time data from Cirium 2024 rankings and BTS Transtats. Cirium and BTS use slightly different carrier groupings — network carriers (e.g., "United Airlines Network") may include regional partners.
This analysis uses publicly available data. Figures are sourced from official government publications and should be verified against primary sources for regulatory or legal use.

Tool Stack

📊

Microsoft Power BI

Core interactive dashboard with composite scoring, AI narratives, Key Influencers XAI visual, What-If modeling, NLQ Q&A, and semantic color system.

Enterprise Standard

📈

Tableau Public

Benchmarking visualizations — quadrant scatter with animated Pages shelf (2022→2024 drift) and bump chart for ranking changes. Published for live public access.

Industry Leader

🐍

Python · Streamlit · Plotly

Autonomous anomaly detection engine. Z-score analysis across 180 monthly observations, real-time alert cards, adjustable sensitivity, and contextual event notes.

Emerging Standard

🌐

HTML · CSS

Decision-centric portfolio wrapper with responsive layout, dark-mode design, embedded tools from all three platforms, and accessible data transparency notes.

Full-Stack BI

Technical Orchestration

How This Platform Was Architected

A multi-tool intelligence stack built through an agentic AI workflow — demonstrating the same rapid implementation capability offered to clients.

Layer 1

API Orchestration & Data Pipeline

Orchestrated DOT Air Travel Consumer Reports and BTS Transtats datasets into a normalized analytical pipeline. Deterministic complaint-rate thresholds and composite scoring logic apply the same precision required to govern compliance data across national aviation infrastructure.

Layer 2

Agentic Anomaly Detection

Z-score detection engine architected to autonomously surface statistically significant events across 36 monthly time series — simulating the proactive intelligence layer of an agentic BI system. Adjustable sigma thresholds with deterministic guardrails prevent false-positive escalation.

Layer 3

Multi-Platform BI Stack

Orchestrated across Power BI (composite scoring, AI narratives, XAI), Tableau Public (benchmarking, trend visualization), and Python/Plotly (anomaly detection engine) — demonstrating cross-platform intelligence architecture at production scale.

Layer 4

Cloud-Native Deployment

Production-grade static deployment via Netlify global CDN. Built through an agentic AI workflow — architecture decisions, data modeling, visualization logic, and deployment all orchestrated using Claude as the implementation engine. Engineered for scale, not for demos.

Governance

System Transparency & Audit-Readiness

All data sourced from official DOT/BTS publications with explicit methodology notes surfaced in the dashboard. Anomaly detection thresholds are deterministic and configurable — no black-box scoring. Every flagged event is cross-referenced against known industry events, ensuring explainable outputs rather than opaque alerts.

Deployment Stack

Power BI Desktop — Composite scoring model, Smart Narratives (AI briefing), Key Influencers (XAI), What-If scenario modeling, NLQ Q&A interface
Tableau Public — Benchmarking bar charts, trend line visualization, published for live public access via embedded iframe
Python · Plotly · Netlify — Anomaly detection engine deployed as production-grade static application on Netlify CDN; re-architected from Streamlit to eliminate cold-start delays
HTML · CSS · Netlify CI/CD — Portfolio wrapper deployed via GitHub → Netlify pipeline; zero-dependency, instant load, global edge delivery