End-to-end ETL pipeline ingesting 700+ daily posts across 14 subreddits with Kafka, Apache Airflow (6 DAGs), and multi-method anomaly detection (z-score, volume spike, TF-IDF + K-Means). Built for a Tesla data engineering interview.
Nov 2025