Real-time Analytics Platform
Kafka & Spark streaming pipeline processing 1M events/sec with fault-tolerant event ingestion.
Architecting low-latency distributed systems and massive-scale ETL pipelines. I build the robust infrastructure that powers real-time analytics for global enterprises.
Architected a multi-region data mesh serving 50+ internal teams. Reduced cloud infrastructure costs by 30%.
Built and owned the company's core data platform, enabling self-serve analytics for 20+ business teams.
Developed ETL pipelines and data models supporting product analytics for 5M+ daily active users.
Kafka & Spark streaming pipeline processing 1M events/sec with fault-tolerant event ingestion.
Airflow-orchestrated Snowflake pipelines that reduced nightly batch runtime by 45%.
Built Kafka-to-Snowflake CDC templates for domain teams to self-serve compliant datasets.