Principal Data Engineer

Rajeev Jasti

Architecting low-latency distributed systems and massive-scale ETL pipelines. I build the robust infrastructure that powers real-time analytics for global enterprises.

Resume Ask AI

$ cat current_focus.md

Realtime lakehouse observability rollout
Self-serve pipeline templates for domain teams

$ stack --active

Kafka Spark Airflow Snowflake Terraform Kubernetes

$ tail -n 3 recent_wins.log

[OK] P95 latency down 40%
[OK] Cloud spend down 30%
[OK] Throughput stable at 1M+ events/sec

10+

Years Experience

50+

Projects Delivered

1M+

Events/Second

30%

Cost Reduction

40%

Latency Improvement

99.9%

Uptime

Technical Skills

Data Processing

Apache Spark Kafka Apache Airflow Apache Flink dbt

Cloud & Infrastructure

AWS Kubernetes Terraform Docker GCP

Data Warehouse

Snowflake AWS Redshift BigQuery Delta Lake

Languages

Python SQL Scala Bash

Databases

PostgreSQL MongoDB Redis Cassandra

Observability

Prometheus Grafana DataDog OpenTelemetry

Experience

Company names are intentionally omitted for confidentiality; scope and outcomes reflect production work.

2022 — Present Remote

Principal Data Engineer • Global Enterprise

Architected a multi-region data mesh serving 50+ internal teams. Reduced cloud infrastructure costs by 30%.

Designed and deployed Kafka-to-Snowflake CDC pipelines handling 1M+ events/sec
Led migration from monolithic ETL to modular, domain-owned data products
Reduced P95 query latency by 40% through Spark optimization and partition tuning
Mentored a team of 6 engineers across 3 time zones

KafkaSparkSnowflakeKubernetesTerraform

2019 — 2022 Hybrid

Senior Data Engineer • Data Platform Company

Built and owned the company's core data platform, enabling self-serve analytics for 20+ business teams.

Delivered Airflow-orchestrated ETL pipelines processing 500GB+ nightly
Migrated legacy Oracle DWH to Snowflake, cutting storage costs by 40%
Implemented data quality framework reducing production incidents by 60%

AirflowSnowflakePythondbtAWS

2016 — 2019 On-site

Data Engineer • Product Analytics Team

Developed ETL pipelines and data models supporting product analytics for 5M+ daily active users.

Built real-time dashboards using Spark Streaming and Kafka
Designed dimensional data models in PostgreSQL and Redshift
Automated pipeline monitoring, reducing MTTR from hours to minutes

SparkKafkaPostgreSQLRedshiftPython

Projects

Selected platform work focused on reliability, observability, and reusable delivery patterns.

Real-time Analytics Platform

Kafka and Spark streaming pipeline processing 1M events per second with fault-tolerant event ingestion.

KafkaSparkKubernetes

Cloud ETL Modernization

Airflow-orchestrated Snowflake pipelines that reduced nightly batch runtime by 45%.

AirflowSnowflakedbt

CDC Data Mesh Enablement

Built Kafka-to-Snowflake CDC templates for domain teams to self-serve compliant datasets.

KafkaSnowflakeCDC

Interests & Hobbies

What I do when I'm not building data pipelines.

Soccer

Sunday League Striker

I play competitively on weekends, always looking to improve my striking and tactical awareness on the pitch.

Sauna

The Daily Detox

Nothing beats a 20-minute sauna session after a long day of engineering. It is my reset button for recovery and deep thinking.

Workout

Strength & Conditioning

Lifting heavy and staying mobile. Consistency in the gym translates directly to discipline and focus at the keyboard.

Foodie

Culinary Explorer

Always on the hunt for the best local eats, from hidden street food gems to trying my hand at cooking new cuisines at home.

Session Activity

Live view of current session events from FastAPI heartbeat session.

Session: -

JavaScript is required to view live session activity.

Date	Time	Type	Path	Data (Encoded)

Let's talk.

Have a project in mind or want to connect? Send me a message, and I'll get back to you soon.

Email Me inboxtorj@gmail.com

How can I help you?

AI can make mistakes. Check important info.

AI Chat requires JavaScript

Please enable JavaScript to interact with the Bedrock AI model.