Building scalable data pipelines that transform
raw data into intelligence.
Data Engineer with 3+ years at Tekion Corp, designing and operating production-grade data systems that process 500GB+ daily. I thrive in fast-paced startup environments where I get to build from scratch — from raw ingestion layers to the analytics endpoints that serve 100+ users.
My work sits at the intersection of distributed systems, cloud infrastructure, and data modelling. I believe every great data product starts with a well-engineered pipeline.
Batch & streaming ingestion with Airflow, AWS Glue & PySpark — 99.9% uptime
AWS-native platforms with S3, Redshift, Lambda — 30% cost reduction
Star-schema modelling in Snowflake & Redshift powering BI for 100+ users
End-to-end data engineering — from raw sources to production-ready analytics
REST APIs, Databases, S3/GCS, Kafka Streams
Orchestrated batch & real-time data ingestion
Distributed compute for transformation & enrichment
Structured storage optimized for analytical queries
Dashboards, APIs & ML-ready feature stores
The stack I use to build production-grade data systems
Enterprise data systems and open-source experiments
Designed and built the core analytical tables powering business-critical dashboards — modelled raw data from multiple sources into clean, reliable datasets using PySpark & dbt, orchestrated with Airflow and stored in Redshift. These tables drive daily reporting and decision-making across the organization.
20+ API endpoints built with AWS Chalice serving the internal Data Platform UI for 100+ daily users. Queries internal PostgreSQL databases with efficient caching techniques, enforces role-based access control, and adds multi-region support in the UI for pipeline monitoring across geographies.
Full-featured social media backend with RESTful APIs, user authentication, and real-time features.
IoT-based agricultural platform with data analytics for crop monitoring and yield optimization.
Full-stack Flask application for tracking personal finances with MongoDB backend and RESTful API architecture.
Flask-based web scraping app for extracting LeetCode problem data — demonstrating data extraction and API skills.
GPT-powered intelligent video analysis tool enabling users to query videos and find clips via natural language.
GPA: 8.70
Percentage: 70%
Open to data engineering roles, collaborations, and interesting conversations about data infrastructure.