Available for opportunities

Nikhar Mahendra
Singh

>_ |

Building scalable data pipelines that transform
raw data into intelligence.

PythonPySparkAirflowSnowflakeAWSdbtRedshift
3+Years Exp
50+Pipelines
500GBDaily Data
20+APIs Built
scroll

I build the infrastructure that powers data-driven decisions.

Data Engineer with 3+ years at Tekion Corp, designing and operating production-grade data systems that process 500GB+ daily. I thrive in fast-paced startup environments where I get to build from scratch — from raw ingestion layers to the analytics endpoints that serve 100+ users.

My work sits at the intersection of distributed systems, cloud infrastructure, and data modelling. I believe every great data product starts with a well-engineered pipeline.

⚙️

ETL / ELT Pipelines

Batch & streaming ingestion with Airflow, AWS Glue & PySpark — 99.9% uptime

☁️

Cloud Infrastructure

AWS-native platforms with S3, Redshift, Lambda — 30% cost reduction

🏛️

Data Warehouse Design

Star-schema modelling in Snowflake & Redshift powering BI for 100+ users

Nikhar Mahendra Singh

Nikhar Mahendra Singh

Data Engineer @ Tekion Corp 📍 Bengaluru, India
DomainAutomotive SaaS, Retail
EducationB.Tech — KIIT University
LanguagesEnglish, Hindi

How Data Flows Through the Pipeline

End-to-end data engineering — from raw sources to production-ready analytics

🗂️

Data Sources

REST APIs, Databases, S3/GCS, Kafka Streams

REST APIsPostgreSQLS3Kafka
📥

Ingestion

Orchestrated batch & real-time data ingestion

AirflowAWS GlueLambdaFivetran

Processing

Distributed compute for transformation & enrichment

PySparkSparkdbtDatabricks
🗄️

Warehousing

Structured storage optimized for analytical queries

RedshiftSnowflakePostgreSQLDelta Lake
📊

Analytics / ML

Dashboards, APIs & ML-ready feature stores

Power BIFastAPISupersetTableau
etl_pipeline.py — data_engineer@tekion

Tools & Technologies

The stack I use to build production-grade data systems

Languages

PythonSQLPySparkBashJavaScript

Orchestration

Apache AirflowAWS GlueLambdaPrefect

Processing

Apache SparkPySparkdbtDatabricksPandas

Storage

AWS RedshiftSnowflakePostgreSQLMongoDBRedisDelta Lake

Cloud & DevOps

AWS (S3, EC2, IAM)DockerGitCI/CDTerraform

APIs & Serving

FastAPIFlaskREST APIsPower BITableauSuperset

Featured Projects

Enterprise data systems and open-source experiments

🏢 Enterprise · Production

Analytics Data Warehouse

Designed and built the core analytical tables powering business-critical dashboards — modelled raw data from multiple sources into clean, reliable datasets using PySpark & dbt, orchestrated with Airflow and stored in Redshift. These tables drive daily reporting and decision-making across the organization.

PySparkAirflowRedshiftS3dbt
🏢 Enterprise · Production

Data Platform API Layer

20+ API endpoints built with AWS Chalice serving the internal Data Platform UI for 100+ daily users. Queries internal PostgreSQL databases with efficient caching techniques, enforces role-based access control, and adds multi-region support in the UI for pipeline monitoring across geographies.

AWS ChalicePostgreSQLCachingRBACREST API

Twitton — Twitter Clone API

Full-featured social media backend with RESTful APIs, user authentication, and real-time features.

FlaskREST APISQLiteJWT Auth

Digital Farming Solution

IoT-based agricultural platform with data analytics for crop monitoring and yield optimization.

PythonIoTData AnalyticsMLDjango

💰 MoneyTracker

Full-stack Flask application for tracking personal finances with MongoDB backend and RESTful API architecture.

FlaskMongoDBREST APIPython

🔍 LeetCode Scraper

Flask-based web scraping app for extracting LeetCode problem data — demonstrating data extraction and API skills.

PythonFlaskBeautifulSoupREST API

🎥 ScreenAssist — AI Video Query

GPT-powered intelligent video analysis tool enabling users to query videos and find clips via natural language.

PythonGPT/OpenAINLPAI/ML

Work Experience

Data Analyst - 2 (Data Engineer)

Tekion Corp
Apr 2024 — Present
  • Built and maintained 20+ RESTful APIs supporting Data Platform web UI, serving 100+ daily users
  • Designed scalable ETL pipelines processing 500GB+ daily data from multiple sources with 99.9% reliability
  • Implemented data quality frameworks reducing data anomalies by 75% using Great Expectations
  • Optimized AWS infrastructure (S3, Glue, Lambda) reducing costs by 30% while improving performance
  • Automated data workflows using Apache Airflow, reducing manual intervention by 80%
PySparkAirflowAWSFastAPIRedshift

Data Analyst - 1

Tekion Corp
Jun 2022 — Apr 2024
  • Analyzed 10M+ records of campaign, retail, and sales data to identify trends and generate actionable insights
  • Collaborated with Data Scientists to improve ML models, achieving 15% accuracy improvement through feature engineering
  • Designed and automated Dealership Exit Process, reducing manual work by 50%
  • Built 15+ Power BI dashboards providing real-time insights to stakeholders across 5 departments
  • Performed root cause analysis on data quality issues, improving data accuracy by 40%
PythonSQLPower BISnowflake

Programmer Analyst Trainee

Cognizant
Jan 2022 — Apr 2022
  • Built end-to-end Data Warehouse for Customer Information System processing 1M+ customer records
  • Developed 10+ ETL pipelines using Informatica, streamlining data warehouse operations
  • Completed intensive training in Python, SQL, Unix, Informatica, Tableau, and IBM Cognos
SQLInformaticaPythonTableau

Bachelor of Technology

KIIT Deemed To Be University 2018 — 2022

GPA: 8.70

Senior Secondary (XII)

Vishwa Bandhu Academy 2018

Percentage: 70%

Let's Connect

Open to data engineering roles, collaborations, and interesting conversations about data infrastructure.