SGSumit Gupta
Remote, On-site · Full-time, Part-time, Contract
Kolkata, India
Results-driven Senior Data Engineer with 4+ years of experience at LTIMindtree, specializing in the Microsoft Azure data ecosystem. Expert in designing and delivering scalable ETL pipelines, cloud migrations, and Medallion architecture implementations.
Technology
Tools and technology
Azure Data Factory
Azure Synapse Analytics
Azure Databricks
ADLS Gen2
Azure Purview
Azure DevOps
PySpark
Python
SQL
T-SQL
Scala (basic)
Shell Scripting
ETL/ELT Pipelines
Medallion Architecture
Delta Lake
Data Warehousing
Data Governance
Data Lineage
CI/CD Pipelines
Terraform
ARM Templates
pytest
Great Expectations
Git
Databricks
Power BI (data layer)
Agile/Scrum
JIRA
Confluence
Full-time, Part-time, Contract
Experience
Recent roles and impact
Senior Data Engineer
LTIMindtree
2021-07
- Architected and maintained high-performance ETL frameworks using PySpark and SQL Server, achieving 35% reduction in pipeline execution time through advanced performance tuning and query optimization.
- Led end-to-end migration of legacy on-premise systems to Azure Cloud (ADF + Synapse + ADLS Gen2), ensuring 99.9% data availability and cutting infrastructure costs by ~25% for mission-critical analytics workloads.
- Designed and implemented Medallion Architecture (Bronze/Silver/Gold layers) on Azure Databricks, enabling clean, reliable data flows that reduced data quality incidents by 40% and accelerated BI reporting by 2x.
- Spearheaded engineering excellence by introducing automated testing frameworks (pytest + Great Expectations) and Infrastructure as Code (Terraform/ARM templates), reducing deployment errors by 60% and enabling repeatable, version-controlled infrastructure provisioning.
- Optimized Spark workloads by implementing adaptive query execution, partitioning strategies, and caching mechanisms, resulting in 45% improvement in job throughput across 10+ production pipelines processing 500GB+ daily.
- Established CI/CD pipelines using Azure DevOps, automating build, test, and deployment workflows that reduced release cycle time from 2 weeks to 3 days.
Education & Certifications
Formal education and credentials
Padmanava College of Engineering
Bachelor of Technology in Computer Science Engineering
2017 — 2021
Databricks Certified Associate Data Engineer
Databricks
2023
Generative AI for Data Professionals
Microsoft / Databricks
2024
Azure Data Factory & Pipelines
Microsoft Azure
2022