Hi, I'm

Data Engineer

SQL Server
PySpark
AWS
Airflow DAGs
Female programmer typing on laptop

Case Studies

Re-architecting
1.1B+ Row Tables

The Scale: At Mastercard, a 1.1B+ row SQL Server transaction table totaling 2.16 TB needed drastic optimization.

The ROI: I re-architected it using monthly date-based partitioning and columnstore compression across 290 partitions. Storage was reduced to 38.5 GB—a massive 98% reduction footprint—while significantly accelerating query performance for fast reporting and time-series analytics.

Accelerating
Ingestion Pipelines

The Scale: Hadoop-to-SQL Server bulk load times were acting as a massive bottleneck, taking 5.6 hours daily.

The ROI: By building a PySpark and XML-driven ingestion framework using partition-log-based incremental processing, load times were slashed to just 1.2 hours. This SSIS orchestration made daily loads faster, infinitely more reliable, and easier to operate.

Building Multidimensional
Analytics Cubes

The Scale: Business teams needed instant aggregation across 44 GB/day of raw incoming data.

The ROI: Daily Auth and Debit cubes were operationalized combining Hadoop, SQL Server, SSAS, and PySpark facts. I achieved a record 38-minute load time for 34 GB of Auth data, unlocking highly scalable, lightning-fast payments analytics.

Delivering Executive BI
on a 2 TB Live Cube

The Scale: Business leaders require absolute confidence when monitoring payments performance at massive enterprise scale.

The ROI: Executive Power BI dashboards were developed running live on a 2 TB enterprise SSAS cube. By tracking core time-series KPIs (MoM, YoY, and growth trends) for top issues, stakeholders gained immediate deep visibility into performance at scale.

Designing Access
Governance at Scale

The Scale: Operating globally means enforcing and verifying access across complex Partner-Acquirer-Merchant hierarchies over 65M rows.

The ROI: I modeled and built a User Access Management dashboard enabling scalable region- and gateway-based access governance. This simplified enterprise access control and improved reporting visibility exponentially across the business.

Tracking Migration &
Operational Readiness

The Scale: Managing client movements between legacy and modern login gateways requires pinpoint tracking over entire regions.

The ROI: I designed a specialized Power BI migration dashboard mapping regions, emails, and pending statuses. This directly provided business teams with real-time operational readiness limits, eradicating bottlenecks and smoothing the entire deployment lifecycle.

Owning Comprehensive
Power BI Operations

The Scale: BI infrastructure requires end-to-end administration to prevent reporting fragmentation and data leaks.

The ROI: I fully managed Power BI access governance—including Active Directory configuration, Stage/Prod environmental roles, refresh automation, and strict report onboarding policies—ensuring the company’s BI ecosystem runs safely and efficiently.

HDFS SQL

Experience

Oct 2024 – Present

Data Engineer

Mastercard (via Mthree/fulcrum) • Pune, India

  • Re-architected a 1.1B+ row (2.16 TB) SQL Server transaction table using monthly date-based partitioning and column-store compression.
  • Reduced Hadoop-to-SQL Server bulk load time from 5.6 hours to 1.2 hours.
  • Built and operationalized daily multidimensional Auth and Debit cubes ingesting up to 34 GB/day.
Feb 2024 – Apr 2024

Software Engineer Intern

Wiley Edge • Bangalore, India

  • Built Frozen Fantasia, a full-stack web app using React and Spring Boot.
Apr 2023 – Mar 2024

Technical Trainer

Hope Foundation • Mumbai, India

  • Delivered Python and Life Skills training for 1,000+ rural girls.
Jul 2019 – Mar 2023

Corporate Technical Trainer

Bitstek Consulting • Hyderabad, India

  • Delivered training with strong expertise in Python, SQL, and Power BI.
Apr 2023 – Jun 2023

R&D ML Engineer Intern

Celestial Institute of Technology

  • Designed a ball-bearing fault detection system using Computer Vision & YOLOv5.
Jun 2017 – Jul 2017

Network Engineer Intern

Central Railway

  • Studied optical fiber networks and interlocking systems.

Education

🎓

Bachelor of Electronics & Telecomm

University of Mumbai

📜

Diploma in Electronics & Video Eng.

MSBTE • Chembur, Mumbai

Skills & Expertise

⚙️
Data Engineering

SQLPythonHadoopHive PySpark / SparkSQL ServerSSISSSAS

📊
Data Analytics

TableauPower BI Data VisualizationAWS Cloud

💻
Engineering Excellence

JavaSpringSpring Boot ReactHTMLCSSGIT

Projects

WANDERLUST

Feb 2023

Travel application supporting interactive maps, Point of Interests, accommodations viewing, and voice assistant.

PythonTkinterFolium

Smart Attendance

Apr 2019

CV-based attendance system utilizing Viola-Jones and LBPH algorithms on Raspberry Pi.

OpenCVPythonRaspberry Pi

Password Door Lock

Apr 2015

Microcontroller door-lock security system implemented utilizing the AT89S52 platform.

AT89S52Hardware

About Me

A Data Engineer focused on building scalable data platforms, optimizing large-scale transaction systems, and delivering analytics solutions that support critical business decision-making. My core philosophy is that data infrastructure should directly drive company growth and eliminate operational bottlenecks.


This work combines sophisticated data modeling, resilient pipeline engineering, query performance tuning, and BI governance to create incredibly reliable data ecosystems. By leveraging my skills, I help organizations unlock their true analytical potential—transforming fragmented data into tangible ROI, drastically slashing cloud storage costs, and empowering stakeholders with the real-time visibility they require to operate faster and smarter.

Contact Me

✉️
Email
shatabdicofficial@protonmail.com
Typically replies within 24 hours
in
LinkedIn
Connect professionally
Learn more about me
🐙
GitHub
View projects & repositories
Code, pipelines & system design
🤗
Hugging Face
Models, datasets & experiments
NLP, LLMs & ML deployments
DB
Databricks
Lakehouse & data engineering work
Spark, pipelines & analytics infra