Senior Data Engineer · GCP · Spark · Kafka

Kiran Sai
Kumar Kanta

7+ years designing and scaling distributed data platforms
across telecom and banking domains.

10M+ Records / Day
90% Pipeline Speedup
1700+ Tables Automated
01 — Who I Am

Building pipelines that
move the world's data.

I'm a Senior Data Engineer with 7+ years of experience architecting distributed data platforms across telecom and banking. Specialized in high-volume batch and streaming systems using GCP, Spark, Kafka, and modern cloud technologies. I excel at designing scalable data infrastructure that handles millions of daily transactions while maintaining data governance and compliance standards.

7+ Years Engineering
3 Cloud Certifications
5M+ Customers Served
3 Major Domains
02 — Experience

Where I've
built things.

Scroll through each role ↓

01
01 · Current

Infosys

Consultant — Data Engineer
Client: Bank of America
Jul 2025 — Present
1700+ Tables Automated
  • Designed and enhanced large-scale mortgage data ingestion pipelines processing business-critical financial datasets for one of the world's largest banks.
  • Built an AI-driven SQL generation and validation framework, automating ingestion for 1700+ tables with zero manual intervention.
  • Led migration analysis for Hive to Ozone across distributed storage infrastructure.
  • Ensured compliance with data governance, auditability and security standards.
HiveApache OzoneSQLAI AutomationData GovernanceBanking
02
02 · 4 Years

Accenture

Senior Data Engineer
Client: Virgin Media O2 UK
Jul 2021 — Jul 2025
90% Pipeline Speedup
  • Architected and optimized high-throughput data pipelines on Google Cloud Platform processing 10M+ daily records for telecom operations.
  • Engineered Apache Spark clusters delivering 90% pipeline speedup, reducing daily batch window from 12hrs to 1.2hrs.
  • Designed Kafka-based real-time event streaming infrastructure handling 50K+ events/second with sub-second latency.
  • Implemented comprehensive data quality framework using DBT and custom validation rules with automated alerting.
GCPBigQuerySparkKafkaDBTTelecom
03
03 · 2.5 Years

Tata Consultancy Services

Systems Engineer — Data
Multiple Fortune 500 Clients
Jan 2019 — Jul 2021
5M+ Customers Supported
  • Designed ETL/ELT pipelines using Hadoop ecosystem processing multi-petabyte datasets across financial and retail domains.
  • Built IBM MDM solutions ensuring single source of truth for customer data across enterprise systems.
  • Implemented GDPR-compliant data retention and purge policies managing sensitive personally identifiable information securely.
  • Mentored junior engineers on data engineering best practices, architecture patterns, and cloud migrations.
HadoopHiveIBM MDMETLGDPRSQL
GCPBigQueryDataflowSparkKafkaPythonScalaSQLHadoopHiveApache NiFiDBTData ModelingGDPRCloud RunIAMGit GCPBigQueryDataflowSparkKafkaPythonScalaSQLHadoopHiveApache NiFiDBTData ModelingGDPRCloud RunIAMGit

Professional Certifications

Industry-recognized credentials across cloud & data engineering

03 — Education

Academic Background

2017 — 2019
Master of Technology
Indian Institute of Technology (IIT) Bombay
Specialized in Data Science & Engineering. Thesis on distributed ML systems and large-scale data processing.
2013 — 2017
Bachelor of Technology
Jawaharlal Nehru Technological University
Computer Science & Engineering with strong foundations in algorithms, systems design, and architecture.
Open to opportunities & collaboration

Let's build
something great.

Email
LinkedIn
Location
Remote / Available Globally