profile-pic

PAPUN KUMAR

Experienced Data Engineer with over 6 years of experience in the IT domain, including more than 2 years of specialisation in AWS.

  • Role

    Senior Consultant

  • Years of Experience

    7.9 years

Skillsets

  • Advanced query tool
  • EventBridge
  • Glue
  • IAM
  • Kinesis
  • KMS
  • Lambda
  • MongoDB
  • Oracle
  • pgAdmin
  • Rally
  • RDS
  • SNS
  • SQS
  • Winzip
  • EC2
  • AS/400
  • HVR
  • Oracle SQL Developer
  • Pl/sql
  • Power BI
  • Rest APIs
  • RStudio
  • S3
  • Salesforce
  • SQL Server
  • Step functions
  • Talend Open Studio
  • Talend administration centre
  • Cropin
  • DynamoDB
  • Python - 4 Years
  • Apache Airflow
  • AutoSys
  • Bamboo
  • Bitbucket
  • Confluence
  • Crowd
  • Databricks
  • Embedded C
  • Github
  • Insomnia
  • Jira
  • Postman
  • DMS
  • Crawler
  • CloudWatch
  • CI/CD
  • aurora
  • Visual Studio
  • Talend
  • ServiceNow
  • Putty
  • Nexus
  • AWS - 4 Years
  • Terraform - 3 Years
  • SQL - 6 Years
  • Unix
  • R - 1 Years
  • C++

Professional Summary

7.9Years
  • Nov, 2021 - Present4 yr 2 months

    Senior Consultant

    Deloitte USI
  • Jul, 2021 - Nov, 2021 4 months

    Developer

    DataEconomy
  • Mar, 2017 - Aug, 20203 yr 5 months

    System Engineer

    Tata Consultancy Services

Applications & Tools Known

  • icon-tool

    Talend

  • icon-tool

    Postman

  • icon-tool

    MySQL

  • icon-tool

    Databricks

  • icon-tool

    Jira

  • icon-tool

    MongoDB

  • icon-tool

    AWS (Amazon Web Services)

  • icon-tool

    AWS Athena

  • icon-tool

    AWS CloudWatch

  • icon-tool

    Confluence

  • icon-tool

    Microsoft Power BI

  • icon-tool

    Git

  • icon-tool

    Visual Studio Code

  • icon-tool

    AWS Glue

  • icon-tool

    AWS Lambda

  • icon-tool

    AWS DynamoDB

  • icon-tool

    Apache Airflow

  • icon-tool

    AWS

  • icon-tool

    Databricks

  • icon-tool

    AutoSys

  • icon-tool

    Rally

  • icon-tool

    R studio

  • icon-tool

    ServiceNow

  • icon-tool

    Confluence

  • icon-tool

    Bitbucket

  • icon-tool

    Bamboo

  • icon-tool

    Crowd

  • icon-tool

    GitHub

  • icon-tool

    Nexus

  • icon-tool

    Insomnia

  • icon-tool

    Putty

  • icon-tool

    Visual Studio

Work History

7.9Years

Senior Consultant

Deloitte USI
Nov, 2021 - Present4 yr 2 months
    Promoted to Senior Consultant from Consultant. Led architecture and cloud-native data platforms on AWS with scalability planning to align ingestion, storage, and processing to SLAs. Designed and optimized robust pipelines across heterogeneous sources, improving pipeline reliability and maintainability. Directed end-to-end delivery of six critical systems while mentoring teams of 27, elevating delivery consistency. Implemented orchestration and scheduling, saving 18 hours/week through automation. Established data quality governance and validation frameworks, reducing defects and boosting analytics trust. Implemented monitoring and alerting via CloudWatch, strengthening operational resilience and reducing incident response time by 35%. Created documentation and handover materials to streamline onboarding and reduce knowledge gaps by 30%. Delivered data platform optimizations that cut annual operating costs by 12% through streamlined ETL and efficient cloud sizing across clients.

Developer

DataEconomy
Jul, 2021 - Nov, 2021 4 months
    Participated in structured knowledge transfer to understand internal usage of third-party tools and existing ETL workflows. Documented ETL patterns and recommended improvements to accelerate team ramp-up, reducing onboarding time by 10 days.

System Engineer

Tata Consultancy Services
Mar, 2017 - Aug, 20203 yr 5 months
    Promoted to System Engineer from Assistant System Engineer. Regional rollout enabled by setting up a new technical environment in India for pre-merger program, accelerating deployment within 8 weeks. SSO integration and R package migration via RStudio, streamlining build/deploy Talend ETL jobs to orchestrate data flows from in-house apps/SQL/AS400 into Salesforce and Cropin. ETL lifecycle ownership from requirements to maintenance, including scheduling, monitoring, and performance tuning. Logging, error handling, recovery practices cut job failures and boosted system stability. Technical documentation & root cause analysis to improve resilience and operational efficiency. Optimized Talend ETL data flows cutting batch processing time by 38% and delivering 15% higher nightly throughput for the India rollout. Decreased data storage and processing costs by 22%, reallocating savings to core analytics initiatives and accelerating project delivery timelines across India operations. Scaled ETL pipelines to handle fivefold peak loads while maintaining 99.95% uptime, ensuring timely data availability for critical business dashboards. Implemented automated data quality checks and exception handling reducing production incidents by 60% and improving release stability across ETL jobs for India.

Major Projects

6Projects

Oracle to PostgreSQL Migration & CDC Delta Pipelines

    Modernized enterprise data platform by migrating Oracle workloads to PostgreSQL and designing HVR-based delta ingestion for near real-time synchronization. Led migration of ~10 billion historical records from Oracle to PostgreSQL with minimal downtime, ensuring data integrity and performance. Implemented near real-time CDC pipelines using HVR and optimized ingestion to scale for up to 62 billion records, improving throughput by 40%.

Chatbot Analytics Data Platform on AWS & Databricks

    Built ingestion and validation pipelines to load chatbot JSON via REST APIs into Amazon Aurora, enabling SQL analytics and Power BI dashboards. Designed end-to-end pipelines using Databricks and AWS to ingest from 3 chatbot platforms into Aurora, reducing manual reporting by 20 hours/week. Scaled architecture to support 11 chatbot data sources, enhanced data quality, and led coordination across ~6 members, improving onboarding speed by 35%.

Enterprise Data Solutions Delivery

    Delivered multiple customer-aligned data solutions with focus on requirements, scalability, and reliability across heterogeneous sources. Owned requirements and scalability planning; led end-to-end delivery of six critical systems while mentoring teams of 27, increasing stakeholder satisfaction by 60%. Designed and optimized resilient pipelines across diverse sources, improving performance by 40% and boosting system reliability.

Talend ETL Orchestration to Salesforce/Cropin

    Engineered Talend jobs to move data from in-house applications and SQL/AS400 into downstream Salesforce and Cropin platforms. Built and managed Talend jobs end-to-end design, deployment, scheduling, tuning, increasing job throughput by 45% and strengthening pipeline stability. Installed and configured Talend stack; created documentation and led root cause analysis to cut recurring failures by 50%.

Atlassian Suite Migration & Administration

    Migrated Crop Sciences data and administered the full Atlassian suite JIRA, Confluence, Bitbucket, Bamboo, Crowd for seamless adoption. Executed migration and platform setup, enabling adoption with multiple projects onboarded and improving platform stability by 35%. Integrated Active Directory via Crowd and migrated CI/CD pipelines and repositories, reducing deployment friction by 40%.

Pre-Merger Environment Setup & R Package Integration

    Established new regional technical environment, customized R packages, and implemented SSO for secure rollouts. Set up the new environment to support pre-merger activities, achieving readiness within 8 weeks and accelerating regional rollout. Migrated/customized R packages and implemented SSO, reducing access issues by 45% and streamlining deployments.

Education

  • B.Tech (ECE)

    Northern India Engineering College (2016)