profile-pic

PAPUN KUMAR

Experienced Data Engineer with over 6 years of experience in the IT domain, including more than 2 years of specialisation in AWS.

  • Role

    Senior Consultant

  • Years of Experience

    7.9 years

Skillsets

  • Mssql
  • EventBridge
  • Glue
  • IAM
  • Kinesis
  • KMS
  • Lambda
  • MongoDB
  • Oracle
  • pgAdmin
  • Rally
  • RDS
  • SNS
  • SQS
  • Winzip
  • EC2
  • NoSQL
  • PLSQL
  • Stepfunction
  • Advanced query tool
  • Apache Airflow
  • Athena
  • Azure DevOps Server
  • Datadog
  • MSSQL Server
  • Oracle SQL Developer
  • R Studio
  • Talend administration centre
  • Talend Open Studio
  • DynamoDB
  • Python - 4 Years
  • AutoSys
  • Bamboo
  • Bitbucket
  • Confluence
  • Crowd
  • Databricks
  • Embedded C
  • Github
  • Insomnia
  • Jira
  • Nexus
  • Postman
  • Putty
  • ServiceNow
  • Visual Studio
  • aurora
  • CI/CD
  • CloudWatch
  • Crawler
  • DMS
  • AWS - 4 Years
  • Terraform - 3 Years
  • SQL - 6 Years
  • Unix
  • R - 1 Years
  • C++

Professional Summary

7.9Years
  • Jan, 2021 - Present5 yr 2 months

    Senior Consultant

    Deloitte USI
  • Jul, 2021 - Nov, 2021 4 months

    Developer

    Dataeconomy
  • Jan, 2018 - Dec, 20202 yr 11 months

    System Engineer

    Tata Consultancy Services
  • Jan, 2017 - Dec, 2017 11 months

    Assistant System Engineer

    Tata Consultancy Services

Applications & Tools Known

  • icon-tool

    Talend

  • icon-tool

    Postman

  • icon-tool

    MySQL

  • icon-tool

    Databricks

  • icon-tool

    Jira

  • icon-tool

    MongoDB

  • icon-tool

    AWS (Amazon Web Services)

  • icon-tool

    AWS Athena

  • icon-tool

    AWS CloudWatch

  • icon-tool

    Confluence

  • icon-tool

    Microsoft Power BI

  • icon-tool

    Git

  • icon-tool

    Visual Studio Code

  • icon-tool

    AWS Glue

  • icon-tool

    AWS Lambda

  • icon-tool

    AWS DynamoDB

  • icon-tool

    Apache Airflow

  • icon-tool

    AWS

  • icon-tool

    Databricks

  • icon-tool

    AutoSys

  • icon-tool

    Rally

  • icon-tool

    R studio

  • icon-tool

    ServiceNow

  • icon-tool

    Confluence

  • icon-tool

    Bitbucket

  • icon-tool

    Bamboo

  • icon-tool

    Crowd

  • icon-tool

    GitHub

  • icon-tool

    Nexus

  • icon-tool

    Insomnia

  • icon-tool

    Putty

  • icon-tool

    Visual Studio

Work History

7.9Years

Senior Consultant

Deloitte USI
Jan, 2021 - Present5 yr 2 months
    PROJECT 1 (ORACLE TO POSTGRESQL MIGRATION & CDC DELTA PIPELINES) 2025-CONTINUED - Led the successful migration of ~10 billion historical records from Oracle to PostgreSQL, ensuring data integrity, performance, and minimal downtime. - Designed and implemented nearreal-time delta ingestion pipelines using HVR to capture and synchronise source system changes efficiently. - Collaborated closely with the clients enterprise architect to enhance and optimise the data ingestion framework, scaling the architecture to support future growth of up to 62 billion records. PROJECT 2 (CHATBOT ANALYTICS DATA PLATFORM ON AWS & DATABRICKS) 2025 - Designed and built end-to-end data pipelines using Databricks, AWS services, and REST APIs (Postman) to ingest JSON data from three chatbot platforms into Amazon Aurora, enabling SQL-based analytics and executive-level Power BI dashboards for Directors and CTOs. - Enhanced and scaled the data ingestion and validation framework to support 11 chatbot data sources, improving data quality, extensibility, and onboarding efficiency through a newly developed architecture. - Collaborated with two internal team members while leading and coordinating two external teams (~6 members total), driving delivery of the new development plan through effective cross-team communication and technical guidance. PROJECT 3 (ENTERPRISE DATA SOLUTIONS DELIVERY) 2021-2024 - Owned requirements gathering, analysis, and scalability planning to deliver data solutions aligned with customer needs. - Designed, developed, and optimised robust data pipelines across multiple source systems, ensuring performance and reliability. - Led the successful end-to-end delivery of six critical systems while managing and mentoring teams of 27 members.

Developer

Dataeconomy
Jul, 2021 - Nov, 2021 4 months
    Acquired in-depth knowledge of internal usage of third-party tools and existing ETL processes through structured knowledge transfer sessions.

System Engineer

Tata Consultancy Services
Jan, 2018 - Dec, 20202 yr 11 months
    PROJECT 4 (TALEND ETL ORCHESTRATION TO SALESFORCE/CROPIN) 2018-2020 - Designed, developed, and managed Talend ETL jobs to orchestrate data flows from multiple in-house applications and SQL/AS400 databases to target platforms including Salesforce and Cropin. - Owned the full ETL analysis, job design, customization, deployment, scheduling, monitoring, performance tuning, and ongoing maintenanceensuring reliable and optimized data pipelines. - Installed and configured Talend and related applications, produced comprehensive technical documentation, and led root cause analysis for job and system failures to improve stability and resilience. PROJECT 5 (ATLASSIAN SUITE MIGRATION & ADMINISTRATION) 2017-2018 - Led migration of Crop Sciences data into the Atlassian ecosystem and administered the full Atlassian suite (JIRA, Confluence, Bitbucket, Bamboo, Crowd), ensuring seamless adoption and platform stability. - Set up and customized Atlassian tools by migrating existing projects, designing JIRA workflows and dashboards, managing user and project configurations, integrating Active Directory via Crowd, and migrating CI/CD pipelines and code repositories.

Assistant System Engineer

Tata Consultancy Services
Jan, 2017 - Dec, 2017 11 months
    PROJECT 6 (PRE MERGER ENVIRONMENT SETUP & R PACKAGE INTEGRATION) 2017 - Contributed to a pre-merger program by setting up a new technical environment in India, supporting platform readiness and regional rollout. - Customized and migrated R packages using RStudio, integrated Single Sign-On (SSO) authentication, and executed build and deployment activities to ensure secure and seamless system integration.

Major Projects

6Projects

Oracle to PostgreSQL Migration & CDC Delta Pipelines

    Modernized enterprise data platform by migrating Oracle workloads to PostgreSQL and designing HVR-based delta ingestion for near real-time synchronization. Led migration of ~10 billion historical records from Oracle to PostgreSQL with minimal downtime, ensuring data integrity and performance. Implemented near real-time CDC pipelines using HVR and optimized ingestion to scale for up to 62 billion records, improving throughput by 35%.

Chatbot Analytics Data Platform on AWS & Databricks

    Built ingestion and validation pipelines to load chatbot JSON via REST APIs into Amazon Aurora, enabling SQL analytics and Power BI dashboards. Designed end-to-end pipelines using Databricks and AWS to ingest from 3 chatbot platforms into Aurora, reducing manual reporting by 18 hours/week. Scaled architecture to support 11 chatbot data sources, enhanced data quality, and led coordination across ~6 members, improving onboarding speed by 35%.

Enterprise Data Solutions Delivery

    Delivered multiple customer-aligned data solutions with focus on requirements, scalability, and reliability across heterogeneous sources. Owned requirements and scalability planning; led end-to-end delivery of six critical systems while mentoring teams of 27, increasing stakeholder satisfaction by 70%. Designed and optimized resilient pipelines across diverse sources, improving performance by 35% and boosting system reliability.

Talend ETL Orchestration to Salesforce/Cropin

    Engineered Talend jobs to move data from in-house applications and SQL/AS400 into downstream Salesforce and Cropin platforms. Built and managed Talend jobs end-to-end design, deployment, scheduling, tuning, increasing job throughput by 60% and strengthening pipeline stability. Installed and configured Talend stack; created documentation and led root cause analysis to cut recurring failures by 50%.

Atlassian Suite Migration & Administration

    Migrated Crop Sciences data and administered the full Atlassian suite JIRA, Confluence, Bitbucket, Bamboo, Crowd for seamless adoption. Executed migration and platform setup, enabling adoption with 6 projects onboarded and improving platform stability by 60%. Integrated Active Directory via Crowd and migrated CI/CD pipelines and repositories, reducing deployment friction by 40%.

PreMerger Environment Setup & R Package Integration

    Established new regional technical environment, customized R packages, and implemented SSO for secure rollouts. Set up the new environment to support pre-merger activities, achieving readiness within 8 weeks and accelerating regional rollout. Migrated/customized R packages and implemented SSO, reducing access issues by 40% and streamlining deployments.

Education

  • B.Tech (ECE)

    Northern India Engineering College (2016)