profile-pic

Taroon Kumar Ray

Strategic Data Engineer versed in distilling and analyzing large data sets. Develops and delivers presentations detailing data findings. Articulate and collaborative with expertise in algorithm design and data collection.
  • Role

    Senior Data Engineer II

  • Years of Experience

    13 years

Skillsets

  • Redshift
  • Spark SQL
  • S3
  • Query Tuning
  • Partitioning
  • Lakehouse
  • Lake Formation
  • Incremental processing
  • Data Vault
  • Data Modeling
  • Data Governance
  • Data Architecture
  • cost optimization
  • CI/CD
  • Spark Streaming
  • Python - 7 Years
  • Lambda
  • Iceberg
  • Glue
  • EMR
  • DMS
  • AWS
  • Airflow
  • Kafka
  • Dagster
  • Athena
  • PySpark - 4 Years
  • dbt - 4 Years
  • Snowflake - 2 Years
  • SQL - 12 Years

Professional Summary

13Years
  • Feb, 2025 - Present1 yr 3 months

    Senior Data Engineer II

    Storable India
  • Sep, 2024 - Feb, 2025 5 months

    Senior Data Engineer

    Highspot
  • Jun, 2021 - Sep, 20243 yr 3 months

    Senior Data Engineer

    Creditsafe Technology
  • Mar, 2015 - Apr, 20183 yr 1 month

    Senior Software Engineer

    UnitedHealth Group
  • May, 2018 - Oct, 20202 yr 5 months

    Senior System Analyst

    Gap Inc.
  • Oct, 2020 - May, 2021 7 months

    Senior Data Warehouse Developer

    Vitech Systems Group
  • Feb, 2013 - Mar, 20152 yr 1 month

    Software Engineer Data

    Accenture

Applications & Tools Known

  • icon-tool

    Apache Airflow

  • icon-tool

    AWS Glue

  • icon-tool

    Redshift

  • icon-tool

    Athena

  • icon-tool

    Amazon Web Services

  • icon-tool

    Spark SQL

  • icon-tool

    SQL

  • icon-tool

    Glue

  • icon-tool

    Lambda

  • icon-tool

    Django

  • icon-tool

    Flask

  • icon-tool

    Kafka

  • icon-tool

    Apache Iceberg

  • icon-tool

    Python

  • icon-tool

    dbt

  • icon-tool

    Azure DevOps

  • icon-tool

    Git

  • icon-tool

    Snowflake

  • icon-tool

    Fivetran

  • icon-tool

    Dagster

  • icon-tool

    EMR

  • icon-tool

    Azure DevOps

  • icon-tool

    Oracle

  • icon-tool

    PostgreSQL

  • icon-tool

    Aurora

  • icon-tool

    SQL Server

  • icon-tool

    SAS

  • icon-tool

    SQL

  • icon-tool

    Hive

  • icon-tool

    Azure DevOps

  • icon-tool

    SQL

  • icon-tool

    Kafka

Work History

13Years

Senior Data Engineer II

Storable India
Feb, 2025 - Present1 yr 3 months
    Defined and implemented enterprise-grade data architecture for a near real-time analytics platform using AWS DMS, Kafka, and Spark Streaming. Designed end-to-end lakehouse architecture (Bronze Silver Gold) using Iceberg on S3, enabling schema evolution and ACID guarantees. Reduced data latency by 70% and compute cost by 65% through architecture optimization and EMR Serverless tuning. Architected One Big Table (OBT) strategy for self-service BI (Looker), improving business adoption by 50%. Implemented data governance and fine-grained access control using AWS Lake Formation. Evaluated Athena vs Pinot for serving layer and drove architectural decisions based on latency vs cost tradeoffs.

Senior Data Engineer

Highspot
Sep, 2024 - Feb, 2025 5 months
    Designed scalable data models and warehouse architecture in Snowflake to support analytics workloads. Led schema evolution strategy for 20+ pipelines, ensuring zero downtime and backward compatibility. Built data transformation architecture using dbt, leveraging modular design and incremental models for efficient processing. Implemented event-driven data workflows using AWS services and Python-based orchestration to enable reliable data processing.

Senior Data Engineer

Creditsafe Technology
Jun, 2021 - Sep, 20243 yr 3 months
    Owned data platform architecture and scalability roadmap for an AWS-based analytics platform. Designed and implemented Data Vault 2.0 models to ensure enterprise-grade data consistency and auditability. Reduced manual effort by 90% through architecture-led automation using custom Airflow plugins. Led migration of legacy systems to cloud using AWS Glue and PySpark, ensuring reliability and performance.

Senior Data Warehouse Developer

Vitech Systems Group
Oct, 2020 - May, 2021 7 months
    Led data migration architecture from Oracle to AWS Aurora (PostgreSQL), ensuring minimal downtime and resolving performance bottlenecks during transition. Redesigned and optimized 30+ stored procedures and ETL workflows, improving query performance and supporting scalable analytical workloads. Developed data models for complex reporting use cases, enabling efficient querying and downstream analytics. Identified and resolved performance and data consistency issues, ensuring reliability of business-critical systems. Collaborated with cross-functional teams to deliver enhancements and stabilize production workloads.

Senior System Analyst

Gap Inc.
May, 2018 - Oct, 20202 yr 5 months
    Designed and developed analytics data models and marts using dbt, achieving 99% data accuracy through comprehensive testing and validation frameworks. Built end-to-end data pipelines using AWS Glue and PySpark, consistently meeting 100% SLA requirements for business-critical reporting. Engineered scalable data ingestion and transformation pipelines for retail systems, enabling enhanced BI and reporting capabilities. Developed custom data extractors and API integrations (Airflow + REST/ORDS) to onboard external vendor data (e.g., payment and logistics systems). Streamlined order processing workflows by integrating OMS and order booking systems, improving operational efficiency and data consistency. Ensured data reliability and pipeline stability through monitoring, debugging, and continuous enhancements in production environments.

Senior Software Engineer

UnitedHealth Group
Mar, 2015 - Apr, 20183 yr 1 month
    Designed and implemented custom monitoring and automation solutions extending beyond Guardium's native capabilities, improving operational efficiency. Analyzed large-scale Guardium audit data (Oracle DB) to generate actionable insights for both operational and business stakeholders. Optimized data processing workflows using Python (multi-threading) and SQL, improving performance across Oracle and SQL Server environments. Developed automated alerting systems for STAP agent failures, enabling proactive issue detection and reducing downtime. Maintained and governed whitelist data controls, ensuring compliance and minimizing false positives in security monitoring. Collaborated with cross-functional teams to enhance data-driven monitoring and operational reliability frameworks.

Software Engineer Data

Accenture
Feb, 2013 - Mar, 20152 yr 1 month
    Optimized performance of 40+ analytical ETL pipelines, improving processing efficiency and reducing execution time across enterprise data workflows. Analyzed and resolved performance bottlenecks in data pipelines, enhancing scalability and reliability. Developed and maintained data transformation logic to support reporting and analytics use cases. Collaborated with cross-functional teams to deliver data-driven solutions aligned with business requirements. Built foundational expertise in ETL optimization, SQL tuning, and data processing frameworks.

Achievements

  • Developed scalable and maintainable data ingestion frameworks, resulting in streamlined data integration processes.
  • Migrated legacy systems to modern cloud platforms leveraging DataVault modeling technique, ensuring seamless business continuity.
  • Leveraged the power of AWS Glue and PySpark to process the supplier files, taking a significant step towards a 100% cloud architectural footprint.
  • Built custom Airflow Plugins to reduce manual effort by 90%.
  • Implemented incremental and snapshot modeling techniques with dbt to build data marts for business needs.
  • "You Rock" award at Gap Inc.
  • "The Gem" award at Gap Inc.
  • Was awarded "You Rock" and "The Gem" award for the outstanding work done.
  • You Rock
  • The Gem

Major Projects

2Projects

AWS Aurora Migration

Oct, 2020 - Jun, 2021 8 months
    Participated in migration of data from Oracle to AWS Aurora.

OMS and OB Implementation

May, 2018 - Oct, 20202 yr 5 months
    Implemented OMS (Order Management System) and OB (Order Broker) for Intermix Brand.

Education

  • Bachelor of Science Information Technology

    Biju Patnaik University of Technology