profile-pic

Taroon Kumar Ray

Strategic Data Engineer versed in distilling and analyzing large data sets. Develops and delivers presentations detailing data findings. Articulate and collaborative with expertise in algorithm design and data collection.
  • Role

    Senior Data Engineer II

  • Years of Experience

    12.10 years

Skillsets

  • Oracle
  • Data Warehousing
  • DMS
  • EMR
  • Glue
  • Hive
  • Iceberg
  • lakeformation
  • Lambda
  • Looker
  • normalization
  • AWS
  • PostgreSQL
  • Redshift
  • SAS
  • Shell Scripting
  • SNS
  • Spark
  • Spark Streaming
  • SQS
  • Unix
  • Data Vault 2.0
  • SQL - 12 Years
  • Data acquisitions
  • Snowflake - 2 Years
  • dbt - 4 Years
  • Azure DevOps
  • Git
  • Data Vault 2.0
  • PySpark - 4 Years
  • Data acquisitions
  • Python - 7 Years
  • Athena
  • Dagster
  • Dapr
  • Django
  • Fivetran
  • Flask
  • Kafka
  • Medallion Architecture
  • Airflow

Professional Summary

12.10Years
  • Feb, 2025 - Present 11 months

    Senior Data Engineer II

    Storable India
  • Sep, 2024 - Feb, 2025 5 months

    Senior Data Engineer

    Highspot
  • Jun, 2021 - Sep, 20243 yr 3 months

    Data Engineering Lead

    Creditsafe Technology
  • Mar, 2015 - Apr, 20183 yr 1 month

    Senior Software Engineer

    UnitedHealth Group
  • May, 2018 - Oct, 20202 yr 5 months

    Senior Systems Analyst Data

    Gap Inc
  • Oct, 2020 - Jun, 2021 8 months

    Senior Data Warehouse Developer

    Vitech Asia
  • Feb, 2013 - Mar, 20152 yr 1 month

    Software Engineer

    Accenture

Applications & Tools Known

  • icon-tool

    Apache Airflow

  • icon-tool

    AWS Glue

  • icon-tool

    Redshift

  • icon-tool

    Athena

  • icon-tool

    Amazon Web Services

  • icon-tool

    Spark SQL

  • icon-tool

    SQL

  • icon-tool

    Glue

  • icon-tool

    Lambda

  • icon-tool

    Django

  • icon-tool

    Flask

  • icon-tool

    Kafka

  • icon-tool

    Apache Iceberg

  • icon-tool

    Python

  • icon-tool

    dbt

  • icon-tool

    Azure DevOps

  • icon-tool

    Git

  • icon-tool

    Snowflake

  • icon-tool

    Fivetran

  • icon-tool

    Dagster

  • icon-tool

    EMR

  • icon-tool

    Azure DevOps

  • icon-tool

    Oracle

  • icon-tool

    PostgreSQL

  • icon-tool

    Aurora

  • icon-tool

    SQL Server

  • icon-tool

    SAS

  • icon-tool

    SQL

  • icon-tool

    Hive

  • icon-tool

    Azure DevOps

  • icon-tool

    SQL

  • icon-tool

    Kafka

Work History

12.10Years

Senior Data Engineer II

Storable India
Feb, 2025 - Present 11 months
    Achieved 70% faster data delivery by revamping data architecture for near real-time processing using AWS DMS. Optimized job efficiency in Bronze and Silver layers utilizing EMR Serverless and Spark Streaming. Unified data platform by implementing Iceberg tables and leveraging Athena for business transformation. Automated orchestration of Glue jobs through Airflow and developed a Dapr Consumer Client to trigger jobs via Kafka topics. Enhanced customer experience by integrating Looker insights into products and implemented fine-grained access control via Lakeformation. Collaborated with cross-functional teams to gather requirements and deliver customized data solutions.

Senior Data Engineer

Highspot
Sep, 2024 - Feb, 2025 5 months
    Optimized ETL tasks using Snowflake's dynamic tools, including Snowstream and Materialized Views. Developed HouseKeeping solutions in Python to roll data into S3, utilizing AWS SNS and SQS for messaging. Automated error notifications by implementing Slack alerts via Dagster. Extracted data from diverse sources like Jira and Amplitude using Fivetran. Ensured data integrity by executing Dagster validation jobs and creating transformation models with dbt. Applied Zero Copy Cloning in Snowflake for effective internal database testing. Enhanced performance by optimizing server queries to reduce system load.

Data Engineering Lead

Creditsafe Technology
Jun, 2021 - Sep, 20243 yr 3 months
    Reduced manual effort by 90% by creating custom Airflow plugins and introducing new automation features. Led strategy for scalability and accuracy of data pipelines while managing the design of new data models. Advanced 100% cloud architecture by onboarding legacy supplier data into Cloud Space using AWS Glue and PySpark. Facilitated business continuity by migrating legacy systems to modern cloud platforms with Data Vault 2.0 modeling. Maintained quality standards through rigorous code reviews and used AWS Lambda for effective metadata registration. Resolved customer issues with knowledgeable service to promote high satisfaction levels.

Senior Data Warehouse Developer

Vitech Asia
Oct, 2020 - Jun, 2021 8 months
    Managed data migration from Oracle to AWS Aurora and mitigated performance bottlenecks during the transition to PostgreSQL. Optimized ETL processes and developed data models to support complex reporting and analytical needs. Resolved business functionality bugs and implemented new requirements and enhancements.

Senior Systems Analyst Data

Gap Inc
May, 2018 - Oct, 20202 yr 5 months
    Achieved 99% data accuracy by constructing models and marts using dbt with rigorous testing protocols. Met 100% of SLA requirements for business reports by building end-to-end pipelines with Glue and PySpark. Enhanced BI functionality by designing engineering pipelines for Retail databases. Built custom extractors for Retail databases (orchestrated via Airflow) and REST APIs with ORDS to integrate vendors like Cybersource and UPS. Streamlined order processing by executing OMS and OB systems for the Intermix Brand.

Senior Software Engineer

UnitedHealth Group
Mar, 2015 - Apr, 20183 yr 1 month
    Engineered user monitoring and automated solutions to streamline manual tasks beyond Guardium's native capabilities. Analyzed Guardium data in Oracle DB to provide actionable insights for both operational and business teams. Optimized efficiency using Python multi-threading and SQL across Oracle and SQL Server databases. Created alert systems for STAP restarts to notify Operations Teams of potential issues. Collaborated with Project Management to maintain whitelist tables for user exceptions.

Software Engineer

Accenture
Feb, 2013 - Mar, 20152 yr 1 month
    Managed ETL performance by monitoring weekly/monthly jobs and resolving issues using PROC SQL and PROC SORT. Developed Linux/UNIX scripts to meet evolving client needs and data processing tasks using Oracle and SAS.

Achievements

  • Developed scalable and maintainable data ingestion frameworks, resulting in streamlined data integration processes.
  • Migrated legacy systems to modern cloud platforms leveraging DataVault modeling technique, ensuring seamless business continuity.
  • Leveraged the power of AWS Glue and PySpark to process the supplier files, taking a significant step towards a 100% cloud architectural footprint.
  • Built custom Airflow Plugins to reduce manual effort by 90%.
  • Implemented incremental and snapshot modeling techniques with dbt to build data marts for business needs.
  • "You Rock" award at Gap Inc.
  • "The Gem" award at Gap Inc.
  • Was awarded "You Rock" and "The Gem" award for the outstanding work done.
  • You Rock
  • The Gem

Major Projects

2Projects

AWS Aurora Migration

Oct, 2020 - Jun, 2021 8 months
    Participated in migration of data from Oracle to AWS Aurora.

OMS and OB Implementation

May, 2018 - Oct, 20202 yr 5 months
    Implemented OMS (Order Management System) and OB (Order Broker) for Intermix Brand.

Education

  • Bachelor of Science: Information Technology

    Biju Patnaik University of Technology (2011)