profile-pic

Taroon Kumar Ray

Strategic Data Engineer versed in distilling and analyzing large data sets. Develops and delivers presentations detailing data findings. Articulate and collaborative with expertise in algorithm design and data collection.
  • Role

    Senior Data Engineer II

  • Years of Experience

    12.10 years

Skillsets

  • Athena
  • Emr hive
  • Spark SQL
  • Serverless
  • Medallion Architecture
  • Kafka
  • Flask
  • Fivetran
  • Django
  • Dapr
  • Dagster
  • AWS Redshift
  • AWS Lambda
  • AWS Glue
  • AWS EMR
  • AWS DMS
  • Python - 7 Years
  • Amazon Web Services
  • Normalization techniques
  • Data Vault 2.0
  • Data acquisitions
  • PySpark - 4 Years
  • Apache Airflow - 4 Years
  • Data Vault 2.0
  • Git
  • Azure DevOps
  • dbt - 4 Years
  • Snowflake - 2 Years
  • Normalization techniques
  • Data acquisitions
  • SQL - 12 Years

Professional Summary

12.10Years
  • Feb, 2025 - Present 10 months

    Senior Data Engineer II

    Storable India
  • Sep, 2024 - Feb, 2025 5 months

    Senior Data Engineer

    Highspot
  • Jun, 2021 - Sep, 20243 yr 3 months

    Data Engineering Lead

    Creditsafe Technology
  • Mar, 2015 - Apr, 20183 yr 1 month

    Senior Software Engineer

    UnitedHealth Group
  • May, 2018 - Oct, 20202 yr 5 months

    Senior Systems Analyst

    Gap Inc
  • Oct, 2020 - Jun, 2021 8 months

    Senior Database Developer

    Vitech Asia
  • Feb, 2013 - Mar, 20152 yr 1 month

    Software Engineer

    Accenture

Applications & Tools Known

  • icon-tool

    Apache Airflow

  • icon-tool

    AWS Glue

  • icon-tool

    Redshift

  • icon-tool

    Athena

  • icon-tool

    Amazon Web Services

  • icon-tool

    Spark SQL

  • icon-tool

    SQL

  • icon-tool

    Glue

  • icon-tool

    Lambda

  • icon-tool

    Django

  • icon-tool

    Flask

  • icon-tool

    Kafka

  • icon-tool

    Apache Iceberg

  • icon-tool

    Python

  • icon-tool

    dbt

  • icon-tool

    Azure DevOps

  • icon-tool

    Git

  • icon-tool

    Snowflake

  • icon-tool

    Fivetran

  • icon-tool

    Dagster

  • icon-tool

    EMR

  • icon-tool

    Azure DevOps

  • icon-tool

    Oracle

  • icon-tool

    PostgreSQL

  • icon-tool

    Aurora

  • icon-tool

    SQL Server

  • icon-tool

    SAS

  • icon-tool

    SQL

  • icon-tool

    Hive

  • icon-tool

    Azure DevOps

  • icon-tool

    SQL

  • icon-tool

    Kafka

Work History

12.10Years

Senior Data Engineer II

Storable India
Feb, 2025 - Present 10 months
    Revamping the data architectural footprint, moving away from Snapshot based processing to near real-time processing using AWS DMS to ensure faster data delivery to the downstream systems. Leveraging EMR Serverless and Spark Streaming for faster jobs and ready Bronze and Silver layers. Using Iceberg tables to build a unified data platform, and Athena to build data marts with necessary business transformations. Glue jobs orchestrated via Airflow for timely data delivery. Built Dapr Consumer Client to consume Application Messages from Kafka topic and trigger Airflow jobs. Collaborated with other teams to deliver solutions. Leveraging Looker to build reports integrated into the product for customers.

Senior Data Engineer

Highspot
Sep, 2024 - Feb, 2025 5 months
    Snowflake is the data warehouse. Leveraged Snowstream, Snowpipe, Snowtask, Dynamic Tables, and Materialized Views for ETL tasks. Built SnowProc using Python for HouseKeeping, rolling data into S3, using AWS SNS, SQS, and Dagster job for Slack notifications. Used Fivetran to pull data from multiple sources. Dagster jobs perform tests and monitor Snowflake tables for data correctness. Slack integration with Dagster for alerts. Used DBT for data transformation models. Optimized queries to improve performance and reduce server load. Used Snowflake's Zero Copy Cloning for internal testing.

Data Engineering Lead

Creditsafe Technology
Jun, 2021 - Sep, 20243 yr 3 months
    Developed and implemented data engineering strategies to improve scalability, performance, and accuracy of data pipelines. Managed design, development, and implementation of new data models. Promoted customer satisfaction by resolving problems. Built airflow plugins and enhanced automated solutions, reducing manual effort by 90%. Reviewed code for quality standards. Used AWS Lambda for file metadata registration. Onboarded Legacy Supplier data to Cloud using AWS Glue and PySpark. Migrated legacy systems to cloud platforms with DataVault modeling. Built business marts using dbt with necessary transformations. Learned Apache Iceberg and Kafka for real-time data processing. Used Airflow for pipeline orchestration. Consumed Supplier APIs using Python REST. Scraped data from HTML pages. Created documentation and training materials for knowledge transfer.

Senior Database Developer

Vitech Asia
Oct, 2020 - Jun, 2021 8 months
    Participated in migration of data from Oracle to AWS Aurora. Mitigated performance bottlenecks post migration from Oracle PL/SQL to PostgreSQL PL/pgSQL. Resolved business bugs, new requirements, and enhancements.

Senior Systems Analyst

Gap Inc
May, 2018 - Oct, 20202 yr 5 months
    Implemented OMS (Order Management System) and OB (Order Broker) for Intermix Brand. Developed REST APIs using ORDS (Oracle Rest Data Services) for web service integrations with vendors such as Cybersource, Clutch, DEG, UPS.

Senior Software Engineer

UnitedHealth Group
Mar, 2015 - Apr, 20183 yr 1 month
    Built automated solutions for manual tasks and Guardium limitations. Developed solution to monitor STAP restarts and alert Operations Team. Analyzed Guardium data for business and team insights. Built solution to monitor and whitelist users in Guardium. Worked on SQL, Python, Oracle, SQL Server, Guardium, and Python multithreading.

Software Engineer

Accenture
Feb, 2013 - Mar, 20152 yr 1 month
    Used PROC SQL, PROC SORT to resolve issues. Monitored weekly and monthly ETL jobs. Created and modified Linux scripts as per project and client requirements. Worked on SQL, Oracle, SAS, and UNIX Shell Scripting.

Achievements

  • Developed scalable and maintainable data ingestion frameworks, resulting in streamlined data integration processes.
  • Migrated legacy systems to modern cloud platforms leveraging DataVault modeling technique, ensuring seamless business continuity.
  • Leveraged the power of AWS Glue and PySpark to process the supplier files, taking a significant step towards a 100% cloud architectural footprint.
  • Built custom Airflow Plugins to reduce manual effort by 90%.
  • Implemented incremental and snapshot modeling techniques with dbt to build data marts for business needs.
  • "You Rock" award at Gap Inc.
  • "The Gem" award at Gap Inc.
  • Was awarded "You Rock" and "The Gem" award for the outstanding work done.
  • You Rock
  • The Gem

Major Projects

2Projects

AWS Aurora Migration

Oct, 2020 - Jun, 2021 8 months
    Participated in migration of data from Oracle to AWS Aurora.

OMS and OB Implementation

May, 2018 - Oct, 20202 yr 5 months
    Implemented OMS (Order Management System) and OB (Order Broker) for Intermix Brand.

Education

  • Bachelor of Science in Information Technology

    Biju Patnaik University of Technology