profile-pic

Hritwij Shrivastava

Enthusiastic and skilled Data Engineer with a strong foundation in Python and scalable data processing. Experienced in building end-to-end ETL pipelines, optimizing data workflows on Databricks, and developing efficient solutions for data transformation and integration. Proficient in collaborating with cross-functional teams to deliver high-quality, reliable data platforms that support analytics and business needs.
  • Role

    Python Engineer

  • Years of Experience

    4 years

  • Professional Portfolio

    View here

Skillsets

  • PySpark
  • Time Series Analysis
  • PostgreSQL
  • MongoDB
  • Machine Learning
  • Hypothesis testing
  • Data Visualization
  • Sqlite3
  • SQL
  • Snowflake
  • Python
  • AWS
  • PowerBI
  • MySQL
  • Microsoft Azure
  • MapReduce
  • GitLab
  • Github
  • ETL
  • Databricks
  • Data science pipeline
  • AWS

Professional Summary

4Years
  • Dec, 2022 - Present3 yr 2 months

    Software Engineer (Python Developer)

    Infinite Computer Solutions
  • Aug, 2021 - Dec, 20221 yr 4 months

    Assistant System Engineer (Python Developer)

    Tata Consultancy Services

Applications & Tools Known

  • icon-tool

    Snowflake

  • icon-tool

    PowerBI

Work History

4Years

Software Engineer (Python Developer)

Infinite Computer Solutions
Dec, 2022 - Present3 yr 2 months
    Led healthcare data processing initiatives on Databricks using PySpark and Python for scalable ETL frameworks. Designed and scaled data pipelines, implemented Medallion architecture, collaborated with testing teams, and enhanced data ingestion efficiency.

Assistant System Engineer (Python Developer)

Tata Consultancy Services
Aug, 2021 - Dec, 20221 yr 4 months
    Developed Python workflows for unstructured data ingestion into Snowflake. Automated data pipelines, implemented Snowflake-specific data models, conducted validation queries, and provided team training.

Major Projects

3Projects

Anticipating COVID Patient Patterns through Daily In-depth Tweet Analysis

    Gathered social media posts related to COVID, performed sentiment analysis using NLP techniques, classified sentiments, visualized trends, and proposed actionable insights.

Enhancing Tabular Data Classification Accuracy with GPT-2-Driven Feature Generation

    Developed synthetic features from tabular data using GPT-2. Fine-tuned GPT-2, created augmented datasets, integrated MLP classifier, and improved classification accuracy.

Transforming Time Series Forecasting with TS-Mixer Architecture

    Implemented TS-Mixer architecture to model temporal dependencies for forecasting. Preprocessed data, fine-tuned the model, integrated predictions into production pipeline, and improved forecasting metrics.

Education

  • Bachelor of Technology in Electronics and Instrumentation

    MAKAUT, Techno India, Salt Lake, Kolkata (2020)

Certifications

  • Aws certified machine learning – specialty (issued by aws)

  • Mitx micromasters (via edx)

  • Aws certified machine learning specialty

  • Mitx micromasters