profile-pic

UTKARSH TIWARI

Results-driven software engineer with expertise in Python, AI/ML, and cloud-native technologies. Passionate about problem-solving, system optimization, and driving innovation through technology.
  • Role

    Data and ML Engineer

  • Years of Experience

    5.42 years

Skillsets

  • GCS
  • S3
  • SQL
  • Unit-test
  • CentOS
  • EFK
  • ELK
  • Linux
  • MongoDB
  • Ubuntu
  • Unix
  • Apache pyspark
  • Bitbucket
  • Cloud VPN
  • GCP
  • Redis
  • GKE
  • Google-adk
  • Jenkins
  • LangGraph
  • LSTM
  • Os
  • Scikit-learn
  • TensorFlow
  • VPC
  • HuggingFace
  • LangChain
  • PEFT
  • Transformers
  • Github
  • Angular
  • Athena
  • AWS
  • AWS
  • Bash
  • CircleCI
  • Docker
  • EC2
  • ECR
  • Elasticsearch
  • FastAPI
  • Flask
  • Git
  • Git
  • Airflow
  • GitLab
  • gitlabci
  • Glue
  • Grafana
  • Kubernetes
  • Kubernetes
  • Lambda
  • macOS
  • MLFlow
  • MySQL
  • Prometheus
  • pytest
  • Python - 4.0 Years

Professional Summary

5.42Years
  • May, 2025 - Present1 yr

    Data & ML Engineer

    streamingo.ai
  • Jan, 2025 - May, 2025 4 months

    Member of Technical Staff - 2

    VoerEir AB
  • Aug, 2023 - Jan, 20251 yr 5 months

    Member of Technical Staff - 1

    VoerEir AB
  • Jan, 2022 - Aug, 20231 yr 7 months

    SDE

    Greendeck
  • Jun, 2022 - Aug, 20231 yr 2 months

    Software Engineer 1

    Quantive

Applications & Tools Known

  • icon-tool

    Airflow

  • icon-tool

    MLflow

  • icon-tool

    Docker

  • icon-tool

    Kubernetes

  • icon-tool

    AWS

  • icon-tool

    Azure

  • icon-tool

    EC2

  • icon-tool

    ECR

  • icon-tool

    Lambda

  • icon-tool

    Glue

  • icon-tool

    Athena

  • icon-tool

    S3

  • icon-tool

    Grafana

  • icon-tool

    Prometheus

  • icon-tool

    CircleCI

  • icon-tool

    Git

  • icon-tool

    GitHub

  • icon-tool

    GitLab

  • icon-tool

    MLflow

  • icon-tool

    AWS

  • icon-tool

    CircleCI

  • icon-tool

    GitHub

Work History

5.42Years

Data & ML Engineer

streamingo.ai
May, 2025 - Present1 yr
    Built LangChain-based pipelines for document retrieval and context-aware question answering, integrating HuggingFace Transformers for embedding and generation tasks. Fine-tuned open-source LLMs (Mistral, LLaMA) using PEFT with LoRA and QLoRA, enabling efficient on-premise inference while significantly reducing GPU memory requirements. Built end-to-end MLOps pipelines for multiple models, handling training, inference, and deployment in production. Improved model accuracy through spatial and temporal analytics, identifying performance gaps and implementing post-processing fixes. Created an Agentic AI platform with Google ADK for Streamingo Atlasa multi-agent FAQ system that automates documentation search and user support. Developed internal Python packages to standardize model training, inference, and pipeline management, speeding up team development. Restructured inference workflow to run projects in parallel, cutting daily processing time by 5 hours. Tuned CUDA settings for YOLO and MTO models, improving inference speed and GPU efficiency. Set up CI/CD pipelines with Bitbucket and Jenkins to automate builds, tests, and deployments. Led migration from service-based setup to cloud-native architecture using Helm and Kubernetes, improving scalability. Consolidated all GPU machines into a single Kubernetes cluster, removing manual GPU management and improving resource usage. Managed GCP infrastructure (GKE, GCS, networking), optimizing cloud operations and reducing annual costs by 10-12%.

Member of Technical Staff - 2

VoerEir AB
Jan, 2025 - May, 2025 4 months

Member of Technical Staff - 1

VoerEir AB
Aug, 2023 - Jan, 20251 yr 5 months
    Led AI/ML initiatives from design through deployment, shaping product direction and scalability. Built AI/ML features that improved analytics and workflow efficiency, helping customers process data faster and unlock 60% more market opportunities. Designed predictive models using LSTM for AI-driven insights and better forecasting. Created an RAG-based system as foundation for AI-powered product features. Debugged performance issues in a core product, boosting cloud performance by 20%.

Software Engineer 1

Quantive
Jun, 2022 - Aug, 20231 yr 2 months

SDE

Greendeck
Jan, 2022 - Aug, 20231 yr 7 months
    Built a custom web scraping library that outperformed Scrapy in speed and efficiency. Created an automation tool using LLMs and RAG that reduced manual work by 30%. Managed data pipeline handling 3+ million product documents daily in a Master-Slave setup, ensuring reliable low-latency transfers. Refactored legacy code with OOP principles and design patterns, cutting code duplication by 25% and making the system easier to maintain. Optimized data workflows with Airflow for better task scheduling and parallel processing.

Achievements

  • Hackathon winner 2023 and 2024
  • On-boarded a client with $450k deal, led the back-end integration, and successfully transferred 25 million product documents to our data warehouse in a single week while ensuring seamless system integration, optimized data flow, and reliable functionality throughout the process.
  • Active member of Coding Clubs organizing workshops on programming, AI, and software development.

Education

  • Bachelor of Technology (B-tech), Computer Science and Information Technology

    Shri Vaishnav Institute of Information and Technology (2022)

Certifications

  • Cka: certified kubernetes administrator

  • Data science practitioners course - ibm