UTKARSH TIWARI

Results-driven software engineer with expertise in Python, AI/ML, and cloud-native technologies. Passionate about problem-solving, system optimization, and driving innovation through technology.

Role
Data and ML Engineer
Years of Experience
5.42 years

Skillsets

GCS
S3
SQL
Unit-test
CentOS
EFK
ELK
Linux
MongoDB
Ubuntu
Unix
Apache pyspark
Bitbucket
Cloud VPN
GCP
Redis
GKE
Google-adk
Jenkins
LangGraph
LSTM
Os
Scikit-learn
TensorFlow
VPC
HuggingFace
LangChain
PEFT
Transformers
Github
Angular
Athena
AWS
AWS
Bash
CircleCI
Docker
EC2
ECR
Elasticsearch
FastAPI
Flask
Git
Git
Airflow
GitLab
gitlabci
Glue
Grafana
Kubernetes
Kubernetes
Lambda
macOS
MLFlow
MySQL
Prometheus
pytest
Python - 4.0 Years

Professional Summary

5.42Years

May, 2025 - Present1 yr
Data & ML Engineer
streamingo.ai
Jan, 2025 - May, 2025 4 months
Member of Technical Staff - 2
VoerEir AB
Aug, 2023 - Jan, 20251 yr 5 months
Member of Technical Staff - 1
VoerEir AB
Jan, 2022 - Aug, 20231 yr 7 months
SDE
Greendeck
Jun, 2022 - Aug, 20231 yr 2 months
Software Engineer 1
Quantive

Applications & Tools Known

Airflow
MLflow
Docker
Kubernetes
AWS
Azure
EC2
ECR
Lambda
Glue
Athena
S3
Grafana
Prometheus
CircleCI
Git
GitHub
GitLab
MLflow
AWS
CircleCI
GitHub

Work History

5.42Years

Data & ML Engineer

streamingo.ai

May, 2025 - Present1 yr

Built LangChain-based pipelines for document retrieval and context-aware question answering, integrating HuggingFace Transformers for embedding and generation tasks. Fine-tuned open-source LLMs (Mistral, LLaMA) using PEFT with LoRA and QLoRA, enabling efficient on-premise inference while significantly reducing GPU memory requirements. Built end-to-end MLOps pipelines for multiple models, handling training, inference, and deployment in production. Improved model accuracy through spatial and temporal analytics, identifying performance gaps and implementing post-processing fixes. Created an Agentic AI platform with Google ADK for Streamingo Atlasa multi-agent FAQ system that automates documentation search and user support. Developed internal Python packages to standardize model training, inference, and pipeline management, speeding up team development. Restructured inference workflow to run projects in parallel, cutting daily processing time by 5 hours. Tuned CUDA settings for YOLO and MTO models, improving inference speed and GPU efficiency. Set up CI/CD pipelines with Bitbucket and Jenkins to automate builds, tests, and deployments. Led migration from service-based setup to cloud-native architecture using Helm and Kubernetes, improving scalability. Consolidated all GPU machines into a single Kubernetes cluster, removing manual GPU management and improving resource usage. Managed GCP infrastructure (GKE, GCS, networking), optimizing cloud operations and reducing annual costs by 10-12%.

Member of Technical Staff - 2

VoerEir AB

Jan, 2025 - May, 2025 4 months

Member of Technical Staff - 1

VoerEir AB

Aug, 2023 - Jan, 20251 yr 5 months

Led AI/ML initiatives from design through deployment, shaping product direction and scalability. Built AI/ML features that improved analytics and workflow efficiency, helping customers process data faster and unlock 60% more market opportunities. Designed predictive models using LSTM for AI-driven insights and better forecasting. Created an RAG-based system as foundation for AI-powered product features. Debugged performance issues in a core product, boosting cloud performance by 20%.

Software Engineer 1

Quantive

Jun, 2022 - Aug, 20231 yr 2 months

SDE

Greendeck

Jan, 2022 - Aug, 20231 yr 7 months

Built a custom web scraping library that outperformed Scrapy in speed and efficiency. Created an automation tool using LLMs and RAG that reduced manual work by 30%. Managed data pipeline handling 3+ million product documents daily in a Master-Slave setup, ensuring reliable low-latency transfers. Refactored legacy code with OOP principles and design patterns, cutting code duplication by 25% and making the system easier to maintain. Optimized data workflows with Airflow for better task scheduling and parallel processing.

Achievements

Hackathon winner 2023 and 2024
On-boarded a client with $450k deal, led the back-end integration, and successfully transferred 25 million product documents to our data warehouse in a single week while ensuring seamless system integration, optimized data flow, and reliable functionality throughout the process.
Active member of Coding Clubs organizing workshops on programming, AI, and software development.

Education

Bachelor of Technology (B-tech), Computer Science and Information Technology
Shri Vaishnav Institute of Information and Technology (2022)

Certifications

Cka: certified kubernetes administrator
Data science practitioners course - ibm