Krishnatheja Vanka

Machine Learning Engineer with a strong focus on computer vision, generative AI, and deep learning. I specialize in building and deploying end-to-end ML solutions—from data curation and model training to real-world deployment using PyTorch, AWS, and modern MLOps tools. Currently at Lytx, I work on visual models for challenging driving conditions and fatigue detection. Previously built intelligent vision systems in manufacturing and healthcare domains. Active open-source contributor (PyTorch Lightning, TorchMetrics) and community mentor in AI/ML spaces.

Role
Applied Scientist II
Years of Experience
5.1 years
Professional Portfolio
View here

Skillsets

C++
vision-language models
Torchmetrics
TensorRT
PyTorch Lightning
OpenCV
ONNX
MLFlow
Labelbox
Git
Generative AI
foundation models
Flask
FastAPI
Computer Vision
Artificial Intelligence
Albumentations
Vision Transformers
Transformers
Semi-Supervised Learning
self-supervised learning
PyTorch
Python
object detection
image segmentation
Image Classification
Docker
Distributed Training
Bash
AWS

Professional Summary

5.1Years

May, 2023 - Present3 yr 2 months
Expert & Reviewer
Packt
Oct, 2025 - Jan, 2026 3 months
Applied Scientist II
SFL
Dec, 2023 - Jul, 20251 yr 7 months
Machine Learning Engineer II
Lytx, Inc.
Feb, 2018 - Aug, 2018 6 months
Intern
Toprankers
Aug, 2019 - Jul, 20211 yr 11 months
Data Scientist
WayCool Foods
Jul, 2021 - Jul, 20221 yr
Data Scientist
Mareana

Applications & Tools Known

PyTorch
Scikit-Learn
OpenCV
Flask
FastAPI
PostgreSQL
Git

Work History

5.1Years

Expert & Reviewer

Packt

May, 2023 - Present3 yr 2 months

Help the discord community by solving queries related to Machine Learning and Deep Learning. Interact with fellow experts and share knowledge with each other while conveying the same to the members of the community.

Applied Scientist II

SFL

Oct, 2025 - Jan, 2026 3 months

Owned the end-to-end training lifecycle, including processing raw sports videos, generating and validating annotations using AWS Ground Truth, and fine-tuning and evaluating YOLO-v9, RT-DETR, and Grounding DINO to improve detection accuracy under diverse camera angles, occlusions, and lighting conditions. Applied and benchmarked multi-object tracking algorithms using BoxMot to maintain consistent player identities in highly occluded and crowded scenes, analyzing trade-offs between ID stability and computational cost. Designed and deployed scalable ML inference pipelines on AWS using ECS for service orchestration, Amazon ECR for containerized model artifacts, SageMaker for managed, asynchronous model serving, and MLflow for experiment tracking and model lifecycle management.

Machine Learning Engineer II

Lytx, Inc.

Dec, 2023 - Jul, 20251 yr 7 months

Conducted a comprehensive literature review, developed and trained a model from scratch, tested and deployed weather classification models, achieving >98% precision and recall, enhancing visual condition recognition for edge-based safety systems. Applied research-driven active learning methods (e.g., Influence Selection) to reduce data labeling effort by 40%, while maintaining >90% across all key metrics (accuracy, precision, recall, F1). Leveraged distillation and representation learning models such as DINOv2 and CLIP, integrated with ScaNN and One-vs-All retrieval, to automate dataset curation for weakly labeled and unlabeled data. Led end-to-end research and development of a driver fatigue detection module using a MobileNetV2 + GRU architecture; achieved 94%+ precision/recall for real-time yawn detection across diverse lighting and camera angles. Architected distributed training infrastructure across multi-node, multi-GPU clusters to scale training on 10B+ image datasets, enabling the development of proprietary vision models. Accelerated model development cycle by 15x by converting legacy Keras/TensorFlow pipelines into modular PyTorch Lightning framework, significantly improving reproducibility and maintainability. Routinely worked with AWS infrastructure (S3, EC2, SageMaker, EKS, ECR) to support scalable training, deployment, and monitoring of production models.

Data Scientist

Mareana

Jul, 2021 - Jul, 20221 yr

Developed a pytorch based framework with configurable training pipelines for object detection, segmentation, and vision transformers within Mareana's Manufacturing Intelligence Hub, reducing manual setup time by 70%. Researched and trained a transformer-based model to convert chemical equation images into corresponding formulas. Applied self-supervised learning techniques (DINO, Segment Anything) to reduce annotation cost by 40%, accelerating dataset preparation for downstream tasks. Optimized model inference latency by 3x by converting TorchScript models to TensorRT via ONNX, enabling real-time performance in production environments. Developed and maintained RESTful APIs using Flask and FastAPI for seamless integration of ML models into web-based applications and internal tools. Mentored junior ML engineers, guiding them through complete ML pipeline development including data acquisition, annotation (Labelbox), model prototyping, evaluation, and deployment best practices.

Data Scientist

WayCool Foods

Aug, 2019 - Jul, 20211 yr 11 months

Developed image-processing prototypes that secured funding during Lightbox investor meetings, showcasing real-world applications of computer vision in agri-supply chains. Designed and deployed de-identification models to censor license plates and faces, enabling the rollout of AI-powered distribution centers while maintaining compliance with data privacy standards. Implemented YOLOv4-based object counting system to track vegetable quantities on conveyor belts, improving inventory accuracy by >85%. Built and managed end-to-end computer vision pipelines using Labelbox for annotation and MLFlow for model tracking and reproducibility. Applied quantization and pruning techniques to reduce deep learning model sizes by up to 60%, enabling low-latency inference on edge hardware.

Intern

Toprankers

Feb, 2018 - Aug, 2018 6 months

Major Projects

3Projects

Cephalometric X-Ray

Built a deep learning pipeline to parse cephalometric X-ray images and extract anatomical landmarks using CNNs and YOLOv5 for high-confidence ROI extraction. Consolidated 19 separate models into a unified network, significantly improving inference efficiency and achieving landmark localization with MSE < 0.1. Reproduced and implemented a research paper on medical image localization, adapting it to real-world dental imaging datasets for high-precision analysis.

Stable diffusion In-painting

Developed an ensemble image inpainting model combining CLIP, Variational AutoEncoder (VAE), and U-Net, resulting in high-quality, artifact-free restoration of product images. Implemented textual inversion techniques using RunwayML training scripts to generate context-aware backgrounds, enhancing visual coherence and aesthetics. Achieved visually consistent, outlier-free outputs suitable for e-commerce and catalog use cases.

Robot Collision Detection

Developed an autonomous exploration script to identify objects and navigable paths within a simulated robotics environment, leveraging custom scenario data provided by the instructor. Trained a CNN-based collision avoidance model under the mentorship of Prof. Heni Ben Amor, enabling safe path planning and real-time obstacle detection in constrained environments.

Education

Master of Computer Science
Arizona State University (2022)
Bachelor of Engineering in Computer Science & Engineering (CSE)
Visvesvaraya Technological University (2018)

Certifications

Deeplearning.ai tensorflow developer
Machine learning by stanford university

Krishnatheja Vanka

Applied Scientist II

5.1 years

View here

Skillsets

Professional Summary

Applications & Tools Known

Work History

Expert & Reviewer

Applied Scientist II

Machine Learning Engineer II

Data Scientist

Data Scientist

Intern

Major Projects

Cephalometric X-Ray

Stable diffusion In-painting

Robot Collision Detection

Education

Master of Computer Science

Bachelor of Engineering in Computer Science & Engineering (CSE)

Certifications

Deeplearning.ai tensorflow developer

Machine learning by stanford university