profile-pic

Krishnatheja Vanka

Machine Learning Engineer with a strong focus on computer vision, generative AI, and deep learning. I specialize in building and deploying end-to-end ML solutions—from data curation and model training to real-world deployment using PyTorch, AWS, and modern MLOps tools. Currently at Lytx, I work on visual models for challenging driving conditions and fatigue detection. Previously built intelligent vision systems in manufacturing and healthcare domains. Active open-source contributor (PyTorch Lightning, TorchMetrics) and community mentor in AI/ML spaces.

  • Role

    Applied Scientist II

  • Years of Experience

    8.17 years

  • Professional Portfolio

    View here

Skillsets

  • C++
  • vision-language models
  • Torchmetrics
  • TensorRT
  • PyTorch Lightning
  • OpenCV
  • ONNX
  • MLFlow
  • Labelbox
  • Git
  • Generative AI
  • foundation models
  • Flask
  • FastAPI
  • Computer Vision
  • Artificial Intelligence
  • Albumentations
  • Vision Transformers
  • Transformers
  • Semi-Supervised Learning
  • self-supervised learning
  • PyTorch
  • Python
  • object detection
  • image segmentation
  • Image Classification
  • Docker
  • Distributed Training
  • Bash
  • AWS

Professional Summary

8.17Years
  • May, 2023 - Present2 yr 11 months

    Expert & Reviewer

    Packt
  • Oct, 2025 - Jan, 2026 3 months

    Applied Scientist II

    SFL
  • Dec, 2023 - Jul, 20251 yr 7 months

    Machine Learning Engineer II

    Lytx, Inc.
  • Feb, 2018 - Aug, 2018 6 months

    Intern

    Toprankers
  • Aug, 2019 - Jul, 20211 yr 11 months

    Data Scientist

    WayCool Foods
  • Jul, 2021 - Jul, 20221 yr

    Data Scientist

    Mareana

Applications & Tools Known

  • icon-tool

    PyTorch

  • icon-tool

    Scikit-Learn

  • icon-tool

    OpenCV

  • icon-tool

    Flask

  • icon-tool

    FastAPI

  • icon-tool

    PostgreSQL

  • icon-tool

    Git

Work History

8.17Years

Expert & Reviewer

Packt
May, 2023 - Present2 yr 11 months
    Help the discord community by solving queries related to Machine Learning and Deep Learning. Interact with fellow experts and share knowledge with each other while conveying the same to the members of the community.

Applied Scientist II

SFL
Oct, 2025 - Jan, 2026 3 months
    Owned the end-to-end training lifecycle, including processing raw sports videos, generating and validating annotations using AWS Ground Truth, and fine-tuning and evaluating YOLO-v9, RT-DETR, and Grounding DINO to improve detection accuracy under diverse camera angles, occlusions, and lighting conditions. Applied and benchmarked multi-object tracking algorithms using BoxMot to maintain consistent player identities in highly occluded and crowded scenes, analyzing trade-offs between ID stability and computational cost. Designed and deployed scalable ML inference pipelines on AWS using ECS for service orchestration, Amazon ECR for containerized model artifacts, SageMaker for managed, asynchronous model serving, and MLflow for experiment tracking and model lifecycle management.

Machine Learning Engineer II

Lytx, Inc.
Dec, 2023 - Jul, 20251 yr 7 months
    Conducted a comprehensive literature review, developed and trained a model from scratch, tested and deployed weather classification models, achieving >98% precision and recall, enhancing visual condition recognition for edge-based safety systems. Applied research-driven active learning methods (e.g., Influence Selection) to reduce data labeling effort by 40%, while maintaining >90% across all key metrics (accuracy, precision, recall, F1). Leveraged distillation and representation learning models such as DINOv2 and CLIP, integrated with ScaNN and One-vs-All retrieval, to automate dataset curation for weakly labeled and unlabeled data. Led end-to-end research and development of a driver fatigue detection module using a MobileNetV2 + GRU architecture; achieved 94%+ precision/recall for real-time yawn detection across diverse lighting and camera angles. Architected distributed training infrastructure across multi-node, multi-GPU clusters to scale training on 10B+ image datasets, enabling the development of proprietary vision models. Accelerated model development cycle by 15x by converting legacy Keras/TensorFlow pipelines into modular PyTorch Lightning framework, significantly improving reproducibility and maintainability. Routinely worked with AWS infrastructure (S3, EC2, SageMaker, EKS, ECR) to support scalable training, deployment, and monitoring of production models.

Data Scientist

Mareana
Jul, 2021 - Jul, 20221 yr
    Developed a pytorch based framework with configurable training pipelines for object detection, segmentation, and vision transformers within Mareana's Manufacturing Intelligence Hub, reducing manual setup time by 70%. Researched and trained a transformer-based model to convert chemical equation images into corresponding formulas. Applied self-supervised learning techniques (DINO, Segment Anything) to reduce annotation cost by 40%, accelerating dataset preparation for downstream tasks. Optimized model inference latency by 3x by converting TorchScript models to TensorRT via ONNX, enabling real-time performance in production environments. Developed and maintained RESTful APIs using Flask and FastAPI for seamless integration of ML models into web-based applications and internal tools. Mentored junior ML engineers, guiding them through complete ML pipeline development including data acquisition, annotation (Labelbox), model prototyping, evaluation, and deployment best practices.

Data Scientist

WayCool Foods
Aug, 2019 - Jul, 20211 yr 11 months
    Developed image-processing prototypes that secured funding during Lightbox investor meetings, showcasing real-world applications of computer vision in agri-supply chains. Designed and deployed de-identification models to censor license plates and faces, enabling the rollout of AI-powered distribution centers while maintaining compliance with data privacy standards. Implemented YOLOv4-based object counting system to track vegetable quantities on conveyor belts, improving inventory accuracy by >85%. Built and managed end-to-end computer vision pipelines using Labelbox for annotation and MLFlow for model tracking and reproducibility. Applied quantization and pruning techniques to reduce deep learning model sizes by up to 60%, enabling low-latency inference on edge hardware.

Intern

Toprankers
Feb, 2018 - Aug, 2018 6 months

Major Projects

3Projects

Cephalometric X-Ray

    Built a deep learning pipeline to parse cephalometric X-ray images and extract anatomical landmarks using CNNs and YOLOv5 for high-confidence ROI extraction. Consolidated 19 separate models into a unified network, significantly improving inference efficiency and achieving landmark localization with MSE < 0.1. Reproduced and implemented a research paper on medical image localization, adapting it to real-world dental imaging datasets for high-precision analysis.

Stable diffusion In-painting

    Developed an ensemble image inpainting model combining CLIP, Variational AutoEncoder (VAE), and U-Net, resulting in high-quality, artifact-free restoration of product images. Implemented textual inversion techniques using RunwayML training scripts to generate context-aware backgrounds, enhancing visual coherence and aesthetics. Achieved visually consistent, outlier-free outputs suitable for e-commerce and catalog use cases.

Robot Collision Detection

    Developed an autonomous exploration script to identify objects and navigable paths within a simulated robotics environment, leveraging custom scenario data provided by the instructor. Trained a CNN-based collision avoidance model under the mentorship of Prof. Heni Ben Amor, enabling safe path planning and real-time obstacle detection in constrained environments.

Education

  • Master of Computer Science

    Arizona State University (2022)
  • Bachelor of Engineering in Computer Science & Engineering (CSE)

    Visvesvaraya Technological University (2018)

Certifications

  • Deeplearning.ai tensorflow developer

  • Machine learning by stanford university