profile-pic

Shubham Songire

Data Science professional with extensive experience in generative AI, computer vision, machine learning, deep learning, NLP, and Robot Operating System (ROS). I have a proven track record in developing end-to-end AI projects.
  • Role

    Machine Learning Engineer

  • Years of Experience

    6.17 years

Skillsets

  • LangGraph
  • Flask
  • Kubernetes
  • LangChain
  • MLFlow
  • OpenAI
  • Postman
  • Pinecone
  • HuggingFace
  • FastAPI
  • OpenCV
  • Opensearch
  • rag
  • spaCy
  • Azuredevops
  • CICD
  • PowerBI
  • Vscode
  • Python
  • BigQuery
  • GCP
  • Github
  • Keras
  • MongoDB
  • MySQL
  • NumPy
  • pandas
  • AWS
  • PyTorch
  • Scikit-learn
  • SQL
  • Tableau
  • TensorFlow
  • Azure
  • Docker

Professional Summary

6.17Years
  • Jan, 2026 - Present 3 months

    Senior Software Engineer, AI/ML

    Rakuten Symphony
  • Aug, 2023 - Dec, 20252 yr 4 months

    Machine Learning Engineer

    Eli Lilly and Company
  • Apr, 2023 - Jun, 2023 2 months

    Machine Learning Intern

    IDfy
  • Oct, 2020 - Dec, 2020 2 months

    Campus Ambassador

    INDIAN ROBOTICS COMMUNITY
  • Jul, 2022 - Mar, 2023 8 months

    Data Scientist

    Freelancer.com
  • Aug, 2022 - Mar, 2023 7 months

    Research And Development Intern

    DRDO, Ministry of Defence, Govt. of India
  • Jul, 2020 - Jul, 20222 yr

    AI Engineer Intern

    Mass Technologies

Work History

6.17Years

Senior Software Engineer, AI/ML

Rakuten Symphony
Jan, 2026 - Present 3 months

Machine Learning Engineer

Eli Lilly and Company
Aug, 2023 - Dec, 20252 yr 4 months
    Architected and deployed an end-to-end enterprise RAG system for 10,000+ complex clinical study documents using LangChain, OpenAI GPT4o, and OpenSearch, deployed on AWS, enabling contextual Q&A, document comparison, and reference-backed summarization to accelerate internal research workflows (evaluated using RAGAs). Reduced clinical documentation time by 60% by developing a Validation Plan Generator powered by LLMs (Azure OpenAI Claude 3.5), accelerating PSR (Periodic Safety Report) generation time from 11 weeks to 4 weeks. Developed a time series forecasting model to project Verzenio sales trends. Integrated model with a live dashboard used by leadership for strategic planning and market analysis. Built and deployed end-to-end machine learning models on Microsoft Fabric, leveraging Dataflows Gen2 and Lakehouse for centralized data ingestion and transformation, and Synapse Data Science Notebooks for feature engineering and model training using Python, pandas, scikit-learn, and PyTorch. Utilized MLflow integration for experiment tracking, model versioning, and registration, and deployed models as Fabric endpoints to serve REST APIs and integrate with Power BI dashboards for real-time analytics. Implemented automated retraining pipelines using Fabric Data Pipelines, ensuring continuous learning, and maintaining model accuracy in production. Won Lillys Best Project Award for our team project and received the Best Individual Achiever Award for Q1 2024.

Machine Learning Intern

IDfy
Apr, 2023 - Jun, 2023 2 months
    Developed a computer vision solution using YOLO architecture with optimized thresholds to enhance Face Quality Assessment (FQA) accuracy in low-light conditions. Achieved a significant 8% improvement in FQA effectiveness under low-light environments, thereby enhancing the accuracy of the spoof detection algorithm's guardrails.

Research And Development Intern

DRDO, Ministry of Defence, Govt. of India
Aug, 2022 - Mar, 2023 7 months
    Developed an AI-driven border surveillance and autonomous target tracking system using YOLOv7, DeepSort, and LiDAR-based SLAM, enabling real-time human detection, tracking, and coordinate-based navigation in defense zones. Integrated camera and LiDAR modules on NVIDIA Orin board and synchronized tracking data with an autonomous navigation vehicle through ROS2 (NAV2) for real-time pursuit and control-room monitoring. Tech Stack: Python, YOLOv7, DeepSort, LiDAR, ROS2 (NAV2), GraspNet, NVIDIA Orin, OpenCV.

Data Scientist

Freelancer.com
Jul, 2022 - Mar, 2023 8 months

Campus Ambassador

INDIAN ROBOTICS COMMUNITY
Oct, 2020 - Dec, 2020 2 months
    Participated and spread awareness about different activities conducted by IRC in our college.

AI Engineer Intern

Mass Technologies
Jul, 2020 - Jul, 20222 yr
    Worked for 2 years in a company on different data science projects, developed multiple AI solutions across healthcare, education and research domains, including tumor detection from clinical images using CNNs, and sentiment analysis using BERT. Built graph interpretation models to extract data from charts (bar, pie, histogram) and generate natural language summaries using computer vision and NLP techniques. Implemented image-to-text generation pipelines combining ResNet for visual feature extraction and NLP models for sentence formation, delivering insights for academic and industry use cases.

Major Projects

2Projects

Clinchat Clinical AI Assistant

    Architected an enterprise-grade RAG system for contextual Q&A over complex, unstructured clinical trial data. Engineered a scalable NLP pipeline leveraging LangChain, OpenAI embeddings, FAISS, and OpenSearch, with advanced parsing of textual and tabular data for high-precision semantic retrieval.

LiDAR & Computer Vision-Based Autonomous Target Tracking System

    Worked on hardware-software integration and their implementation to run autonomous navigation vehicle using LiDAR point cloud in an open environment with SLAM to detect the target and track it using object tracking algorithms like YOLO and deepsort. GraspNET, ROS2 is used here to pick up the object as per instructions from the control room.

Education

  • Master of Technology, AI & ML - Distance Learning Program

    Birla Institute of Technology and Science (BITS), Pilani
  • Bachelor of Engineering, Computer Engineering

    Bharati Vidyapeeths College of Engineering (2023)