profile-pic

Antara Raman Sahay

I’m a Generative AI engineer focused on building production-grade LLM and multi-agent systems. My work spans designing RAG pipelines, optimizing agent orchestration, and deploying scalable AI services using modern MLOps and LLMOps practices.

I’ve developed solutions such as automated code review systems, customer-facing multi-agent QA platforms, and NLP-driven analytics dashboards that translate natural language into real-time insights. I enjoy solving complex problems at the intersection of AI systems design, model optimization, and real-world product deployment.

  • Role

    Gen AI Engineer

  • Years of Experience

    16.92 years

Skillsets

  • Jenkins
  • Parlant
  • vLLM
  • Unsloth
  • Transformers
  • TensorFlow
  • Streamlit
  • SQL
  • Scikit-learn
  • PyTorch
  • Python
  • Pocketbase
  • OpenCV
  • MLFlow
  • LangChain
  • Accelerate
  • HuggingFace
  • Hatchet
  • Google Cloud Platform
  • Git
  • GCP
  • Flask
  • FastAPI
  • Docker
  • crewAI
  • Bitbucket
  • Bash
  • Azure
  • Ag2

Professional Summary

16.92Years
  • Nov, 2020 - Present5 yr 5 months

    Drafter-International Summit

    Invertis University
  • Aug, 2025 - Dec, 2025 4 months

    Gen AI Engineer

    TestZeus
  • Nov, 2024 - Aug, 2025 9 months

    Data Science Mentor

    The Skillians
  • Sep, 2022 - Jul, 2023 10 months

    President

    I-Tech (The Technical Club)
  • Aug, 2023 - Nov, 2023 3 months

    AI Researcher

    The Mtrench
  • Mar, 2024 - Aug, 20251 yr 5 months

    Software Engineer Trainee

    H&P
  • Sep, 2021 - Aug, 2022 11 months

    Joint Secretary

    I-Tech (The Technical Club)
  • Event Host

Work History

16.92Years

Drafter-International Summit

Invertis University
Nov, 2020 - Present5 yr 5 months
    For the 2 day International Summit-Economy and Leadership, I was appointed as a drafter for the distinguished speakers at the summit.

Gen AI Engineer

TestZeus
Aug, 2025 - Dec, 2025 4 months
    Optimized multi-agent system for software test case execution, reducing execution time by 40% via caching-based replay and mode-switching strategies. Architected a customer-facing multi-agent QA chatbot suite, enabling production-ready test suite generation with adaptive memory, multi-modal ingestion, and ARQ-based guardrails.

Data Science Mentor

The Skillians
Nov, 2024 - Aug, 2025 9 months
    Delivered live interactive sessions on core and advanced Data Science topics including Python, SQL, Machine Learning, Deep Learning (NLP, Computer Vision, Time Series), and MLOps.

Software Engineer Trainee

H&P
Mar, 2024 - Aug, 20251 yr 5 months
    Developed an AI-powered code review system, reducing manual review and release delays. Built NLP-powered dashboard for drilling data analysis and AI-powered summaries. Structured and optimized OCR pipeline for extracting structured data from complex PDFs using hybrid OCR and layout-aware models. Architected custom Graph RAG pipeline and fine-tuned Small Language Model for enterprise data retrieval. Engineered Jenkins Shared Library PoC for ML and LLM pipelines.

AI Researcher

The Mtrench
Aug, 2023 - Nov, 2023 3 months
    Researched, conceptualized, and deployed advanced LLM architectures for business challenges. Processed and visualized large unstructured datasets, enhancing decision-making efficiency by 25%. Delivered scalable AI/ML solutions, reducing processing time by 20% and improving model accuracy.

President

I-Tech (The Technical Club)
Sep, 2022 - Jul, 2023 10 months

Joint Secretary

I-Tech (The Technical Club)
Sep, 2021 - Aug, 2022 11 months

Event Host

    For my college fest(Invertia-2019) I was part of the Comparing Committee, where I was help responsible for 2 major event as a Host

Major Projects

1Projects

Video Analysis Agent

    Implemented a multimodal video analysis agent PoC to verify autonomous UI test executions by comparing planning logs, execution videos, and final test reports. Used Qwen2-VL-7B for OCR and UI captioning, and LLM-based step matching for validation. Generated structured deviation reports using Azure GPT-4o, enabling automated detection of missed steps and reducing manual review overhead.

Education

  • B.Tech in Computer Science

    Invertis University (2023)