profile-pic

Debanirmalya Patra

I’m an AI + backend engineer with ~3 years of production experience building LLM-powered systems at scale, across a govtech platform serving 100,000+ facilities and a high-volume voice AI startup.

My work sits at the systems layer of AI: retrieval pipelines, LLM tool orchestration, multimodal fallbacks, and concurrency-safe real-time infrastructure. I focus on building systems that remain reliable under noise, latency, and ambiguity not just ideal conditions.

At the National Health Authority, I built a RAG system over 5,000+ parliamentary records using a hybrid BM25 + FAISS retrieval stack, query decomposition, and a self-correction loop reducing research time by 40% and improving retrieval precision by 80%. I also contributed to a national-scale platform tracking 100,000+ healthcare facilities, which cleared a full security audit.

At SquadStack, I built a multimodal recovery pipeline (Voice → WhatsApp) that rehydrated LLM context from webhook-captured input reducing low-confidence failures by 40%. I also designed a self-registering integration framework (decorator-based registry), cutting partner onboarding time from 2–3 days to under 4 hours.

I evaluate systems using RAGAs, instrument for latency and failure modes, and design with idempotency and observability in mind.

  • Role

    Forward Deployed Engineer

  • Years of Experience

    3 years

  • Professional Portfolio

    View here

Skillsets

  • LangGraph
  • Spring Boot
  • GitHub Actions
  • Transformers
  • SQL
  • Redux
  • Redis
  • react
  • Python
  • PostgreSQL
  • Next.js
  • LLMs
  • Linux
  • AWS
  • LangChain
  • JavaScript
  • Java
  • HuggingFace
  • Git
  • Flask
  • FastAPI
  • FAISS
  • Django
  • Cypress
  • Chroma
  • C++

Professional Summary

3Years
  • Feb, 2026 - Apr, 2026 2 months

    Forward Deployed Engineer

    SquadStack.ai
  • Jul, 2023 - Jan, 20262 yr 6 months

    Software Engineer

    National Health

Work History

3Years

Forward Deployed Engineer

SquadStack.ai
Feb, 2026 - Apr, 2026 2 months
    Architected a stateless Live Call Transfer routing system using a decorator-based handler registry, enabling self-registering integrations with zero core-logic changes; reduced onboarding time from 23 days to 1-2 hours. Built a multi-modal recovery pipeline (Voice WhatsApp) as an LLM tool call, capturing typed user input via webhooks and rehydrating LLM context; reduced low-confidence failures by 40%. Proposed and implemented a batch transcription pipeline over offline call recordings to recover missed entities from noisy audio, improving data accuracy by 20%+ in downstream verification workflows. Improved STT accuracy by 30%+ using Deepgram Keyterm Prompting (Nova-3), injecting domain-specific vocabulary to increase keyword recall and reduce transcription errors in downstream LLM workflows. Enforced concurrency safety using Redis locks and idempotency controls, reducing duplicate executions by 70%.

Software Engineer

National Health
Jul, 2023 - Jan, 20262 yr 6 months
    Agentic AI Parliamentary Q&A System: Engineered an end-to-end RAG system using LangChain to query a corpus of 5000+ parliamentary records, reducing manual research time for policy analysts by over 40%. Implemented a multi-step retrieval pipeline leveraging query deconstruction, boosting factual accuracy by 30%. Designed a hybrid multi-tool framework using BM25 and FAISS, that dynamically selects from 3+ retrieval methods (filtering) and incorporates a self-correction loop, enhancing retrieval precision by 80%. Deployed using FastAPI, Next.js and AWS, delivering citation-linked answers with 70% faster research turnaround. Improved performance by 25% through rigorous evaluation of faithfulness and answer relevance via RAGAs. ABDM Microsite: Spearheaded the development of a high-performance portal using Django, Next.js and Redux streamlining registration tracking for more than 100k facilities and professionals and obtained Web Application Security Audit clearance. Designed robust RESTful APIs for comprehensive analytics and reporting features, driving 30% increase in adoption, optimized monetary rewards disbursement, and eliminating over 100 man-hours in oversight efforts. Boosted page load performance by 90% through frontend code-splitting with Next.js SSR, while designing role based access control (RBAC) for 4 user levels and conditionally rendering tailored interfaces. Integrated automated Cypress E2E testing into CI/CD pipeline, achieving 80% pre-production defect detection.

Major Projects

1Projects

PayFlow Platform

    Architected a full-stack multi-role payment management system reducing approval turnaround time by 40%. Built scalable state management using Zustand with a type-safe Supabase client, improving maintainability. Automated scheduled and recurring payments using serverless Edge Functions in Deno invoking PostgreSQL RPCs, reducing manual workload by 15+ hours per week while ensuring reliable, atomic transaction processing. Enforced enterprise-grade RBAC with protected routes and middleware validation, across 3 distinct user roles. Created a financial reporting pipeline with Excel export integration, streamlining audit process by 60%.

Education

  • Civil Engineering

    Indian Institute of Technology (BHU) Varanasi (2023)
  • Class XII

    Jawahar Navodaya Rangareddy, Hyderabad (2018)