Debanirmalya Patra

I’m an AI + backend engineer with ~3 years of production experience building LLM-powered systems at scale, across a govtech platform serving 100,000+ facilities and a high-volume voice AI startup.

My work sits at the systems layer of AI: retrieval pipelines, LLM tool orchestration, multimodal fallbacks, and concurrency-safe real-time infrastructure. I focus on building systems that remain reliable under noise, latency, and ambiguity not just ideal conditions.

At the National Health Authority, I built a RAG system over 5,000+ parliamentary records using a hybrid BM25 + FAISS retrieval stack, query decomposition, and a self-correction loop reducing research time by 40% and improving retrieval precision by 80%. I also contributed to a national-scale platform tracking 100,000+ healthcare facilities, which cleared a full security audit.

At SquadStack, I built a multimodal recovery pipeline (Voice → WhatsApp) that rehydrated LLM context from webhook-captured input reducing low-confidence failures by 40%. I also designed a self-registering integration framework (decorator-based registry), cutting partner onboarding time from 2–3 days to under 4 hours.

I evaluate systems using RAGAs, instrument for latency and failure modes, and design with idempotency and observability in mind.

Role
Forward Deployed Engineer
Years of Experience
3 years
Professional Portfolio
View here

Skillsets

LangGraph
Spring Boot
GitHub Actions
Transformers
SQL
Redux
Redis
react
Python
PostgreSQL
Next.js
LLMs
Linux
AWS
LangChain
JavaScript
Java
HuggingFace
Git
Flask
FastAPI
FAISS
Django
Cypress
Chroma
C++

Professional Summary

3Years

Feb, 2026 - Apr, 2026 2 months
Forward Deployed Engineer
SquadStack.ai
Jul, 2023 - Jan, 20262 yr 6 months
Software Engineer
National Health

Work History

3Years

Forward Deployed Engineer

SquadStack.ai

Feb, 2026 - Apr, 2026 2 months

Architected a stateless Live Call Transfer routing system using a decorator-based handler registry, enabling self-registering integrations with zero core-logic changes; reduced onboarding time from 23 days to 1-2 hours. Built a multi-modal recovery pipeline (Voice WhatsApp) as an LLM tool call, capturing typed user input via webhooks and rehydrating LLM context; reduced low-confidence failures by 40%. Proposed and implemented a batch transcription pipeline over offline call recordings to recover missed entities from noisy audio, improving data accuracy by 20%+ in downstream verification workflows. Improved STT accuracy by 30%+ using Deepgram Keyterm Prompting (Nova-3), injecting domain-specific vocabulary to increase keyword recall and reduce transcription errors in downstream LLM workflows. Enforced concurrency safety using Redis locks and idempotency controls, reducing duplicate executions by 70%.

Software Engineer

National Health

Jul, 2023 - Jan, 20262 yr 6 months

Agentic AI Parliamentary Q&A System: Engineered an end-to-end RAG system using LangChain to query a corpus of 5000+ parliamentary records, reducing manual research time for policy analysts by over 40%. Implemented a multi-step retrieval pipeline leveraging query deconstruction, boosting factual accuracy by 30%. Designed a hybrid multi-tool framework using BM25 and FAISS, that dynamically selects from 3+ retrieval methods (filtering) and incorporates a self-correction loop, enhancing retrieval precision by 80%. Deployed using FastAPI, Next.js and AWS, delivering citation-linked answers with 70% faster research turnaround. Improved performance by 25% through rigorous evaluation of faithfulness and answer relevance via RAGAs. ABDM Microsite: Spearheaded the development of a high-performance portal using Django, Next.js and Redux streamlining registration tracking for more than 100k facilities and professionals and obtained Web Application Security Audit clearance. Designed robust RESTful APIs for comprehensive analytics and reporting features, driving 30% increase in adoption, optimized monetary rewards disbursement, and eliminating over 100 man-hours in oversight efforts. Boosted page load performance by 90% through frontend code-splitting with Next.js SSR, while designing role based access control (RBAC) for 4 user levels and conditionally rendering tailored interfaces. Integrated automated Cypress E2E testing into CI/CD pipeline, achieving 80% pre-production defect detection.

Major Projects

1Projects

PayFlow Platform

Architected a full-stack multi-role payment management system reducing approval turnaround time by 40%. Built scalable state management using Zustand with a type-safe Supabase client, improving maintainability. Automated scheduled and recurring payments using serverless Edge Functions in Deno invoking PostgreSQL RPCs, reducing manual workload by 15+ hours per week while ensuring reliable, atomic transaction processing. Enforced enterprise-grade RBAC with protected routes and middleware validation, across 3 distinct user roles. Created a financial reporting pipeline with Excel export integration, streamlining audit process by 60%.

Education

Civil Engineering
Indian Institute of Technology (BHU) Varanasi (2023)
Class XII
Jawahar Navodaya Rangareddy, Hyderabad (2018)

Debanirmalya Patra

Forward Deployed Engineer

3 years

View here

Skillsets

Professional Summary

Work History

Forward Deployed Engineer

Software Engineer

Major Projects

PayFlow Platform

Education

Civil Engineering

Class XII