
I’m an AI + backend engineer with ~3 years of production experience building LLM-powered systems at scale, across a govtech platform serving 100,000+ facilities and a high-volume voice AI startup.
My work sits at the systems layer of AI: retrieval pipelines, LLM tool orchestration, multimodal fallbacks, and concurrency-safe real-time infrastructure. I focus on building systems that remain reliable under noise, latency, and ambiguity not just ideal conditions.
At the National Health Authority, I built a RAG system over 5,000+ parliamentary records using a hybrid BM25 + FAISS retrieval stack, query decomposition, and a self-correction loop reducing research time by 40% and improving retrieval precision by 80%. I also contributed to a national-scale platform tracking 100,000+ healthcare facilities, which cleared a full security audit.
At SquadStack, I built a multimodal recovery pipeline (Voice → WhatsApp) that rehydrated LLM context from webhook-captured input reducing low-confidence failures by 40%. I also designed a self-registering integration framework (decorator-based registry), cutting partner onboarding time from 2–3 days to under 4 hours.
I evaluate systems using RAGAs, instrument for latency and failure modes, and design with idempotency and observability in mind.
Forward Deployed Engineer
SquadStack.aiSoftware Engineer
National Health