Research Scientist
Saarthi.AIAug, 2021 - Jan, 20253 yr 5 months
Led end-to-end TTS research across Tacotron, FastSpeech, and HiFi-GAN, covered single-speaker, multi-speaker, and multilingual settings across 11 Indian languages at 5M calls/day. Built and deployed streaming ASR systems (DeepSpeech, Whisper, Kaldi), developed full NLU pipeline from data creation to cloud deployment on Azure and AWS. Distilled a large recommendation model into a compact on-device model deployed in production on Android inside a keyboard product for real-time content recommendation. Led a cross-functional team of engineers, linguists, and CUX designers across the full research-to-deployment lifecycle.