profile-pic

Saket Kumar

My passion for developing solutions stemsfrom my dedication to optimizing userexperiences and enhancing . Through my work on diverseprojects, such as creating a central LLMinteraction service with load balancing andbuilding high-performance backend systemsfor various applications, I aim to push theboundaries of innovation.
  • Role

    Lead AI Engineer

  • Years of Experience

    11.58 years

Skillsets

  • CDC
  • VM
  • webhook integration
  • Multi-environment orchestration
  • CI/CD
  • Cloud task queues
  • GCP
  • context management
  • GuardRails
  • LLM safety
  • Mem0
  • Policy-driven moderation
  • Memori
  • Long-term memory design
  • Session-level prompt orchestration
  • serverless computing
  • Debezium
  • Episodic memory
  • Federated search
  • Go
  • GraphRAG
  • Langfuse
  • LangGraph
  • LlamaIndex
  • Model context protocol
  • Multi-Agent Systems
  • Websockets
  • Self-healing ai
  • Structural drift mitigation
  • Federated memory
  • Data Modeling
  • FastAPI
  • Firebase
  • LangChain
  • Message Queues
  • OpenAI
  • Postgres
  • Python
  • Redis
  • SQL
  • API Design
  • API Design
  • Celery
  • Containerization
  • Celery
  • Django
  • Data Modeling
  • Distributed Systems
  • Docker
  • Grafana
  • llm prompt engineering
  • Ollama
  • llm prompt engineering
  • nginx
  • Qdrant
  • Observability
  • Ollama
  • Prometheus
  • Qdrant
  • Retrieval-Augmented Generation

Professional Summary

11.58Years
  • Apr, 2025 - Present1 yr

    LLM Engineer

    Opkey
  • Jul, 2022 - Apr, 20252 yr 9 months

    Lead SDE

    Proshort
  • Apr, 2020 - Jul, 20222 yr 3 months

    Senior Software Developer

    NSEIT LIMITED
  • Jul, 2014 - Jun, 20183 yr 11 months

    Independent Consultant

    Several
  • Aug, 2018 - Apr, 20201 yr 8 months

    Software Development Engineer

    eClerx

Work History

11.58Years

LLM Engineer

Opkey
Apr, 2025 - Present1 yr
    Reduced manual effort by 75%, architected a distributed real-time ERP workflow automation platform with event-driven architecture; designed scalable WebSocket/REST APIs with FastAPI, Apache Kafka & Redis; Langfuse for observability, and then wrote the whole thing in GO utilizing channels and Goroutines achieving 5x latency improvement. Designed & implemented scalable, high-performance Search Engine with ElasticSearch and LLM, enabling natural language retrieval across 100k records. Delivered robust API design (FastAPI) and a top-k most frequent results, all with sub-second response times & easy extensibility. Developed a central RAG system using FastAPI, MySQL, Qdrant, and Nomic embeddings, implementing Strategy/Factory patterns for modular architecture, with Docker containerization and Kubernetes deployment, enabling scalable semantic search and easy addition of new models/parsers. Architected a robust, policy-driven prompt guardrailing framework for multi-agent systems, combining regex, heuristic, and LLM-based detectors with per-agent configurations to vet inputs/outputs and enforce enterprise-grade safety and compliance. Implemented long-term agent memory using Mem0, Memori, and custom session-level base prompts/chat threading, delivering safer, more context-rich interactions, reducing hallucinations, and materially improving answer quality over time.

Lead SDE

Proshort
Jul, 2022 - Apr, 20252 yr 9 months
    Architected central LLM interaction service supporting multiple language models with intelligent load balancing, ensuring 99.9% service availability. Pioneered specialized prompt repository enabling dynamic prompt selection across services, improving code maintenance efficiency by 4x. Engineered shorts-creation and video-summarizing module, accelerating production time by 5x while achieving industry-leading 60% publishable rate. Co-developed text-to-video engine generating multilingual videos at scale from raw text, serving 100k+ unique users and enterprise clients. Built proshorts web & mobile application from ground up, supporting 10k+ DAU with Firebase-powered authentication. Orchestrated end-to-end Payment Module using Stripe, managing complex subscription lifecycles and transaction flows involving 10k+ users & 100s of transactions per week. Unified Payment Module across diverse platforms (proshort+, L&D platform, video consumption platform) through centralized user identity management. Delivered data-intensive CRM integration module synchronizing deals, contacts, companies, and emails via event-driven architecture using queues, Pub-Sub patterns, strategic caching, and automated schedulers.

Senior Software Developer

NSEIT LIMITED
Apr, 2020 - Jul, 20222 yr 3 months
    AI Proctoring for remote examinations with OpenCV/Deep Learning. Worked on Anomaly Detection on NSEs financial data with DBSCAN and Greenplum in order-trade data. Text Extraction for eKYC on PDF/image with success on over 75% docs with AXIS bank as client. Led development of two websites on Django as Team Lead for naviinsurance.com directly consulting with client, "exceeding their expectations" on both yearly appraisals.

Software Development Engineer

eClerx
Aug, 2018 - Apr, 20201 yr 8 months
    Sales Forecast with 86% accuracy using RNN. Developed/Enhanced Flow based chatbots on RASA and Chatterbot, deployed with Flask. Chatbot enhancements using BERT.

Independent Consultant

Several
Jul, 2014 - Jun, 20183 yr 11 months
    Improved customer engagement by 40%+ for columbiawineco.com introducing Wine Ratings. Added Food Recommendation (NLP) based on grapes in wine. Developed NLP-driven summarization algorithm using Python and NLTK to condense customer reviews into key insights, featured on the website; increased click-through rate on product pages by 25%. Predicted Manpower requirement with 91% accuracy.

Major Projects

1Projects

A Zero-Day Resistant Malware Detection Method for Securing Cloud Using SVM and Sandboxing Techniques

    Three step pipeline for zero-day malware detection with ML and networking scripts. Compared the results to the benchmarks from various anti-virus software. Achieved over 80% accuracy with 35% less data.

Education

  • M.Tech., Computer Engineering

    NIT, Kurukshetra (2018)
  • B.Tech., Computer Science and Engineering

    BCET, Durgapur (2014)