profile-pic

Sarthak Khandelwal

With 4 years of hands-on industry software development experience and a Masters degree in Computer Science and Engineering, I bring a passion for Cloud Computing, Machine Learning, and Full Stack Development to contribute effectively to a dynamic team environment.
  • Role

    Sr Software & FastAPI Engineer

  • Years of Experience

    4 years

Skillsets

  • Fastmcp
  • FastAPI
  • GitLab
  • Golang
  • Poetry
  • ABAC
  • Authentik
  • AWS S3
  • Aws serverless framework
  • ClickHouse
  • CSS
  • Websockets
  • HTML
  • Java
  • JavaScript
  • LangGraph
  • MongoDB
  • Postgres
  • Qdrant
  • Rbac
  • Redis
  • Vuejs
  • Jenkins
  • Flask - 3.0 Years
  • AWS - 3.0 Years
  • SQL - 2.0 Years
  • LangChain - 2.0 Years
  • Agile
  • Apache Kafka
  • Bitbucket
  • CI/CD
  • Docker
  • Go
  • Python - 4.0 Years
  • LLM
  • Microservice
  • NLP
  • PyTorch
  • react
  • TensorFlow
  • Twilio
  • Vector DB
  • Web Development

Professional Summary

4Years
  • Aug, 2025 - Present 7 months

    Senior Software Engineer

    Couture AI
  • Jan, 2025 - Aug, 2025 7 months

    Software Engineer

    Echelon IT
  • Feb, 2024 - Dec, 2024 10 months

    Research Engineer

    University at Buffalo
  • Jan, 2020 - Jun, 20211 yr 5 months

    Software Developer

    Iotron Technologies
  • Aug, 2021 - Jan, 2022 5 months

    Software Developer Engineer, Intern

    HighRadius
  • Jan, 2022 - Jan, 20231 yr

    Software Developer

    Bajaj Finserv

Applications & Tools Known

  • icon-tool

    Postgres

  • icon-tool

    MySQL

  • icon-tool

    MongoDB

  • icon-tool

    AWS S3

  • icon-tool

    Flask

  • icon-tool

    Kubernetes

  • icon-tool

    Docker

  • icon-tool

    FastAPI

  • icon-tool

    Django

Work History

4Years

Senior Software Engineer

Couture AI
Aug, 2025 - Present 7 months
    Product Embeddings Platform: Trend Management Console (Python, FastAPI, Qdrant, LangGraph, FastMCP) Redesigned data schema and migrated from PostgreSQL to Qdrant, reducing trend retrieval latency from 5s to 200ms. Built a Chat Agent using LangGraph to manage and visualize product trends through natural language commands. Implemented a Model Context Protocol (MCP) server using FastMCP to standardize agent tool usage and context retrieval. Retail Planning Platform: Analytics & Forecasting (Python, FastAPI, ClickHouse, Authentik) Migrated analytics database from PostgreSQL to ClickHouse, reducing dashboard query latency from 15s to sub-500ms. Implemented RBAC and ABAC authorization flows using Authentik to secure access to sensitive sales forecast data. Developed high-performance FastAPI endpoints to serve real-time visualization data for the retail planning dashboard. Search Backend: High-throughput eCommerce Search (Python, FastAPI, Qdrant, Redis, Kubernetes, Docker) Designed and implemented a search backend microservice for eCommerce using FastAPI and Python to serve relevance-ranked results. Integrated Qdrant for vector search and Redis for caching to achieve low-latency retrieval. Containerized with Docker and deployed on Kubernetes; sustained 1000 QPS with 200ms average response time.

Software Engineer

Echelon IT
Jan, 2025 - Aug, 2025 7 months
    Smart Assist: AI-Powered Call Support (Python, FastAPI, ECS, ALB, Docker, GitLab, OpenAI, Twilio, WebSockets, PostgreSQL) Built an AI-Powered call support system with OTP verification and human fallback, handling 100+ concurrent sessions with sub-500ms latency. Integrated FastAPI, Twilio Voice, WebSockets, and OpenAI streaming for real-time transcription and intent recognition, backed by PostgreSQL. Deployed a scalable WebSocket service on AWS ECS behind ALB, containerized with Docker and automated via GitLab CI/CD.

Research Engineer

University at Buffalo
Feb, 2024 - Dec, 2024 10 months
    3D Reconstruction Pipeline for Heritage Site Modeling (Python, PyTorch, AWS EC2, Docker, GitLab) Developed a scalable 3D reconstruction pipeline using Gaussian Splatting and PyTorch on AWS EC2, automating workflows via GitLab/Docker to process 1000+ images and reduce costs by 50%.

Software Developer

Bajaj Finserv
Jan, 2022 - Jan, 20231 yr
    LOKAL (Location Based Dealer Finder) (Python, SQL, MongoDB, Flask, Docker, Kubernetes, Jenkins) Migrated spatial querying from in-memory KDTree to MongoDB/SQL geospatial indexing, increasing service responsiveness by 20%. Developed a geolocation-based Flask microservice to optimize dealer availability searches within specified distances. Streamlined CI/CD with Jenkins and deployed containerized services using Docker and Kubernetes for scalable orchestration. Retrieval-Augmented Generation (RAG) for Design Documents (Python, Docker, VectorDB, AWS S3, AWS Lambda) Implemented a LangChain-based RAG tool with LLMs for role-based access control and retrieval of 1000+ documents. Reduced employee document search time by up to 50% with an LLM-powered chatbot for fast, accurate retrieval. Built an ETL pipeline microservice to ingest, process, store, and manage 1000+ documents using Python, S3, Lambda, and Pinecone. Deployed the system using Terraform, Docker, Kubernetes, and Jenkins with CI/CD pipelines for scalable and reliable operations.

Software Developer Engineer, Intern

HighRadius
Aug, 2021 - Jan, 2022 5 months
    Migrated data and transformed queries from SQL to Snowflake.

Software Developer

Iotron Technologies
Jan, 2020 - Jun, 20211 yr 5 months
    Built responsive Art Gallery website with React/JavaScript, integrated Razorpay payment gateway and ShipRocket shipping service.

Major Projects

5Projects

Smart Assist: AI-Powered Call Support

Jan, 2025 - Present1 yr 2 months
    Built AI-powered call support system with real-time transcription, intent recognition, and AI voice response.

3D Reconstruction Pipeline for Heritage Site Modeling

Feb, 2024 - Dec, 2024 10 months
    Developed cloud-scaled 3D reconstruction workflows for heritage site modeling, reducing costs and improving scalability.

LOKAL (Location Based Dealer Finder)

Jan, 2022 - Jan, 20231 yr
    Optimized geospatial microservices for dealer availability and responsive backend architecture.

Retrieval-Augmented Generation (RAG) for Design Documents

Jan, 2022 - Jan, 20231 yr
    Implemented LangChain-based RAG tool for efficient design document search and retrieval.

Speaker Identification

Jan, 2022 - Jan, 20231 yr
    Decreased transcript processing time using TensorFlow/Keras-based speaker identification with CI/CD automation.

Education

  • Masters in Computer Science and Engineering

    University at Buffalo SUNY
  • Bachelor of Technology, Computer Science and Engineering

    SRM Institute of Science and Technology

Certifications

  • Python bootcamp

  • Full stack web development (html|css|javascript)

  • Cloud computing (nptel)

  • Deep learning specialization