profile-pic

Nihad Hassan

Experienced Machine Learning Engineer with one year of hands-on expertise in developing and implementing cutting-edge machine learning models, demonstrating strong proficiency in data analysis, algorithm design, and model deployment.
  • Role

    MLOps Engineer I

  • Years of Experience

    2.9 years

Skillsets

  • Datadog
  • Transformers
  • TensorRT
  • System Design
  • spaCy
  • NLTK
  • MLFlow
  • Kubernetes
  • HuggingFace
  • Helm
  • Grafana
  • Golang
  • gitlab ci
  • GCP
  • FasterTransformer
  • FastAPI
  • NumPy
  • Data Structures
  • ArgoCD
  • Algorithms
  • Python
  • Git - 2 Years
  • Databricks
  • Scikit-learn
  • Bash
  • Python - 2 Years
  • PyTorch
  • Keras
  • Git
  • pandas
  • Docker
  • Flask

Professional Summary

2.9Years
  • Jan, 2025 - Present1 yr 2 months

    MLOps Engineer I

    QuillBot
  • Jan, 2023 - Jan, 20252 yr

    Software Engineer (AI/ML)

    Techversant InfoTech

Applications & Tools Known

  • icon-tool

    PyTorch

  • icon-tool

    Scikit-learn

  • icon-tool

    NLTK

  • icon-tool

    Keras

  • icon-tool

    Hugging Face

  • icon-tool

    OpenAI

  • icon-tool

    pandas

  • icon-tool

    NumPy

  • icon-tool

    Matplotlib

  • icon-tool

    Git

  • icon-tool

    Docker

  • icon-tool

    Databricks

  • icon-tool

    Google Colab

  • icon-tool

    Azure Functions

Work History

2.9Years

MLOps Engineer I

QuillBot
Jan, 2025 - Present1 yr 2 months
    Maintained a central ML gateway (FastAPI) routing inference traffic across multiple NLP and multimodal services using RabbitMQ and Kubernetes, supporting millions of requests per day across high-throughput, multi-language workloads. Led end-to-end deployment of NLP models (classifiers, paraphrasers, grammar checkers, translators) across multiple GCP clusters and environments, enabling safe rollouts and rollbacks using ArgoCD and Datadog. Built an automated CI/CD pipeline using GitLab CI to promote fine-tuned Vertex AI models from model registry to production, integrating LiteLLM-based cost observability to track and optimize LLM inference spend, increasing developer efficiency by 70%. Designed resilient LLM inference pipelines with Langfuse-driven prompt versioning and automated cross-vendor fallback, ensuring graceful failover during rate-limit and availability incidents and reducing user-facing failures during traffic spikes. Led complex embedder-discriminator deployments to support personalization research, collaborating with data, backend and research teams to safely productionize experimental models and accelerate research-to-production timelines. Implemented batch processing for AI content detection workloads, improving throughput and reducing required Kubernetes pods, resulting in meaningful infrastructure cost savings.

Software Engineer (AI/ML)

Techversant InfoTech
Jan, 2023 - Jan, 20252 yr
    Trained and deployed a cascade model solution combining Semantic lexicon and LSTM for Brickcode prediction on 1.3 million rows of data as a serverless Azure Function App (HTTP trigger), leveraging Azure ML Studio. Optimized inference time by 75%, significantly enhancing prediction accuracy and deployment efficiency. Developed a nutrient attributes extraction solution using Azure Cognitive OCR, LangChain, and AzureOpenAI to extract, normalize, and rephrase nutrition information from images, employing GPT-4 prompt engineering and regex for structured data processing, and deployed it as a serverless Azure Function App. Designed and deployed an image validation and enhancement pipeline as an Azure Function App, processing user-provided image URLs through validation checks, enhancement, and lossless compression, then serving the optimized images via Azure blob storage URLs. Developed a PDF to DOCX converter utilizing PaddleOCR, optimized to run on standard servers instead of GPU-based servers with minimal inference time difference, reducing costs by 80% and enhancing document management and accessibility.

Major Projects

2Projects

DeepIris Recognition

    Implemented a deep learning model solution based on a journal paper for iris recognition. Fine-tuned a ResNet50 model with ImageNet weights on the IITD Iris Dataset. Developed the project to run CPU inference using ONNX, ensuring efficient processing on non-GPU servers.

Signboard Detection And Recognition

    Developed an ML solution to detect and recognize store nameboards in a shopping mall. Fine-tuned YOLO V8 model with ImageNet weights trained on Roboflow custom dataset. Runs CPU inference using ONNX and text extraction using PaddleOCR.

Education

  • B.Tech in Mechanical Engineering

    Amal Jyothi College Of Engineering (2022)

Certifications

  • Aws certified ai practitioner

Interests

  • Watching Movies