profile-pic

Nihad Hassan

Experienced Machine Learning Engineer with one year of hands-on expertise in developing and implementing cutting-edge machine learning models, demonstrating strong proficiency in data analysis, algorithm design, and model deployment.
  • Role

    MLOps Engineer I

  • Years of Experience

    5.25 years

Skillsets

  • Datadog
  • Transformers
  • TensorRT
  • System Design
  • spaCy
  • NLTK
  • MLFlow
  • Kubernetes
  • HuggingFace
  • Helm
  • Grafana
  • Golang
  • gitlab ci
  • GCP
  • FasterTransformer
  • FastAPI
  • NumPy
  • Data Structures
  • ArgoCD
  • Algorithms
  • Python
  • Git - 2 Years
  • Databricks
  • Scikit-learn
  • Bash
  • Python - 2 Years
  • PyTorch
  • Keras
  • Git
  • pandas
  • Docker
  • Flask

Professional Summary

5.25Years
  • Apr, 2022 - Present4 yr 1 month

    Embedded Software Engineer

    Valeo
  • MLOps Engineer I

    QuillBot

Applications & Tools Known

  • icon-tool

    PyTorch

  • icon-tool

    Scikit-learn

  • icon-tool

    NLTK

  • icon-tool

    Keras

  • icon-tool

    Hugging Face

  • icon-tool

    OpenAI

  • icon-tool

    pandas

  • icon-tool

    NumPy

  • icon-tool

    Matplotlib

  • icon-tool

    Git

  • icon-tool

    Docker

  • icon-tool

    Databricks

  • icon-tool

    Google Colab

  • icon-tool

    Azure Functions

Work History

5.25Years

Embedded Software Engineer

Valeo
Apr, 2022 - Present4 yr 1 month

MLOps Engineer I

QuillBot
    Maintained a central ML gateway (FastAPI) routing inference traffic across multiple NLP and multimodal services using RabbitMQ and Kubernetes, supporting millions of requests per day across high-throughput, multi-language workloads. Led end-to-end deployment of NLP models (classifiers, paraphrasers, grammar checkers, translators) across multiple GCP clusters and environments, enabling safe rollouts and rollbacks using ArgoCD and Datadog. Built an automated CI/CD pipeline using GitLab CI to promote fine-tuned Vertex AI models from model registry to production, integrating LiteLLM-based cost observability to track and optimize LLM inference spend, increasing developer efficiency by 70%. Designed resilient LLM inference pipelines with Langfuse-driven prompt versioning and automated cross-vendor fallback, ensuring graceful failover during rate-limit and availability incidents and reducing user-facing failures during traffic spikes. Led complex embedder-discriminator deployments to support personalization research, collaborating with data, backend and research teams to safely productionize experimental models and accelerate research-to-production timelines. Implemented batch processing for AI content detection workloads, improving throughput and reducing required Kubernetes pods, resulting in meaningful infrastructure cost savings.

Major Projects

2Projects

DeepIris Recognition

    Implemented a deep learning model solution based on a journal paper for iris recognition. Fine-tuned a ResNet50 model with ImageNet weights on the IITD Iris Dataset. Developed the project to run CPU inference using ONNX, ensuring efficient processing on non-GPU servers.

Signboard Detection And Recognition

    Developed an ML solution to detect and recognize store nameboards in a shopping mall. Fine-tuned YOLO V8 model with ImageNet weights trained on Roboflow custom dataset. Runs CPU inference using ONNX and text extraction using PaddleOCR.

Education

  • B.Tech in Mechanical Engineering

    Amal Jyothi College Of Engineering (2022)

Certifications

  • Aws certified ai practitioner

Interests

  • Watching Movies