profile-pic
Vetted Talent

Satyam Mishra

Vetted Talent
AI Engineer specializing in LLM applications, RAG systems, and generative AI with expertise in Python and modern AI frameworks. Proven track record in developing conversational AI agents, recommendation systems, and computer vision solutions that drive business value across HR and media industries.
  • Role

    AI Engineer

  • Years of Experience

    5 years

Skillsets

  • LLM Fine-tuning
  • Weights & Biases
  • Weaviate
  • rag
  • Pinecone
  • LlamaIndex
  • Langraph
  • Kubernetes
  • FAISS
  • Chroma
  • Recommender systems
  • PyTorch
  • Prompt Engineering
  • OpenAI API
  • Python - 2 Years
  • LangChain
  • Hugging Face
  • FastAPI
  • TensorFlow
  • Streamlit
  • Computer Vision
  • Docker
  • Python
  • NLP
  • SQL - 2 Years
  • SQL - 2 Years
  • Python - 2 Years

Vetted For

6Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Jr. AI/ML Developer (Onsite)AI Screening
  • 34%
    icon-arrow-down
  • Skills assessed :Deep Learning, PyTorch, machine_learning, Problem Solving Attitude, Python, R Python
  • Score: 31/90

Professional Summary

5Years
  • Jul, 2024 - Present1 yr 11 months

    Product Engineer - AI / ML

    HROne
  • Aug, 2023 - Jul, 2024 11 months

    Data Scientist

    Orion eSolutions
  • Jun, 2022 - Aug, 20231 yr 2 months

    Data Scientist

    Redcliffe Labs
  • Apr, 2021 - Jun, 20221 yr 2 months

    Machine Learning Engineer

    Amarujala.com

Applications & Tools Known

  • icon-tool

    Git

  • icon-tool

    Docker

  • icon-tool

    AWS

  • icon-tool

    Metabase

  • icon-tool

    Django

  • icon-tool

    Streamlit

  • icon-tool

    GCP Bigquery

  • icon-tool

    Apache Beam

Work History

5Years

Product Engineer - AI / ML

HROne
Jul, 2024 - Present1 yr 11 months
    Designed and deployed an enterprise-grade HR AI agent with RAG capabilities for automated HR interactions and knowledge retrieval. Implemented advanced context management and dynamic query planning using Langraph and ReAct agent patterns. Created custom vector indexes and hybrid retrieval systems for company policy and procedure documents. Reduced query response time by 85% and decreased HR team workload by 30% through intelligent task automation.

Data Scientist

Orion eSolutions
Aug, 2023 - Jul, 2024 11 months

Data Scientist

Redcliffe Labs
Jun, 2022 - Aug, 20231 yr 2 months
    Developed generative AI system for personalized health reports using Python and Azure TTS, reducing creation time by 90% and improving report quality. Built end-to-end computer vision solution with ResNet50 for identification and compliance verification with 95% accuracy. Implemented RFM analysis framework and predictive models boosting revenue by 50% through targeted engagement strategies.

Machine Learning Engineer

Amarujala.com
Apr, 2021 - Jun, 20221 yr 2 months
    Built production-grade recommendation engine using NLP embeddings and hybrid matrix factorization techniques, increasing user engagement by 70%. Developed ALS-based article recommendation model with A/B testing framework, achieving 200% CTR improvement using TensorFlow and distributed processing.

Education

  • Bachelor of Computer Applications

    Indira Gandhi National Open University (2020)

Certifications

  • Machine Learning

AI-interview Questions & Answers

Atir Mishra, I am a data scientist. I worked with Radcliffe Labs where I worked on building a recommendation system. And at Amaruzala, I also worked on a recommendation system, and I developed the recommendation system that caters to millions of users. At Radcliffe Labs, I designed a computer vision system for fluoro grooming detection. This is a brief introduction about myself. I did a BCA from IIGN and have certifications from IIT Madras in machine learning and TensorFlow, and also from Datachem as a power data scientist with Python skills.

Which I think has to be most efficient for indexing when working on machine learning problems.

What approach would you take to handle imbalanced classes in a dataset when training a deep learning model using PyTorch? You can basically apply techniques like ensemble methods, resampling techniques, data augmentation techniques, and weighted losses. PyTorch has inbuilt functionality of cross entropy loss, which is like a weighted loss function. It also has some other functionalities like precision recall and F1 score to evaluate.

One of the best use cases I found for Python's lambda function in preprocessing data for a machine learning model is that I can just build a function, sorted from a version of a function that's required for cleaning up and starting with specific data. Like, I wanted to do some scaling, or maybe I can also do some other things like normalization, or maybe even feature extraction, which can be done with a single line of code using a lambda function.

Regularization techniques in Python's scikit-learn library, such as L1 and L2 regularization, help prevent overfitting during model training.

I would use Recursive feature elimination or Lasso regression for feature selection in a large-scale machine learning project.

numeric_vector <- c(1, 2, 3, 4, 5) for (i in 1:length(numeric_vector)) { sum_value <- 0 for (j in 1:i) { sum_value <- sum_value + numeric_vector[j] } mean_value <- sum_value / i print(mean_value) } Corrected Code: numeric_vector <- c(1, 2, 3, 4, 5) sum_value <- 0 for (i in 1:length(numeric_vector)) { for (j in 1:i) { sum_value <- sum_value + numeric_vector[j] } mean_value <- sum_value / i print(mean_value) }

in the neural network, training loop written in the Python, identify the section of code. in neural network training loop written in that Python, identify that section of the loop code that might cause an error during back propagation. What is the and that import while touching what quite might cause in that provision and explain why. so it really is what you need to do. This might be problematic. Mod.eval. Okay. so how I should I do that? And then not relu. and are not linear. I've crossed 2, long dot tensor. The line target dot on this typo, it should be long tensor 1. That's

Which when hyper tuned are parameters for PyTorch model, which method do you prefer? Good search or random search? Hyperparameter tuning. I would use grid search.

When developing a PyTorch model, how do you ensure efficient memory usage during training, especially with a large dataset? Okay. When developing a pilot class model, how do you ensure efficient memory usage during training, especially with larger datasets. So, in that case, when developing a PyTorch model to ensure efficient memory usage, in PyTorch, I would consider using techniques like batch loading, data augmentation, optimizing your model architecture for memory. Additionally, monitoring GPU memory usage and clearing unnecessary variables can also help with memory effectively.