profile-pic
Vetted Talent

Aftab Alam

Vetted Talent
Expertise in Generative AI, AI fine-tuning, crafting multi-model applications, and working with Dialog Flow CX. Worked extensively on backend development. Proficient in Prompt Engineering, Machine Learning (ML), Deep Learning (DL), Natural Language Processing (NLP), and their associated libraries and frameworks.
  • Role

    Full Stack Developer

  • Years of Experience

    5.4 years

  • Professional Portfolio

    View here

Skillsets

  • Supabase
  • PostgreSQL
  • Scikit-learn
  • SQLite
  • Yolo
  • C
  • C++
  • HuggingFace
  • rag
  • React-native
  • Oracle
  • Airflow
  • GCP
  • image preprocessing
  • Kafka
  • LLM fine tuning
  • Milvus
  • MLFlow
  • Multimodal models
  • Qdrant
  • Azure
  • Prompt Engineering
  • AWS
  • Docker
  • PyTorch
  • TensorFlow
  • Python
  • Keras
  • OpenCV
  • Annoy
  • FastAPI
  • Bash
  • Chroma
  • CI/CD
  • FAISS
  • LangChain
  • LangGraph
  • MongoDB
  • MySQL

Vetted For

16Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Machine Learning Engineer, AI/ML, Search & Discovery (Remote)AI Screening
  • 56%
    icon-arrow-down
  • Skills assessed :Collaboration, Communication, CI/CD, Data preprocessing, Deep Learning, Feature Engineering, Model evaluation, Natural Language Processing, PyTorch, Reinforcement Learning, TensorFlow, Good Team Player, machine_learning, NLP, Problem Solving Attitude, Python
  • Score: 50/90

Professional Summary

5.4Years
  • Sep, 2025 - Present 9 months

    Full Stack Developer

    Freelance
  • Dec, 2025 - Feb, 2026 2 months

    Gen AI - Consultant

    Bespoke Labs
  • Aug, 2025 - Nov, 2025 3 months

    AI Engineer

    Agelix Consulting
  • Mar, 2021 - Nov, 20243 yr 8 months

    AI Engineer

    Freelance
  • Nov, 2024 - May, 2025 6 months

    ML engineer

    Brenin

Applications & Tools Known

  • icon-tool

    Python

  • icon-tool

    Scikit-Learn

  • icon-tool

    PyTorch

  • icon-tool

    Keras

  • icon-tool

    Pandas

  • icon-tool

    NumPy

  • icon-tool

    NLTK

  • icon-tool

    MySQL

  • icon-tool

    Matplotlib

  • icon-tool

    Seaborn

  • icon-tool

    Pandas

  • icon-tool

    NLTK

  • icon-tool

    Visual Studio Code

  • icon-tool

    Git

  • icon-tool

    Hugging Face

  • icon-tool

    React

  • icon-tool

    Vector DB

Work History

5.4Years

Full Stack Developer

Freelance
Sep, 2025 - Present 9 months
    Developing I-Sapphire, an internal task management application for I-Sapphire company using react-native. The platform enables employees to create, assign and track tasks efficiently. It has voice note recording, past data for analysis, focus task and so on. Key Contributions - Responsive react frontend, supabase for database, and GCP services.

Gen AI - Consultant

Bespoke Labs
Dec, 2025 - Feb, 2026 2 months
    Designed and implemented LLM Evaluation frameworks to benchmark GenAI model performance on real world task solving. Key Contributions - Created benchmark datasets, prompts and evaluation metrics to measure accuracy, reasoning and response quality. Built pipelines on the Lunara project to push and execute the framework.

AI Engineer

Agelix Consulting
Aug, 2025 - Nov, 2025 3 months
    Developed AI-driven document intelligence systems to transform unstructured contracts into structured and searchable data using OCR and LLMs. Key Contributions Built LLM-powered document processing pipelines, added fastapi services for document ingestion, processing and inference.

ML engineer

Brenin
Nov, 2024 - May, 2025 6 months
    Worked on Amara - an AI powered digital hr platform integrating Generative AI, Conversational interfaces and automation workflows. Key Contributions Integrated Generative AI models and LLM APIs to power conversational HR assistants, Implemented RAG pipelines for HR knowledge retrieval and question answering, Developed biometric authentication and interview analysis AI modules etc.

AI Engineer

Freelance
Mar, 2021 - Nov, 20243 yr 8 months
    Delivered AI and Machine learning solutions across various domains including chatbots, computer vision, RAG systems and business analytics automation. Key Contributions Developed RAG-based chatbot systems and knowledge assistants for enterprise applications, Designed deep learning models for image classifications and object detection using YOLO, Implemented Email Automation using Langgraph etc.

Achievements

  • Developed various machine learning and deep learning projects
  • Optimized machine learning models using advanced techniques
  • Developed and fine-tuned supervised and unsupervised learning models using Python libraries such as Scikit-Learn, TensorFlow, and PyTorch.
  • Implemented neural networks for tasks such as image recognition, natural language processing, and time series prediction.
  • Conducted thorough data cleaning and preprocessing using tools like Pandas and NumPy, ensuring high-quality inputs for models.

Major Projects

4Projects

EMAIL-AUTOMATION

    Designed an automated email-handling system that interprets customer queries, requests missing details, analyzes attachments, retrieves customer records from the database and generates accurate support responses.

CONTRACT-DIGITIZATION

    Built a contract digitization system capable of processing diverse contract formats, including images and PDFs, to extract clauses and key business information.

AMARA APP

    Developed an AI-powered digital HR platform automating HR workflows, including applicant tracking system, intelligent resume screening, interview scheduling, and candidate shortlisting.

React-Native(Full Stack)

    A task tracking app built for internal employees of an I-Sapphire company to manage, assign and contribute to tasks easily and effectively.

Education

  • Master Of Computer Applications (MCA)

    Bikaner Technical University
  • Bachelor Of Computer Applications (BCA)

    Lachoo Memorial College Of Science And Technology
  • High School with PCM(Physics, Chemistry and Maths)

    Apna Sr. Sec. School

Certifications

  • Python training

  • React native the practical guide (2023)

  • Artificial intelligence tensorflow developer certificate in 2023: zero to mastery

  • Tensorflow certificate

  • Python certificate

  • Artificial intelligence certificate

  • React native certificate

AI-interview Questions & Answers

I have done my MCA from Technical University, and my major is in data science and machine learning. I also have a certificate from Intel, as well as completed courses on Udemy and many other online platforms. And I have worked on many projects in machine learning and deep learning, such as text classification, image classification, and others.

To make our machine learning model more accurate, we can lower the number of layers or use a grid search, random search to find the best parameters or use an optimizing technique which best suits our problem.

Tune hyperparameters for a deep learning model. We can use grid search or random search to find the best parameters that work with our machine learning model to predict the best transfers so as to achieve higher accuracy. You can use libraries such as TensorFlow and scikit-learn.

When we have imbalanced data, then we can balance it, we can use imputation or such methods as mean, median, mode, or KNN algorithm that will automatically correct our dataset. If we have categorical values, then we must first convert them into numerical values. Similarly, it goes with text and images. We can detect the skewness of the data by visualization through matplotlib or seaborn. We can also detect outliers if there are any in the dataset using a box plot, which will highly affect our machine learning model's prediction, or we can use normalization and standardization techniques on our datasets. So to make all our datasets have the same characteristics, such as 0 mean and a standard deviation of 1, we can use no standard scaler from scikit-learn to standardize our data and normalize it.

Well, we work with the RNN to process sequential data. We have a problem such as the RNN model. A simple RNN can't process or remember long-range dependencies in sequential data. The RNN model is specifically designed for sequential data, and we can also do classification as well as regression with RNN. However, based on transformer models such as T5, CHEDGBT, and other models based on transformers, they can have the ability to remember long-range dependencies and provide coherent and logical answers because of the hidden cell state it maintains over every time step of input. This is the drawback that transformer models cover.

To accurately select the future that you want to take in to train our model, we can use various metrics such as correlation metrics and covariance metrics to check the dependence of features on one another, how they influence each other, and how they will influence our model. This is a part of feature engineering. And similarly, if we are working with time series, then we have to look for a date range from where the data is starting and ending. To make a time frame, perform operations such as rolling mean or rolling statistics to define the window frame and predict the upcoming answer. Feature engineering is important because our model depends only on features to provide the best transfer. We can discard outliers because they are not needed in our machine learning models. And also, we can look for columns that will not be apart for our machine learning model to predict the output, such as date columns, index columns, or name of years and their countings, and so on.

Medical embedding. I'm encoding. Encoding. Get. Each category equals an index. Apply. Data frame column. Approaching code from the apply lambda function encoding. So this is the code. This code I see here will convert category variables into numerical embeddings. It suggests it in the line data frame. The bracket column equals the data frame column dot apply where it uses a lambda function to encode features. The second parameter that we give in the encode categorical definition function will convert this column when we apply this lambda function into numerical. First, we give the data frame as the first parameter, and the column that we want to convert from categorical to numerical as the second.

Prediction service. What we did? Is it class prediction service? Decorator static method. Load model. Model path, prediction service dot model, wish service dot model, from path. Another function. Previous permission. Load model as permission. Model path of the model. Static method. Predict input feature. This mode which we are loading a model here and then another definition we are predicting features. At last, within it, model is described. Features of prediction service dot model is none. Then we predict we can predict predictions. Prediction service predict the load model. To load the model prediction service That model, we are getting model from the prediction surface, and then we are loading it from another path, another path, load model path, If prediction it's telling if prediction service. Model is none. Model is not loaded. Yes. Right. Model dot predict input features. Prediction service dot model into futures. This point is not a problem here. I'm not sure if you address it. I think it will take time for me to take this code and run it in my notebook and then find what are the potential causes for any errors. Just by looking, I think I wouldn't be able to find what the problem here is.

When I have to work with TensorFlow, specifically for image classification, first, I will need my data. If I have data, then I can start working on it. And if I do not have data, then first, I have to collect data. This can be done through a YouTube app scraping tool. Or if I have data available, then I will use that data first to check whether it is in the right format or all the data types are in the right structure. Also, I will perform operations such as whether there are null values or not. And then after checking the format of images, whether all the images that I want to train in my model are right or not. Because it sometimes happens that some formats are not supported by our model. So, that leads to an error while we are loading our images with our model. And it comes to our attention later when we have finished our model architecture. To work with images, we generally use CNNs because they work best. In CNNs, we have convolutional neural networks. They have parameters such as filters, which capture information in the images. And we can define the filter size by the help of the kernel size parameter. The number of filters we can define. The activation function we have to define in the model architecture, and the input dimension, the dimension of the image that our model will train on, which will be defined in the first layer. A model architecture in CNN will start from the input layer, then the hidden layers will come. And then in the final layer, our output layer will come. We can also use a dropout layer to drop some neurons randomly. And then we can divide our dataset into a train-test split. After dividing, we can train our model by the model's fit method. We can pass our dataset and define the number of epochs. And then after our model will successfully train on the dataset, we can predict and test on our test dataset using the model's predict method. And finally, we can check the accuracy of the model on the training dataset and testing dataset.

For transfer learning, we can use models where we don't want to create a model for ourselves because sometimes, and it often happens when we're working with large datasets. Models consume too much space and time, and it's not for everyone to train complex models on their personal laptops because it requires complex hardware architectures such as GPUs and TPUs. So when we're working with large datasets and the data are in high dimension, it's better to use pre-trained models because they don't take time to train our model. We can customize the output layers just to set as trainable is equal to true, and then we can define our input and output layers in which we want our output, such as when we're working with 10 classes, then we can define 10 classes and output in a layer. We can use the Hugging Face library for leveraging transfer learning models, and we can customize according to our needs. And it will work similarly as I've said before. First, we have to gather old data and divide the data, then train our model on our data. After successful training, we will predict our data and check its accuracy. Yes. We can use transfer learning with GPT, BERT, and many other models such as ImageNet, VGG. It's based on the type of problem we're working on, whether we're working with text data or when we're working with images or videos.

Performance. What strategies could you apply to monitor performance of production and systems? You can use version controls to manage your code and provide continuous integration or continuous deployment. We can use GitLab, AWS to track our code and update our code, fix bugs. Monitoring the performance of production. Yes. That was all. Thank you.