Vetted Talent

Omjee Mishra

Vetted Talent

Strong experience in Hadoop and Spark ecosystems including Spark Core, Pyspark, SparkSQL, HDFS, Map-Reduce, Hive, Hbase, Sqoop Having good knowledge on programming language as Python and Scala. Proficient in performing SQL queries. Easily adopt, explore, learn and understand newer business domains and technology based on the requirements. Develop best practices for developing and deploying Hadoop applications and assist the team to manage compliance to the standards. Experienced with different file formats like Parquet, ORC, Sequence, CSV, JSON, Text files. Experienced Big Data/Hadoop and spark Developer with strong background with file distribution system in big data arena Execute change management activities supporting production deployment to Developers. Design, plan, and develop programs to perform automated extract, transform and load data between data sources when working with large data sets. Performed Hive operations on large datasets with proficiency in writing HiveQL queries using transactional and performance efficient concepts: Partitioning, Bucketing, efficient and effective Join operations. Gained exposure on various AWS services like Glue, S3, Lambda, RDS, Athena. Having sound knowledge on Data warehousing concepts.

Role
AI Engineer
Years of Experience
8.6 years

Skillsets

Google Cloud AI
UX/UI Design
TypeScript
Supabase
Streamlit
react
PostgreSQL
Pinecone
MongoDB
MLOps
LangChain
JavaScript
Git
FastAPI
Event-Driven Architectures
Docker
CI/CD
Aws ai tools
API development
JavaScript - 8 Years
JSON
Python - 8 Years

Vetted For

10Skills

Roles & Skills
Results
Details

Python Developer (AI/ML & Cloud Services) - RemoteAI Screening
47%

Skills assessed :GCP/Azure, Micro services, Django /Flask, Neo4j, Restful APIs, AWS, Docker, Kubernetes, machine_learning, Python
Score: 42/90

Professional Summary

8.6Years

Jul, 2025 - Present 10 months
AI Engineer / Software Developer
MTI
Jul, 2025 - Jul, 2025
AI Solutions Developer
Tech Mahindra

Applications & Tools Known

CSV
JSON
AWS Glue
S3
Lambda
RDS
PostgreSQL
Python
Databricks
Hadoop
HDFS
Hive
SQL
C#
Oracle 11g

Work History

8.6Years

AI Engineer / Software Developer

MTI

Jul, 2025 - Present 10 months

Designed and developed modular conversational AI systems leveraging private knowledge bases to deliver accurate, context-aware responses, enhancing customer interactions by 30%. Created operational AI tools and automations that streamlined business processes across web, mobile, and messaging platforms, resulting in a 25% reduction in processing time. Developed and maintained API integrations using FastAPI, ensuring seamless communication between various services and enhancing overall system performance. Implemented secure and scalable solutions that protected sensitive data while maintaining high performance, achieving compliance with industry standards. Collaborated with cross-functional teams to create user-configurable tools adaptable across diverse markets, increasing customer satisfaction scores by 20%.

AI Solutions Developer

Tech Mahindra

Jul, 2025 - Jul, 2025

Led the development of a self-service platform utilizing conversational AI and natural language processing, enabling clients to manage their own AI solutions effectively. Developed modular components for integration across various channels, ensuring extensibility and ease of use for end users. Participated in the design of multi-tenant architectures and webhook implementations, enhancing the platform's capabilities for diverse client needs. Contributed to the establishment of CI/CD pipelines, improving deployment efficiency and reducing time-to-market for new features.

Major Projects

2Projects

Conversational AI Chatbot

Jan, 1970 - Jan, 1970

Developed a chatbot using LangChain and FastAPI that integrated with multiple communication platforms, improving customer engagement by 40%.

Operational AI Automation Tool

Jan, 1970 - Jan, 1970

Created an automation tool that streamlined data processing workflows, resulting in a 50% increase in operational efficiency for a marketing firm.

Education

Bachelor of Engineering
VTU

Certifications

Big Data Developer Certification from Simplilearn
Big data developer certification from simplilearn
Data science with python from simplilearn
Machine learning advanced certification training from simplilearn
Tableau desktop 10 certification from simplilearn

AI-interview Questions & Answers

I'm glad we could schedule this interview today.

in data engineering, where I worked on data collection, data storage, data transformation. And actually, I rely on data to make informed decisions. As a data engineer, I always play a vital role in providing the necessary infrastructure and tools. And as a business grows, the data needs grow too. So as a data engineer, we build a scalable system to handle increasing data volumes. And I'm proficient in Python, SQL, and Scala. And I have knowledge of relational databases and NoSQL. And I have experience with cloud platforms like AWS, Azure, and GCP.

So, in terms of implementing a CI/CD pipeline for deploying a Python application on AWS. Okay. A Python application on AWS. So the service that, in terms of, we can use AWS CodeBuild and AWS CodeDeploy. Okay. Infrastructure as code is optional. AWS CloudWatch. Okay. AWS CodePipeline.

So, basically, if you'll be talking about how you use Docker containers to manage dependencies and streamline deployment for a Django-based machine learning service on AWS. So, basically, the entire application involves, including all dependencies like the Python library system. So, outline the various AWS services that can be used for deploying a Dockerized service, such as Amazon Elastic Container Service, Amazon Elastic Kubernetes Service, and Amazon Elastic Beanstalk. So, basically, we can say that streamlining the deployment process provides consistent. Okay. So, some options to consider include a simple diagram. Okay.

So basically, the benefit we can say in terms of Neo4j integration in machine learning workflow is that it can be used for graph-based feature engineering, which excels at representing complex relationships between entities. This can be leveraged to engineer powerful features for machine learning models, like network embedding, path-based features, and community detection. It can also improve model performance, as well as scalability and efficiency. The approach for integration involves data loading, feature engineering, model training, integrating with model training, model evaluation, and we can use Python code with Neo4j.

so basically we can break down how we can implement serverless microservice in AWS Lambda using Python so basically first we can say microservice is a small independent service in serverless architecture where we don't manage the server directly okay so first we need to create an AWS Lambda function where we can choose the runtime write function code and configure the handler then we can create an API Gateway and then we can configure the API Gateway then we can deploy and test so these are the process

So, there can be some potential issues like a lack of complete context in the query, hard-coded entity URL, no error handling, potential framework bottleneck, okay. So, we can recommend refining the query, parsing the entity URL argument, adding error handling, considering indexing, and optimizing SparkQL queries.

so first we need to do logging, then matrix, tracing, and in terms of reliability it can be error handling, retry, idempotency, fault tolerance, monitoring, and alerting, and some of the AWS specific considerations like AWS Step Functions, Lambda, and batch.

so basically if you will be talking about like you know some of the service components, such as API gateway, lambda function, model serving. The second one is high availability and fault tolerance in terms of high availability and fault tolerance, it can be API gateway, lambda function, model serving, okay, data storage. And in terms of Python implementation, it can be request handler, inference handler, monitoring, and logging, security, testing, and deployment.

In my last project, I worked on a Django application that hosted a set of RESTful APIs for a large e-commerce platform. We encountered a performance issue as the application failed to handle increasing traffic. To address this, I implemented several optimization strategies, including database optimization, API endpoint optimization, server-side optimization, and cloud-specific optimization. By implementing these optimizations, I was able to significantly improve the performance and scalability of the Django application.

so basically if you'll be talking about your previous project as you told me you worked on a larger scale e-commerce platform where we needed to process a high volume of orders and user interactions to achieve this, you implemented a task process using Celery, a popularly attributed task queue in Python. Some of the key aspects of the implementation were task definition, task queue integration, message broker, task scheduling, and error handling and retry.

Omjee Mishra

AI Engineer

8.6 years

Skillsets

Vetted For

Professional Summary

Applications & Tools Known

Work History

AI Engineer / Software Developer

AI Solutions Developer

Major Projects

Conversational AI Chatbot

Operational AI Automation Tool

Education

Bachelor of Engineering

Certifications

Big Data Developer Certification from Simplilearn

Big data developer certification from simplilearn

Data science with python from simplilearn

Machine learning advanced certification training from simplilearn

Tableau desktop 10 certification from simplilearn

AI-interview Questions & Answers