profile-pic
Vetted Talent

Omjee Mishra

Vetted Talent
Strong experience in Hadoop and Spark ecosystems including Spark Core, Pyspark, SparkSQL, HDFS, Map-Reduce, Hive, Hbase, Sqoop Having good knowledge on programming language as Python and Scala. Proficient in performing SQL queries. Easily adopt, explore, learn and understand newer business domains and technology based on the requirements. Develop best practices for developing and deploying Hadoop applications and assist the team to manage compliance to the standards. Experienced with different file formats like Parquet, ORC, Sequence, CSV, JSON, Text files. Experienced Big Data/Hadoop and spark Developer with strong background with file distribution system in big data arena Execute change management activities supporting production deployment to Developers. Design, plan, and develop programs to perform automated extract, transform and load data between data sources when working with large data sets. Performed Hive operations on large datasets with proficiency in writing HiveQL queries using transactional and performance efficient concepts: Partitioning, Bucketing, efficient and effective Join operations. Gained exposure on various AWS services like Glue, S3, Lambda, RDS, Athena. Having sound knowledge on Data warehousing concepts.
  • Role

    AI Engineer

  • Years of Experience

    8.6 years

Skillsets

  • Google Cloud AI
  • UX/UI Design
  • TypeScript
  • Supabase
  • Streamlit
  • react
  • PostgreSQL
  • Pinecone
  • MongoDB
  • MLOps
  • LangChain
  • JavaScript
  • Git
  • FastAPI
  • Event-Driven Architectures
  • Docker
  • CI/CD
  • Aws ai tools
  • API development
  • JavaScript - 8 Years
  • JSON
  • Python - 8 Years

Vetted For

10Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Python Developer (AI/ML & Cloud Services) - RemoteAI Screening
  • 47%
    icon-arrow-down
  • Skills assessed :GCP/Azure, Micro services, Django /Flask, Neo4j, Restful APIs, AWS, Docker, Kubernetes, machine_learning, Python
  • Score: 42/90

Professional Summary

8.6Years
  • Jul, 2025 - Present 10 months

    AI Engineer / Software Developer

    MTI
  • Jul, 2025 - Jul, 2025

    AI Solutions Developer

    Tech Mahindra

Applications & Tools Known

  • icon-tool

    CSV

  • icon-tool

    JSON

  • icon-tool

    AWS Glue

  • icon-tool

    S3

  • icon-tool

    Lambda

  • icon-tool

    RDS

  • icon-tool

    PostgreSQL

  • icon-tool

    Python

  • icon-tool

    Databricks

  • icon-tool

    Hadoop

  • icon-tool

    HDFS

  • icon-tool

    Hive

  • icon-tool

    SQL

  • icon-tool

    C#

  • icon-tool

    Oracle 11g

Work History

8.6Years

AI Engineer / Software Developer

MTI
Jul, 2025 - Present 10 months
    Designed and developed modular conversational AI systems leveraging private knowledge bases to deliver accurate, context-aware responses, enhancing customer interactions by 30%. Created operational AI tools and automations that streamlined business processes across web, mobile, and messaging platforms, resulting in a 25% reduction in processing time. Developed and maintained API integrations using FastAPI, ensuring seamless communication between various services and enhancing overall system performance. Implemented secure and scalable solutions that protected sensitive data while maintaining high performance, achieving compliance with industry standards. Collaborated with cross-functional teams to create user-configurable tools adaptable across diverse markets, increasing customer satisfaction scores by 20%.

AI Solutions Developer

Tech Mahindra
Jul, 2025 - Jul, 2025
    Led the development of a self-service platform utilizing conversational AI and natural language processing, enabling clients to manage their own AI solutions effectively. Developed modular components for integration across various channels, ensuring extensibility and ease of use for end users. Participated in the design of multi-tenant architectures and webhook implementations, enhancing the platform's capabilities for diverse client needs. Contributed to the establishment of CI/CD pipelines, improving deployment efficiency and reducing time-to-market for new features.

Major Projects

2Projects

Conversational AI Chatbot

Jan, 1970 - Jan, 1970
    Developed a chatbot using LangChain and FastAPI that integrated with multiple communication platforms, improving customer engagement by 40%.

Operational AI Automation Tool

Jan, 1970 - Jan, 1970
    Created an automation tool that streamlined data processing workflows, resulting in a 50% increase in operational efficiency for a marketing firm.

Education

  • Bachelor of Engineering

    VTU

Certifications

  • Big Data Developer Certification from Simplilearn

  • Big data developer certification from simplilearn

  • Data science with python from simplilearn

  • Machine learning advanced certification training from simplilearn

  • Tableau desktop 10 certification from simplilearn

AI-interview Questions & Answers

I'm glad we could schedule this interview today.

in data engineering, where I worked on data collection, data storage, data transformation. And actually, I rely on data to make informed decisions. As a data engineer, I always play a vital role in providing the necessary infrastructure and tools. And as a business grows, the data needs grow too. So as a data engineer, we build a scalable system to handle increasing data volumes. And I'm proficient in Python, SQL, and Scala. And I have knowledge of relational databases and NoSQL. And I have experience with cloud platforms like AWS, Azure, and GCP.

So, in terms of implementing a CI/CD pipeline for deploying a Python application on AWS. Okay. A Python application on AWS. So the service that, in terms of, we can use AWS CodeBuild and AWS CodeDeploy. Okay. Infrastructure as code is optional. AWS CloudWatch. Okay. AWS CodePipeline.

So, basically, if you'll be talking about how you use Docker containers to manage dependencies and streamline deployment for a Django-based machine learning service on AWS. So, basically, the entire application involves, including all dependencies like the Python library system. So, outline the various AWS services that can be used for deploying a Dockerized service, such as Amazon Elastic Container Service, Amazon Elastic Kubernetes Service, and Amazon Elastic Beanstalk. So, basically, we can say that streamlining the deployment process provides consistent. Okay. So, some options to consider include a simple diagram. Okay.

So basically, the benefit we can say in terms of Neo4j integration in machine learning workflow is that it can be used for graph-based feature engineering, which excels at representing complex relationships between entities. This can be leveraged to engineer powerful features for machine learning models, like network embedding, path-based features, and community detection. It can also improve model performance, as well as scalability and efficiency. The approach for integration involves data loading, feature engineering, model training, integrating with model training, model evaluation, and we can use Python code with Neo4j.

so basically we can break down how we can implement serverless microservice in AWS Lambda using Python so basically first we can say microservice is a small independent service in serverless architecture where we don't manage the server directly okay so first we need to create an AWS Lambda function where we can choose the runtime write function code and configure the handler then we can create an API Gateway and then we can configure the API Gateway then we can deploy and test so these are the process

So, there can be some potential issues like a lack of complete context in the query, hard-coded entity URL, no error handling, potential framework bottleneck, okay. So, we can recommend refining the query, parsing the entity URL argument, adding error handling, considering indexing, and optimizing SparkQL queries.

so first we need to do logging, then matrix, tracing, and in terms of reliability it can be error handling, retry, idempotency, fault tolerance, monitoring, and alerting, and some of the AWS specific considerations like AWS Step Functions, Lambda, and batch.

so basically if you will be talking about like you know some of the service components, such as API gateway, lambda function, model serving. The second one is high availability and fault tolerance in terms of high availability and fault tolerance, it can be API gateway, lambda function, model serving, okay, data storage. And in terms of Python implementation, it can be request handler, inference handler, monitoring, and logging, security, testing, and deployment.

In my last project, I worked on a Django application that hosted a set of RESTful APIs for a large e-commerce platform. We encountered a performance issue as the application failed to handle increasing traffic. To address this, I implemented several optimization strategies, including database optimization, API endpoint optimization, server-side optimization, and cloud-specific optimization. By implementing these optimizations, I was able to significantly improve the performance and scalability of the Django application.

so basically if you'll be talking about your previous project as you told me you worked on a larger scale e-commerce platform where we needed to process a high volume of orders and user interactions to achieve this, you implemented a task process using Celery, a popularly attributed task queue in Python. Some of the key aspects of the implementation were task definition, task queue integration, message broker, task scheduling, and error handling and retry.