profile-pic
Vetted Talent

Amit Kumar

Vetted Talent

Amit Kumar is an experienced Cloud/Linux Engineer with a robust background in various aspects of system administration, cloud infrastructure management, and DevOps practices. With proficiency in a wide range of technologies including RedHat Linux, Debian, CentOS, Ubuntu, AWS, Azure, GCP, Docker, Ansible, and Jenkins, he has demonstrated expertise in provisioning servers, ensuring cloud security, and resolving customer issues promptly. Amit has a Bachelor's degree in Commerce with Computer Science and has worked at Grras Solutions and Ninehertz India, where he contributed to designing, developing, and maintaining cloud-based solutions, providing technical direction to implementation teams, and ensuring optimal performance and uptime of applications. With excellent problem-solving skills, strong technical acumen, and effective communication abilities, Amit is a valuable asset in any team environment, adept at meeting tight deadlines and working under pressure.

  • Role

    Site Reliability Engineer

  • Years of Experience

    8 years

Skillsets

  • cross-functional communication
  • VMware
  • Team Leadership
  • Root Cause Analysis
  • RedHat
  • Python Scripting
  • Prometheus
  • Patience
  • Nagios
  • Mentoring
  • Load Testing
  • Linux Administration
  • Grafana
  • Github
  • Disaster Recovery
  • AWS - 6 Years
  • Critical Thinking
  • Collaboration
  • Cloudflare
  • Cloud monitoring
  • Bitbucket
  • Adaptability
  • Troubleshooting
  • Jenkins
  • Ansible
  • CI/CD - 4 Years
  • Kubernetes - 4 Years
  • Terraform - 4 Years
  • Docker - 4 Years

Vetted For

12Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Senior AWS DevOps EngineerAI Screening
  • 59%
    icon-arrow-down
  • Skills assessed :Agile principles, Jira, CI/CD Tools, Terraform, AWS, Docker, Java, Jenkins, Kubernetes, 組込みLinux, Python, Ruby on Rails
  • Score: 59/100

Professional Summary

8Years
  • Oct, 2024 - Present 11 months

    Site Reliability Engineer

    Wolfram Research
  • Nov, 2021 - Jun, 20242 yr 7 months

    DevOps Engineer

    NinehertzIndia Pvt Ltd
  • Jun, 2018 - Nov, 20213 yr 5 months

    Cloud Engineer

    Grras Solutions Pvt Ltd

Applications & Tools Known

  • icon-tool

    AWS (Amazon Web Services)

  • icon-tool

    Azure DevOps Server

  • icon-tool

    Jenkins

  • icon-tool

    Docker

  • icon-tool

    Linux Admin

  • icon-tool

    Terrafrom

  • icon-tool

    Kubernetes

  • icon-tool

    Debian

  • icon-tool

    CentOS

  • icon-tool

    Oracle VM

  • icon-tool

    Ubuntu

  • icon-tool

    Ansible

  • icon-tool

    Amazon Web Services

  • icon-tool

    Azure

  • icon-tool

    GCP

  • icon-tool

    AWS EC2

  • icon-tool

    VPC

  • icon-tool

    CLI

  • icon-tool

    S3

  • icon-tool

    IAM

  • icon-tool

    GitHub

  • icon-tool

    Apache

  • icon-tool

    Nginx

  • icon-tool

    LAMP

  • icon-tool

    MERN

  • icon-tool

    MEAN

  • icon-tool

    AWS

  • icon-tool

    Oracle

  • icon-tool

    Terraform

  • icon-tool

    EC2

  • icon-tool

    SNS

  • icon-tool

    Route53

  • icon-tool

    IAM

  • icon-tool

    EKS

  • icon-tool

    ECS

  • icon-tool

    CloudWatch

  • icon-tool

    GitLab

  • icon-tool

    OTRS

  • icon-tool

    Jira

  • icon-tool

    ELK Stack

  • icon-tool

    MEAN stack

  • icon-tool

    Oracle Cloud

  • icon-tool

    IAM

Work History

8Years

Site Reliability Engineer

Wolfram Research
Oct, 2024 - Present 11 months
    Implement and manage scalable infrastructure using cloud-native technologies and tools like terraform and ansible. Optimize system performance, troubleshoot incidents, and implement blameless postmortems within defined error budgets. Collaborate with development teams to implement best practices in reliability, fast development and deployment. Increase automation of operational tasks, reduce manual intervention, and enhance proactive monitoring solutions.

DevOps Engineer

NinehertzIndia Pvt Ltd
Nov, 2021 - Jun, 20242 yr 7 months
    Performed role as a subject matter expert in DevOps and infra automation. Defined best practices for application deployment infrastructure, exceeded software delivery goals through CI/CD setup, and developed custom Ansible and Terraform scripts to extend functionality and automate workflows.

Cloud Engineer

Grras Solutions Pvt Ltd
Jun, 2018 - Nov, 20213 yr 5 months
    Provided 24/7 production support, performed Linux server provisioning, patch management, and disaster recovery. Worked on incident resolution, performance monitoring, technical troubleshooting, and collaborated with cross-functional teams.

Achievements

  • Automated deployments: Successfully implemented CI/CD pipelines (e.g., Jenkins, GitLab CI/CD) to automate deployments, leading to faster release cycles and reduced manual errors (mention 60% reduction or specific timeframe improvement).
  • Cloud Migration: Successfully migrated on-premises infrastructure to the cloud (e.g., AWS, Azure) with minimal downtime, resulting in increased scalability, cost efficiency, and agility.
  • Security best practices: Implemented security best practices like infrastructure hardening, access control, and vulnerability management, improving the overall security posture of the infrastructure.
  • Knowledge sharing: Mentored junior engineers or colleagues on DevOps tools and best practices, fostering a culture of continuous learning and knowledge sharing within the team.

Major Projects

4Projects

Other

confidential
Aug, 2017 - May, 20246 yr 9 months
    • 100+ Projects deployment with multiple tech stacks like Php, Laravel, Node, NodeJS, ReactJS, and WordPress on multiple environments like VMs, bare-metal, Ec2, Containers, and Clusters using various Cloud providers like AWS, Azure, Digital Ocean, Oracle, IBM etc.

SocialMedia Application

Confidentials
Sep, 2022 - Feb, 20241 yr 5 months
    • It is a Social media platform for Dogs and has over 3 lakh users.
    • As a DevOps engineer working on a social media app, My roles and responsibilities would encompass a wide range of tasks focused on automating, streamlining, and securing the development, deployment, and maintenance processes. Utilized cloud platforms like AWS for provisioning and managing cloud resources like servers, databases, storage, and networking components using Infrastructure as Code (IaC) tools like Terraform or Ansible.

Multimedia Application With OTT

Confidentials
Apr, 2022 - Jan, 20241 yr 9 months
    • It is a horror movies and podcast platform and has 4 lakh+ users.
    • Implemented best practices like CI/CD Pipeline Management (CodeCommit) and containerization technologies like Docker and Kubernetes(EKS), Optimizing the performance and scalability, Infrastructure hardening, Object storage for Media content (S3) with CDN (Cloudfront) and Multi staging Database(RDS).

Ecommerce(Magento)

Dec, 2021 - Oct, 20231 yr 10 months
    • E-commerce web application for Eu client.
    • The website backend is in Magento. Deployed on Bare metal server provisioned on Interserver.net.
    • Along with apache2, ElasticSearch, Redis, and Varnish were implemented. For media assets block storage and CDN were there.

Education

  • Bachelor of Commerce with Computer Science

    MGSU University Bikaner (2019)

Certifications

  • Project Management

    Great Learning Academy (Feb, 2024)
  • Aws cloud practitioner by amazon

  • Elastic stack by great learning academy

  • Cloud computing architecture by great learning academy

  • Microsoft azure essentials by great learning academy

  • Project management by great learning academy

  • Project management by great learnings academy

Interests

  • Trekking
  • Travelling
  • Learning
  • Exploring Places
  • AI-interview Questions & Answers

    Could you help me to understand more about your background? Like, even a brief introduction of yourself. So, uh, my name is Amit, and I'm working as a production engineer in Manors, India, which is based on. And currently, We are, uh, working on our multiple projects where we need to deploy the application on our Kubernetes as well and, Auto scale without the scaling and, uh, high scalability. So yeah. Also, after the deployment, we need to take care of the monitoring. We need take care of the security, and we need to take care of the continuous indications. So, yeah, these are the stuff we are both currently working on. I have, uh, experience, uh, more, like, more than 5 years now, and, yeah, I continue. I'm learning new stuff and new new skills. Uh

    Okay. So, um, let's suppose, uh, if we have a multiple text stack and, uh, we are not only depending on the cloud formation, I mean, on the on AWS cloud, definitely at that, uh, kind of scenario. In that kind of scenario, we can we can, uh, con consider the Terraform as well. Like, there are few dependencies. Like, if your Kubernetes cluster and everything is running on a different cloud, then you can't you can't reuse AWS cloud formations. But If you are only working with the AWS, uh, uh, definitely, you can go with the AWS cloud formation. That's what I think about it.

    You need to automate a regular task that must be performed on your Linux server. Which scripting language would you prefer to use among Python? Ruby, Perl, and why? Um, okay. So, uh, first of all, like, if it is a Linux server and servers okay. So there are multiple servers, and, uh, if I need to perform any sort of task and which is regularly, we need to, you know, run it, Like, the similar task every time. So, definitely, first, I will go with the with the Python and Python based, uh, yeah, the YAML format, And I use Ansible to automate the script automate the task to perform on multiple multiple, uh, server in assembly click. Thank you.

    Can outline the step to create a secure AWS architecture for a new app? Can outline a setup steps to create a secure AWS architecture for a new application. So yeah. Uh, for the architecture, it it is not, uh, mentions. Like, it is, uh, you know, my pro or, like so yeah. So for that for now, let's let's, uh, I'm I'm talking about the scaling one so we can we can create a or strap separate AC 2 node, I mean, VPC, First of all, in a secure web PC, we can, uh, we can create a reset of machines over there. We can, uh, apply the auto scaling and the load balancer. And the load balancer is a public phishing. And, uh, um, you can deploy your application via code commit. And on the code commit, we can connect, uh, like, we can for the automation, I will use code pipeline. And for the, um, you know, in the code commit, we can use code commit for the code side. We can use code commit, or we can also test the data So we can do it like that. Also, um, if it is a we are we are running we're working on a on a is it a machines? So, definitely, in that case, uh, what we can do is, like, uh, there are multiple security tools, like for the DDoS and everything, so we can implement that. Also, for the secure secure layer, we can we can, uh, we can arrange some cloud flare for the first line line of, uh, interaction. So, uh, from the cloud from the CloudFlare, we'll also we'll we'll first handle the client. And now after that, it will route the traffic to the load balancer and then there is a. So, uh, in this way, we can do some, um, a secure setup for our new applications. Thank you.

    You handle version control and deployment in a multiple cloud environment to ensure consistency across production and nonproduction setups? Wasn't meant to run deployment in multi cloud environment. So, yeah, uh, we can we can like, as I understand, we can do some you know, For the for the kit, uh, what we can do, we can create a multiple branches and, uh, like, Whatever the the production setup or let's say, uh, for the production, we have a different cloud for the for the development. We have different for the staging, we have different. So We can we can divide all the all the the department or or the the production or the moment into the branches. And whenever, uh, any developer sort of developer, like, uploaded any kind of a code, you know, in a specific branch, definitely, The code will go into that particular, uh, environment. I mean, uh, if it is in AWS, let's say development environment is in is on the AWS side. Definitely, if, uh, that particular port will go to the AWS side. If it is in Production and production will be on Azure. Then it will, uh, if if any developer uploaded, uh, on a Production branch, the code will definitely go to the Azure site. So in that way, I guess, uh, we can, Uh, we can do this, uh, multiple, you know, production setups. Thank you.

    In your experience, has implementing high level system design within emphasis on learn as well as to result in a less time frame of fewer reduction errors? So, um, as I thought about the agile, it's like, We need to firstly, uh, first, we need to understand the requirement. Whatever the requirement is, whatever the text type we are using, we have to clear about it. Once we we are, uh, we have a clear picture about that, we need to, uh, little bit think about the the the users and the the the public, which is go you know, coming in a few let's say, a few months on our server on our application. So accordingly, we have to think about it, And, uh, we have to give some proper, uh, you know, scalability in our application. We have a proper solution about the monitoring And the security wise, as a DevOps, I can I can assure about their infrastructure, monitoring, security, and scalability So that, uh, if we implement all of these stuff, uh, with with the thought of, like, What are the text tag we are currently using? And with that as well, we can use, uh, uh, you know, a light test tools and technologies along with that. And, uh, um, we can also, uh, check how many users we are going to face in our release phase. Definitely, accordingly, we we need to, uh, prepare that.

    Given the below Java code, you have received null point exception error at runtime. Find the possible root cause. Okay. So public clusters stream. Public system. Wait a minute. So as I don't I don't have the exact Java knowledge because I'm not from the coding background, but, definitely, like, Uh, in this scenario, I will definitely reach out to my, uh, you know, uh, the colleague, uh, who's Working on is our definitely, I'll sit with them. I'll discuss this, uh, issue. If they are not busy in their their specific their particular or specific task. Definitely, at that point, I'll I'll reach out to them. I'll ask for the help. I'll Talk to them, and, uh, you know, I'll try to get get it run, like, get it fixed, definitely. So, uh, because, uh, directly, I don't have the Java knowledge. But, yeah, with my team, uh, definitely, I'll work with it, and I'll make it run For sure. Thanks

    The following communities. YAML config. Configuration for the deployment support. The update will prevent successful implementation. Deployment. The next deployment replica 3. So the in Genex 1, 1.7.9 port Continue. Okay. So in this case, First thing is, uh, I can see is the is the syntax error, which is a container port. The c is, uh, the container spelling is, like, in a small also, The other thing is to get match labels like to request 3. Yeah. That is the first thing as I can right now, I can see Selector template and specs. There are 3 replica with the main level of Nginx. Okay. Label will be NGINX 1. Okay. So here We define the label name as Nginx one, but on the match label, we just said Nginx. So that is something also, uh, there there is some issue, like, uh, whenever a new replica will create, it will check the label. But the label is in the metadata, label name is different, and the mesh label is, uh, name is different. So as I can see, uh, this is also an an issue. And apart from that, No. I I don't see anything else currently. But, yeah, the that one, like, match level is the issue, I guess, and the NGINX one, and Uh, match level name is generics only. So that is something, uh, I think we need to fix it. Thank you.

    Challenging DevOps project where you depth of knowledge in Linux, cloud services, and scripting language played a key role in your problem solving approach. Okay. So, uh, there is 1 project where, uh, where client is, It's in house at that time, and, uh, we need to, uh, live a specific, you know, uh, feature in our applications over there. Uh, the issue is, like, uh, that particular Feature is running on the staging side, but not in the production side. So we have, uh, checked, uh, Few things, like what are the missing and at that time, uh, as as I as mentioned in the description as well So I checked that the that the the Linux version, which is, uh, running on, uh, staging, is a different at at that point of time because it it just got updated. But on the on the on the other side, the production side, The version is, uh, well, a little bit lower. So, like, first, we checked everything, like, inform from, like, the DevOps prospect Even the DevOps tools and the technologies, we have checked everything. Everything seems perfect, fine, but very small, uh, Thing which is the Linux version. And, uh, with us with that specific version, there are some tools which are not working properly. So we need to we need to go through the, you know, call Linux path and check everything. What is missing? And so That's how we resolve that. It will take, like, uh, around 6 to 7,