profile-pic
Vetted Talent

Rottela Sudheer Kumar

Vetted Talent

Completed post-graduation in Master of Computer Applications (MCA). I used to listen motivational songs, watch mind blowing movies, read self-help and spiritual books. I'm a mentor for new aspirants and help in their career growth. Received 2nd rank during intermediate and achieved gold medal during graduation. I also used to write poems and received 2nd prize in Infosys poem competition.

Received customer delight award, game changer award, and many appreciations during

career journey.


Worked as a Site Reliability Engineer at Infosys BPM. The following are my key roles and responsibilities:


- Implementing practices like SLA's for accessibility, availability, and response time of systems/services to make them reliable.

- Creating automated processes from operational aspects by calculating and evaluating whether the systems/services are with in SLA or not.

- Monitoring and logging to measure performance of systems/services and detect issues before or at early stages.

- Providing on-call support to find the improvements required in existing systems.

- Finding Root Cause Analysis (RCA's) while detecting the issues and provide additional protection to systems.

- Documenting post incident reviews after issue or after outage for future reference.

- Working towards same principles/goals of DevOps for fast releases while allowing fast changes.

- Providing KT's for new joiners and guide them with domain specific technologies.

  • Role

    AWS DevOps Engineer (SRE)

  • Years of Experience

    5 years

Skillsets

  • Bash
  • ServiceNow
  • ELK Stack
  • Splunk
  • Unix
  • Puppet
  • Prometheus
  • Nagios
  • Grafana
  • Jira
  • Ansible - 1 Years
  • AWS
  • MySQL
  • Python - 2 Years
  • Kubernetes - 2 Years
  • Jenkins - 2 Years
  • Git - 2 Years
  • Docker - 1 Years
  • Terraform - 1 Years

Vetted For

26Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Senior Software Engineer - Site Reliability(Remote)AI Screening
  • 28%
    icon-arrow-down
  • Skills assessed :Chef, gitlabci, OpenShift, PagerDuty, Pingdom, Puppet, Salt, smashtest, TravisCI, twelve factor development, Agile Methodology, Ansible, CircleCI, Infrastructure as Code (IaC), NewRelic, Terraform, C#, Cloud Server (Google / AWS), Docker, Git, JavaScript, Jenkins, Kubernetes, PHP, Python, Ruby
  • Score: 25/90

Professional Summary

5Years
  • Dec, 2023 - Present1 yr 7 months

    AWS DevOps Engineer (SRE)

    JPMC (Payroll Company: Snapminds)
  • Mar, 2019 - Nov, 20234 yr 8 months

    Technical Specialist (SRE)

    InfosysBPM

Applications & Tools Known

  • icon-tool

    Chef

  • icon-tool

    Nagios

  • icon-tool

    Prometheus

  • icon-tool

    Docker

  • icon-tool

    Kubernetes

  • icon-tool

    Ansible

  • icon-tool

    Terraform

  • icon-tool

    Puppet

  • icon-tool

    Git

  • icon-tool

    GitHub

  • icon-tool

    Jenkins

  • icon-tool

    ServiceNow

  • icon-tool

    Jira

  • icon-tool

    MySQL

  • icon-tool

    Slack

Work History

5Years

AWS DevOps Engineer (SRE)

JPMC (Payroll Company: Snapminds)
Dec, 2023 - Present1 yr 7 months
    Monitored and troubleshooted production issues, identified root causes, and implemented solutions to prevent recurrence, ensuring high system availability. Worked closely with cross-functional teams to identify and resolve performance bottlenecks, leveraging strong analytical and problem-solving skills. Developed and maintained comprehensive system documentation, including runbooks, standard operating procedures, and system diagrams. Participated in on-call rotations to provide 24/7 support for production systems, ensuring minimal downtime. Stayed updated with the latest advancements in Site Reliability Engineering, applying innovative approaches to maintain a competitive edge.

Technical Specialist (SRE)

InfosysBPM
Mar, 2019 - Nov, 20234 yr 8 months
    Leveraged cloud platforms such as AWS and Azure to deploy and manage scalable applications. Utilized automation tools like Chef to streamline operations and enhance system efficiency. Demonstrated strong understanding of Linux operating systems and command-line tools to manage and troubleshoot systems. Implemented and managed monitoring and logging solutions using tools like Nagios, Prometheus, and ELK stack to ensure system health and performance. Collaborated with development teams to optimize system performance and scalability, contributing to overall system reliability.

Achievements

  • Designed and implemented a comprehensive monitoring system using Prometheus and Grafana.
  • Automated alerting processes, reducing incident response time by 30%.
  • Integrated monitoring solutions with Slack for real-time notifications.
  • Conducted training sessions for team members on using the new monitoring tools.
  • Optimized Kubernetes deployments to improve scalability and reliability.
  • Reduced deployment times by 50% and increased system uptime by 20%.
  • Collaborated with development teams to ensure smooth rollouts and minimal disruptions.
  • Developed an automated backup solution using AWS Backup and Python scripts.
  • Ensured data integrity and compliance with company policies.
  • Conducted regular disaster recovery drills, reducing recovery times by 40%.
  • Documented all backup processes and trained junior engineers.

Major Projects

3Projects

Enterprise Monitoring System Implementation

Dec, 2023 - Present1 yr 7 months
    Designed and implemented a comprehensive monitoring system using Prometheus and Grafana. Automated alerting processes, reducing incident response time by 30%. Integrated monitoring solutions with Slack for real-time notifications. Conducted training sessions for team members on using the new monitoring tools.

Kubernetes Deployment Optimization

Mar, 2019 - Nov, 20234 yr 8 months
    Optimized Kubernetes deployments to improve scalability and reliability. Reduced deployment times by 50% and increased system uptime by 20%. Collaborated with development teams to ensure smooth rollouts and minimal disruptions.

Automated Backup and Recovery Solution

Mar, 2019 - Nov, 20234 yr 8 months
    Developed an automated backup solution using AWS Backup and Python scripts. Ensured data integrity and compliance with company policies. Conducted regular disaster recovery drills, reducing recovery times by 40%. Documented all backup processes and trained junior engineers.

Education

  • Master in Computer Applications (MCA)

    Sri Krishnadevaraya University (2019)

Certifications

  • Customer Delight Award

    Infosys BPM
  • Customer Delight Award

    Infosys BPM
  • Insta Award

    Infosys BPM