profile-pic

Rakesh Reddy

Rakesh Reddy

Experienced DevOps/S.R.E with 8 years of expertise in automating, deploying, and managing applications in cloud and on-premises environments. Proficient in leveraging a wide range of technologies, including AWS, GCP, Azure,Docker, Kubernetes, and Terraform, to streamline and enhance the software development lifecycle. Demonstrated success in optimizing CI/CD pipelines, improving system reliability, and fostering collaboration between development and operations teams. Adept at scripting with Python, Bash, and other tools to automate infrastructure and workflows, ensuring high availability and scalability of mission-critical systems. Strong background in monitoring, performance tuning, and security best practices, with a proven track record of delivering robust solutions that align with business objectives. Passionate about continuous learning and driving innovation in dynamic, fast-paced environments.

  • Role

    Cloud System Administration

  • Years of Experience

    8 years

Skillsets

  • CLI - 4 Years
  • Jenkins - 7 Years
  • Azure - 3 Years
  • EKS
  • AKS
  • BigQuery - 6.0 Years
  • Dynatrace
  • ELK
  • GitHub Actions
  • Scc
  • Gke cluster
  • Tanium
  • Container management
  • Cloud API - 4 Years
  • cloud computimg - 7 Years
  • Computer & Network Security - 7 Years
  • SOLID principles - 4 Years
  • Agile - 7 Years
  • Monitoring - 5 Years
  • Project management - 5 Years
  • Identity and Access Management (IAM) - 8.0 Years
  • SRE - 6.0 Years
  • GKE - 5.0 Years
  • GCP certification - 3.0 Years
  • Ansible
  • SDK - 3 Years
  • CI/CD - 7 Years
  • Kubernetes - 5 Years
  • Azure DevOps - 3 Years
  • YAML - 5 Years
  • Terraform - 5 Years
  • Python - 3 Years
  • AWS - 7 Years
  • GCP - 6.0 Years
  • Shell Scripting
  • Git
  • Splunk - 4 Years
  • Docker - 5 Years
  • Datadog - 3 Years
  • MLOps
  • Airflow
  • Dataflow
  • MongoDB
  • Emr
  • Container orchestration
  • Postgresql administration
  • Dataproc

Professional Summary

8Years
  • Jan, 2024 - Present1 yr 3 months

    Senior Site Reliability Engineer

    Brevan Howard
  • Aug, 2023 - Oct, 2023 2 months

    Senior Architect

    Devon Software Services
  • Dec, 2021 - Mar, 20231 yr 3 months

    Senior Software Engineer

    Zebra Technologies
  • Aug, 2015 - May, 20182 yr 9 months

    DevOps Engineer

    Accend Systems Private Limited
  • Sep, 2018 - Mar, 2019 6 months

    Process Associate

    Amazon
  • Jan, 2020 - Sep, 20211 yr 8 months

    Data Validation Engineer

    Alchemy Techsol

Applications & Tools Known

  • icon-tool

    AWS

  • icon-tool

    GCP

  • icon-tool

    Terraform

  • icon-tool

    Ansible

  • icon-tool

    Git

  • icon-tool

    GitHub Actions

  • icon-tool

    Jenkins

  • icon-tool

    Kubernetes

  • icon-tool

    Docker

  • icon-tool

    Datadog

  • icon-tool

    Crowdstrike

  • icon-tool

    Nessus

  • icon-tool

    Bigquery

  • icon-tool

    ELK

  • icon-tool

    EKS

  • icon-tool

    AKS

  • icon-tool

    Azure

  • icon-tool

    Dynatrace

  • icon-tool

    MongoDB

  • icon-tool

    Airflow

  • icon-tool

    EMR

  • icon-tool

    Dataproc

  • icon-tool

    Dataflow

  • icon-tool

    Tanium

  • icon-tool

    Looker

  • icon-tool

    Apache Tomcat

  • icon-tool

    Amazon Macie

  • icon-tool

    CloudWatch

  • icon-tool

    SNS

  • icon-tool

    Shell Scripting

  • icon-tool

    GitHub

  • icon-tool

    BigQuery

  • icon-tool

    SonarQube

  • icon-tool

    JFrog Artifactory

Work History

8Years

Senior Site Reliability Engineer

Brevan Howard
Jan, 2024 - Present1 yr 3 months
    Designed and implemented infrastructure solutions using Terraform to automate cloud resources provisioning on AWS and GCP. Managed containerized applications and services using Docker and Kubernetes, ensuring high availability and scalability. Developed and maintained CI/CD pipelines using GitHub Actions and Jenkins, resulting in streamlined deployments and faster release cycles. Automated configuration management tasks with Ansible, improving deployment efficiency and reducing manual intervention. Implemented monitoring and alerting solutions using Datadog, leading to improved system reliability and proactive issue resolution. Created Shell scripts for system administration tasks, including automated backups, system updates, and log management.

Senior Architect

Devon Software Services
Aug, 2023 - Oct, 2023 2 months
    Upgraded existing Terraform modules codebase from an older version to the latest stable version, improving infrastructure as code management. Migrated configuration management from Puppet to Ansible, streamlining configuration processes and improving system automation. Managed the upgrade of EKS clusters to version 1.24 on AWS, enhancing the performance, stability, and security of containerized applications. Integrated Twistlock into the CI/CD pipelines for image vulnerability scanning, ensuring security compliance and reducing risks associated with containerized applications. Implemented and maintained Jenkins-based CI/CD pipelines for automated build, test, and deployment processes. Developed and managed Docker container images for application deployments, ensuring consistent environments across development and production.

Senior Software Engineer

Zebra Technologies
Dec, 2021 - Mar, 20231 yr 3 months
    Business Unit: GMSS Implemented MLOps as part of the GMSS VSDA Team by leveraging GCP services such as BigQuery, Dataproc Clusters, and Dataflow jobs for real-time data streaming and migration from ELK Stack to BigQuery and Looker on GCP. Deployed ML Models as part of CI/CD pipelines using Jenkins and Google Cloud Build, automating the deployment of machine learning models and improving deployment efficiency. Created and Managed Projects across GCP, AWS, and Azure by setting up multiple parent and child projects for various environments using Terraform. Implemented Best Practices and Organizational Policies for GCP and AWS to ensure compliance with industry standards and organizational requirements. Monitored Production Workloads on Airflow Composer in GCP for job orchestration and workflow management, ensuring timely and reliable data processing. Remediated Security Vulnerabilities by addressing critical events within the given SLA on the Security Command Center in GCP, enhancing the security posture of cloud environments. Performed Security and OS Patching on all VMs in production and lower environments, ensuring systems are up-to-date and secure. Installed and Upgraded Security Agents like LogRhythm, Nessus, and Falcon Sensor, including on GKE and EKS clusters to ensure comprehensive security coverage. Documented Installation and Troubleshooting Procedures for both CoS and Normal clusters, creating and maintaining detailed documentation for support and operational purposes. Integrated OpsRamp Monitoring from the previous PagerDuty setup to enhance incident management and monitoring capabilities. Automated Patching Processes by transitioning from manual to automated patching using Ansible, streamlining system maintenance operations. Updated Apache Airflow Composer to the latest version, ensuring access to new features and improvements.

Data Validation Engineer

Alchemy Techsol
Jan, 2020 - Sep, 20211 yr 8 months
    Implemented Infrastructure as Code (IaC): Utilized Terraform to define and manage AWS infrastructure resources including EC2 instances, RDS databases, VPCs, and S3 buckets. Designed and Managed CI/CD Pipelines: Developed CI/CD pipelines with Jenkins for automating builds, tests, and deployments. Configured and Deployed Containerized Applications: Leveraged Docker for containerizing applications and Amazon ECS for managing containerized services. Orchestrated Containerized Workloads: Managed Kubernetes clusters using Amazon EKS for deploying, scaling, and maintaining applications. Implemented Monitoring and Logging Solutions: Set up AWS CloudWatch for monitoring resources and Elasticsearch for logging and data analysis. Automated Operational Tasks: Developed Shell Scripts and used Ansible for automating deployments, configurations, and updates. Enhanced Security and Compliance: Applied best practices for AWS security, including IAM roles and policies, and configured AWS Config for compliance. Optimized Infrastructure Costs: Conducted cost optimization and resource management, reducing AWS expenditures by 20%. Implemented Backup and Recovery Solutions: Configured AWS Backup and developed recovery procedures for data integrity and availability. Collaborated with Development Teams: Worked with teams to understand requirements, provide technical guidance, and support deployments.

Process Associate

Amazon
Sep, 2018 - Mar, 2019 6 months
    Assisted in Infrastructure Management: Supported the setup, configuration, and maintenance of cloud infrastructure using AWS services including EC2, S3, RDS, and VPC, ensuring stable and scalable environments. Contributed to CI/CD Pipeline Development: Assisted in the development and maintenance of CI/CD pipelines using Jenkins for automating the build, test, and deployment processes for various applications. Managed Source Code Repositories: Utilized Git for version control, managing code branches, performing merges, and resolving conflicts, ensuring code quality and integration. Automated Routine Tasks: Developed Shell Scripts and used Ansible for automating repetitive tasks such as software installations, configurations, and updates.

DevOps Engineer

Accend Systems Private Limited
Aug, 2015 - May, 20182 yr 9 months
    As a Junior DevOps Engineer at Verizon Data Services, a leading telecom company in the U.S. with a substantial customer base, I was involved in various facets of build automation, deployment, and continuous integration for applications within the Test Data Management (TDM) module and JITR (Just-in-Time Release) subsystem. This role required a combination of technical skills and collaborative efforts to streamline processes, manage environments, and support application development and deployment. Build Environment Setup: Set up and maintained build environments for various applications across Windows and Linux platforms. Build and Automation Tools: Edited and maintained existing Ant and Maven build files (build.xml, pom.xml); utilized Ant and Maven for build automation tasks. Source Code Management: Cloned code from Git repositories; performed checkouts, branching, and merging tasks using Git. CI/CD Pipeline Management: Managed and optimized Jenkins CI/CD pipelines for continuous integration and deployment. Application Deployment: Built and deployed EAR and WAR files using Maven; configured and deployed archives to Apache Tomcat and WebSphere Application Servers. Containerization and Orchestration: Worked with Docker components for creating, managing, and deploying containerized applications; managed containerized applications on AWS using Docker Containers and Kubernetes. Configuration Management: Implemented Chef Recipes and Cookbooks for deployment and configuration management on internal data center servers. AWS Cloud Services: Configured AWS resources including EC2 instances, Elastic Load Balancing, Auto Scaling, VPC, S3, CloudFront, and IAM roles; utilized AWS CloudFormation for infrastructure as code. Security and Compliance: Configured and maintained AWS Security Services such as Security Hub and Amazon Macie for security monitoring and data protection. Support and Troubleshooting: Provided day-to-day support for users; resolved build script issues and assisted with schema upgrades and message publications. Documentation and Process Improvement: Documented build processes, deployment procedures, and troubleshooting steps; converted Ant scripts to Maven to optimize build processes. Backup and Recovery: Managed backup and recovery tasks, including creating dump backups and restoring data for clients and offshore environments.

Education

  • B.Tech/B.E.

    PES School of Engineering, Bangalore (2015)

Certifications

  • Continuing professional education certificate - immersive labs