Designed, Executed, and maintained highly available, scalable, and cost-optimized cloud infrastructure on AWS and Azure following SRE and DevOps best practices like Cloud Security, Disaster Recovery and Network Security. Optimized AWS cloud costs using AWS Cost Explorer, billing dashboards, tagging strategies, and cost allocation reports, achieving 15% cost reduction. Configured AWS EC2 On-Demand instances, Auto Scaling Groups (ASG), custom AMIs, and load-based scaling policies to improve performance and fault tolerance which reduced downtime by 20%. Managed 3000+ AWS Instances. Setup Azure Virtual Machines and VM Scale Sets (VMSS) with autoscaling, alerts, and Azure Monitor to improve availability and reduce service impact. Managed AWS Route 53 DNS with public and private hosted zones, internal and external DNS records, and health checks for reliable traffic routing. Built and maintain CI/CD pipelines using GitHub Actions, Jenkins, and AWS CodeBuild for automated build, test, and deployment across AWS, Azure, and Kubernetes. Provisioned and managed multi-cloud infrastructure using Terraform (Infrastructure as Code), reducing infrastructure provisioning time by 30%. Automated server provisioning and configuration management using Ansible, reducing manual operational effort by 20%. Containerized applications using Docker by creating and optimizing Dockerfiles, integrating Docker builds into CI/CD pipelines, managing images in AWS ECR, and deploying to Kubernetes (EKS/AKS). Deployed, configure, and operated Kubernetes clusters to ensure high availability, scalability, and resilience of containerized workloads. Implemented GitOps workflows using Argo CD for automated Kubernetes deployments, version-controlled manifests, and rollback strategies. Managed Kubernetes application deployments using Helm charts with templating, versioning, and environment- specific configurations. Monitored Kubernetes pods, nodes, and services using Prometheus and Grafana to track resource utilization, latency, and error rates. Developed Bash and Python automation scripts for Linux system administration to reduce repetitive tasks and improve efficiency. Applied centralized monitoring and observability using AWS CloudWatch with AWS SNS, Prometheus, and Grafana for proactive issue detection.