profile-pic

Nirmal Jeet Singh

With over 13 years of experience, I am a seasoned Linux Engineer, System Administrator, and DevOps expert with proven expertise in cloud computing, infrastructure automation, and system reliability. I specialize in tackling complex customer issues and delivering effective solutions, leveraging strong command over Python and Bash Shell scripting. Proficient in Linux server debugging and troubleshooting, I ensure peak system performance while managing alarm and monitoring systems for VB fleets using Terraform and T2 metrics. My experience spans infrastructure migrations with a focus on robust security, performance analysis including network and database optimization, and execution of build and deployment processes to support development teams in LRT testing. I have hands-on expertise with CI/CD pipelines, Jenkins-based build and deployment testing, auto-deployment of applications, and source code management using GitHub, excelling in cloning, branching, and merging. Additionally, I have successfully managed VM migrations with Shepherd tooling, patch updates for secure and functional systems, and collaborated with cross-functional teams to manage customer instances. My background also includes advanced file system management (LVM), network administration involving IP addressing, subnetting, VLANs, and technologies like LAN, OSI, TCP/IP, and Ethernet, along with package installations using RPM and YUM on Red Hat Linux.

  • Role

    Principal Member Technical staff (SRE)

  • Years of Experience

    13 years

Skillsets

  • CDN
  • VPN
  • VPC
  • TLS
  • Terraform
  • OCI
  • Kerberos
  • GCP
  • Prometheus
  • PAT
  • Networking
  • NAT
  • load balancers
  • iptables
  • DHCP
  • Ansible
  • SQL
  • Python
  • Nagios
  • MySQL
  • Linux
  • LDAP
  • Kubernetes
  • Jenkins
  • Grafana
  • Git
  • Docker
  • DNS
  • Bash

Professional Summary

13Years
  • May, 2022 - Present3 yr 5 months

    Principal Member Technical staff (SRE)

    Oracle
  • Oct, 2018 - May, 20223 yr 7 months

    Sr. Linux Developer

    SAP Labs
  • Sep, 2017 - Sep, 20181 yr

    Linux Administrator

    FIS
  • May, 2013 - May, 20163 yr

    Project Engineer

    Wipro
  • Jun, 2016 - Aug, 20171 yr 2 months

    Service Delivery Engineer

    Hewlett Packard

Work History

13Years

Principal Member Technical staff (SRE)

Oracle
May, 2022 - Present3 yr 5 months
    Proficient Linux server debugging and troubleshooting expert, ensuring peak system reliability and performance of all RedHat Linux, Centos, Ubuntu & Solaris servers. Developed automated log analysis, configuration validation, and patch compliance checks and command line tooling in python and bash shell scripting. Efficiently resolves critical customer issues within strict timelines, enhancing client satisfaction. Implemented Linux hardening for 4000+ production and staging VMs across GCP and OCI, achieving >90% compliance score. Built CI/CD systems and builds using Jenkins and Git, reducing deployment time to 60%. Maintain large-scale systems and applications, ensuring high reliability, efficiency, and scalability. Monitor Oracle database performance in Linux. Administered network configurations, including IP addressing, subnetting and managed network technologies, including TCP/IP, UDP, DNS, DHCP etc. Performs patch updates, keeping systems secure and functional with the latest releases. Optimized system performance and security by tuning kernel parameters.

Sr. Linux Developer

SAP Labs
Oct, 2018 - May, 20223 yr 7 months
    System installation and configuration, performance tuning. Database and OS integration, backup and recovery, monitoring and troubleshooting. Expert Linux developer and administrator with proficiency in Bash and Python scripting for manual admin task automation. Skilled in implementing auto-check and auto-healing mechanisms within Linux environments using Nagios. Experienced with Google Cloud Platform to enhance cloud-based solutions and infrastructure and manage large scale of Linux VMs (20000+). Built command line tools for regular work such as fleet scan, backup checks, health check, schedule job etc. Basic experience with LDAP, DNS, Kerberos, TLS, and load balancers. Authored wikis, SOPs, and runbooks to standardize team processes and improve knowledge sharing. Automated backup, log cleanup, and patch management using Bash and Python scripts, reducing manual effort by 70%. Deployed and maintained Nagios and Zabbix for performance monitoring and alerting on Linux infrastructure. Administered 200+ RHEL and Ubuntu servers supporting mission-critical workloads in hybrid data center-cloud environments. Hardened Linux systems based on CIS Level 1 policies, disabled unnecessary daemons, restricted root access, enforced password and session timeouts.

Linux Administrator

FIS
Sep, 2017 - Sep, 20181 yr
    Focus on on-prem servers, patching, security compliance, OS-level hardening. Expertise in maintaining and managing Linux systems (10000+) and servers to guarantee stability, security, and peak performance. Proficient in continuous server performance monitoring, focusing on resource utilization (CPU, memory, disk space), network traffic, and application health. Diligent in monitoring system logs to detect and address suspicious activity and potential security breaches. Minimized the downtime and get the right SMEs to address the issue.

Service Delivery Engineer

Hewlett Packard
Jun, 2016 - Aug, 20171 yr 2 months
    Client: Deutsche bank (Banking Domain). Expertly executing system maintenance, including security patching, updates, and upgrades for OS and software. Managing robust storage, backup, and recovery protocols to safeguard data integrity and ensure availability. Handle client browser configuration and troubleshooting. Efficiently troubleshooting hardware and software issues, pinpointing root causes, and deploying effective resolutions. Crafting and overseeing automation scripts for streamlined system administration, including server provisioning and configuration management. Prepare or update technical documentation with SOPs in wiki for the team to refer. Troubleshoot Windows networking tools: ping, ipconfig, route etc.

Project Engineer

Wipro
May, 2013 - May, 20163 yr
    Client: British Petroleum. Proficient in UNIX platforms including Red Hat, Solaris, and HP UNIX. Skilled in prioritizing and resolving tickets/alarms based on severity levels. Adept at maintaining clear communication with users and stakeholders for updates and issue resolution. Creating and managing user accounts, including setting appropriate permissions and access controls. Committed to delivering high-quality user support for account management and access issues. Configuring and troubleshooting Samba (SMB), CIFS, NFS shares, and security protocols. Administration of Unix core services and applications such as NIS, NFS, Automount, DNS, DHCP, SMB, FTP, SendMail, Apache, NTP, PTP, Sudo, LDAP, SSH, IPTables. Assisted in log monitoring and troubleshooting system crashes. Wrote shell scripts to automate log rotation, backup, and monitoring tasks.

Major Projects

1Projects

High Speed Automated Administrative Tool

    Developed tool to automate bulk activity and eliminate manual work. Automated patch update and firmware update on Linux hosts. Automated upgrade process of SUSE OS and developed scripts for host deployment and migration in cloud.

Education

  • MSc Computer Science

    Banaras Hindu University (2012)
  • BSc Computer Science and Mathematics

    Gorakhpur University (2010)