IBM

Site Reliability Engineering Professional (Compute SRE)

Shell Python Ansible
Search for More Jobs Talk to a recruiter now 💪
This job is closed! Check out or
Description
As a Compute Operations Site Reliability Engineer, you will perform the following tasks:
  • Remotely administer Power Server hardware environments across numerous datacenter locations around the world (currently 18 datacenters and growing).  
  • Develop automation to reduce manual toil (automated, repetitive tasks) using shell scripts (bash, etc), Python, Ansible, and related tools and languages.
  • Perform code stack updates on infrastructure systems (VIOS, firmware, PowerVC, HMC, Novalink, NIM servers) as well as cloud supporting systems (jump servers, sobox, network nodes, gateways, TSM servers).
  • Upload/maintain stock images.
  • Remotely administer AIX and Linux servers
  • Maintain UserIDs (Add/delete) and passwords.
  • Monitor daily/weekly backups to ensure they are working.
  • Manage and maintain Nagios monitoring environment, troubleshoot scripts/plug-ins if there is an issue.
  • Perform periodic LPMs, inactive migrations, or remote restarts of customer VMs to perform system maintenance, balance workloads, or free up resources.
  • Monitor and provide details of Capacity utilized in each Datacenter.
  • Attend scheduled meetings planned by customer for cutover/maintenance windows.
  • Verify capacity requirements in case of provisioning failure issues by customers.
  • Work with customers to resolve any RSCT issues so that LPM activities can be performed without impacting customer workloads.

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 307 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers