Career Category
Information SystemsJob Description
ABOUT AMGEN
Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 40 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today.
ABOUT THE ROLE
Let’s do this. Let’s change the world.
We are looking for a Site Reliability Engineer/Cloud Engineer (SRE) to work on the performance optimization, standardization, and automation of Amgen’s critical infrastructure and systems. This role is crucial to ensuring the reliability, scalability, and cost-effectiveness of our production systems. The ideal candidate will work on operational excellence through automation, incident response, and proactive performance tuning, while also reducing infrastructure costs. You will work closely with cross-functional teams to establish best practices for service availability, efficiency, and cost control.
Roles & Responsibilities:
System Reliability, Performance Optimization & Cost Reduction: Ensure the reliability, scalability, and performance of Amgen’s infrastructure, platforms, and applications. Proactively identify and resolve performance bottlenecks and implement long-term fixes. Continuously evaluate system design and usage to identify opportunities for cost optimization, ensuring infrastructure efficiency without compromising reliability.
Automation & Infrastructure as Code (IaC): Drive the adoption of automation and Infrastructure as Code (IaC) across the organization to streamline operations, minimize manual interventions, and enhance scalability. Implement tools and frameworks (such as Terraform, Ansible, or Kubernetes) that increase efficiency and reduce infrastructure costs through optimized resource utilization.
Standardization of Processes & Tools: Establish standardized operational processes, tools, and frameworks across Amgen’s technology stack to ensure consistency, maintainability, and best-in-class reliability practices. Champion the use of industry standards to optimize performance and increase operational efficiency.
Monitoring, Incident Management & Continuous Improvement: Implement and maintain comprehensive monitoring, alerting, and logging systems to detect issues early and ensure rapid incident response. Lead the incident management process to minimize downtime, conduct root cause analysis, and implement preventive measures to avoid future occurrences. Foster a culture of continuous improvement by leveraging data from incidents and performance monitoring.
Collaboration & Cross-Functional Leadership: Partner with software engineering, and IT teams to integrate reliability, performance optimization, and cost-saving strategies throughout the development lifecycle. Act as a SME for SRE principles and advocate for best practices for assigned Projects.
Capacity Planning & Disaster Recovery: Execute capacity planning processes to support future growth, performance, and cost management. Maintain disaster recovery strategies to ensure system reliability and minimize downtime in the event of failures.
Basic Qualifications:
Master’s degree and 8 to 10 years of IT infrastructure, Site Reliability Engineering or related fields experience OR
Bachelor’s degree and 10 to 14 years of IT infrastructure, Site Reliability Engineering or related fields experience OR
Diploma and 14 to 18 years of IT infrastructure, Site Reliability Engineering or related fields experience.
Must-Have Skills:
Extensively experienced with AWS Cloud Services
Proficient in CI/CD (Jenkins/Gitlab), Observability, IAC, Gitops etc
Experience with containerization (Docker) and orchestration tools (Kubernetes) to optimize resource usage and improve scalability.
Identify and specify SRE tasks
Strong Hands-on SRE tasks and automate using Python/ Scripting language
Well Versed with FinOps, Infra-Ops, & Platform Operations.
Ability to learn new technologies quickly. Strong problem-solving and analytical skills. Excellent communication and teamwork skills.
Leadership skills are mandatory to lead a team of 4 to 5 to guide on Technical blockers
Good-to-Have Skills:
Knowledge of cloud-native technologies and strategies for cost optimization in multi-cloud environments.
Familiarity with distributed systems, databases, and large-scale system architectures.
Bachelor’s degree in computer science and engineering preferred, other Engineering field is considered
Databricks Knowledge/Exposure is good to have (need to upskill if hired)
Soft Skills:
Ability to foster a collaborative and innovative work environment.
Strong problem-solving abilities and attention to detail.
High degree of initiative and self-motivation.
EQUAL OPPORTUNITY STATEMENT
Amgen is an Equal Opportunity employer and will consider you without regard to your race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status.
We will ensure that individuals with disabilities are provided with reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation
.Other Jobs from Amgen
Principal Platform Architect, MuleSoft
Sr. Engineer Drug Process Development Inspection
Test Automation Software Engineer - Copado Robotic Testing
Data Engineer
Principal Engineer - Thermal Engineering
Sr Associate IS Engineer - Visualization
Similar Jobs
Principal Engineer
Software Engineer - Backend
Automation Engineer Senior Analyst - HIH - Evernorth
Sr Staff Engineer - Java - REMOTE
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say