American Express

Senior Infrastructure Engineer I

London, UK Remote Hybrid
Kubernetes Python Bash AWS Azure GCP Terraform Docker
Description

You Lead the Way. We’ve Got Your Back.

With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a career journey that’s unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally.

At American Express, you’ll be recognized for your contributions, leadership, and impact—every colleague has the opportunity to share in the company’s success. Together, we’ll win as a team, striving to uphold our company values and powerful backing promise to provide the world’s best customer experience every day. And we’ll do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong.

Join Team Amex and let's lead the way together.

Overview 

We are seeking a versatile and highly skilled Full Stack Infrastructure Engineer with expertise in Compute, Storage, Network and Cloud technologies. The ideal candidate will design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance. 

Key Responsibilities:

  • Ensure the reliability, availability, and performance of the entire infrastructure stack including compute, storage, network and cloud components.
  • Lead incident response efforts across the infrastructure stack, coordinating with Application Support, SRE, and Engineering teams to minimize MTTD and MTTR.
  • Perform root cause analysis for infrastructure related incidents and implement corrective actions.
  • Develop and maintain automation tools for managing infrastructure resources.
  • Collaborate with Engineering teams to plan and execute system upgrades and maintenance.
  • Conduct capacity planning and resource management for all infrastructure components.
  • Participate in on-call rotations to provide 24x7 support for all critical infrastructure issues.
  • Design and implement disaster recovery plans and business continuity strategies.
  • Implement best practices for monitoring, logging, and alerting across the infrastructure.
  • Foster a culture of continuous improvement and operational excellence.
  • Analyze complex infrastructure problems, design scalable and resilient solutions, and lead the implementation of these solutions.
  • Collaborate with architects and other engineers to design and enhance the architecture of infrastructure systems, ensuring alignment with business needs and technology standards.

 

Required Skills and Experience:

  • Proven experience managing and optimizing a diverse infrastructure stack.
  • Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation).
  • Familiarity of service mesh technologies (Istio, Linkerd).
  • Solid understanding of virtualization (VMware, Hyper-V) and containerization (Docker, Kubernetes) and orchestration.
  • Understanding of storage solutions (SAN, NAS, cloud storage) and backup systems.
  • Strong understanding of network protocols, routing, switching, and firewalls.
  • Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools.
  • Experience in DNS management and troubleshooting.
  • Experience in network security best practices.
  • Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk).
  • Proficiency in at least one scripting language (Python, Bash) for automation.
  • Experience with CI/CD pipeline management and DevOps practices.
  • Strong understanding of disaster recovery and business continuity planning.
  • Experience with performance tuning and capacity planning.
  • Understanding of chaos engineering principles and practices.
  • Skills in cost optimization for cloud infrastructure.

 

Specific Tools and Techniques:

  • Experience in using cloud native monitoring tools like AWS CloudWatch, Azure Monitor, and Google Cloud Operations Suite.
  • Experience with packet capture tools like Wireshark for troubleshooting network issues.
  • Experience in using traceroute utilities and performance analysis tools like perf for identifying and resolving bottlenecks.
  • Familiarity with tools such as ipconfig/ifconfig for viewing network configurations, flushing DNS, and diagnosing network issues.
  • Experience with SNMP-based tools for network device monitoring and performance management.
  • Experience in using NetFlow for network traffic analysis.
  • Experience with tools like iostat, vmstat, and dstat for monitoring storage and system performance.
  • Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions.
  • Familiarity with tools like Prometheus and Grafana for monitoring and observability. 

We back our colleagues and their loved ones with benefits and programs that support their holistic well-being. That means we prioritize their physical, financial, and mental health through each stage of life. Benefits include:

  • Competitive base salaries 
  • Bonus incentives 
  • Support for financial-well-being and retirement 
  • Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location) 
  • Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need 
  • Generous paid parental leave policies (depending on your location) 
  • Free access to global on-site wellness centers staffed with nurses and doctors (depending on location) 
  • Free and confidential counseling support through our Healthy Minds program 
  • Career development and training opportunities

Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.

American Express
American Express
Debit Cards Finance Financial Services Payments Travel

0 applies

1 views

Other Jobs from American Express

Staff Engineer - Full Stack Java

Chennai, India Bengaluru, India

Technical Project Manager

Gurgaon, India Remote Hybrid

Senior Engineer II

Bengaluru, India Remote Hybrid

Engineer III- TM1 Developer

Gurgaon, India Remote Hybrid

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say