Description

Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.

athenahealth is a progressive, innovation-driven software product company dedicated to transforming healthcare through cutting-edge cloud solutions. We partner with healthcare organizations to improve clinical and financial outcomes by building modern technology on an open, connected ecosystem that drives meaningful insights for our customers and their patients. We take pride in our values-driven culture, offering a flexible work-life balance and fostering an environment of innovation. As a testament to our industry leadership and rapid growth, we were acquired by Bain Capital for $17B in 2021, and we continue to launch new strategic product initiatives to push the boundaries of healthcare technology.

We are headquartered in Boston, US, and our India offices are in Bangalore, Chennai, and Pune.

Position Summary: We are looking for a Lead Site Reliability Engineer - LMTS to join our Cloud Infrastructure Engineering division in Bangalore. Cloud Infrastructure Engineering ensures the continuous availability of the technologies and systems that are the foundation of athenahealth’s services. We are directly responsible for thousands of servers, petabytes of storage, and handling thousands of web requests per second, all while sustaining growth at a meteoric rate. We enable an operating system for the medical office that abstracts away administrative complexity, leaving doctors free to practice medicine.

About You: You are a seasoned engineer with a passion for identifying and resolving reliability and scalability challenges. You are a curious team player, someone who loves to explore, learn, and make things better. You are excited to uncover inefficiencies in business processes, creative in finding ways to automate solutions, and relentless in your pursuit of greatness. You are a nimble learner capable of quickly absorbing complex solutions and an excellent communicator who can help evangelize engineering excellence.

The Team: We are a team of Site Reliability Engineers who are passionate about reliability, automation, and scalability. We use an agile-based framework to execute our work, ensuring we are always focused on the most important and impactful needs of the business. We support systems in both private and public cloud environments and make data-driven decisions about which solutions best suit the needs of the business. We are relentless in automating away manual, repetitive work so we can focus on projects that help move the business forward.

Job Responsibilities

Cloud Infrastructure Leadership:

Lead the design, implementation, and maintenance of scalable and highly available cloud infrastructure using public cloud platforms (AWS).
Ensure the cloud infrastructure is resilient, fault-tolerant, and capable of supporting large-scale applications and services.
Provide technical guidance and leadership for cloud infrastructure projects, helping to drive the infrastructure strategy forward.
Strong Understanding of Hybrid Cloud Setup, Operations and scaling up

Reliability and Availability:

Define, measure, and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud services and infrastructure components.
Lead efforts to continuously improve system availability, fault tolerance, and disaster recovery capabilities.
Ensure proactive incident detection, efficient root cause analysis, and timely resolution of production incidents
On-Call participation in 24x7 setup.

Automation and Infrastructure as Code (IaC):

Drive automation efforts to reduce manual intervention and streamline cloud infrastructure management.
Implement Infrastructure as Code (IaC) using tools like Terraform, AWS CloudFormation, and Ansible to provision, manage, and scale cloud resources.
Automate deployment, scaling, and monitoring processes to improve efficiency and reduce operational complexity.

Monitoring, Observability, and Performance Tuning:

Design and implement monitoring, logging, and alerting solutions to track cloud infrastructure health, performance, and security.
Use observability tools (e.g., Prometheus, Grafana, Cloud Watch) to ensure continuous visibility into cloud infrastructure performance and capacity.
Identify bottlenecks and performance issues, proposing and implementing improvements to ensure optimal resource usage.

Security and Compliance:

Ensure that cloud infrastructure is built with security best practices in mind and meets all relevant compliance and regulatory requirements.
Collaborate with security teams to implement security controls and risk mitigation strategies across cloud environments.
Regularly audit and review cloud infrastructure for security vulnerabilities and compliance gaps.

Cost Optimization:

Optimize cloud resource usage and reduce costs without compromising performance or reliability.
Monitor cloud service usage and recommend strategies for optimizing cloud infrastructure spending.
Implement cost-tracking tools and reporting mechanisms to ensure the business remains within budget for cloud infrastructure.

Collaboration and Cross-Functional Leadership:

Work closely with development, DevOps, and operations teams to ensure cloud infrastructure aligns with application and business requirements.
Lead and mentor a team of Site Reliability Engineers, promoting best practices and fostering a culture of operational excellence.
Act as a key technical point of contact for cloud-related infrastructure and operations issues.

Incident Management and Post-Mortem:

Lead the incident response efforts for cloud infrastructure-related issues, ensuring that all incidents are managed effectively.
Conduct post-incident reviews (PIRs) to identify root causes and implement preventive measures.
Continuously refine incident management processes to reduce downtime and enhance recovery times.

Qualifications:

8-10 years of hands-on experience with cloud automation and configuration management tools (e.g., Terraform, AWS CloudFormation, Ansible). On a Hybrid Cloud Set-up.
7+ years of experience in a Site Reliability Engineering (SRE), Infrastructure Engineering, or DevOps role, with at least 3+ years in a technical leadership capacity.
Deep knowledge of cloud services and technologies (e.g., EC2, S3, Lambda, Kubernetes, etc.).
Proficiency in scripting or programming languages (Python, Go, Bash, etc.).
Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, ELK stack).
Familiarity with Continuous Integration/Continuous Deployment (CI/CD) pipelines and cloud-native development practices.
Strong expertise in managing cloud infrastructure (AWS, Google Cloud, Azure) in production environments.
Experience with cloud-native architectures, microservices, and containerized environments (Kubernetes, Docker).
Proven experience in building and managing highly available, scalable, and fault-tolerant systems in the cloud.
Strong understanding of cloud networking, storage, compute services, On-Prem and security best practices.

Behaviors & Abilities Required:

Strong Technical leadership and mentoring abilities, with a track record of developing high-performance engineering teams.
Excellent problem-solving, troubleshooting, and diagnostic skills.
Ability to work in a cross-functional, collaborative environment.
Effective communication skills, with the ability to translate technical concepts to non-technical stakeholders.

About athenahealth

Here’s our vision: To create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all. 

What’s unique about our locations?
From an historic, 19th century arsenal to a converted, landmark power plant, all of athenahealth’s offices were carefully chosen to represent our innovative spirit and promote the most positive and productive work environment for our teams. Our 10 offices across the United States and India — plus numerous remote employees — all work to modernize the healthcare experience, together.

Our company culture might be our best feature.
We don't take ourselves too seriously. But our work? That’s another story. athenahealth develops and implements products and services that support US healthcare: It’s our chance to create healthier futures for ourselves, for our family and friends, for everyone.

Our vibrant and talented employees — or athenistas, as we call ourselves — spark the innovation and passion needed to accomplish our goal. We continue to expand our workforce with amazing people who bring diverse backgrounds, experiences, and perspectives at every level, and foster an environment where every athenista feels comfortable bringing their best selves to work.

Our size makes a difference, too: We are small enough that your individual contributions will stand out — but large enough to grow your career with our resources and established business stability.

Giving back is integral to our culture. Our athenaGives platform strives to support food security, expand access to high-quality healthcare for all, and support STEM education to develop providers and technologists who will provide access to high-quality healthcare for all in the future. As part of the evolution of athenahealth’s Corporate Social Responsibility (CSR) program, we’ve selected nonprofit partners that align with our purpose and let us foster long-term partnerships for charitable giving, employee volunteerism, insight sharing, collaboration, and cross-team engagement.  

What can we do for you?
Along with health and financial benefits, athenistas enjoy perks specific to each location, including commuter support, employee assistance programs, tuition assistance, employee resource groups, and collaborative workspaces — some offices even welcome dogs.

In addition to our traditional benefits and perks, we sponsor events throughout the year, including book clubs, external speakers, and hackathons. And we provide athenistas with a company culture based on learning, the support of an engaged team, and an inclusive environment where all employees are valued.

We also encourage a better work-life balance for athenistas with our flexibility. While we know in-office collaboration is critical to our vision, we recognize that not all work needs to be done within an office environment, full-time. With consistent communication and digital collaboration tools, athenahealth enables employees to find a balance that feels fulfilling and productive for each individual situation.

Athenahealth

Enterprise Software Health Care Information Technology Medical

0 applies

11 views

Other Jobs from Athenahealth

Senior UI Developer - SMTS

Remote Bengaluru, India

Senior Software Engineer, athenaCollector

Remote Boston, MA

Senior Manager Engineering

Remote Pune, India

Senior MLOps Engineer - athenaIntelligence R&D

Remote Boston, MA

Manager Engineering

Remote Chennai, India

Lead Site Reliability Engineer - Core Infra (LMTS)

Remote Bengaluru, India

Similar Jobs

.Net Developer - Intermediate

India

Forward Deployed Engineer (AI Agent)

Remote Toronto, Ontario

Associate Architect - Platform (MLOps)

Bengaluru, India Mumbai, India

Principal Software Engineer - Java

Montevideo, Uruguay Remote Hybrid

Software Engineer - Java

Montevideo, Uruguay Remote Hybrid

Software Engineer

Remote Raleigh, NC

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say

Sid

Very nice portal for searching jobs in this rough market.

Mar 6, 2025

Michael Duran

Software Engineer

I've been using this job search site for a while now, and it’s honestly one of the best out there! The clean and easy-to-navigate UI makes the whole job-hunting process so much smoother. Plus, the job postings are always up-to-date, so I never feel like I’m wasting time. The cherry on top is the owner—super kind and always quick to respond. Definitely recommend checking it out if you're on the job hunt!

Aug 21, 2024

Sai

It’s really great website for finding jobs based on skills it’s really helpful give a go

Aug 21, 2024

Adinadh

What I like most about Echo Jobs is how easy it is to use. The platform helps me quickly find jobs that match my skills and interests, thanks to its great recommendations and filters. Yes, I would definitely recommend Echo Jobs to a friend. It makes job searching simple and efficient, making it a great tool for anyone looking for a new job.

Jul 23, 2024

Rahim

Software Engineer

As a student navigating the job market, I've found LinkedIn increasingly frustrating due to numerous fake postings by consultancies. In contrast, this job posting website has been a game-changer for me. It offers genuine opportunities and a straightforward application process, making it much easier to find and apply for real jobs. Highly recommend it to fellow students seeking reliable job listings!

Jul 16, 2024

Cliff Gor

Software Engineer

Echo Jobs has been exceptional in my job hunt where it provides one platform to job hunt and I don't have to open 10 websites just to look for a job. It has also helped me focus much on the job skill and the location filtering out the onsite jobs and remote ones. The only feature that I would request is to display fully remote jobs that are not restricted to a country since the one available shows ie, Remote, US yet. But if it could show remote only, that would be helpful not only to me but to other people applying for full remote and not tied to only US candidates

Apr 22, 2024

Mauro

Software Engineer

I found EchoJobs in 2022, and I love it. It has a lot of remote jobs. It's exclusive to software and technology jobs (helpful for devs like me). What I like the most are its filters and its API. If you're a tech professional seeking remote work, I highly recommend giving it a try to EchoJobs.

Mar 4, 2024

Kenn Kibadi

Founder & Product Engineer @ EarlyAccessHQ.com

Would definitely recommend it! Excellent product, dedicated founder, Jobs are easier to find. Congrats 🎉 to the entire team!

Mar 3, 2024

Brandon Banks

Echo Jobs is really impressive. It provides a great user experience with an ability to quickly search through the many job postings. There is an impressive amount of jobs here and it is quickly updated. The details in the each job posting is helpful when determining if it is worth pursuing. I would highly recommend using Echo Jobs to find the next step in your career.

Mar 2, 2024

Tyler Young

tylerayoung.com

Best wishes with EchoJobs—it's become my favorite job board overnight!

Dec 16, 2023

Gabriel

Remote Job Seeker

Simply put, it's the most up to date tech jobs aggregator I’ve found. I'm like... "I don't have to check 10+ jobs boards daily just to see if there's a new job listing? sign me up!" The filters are also quite helpful! The UI is very clean and straightforward. Love it!

Oct 5, 2023

Collect testimonials with Senja

Athenahealth

Lead Site Reliability Engineer – Public Cloud (LMTS)

Other Jobs from Athenahealth

Similar Jobs