Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.
We are looking for a Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering & Operations division. Ultimately your work will focus on improving the performance and efficiency of our teams by joining our quest to continue building a world-class observability platform and contribute to the success of our business.
The Team:
The Logging, Metrics and Monitoring team is responsible for building and providing observability services and tools for engineering teams within the Cloud Engineering & Operations and Research & Development zones. Our services are highly visible and used every day to develop, monitor, troubleshoot and scale our web services. The team is responsible for collecting and hosting large volumes of metrics and log data; we do this by running large scale distributed, fault tolerant systems to collect and host all this data.
Our team has a big impact on productivity of hundreds of developers across athenaNation.
In a typical week, our engineers work on problems ranging from tuning performance and scaling services to debugging hard problems. We’re responsible for delivering new features and partnering with development teams to solve their pressing monitoring and logging issues. We work in an agile, sprint-based schedule running daily standups and work in both the private and public cloud
Job Responsibilities
· Automate the deployment of logging, metrics, and monitoring services through configuration management utilizing Puppet.
· Address and resolve production incidents by applying Linux administration and engineering expertise.
· Lead projects from inception to completion, including designing technical solutions, managing timelines, and executing deliverables.
· Design and implement metrics dashboards and alert criteria to effectively monitor and scale services.
· Participate in a week-long on-call rotation in collaboration with team members.
· Assist development teams in enhancing their logging and metrics collection processes.
· Demonstrate the ability to manage on-call rotations every few weeks.
Typical Qualifications
· Possess 5 to 8 years of prior experience in a production environment, exhibit strong system administration and DevOps skills for managing services within a Linux environment.
· Demonstrate hands-on experience with configuration management tools such as Puppet or Ansible.
· Strong experience troubleshooting production services in a Linux environment and participating in on-call rotations.
· Proficient in programming with experience writing and maintaining scripts in the following languages: Bash, Ruby, Python, Perl, C++, Java, and Golang.
· Experience developing Infrastructure as Code utilizing Terraform and CloudFormation.
· Display adaptability and flexibility in response to changing environmental and business demands.
Additional Qualifications
· Demonstrated experience in managing production server fleets at a scale of thousands.
· Subject matter expertise in relevant technologies, including FluentD, Kafka, Elasticsearch, Graphite, Clickhouse, Prometheus, Grafana, Graylog, Terraform, CloudFormation, Docker, Jenkins, and Git.
· Exposure to Amazon Web Services (AWS) for deploying, managing, and scaling applications, with a foundational understanding of AWS services, architecture, and best practices.
· Proficient in using protocol analyzers such as tcpdump and Wireshark.
About athenahealth
Here’s our vision: To create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.
What’s unique about our locations?
From an historic, 19th century arsenal to a converted, landmark power plant, all of athenahealth’s offices were carefully chosen to represent our innovative spirit and promote the most positive and productive work environment for our teams. Our 10 offices across the United States and India — plus numerous remote employees — all work to modernize the healthcare experience, together.
Our company culture might be our best feature.
We don't take ourselves too seriously. But our work? That’s another story. athenahealth develops and implements products and services that support US healthcare: It’s our chance to create healthier futures for ourselves, for our family and friends, for everyone.
Our vibrant and talented employees — or athenistas, as we call ourselves — spark the innovation and passion needed to accomplish our goal. We continue to expand our workforce with amazing people who bring diverse backgrounds, experiences, and perspectives at every level, and foster an environment where every athenista feels comfortable bringing their best selves to work.
Our size makes a difference, too: We are small enough that your individual contributions will stand out — but large enough to grow your career with our resources and established business stability.
Giving back is integral to our culture. Our athenaGives platform strives to support food security, expand access to high-quality healthcare for all, and support STEM education to develop providers and technologists who will provide access to high-quality healthcare for all in the future. As part of the evolution of athenahealth’s Corporate Social Responsibility (CSR) program, we’ve selected nonprofit partners that align with our purpose and let us foster long-term partnerships for charitable giving, employee volunteerism, insight sharing, collaboration, and cross-team engagement.
What can we do for you?
Along with health and financial benefits, athenistas enjoy perks specific to each location, including commuter support, employee assistance programs, tuition assistance, employee resource groups, and collaborative workspaces — some offices even welcome dogs.
In addition to our traditional benefits and perks, we sponsor events throughout the year, including book clubs, external speakers, and hackathons. And we provide athenistas with a company culture based on learning, the support of an engaged team, and an inclusive environment where all employees are valued.
We also encourage a better work-life balance for athenistas with our flexibility. While we know in-office collaboration is critical to our vision, we recognize that not all work needs to be done within an office environment, full-time. With consistent communication and digital collaboration tools, athenahealth enables employees to find a balance that feels fulfilling and productive for each individual situation.
Other Jobs from Athenahealth
Senior Engineering Manager, athenaCollector - AI Application Development
Software Engineer, athenaCollector – Collector Platform
Senior Member of Technical Staff(Java Backend + Infrastructure)
Engineering Manager
Senior Java Backend Developer - SMTS
Lead MLOps Engineer - LMTS
Similar Jobs
Site Reliability Engineer II - Real-Time
Site Reliability Engineer II - Real-Time
Site Reliability Engineer II - Real-Time
Site Reliability Engineer II - Real-Time
Software Development Engineer - US Federal
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say