Software Principal Engineer - SRE, Production Engineering
Location: India
Department: Engineering
About Boomi and What Makes Us Special
Are you ready to work at a fast-growing company where you can make a difference? Boomi aims to make the world a better place by connecting everyone to everything, anywhere. Our award-winning, intelligent integration and automation platform helps organizations power the future of business. At Boomi, you’ll work with world-class people and industry-leading technology. We hire trailblazers with an entrepreneurial spirit who can solve challenging problems, make a real impact, and want to be part of building something big. If this sounds like a good fit for you, check out boomi.com or visit our Boomi Careers page to learn more.
Boomi provides the foundation on which your business can evolve and innovate. According to a recent survey by Vanson Bourne, connected businesses are far outpacing their competitors. We help organizations connect everything and engage everywhere across any channel, device or platform. More than 18,000 organizations are using Boomi to run better, faster and smarter.
Working at Boomi means doing what you love. We hire trailblazers with an entrepreneurial spirit who can solve challenging problems, make a real impact in technology and want to build something big. If you are passionate about solving hard problems, enjoy working with world-class people and developing cutting edge technology, you should explore a career with Boomi. Learn more at http://www.boomi.com/ or visit Boomi Careers.
Join us as a Sr Site Reliability Engineer on our Reliability team to do the best work of your career and make a profound social impact.
What you’ll achieve
As a Senior Site Reliability Engineer,, you will be responsible for developing sophisticated systems and software based on the customer’s business goals, needs and general business environment. You will work with product management, other engineering teams, customer success and support on developing cutting edge new product features and enhancements across various areas of Boomi offerings.
You will:
- Participate actively in detecting, remediating and reporting on Production incidents, ensuring the SLAs/ SLOs are defined and met
- Participate in on-call rotation to ensure coverage for planned/unplanned events.
- Engage with other Engineering organizations to implement processes, identify improvements, and drive consistent results.
- Working with your SRE and Engineering counterparts for driving DR exercises, Game days, training and other response readiness efforts.
- Collaborate with Service Engineering organizations to build and automate tooling, implement best practices on Observability and manage the Boomi services in production and consistently achieve our market leading SLA.
- Improving the scalability and reliability of Boomi’s systems in production.
- Automate the provisioning and maintenance of Boomi’s infrastructure.
- Work independently with a minimal level of guidance from technical leadership
- Mentor other Boomi engineers, including design collaboration and code reviews
Take the first step towards your dream career with Boomi
Essential Requirements
Expert in defining, measuring, and improving Reliability Metrics (SLO/SLI/ Error budgets)
- Strong in implementing observability practices (Monitoring, Logging, Distributed Tracing etc.) preferably using New Relic and Splunk. Experience not limited to using the dashboards, but creating them from scratch.
- Passionate about SRD Automation and infrastructure platforms. Expert in developing Ansible playbooks and automation for Infrastructure as code using Terraform and Cloud Formation Templates and Python.
- Experience in conducting and automating DR exercise in AWS cloud thus validating RPOs and RTOs.
- Strong understanding and working experience with AWS components.
- Ability to design and implement API’s for use by internal teams.
Desirable Requirements
- 7+ years’ experience in the software engineering industry, with experience supporting large scale software systems in production.
- Experience actively in detecting, remediating and reporting on Production incidents, ensuring the SLAs/ SLOs are defined and metand participate in on-call rotation to ensure coverage for planned/unplanned events.
- Certified in Cloud (AWS/Azure/GCP/Oracle), experience in using services such as computers, containers and databases.
- Experience in Observability, creating dashboards for SLA/SLI/SLO
- Experience in Ansible/Terraform and Python.
- A grasp of Cloud Native concepts, containerization best practices and security awareness in Cloud will be a strong plus.
Be Bold. Be You. Be Boomi. We take pride in our culture and core values and are committed to being a place where everyone can be their true, authentic self. Our team members are our most valuable resources, and we look for and encourage diversity in backgrounds, thoughts, life experiences, knowledge, and capabilities.
All employment decisions are based on business needs, job requirements, and individual qualifications.
Boomi strives to create an inclusive and accessible environment for candidates and employees. If you need accommodation during the application or interview process, please submit a request to [email protected]. This inbox is strictly for accommodations, please do not send resumes or general inquiries.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say