Astronomer designed Astro, an industry-leading data orchestration and observability platform for data teams. Powered by Airflow, Astro accelerates building reliable data products that unlock insights, unleash AI value, and drive data-driven applications.
We’re a globally-distributed and rapidly growing venture-backed team of learners, innovators and collaborators. Our mission is to empower data teams to bring mission-critical analytics, AI, and software to life. As a member of our team, you will be at the forefront of the industry as we strive to deliver the world's data.
Your background may be unconventional; as long as you have the essential qualifications, we encourage you to apply. While having "bonus" qualifications makes for a strong candidate, Astronomer values diverse experiences. Many of us at Astronomer haven't followed traditional career paths, and we welcome it if yours hasn't either.
About this role
This role is well suited to candidates early in their careers.
The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers' usage of our managed Airflow service.
The CREs are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations.
As an infrastructure specialist within the team, you will focus on the reliability of the underlying cloud infrastructure and Kubernetes clusters. This entails responding to incidents either raised by a customer, or from our monitoring system and then taking further steps to ensure problems are permanently resolved or monitored. As owners of the observability platform, CRE has unlimited potential to improve the reliability of the product and deliver the best possible outcome for our customers.
This role is directly customer-facing and gives exposure to very diverse problems and requirements. CRE get the opportunity to interface with customers from a variety of industries across different cloud providers, and all with different expectations. Your contributions will directly impact customers' success with using the Astronomer products, and you will be able to help make meaningful improvements to the customer experience.
What you get to do:
Provide solutions to customers to make them successful using our products.
Troubleshoot Customer environments and engage in active triaging with customers
Participate in on-call rotation for weekend coverage
Provide feedback to the product development teams on customer needs and pain points.
Build out our monitoring and alerting systems.
Build and maintain automation to ensure daily operational tasks are handled as efficiently as possible.
Help direct the architecture of the products and contribute where possible.
Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide “white glove” guidance on the path to production.
Participate remotely within a fully distributed team.
Enhance and Enrich customer documentation
Work with the latest Technology and multi-cloud implementations
What you bring to the role:
3-4 years of experience, preferably with large, complex SaaS infrastructures operating at scale
About 2 years of experience with Kubernetes
Experience managing a Production distributed system with at least one major cloud provider (AWS, GCP, Azure)
Good network experience with one of the major clouds
Good Linux experience
Knowledge of how to operate and monitor issues for distributed systems
Experience with observability tools
Previous experience in handling customers issues (internal and external)
Good communication skills
DevOps or CI/CD experience
Python scripting
Good troubleshooting skills
Bonus points if you have:
Experience as a Site Reliability Engineer
Worked with Kubernetes Custom Resources
Depth of knowledge with Azure
Airflow/Big Data Orchestration experience
IaC experience
At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remote-first company.
0 applies
24 views
Other Jobs from Astronomer
Staff Software Engineer - Platform
Senior Infrastructure Engineer - India
Senior Software Engineer, Platform
Senior Software Engineer, Platform
Similar Jobs
Senior PCIe DevOps, Automation and Verification Engineer
Senior Engineer, Server Administration - RAPIDS
Software Engineer Senior(ETL,Informatica)
Clinical Cloud DevOps Engineer
Software Engineer - Metrics and Operations [IC3]
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say