Astronomer designed Astro, an industry-leading data orchestration and observability platform for data teams. Powered by Airflow, Astro accelerates building reliable data products that unlock insights, unleash AI value, and drive data-driven applications.
We’re a globally-distributed and rapidly growing venture-backed team of learners, innovators and collaborators. Our mission is to empower data teams to bring mission-critical analytics, AI, and software to life. As a member of our team, you will be at the forefront of the industry as we strive to deliver the world's data.
Your background may be unconventional; as long as you have the essential qualifications, we encourage you to apply. While having "bonus" qualifications makes for a strong candidate, Astronomer values diverse experiences. Many of us at Astronomer haven't followed traditional career paths, and we welcome it if yours hasn't either.
About this role
At Astronomer, our R&D team is dedicated to providing an exceptional experience in managing Apache Airflow at scale. As a leading player in the industry, we welcome an experienced Software Engineer to work on the infrastructure team of our flagship Enterprise product, Astronomer Software.
Your goal will be to enhance scalability, performance, and reliability while minimizing operational overhead by leveraging your deep understanding of container orchestration (Kubernetes) and cloud platforms (AWS, Azure, GCP, Openshift, etc.); you will streamline our infrastructure to support seamless on-premise installations.
You will collaborate closely with cross-functional teams, including CRE, Platform, and QA, to drive continuous improvement initiatives. Your technical guidance and support will enable teams to adopt best practices and implement efficient infrastructure solutions.
Upholding the highest standards of security and compliance, you will implement robust measures to protect our infrastructure and customer data. Your proactive approach to security will ensure that Astronomer Software remains resilient against potential threats.
Utilizing monitoring tools and performance metrics such as ELK and Prometheus, you will identify areas for optimization and implement strategies to enhance system performance and resource utilization for a customer's on-premise installation.
What you get to do:
Serve as a primary point who is responsible for the overall health, performance, and capacity of our platform.
Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and growth.
Develop tools to improve our ability to rapidly deploy and effectively monitor applications in a large-scale environment.
Work closely with development teams to ensure the platform is designed with operability in mind.
Identify and lead efforts to improve automation.
Perform root cause analysis and document results in the form of post-mortems.
Write and maintain documentation around key systems and processes.
Participate in an on-call rotation with some of our customers.
Function well in a fast-paced, rapidly changing environment.
What you bring to the role:
5 years of hands-on experience operating Kubernetes clusters in a production environment.
Experience in managing and scaling distributed systems in one of the three major cloud providers (AWS, Azure, GCP).
Strong experience with at least one Continuous Integration system, such as CircleCI or Jenkins.
Understanding of the Linux Operating System, standard networking protocols, and components.
Experience with deploying, supporting, and monitoring new and existing services, platforms, and application stacks.
Automation/Scripting experience with Shell, Python, or similar.
Familiarity with Infrastructure as Code (IaC) tools (Terraform, Cloudformation, etc.).
Strong troubleshooting and problem-solving skills.
Bonus points if you have:
Experience with scale testing, disaster recovery, and capacity planning.
Experience with at least one of the following languages: NodeJS, Go.
Familiarity with Apache Airflow.
Experience with Openshift and the Red Hat marketplace.
Experience with the Prometheus/Grafana and ELK stacks.
At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remote-first company.
0 applies
17 views
Other Jobs from Astronomer
Senior Software Engineer, Data Ingestion
Senior Software Engineer, Data Ingestion
Product Manager, Software
Senior Software Engineer, Observability
Senior Software Engineer, Observability
Senior Developer Advocate
Similar Jobs
Staff Software Engineer (Node+React)
Senior Staff Software Engineer
Senior Software Engineer
Staff Software Engineer
Senior Cloud Engineer
Software Engineer 2- AI Full Stack Development
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say