Stack AV

Site Reliability Engineer, Site Reliability Engineering

Remote Pittsburgh, PA
Kubernetes
Search for More Jobs Talk to a recruiter now 💪
Description

About

With customers at its core, Stack AV is focused on revolutionizing the way businesses transport goods, designing solutions to alleviate long-standing issues that have plagued the trucking industry including driver shortages, lagging efficiency in uptime per vehicle, overarching safety concerns, high operating costs, and elevated emission levels. By building safe and efficient autonomous trucking solutions, Stack AV is creating better and smarter supply chains for its partners, improving business outcomes for its customers, delivering goods to end-users faster, and ultimately moving the trucking industry forward.

What We're Looking for:

We are looking for people who are passionate about delivering self-driving (L4) products that make the way we move safer, faster and more efficient. We seek mission-driven, highly skilled people with deep experience in fast paced, rapidly growing, tech development environments.

Stack AV Site Reliability Engineers are responsible for enabling and ensuring our production systems meet their service-level objectives. Through the implementation of centralized observability and automation, the SRE team constantly ensures the health, reliability, scalability, and performance of Stack AV’s infrastructure. Members of the team are expected to contribute to a culture of continuous learning, provide consultation on architecting for high-availability, and ultimately drive the uptime and performance of our systems.

What Success Looks Like:

  • Experience building a centralized observability stack capable of scaling to meet the needs of engineering teams across the business. Focus on leveraging the system to increase debuggability and ensure maximum uptime of mission-critical production services.
  • A deep understanding of design tradeoffs and ability to articulate those tradeoffs in order to help teams build systems that meet their SLOs.
  • Experience implementing and debugging cloud native systems such as Kubernetes, etcd, and Prometheus across hybrid cloud environments in support of running highly-available workloads. 
  • Desire to build a culture of blameless postmortems and continuous learning through the implementation of a standard incident management framework and process. 
  • Experience building observability and alerting for hardware systems within private cloud environments.
  • Fundamental understanding of Linux OS internals, TCP/IP networking stack, and storage systems with a knack for curiosity and methodical approach to debugging issues.
  • Experience working in a diverse and distributed team, with a focus on communication and customer empathy. Desire to work cross-functionally within Stack to achieve required performance and reliability within budget.

 

We are proud to be an equal opportunity workplace. We believe that diverse teams produce the best ideas and outcomes. We are committed to building a culture of inclusion, entrepreneurship, and innovation across gender, race, age, sexual orientation, religion, disability, and identity.

Check out our Privacy Policy.

Please Note: Pursuant to its business activities and use of technology, Stack AV complies with all applicable U.S. national security laws, regulations, and administrative requirements, which can restrict Stack AV’s ability to employ certain persons in certain positions pursuant to a range of national security-related requirements. As such, this position may be contingent upon Stack AV verifying a candidate’s residence, U.S. person status, and/or citizenship status. This position may also involve working with software and technologies subject to U.S. export control regulations. Under these regulations, it may be necessary for Stack AV to obtain a U.S. government export license prior to releasing its technologies to certain persons. If Stack AV determines that a candidate’s residence, U.S. person status, and/or citizenship status will require a license, prohibit the candidate from working in this position, or otherwise be subject to national security-related restrictions, Stack AV expressly reserves the right to either consider the candidate for a different position that is not subject to such restrictions, on whatever terms and conditions Stack AV shall establish in its sole discretion, or, in the alternative, decline to move forward with the candidate’s application.

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 307 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers