Staff DevOps Engineer
Team: Software
Location: San Francisco, California
Commitment: Full-time
Workplace Type: hybrid
Salary:
Role Description
Join Terawatt’s innovative technology team as a DevOps Software Engineer and help drive the evolution of our platform to support one of the leading fleet EV charging solutions in North America. In this role, you will contribute to the development and reliability of Terawatt’s charging network management system and the capabilities that enable seamless charging and a high-quality site experience for customers.
Working closely with the Director of Software, the Product team, and other technical partners, you will play a key role in expanding our platform and delivering meaningful value in a rapidly evolving industry. This role will help scale our cloud infrastructure and support organizational growth by implementing best practices that improve the reliability, performance, and maintainability of our charging infrastructure and software systems.
If you are a DevOps or Cloud Infrastructure Software Engineer who is motivated to help build scalable, impactful technology with a collaborative and innovative team, we encourage you to apply. Your work at Terawatt will contribute directly to advancing the sustainable transportation infrastructure of the future.
Core Responsibilities
-
Lead and architect the evolution of our cloud infrastructure using Terraform, building resilient and scalable systems to support business growth.
-
Maintain helm charts and deployment patterns that enable teams to manage the lifecycle of their services while adhering to established deployment standards.
-
Build tooling to enable engineering teams to own the application deployment process through CI/CD pipelines using GitHub Actions.
-
Promote security best practices across all layers of the stack, including software access, managed workloads, and services running in pre-production and production environments.
-
Strengthen cloud and network security using industry-standard tools to detect vulnerabilities and anomalies, and help prevent suspicious or malicious activity.
-
Advance observability practices using frameworks such as OpenTelemetry (OTel) and tools like Grafana Cloud for monitoring and alerting across services and infrastructure.
-
Develop tooling that supports both local and remote container-based cloud development workflows.
-
Create and automate simulated production scenarios used for testing during development and validating production releases.
-
Implement automation and alerting to maintain security and compliance standards, including SOC 2 controls.
-
Design and manage infrastructure that supports machine learning model training and deployment, ensuring scalable compute resources for ML workloads.
-
Partner with the Data team to manage core data infrastructure, including our Databricks data lake and Kafka event streams (Aiven/AWS), while advising on scalable data architecture and infrastructure improvements.
- Contribute to building a highly available, web-based depot operations platform that supports the future of EV charging using NodeJS.
-
Participate in a 24/7 on-call rotation to support the reliability of production systems.
Minimum Qualifications
-
8+ years of experience building and operating high availability production software systems, preferably in DevOps or platform engineering teams.
-
Experience building and maintaining scalable cloud-based infrastructure, including services running in managed Kubernetes (EKS).
-
Experience building or maintaining CI/CD pipelines (e.g., GitHub Actions) to support reliable software delivery.
-
Experience leading or contributing to SRE or DevOps initiatives supporting production cloud platforms.
-
Experience with observability frameworks and tools (e.g., OpenTelemetry, Grafana, or similar platforms).
- Experience working with managed databases such as PostgreSQL, MongoDB, or similar systems.
- Strong communication skills and the ability to collaborate effectively across engineering, product, and infrastructure teams.
Preferred Qualifications
- Experience working with multi-region AWS infrastructure and Kubernetes (EKS) at scale.
- Experience improving security and compliance practices through automation and internal tooling.
- Experience implementing or scaling observability standards using OpenTelemetry and tools like Grafana Cloud.
- Experience maintaining or scaling data infrastructure, such as Databricks, Kafka (MSK), or similar streaming/data platforms.
- Proficiency in Python or NodeJS.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
