Gretel

Senior Software Engineer, Site Reliability

Remote
USD 180k - 230k
Kubernetes Go PyTorch TensorFlow API AWS Docker Python Terraform
Search for More Jobs Talk to a recruiter now 💪
Description

Who we are

At Gretel, our mission is to build the world’s first developer platform for synthetic data. Our platform solves the data bottleneck problem for developers, data scientists, and AI/ML researchers across multiple modalities including tabular, time-series, relational, language and image. Gretel's APIs automatically fine-tune AI models to generate synthetic data on-demand while protecting privacy and maintaining the utility and accuracy of the original data.

As a Senior or Staff Site Reliability Engineer (SRE) at Gretel you will ensure the safety, security, and reliability of our cloud infrastructure. This includes our compute infrastructure, container orchestration platform, deployment pipelines, and observability stack.

What you will do

  • Build and maintain Gretel's observability stack. Measure and monitor Gretel's availability, latency, and overall system health

  • Scale systems sustainably with automation and continuously improve and evolve systems

  • Manage and lead incident response, recovery, and blameless postmortems

  • Partner with software engineers to troubleshoot production issues

  • Build tools and frameworks that help Gretel engineers be more productive

  • Ship complex ML/AI models in partnership with Gretel's applied science and engineering teams

Minimum Qualifications

  • Experience with at least one cloud platform (we use AWS heavily)

  • Experience with Docker and Kubernetes

  • Ability to write software and tools in Python or Go

  • Experience with monitoring, alerting and operations

  • Experience operating highly available distributed systems in the cloud

  • Experience identifying, diagnosing, and responding to operational outages

Preferred Qualifications

  • Experience with infrastructure as code (Terraform, CloudFormation, etc)

  • Experience with build systems such as Bazel

  • Experiencing shipping application with complex dependencies (Pytorch, Tensorflow)

  • Software engineering skills beyond script writing (TDD, design patterns, etc)

  • Experience with DevOps or CI/CD pipelines

We think the best ideas come from the blending of diverse perspectives and experiences, which will lead to a stronger company and advancements in technologies.  We hire individuals whose peers call them subject matter experts, whose curiosity draws them to new edges of their field and who like to laugh.  We are deeply collaborative, apolitical and mission-oriented.

Gretel is an equal opportunity employer. Individuals seeking employment and employees at Gretel are considered without regard to race, color, religion, national origin, age, sex, gender, gender identity, gender expression, sexual orientation, marital status, medical condition, ancestry, disability, military or veteran status, or any other characteristic protected by applicable law. 

Accommodations: We celebrate diversity and are committed to creating an inclusive environment for all candidates and employees. If you need assistance or an accommodation due to a disability, please let your recruiter know.

Compensation

Employee compensation will be determined based on interview performance, level of experience, specialization of skills, and market rate. During the offer discussion, your recruiter will review the finalized base salary, bonus (for applicable roles), benefits and perks (additional information available on our career site), and stock options as they’ll be reflected in the offer letter. 

Employees hired in the U.S. and Canada can expect the below information to reflect a reasonable estimate of the salary offered for this role. Salary ranges are updated regularly using premium market data. (Please note: it is unusual for new hires to receive a base salary at the top of the range. Additionally, the value of Gretel.ai’s stock options is not included in the salary bands and may represent a significant portion of your compensation.)

Senior or Staff Site Reliability Engineer: $180,000-$230,000 USD

Gretel
Gretel
Information Technology Privacy Software

0 applies

3 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say