Barti

Senior Site Reliability Engineer

Remote
USD 150k - 200k
Python Go Bash GCP Kubernetes Docker Terraform CloudFormation Linux Prometheus Grafana Datadog PostgreSQL MySQL GitHub Actions CircleCI GitLab CI
Description

Senior Site Reliability Engineer

Department: Product & Engineering

Location: United States

Compensation: $150K – $200K

Employment Type: FullTime

About Barti

Barti is a venture-backed startup on a mission to build the future of eye care. We’re building groundbreaking, AI-powered software that transforms how practices document care, run their operations, and serve patients. Our goal is to craft exceptional user experiences that let eye care providers stay focused on delivering high-quality care.

We recently raised our Series A and are growing quickly as hundreds of practices adopt Barti to replace outdated legacy systems. It’s an exciting time to join as we expand into new parts of eye care and continue shaping the future of practice management.

About the Role

We are looking for a seasoned Senior Site Reliability Engineer to join our dynamic team in a foundational role, owning reliability and infrastructure as our first SRE. This role will involve ensuring the reliability, scalability, and performance of our production systems, leading infrastructure initiatives, and mentoring engineers on best practices. The ideal candidate will have a strong technical background in both software engineering and systems operations, demonstrate excellent problem-solving skills, and have a passion for building resilient, automated systems.

Responsibilities

System Reliability & Performance:

  • Lead and participate in the design, implementation, and maintenance of highly available and scalable infrastructure.

  • Monitor system health, performance metrics, and capacity planning to ensure optimal performance.

  • Establish and track SLIs, SLOs, and error budgets to measure and improve system reliability.

Infrastructure & Automation:

  • Design and implement Infrastructure as Code (IaC) solutions using tools like Terraform, Pulumi, or CloudFormation.

  • Build and maintain CI/CD pipelines to enable rapid, safe deployments.

  • Automate operational tasks and eliminate toil through scripting and tooling.

Incident Management:

  • Lead incident response efforts, including on-call rotation, post-mortem analysis, and remediation.

  • Debug and resolve complex production issues across the entire stack.

  • Implement monitoring, alerting, and observability solutions to detect and prevent issues proactively.

Technical Leadership:

  • Provide technical leadership and mentorship to engineers on reliability and infrastructure best practices.

  • Collaborate with cross-functional teams, including Engineering and Product to ensure reliable product delivery.

  • Lead the technical design of infrastructure solutions, ensuring alignment with architectural principles and business goals.

Continuous Improvement:

  • Stay updated with emerging technologies and industry trends in SRE, DevOps, and cloud infrastructure.

  • Propose and drive the adoption of best practices, tools, and processes to enhance system reliability and developer productivity.

  • Conduct chaos engineering experiments and disaster recovery drills to validate system resilience.

Security & Compliance:

  • Implement and maintain security best practices across infrastructure and applications.

  • Manage secrets, access controls, and security monitoring systems.

Collaboration & Communication:

  • Foster a collaborative environment within the engineering team and across departments.

  • Clearly communicate technical concepts and system health to both technical and non-technical stakeholders.

  • Work closely with engineering teams to define reliability requirements and ensure operational excellence.

Minimum Qualifications

  • 5+ years (ideally 7+) of relevant work experience in Site Reliability Engineering, DevOps, or Infrastructure roles

  • 1+ years of hands-on experience with either Python, Go, or Bash scripting

  • Experience with cloud platforms (ideally GCP) and container orchestration (Kubernetes, Docker)

  • Proficiency with Infrastructure as Code tools (Terraform, CloudFormation, or similar)

  • Strong understanding of Linux systems, networking, and distributed systems

  • Experience with monitoring and observability tools (Prometheus, Grafana, Datadog, or similar)

  • Excellent problem-solving and communication skills

  • Able to work independently and as part of a team

Preferred Qualifications

  • Background in healthcare technology or regulated industries

  • Experience with GCP, Cloud SQL, and Google Kubernetes Engine (GKE)

  • HIPAA compliance and security best practices experience

  • Experience with relational databases (Postgres, MySQL) performance tuning and high availability

  • Proficiency with CI/CD tools (GitHub Actions, CircleCI, GitLab CI)

  • Familiarity with APM tools and distributed tracing

Perks and Benefits

  • Be part of a mission-driven, rapidly scaling company changing the future of eye care

  • Work remotely from anywhere in the U.S.

  • Collaborate with a passionate, fun, and supportive team

  • Competitive salary - $150,000 - $200,000

  • Equity in a fast-growing startup

  • Health, vision, and dental benefits

  • Unlimited PTO

  • Annual professional development stipend

  • A high-impact role with plenty of room for growth, ownership, and creativity

We are an equal opportunity employer. We value a diverse workforce and an inclusive culture. We encourage applications from all qualified individuals without regard to race, color, religion, gender, sexual orientation, gender identity or expression, age, national origin, marital status, disability, and veteran status.

Barti
Barti

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say