SRE Engineer
Location: Tel Aviv/ Netanya, Israel
Department: R&D
At JFrog, we’re reinventing DevOps to help the world’s greatest companies innovate -- and we want you along for the ride. This is a special place with a unique combination of brilliance, spirit and just all-around great people. Here, if you’re willing to do more, your career can take off. And since software plays a central role in everyone’s lives, you’ll be part of an important mission. Thousands of customers, including the majority of the Fortune 100, trust JFrog to manage, accelerate, and secure their software delivery from code to production -- a concept we call “liquid software.” Wouldn't it be amazing if you could join us on our journey?
We are looking for a Site Reliability Engineer to join our SaaS Production team and help us ensure high availability, performance, and reliability across our global cloud environments.
As a Site Reliability Engineer in JFrog you will…
- Support the operation and reliability of JFrog’s large-scale, multi-cloud, Kubernetes-based SaaS environments
- Troubleshoot complex production issues across distributed systems and work closely with Engineering and Cloud teams to resolve them
- Contribute to improving system reliability, performance, scalability, and observability
- Apply SRE best practices, including incident response, service monitoring, capacity considerations, and continuous reliability improvements
- Participate in on-call rotations and take part in incident investigations and postmortems
- Build and enhance automation tools (primarily in Python or Go) to reduce operational toil and improve efficiency
- Assist in improving CI/CD workflows and deployment safety
- Design and develop AI-based tools and automation to improve operational efficiency and productivity for JFrog’s internal engineering and SaaS teams
- Support resilience initiatives, including disaster recovery validation and service readiness improvements
- Continuously learn and explore new technologies that improve operational excellence
To be a Site Reliability Engineer in JFrog you need…
- 1-3 years of experience in SRE, DevOps, Production Engineering, or a similar role in a production environment
- Hands-on experience operating Kubernetes-based containerized workloads in production
- Experience with at least one public cloud provider (AWS, GCP, or Azure)
- Strong troubleshooting and analytical skills with the ability to debug production issues methodically
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
