TripleTen for Business empowers companies to achieve their business goals by bridging talent gaps in Data Science, AI for professionals, Python Development, and Management.
Our transformative approach includes tailored training programs, informed by comprehensive pre-training assessments, ensuring precise alignment with client needs. With expert-led content and personalized mentoring, we help employees excel and achieve new levels of proficiency.
We are looking for a Senior Site Reliability Engineer. In this role, you will take ownership of ensuring service high availability*, documenting infrastructure details, and empowering developers through training and guidance on working with it*
- Develop infrastructure, and write solutions to simplify operations.
- Build processes to achieve and maintain 99.99% uptime, and improve the exercise process.
- Develop automation and service reliability, plan resources, and reduce ops in development.
- Build infrastructure and monitoring, help developers solve infrastructure problems, train developers to solve problems independently, and improve the observability of infrastructure, monitoring, schedules, and alerts.
- 2+ years of Site Reliability Experience.
- Experience working with Prometheus - must have.
- Experience working with Kubernetes, GitLab CI, and Ansible.
- Experience working with Unix systems (we have Ubuntu) and the console.
- Understanding the basics of TCP/IP to build networks, how web services work, REST API, and gRPC.
- Experience performing diagnostics, including interpreting the output of Ps, Top, Strace, Perf, and TCPDump.
- Understanding of how user applications interact with the operating system, including familiarity with system calls, processes, and threads.
- Willingness to build high-load systems and understanding of how to do that.
- Understanding of fault tolerance and service scaling.
- High degree of emotional intelligence, ability to find common ground with colleagues and work as part of a team.
- Must be professionally fluent in English
Nice to have:
- Experience working with AWS and Terraform.
- Experience programming in Python / Golang or desire to learn how.
- Full-time remote collaboration with a convenient schedule. Professional freedom, where we trust your experience instead of wasting each other's time and effort micromanaging;
- A diverse and tight-knit team. Our teammates are spread out across Serbia, the US, Israel, Georgia, Armenia, Latin America, and more. They’ve worked at all of big techs, ed-techs, design agencies, and cultural institutions;
- Comfortable digital workspace. We use Miro, Notion, Google Workspace, Jira, etc.— to make working together process seamless.
Similar Jobs
Principal Software Engineer
Staff Machine Learning Engineer
Staff Software Engineer, Machine Learning Infrastructure - Slack
Site Reliability Engineer 3 (TS&CG)
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say