Description
Thought Machine’s Site Reliability Engineers are the guardians of mission-critical systems for the world's most influential financial institutions. As a member of our elite, globally distributed team, you'll be entrusted with running and maintaining the robust production infrastructure that powers our customers' cutting-edge Core Banking and Payments platforms. This is an opportunity to make a tangible impact on the global financial landscape while collaborating with brilliant minds to solve complex engineering challenges. This role will be part of the Site Reliability Engineering team at Thought Machine HQ in London, tackling the challenges of automating complex fleet management operations, mentoring team members, promoting communities of best practice within engineering as well as designing operational processes that provide effective interfaces between Thought Machine and our Saas customers. The SRE team is deeply involved in tackling the technical challenges of executing Thought Machine’s growth ambitions - expect to be working with senior stakeholders in the organisation and with our customers, and working on programmes and initiatives that are critical to the success of the company.
Supporting the product engineering teams in building highly fault-tolerant, scalable applications by participating in design discussions, engaging in RFCs and code reviews
Executing various department strategies - contributing to the design and scoping work for team members around disaster recovery, backup, redundancy and capacity planning activities
Being part of a support rota responsible for resolving alerts generated by proactive monitoring and working closely with client-facing roles to provide L2 support for client-initiated support requests
Regular maintenance of production systems that host Vault products
Driving the evolution of our SaaS products by defining and designing features that foster exceptional reliability and an unparalleled user experience
Implementing and regularly testing DR strategies to ensure the highest level of resilience and fault tolerance of the platform
Maintain and promote high-quality written documentation of assets, processes and runbooks that are used by the team in their day-to-day operations
Working with your Manager in growing team members in their technical skills as well as their understanding of Vault Products
Requirements
You possess an up-to-date understanding of design patterns relevant to hosting and networking architectures
You proactively champion product development, driven by a desire to build truly exceptional products, not just solve immediate challenges
You’re a high-agency individual who can independently drive projects to completion by effectively scaling your individual output with the appropriate delegation of work to team members
You have a strong background working in either Python, Golang or Java, having used one of these programming languages to execute a significantly sized project or initiative
You have experience working with Kubernetes or other container orchestration systems
You have expertise in one or more of the following areas: Database Administration, Networking, Observability Tools (such as Prometheus, Jaeger) or automation infrastructure
You have extensive experience working with either GCP or AWS
Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible
Benefits
- Highly competitive salary
- Pension plan (match up to 5%)
- Life insurance - three times annual salary
- Competitive maternity (six months fully paid) and paternity leave (four weeks fully paid)
- Shared parental leave (matched to our maternity leave for the same point in time)
- 25 days holiday and bank holidays
- Flexible working hours
- Cycle-to-work scheme
- Electric car scheme
- Season ticket loan
- Access to outstanding learning materials and courses
- Sports and hobby clubs, subsidised by Thought Machine
- All the latest tech you need
- Start the day properly with fresh fruit and cereals
- Huge range of healthy (and not-so-healthy) snacks, smoothies and drinks
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress
- Two charity days a year
- Weekly food pop-up
We actively hire candidates who demonstrate technical excellence in their field and welcome people of all ages and backgrounds, providing everyone with equal access to professional development. You are encouraged to apply even if your experience doesn't accurately match the job description. We also encourage applications from those with different abilities, including candidates with ADHD, autism, dyslexia or dyspraxia.

1 applies
12 views
Other Jobs from Thought Machine
Mid-Level Engineering Program Manager
Junior Engineering Program Manager
Engineering Program Manager
Cloud Support Engineer
Senior Software Engineer
Similar Jobs
Systems Development Engineer, Region Services
Software Engineer (Java/Python + DevOps CI/CD)
Loopnet - Lead DevOps Engineer
Site Reliability Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say