Senior Site Reliability Engineer
Team: Software Engineering
Location: Lisbon, Madrid, Barcelona, London, Cape Town
Commitment: Full Time
Workplace Type: remote
🚀 What you will do
- In the short term we need to increase the resiliency and reliability of our current PaaS solution with things such as:
- Improving the maintainability of our infrastructure as code
- Building dashboards, monitoring & alerting mechanisms with Datadog
- Load testing and performance tuning our production services
- Lifecycling and maintenance of our Kubernetes clusters
- In the medium to long term you’ll get to:
- Implement new and shiny technologies on top of Kubernetes as you see fit to ensure our tech can scale with the business.
- Develop and integrate solutions with a bias for automation in order to improve and maintain reliability across the production estate and make recovery easier.
- Design and track metrics for site uptime and performance ensuring high levels of visibility are maintained.
- Own the deployment pipelines and continuously improve our monitoring and alerting capabilities.
- Collaborate closely with all other engineering functions to provide timely feedback from our environments.
- Support Engineering on their journey to deliver better software, faster and more safely (think “It’s OK to deploy on Fridays” 😎).
💻 What you will be working with
- Typescript
- Node.js
- TypeORM, TypeDI, TypeGraphQL and routing-controllers
- React and NextJS hosted on Vercel
- Google Cloud Platform
- Postgres
- Redis
- Bull, BullMQ
- DataDog
- ArgoCD
- Kubernetes
- GitHub
- Jest
🧑🚀 About You
- Strong systems administration skills, know the difference between a container and a virtual machine, and know your way around a Linux terminal
- Platform engineering/SRE experience at leading startups or fast growing tech companies
- Either experience with some of our tech stack or are confident you can cross train and up skill quickly
- Experience working in a regulated industry
- Confident working with and guiding developers on monitoring and logging of complex systems at scale
- Worked on complex projects
- Work collaboratively with different teams i.e. Security, Data, Engineering
- Want to forge and own MoonPays reliability & recovery processes
- Have at least a basic understanding of complex reliability structures, theories, principles, and best practices
- Worked with JavaScript codebases and frameworks e.g Typescript, Node.JS and React
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
