Hazelcast

Senior Site Reliability Engineer

Remote United Kingdom
Kubernetes Microservices AWS Terraform Keycloak Prometheus Grafana OpenTelemetry ArgoCD Helm Golang Python Jenkins GitHub Actions GCP Azure Java
Description

Senior Cloud SRE

Department: Software Engineering

Employment Type: Permanent - Full Time

Location: Remote, UK, Remote

We are looking for an SRE, experienced in distributed systems, Kubernetes & microservices to join our Applications team. The team focuses on providing tooling to enrich the core Hazelcast Platform, making it easier to use, scale and provide greater functionality. Ensuring solutions to meet the most demanding customer needs.

Day to day, you’ll be leveraging your solid engineering fundamentals with a focus on performance, consistency, resilience and scale, bringing your passion for solving difficult problems to help realize the product vision.

Your role as a SRE is crucial in ensuring that Hazelcast Platform meets business objectives, is robust and scalable, and is depended upon by customers for mission-critical implementations.

WHAT YOU’LL DO

Keep Hazelcast cloud-based production systems running smoothly 24/7/365
  • Design and Development:
  • Design, develop, and maintain our cloud infrastructure to support both our end user management center and microservice based platform
  • Implement new solutions using AWS and terraform, improving scalability, throughput, and reliability.
  • Support and manage our Keycloak IDP ensuring it provides appropriate security while meeting the needs of the development team
  • Security and Integration:
    • Implement security measures to protect data integrity and confidentiality, including encryption, access control, and compliance with relevant regulations.
    • Work with our operations team to maintain our SOC2 & ISO27001 compliance, and keeping our environment secure
  • Monitoring and Maintenance:
    • Monitor the system for performance issues, errors, and potential failures, and implement maintenance procedures such as backups, data recovery, and disaster recovery plans.
    • Troubleshoot issues related to data storage, including performance bottlenecks, data corruption, or compatibility issues with other software components.
  • Collaboration:
    • Collaborate with cross-functional teams, including software developers, architects, and product managers, to ensure the effective integration and operation of the components within the overall software infrastructure.
    • Document design decisions, implementation details, and operational procedures to facilitate collaboration among team members and ensure the maintainability of the system.
  • Continuous Learning:
    • Stay updated with the latest developments in storage technologies, Java programming language, and software engineering best practices, and apply this knowledge to improve existing storage systems and develop new solutions.
  • On-call participation
    • Be part of our on-call rotation to respond to availability incidents and work with support and engineers on customer incidents

WHAT YOU HAVE

Experience of distributed systems, Kubernetes & microservices
  • Infrastructure as Code (Terraform)
  • Modern devops stack (K8s, Prometheus, Grafana, Opentelemetry, ArgoCD, helm)
  • Experience with at least one programming languages, preferably Golang or Python
  • Experience with CI and building CD pipelines (Jenkins, GitHub Actions)
  • A passion for automation and keeping our software delivery fast and efficient
  • Knowledge of following are desirable:
    • Mutli-cloud (AWS, GCP and/or Azure)
    • Experience working with software engineers in designing cloud-native applications or troubleshooting them
    • Experience as part of an on-call rota
  • Bachelor's degree in a relevant field of study (Computer Science, or related discipline). OR equivalent experience.

BENEFITS

  • 25 days annual leave + Bank holidays
  • Group Company Pension Plan
  • Private Medical Insurance
  • Private Dental Insurance
  • Life Insurance
  • EAP (Employee Assistance Program)

Hazelcast
Hazelcast

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say