What you will be doing:
We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to join our Data Infrastructure team. You will be responsible for ensuring the reliability, availability, and performance of our critical data systems running on AWS and GCP. Your expertise in cloud infrastructure, automation, and operational excellence will be crucial in supporting our Product trough our global client base.
As a Senior Site Reliability Engineer you will:
- Design, implement, and maintain highly available and reliable data infrastructure services, including SQL, NoSQL, Kafka, and Spark-based data layers. Define and monitor Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
- Participate in an on-call rotation to respond to incidents and ensure rapid resolution of production issues. Conduct thorough post-incident reviews to identify root causes and implement preventative measures.
- Manage and automate cloud infrastructure using Terraform and Helm, adhering to GitOps principles.
- Implement and maintain comprehensive monitoring, logging, and tracing solutions to proactively identify and resolve performance and reliability issues.
- Monitor and manage data infrastructure capacity, plan for future growth, and optimize performance through tuning and automation.
- Develop and maintain automation scripts and tools to streamline operational tasks, improve efficiency, and reduce manual effort.
- Ensure the security and compliance of data infrastructure services, implementing best practices for access control, data protection, and vulnerability management.
- Collaborate with development and data engineering teams to ensure smooth deployments and operational support. Maintain thorough documentation of infrastructure configurations, processes, and procedures.
- Manage and maintain distributed databases running within a Kubernetes environment.
Our Tech Stack:
- Cloud-Based Infrastructure: Fully cloud-based with a Kubernetes-focused tech stack. Compute workloads run in Kubernetes clusters across multiple regions.
- Infrastructure Management: Heavy use of Terraform and Helm, adhering to GitOps paradigms for managing cloud infrastructure and Kubernetes applications.
- Core Technologies: Extensive use of Kafka, distributed PostgreSQL and Cassandra QL, Elasticsearch, and Databricks/Spark. Development of inter-cloud failover options to support multi-cloud plans.
- Wide Array of Applications: Teams build and release containerised applications for low latency APIs, machine learning models, and data processing pipelines.
About You:
- At least 5 years experience as an SRE managing cloud infrastructure (AWS and/or GCP) and data systems (Apache Kafka, Apache Spark, Elasticsearch, PostgreSQL, Cassandra). Proven track record of improving reliability and availability in complex production environments.
- Extensive experience codifying infrastructure using Terraform and Helm charts.
- Proven experience managing and troubleshooting distributed databases within Kubernetes.
- Deep understanding of monitoring, logging, and tracing tools and techniques.
- Strong incident response and troubleshooting skills.
- Proficiency in scripting and automation tools.
- Understanding of security best practices for cloud infrastructure and data systems.
- Familiarity with CI tooling, test pipelines, and asset generation (e.g., Docker images, Helm charts). Understanding of security considerations in data systems.
Education:
- BSc/BA degree in computer science, engineering, or related discipline OR equivalent experience in required skills.
Nice to have
- Familiarity with distributed SQL and NoSQL databases such as Yugabyte, Cockroach, Spanner, HBase, or CouchDB.
- Familiarity with data modelling, sharding, and indexing strategies for large-scale databases.
What’s in it for you?
- Equity as we want you to have a part of what we are building
- Private medical insurance designed to keep you ensuring peace of mind while you excel in your career
- Unlimited Time Off Policy- A work-life balance and focus on our well-being are critical to keeping us performing at our best
- We embrace a hybrid approach that requires employees to be in the office for two days a week. We strongly believe that this approach fosters collaboration and enables the building of meaningful relationships
- You will also get a new starter budget to kit out your home office
- Opportunity to work on innovative projects with smart-minded people keen to share their knowledge and continuously improve
- Annual learning budget (prorated based on start date) to drive your performance and career development
About us:
ComplyAdvantage is the financial industry’s leading source of AI-driven financial crime risk data and detection technology. Our mission is to neutralise the risk of money laundering, terrorist financing, corruption, and other financial crime.
More than 1000 companies rely on us to understand the risk of who they’re doing business with through the world’s only global, real-time database of people and companies. Our solutions identify thousands of risk events daily from millions of structured and unstructured data points.
We have five global hubs in New York, London, Singapore, Lisbon and Cluj-Napoca and are backed by Goldman Sachs, Ontario Teachers, Index Ventures, and Balderton Capital.
Since 2014, we have raised over $100 million in funding, and in 2022 alone grew by over 40% to over 500 people globally. Over the next 12 months, as our revenue increases, we plan to increase to 600.
At ComplyAdvantage diversity fuels our rocket ship and our commitment to inclusion across race, gender, age, religion, identity and experience drives us forward every day. We encourage everyone to apply and aspire to consider every application fairly.
We will handle your information in accordance with our Privacy Policy. For further information, please click here.

0 applies
9 views
Other Jobs from ComplyAdvantage
Junior Software Engineer
Senior Data Scientist
Similar Jobs
Backend / Infra Software Developer
Staff Software Engineer Radiology
Data Engineer II
Senior Software Engineer (.Net CORE)
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say