Senior Site Reliability Engineer
Location: London, UK
Time Type: Full time
Job Description
As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.
Job Description
SS&C is a global financial technology and software-enabled services company that provides mission-critical solutions primarily to the financial services and healthcare industries. It is headquartered in Windsor, Connecticut, USA, and is publicly listed on the NASDAQ. SS&C is widely regarded as one of the largest administrators of hedge fund and private equity operations and the largest mutual fund transfer agency globally.
About the role:
Operated within the SS&C WIT business, Genesis is an all-new investment operations platform that provides extensive asset class and functional support across the front, middle, and back office. Built natively for the cloud with advanced technology, Genesis features an innovative user experience, actionable monitors, notifications, and alerts infused with AI.
The role requires an in-depth knowledge of observability principles and strong experience in implementing the observability stack across infrastructure, data and application layers for real time, compute intensive, distributed environments. The Senior SRE Engineer will have a solid understanding of cloud platforms and container orchestration. They will have a comprehensive grasp of incident management and operational risk mitigation and experience in implementing automation frameworks to minimize toil and reduce MTTD/MTTR. They will have proven experience in using infrastructure as code and familiarity with AI-driven operational tooling. Logical thinkers with strong problem solving and communication skills and a desire to effect continuous improvements.
Your Responsibilities:
- Maintain shared ownership for providing production level resilience and reliability for business-critical systems.
- Leverage industry-standard observability technologies to provide a centralized view of system and service health.
- Implement and continually improve monitoring and alerting based on harvested logs, metrics and traces.
- Lead incident response, post incident reviews and post remediation improvements.
- Define and establish KPIs, SLIs and SLOs in support of agreed service levels.
- Develop and maintain automation, and leverage generative AI technologies to reduce operational toil, improve MTTD and MTTR.
- Take on new support for additional technical service components as the service evolves. Support, mentor and train SRE Engineers.
- Work with other teams to maintain a sound knowledge of all aspects of the application technical architecture.
- Contribute to building up and maintaining a knowledge base in support of the technical role.
- Maintain and awareness of, comply with and champion the stated service controls required to achieve audit compliance.
Your Experience:
- Bachelor’s degree in Computer Science, Software Engineering, or a related field.
- ITIL foundation level or experience working in an ITIL framework preferred.
- 4+ years of Linux OS and Windows OS systems management experience.
- 4+ years of experience with observability technologies for system monitoring and alerting technologies (e.g. Prometheus, Grafana, Loki).
- 2+ years working in a team environment with operational responsibilities for client facing applications.
- 2+ years of experience with containerization technologies and Kubernetes.
- Proven scripting skills in at least one of Linux shell scripting (csh, ksh, Bash or Windows PowerShell), Ansible, Terraform or Python.
- Working experience in use of versatile workload automation / enterprise scheduling tools such as Airflow.
- Working experience with, and a technical understanding of, NoSQL DBs such as MongoDB/Cassandra and traditional relational DBs such as SQL Server/Oracle/Postgre.
- Working experience of a cloud self-service environment.
- Working experience of LLM or AI usage in monitoring and observability stacks.
EEO Statement / Non-agency Disclosure We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives to our thinking and conversation. It's important to us that we strive to have a workforce that is diverse in the widest sense.
Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.
Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.
Applications will be accepted on an ongoing basis until the position is filled.
SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
