Job Description:
We are hiring a Principal Site Reliability and Support specialist, a motivated technologist / leader, to join our support organization. In this role, the resource will serve as a production support and SRE specialist for supporting FFIO Business Units Infrastructure and Applications. Your key partners are Fund Accounting (Fixed Income and Equity, Money Market and Institutional Products), Pricing, Cash Hub, Trade Hub and DAL accounting Technology / Business Teams.
The team comes with a diverse technological background and the responsibilities provide the opportunity for a variety of challenges. Ideal candidates will have a background in either software engineering or systems engineering with a desire to learn the other or previous experience as an SRE. We are looking for a system thinking specialist who will be helping the teams scale through production insights, operational automation, developer guidance, real-time metrics and automation. This is a great opportunity for anyone looking to lead, learn and use their Cloud, Database, Middle-tier technical skills and experience to drive production stability, reliability, and resiliency.
The Expertise You Have and The Skills You Bring
Bachelor’s degree or higher in a technology related field (like Engineering, Computer Science, Information Technology) required, master’s degree is a plus.
A minimum of 5+ years of hybrid experience in Production Support, Development and SRE Experience. Hands-On experience deploying and/or supporting highly distributed multi-tiered systems at scale.
A minimum of 2+ years of experience in cloud development (AWS) and migration skills; Experience with building and operating highly resilient platforms in AWS Cloud Environments.
2 - 4+ years of experience in software development with Python, NodeJS, Java with a focus on SDLC and automation.
A self-starter and team player who can independently manage multiple responsibilities in a dynamic environment.
Strong hands-on experience and ability to automate with various scripting languages such as Python, Shell Scripting, etc.
Solid understanding of Cloud Computing and DevOps concepts including CI/CD Pipelines
Hands-On Kubernetes skills and knowledge.
Expert and hands on experience with one or more Observability tools (Prometheus, Grafana, ELK/OpenSearch, Open Telemetry, Datadog, etc.).
Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale.
Proven experience in maintaining scalability and resiliency in complex environments.
Proven experience in implementing advanced observability practices and techniques at scale.
Ability to triage, perform root cause analysis, and be decisive under pressure.
Experience managing and interpreting large datasets using query languages and visualization tools.
Excellent verbal, written communication skills and ability to tailor them to various audiences.
Ability and high-level curiosity enabling the desire to learn new technologies, tools and bring them to our developers.
Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner to build and maintain effective relationships.
Familiarity with Agile Software Development Methodologies.
Highly effective business communication and influencing skills.
AWS and AWS / EKS certifications are a plus.
The Team
Our Site Reliability Engineering and production support services group within Enterprise Infrastructure for Fidelity Fund and Investment Operations (FFIO) combines Operations Excellence with the Development Experience to deliver services at high-scale, high-availability with resilience by using automation Infrastructure as code. We built reliability into our ecosystem by applying best practices in Resiliency Engineering, Automation, Observability in addition to core production support like Incident, Change, Problem and Release management.
We partner with our key stakeholders in Information Technology and business teams to deploy new functionalities, software fixes, SRE Features and support applications in a wide range of infrastructures and products.
Certifications:
Category:
Information TechnologyFidelity’s hybrid working model blends the best of both onsite and offsite work experiences. Working onsite is important for our business strategy and our culture. We also value the benefits that working offsite offers associates. Most hybrid roles require associates to work onsite every other week (all business days, M-F) in a Fidelity office.
Other Jobs from Fidelity
Similar Jobs
Senior Sensor Fusion Engineer (Autonomous Driving)
Senior Backend Engineer - Platform
Senior Verification Engineer - Memory Subsystem
Senior Verification Engineer - Memory Subsystem
Design Verification Infrastructure Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say