Job Description:
Position Description:
Drives software production stability, reliability, and resiliency within operating system environments (Linux) using Oracle, Java, Python, and PL/SQL. Promotes code into production using Continuous Integration and Continuous Delivery (CI/CD) tools -- Concourse, Jenkins, and Udeploy. Improves Cloud operations and delivery -- Amazon Web Services (AWS) – using container orchestration tools (Kubernetes EKS and AKS). Debugs and addresses production failures for major system outages or issues using Extract Transform Load (ETL) tools -- Informatica. Supports the resolution of production support issues using Datadog and Sitescope. Provides ongoing technical production support for applications and software infrastructure.
Primary Responsibilities:
Supports Cloud-based and On-Prem database, middle-tier services, and applications.
Coordinates the planning and successful execution of special events and projects.
Drives incident management, root cause analysis, problem management, and organizational change processes.
Resolves production tickets and prioritizes critical production outages on crisis management calls based on the SLA and business impacts.
Performs batch cycle management – monitors, analyzes, and completes batch cycles within the SLA.
Implements alarms for batch failures or long running jobs, system outages, and performance issues to identify root cause and steps to mitigate the issue and restore service.
Designs and implements monitoring and alerting mechanisms for applications and infrastructure.
Supports deployment of applications on Cloud or on-Prem platforms.
Proposes modifications and improvements as part of the continuous improvement efforts.
Supports, manages, and provides evidence of successful disaster recovery executions for critical applications to ensure compliance with enterprise requirements.
Develops Standard Operating Procedures (SOPs) and documentation for teams to successfully support applications and infrastructure.
Supports major market driven events and holidays to maintain production stability and availability.
Drives metrics and reporting processes to produce reporting and metrics using SQL and ServiceNow.
Facilitates informed support and operational decisions.
Serves as the point of escalation for outages to ensure prompt escalation to engineering teams as required to minimize the business impact.
Participates in on-call rotations.
Education and Experience:
No degree and five (5) years of experience as a Senior Systems Engineer (or related occupation) providing product support for Cloud applications and infrastructure that requires batch cycle management, deployment and cloud infrastructure management within a financial services or banking or insurance domain.
Or, alternatively, Bachelor’s degree (or foreign education equivalent) in Computer Science, Engineering, Information Technology, Information Systems, Mathematics, Physics, or a closely related field and three (3) years of experience as a Senior Systems Engineer (or related occupation) providing product support for Cloud applications and infrastructure that requires batch cycle management, deployment and cloud infrastructure management within a financial services or banking or insurance domain.
Or, alternatively, Master’s degree (or foreign education equivalent) in Computer Science, Engineering, Information Technology, Information Systems, Mathematics, Physics, or a closely related field and one (1) year of experience as a Senior Systems Engineer (or related occupation) providing product support for Cloud applications and infrastructure that requires batch cycle management, deployment and cloud infrastructure management within a financial services or banking or insurance domain.
Skills and Knowledge:
Candidate must also possess:
Demonstrated Expertise (“DE”) providing production support for Cloud applications and infrastructure using Datadog or Splunk or Cloudwatch, Autosys or Control-M or CA-ESP, and uDeploy or Jenkins or Bamboo, within a financial services environment, or banking or insurance domain; Identifying root causes for applications and infrastructure hosted on AWS by providing production support for incident management using log aggregating tools (Cloudwatch or DataDog, or Splunk); and promoting new features and functionalities by supporting the deployment of applications in production and non-production environments, using CI/CD tools -- Jenkins or Udeploy or Bamboo.
DE triaging application or infrastructure issues for large scale enterprise systems by performing log analysis and reviewing application health dashboards and performance monitors to identify the root cause, patterns, and key operational metrics/trends using SQL, Venafi, and Splunk or Datadog; and applying and implementing Information Technology Infrastructure Library (ITIL) standards for operations, and incident, problem, request management, change, and release management using ITSM tools -- ServiceNow or HP Service manager or BMC Remedy.
DE providing installation and upgrade support for applications and infrastructure to development teams, using Amazon Web Services (AWS) Cloud platform; performing operational tasks in AWS to scale services -- Elastic Cloud Compute (EC2) and Auto Scaling Groups; and coordinating, monitoring, and supporting the successful completion of batch cycles within the SLA using Autosys, Control-M or CA-ESP.
DE automating manual operations and certificate management systems to renew and expire certificates in a timely manner in production and non-Production environments using Venafi and Jenkins or Bamboo pipelines; and building Standard Operating Procedures (SOPs), and supporting deployments using Operational tools -- uDeploy or Bamboo or Jenkins.
#PE1M2
Certifications:
Category:
Information TechnologyFidelity’s hybrid working model blends the best of both onsite and offsite work experiences. Working onsite is important for our business strategy and our culture. We also value the benefits that working offsite offers associates. Most hybrid roles require associates to work onsite every other week (all business days, M-F) in a Fidelity office.
Other Jobs from Fidelity
Director, Full Stack Engineering
Lead - Software Engineering
Lead - Software Engineering - Java Full Stack(Java, SQL, AWS)
Mobile Engineer (iOS or Android)
Senior Full Stack Engineer (Java, Spring, Angular)
Similar Jobs
Python ML Developer
Senior Data Infrastructure Engineer
Staff Software Engineer, Demand Applications
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say