First Abu Dhabi Bank

Senior Engineer, Alerting & Incident Management

Abu Dhabi
Opsgenie Splunk On-Call ServiceNow VictorOps
Description

Senior Engineer- Alerting & Incident Management

Location: Abu Dhabi, Abu Dhabi, ae

Company Description

Join the UAE’s largest bank and one of the world’s largest and safest financial institutions. Our focus is to create value for our employees, customers, shareholders and communities to grow through differentiation, agility and innovation. We are looking for top talent and your success is our success. Accelerate your growth as you help us reach our goals and advance your career. Be ready to make your mark a top company, in an exciting and dynamic industry.

Job Description

Overall objectives

•To establish and maintain an effective, intelligent, and timely alerting framework across infrastructure, application, and business services.

•To coordinate and continuously improve the incident management lifecycle with a focus on early detection, rapid response, and root cause accountability.

•To integrate observability data (logs, metrics, traces) into a unified alerting and incident response workflow.

•To reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) through automation, clear escalation paths, and operational discipline.

Role specific responsibilities

•Manage and continuously improve the incident response process, including triage, escalation, status communications, and resolution tracking.

•Act as the incident commander during major outages or high-severity issues, coordinating technical teams toward resolution.

•Maintain and govern on-call schedules, escalation paths, and responder playbooks.

•Integrate observability tools with incident management platforms to enable real-time, contextual alerting.

•Lead and document root cause analysis (RCA) and ensure completion of follow-up actions and preventive measures.

•Report on incident metrics and trends, identifying areas for resilience and process improvement.

General functional responsibilities

•Maintain detailed documentation on alert rules, incident workflows, contact rosters, and escalation trees.

•Ensure compliance with regulatory, audit, and risk management requirements related to incident response and system availability.

•Collaborate with monitoring, logging, and APM peers to align telemetry signals with operational response.

•Work with development, infrastructure, and support teams to embed alert and incident management best practices in SDLC and change management.

•Participate in regular incident simulations and on-call readiness drills.

•Drive continuous improvement through retrospective reviews, blameless post-mortems, and incident automation.

Qualifications

Core competencies required

Strong experience with alert management platforms such as Opsgenie, Splunk On-Call, ServiceNow Event Management, or VictorOps.

Familiarity with routing rules, escalation policies, noise suppression, on-call schedules, and alert deduplication.

Deep understanding of the end-to-end incident management process—detection, triage, escalation, communication, and closure.

Proficient in running major incident bridges, documenting timelines, and leading post-incident reviews (PIRs/RCAs).

Calm and assertive in high-pressure incident scenarios.

Excellent communicator—able to coordinate with technical and business stakeholders during incidents..

First Abu Dhabi Bank
First Abu Dhabi Bank

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say