Description

Job Description:

Job Title – Lead - Software Engineering

The Purpose of This Role

Our Site Reliability Engineering group within Enterprise Infrastructure combines Operations Excellence with the Development Experience to deliver services at high scale, high availability with resilience by using automation and Infrastructure Code. We build reliability into our ecosystem by applying best practices in Resiliency Engineering, Automation, Observability & Chaos Testing.

The team comes from diverse technical backgrounds, and the responsibilities provide the opportunity for a variety of challenges. Ideal candidates will have a background in either software engineering or systems engineering with a desire to learn the other or previous experience as an SRE. We are looking for a Systems Thinking, SRE Engineer who has helped teams scale through production insights, operational automation, developer guidance, real-time metrics, automation, automation, automation...!

The Value You Deliver

This is an exciting opportunity to join a passionate SRE team who are dedicated to providing a truly predictable customer experience. Under times of market volatility and high volumes, there is an increased expectation of a consistent service level. In Fidelity, we strive to meet this expectation by building reliability into our ecosystem. This will be achieved though defining & implementing practices in Resiliency Engineering, Automation, Observability & Chaos Testing while also engraining a proactive Culture that thinks reliability first design. Troubleshoot stack-wide engineering issues related to hardware, software, network, applications, and cloud service providers.

The Skills that are Key to this role

Ability to automate with various scripting languages (Python, Shell scripting, etc…)
Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef, …)
Solid understanding of Cloud Computing and DevOps concepts including CI/CD pipelines
Hands-on experience with container orchestration, preferably with Kubernetes
Experience & Expertise in Performance Engineering
Hands on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, OpenTelemetry, Datadog, etc…)
Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale
Proven experience in maintaining scalability and resiliency of complex environment.
Proven experience in implementing advanced observability practices and techniques at scale.
Ability to triage, execute root cause analysis, and be decisive under pressure
Experience managing and interpreting large datasets using query languages and visualization tools
Proficient communication skills with an ability to reach both technical and non-technical audience
Ability to learn new software, method and practices and bringing them to our developers
Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationships
Proven experience performing chaos testing to build confidence in the system's capability to withstand turbulent conditions in production
Strong understanding in API testing tools (SoapUI, Postman, Soatest)
Understanding of Agile Methodology
Provide enterprise Cloud and Platform Engineering support for production and non-production environments and ability to participate in on-call rotation to provide solutions.
Experience in Cloud development( AWS and Azure) and migration skills; Experience with building and operating highly resilient platforms in public cloud environments

Behavioral :

Analytical Skills and Research capabilities
Ability to evaluate and propose best-of-breed tools and engineering best-practices
Deeply self-motivated with the ability to work independently, coordinating activities within cross-regional and multi-functional teams
A passion for excellence, innovation, and teamwork; eager to learn and adapt every day
Proven track record to quickly learn, adapt and thrive in a fast paced, dynamic and deadline driven environment
Excellent Communication Skills

Preferred:

Experience in Production Support on A

How Your Work Impacts the Organization

The SRE team comprises of a team of passionate experts dedicated to deriving and implementing site reliability practices across a number of key workstreams, including, Observability, Resiliency, Chaos Engineering and Operations.

You will have accountability for delivering strategic change across a diverse set of applications, technologies, and squads.

The Expertise We’re Looking For

Bachelor’s degree in Computer science or any other discipline
~5+ years of experience

Location: Bangalore

Shift timings: 11:00 am - 8:00pm

Fidelity

Lead - Software Engineering

Ugh.. sorry 😔 This job is closed.

Check out similar jobs below 😊

Job Description:

Certifications:

Category:

Jobs from our Partners

Senior Software Engineer

Cloud Engineer

Machine Learning Engineer I

Machine Learning Engineer I

Full Stack Developer

Software Engineer III

Other Jobs from Fidelity

Director, Project Management

Data Center Engineer

Lead – Software Engineering - Node JS Backend Developer

Lead - Software Engineering

Principal Software Engineer - (Azure, Java, CI/CD)

Director, Software Engineering

Similar Jobs

Infrastructure and DevOps Engineer

Cloud Engineer | Observability Team | B2B Contract

Senior Automation Developer (VRA/VRO focus)

Staff DevsecOps Engineer

Senior DevsecOps Engineer

Wall of love from fellow engineers