ING

Site Reliability Engineer

Bucharest, RO
Bash C# C++ Java JavaScript Python SQL Spring API Microservices Ansible Azure Docker GCP Git Kubernetes Prometheus Elasticsearch ITIL
Description

Site Reliability Engineer - WARF @ING Bank

Location: Bucharest, 10, RO

Employment Type: Full time

Discover ING Bank Romania

ING believes in a world where everyone has the right to grow and progress in their own way. We express this in our global tagline, “do your thing”. Perhaps more than in any other large company, we extend our belief in the power of autonomy to our own people. But there’s a catch. In return for great freedom, we expect people to do great things for our customers, our stakeholders, and ING at large.

To work here is to be surrounded by people who are energetic, ambitious, friendly and respectful: talented specialists who take the responsibility and autonomy to make great things happen. We stay curious, thrive on change, and seek new and better ways to make it happen. Active in Romania for 30 years, ING Bank pioneered and challenged the local banking industry. Technology and innovation are at the core of what we do, making our products relevant for our customers’ lives and businesses.

ING Bank Romania is the only bank with an organic growth within the top 10 local banks by assets, without acquisitions of client portfolios or other banks. ING Bank Romania is an universal bank with more than 1.8 million customers from three business segments: individuals (retail), SME and Mid-Corporate companies and Wholesale Banking.

Join us!

Mission

The SRE team is responsible to roll-out the SRE (Site Reliability Engineering) practices to improve the reliability of Critical Business Services for ING Bank Romania. The SRE team is responsible for defining, introducing, and promoting SRE processes and practices like Observability, Incident & Problem Management, Capacity & Performance Management, IT Service Continuity, Well-Architected Review Framework, Operational Resilience & Reliability Testing, Release Procedures & Change Management, Reliability reporting & error budgeting, etc.

This role is responsible for ‘Resilience by design’ and challenges & contributes to ING’s Well-Architected Framework and underlying reliability patterns (as developed by Enterprise Architecture).

As part of the SRE team, you will:

  • Steer patterns to implementation. This includes design and/or development of conformity bots in the CI/CD pipeline, policy-as-code validations for infrastructure provisioning, conformity monkey or other ways to validate implementation in production and perform drift detection.

  • Ensure proper documentation, training material and other ways to get the knowledge to our engineers across ING.

  • Contribute as a reliability expert to key operational activities with a focus on services or incidents touching multiple key areas. This includes performing Critical Business Service/critical chain reviews to identify weaknesses to be solved and supporting P1 incidents and Major Incidents (as expert) by providing expertise that ensures high quality root-cause analysis and by ensuring follow-up of structural (architectural/design-related) findings with Architects & DevOps.

Your day-to-day

The initial focus will be to challenge and to contribute to ING’s Well-Architected Framework (WARF) and underlying reliability patterns (as developed by Enterprise Architecture). The rest of the activities include:

  • Ensures that the architecture of IT Services that support CBSs is designed for resilience.

  • Prepares, facilitates, and coordinates the Well-Architected Review E2E to identify weaknesses to be solved. Organizes the review based on the process specific triggers, selects the System Experts and the Reviewers (including Lead Reviewer) that should be included in the review.

  • Ensures that the Reviewers challenge the design and implementation of IT Services based on best practices from the Well-Architected Framework.

  • Ensures weaknesses are identified during the Well-Architected Review if the case and documents the findings in the Review Document template. If actions are required, ensures that backlog items are created and follows-up on their resolution.

  • Ensures accurate reporting of the Well-Architected Reviews and related improvements.

  • Operates in strong cooperation with Architects, the rest of the SRE team, engineers and aligns with the Global Review Coordinator from Global SRE team.

  • Supports P1 Incidents and Major Incidents as expert and provides expertise to ensure high-quality root-cause analysis and follow-up of structural (architectural/design-related) findings with Architects and DevOps teams.

  • Explore AI technologies in order to gain efficiency through WAR outcomes.

  • Be a reliable partner for all Delivery squads and provide hands-on guidance to assure a great level of resilience.

What you bring to the team

  • Education: Bachelor's or Master's degree in computer science, information systems, or a related discipline.

  • Experience: 10+ Years in software engineering/IT operations and/or IT architect roles.

  • Technical skills:

    • Knowledgeable about technology in all levels in the technology stack (from infrastructure to front-end, from CI/CD to observability tooling) with expert knowledge & hands-on experience on one or more levels (e.g. infrastructure & back-end development and/or observability & CI/CD tooling).

    • In-depth knowledge of system design and experience with scalable and reliable infrastructure.

    • Understanding of network protocols, security best practices, and ability to implement secure and robust solutions.

    • Competence in using Cloud services.

    • ToolsING Private Cloud or Public Cloud (Azure or Google Cloud) and related VM/container stacks & tooling; application-level technologies & tooling heavily in use at ING e.g. spring boot, ING’s API SDK, Azure DevOps, Prometheus/ELK stack/Tracing or ING’s specific implementations (e.g. RTK2, Log4All, MDPL).

  • Proven experience or interest in the Site Reliability Engineering (SRE) methodology, IT security and compliance. Familiarity with DevOps culture and practices.

  • Proven experience with ITIL processes and ITSM tools .(ServiceNow, Azure DevOps, etc).

  • Strong analytical and problem-solving skills.

  • High accuracy in performing duties.

  • Ability to efficiently promote in the organization the SRE concepts and frameworks.

  • Effective communication, both written and verbal, to convey complex technical concepts in a clear and understandable manner.

  • Strong stakeholder management abilities.

What we offer

  • Impactful work in a fun and collaborative environment.

  • Open-concept offices designed for both teamwork and relaxation.

  • Corporate events and social gatherings.

  • Hybrid way of working with flexible working schedule and short week options.

  • Monthly budget on Benefit platform.

  • Extra annual leave days depending on the total length of working experience.

  • Growth opportunities through upskilling/ reskilling programs and a variety of learning and development platforms: ING Learning Centre, Udemy, Bookster, as well as through trainings and certifications.

  • Possibility to access Internal roles, International Short-Term Assignments or Long-Term Assignments.

  • Context to make an impact through Sustainability and Corporate Social Responsibility projects.

ING
ING

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say