Job Summary
Job Description
What will you do?
- Run the production environment by monitoring availability and taking a holistic view of system health
- Build tools to manage platform infrastructure and applications
- Debug production issues across services and levels of the stack and provide primary operational support and engineering for multiple large distributed software applications
- Help adopt and drive the creation of tools for health monitoring and alerting applications.
- Improve reliability, quality, and time-to-market of our suite of software solutions
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of application team needs, and innovating to continually improve
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
- Partner with development teams to improve services through rigorous testing and release procedures.
- Participate in system design consulting, platform management, and capacity planning.
- Create sustainable systems and services through automation and uplifts.
- Balance feature development speed and reliability with well-defined service level objectives.
Technical Leadership
- Provide SRE thought leadership on the squad level
- Perform code and non-functional (performance, security, maintainability) reviews of all production-bound SRE solutions
- Help drive transformation by continuously looking for ways to automate existing processes
- Run engineering mindset meetups accelerating breadth and depth of knowledge in the community
- Manage application assets and users (virtual machines, cloud instances, source code repositories, etc.)
- Publish technical design for SRE solutions
- Publish and/or review implementation plans for SRE solutions bound to production
- Explore new capabilities and technologies to drive innovation (including coding and publishing how-to documentation)
Production Support + Development
- Perform production support role, including off-hours support
- Assist in incident management and problem management for applications in scope
- Evaluate continuously – what went well, what went wrong, what can be done to improve and prevent in future
- Maintain technology currency (perform server patching, certificate renewal, etc.) with a keen eye on automating opportunities
- Ensure availability and uptime of applications in scope, as per service level objectives
- Ensure compliance with all systems and applications in scope, including maintaining segregation of duties
What Do You Need To Succeed?
Must have:
- Overall, 5- 7 years of support experience in Openshift, Azure & Kubernetes.
- 3 years of experience as an SRE supporting multiple applications
- Have very strong programming skills in Java/JavaScript/Typescript/Python
- SQL database operational experience in the cloud/on-premise and writing/understanding database queries (SQL and/or No-SQL)
- Object Oriented design and development
- Exposure to UCD, PCF (Pivotal Cloud Foundry), and GitHub is desirable
- Having a good overall understanding of networking-related areas like certificates, load balancers etc.
- Monitoring using Splunk, Dynatrace, RUM, Grafana & other related tools
- Experience with the operational aspects of software systems such as monitoring, centralized logging, and alerting.
- Experience in micro-services, public cloud (Azure preferred) & container technologies
- Working knowledge of Mainframes & JCL is nice to have
Nice-to-have:
- Knowledge of public cloud (Microsoft Azure and AWS) and private cloud (OpenShift) platforms and development of applications in multi-cloud, hybrid environments
- Knowledge of containers and orchestration (e.g: Docker, Kubernetes)
What's in it for you?
- A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.
- Leaders who support your development through coaching and managing opportunities.
- Ability to make a difference and lasting impact.
- Work in a dynamic, collaborative, progressive, and high-performing team.
- Flexible work/life balance options.
- Opportunities to do challenging work.
- Opportunities to take on progressively greater accountabilities.
- Opportunities to build close relationships with clients.
#LI-Hybrid #LI-POST #TECHPJ
Job Skills
Critical Thinking, Customer Support Systems, Group Problem Solving, Installation Support, IT Service Level Management, IT Service Management (ITSM), IT Standards, Technical TroubleshootingAdditional Job Details
Address:
City:
Country:
Work hours/week:
Employment Type:
Platform:
Job Type:
Pay Type:
Posted Date:
Application Deadline:
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above
Inclusion and Equal Opportunity Employment
At RBC, we embrace diversity and inclusion for innovation and growth. We are committed to building inclusive teams and an equitable workplace for our employees to bring their true selves to work. We are taking actions to tackle issues of inequity and systemic bias to support our diverse talent, clients and communities.
We also strive to provide an accessible candidate experience for our prospective employees with different abilities. Please let us know if you need any accommodations during the recruitment process.
Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com.
Other Jobs from Royal Bank of Canada
Relationship Manager, Business Markets, Intern
Associate Director, Equity Derivatives Full-Stack Developer
Senior Data Scientist
Senior Network Security Engineer (Global Security)
Senior Quality Engineer
Associate Director, Cyber and IT Risk and Reporting
Similar Jobs
Lead Full Stack Engineer – Apollo Capital Solutions
Senior Backend Developer (GenAI Solutions)
Senior Software Engineer (Platform)
Software Engineering Architect
Lead Software Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say