Your Impact
Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for improving the availability and reliability of some of the firm’s most critical platform services, and ensures they meet the requirements of our internal and external users. We are looking for engineers who are motivated to collaborate with our businesses to build and run sustainable production systems, which can evolve and adapt to changes in our fast-paced, global business environment.
The SRE team develops and maintains platforms and tools which help other engineering teams in Goldman Sachs to build and operate reliable and resilient systems. The platforms we offer range from central logging and tracing to monitoring and alerting and we provide tools to drive adoption and improvements to capacity planning, operational readiness assessments, production incident postmortems, SLIs / SLOs, and deployment automation including canary releases.
The products and services we provide to our internal customers are used by thousands of engineers every day. We believe that reliability is the most important feature of any system, and we are devoted to giving our engineers the platforms and tools they need to build and operate reliable products.
How You Will Fulfil Your Potential
As a developer in the SRE team, you will work with internal customers, product owners, and SREs to design, develop, and support the platforms and tools we provide to other engineering teams to enable them to run reliable large scale production systems spanning cloud and on-prem datacenters.
Responsibilities
- Design, develop, and support SRE platforms and tools
- Create and support automation solutions and build out monitoring and alerting to improve the reliability of the platforms and tools we operate
- Collaborate with other teams to onboard them onto SRE owned platforms and tools and help them implement SRE best practices
- Adhere to and drive SRE disciplines and processes across the global team
Basic Qualifications
- Degree in computer science or engineering with at least 3 years industry experience
- Proficiency in at least one major programming language, preferably in Java or Go and JavaScript / Typescript
- Excellent programming skills including debugging, testing, and optimizing code
- Strong problem solving / analytical skills
- Experience with algorithms, data structures as well as software and system design
- Experience automating operational tasks
- Comfortable with technical ownership, managing multiple stakeholders, and working as part of a global team
Preferred Experience
- Experience with distributed systems design, maintenance, and troubleshooting
- Experience with databases / data stores like PostgreSQL, MongoDB, and Elasticsearch
- Proficiency in using Terraform for Infrastructure deployment and management
- Knowledge of cloud native solutions in AWS or GCP
- Systems experience in Linux and networking, especially in scaling for performance and debugging complex distributed systems
- Experience with monitoring and alerting systems
0 applies
1 views
Other Jobs from Goldman Sachs
General Software Engineer - Women Impact Tech Conference 2024
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say