Site Reliability Engineer
Location: San Francisco, CA, us
Job Description
Responsibilities:
• Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.
• Troubleshoot issues across the entire stack. Solve problems relating to mission critical services and build automation to prevent problem recurrence; with the goal of automating response to all non-exceptional service conditions
• Identify and drive opportunities to improve automation
• Engage in service capacity planning and demand forecasting, software performance analysis and system tuning.
• Participate in periodic on call duties.
• Represent the SRE team in design reviews and operational readiness exercises for new and existing services
Minimum qualifications:
• BS degree in Computer Science or related technical field, or equivalent practical experience.
• Minimum 5+ years of managing services in an internet scale *nix environment
• Practical knowledge of various aspects of service design, including messaging protocols & behavior, caching strategies and software design practices
• Experience in one or more of: Java, Tomcat, Elastic Search, MySQL or scripting experience in Shell and Python.
• Experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols.
• Strong hands on experience with configuration management tools like Ansible, Puppet, or Chef
• Experience with network theory e.g. TCP/IP, UDP, ICMP, etc., MAC addresses, IP packets, DNS, OSI layers, and load balancing.
• Must work well with and be able to influence myriad personalities at all levels
• Ability to prioritize tasks and work independently
• Must be adaptable and able to focus on the simplest, most efficient & reliable solutions
• Track record of successful practical problem solving, excellent written and interpersonal communication, and documentation skills
Desired qualifications:
• Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
• In-depth knowledge of operating systems (processes, threads, concurrency issues, locks, mutexes, semaphores, monitors and how they work).
• Familiarity with algorithms, data structures and complexity analysis.
• Hands on Java and Apache optimization, performance tuning and configuration
• Systematic problem solving approach, coupled with a strong sense of ownership and drive.
Qualifications
Linux Administration,Tomcat. Puppet
Additional Information
Multiple Openings
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
