Groupon

Site Reliability Engineer III - Incident Management, Linux, Root Cause Analysis - US Shift

Bengaluru, India US
Python AWS Kubernetes MySQL Redis Ruby Node.js Next.js GCP Docker Elasticsearch Java
Description

Groupon’s mission is to become the daily habit in local commerce and fulfill our purpose of building strong communities through thriving small businesses. We connect people to a vibrant, global marketplace for local services and experiences. In the process, we’re positively impacting the lives of millions of customers and merchants globally. Even with thousands of employees spread across multiple continents, we still maintain a culture that inspires innovation, rewards risk-taking and celebrates success. If you want to take more ownership of your career, then you're ready to be part of Groupon. 

Are you a passionate, energetic and technology enthusiast eager to work at a rapid pace with the flexibility to work across our suite of technologies? Are you a problem solver; someone who enjoys debugging infrastructure platforms, resolving issues, and creating solutions for common problems? Do you get a little obsessed with the details? 

We are looking for a Site Reliability Engineer (Incident Management) to join our team to support and optimize the process, implementation, and operational support of internal systems that span business side and engineering departments. 

We're a "best of both worlds" kind of company. We're big enough to have resources and scale, but small enough that a single person has a surprising amount of autonomy and can make a meaningful impact. 

We're curious, fun, a little intense, and kind of obsessed with helping local businesses thrive. 

Does that sound like a compelling place to work? 

Our infrastructure ecosystem: 

• AWS/GCP Environment 

• Docker and Kubernetes 

• Elasticsearch 

• Pingdom, Opsgenie, Kibana and Wavefront monitoring tools 

• GitHub and JIRA

• Java, Ruby and Node.js and Next.js

• MySQL and PG databases

• Redis and Memcached

• Akamai CDN

• Python Tooling 

Some more details on the role: 

• You will leverage Site Reliability Engineering best practices and ITIL Solutions Architecture framework to devise incident management strategies. 

• Incident Commander, change manager, and a senior technical resource responsible for preventing, identifying, triaging, documenting, investigating, mitigating, and recovering from site/service impacting incidents across Groupon’s ~300+ globally dispersed services. 

• Facilitating the coordination and resolution of Post Mortems through best practices, and overseeing Problem Management. 

• Dedicated project time to work on a number of interesting and engaging projects. 

• Working as part of the Incident Management team (Shift Monday-Friday with one weekend primary on-call every 6 weeks) 

We’re excited about you if you have: 

• 6+ years administering Linux system environments, as well as complete root cause analysis of site impacting issues. 

• 6+ years experience with web applications operations and root cause analysis

• 4+ years of experience creating unique Splunk or Kibana search queries to identify, resolve, and prevent incidents and outages, and have experience owning all impacting events until resolution; including coordination with Subject Matter Experts, triage tasks, creating all associated documentation, complete action items, and Post Mortem. 

• 6+ years of experience developing policies and procedures that improve overall production stability. 

• Good communication, consulting, and collaboration skills interfacing with senior leadership teams. 

• Good communication, consulting, and collaboration skills interfacing with senior leadership teams. 

• Experience with one or more programming languages (Python, Ruby, Java) 

• A plus if you have a BS, MS or PhD in Computer Sciences or related fields. 

• A plus if you have designed and created tools to manage the site and services.

We value engineers who are: 

• Customer-focused: We believe that doing what’s right for the customer is ultimately what will drive our business forward. 

• Team players. You believe that more can be achieved together. You listen to feedback and also provide supportive feedback to help others grow/ improve. 

• Fast learners: We are willing to disrupt our existing business to trial new products and solutions. You love learning how to use new technologies and then rapidly apply them to new problems. 

• Pragmatic: We do things quickly to learn what our customers desire. You know when it’s appropriate to take shortcuts that don’t sacrifice quality or maintainability. 

• Owners: Engineers at Groupon know how to positively impact the business. Groupon’s purpose is to build strong communities through thriving small businesses. To learn more about the world’s largest local ecommerce marketplace, click here for the latest Groupon news. 


 

Groupon’s purpose is to build strong communities through thriving small businesses. To learn more about the world’s largest local ecommerce marketplace, click here. You can also find out more about us in the latest Groupon news as well as learning about our DEI approach. If all of this sounds like something that’s a great fit for you, then click apply and join us on a mission to become the ultimate destination for local experiences and services.

Beware of Recruitment Fraud: Groupon follows a merit-based recruitment process without charging job seekers any fees. We've noticed an increase in recruitment fraud, including fake job postings and fraudulent interviews and job offers aimed at stealing personal information or money. Be cautious of individuals falsely representing Groupon's Talent Acquisition team with fake job offers. If you encounter any suspicious job offers or interview calls demanding money, recognize these as scams. Groupon is not responsible for losses from such dealings. For legitimate job openings, always check our official careers website at grouponcareers.com.

Groupon
Groupon
E-Commerce Internet Retail Social Media

0 applies

9 views

Other Jobs from Groupon

Senior Jira and Automation Engineer

Remote Prague, Czech Republic

Frontend Engineer - SDE III

Dublin, Ireland Prague, Czech Republic

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say