Restaurant365

Site Reliability Engineer

Austin, TX Akron, OH
USD 99k - 138k
Bash Python PowerShell Terraform Ansible CloudFormation Linux Windows Nginx Apache Tomcat GitLab Git Azure AWS GCP EKS ECS AKS Lambda S3 Prometheus Grafana ELK Site24x7 Nagios
Description

Site Reliability Engineer II

Team: DevOps

Location: Austin, TX/Akron, Ohio/Irvine, CA, SF Bay Area/Northern California, Silicon Valley Region, Denver, CO

Commitment: Full Time

Workplace Type: hybrid

Restaurant365 is a SaaS company disrupting the restaurant industry! Our cloud-based platform provides a unique, centralized solution for accounting and back-office operations for restaurants. Restaurant365’s culture is focused on empowering team members to produce top-notch results while elevating their skills. We’re constantly evolving and improving to make sure we are and always will be “Best in Class” ... and we want that for you too!

This role requires a hybrid work schedule based out of one of our office locations: Austin, TX; Irvine, CA; or Akron, OH. 
 
The Site Reliability Engineer II will be responsible for supporting, enhancing, and maintaining Restaurant365’s cloud infrastructure and applications. Qualified candidates will demonstrate growing expertise in site reliability practices, with skills in incident response, system monitoring, automation, and performance troubleshooting. You will collaborate with DevOps, development, and infrastructure teams to resolve moderately complex issues, propose improvements, and strengthen the reliability, scalability, and security of our SaaS platform. 

How you'll add value:

  • Execution & Collaboration 
  • Respond to production incidents, perform triage and troubleshooting, and contribute to post-incident analysis. 
  • Identify and automate manual processes to improve efficiency and reduce risk. 
  • Enhance and evolve monitoring tools and platforms to improve observability. 
  • Promote and apply best practices for reliability, scalability, and performance across engineering. 
  • Implement and support cloud automation using Terraform, Ansible, or CloudFormation. 
  • Work within change management protocols to provide maximum uptime for production systems. 
  • Participate in on-call rotation, providing 24x7 support for incidents and contributing to root cause analysis. 
  • Partner with developers, architects, vendors, and IT teams to ensure reliable system operations. 
  • Research and remediate vulnerabilities in coordination with security teams. 
  • Maintain documentation of infrastructure, monitoring, runbooks, and incident response procedures. 
  •  
  • Standards & Process 
  • Apply company policies and procedures when handling operational tasks and incidents. 
  • Suggest and implement improvements to operational processes and monitoring practices. 
  • Contribute to technical diagrams, documentation, and runbooks for system reliability. 
  •  
  • Learning & Growth 
  • Expand expertise in cloud services (Azure, AWS, or GCP) and container platforms (EKS, ECS, AKS). 
  • Build proficiency with observability and monitoring tools (Prometheus, Grafana, ELK, Site24x7, Nagios). 
  • Develop scripting and automation skills using Python, Bash, PowerShell, or similar. 
  • Participate in planning discussions by contributing technical input on system stability and reliability. 

What you'll need to be successful in this role:

  • BS in Computer Science, Information Systems, or related field (or equivalent experience). 
  • 2–4 years of experience in site reliability engineering, DevOps, or cloud operations. 
  • Experience with cloud platforms (Azure or AWS), including services such as AKS, ECS/EKS, Functions/Lambda, S3, and Blob storage. 
  • Proficiency with infrastructure-as-code and automation (Terraform, Ansible, YAML, Python, Bash, PowerShell). 
  • Strong Linux engineering skills; working knowledge of Windows administration. 
  • Experience supporting production environments and participating in on-call rotations. 
  • Familiarity with web servers and middleware (Nginx, Apache Tomcat). 
  • Experience with CI/CD tools (GitLab, Git, or similar). 
  • Strong written, oral, and interpersonal communication skills. 
  • Preferred Qualifications 
  • Experience with monitoring tools (Prometheus, Grafana, ELK, Site24x7, Nagios). 
  • Knowledge of performance analysis and system vulnerability remediation. 
  • Cloud certification (AWS or Azure) preferred. 
  • Familiarity with restaurant industry SaaS platforms and customer-facing applications. 

R365 Team Member Benefits & Compensation

  • This position has a salary range of $98,583-$138,016 annually. The above range represents the expected salary range for this position. The actual salary may vary based upon several factors, including, but not limited to, relevant skills/experience, time in the role, business line, and geographic location. Restaurant365 focuses on equitable pay for our team and aims for transparency with our pay practices.
  • Comprehensive medical benefits, 100% paid for employee
  • 401k + matching
  • Equity Option Grant
  • Unlimited PTO + Company holidays
  • Wellness initiatives
  •  
    #BI-Remote
DYN365, Inc d/b/a Restaurant365 is an equal opportunity employer.
Restaurant365
Restaurant365

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say