Dutchie

Staff Site Reliability Engineer

Remote
USD 190k - 190k
GCP Kubernetes Shell Go Terraform AWS Azure Python Docker
Search for More Jobs Talk to a recruiter now 💪
Description

About Dutchie

Founded in 2017, Dutchie is a comprehensive technology platform powering dispensary operations, while providing consumers with safe and easy access to cannabis. Dutchie aims to further support the positive societal change the cannabis industry brings to the world through wellness benefits, social justice, and empowering local communities through tax revenue. Powering thousands of dispensaries across 40+ markets throughout the United States and Canada, Dutchie is the leading technology company in the cannabis space and was named in Fast Company’s 10 Most Innovative Companies in North America and listed two years in a row on LinkedIn’s Top 50 Startups.

Dutchie has raised over $600M in funding to date, backed by D1 Capital Partners, Tiger Global, Dragoneer, DFJ Growth, Thrive Capital, Howard Schultz, Snoop Dogg’s Casa Verde Capital, Gron Ventures, members of the founding team at DoorDash, Kevin Durant’s Thirty Five Ventures, and other notable angel investors.

About This Job

We are seeking an experienced and highly technical Staff Site Reliability Engineer (SRE) to join our Reliability Engineering team. As a Staff SRE, you will be the technical lead on the team in developing and implementing innovative solutions to ensure the reliability, scalability, and performance of our critical systems. This is a highly impactful role, where you will help shape our SRE strategy, mentor team members, and drive significant improvements to our infrastructure and operations. You will work closely with cross-functional teams to design, build, and maintain systems that deliver exceptional user experiences and improve the uptime and availability of the company’s products and services.

What You'll Do...

  • Lead SRE Strategy: Define the overall technical direction and strategy for SRE at Dutchie, aligning with business goals and ensuring the highest levels of system reliability and stability.
  • Technical Leadership: Mentor and guide other engineers on best practices, emerging technologies, and industry trends, fostering a culture of continuous learning and improvement.
  • Project Execution: Drive the execution of key SRE projects, ensuring timely delivery, quality, and alignment with business objectives.
  • Operational Excellence: Collaborate with development and product teams to optimize system performance, reliability, and scalability.
  • Incident Management: Troubleshoot and resolve complex issues in production environments. Lead the resolution of critical incidents, conduct post-incident reviews, identify trends and implement preventative measures to minimize future disruptions.
  • Automation: Champion automation initiatives to streamline processes, reduce manual toil, and improve operational efficiency.
  • Performance Optimization: Continuously monitor system capacity and performance, identify bottlenecks, and implement optimization strategies to maximize efficiency and resource utilization.
  • Collaboration: Partner with stakeholders across the organization to understand their needs, communicate SRE initiatives, and foster a collaborative environment.
  • Mentorship: Provide technical guidance and mentorship to junior SREs, helping them develop their skills and grow professionally.
  • Maximize Observability: Drive successful adoption and use of observability tools (Datadog) and logging (Splunk) across the organization. Implement and manage monitoring, alerting and logging systems to ensure early detection of issues.
  • Business Continuity: Lead the design and implementation of disaster recovery and business continuity plans.
  • Support: Participate in on-call rotation to ensure 24/7 availability of our systems and services.

What You Bring...

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 10+ years of experience as a Site Reliability Engineer or a related role with a proven track record.
  • Strong expertise in cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).
  • Strong technical expertise and leadership skills
  • Proficient in scripting and automation using languages such as Python, Shell, or Go.
  • Solid understanding of networking, security, and infrastructure-as-code principles.
  • Experience with observability tools such as Datadog and logging solutions such as Splunk.
  • Proven track record of successfully leading incident response efforts and conducting post-mortems.
  • Experience in enabling application teams to enhance observability and reliability of their services.
  • Excellent communication and collaboration skills, with the ability to work effectively in a team environment.
  • Excellent problem-solving and troubleshooting skills.

It's a bonus if you...

  • Master's degree in Computer Science, Computer Engineering, or a related field
  • Experience with containerization technologies (e.g., Docker, Kubernetes)
  • Experience with Infrastructure as Code (IaC) tools (e.g., Pulumi, Terraform, CloudFormation)
  • Experience with agile development methodologies (e.g., Scrum, Kanban)
  • Relevant industry certifications (e.g., CKAD)

You’ll Get…

We are targeting a starting salary of $190,000 based on the intended level for this role. There may be flexibility on individual compensation packages based candidate skill set, experience, qualifications and other position-related factors.

In addition to cash compensation, our total rewards package includes:

  • Full medical benefits including dental and vision plans to ensure you always have the best care.
  • Equity packages in the form of stock options to all employees.
  • Technology (hardware, software, reading materials, etc..) allowance
  • Flexible vacation and sick days

#LI-AH1

At Dutchie, we’re committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants and teammates without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law. Dutchie believes that diversity and inclusion among our teammates is critical to our success, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool.

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 320 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers