DigitalOcean

Software Engineer I (Resiliency:Maintenance)

Canada
DigitalOcean Elasticsearch Ansible Python Go Shell Git
Description

Do you ever wonder what happens inside the cloud?

DigitalOcean (NYSE: DOCN) simplifies cloud computing so builders can spend more time creating software that changes the world. With our mission-critical infrastructure and fully managed offerings, DigitalOcean enables startups and small and medium-sized businesses (SMBs) to rapidly deploy and scale modern applications. As a remote-first organization, our employees, like our customers, are based around the world.

We want people who are passionate about designing and operating secure systems at scale.

We are looking for an experienced, motivated, adaptable, empathetic engineer who is comfortable working remotely. You will report to the Engineering Manager of the Maintenance team, and act as a contributor to the team’s mission. The Maintenance Mission is to improve cloud reliability, efficiency, and scalability by driving throughput via automation and optimization of flow through our bottlenecks.

Consider applying for this job if you enjoy:

  • Automating cumbersome processes
  • Digging deep to understand the true causes of bottlenecks
  • Making complex automation systems easier for other engineers to use and understand
  • Automating things for fun - writing deployment scripts for your homelab, or automating your lights using Home Assistant

More than anything, we’re looking for someone empathetic, motivated, and driven to grow with us.

Also, we’re looking to expand our team’s expertise in driving operational flow. The ability to help identify bottlenecks and optimize throughput will be extremely valuable for this role.

DigitalOcean’s Internal Culture and Tooling

DigitalOcean teams communicate primarily via Slack. The Maintenance team makes light use of Jira and GSuite. We strive to make our work-life balance comfortable, and aim to scope work appropriately so that everyone works at a healthy pace. You might expect to be on-call periodically, depending on the approach the team decides to take.

DigitalOcean’s observability platform comprises Prometheus, Grafana, Alertmanager, and Elasticsearch. Knowing any of these tools is a bonus, because every service at DO is generally expected to use this platform.

The Maintenance team exists within the Resiliency division, an arm of the Infrastructure department. We are aimed at driving high throughput of maintenance operations and minimal impact to customer availability. The Resiliency division is made up of a diverse group of nearly 40 engineers located across the US, Canada, Mexico, and Europe.

The Maintenance team is a new mission within DO and will be made up of roughly 8 engineers with various skill sets and backgrounds. Within Resiliency, there are growth opportunities along several tracks (i.e. Tech Leader, Subject Matter Expert (SME), Project Management, Engineering Manager, etc).

What You’ll Be Doing:

As an engineer, you will spend your day-to-day on:

  • Writing maintenance automation leveraging StackStorm, Ansible, and Python
  • Identifying and removing bottlenecks (where feasible)
  • Optimizing the throughput of constrained systems
  • Partnering closely with other engineering and operations teams to improve fleet health
  • Participating in large multi-quarter projects and efforts
  • Demonstrating your amazing work to tons of lovely engineers
  • Participating in our work prioritization processes
  • Responding to Slack messages & keeping up with various streams of conversation, including triaging user support requests
  • Participating in company-wide hackathons
  • Spending 5-20% of your time on innovation, education, or open source projects

As the team is just starting to come together, these are a few areas of initial focus:

  • Defining what strategies the Maintenance team will adopt to drive our mission
  • Working directly with operational teams to learn about the processes they follow to inform automation priorities
  • Developing insights into fleet health and how it impacts customer experience
  • Driving throughput for existing operational automation
  • Communicating effectively across teams and orgs

What We’ll Expect From You:

NB: If you don’t meet all of the expectations below, that’s okay! Submit an application, and be sure to include a cover letter telling us why you’d be a good fit for our team.

  • Experience driving positive outcomes via cross-functional engagement
  • Experience using or administering Linux systems
    • We use Ubuntu’s LTS releases
  • Experience automating things
  • Experience with Python and/or Golang
  • Familiarity with shell scripting
  • Familiarity with git
  • Familiarity with continuous integration systems and concepts
    • We use Github Actions & Concourse
  • Experience leveraging monitoring systems for data-driven outcomes
    • We use Grafana, Prometheus, Looker, and Elasticsearch
  • Excellent written and verbal English communication skills
  • Comfort executing in an asynchronous remote environment
  • Transparency, honesty, and openness to constructive feedback
  • A desire to work with a respectful and inclusive team

Why You’ll Like Working for DigitalOcean:

  • We reward our employees. The base salary range for this position is based on relevant years of experience and skills. The salary range for this position is based on relevant years of experience and skills. Employees may qualify for a bonus in addition to base salary; bonus amounts are determined based on company and individual performance. We also provide equity compensation to eligible employees including grants of equity upon hire and the option to participate in our Employee Stock Purchase Program.
  • We value development. You will work with some of the smartest and most interesting people in the industry. We are a high-performance organization that is always challenging our teams and employees to continuously grow. We maintain a growth mindset in everything we do and invest deeply in employee development through formalized mentorship and other internal programs. We provide all employees with reimbursement for relevant conferences, training, and education.
  • We care about your well-being. In addition to cash and equity compensation, we also offer employees a competitive array of benefits. In the United States, these include health insurance, flexible vacation, retirement benefits, a generous parental leave program, and additional resources to support employees' overall well-being. While the philosophy around our benefits is the same worldwide, specific benefits may vary in other countries due to local regulations and preferences.
  • We value diversity and inclusivity. We are an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

*This is a remote role

#LI-Remote

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

50,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 257 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers