DigitalOcean

Staff Systems Development Engineering

USD 162k - 235k
DigitalOcean
Search for More Jobs Talk to a recruiter now 💪
This job is closed! Check out or
Description

Do you ever wonder what happens inside the cloud?

DigitalOcean (NYSE: DOCN) simplifies cloud computing so builders can spend more time creating software that changes the world. With our mission-critical infrastructure and fully managed offerings, DigitalOcean enables startups and small and medium-sized businesses (SMBs) to rapidly deploy and scale modern applications. As a remote-first organization, our employees, like our customers, are based around the world.

We are seeking a Staff Software Engineer to play a pivotal role in advancing our GPU Bare Metal offerings.

This position focuses on developing robust, high-performance systems tailored to meet customer needs in a dynamic environment. With a strong emphasis on SRE principles, the ideal candidate will drive innovations that enhance system reliability and efficiency, contributing to our cutting-edge technology landscape.

Key Areas of Focus:

  • Engineer robust GPU Bare Metal solutions that exceed customer expectations.
  • Implement and enhance monitoring systems to ensure optimal performance and rapid issue resolution.
  • Operate in a fast-paced, dynamic environment, actively addressing and fulfilling customer needs.
  • Apply Agile methodologies and SRE principles to ensure efficient and effective project delivery.
  • Focus on quality and speed to develop scalable and reliable technological solutions.
  • Play a key role in the development and scaling of complex solutions that captivate and expand our customer base.
  • Engage in continuous learning to maintain a growth mindset.
  • Utilize data-driven insights to enhance performance and customer satisfaction.

What You’ll Be Doing:

  • Design, develop, and refine monitoring systems for a world-class GPU offering.
  • Define and refine infrastructure requirements to support innovative AI/ML workloads.
  • Collaborate closely with technical leaders to manage large datasets effectively and troubleshoot system issues.
  • Support performance teams with industry-standard testing methodologies to optimize GPU fabric throughput.
  • Develop and integrate advanced storage solutions, including Object, NFS, and block storage technologies, ensuring their seamless performance and scalability within GPU infrastructure projects.
  • Participate in security improvement initiatives and contribute to internal review discussions.
  • Engage with DigitalOcean’s Architecture group to help shape engineering practices and approaches with an SRE focus.
  • Implement new infrastructure functions and technologies that bolster DigitalOcean’s AI/ML product capabilities.
  • Contribute to open-source communities relevant to our technology stack.

What We’ll Expect From You:

  • Proven expertise in leading, developing, and scaling complex solutions from inception to global scale, emphasizing speed, quality, and customer satisfaction.
  • Excellent leadership and collaborative skills, with a capacity to drive innovation in an Agile/Kanban environment.
  • Demonstrated ability to develop and optimize complex storage and networking solutions.
  • Strong strategic analytical skills to effectively utilize data, enhancing system stability and performance, and improving decision-making and problem-solving capabilities.
  • Commitment to continuous learning and improvement, particularly in technologies that support AI/ML workloads.
  • Always prioritize customer needs and feedback to drive product excellence.
  • Ability to quickly adapt to new technologies and challenges.
  • Excellent communication and collaboration skills to work effectively across teams.
  • Understanding of AI/ML workloads and overall industry trends.
  • Strong collaborator and consensus builder. Author and review design documentation.
  • Experience troubleshooting, analyzing, and debugging.
  • Experience as a software engineer/developer in a large-scale, distributed environment.
  • Experience writing secure, testable, and robust low-level code.
  • A critical thinker dedicated to solving problems and delivering solutions.
  • Deep understanding of operating systems, network protocols, virtualization technologies, and Linux internals.
  • Experienced in providing expert SRE support and troubleshooting for complex, global-scale situations.

Why You’ll Like Working for DigitalOcean:

  • We reward our employees. The base salary range for this position is between $162,000.00 - $235,000.00 based on relevant years of experience and skills. The salary range for this role is specific to candidates located within the U.S. and will vary for candidates outside the U.S.. Employees may qualify for a bonus in addition to base salary; bonus amounts are determined based on company and individual performance. We also provide equity compensation to eligible employees including grants of equity upon hire and the option to participate in our Employee Stock Purchase Program.
  • We value development. You will work with some of the smartest and most interesting people in the industry. We are a high-performance organization that is always challenging our teams and employees to continuously grow. We maintain a growth mindset in everything we do and invest deeply in employee development through formalized mentorship and other internal programs. We provide all employees with reimbursement for relevant conferences, training, and education.
  • We care about your well-being. In addition to cash and equity compensation, we also offer employees a competitive array of benefits. In the United States, these include health insurance, flexible vacation, retirement benefits, a generous parental leave program, and additional resources to support employees' overall well-being. While the philosophy around our benefits is the same worldwide, specific benefits may vary in other countries due to local regulations and preferences.
  • We value diversity and inclusivity. We are an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

*This is a remote role

#LI-Remote

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 307 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers