Want to help us, help others? We’re hiring!
GoFundMe is a global community of over 150 million people who come together every day with the common purpose of helping one another. Our mission is to help people help each other through our best in class technology. In 2022, GoFundMe joined together with Classy, a leading nonprofit fundraising software company that enables nonprofits to connect supporters with the causes they care about. Together, we have empowered people and organizations to raise more than $30 billion since 2010. Our vision is to become the most helpful place in the world.
Join GoFundMe's Platform Infrastructure and Operations team as a Senior Cloud Ops Engineer. This crucial role focuses on building and maintaining an advanced cloud infrastructure vital for our online fundraising platform, which supports nonprofits worldwide. You will be instrumental in ensuring our infrastructure achieves 99.999% availability, meeting the high demands of our global payments platform.
Key Responsibilities:
- Design and implement robust, fault-tolerant cloud solutions to process billions of dollars annually, ensuring scalability, resilience, and compliance.
- Share expertise and foster a culture of continuous improvement, innovation, and learning within the team, contributing to technical mentorship and knowledge sharing.
- Participate in strategic decisions regarding cloud architecture, influencing the adoption of best practices and cutting-edge technologies.
- Work collaboratively to enhance system performance, observability, and reliability across the infrastructure, focusing on improving real-time monitoring and logging for operational excellence.
- Lead initiatives to improve infrastructure resiliency, leveraging tools like AWS Resilience Hub and Fault Injection Simulator to test and enhance system robustness.
- Drive application resilience by designing and executing load tests, simulating infrastructure faults, and analyzing results to improve fault tolerance.
- Incorporate scalability and performance testing as integral parts of service design, ensuring services meet reliability and performance goals under high transaction volumes.
- Embed testing phases within CI/CD pipelines to promote shift-left performance testing practices, improve efficiency, and reduce development cycle times.
- Contribute to implementing and analyzing DORA (DevOps Research and Assessment) metrics to enhance the efficiency and effectiveness of the development lifecycle.
- Participate in an on-call rotation to promptly address and resolve critical incidents, ensuring continuous operational excellence and rapid recovery during outages.
Job Requirements:
- Bachelor’s Degree in Computer Science, a related field, or 8+ years of equivalent practical experience.
- Minimum of 6 years of experience designing and managing scalable, cloud-based infrastructure, preferably in SaaS environments.
- Deep technical expertise with a strong foundation in computer science, sharp engineering skills, and a commitment to delivering high-quality solutions.
- Expert-level knowledge of AWS cloud services, container technologies like Docker and Kubernetes, and Infrastructure as Code (IaC) tools like Terraform and CloudFormation.
- Proficiency in software architecture, including asynchronous event-driven architecture and microservices.
- Experienced in performance and reliability testing using tools like Artillery, K6, or similar frameworks.
- Experience in defining, monitoring, and managing Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to ensure the cloud infrastructure consistently meets performance and availability targets.
- Proven expertise in disaster recovery planning and execution, including developing and implementing robust strategies to maintain business continuity and achieve rapid recovery in the event of an outage.
- Hands-on experience with application performance management (APM) tools like New Relic, DataDog, and Splunk.
- Advanced scripting and development skills in Bash, PHP, and NodeJS languages.
- Skilled in managing distributed data systems, troubleshooting complex issues under high load, and designing for high transaction volumes.
- Knowledgeable in compliance regulations, including PCI, SOC2, and GDPR.
Preferred Experience:
- AWS cloud certifications.
- Experience with fault-tolerant system design, large-scale distributed systems, and high-transaction environments.
- Familiarity with tools and processes for infrastructure resiliency and fault injection testing.
Traits:
- Strong collaborative skills with a track record of leading initiatives and working with cross-functional teams.
- Adaptable and thrives in a fast-paced and agile environment.
- Excellent communication skills, capable of effectively collaborating across diverse teams and cultural backgrounds.
Why you’ll love it here...
- Be a part of a mission-driven organization that positively impacts tens of millions of lives every year.
- Be a Leader in a high-impact product organization and drive business transformation through product.
- Collaborate with a diverse, passionate, and talented team in a fast-paced and innovative environment.
- You’ll be a part of a fun, supportive team that works hard and celebrates accomplishments together.
- We live by our core values: impatient to be great, find a way, earn trust every day, fueled by purpose.
GoFundMe is proud to be an equal opportunity employer that actively pursues candidates of diverse backgrounds and experiences. We are committed to providing diversity, equity, and inclusion training to all employees, and we do not discriminate on the basis of race, color, religion, ethnicity, nationality or national origin, sex, sexual orientation, gender, gender identity or expression, pregnancy status, marital status, age, medical condition, mental or physical disability, or military or veteran status.
If you require a reasonable accommodation to complete a job application or a job interview or to otherwise participate in the hiring process, please contact us at accommodationrequests@gofundme.com.
Dedication to Diversity
GoFundMe and Classy are committed to leveraging Diversity, Equity, Inclusion, and Belonging to cultivate a culture that embraces and supports the unique identities, experiences, and perspectives of our people and customers.
Our diversity recruiting priority is recognized under our first DEIB Driver: Opportunity Foster Diversity - we identify, recruit, and invest in top talent- ensure our people reflect the unique identities, experiences, and perspectives of the communities we serve and are all given the chance to grow.
Global Data Privacy Notice for Job Candidates and Applicants:
Depending on your location, the General Data Protection Regulation (GDPR) or certain US privacy laws may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available here. By submitting your application, you are agreeing to our use and processing of your data as required.
Learn more about GoFundMe:
We’re proud to partner with GoFundMe.org, an independent public charity, to extend the reach and impact of our generous community, while helping drive critical social change. You can learn more about GoFundMe.org’s activities and impact in their FY ‘24 annual report.
Our annual “Year in Help” report reflects our community’s impact in advancing our mission of helping people help each other.
For recent company news and announcements, visit our Newsroom.
Other Jobs from GoFundMe
SIte Reliability Engineer II
Senior Security Engineer
Senior DevEx Engineer
Staff Software Engineer
Senior Software Engineer
Similar Jobs
Data Engineer/Architect
Senior Software Engineer (Golang/PHP + Kubernetes+AWS)
Fullstack Engineer - Product
Staff Software Engineer
Test Automation Engineer (experienced/senior)
Sr. Software Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say