CrowdStrike

Senior Software Engineer, Cloud - Core Reliability Services (Remote)

Remote Sunnyvale, CA
Go Java Scala Kotlin Python Node.js Kafka Elasticsearch Cassandra Kubernetes AWS Microservices Distributed Systems Machine Learning
Description

Sr. Software Engineer, Product SRE

Location: USA - Sunnyvale, CA, USA - Austin, TX, USA - Remote, NY, USA - Redmond, WA

Time Type: Full time

Job Description

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About the Role:

As a Senior Engineer (Typically equivalent to Staff or Sr Staff titles in other companies) in our Embedded Reliability team, you'll work directly within CrowdStrike product groups alongside product engineers and their leadership. You'll partner with engineering leaders to shape reliability roadmaps while doing hands-on work solving complex distributed systems problems at scale. This is hands-on systems engineering work focused on writing code, building foundational infrastructure, and solving complex problems rather than day-to-day operations or ticket management. While we embrace the SRE moniker, you'll find that it means something much more service-oriented at Crowdstrike, and affords you no shortage of Golang development initiatives, as well as the freedom to move up/down and laterally across the stack as & when needed. It is far and away our most self-driven & autonomous backend development role.

CrowdStrike Falcon processes trillions of events per day. You'll work on the critical production systems that power this platform by improving, rearchitecting, and scaling them to meet growing demands. You'll write production code, debug complex distributed systems issues, and tackle problems spanning scale and resiliency, performance engineering, foundational observability and instrumentation, cost optimization, and failure modeling.

Product engineers and engineering leaders will come to you for guidance on architectural decisions because you've earned credibility through hands-on work and delivering results. You'll ensure follow through on incident retrospectives with concrete improvements that eliminate entire classes of failures. You'll identify opportunities to extract common patterns into shared libraries or tools, or partner with platform teams on improvements that benefit multiple product groups. Recent examples from the team include resolving critical issues in leader election libraries and building infrastructure-as-code tools that eliminate manual deployment processes.

Why This Role Matters: CrowdStrike Falcon is the industry standard in cloud-native cybersecurity and threat hunting. Our customers depend on us to protect their businesses from sophisticated threats, and reliability isn't optional - it's fundamental to our mission. As an Embedded SRE, your work directly impacts whether organizations around the world can defend themselves against cyberattacks. You'll be working on problems that matter, at a scale that few companies can match, with the autonomy to make real architectural decisions.

What You'll Do:

  • Partner with engineering leadership to define and drive reliability roadmaps

  • Design and implement architectural improvements to services, libraries, and platforms that impact teams across CrowdStrike

  • Establish foundational observability practices: ensure teams instrument services properly, react to signals effectively, and leverage observability to drive automation like continuous delivery

  • Lead performance and cost optimization: profiling, bottleneck analysis, capacity planning, and efficiency improvements across cloud infrastructure

  • Define and implement service-level objectives that drive decision-making and prioritization

  • Conduct resilience engineering: chaos experiments, failure injection, and designing for graceful degradation

  • Provide technical leadership during complex incidents and drive systemic improvements

  • Mentor and coach engineers, building a culture of excellence and driving architectural standards across the organization

What You'll Need:

  • 7-10+ years building and operating distributed systems at scale

  • Expert-level proficiency in at least one programming language; willingness to become proficient in Go

  • Deep understanding of distributed systems: e.g. consensus algorithms, replication, consistency, failure modes, scalability patterns

  • Proven experience scaling backend systems: e.g sharding, partitioning, horizontal scaling, capacity planning, performance optimization

  • Track record of making impactful architectural decisions and seeing them through to production

  • Strong systems thinking and ability to influence without direct authority across organizational boundaries

  • Degree in Computer Science or equivalent experience in data structures/algorithms/distributed systems

Bonus Points: 

  • Experience driving reliability improvements in organizations with hundreds or thousands of microservices

  • Deep knowledge of Kubernetes, cloud platforms, or other large-scale orchestration systems

  • Experience with AWS, Cassandra, Kafka, OpenSearch, or similar large-scale distributed systems

  • Track record of building internal platforms or tools that other engineers use

  • Experience in infrastructure cost optimization at scale

  • Background in performance engineering: profiling, optimization, understanding system bottlenecks

  • Experience with chaos engineering or resilience testing practices

  • History of establishing SLO/SLI frameworks and error budgets in production environments

  • Background in cybersecurity or intelligence fields

  • Experience building developer platforms or improving developer experience

#LI-MP2

#LI-DG1

#LI-HTF

#HTF

This role will require the candidate to periodically undergo and pass additional background and fingerprint check(s) consistent with government customer requirements.

Benefits of Working at CrowdStrike:

  • Market leader in compensation and equity awards

  • Comprehensive physical and mental wellness programs

  • Competitive vacation and holidays for recharge

  • Paid parental and adoption leaves

  • Professional development opportunities for all employees regardless of level or role

  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections

  • Vibrant office culture with world class amenities

  • Great Place to Work Certified™ across the globe


CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at [email protected] for further assistance.

Find out more about your rights as an applicant.

CrowdStrike participates in the E-Verify program.

Notice of E-Verify Participation

Right to Work

CrowdStrike, Inc. is committed to fair and equitable compensation practices. Placement within the pay range is dependent on a variety of factors including, but not limited to, relevant work experience, skills, certifications, job level, supervisory status, and location. The base salary range for this position for all U.S. candidates is $140,000 - $215,000 per year, with eligibility for bonuses, equity grants and a comprehensive benefits package that includes health insurance, 401k and paid time off.

For detailed information about the U.S. benefits package, please click here.

 

CrowdStrike
CrowdStrike

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say