Site Reliability Engineer (AHT)
Location: United States-Ohio-Beavercreek, United States-Ohio-Wright-Patterson AFB
Time Type: Full time
Job Description
RELOCATION ASSISTANCE: Relocation assistance may be availableCLEARANCE REQUIRED FOR START: NoCLEARANCE TYPE: Top SecretTRAVEL: Yes, 10% of the TimeDescription
At Northrop Grumman, our employees have incredible opportunities to work on revolutionary systems that impact people's lives around the world today, and for generations to come. Our pioneering and inventive spirit has enabled us to be at the forefront of many technological advancements in our nation's history - from the first flight across the Atlantic Ocean, to stealth bombers, to landing on the moon. We look for people who have bold new ideas, courage and a pioneering spirit to join forces to invent the future, and have fun along the way. Our culture thrives on intellectual curiosity, cognitive diversity and bringing your whole self to work — and we have an insatiable drive to do what others think is impossible. Our employees are not only part of history, they're making history.Northrop Grumman’s Site Reliability Engineers are tasked with guaranteeing the dependability of applications they don’t build, setting user focused reliability targets, instrumenting those metrics, and developing the automation and runbooks needed to recover from failures. They work closely with development teams to embed operational quality early in the software lifecycle and assume primary responsibility for resolving production issues when they arise. Their skill set includes troubleshooting distributed systems, handling incidents, and converting those experiences into lasting reliability enhancements. On a daily basis, their work is organized around four key areas: incident management, reducing manual toil, evaluating reliability, and enabling platform capabilities.
This position may be filled at as a level 2 or 3 based on the qualifications below and is contingent on funding.
Basic Qualifications:
- Site Reliability Engineer (Level 2): Bachelor's degree in a STEM discipline with 2 years of related experience OR a master’s degree with 0 years of experience
- Principal Site Reliability Engineer (Level 3): Bachelor's degree in a STEM discipline with 5 years of related experience OR a master’s degree with 3 years of experience
- Must have the ability to obtain a Department of War Top Secret/Sensitive Compartmented Information security clearance (TS/SCI)
- Have or obtain IAT Level II certification such as SecurityX (formerly CASP+) or Security+ within 90 days.
- Minimum of three years of experience In Systems Administration.
- Linux/networking fundamentals
- Demonstrated understanding of systems thinking, including the ability to identify inter dependencies, assess potential blast radius, and analyze how multiple components can fail together to ensure resilient infrastructure.
- Basic knowledge of observability fundamentals, encompassing more than the three core signals (logs, metrics, traces) and demonstrating the ability to select, implement, and leverage telemetry data to optimize services and enhance engineering productivity.
- Fundamental software engineering skills, including the development of automation scripts and APIs, proficiency with Git branching and workflow conventions, and active participation in peer code reviews.
- Strong Communication, Collaboration, and Organizational Skills
Preferred Qualifications:
- Kubernetes, ArgoCD/GitOps, disaster recovery, capacity planning.
- OTel standards, Grafana/Perses, Tempo, Clickhouse, VictoriaMetrics.
- Scripting experience using tools such as Python, Bash, PowerShell.
- Experience with DevOps architecture leveraging automation, containerization, and GitOps, using GitLab/Jenkins/ArgoCD for CI/CD, Kubernetes (K8s) for orchestration, Istio for service mesh, and Nexus/Harbor for artifact repositories.
- SRE Certifications from The DevOps Institute, AWS Solution Architect, or similar.
- Dashboard quality, alert design, anomaly detection.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
