Responsibilities:
- Utilize troubleshooting and scripting skills to improve the availability, performance, and security of Percipient.ai services
- Implement automated deployments, and operational tools
- Collaborate with product and engineering teams to plan and deploy product releases
- Ensure services are designed with 24/7 availability and operational readiness and rigor
- Implement proactive monitoring, alerting, and self-healing systems
- Participate in on-call rotations, driving restoration and repair of service-impacting issues
- Define non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systems
- Coding and automation of applications in the cloud
Requirements:
- BS in Computer Science or related field
- Ability to be onsite at the customer’s facility several days a week
- 8+ years of Systems/Applications automation in 24/7 production services environments
- Expert understanding of running a large-scale virtualized infrastructure in the cloud and on-premise
- Expertise with containerizing concepts like Docker, PaaS services on AWS, and Kubernetes or equivalent technologies
- Fluency with at least one current generation scripting language used by DevOps professionals such as Python, Bash, or Perl
- Deep experience operating on AWS (C2S) and infrastructure automation using Ansible and Terraform
- Excellent troubleshooting and problem-solving skills
- Demonstrated experience in analyzing and diagnosing large-scale distributed systems and Linux systems internals (system libraries, file systems, etc.)
- Experience with elastically scalable, fault tolerance and other cloud architecture patterns
- Experience with Continuous Integration and Continuous Delivery, including tools such as Cloudformation
- Experience in Linux and security triage and forensic analysis
- Excellent interpersonal and communication skills
- US citizenship and a national security background required
- Experience working with the DoD / IC community a plus
0 applies
235 views
Other Jobs from Percipient.ai
Sr. Back-End Software Engineer - Machine Learning
Senior Staff Front-End Software Engineer
Similar Jobs
Senior Solutions Architect, Platform Infrastructure - PST (Remote)
Senior DevOps Engineer - AI Infrastructure
Senior Solution Architect, HPC and AI - NVIS
Machine Learning Infrastructure Engineer
Senior Machine Learning Engineer, Decisions Alliance (m/f/x)
DevOps Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say