Compute Infrastructure Engineer, AI and Advanced Computing Institute
Team: AI
Location: New York, NY, Washington D.C., Bay Area, CA
Commitment: Full-Time
Workplace Type: onsite
Primary Technical Development
- Continually identify, evaluate, and deploy open source and proprietary technologies that meet the combined infrastructure requirements from an evolving list of projects.
- Implement performant solutions that meet industry compliance and security standards while enabling rapid development workflows.
- Collaborate with hardware and software vendors on methods and configurations that maximize system resource utilization.
Additional Schmidt Sciences Support
- Assist existing programs by providing infrastructure management advice, while working closely and collaboratively with multiple subject matter expert teams.
- Work with other members of the Schmidt Sciences technical team to implement new deployment strategies and application hosting capabilities in support of diverse applications and user audiences.
- Maintain awareness and track industry trends for hardware and software tooling that simplifies infrastructure management, while lowering the cost of deploying and supporting multi-tenant research applications.
- Participate in relevant industry events and forums, representing Schmidt Sciences’ presence on AI and advanced computing issues.
Required Knowledge, Skills, and Abilities
- A Bachelor’s degree from an accredited institution, with a focus on Computer Science, Information Technology, or a related field.
- 5+ years of professional experience managing production-grade compute clusters.
- Proficiency with code-management and infrastructure-provisioning tools and best practices.
- Hands-on experience with workload management using Slurm and Kubernetes.
- Proficiency with modern machine-learning hosting software frameworks, such as NVIDIA Dynamo, TensorFlow Serving, Ray, etc.
- Proficiency in building, deployment, and troubleshooting containerized Linux workloads, including GPU-accelerated configurations.
- In-depth knowledge of data center networking technologies and solutions.
- Understanding of the tech stack needed to design, train, deploy, and maintain state-of-the-art AI models at a production scale.
- Experience producing technical writing for expert and general audiences.
- Good track record of collaborative impact in high-intensity, team-based environments.
- Sense of controlled urgency in driving work to completion.
- The highest integrity and ability to maintain confidentiality.
- Be able to travel within the U.S. and internationally on a regular basis as needed.
Preferred Knowledge, Skills, and Abilities
- Expert-level experience and industry credentials in the software and hardware frameworks that drive modern AI, and competence in at least one, and preferably multiple, fields of science impacted by modern AI.
- Prior leadership of data center infrastructure initiatives and projects, such as evaluating hardware scalability, securing data, or executing large-scale upgrades.
- Expertise in relevant technical focus areas, e.g., AI model performance monitoring or network and storage optimization, etc.
- Ability to work with and effectively translate technical concepts across multiple scientific disciplines.
- Ability to critically evaluate scientific and technical publications and emerging methods in related disciplines.
- Experience working with science-focused institutions such as philanthropic organizations or academic/government research institutions.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
