ABOUT BASETEN
We’re a growing team of builders backed by top-tier investors, including IVP, Spark Capital, Greylock, and Sarah Guo at Conviction. ML teams at enterprises and category-defining AI-native companies like Descript, Bland.ai, Patreon, Writer, and Robust Intelligence use Baseten to power their core production workloads with best-in-class performance, security, and reliability. While we’ve unlocked PMF and secured Series B funding, the ML infrastructure market is massive, and we’re just getting started. If you’re excited to work on engaging and relevant problems while building something new from the ground up, come join us!
THE ROLE
Are you passionate about building robust, scalable infrastructure that powers cutting-edge machine learning applications? We are looking for a Tech Lead Manager - Infrastructure to lead our infrastructure team in designing, developing, and optimizing the core systems that support our ML platform. This is an ideal role for someone with a deep technical background in infrastructure engineering who enjoys mentoring and leading a team. If you’re excited about the challenges of scaling infrastructure for ML workloads in a fast-paced startup environment, we’d love to meet you.
RESPONSIBILITIES:
Lead, manage, and mentor the infrastructure engineering team responsible for building the backbone of Baseten’s ML platform.
Define and drive the technical strategy for infrastructure, ensuring performance, security, and scalability of core systems.
Collaborate closely with ML teams and cross-functional stakeholders to ensure smooth integration of models into production environments.
Design and implement scalable infrastructure solutions, including CI/CD pipelines, container orchestration, and cloud infrastructure (AWS, GCP, etc.).
Dive deep into performance optimization of our systems, identifying and addressing bottlenecks to improve overall infrastructure efficiency.
Own end-to-end project management for infrastructure initiatives, from planning and execution to monitoring and maintenance.
Promote engineering best practices and a culture of continuous improvement within the team.
REQUIREMENTS:
Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field.
5+ years of professional experience in infrastructure or software engineering, with at least 2 years in a technical leadership role.
Expertise in infrastructure design, including containerization (Docker), orchestration (Kubernetes), and cloud platforms (AWS, GCP).
Strong experience with CI/CD pipelines, infrastructure as code (Terraform, Ansible), and monitoring systems.
Solid understanding of networking, security, and high-availability infrastructure design.
Experience managing and scaling infrastructure for machine learning or similar high-performance workloads.
Proven track record of leading teams and delivering large-scale, production-level infrastructure solutions.
Excellent problem-solving skills and the ability to drive technical projects from idea to completion.
BONUS POINTS:
Experience with optimizing infrastructure for machine learning workloads, including GPU utilization and distributed computing.
Familiarity with multi-cloud strategies and hybrid cloud deployments.
Deep understanding of security best practices in cloud-native environments.
Previous experience in a fast-paced startup environment, particularly in the ML or AI space.
BENEFITS:
Competitive compensation package (Unlimited PTO, 401k, covered healthcare premiums).
Opportunity to lead a talented infrastructure team in one of the most exciting engineering fields.
An inclusive and supportive work culture that fosters growth and continuous learning.
Exposure to cutting-edge ML infrastructure technologies and collaboration with top-tier ML teams and organizations.
0 applies
18 views
Other Jobs from Baseten
Developer Success Engineer
AI Support Engineer
Forward Deployed ML Engineer
Tech Lead Manager - ML Performance
Site Reliability Engineer
Similar Jobs
Lead Site Reliability Engineer
Senior Site Reliability Engineer - Logging Metrics and Monitoring
Machine Learning Engineer
Staff Machine Learning Engineer, Data Infrastructure, Central
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say