Staff Software Engineer - Computer Vision Deployment
Department: Engineering
Location: San Francisco
Employment Type: FullTime
We're looking for a Staff Software Engineer – Computer Vision Deployment to build and scale the infrastructure that powers our AI-driven warehouse intelligence platform. You'll own the end-to-end lifecycle of computer vision models — from training pipelines through optimized cloud deployment — ensuring our cutting-edge computer vision and multi-modal AI systems run reliably and efficiently in production. Your work will directly enable the real-time perception and autonomous decision-making capabilities at the core of our platform.
This is a deeply technical role at the intersection of machine learning, distributed systems, and cloud infrastructure. You'll design scalable GPU compute clusters, build robust orchestration pipelines, and optimize model serving for low-latency inference at scale. You'll work closely with our research scientists, computer vision engineers, and product teams to bridge the gap between experimental models and production-ready systems that operate across diverse warehouse environments. We've found tremendous value in collaborative problem-solving, thus our team works from our SF office three days a week.
Responsibilities
Develop and maintain distributed cloud GPU infrastructure for large-scale world model training and low-latency inference.
Build end-to-end computer vision pipelines — from data ingestion and preprocessing through model training, evaluation, and deployment — and integrate them into core product workflows.
Deploy and optimize state-of-the-art machine learning models in the cloud using model serving platforms and inference optimization techniques, including VLMs and VLAs.
Design and operate orchestration systems that enable both engineers and non-engineers to build and manage data and ML pipelines.
Establish monitoring, benchmarking, and evaluation frameworks to ensure model performance and reliability in production environments.
Required Experience
B.S. / M.S. in Computer Science, Robotics, or similar technical field, or equivalent practical experience.
7+ years of professional software engineering experience, with at least 3 years in machine learning infrastructure — developing, scaling, training, deploying, and optimizing large-scale ML systems from data to model.
Track record of deploying computer vision models in production environments with real-world constraints.
Experience with distributed messaging and compute systems (Kafka, gRPC, ROS2, or similar).
Strong programming skills in Python with solid software engineering practices.
Preferred Experience
Experience developing, running, and managing orchestration systems (Flyte, Temporal, Airflow, or similar) for ML and data pipelines.
Proficiency with ML frameworks (PyTorch, TensorFlow, DeepSpeed) and model serving platforms (TorchServe, TensorFlow Serving, NVIDIA Triton Inference Server, or similar).
Deep understanding of state-of-the-art machine learning models such as auto-regressive transformers and familiarity with inference optimization techniques (TensorRT, quantization, custom kernels).
Experience with C++ or CUDA programming for GPU acceleration.
Prior experience working at autonomous vehicles or robotics companies.
Equal Opportunity Statement
We’re an equal opportunity employer that values diversity and inclusion. We welcome teammates of all backgrounds and don’t discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.
Benefits
At Claryo, we offer a competitive benefits package that supports your health and well-being, including — top-tier medical, dental, and vision coverage, 401k with employer matching, parental leave, and unlimited vacation.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
