About the Company
At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business.
A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight.
Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer.
Job Description Summary
The model development department is an ML deployment engineer who will deploy our next generation machine learning models for our autonomous driving stack.
As a senior engineer of the team, you are applying machine learning science in a production focused environment. You are using machine learning models in both a unimodal and multimodal context, to solve all tasks across the functional autonomous driving stack. Training, validation, data science, architectural design are your daily work. You are interested in understanding how your model performs in deployment, for what you collaborate closely with deployment focused teams. You mentor and guide more junior members of the team and are always interested in the newest trends in research, eager to translate scientific improvements into our production grade machine learning pipelines.
Meet the team
Torc's Autonomy Applications software utilizes cutting-edge deep learning techniques to perceive the vehicle's environment, predict the movements of other vehicles, and execute accurate driving decisions. We are actively seeking an experienced ML deployment engineer to join our model development department. This is an exceptional opportunity for you to have a significant impact on the future of the autonomous vehicle industry by leveraging AI.
What You’ll Do:
Model Deployment & Optimization
Deploy and optimize machine learning models for production environments, ensuring real-time performance and resource efficiency on edge devices and automotive-grade hardware.
Implement model quantization, pruning, and compression techniques to enhance inference speed while maintaining accuracy.
Collaborate with ML engineers to transition research-grade code (e.g., PyTorch) into production-ready, scalable systems.
Inference Pipeline Development
Design and optimize end-to-end inference pipelines for embedded systems, leveraging frameworks like ONNX, TensorFlow Serving, or PyTorch Serve.
Integrate model outputs with upstream & downstream systems (e.g., perception, control modules) via APIs or middleware.
Cross-Functional Collaboration
Partner with DevOps teams to build CI/CD pipelines for automated model deployment, testing, and rollback.
Work with hardware engineers to profile and optimize model performance on target devices (e.g., NVIDIA Jetson).
Monitoring & Maintenance
Develop tools and dashboards to monitor model performance, data drift, and system health in production.
Implement A/B testing and canary deployment strategies to validate model updates.
Infrastructure & Tools
Optimize data pipelines for low-latency inference, including preprocessing and postprocessing workflows.
Advocate for MLOps best practices (versioning, reproducibility, logging) across the ML lifecycle.
What You’ll Need to Succeed:
Education & Experience
Bachelor’s degree in computer science, engineering, or related field with 2+ years of experience in deploying ML models (or master’s with 1+ years).
Proven expertise in deploying models to edge devices or cloud platforms (AWS, Azure, GCP).
Technical Skills
Mastery of Python and C++; familiarity with CUDA, TensorRT, or OpenVINO for acceleration.
Experience with deployment frameworks (e.g., ONNX, TensorFlow Lite, PyTorch Mobile) and containerization (Docker, Kubernetes).
Knowledge of performance profiling tools (e.g., NVIDIA Nsight, VTune) and optimization techniques (e.g., layer fusion, memory management).
Domain Knowledge
Understanding of ML model lifecycle challenges (e.g., drift, scalability) and MLOps principles.
Familiarity with computer vision, LiDAR/radar data, or sensor fusion workflows is a plus.
Bonus Points!
Experience with NVIDIA libraries (CUDA, CuDNN, TensorRT) or embedded SDKs (JetPack, DeepStream).
Proficiency in distributed inference using Ray or Horovod.
Cloud certifications (AWS ML Specialty, Azure AI Engineer) or MLOps tools (MLflow, Kubeflow).
Knowledge of security practices for ML systems (e.g., adversarial defense, encrypted inference
At Torc, we’re committed to building a diverse and inclusive workplace. We celebrate the uniqueness of our Torc’rs and do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, veteran status, or disabilities.
Even if you don’t meet 100% of the qualifications listed for this opportunity, we encourage you to apply.
Other Jobs from Torc Robotics
Software Engineer, II
Sr. Test Strategy Engineer - Hardware-in-the-Loop
Software Engineer, II - Vehicle Simulation
Embedded Software Engineer II - AUTOSAR Classic
Similar Jobs
Senior Data Scientist
Lead Software Engineer (FullStack - Java and React )
Staff Machine Learning Engineer - Content and Contributor Intelligence (Remote - United States)
Staff Machine Learning Engineer - Content and Contributor Intelligence (Remote - Canada)
Senior Data Scientist | CARE
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say