Job Responsibilities:
- Investigate and develop advanced techniques for small LLMs, including transformer architectures and synthetic data generation for robust training.
- Explore methods for LLM fine-tuning and optimization, ensuring models are both high-performing and efficient.
- Collaborate with cross-functional teams to integrate LLM solutions into hardware platforms.
- Implement optimization techniques such as quantization, runtime adjustments, and inference speed improvements.
- Work with runtime deployment tools such as ONNX and TensorFlow Lite to optimize model performance on target hardware.
- Develop and evaluate retrieval-augmented generation (RAG) strategies to enhance model performance in dynamic, unstructured data scenarios.
- Document experimental findings, contribute to internal technical reports, and support potential publication efforts in top-tier conferences.
- Participate in team discussions, code reviews, and agile development cycles to continually refine and improve deployment strategies.
Minimum Qualifications:
- Currently enrolled in MS/PhD program in CS, EE, Math, or a related field, with a strong focus on machine learning, deep learning, and natural language processing
- Proficiency in Python coding, shell scripting, and working within Linux environments
- Demonstrated experience in developing and training deep learning models, especially with transformer architectures and language models
- Extensive experience with deep learning frameworks such a PyTorch and Tensorflow
- Experience with runtime deployment and optimization tools, e.g. ONNX, TensorFlow Lite
- Basic understanding of hardware deployment challenges, including containerization tools like Docker
- Experience with cloud-based tools and platforms such as Azure, Databricks, and Apache Spark
- Knowledge of model optimization techniques such as quantization, inference optimization, and runtime performance enhancements
- Basic knowledge of MLOps practices, including experiment tracking and model versioning using tools such as MLflow
- Understanding of ML workflow: preparing the data, implementing and training ML models, evaluating results, deploying inference on different platforms
- Experience with git or other version control systems
Preferred Qualifications:
- Experience with synthetic data generation for ML applications
- Prior exposure to LLM fine-tuning and evaluation methodologies
- Hands-on experience with retrieval-augmented generation (RAG) systems
- Familiarity with processing unstructured data in real-world environments
- A record of publications or contributions to reputable AI/ML, CV, or NLP conferences and journals
- Curious, self-motivated, and excited about solving open-ended challenges at Mercedes-Benz
Similar Jobs
AI Engineer
Principal AL ML Engineer
(USA) Principal, Software Engineer
(USA) Staff, Data Scientist
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say