Key Responsibilities
- Design, develop, and maintain scalable data pipelines for large-scale multimodal and text-based foundation model training.
- Curate, clean, and validate diverse real-world datasets, ensuring high data quality and relevance.
- Develop and optimize data preprocessing models for automated cleaning, augmentation, and filtering.
- Implement robust data evaluation and benchmarking strategies to assess dataset quality and model impact.
- Develop scalable tools and frameworks for data ingestion, transformation, and annotation.
- Develop custom synthetic data generation techniques.
- Optimize dataset storage and retrieval strategies for efficient large-scale training.
- Work closely with research teams to integrate data improvements into model training workflows.
- Prototype and iterate on human-in-the-loop solutions to enhance dataset quality.
Required Qualifications
- Experience Level: B.S. + 5 years experience or M.S. + 3 years experience or Ph.D. + 1 year of experience.
- Relevant Experience: Some combination of at least 2 of the following across research and engineering is ideal: Software Engineering, Data Engineering, Machine Learning Engineering, and Research.
- Data Engineering: Expertise in data curation, cleaning, augmentation, and synthetic data generation techniques.
- Machine Learning Expertise: Ability to write and debug models in popular ML frameworks, and experience working with LLMs.
- Software Development: Strong programming skills in Python, with an emphasis on writing clean, maintainable, and scalable code.
- Cloud and Infrastructure: Experience with distributed computing frameworks and cloud storage solutions.
Preferred Qualifications
- M.S. or Ph.D. in Computer Science, Electrical Engineering, Math, or a related field.
- Experience fine-tuning or customizing LLMs or multimodal foundation models.
- First-author publications in top ML conferences (e.g. NeurIPS, ICML, ICLR).
- Contributions to popular open-source projects.

0 applies
21 views
Other Jobs from Liquid AI
Member of Technical Staff - Edge AI Inference Engineer
Member of Technical Staff - Machine Learning Research Engineer, Post-Training
Member of Technical Staff - Applied Machine Learning Lead
Member of Technical Staff - Machine Learning Engineer, Training Infrastructure
Member of Technical Staff - Applied Machine Learning Engineer
Similar Jobs
Data Engineer (R-17830)
Senior Software Engineer (L5) - Partner Ecosystem
Machine Learning Engineer
Machine Learning Scientist Intern
Software Engineer I (San Francisco)
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say