AI Researcher (Computer Vision/Multimodal/Generative AI)
Location: Hybrid (San Francisco, California, US)
Department: Engineering
About the Role
We are hiring ML Researchers to develop novel approaches that advance the frontier of multimodal vision AI and create product-defining capabilities for SpreeAI. This role exists because current generative and vision models are not designed for photorealistic human representation, controllable try-on, or real-world deployment constraints. You will explore new architectures, algorithms, and training strategies that improve realism, controllability, efficiency, and multimodal understanding — with a direct path from research to production.
You will work on research problems across:
- photorealistic virtual try-on
- human-centric visual representation learning
- video-based modeling and temporal consistency
- multimodal reasoning and generative pipelines
- compute-efficient diffusion and generative architectures
This is a research role with product impact: successful work leads to platform capabilities, white papers, patents and most importantly, industry differentiation.
Why This Role Exists
Modern multimodal AI systems struggle with identity preservation, pose consistency, physical realism, and controllability under production constraints. We are building new approaches where:
- diffusion models must produce consistent outputs across poses, viewpoints, and garments,
- generative models must learn human and garment interactions realistically,
- research innovations must scale to real-world deployment environments.
This role is for researchers who want to see novel ideas become shipped systems used by real customers.
What you'll do
- Develop novel architectures and training approaches for vision and multimodal AI.
- Advance generative modeling techniques including controllable diffusion and video generation.
- Design experiments improving realism, temporal consistency, and human representation.
- Collaborate with applied engineering teams to translate research into production systems.
- Publish white papers or research outputs aligned with product differentiation.
- Evaluate new model paradigms for scalability and efficiency.
Core Research Areas & Model Architectures
Candidates should have familiarity with or interest in advancing:
- Diffusion models and latent diffusion architectures.
- Transformer-based vision models (ViT, multimodal transformers).
- Image-to-image and video generation pipelines.
- Control mechanisms for generative models (conditioning, adapters, LoRA).
- Representation learning for human pose, geometry, or identity consistency.
- Multimodal architectures combining vision, text, and structured inputs.
Qualifications
- PhD in Computer Science, Artificial Intelligence, Robotics, Computer Vision, or related field.
- Strong research background in computer vision, generative modeling, or multimodal AI.
- Strong programming skills in Python and familiarity with object-oriented languages.
- Experience with deep learning frameworks (PyTorch preferred).
- Strong foundations in machine learning theory and experimental design.
Preferred Qualifications
- Publications at top conferences (CVPR, ICCV, NeurIPS, ICLR, SIGGRAPH, etc.).
- Experience with diffusion-based generative models.
- Video modeling or temporal learning experience.
- Experience bridging research into production systems.
- Interest in compute efficiency, distillation, or scalable generative pipelines.
About the Company
SPREEAI is a fast-growing, innovative AI company at the forefront of fashion and e-commerce, revolutionizing how consumers engage with fashion through lifelike photorealistic try-on technology and hyper-personalized shopping experiences. Our mission is to redefine the retail landscape with cutting-edge AI solutions that blend high fashion and technology. We thrive in a dynamic, fast-paced environment where creativity meets technology to drive real impact. If you are passionate about innovation and shaping the future of fashion, SPREEAI offers a platform to make your mark.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
