Intern, Research Foundational Models
Location: Toronto, ON, CAN
Time Type: Full time
Job Description
Job Requisition ID #
Position Overview
We are seeking a research intern to explore fundamental challenges in geometry, design understanding, and relative spatial reasoning for vision-language models (VLMs). While modern VLMs have shown strong performance on captioning, semantic understanding, and segmentation, they continue to struggle with geometric reasoning, layout understanding, and precise relative positioning—capabilities that are critical for design, engineering, and creation workflows.
During this internship, you will work closely with research mentors to investigate new modeling and training paradigms that move beyond one-shot visual reasoning. The project will focus on approaches such as reinforcement learning, test-time computation, and “thinking with images,” where models iteratively attend to visual evidence, reason over intermediate representations, and verify hypotheses through visual feedback. The goal is to advance state-of-the-art methods for spatially grounded reasoning and contribute insights relevant to both the research community and Autodesk’s long-term vision for intelligent design tools.
Over the course of the internship, you will define and drive a focused research project, including model development, experimental validation, and analysis, with the opportunity to publish results and present findings internally and externally.
Responsibilities
Define and execute a research project focused on geometric reasoning, spatial understanding, and layout awareness in vision-language models
Conduct literature reviews to identify limitations of existing VLMs and relevant prior work in multimodal reasoning and reinforcement learning
Design and implement novel training or inference strategies using reinforcement learning, test-time computation, or iterative visual reasoning
Develop model architectures, training pipelines, and evaluation benchmarks for spatial and geometric tasks
Run large-scale experiments, analyze results, and iterate on model designs based on empirical findings
Compare proposed approaches against strong baselines and state-of-the-art methods
Collaborate closely with research mentors and other researchers, sharing progress and incorporating feedback
Author a research paper suitable for submission to a top-tier machine learning or computer vision conference
Present research results internally at Autodesk and externally at academic venues
Minimum Qualifications
Currently enrolled in a PhD program in Computer Science, Machine Learning, Computer Vision, or a closely related field
Must have at least one academic remaining semester post internship
Strong publication record in top-tier ML or vision conferences (e.g., ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV)
Hands-on experience training vision-language models and reinforcement learning algorithms
Strong implementation skills using modern deep learning frameworks (e.g., PyTorch, TRL, Ray)
Solid background in machine learning fundamentals and experimental research methodology
Ability to work independently on open-ended research problems and communicate results clearly
Preferred Qualifications
Experience with multimodal or embodied reasoning, test-time optimization, or iterative inference methods
Familiarity with geometric vision, spatial reasoning benchmarks, or synthetic visual datasets
Experience scaling experiments on distributed systems or large compute clusters
Strong written and verbal communication skills
Learn More
About Autodesk
Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.
We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.
When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!
Salary transparency
Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience, educational level, and geographic location.
Diversity & Belonging
We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
