Tencent

Research Intern, Multimodal Large Language Models

Singapore
Python AI Machine Learning Deep Learning
Description

Tencent Youtu Lab Research Intern - Multimodal Large Language Models (2026 Project UP)

Location: Singapore-CapitaSky

Remote Type: Onsite

Time Type: Full time

Job Description

Business Unit

Cloud & Smart Industries Group (CSIG) is responsible for promoting the company's cloud and industry Internet strategy. CSIG explores the interactions between users and industries to create innovative solutions for smart industries via technological advancements such as cloud, AI, and network security. While driving the digitalization of retail, medical, education, transportation and other industries, CSIG helps companies serve users in smarter ways, building a new ecosystem of intelligent industries that connect users and businesses.

What the Role Entails

Tencent Youtu Lab is seeking a highly motivated Research Intern to join our 2026 Project UP initiative, focusing on Multimodal Large Language Models (MLLMs). This role is ideal for postgraduate students passionate about AI research and looking to contribute to the next generation of intelligent systems. The intern will engage in cutting-edge R&D involving the integration and understanding of diverse data types such as image, text, audio, and video. You will work alongside top researchers to explore advanced techniques like multimodal pre-training, long-video interaction, and visual reasoning, bridging theoretical innovation with real-world applications. This is a unique opportunity to develop impactful solutions, contribute to open-source or academic outputs, and push the frontier of AI at Tencent Cloud.
Responsibilities
  • Core Research & Development: Conduct advanced research on technical solutions for Multimodal Large Models (MLLMs), focusing on the perception, understanding, and interaction of mixed modalities including image, text, audio, and video. Explore cutting-edge topics such as native multimodal pre-training schemes, long-video interactive understanding, visual reasoning, and multimodal agents.
  • Innovation & Breakthroughs: Keep pace with state-of-the-art (SOTA) trends in the multimodal large model field. Combine theoretical research with actual business scenarios to explore innovative solutions, achieve technical breakthroughs, and build industry-leading multimodal large language models.
  • Industry Impact: Produce influential research outcomes and contribute to the advancement of multimodal large model technologies within the industry through high-quality papers or open-source projects.

Who We Look For

  • Currently pursuing a postgraduate degree (Master’s or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related fields, familiar with key research achievements and mainstream open-source projects in the field of multimodal large language models. Demonstrated strong academic research capabilities with a track record of results, and practical experience in solving concrete algorithmic problems.
  • Strong algorithm implementation skills with proficiency in Python. Demonstrate extensive knowledge of mainstream deep learning platforms and algorithmic frameworks applied in Multimodal Large Models (MLLMs) and foundation models.
  • Excellent analytical and problem-solving skills. Passionate about tackling challenging technical problems and possesses a strong spirit of teamwork and collaboration.
  • Excellent proficiency in both English and Chinese Mandarin, written and spoken, to effectively collaborate with global research and engineering teams.


#LI-JY1

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Tencent
Tencent

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say