Roche

Machine Learning Scientist, Scientific Reasoning Models

New York City, NY South San Francisco, CA
USD 141k - 274k
Python Machine Learning AI LLM
Description

Machine Learning Scientist, Scientific Reasoning Models, AI for Drug Discovery

Location: New York City, South San Francisco

Time Type: Full time

Job Description

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. That’s what makes us Roche.

Advances in AI, data, and computational sciences are transforming drug discovery and development. Roche’s Research and Early Development organisations at Genentech (gRED) and Pharma (pRED) have demonstrated how these technologies accelerate R&D, leveraging data and novel computational models to drive impact. Seamless data sharing and access to models across gRED and pRED are essential to maximising these opportunities. The new Computational Sciences Center of Excellence (CoE) is a strategic, unified group whose goal is to harness the transformative power of data and Artificial Intelligence (AI) to assist our scientists in both pRED and gRED to deliver more innovative and transformative medicines for patients worldwide. ​

The Opportunity

At Roche's AI for Drug Discovery (AIDD) group, we are revolutionizing drug discovery with cutting-edge machine learning (ML) techniques.  We are seeking a Machine Learning Scientist to join the Foundation Models team within Prescient Design (gRED).  In this role, you will contribute to our internal reasoning Large Language Models (LLMs) and enable it to succeed at relevant drug discovery tasks, including biomolecular design. You will work at the intersection of engineering and research, designing and scaling large machine learning systems.

In this role, you will:

  • Scalable Systems & Engineering: Design, implement, and improve large-scale distributed machine learning systems, writing robust, performance-critical code and contributing to core infrastructure.

  • Model Improvement & Reasoning: Develop and execute strategies to systematically improve performance on scientific tasks, including long-horizon task completion and complex reasoning challenges.

  • Domain Translation: Translate biological and chemical domain knowledge into concrete machine learning objectives, training signals, and evaluation criteria.

  • Evaluation & Benchmarks: Design and implement evaluation methodologies to assess model capabilities relevant to biological research, working with domain experts to establish benchmarks and curate high-quality data.

  • Research-to-Production: Collaborate closely with researchers to translate ideas and prototypes into scalable, production-ready systems.

As a Machine Learning Scientist:

  • Focus: You focus on the execution of defined projects. You are responsible for writing clean, efficient code to test specific hypotheses regarding reasoning and alignment.

  • Engineering: You contribute to the maintenance of the training infrastructure and data pipelines, ensuring experiments run reliably on our clusters.

  • Collaboration: You work closely with senior scientists to implement novel algorithms, translating research papers into working prototypes.

Who you are

  • BS/MS in Computer Science, Statistics, Mathematics, Physics, or a related quantitative field with 2+ years of relevant work experience. Or Ph. D. with 0-2 years relevant work experience.

  • LLM Expertise: Experience developing and training large-scale machine learning models, including post-training techniques to enhance domain knowledge, reasoning capabilities, and model alignment.

  • Publication Record: A strong history of research excellence at top-tier venues (e.g., NeurIPS, ICLR, ICML).

  • Engineering: Strong software engineering skills and experience working with high-performance computing systems.

Preferred

  • Experience with molecular modalities (e.g., protein sequences, chemical graphs, and structured molecular data).

  • A public portfolio of research or significant contributions to open-source ML libraries.

  • A passion for applying frontier AI to drug discovery.

Relocation benefits are NOT available for this job posting

The expected salary range for this position, based on the primary location of New York City, is $141,100 -262,100 of hiring range, and for San Francisco, $147,600 - 274,000.  Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law.  A discretionary annual bonus may be available based on individual and Company performance.  This position also qualifies for the benefits detailed at the link provided below.

Benefits

#ComputationCoE

#tech4lifeComputationalScience 

#tech4lifeAI

Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws.

If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form Accommodations for Applicants.

Roche
Roche

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say