RefinedScience

Bioinformatics Machine Learning Intern

Remote
USD 71k - 79k
Python R Bash Git Docker GCP PyTorch TensorFlow Scikit-learn API Machine Learning Deep Learning AI LLM
Description

Bioinformatics Machine Learning Intern

Location: Remote

Department: Bioinformatics

Bioinformatics Machine Learning Intern

RefinedScience | United States (hybrid or remote)

At RefinedScience, our mission is to advance care by bringing together the best science, data and minds – disease by disease, patient by patient, cell by cell to discover pathways to life beyond disease.

What We Are Looking For

We are seeking a highly motivated Bioinformatics Machine Learning Intern to join our team. This internship is designed for Ph.D. candidates with experience applying machine learning, deep learning, or generative AI methods to single-cell omics data. You will contribute to active projects spanning single-cell biology, multiomics integration, and computational approaches to precision medicine and drug development.

Our Bioinformatics team plays a crucial role in integrating computational biology, large-scale data analysis, and machine learning to drive discoveries in precision medicine and drug development.

Key Activities

  • Analyze single-cell and multiomics datasets to extract biological insights supporting precision medicine and drug development programs
  • Apply and evaluate machine learning and deep learning approaches to single-cell data for tasks such as cell type classification, biomarker discovery, and patient stratification
  • Explore and prototype generative AI and LLM-based approaches to accelerate biological data interpretation and scientific workflows
  • Collaborate with scientists, clinicians, and data scientists to design and execute data-driven research projects
  • Document and optimize computational workflows following reproducible research best practices
  • Present findings through technical reports, visualizations, and presentations to cross-functional teams

Must Haves

  • Current Ph.D. candidate in Bioinformatics, Computational Biology, Computer Science, Biostatistics, or a related quantitative field
  • Single-cell omics experience: Demonstrated ability to process, analyze, and interpret single-cell data (scRNA-seq, scATAC-seq, CITE-seq, or spatial transcriptomics) using frameworks such as Scanpy/scverse, Seurat, or Bioconductor
  • Machine learning expertise: Applied experience developing and evaluating ML/deep learning models on biological data, including neural network architectures (GNNs, transformers, autoencoders), model selection and benchmarking, and integration of ML approaches into analytical workflows
  • Programming proficiency: Python and/or R for data analysis, statistical modeling, and visualization
  • Statistical foundation: Understanding of statistical methods for biological data (hypothesis testing, differential expression, multiple testing correction, clustering)
  • Strong problem-solving skills and ability to communicate complex insights effectively

Desired Qualifications

Machine Learning & AI

  • Experience with deep learning frameworks (PyTorch, TensorFlow, JAX)
  • Familiarity with graph neural networks, attention mechanisms, or transformer architectures applied to biological data
  • Experience with ML experiment tracking and reproducibility (MLflow, Weights & Biases)
  • Exposure to representation learning, variational autoencoders, or contrastive learning methods
  • Familiarity with scikit-learn, XGBoost, or similar ML libraries
  • Interest in or experience with LLMs, RAG systems, or agentic AI tooling

Bioinformatics

  • Experience with multimodal single-cell integration (Seurat WNN, scvi-tools/MultiVI/totalVI, Muon)
  • Familiarity with spatial transcriptomics analysis (Squidpy, cell2location, nf-core/spatialvi)
  • Experience with cell-cell communication inference (CellChat, NicheNet, LIANA)
  • Knowledge of drug-gene interaction resources (CMap/LINCS, OpenTargets, ChEMBL)

Engineering & Infrastructure

  • Familiarity with Linux/Unix CLI and version control (Git/GitHub)
  • Experience with containerization (Docker, Singularity) and environment management (conda, venv)
  • Exposure to cloud computing platforms (GCP preferred)
  • Familiarity with workflow managers (Nextflow, Snakemake)
  • Adherence to best-practices for conduct reproducible computational research

Duration

8–10 weeks

Why You'll Love RefinedScience

Team + Values

At RefinedScience, we seamlessly integrate top-tier clinical and biological data with expert knowledge to provide unparalleled insights. We maximize patient impact with these unique insights by optimizing clinical trial probability of success and time to actionable results. We work across biopharma and we are a trusted partner in achieving better results, faster – working together to unlock strategic advantage.

Our Values

  • Act with Purpose – We believe in rigor through deliberate and thoughtful actions
  • Be Curious – Curiosity is the spark that ignites innovation and growth
  • Take Ownership – True ownership leads to pride and commitment in the work we do
  • Invest in Relationships – Building strong connections is the foundation for effective collaboration and trust for long term success
  • Embrace Agility – We celebrate agile thinking, resilience, and adaptability

Compensation

  • $34-$38 per hour

 

RefinedScience
RefinedScience

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say