Clarivate Analytics

Futures Technology Intern

US
R Machine Learning Python Pandas
Description

We are looking for a TDM Data Scientist / Machine Learning Intern to join our TDM Studio Team in Ann Arbor, MI. This is an amazing opportunity to work on TDM Studio and related Data Science research. The team consists of 10 people and is reporting to the TDM Studio Manager and Dev Director. We have a great skill set in building data management and science products and we would love to speak with you if you have skills in NLP and working with text data.

About You – experience, education, skills, and accomplishments

  • Actively pursuing or completed a Bachelor’s or graduate degree majoring in Engineering, Computer Engineering, Computer Science or other related field at an accredited school.
  • Past experience and projects with Machine Learning and Natural Language Processing.

It would be great if you also had . . .

  • Advanced experience with Machine Learning projects in Python. Familiarity with the important NLP Python libraries including NLTK, scikit-learn, and Pandas.
  • Some knowledge and familiarity with using and fine tuning large language models.
  • Experience working with text data in different data formats such as XML and CSV.
  • Past experience mapping large text data from one structure or format to another
  • Some experience working with and creating sample Jupyter Notebooks for Data Science research

What will you be doing in this role?

  • Develop Python system to map between different XML text datasets to ingest new content for TDM Studio.
  • Conduct machine learning research and development related to newspaper article segmentation. This will require both work with large language models and NLP tasks as well as opportunities for computer vision experiments. 
  • Create and implement from scratch new Data Science-related Python and R scripts for TDM Studio.

About the Team

During this internship, we will focus on R&D endeavors related to processing large-scale newspaper data as well as development work related to TDM Studio. The intern will gain great experience working on a real-world, Data Science product which supports NLP and Machine Learning research.

As part of the TDM Studio team, we will be working specifically on three tasks throughout the internship: (1) Exploring and Developing machine learning models to segment page-level newspaper data into article-level; (2) Creating Python and R sample coding notebooks for us by researchers in TDM Studio; and (3) Developing Python code to map different XML structures to standard text ingest structure.

The TDM Studio team consists of a development team as well as a product and user experience team. You will gain knowledge of working across our TDM Studio team in this internship position.

Hours of Work

This is a hybrid position with 2-3 days per week in the Clarivate Ann Arbor office. Ideally this internship will be full time during the summer months (35-40 hours per week) and part time in the fall months (10-15 hours per week).

Clarivate is an Equal Opportunity Employer Vets/Minorities/Women/Disabled

Clarivate Analytics
Clarivate Analytics
Analytics Information Services Information Technology Innovation Management

0 applies

40 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

50,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 232 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers