We are looking for a TDM Data Scientist / Machine Learning Intern to join our TDM Studio Team in Ann Arbor, MI. This is an amazing opportunity to work on TDM Studio and related Data Science research. The team consists of 10 people and is reporting to the TDM Studio Manager and Dev Director. We have a great skill set in building data management and science products and we would love to speak with you if you have skills in NLP and working with text data.
About You – experience, education, skills, and accomplishments
- Actively pursuing or completed a Bachelor’s or graduate degree majoring in Engineering, Computer Engineering, Computer Science or other related field at an accredited school.
- Past experience and projects with Machine Learning and Natural Language Processing.
It would be great if you also had . . .
- Advanced experience with Machine Learning projects in Python. Familiarity with the important NLP Python libraries including NLTK, scikit-learn, and Pandas.
- Some knowledge and familiarity with using and fine tuning large language models.
- Experience working with text data in different data formats such as XML and CSV.
- Past experience mapping large text data from one structure or format to another
- Some experience working with and creating sample Jupyter Notebooks for Data Science research
What will you be doing in this role?
- Develop Python system to map between different XML text datasets to ingest new content for TDM Studio.
- Conduct machine learning research and development related to newspaper article segmentation. This will require both work with large language models and NLP tasks as well as opportunities for computer vision experiments.
- Create and implement from scratch new Data Science-related Python and R scripts for TDM Studio.
About the Team
During this internship, we will focus on R&D endeavors related to processing large-scale newspaper data as well as development work related to TDM Studio. The intern will gain great experience working on a real-world, Data Science product which supports NLP and Machine Learning research.
As part of the TDM Studio team, we will be working specifically on three tasks throughout the internship: (1) Exploring and Developing machine learning models to segment page-level newspaper data into article-level; (2) Creating Python and R sample coding notebooks for us by researchers in TDM Studio; and (3) Developing Python code to map different XML structures to standard text ingest structure.
The TDM Studio team consists of a development team as well as a product and user experience team. You will gain knowledge of working across our TDM Studio team in this internship position.
Hours of Work
This is a hybrid position with 2-3 days per week in the Clarivate Ann Arbor office. Ideally this internship will be full time during the summer months (35-40 hours per week) and part time in the fall months (10-15 hours per week).
Clarivate is an Equal Opportunity Employer Vets/Minorities/Women/Disabled
0 applies
40 views
Jobs from our Partners
Oracle Cloud Fusion BI Publisher Engineer – ETS Engineer III
Staff Software Engineer II (Hybrid)
Other Jobs from Clarivate Analytics
Senior Software Engineer
Associate Project Manager
Senior Director, Sales- Real World Data
Senior Healthcare Research & Data Analyst
Healthcare Research & Data Analyst
Full Stack Software Engineer
Similar Jobs
Senior Security Operations - Project Manager - CTJ - Poly
Senior Data Scientist
Site Reliability Engineer II
Associate Director Data Science
Data Scientist
QA Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
50,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 232 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
Cancel anytime / Money-back guarantee