Ripjar

Data Scientist - Core Analytics

London, UK
Python PyTorch NumPy Spark Hadoop Machine Learning
Description

At Ripjar, we help governments and organisations automate the detection, investigation, and monitoring of threats from criminal activity.

Ripjar originally span out from GCHQ and now has 140 staff based across Cheltenham, Bristol, London and Canberra, as well as a smaller presence in the USA. We have two successful, inter-related products; Labyrinth Screening and Labyrinth Intelligence. Labyrinth Screening allows companies to monitor their customers or suppliers for entities that they aren’t allowed to or do not want to do business with (for ethical or environmental reasons). Labyrinth Intelligence empowers organisation to perform deep investigations into varied datasets to find interesting patterns and relationships.

Data infuses everything Ripjar does. We work with a wide variety of datasets of all scales, including an always-growing archive of 10 billion news articles in (nearly!) every language in the world going back over 30 years, sanctions and watchlist data provided by governments, plus 250 million organisations and ownership data from global corporate registries.

This is a great time to join a growing group of highly talented technologists and data scientists who are building products that solve real world issues and are changing the way criminal activities are detected and prevented.

Team Mission

The data science team, which sits within the engineering team, enables the delivery of high-quality data science products and software to a variety of environments through technical skills, process implementation and software management, anchored in a continuous innovation culture.

What you'll be doing

We're looking for an experienced, highly motivated Data Scientist to support the research, development, and ongoing maintenance of Ripjar's analytics and data products. You will carry out data analysis tasks to develop Ripjar’s understanding of relevant data and will develop, evaluate and deploy machine learning models that integrate with Ripjar's software products and data processing pipelines. You will be working with Language models, machine learning tools and large-scale distributed clusters. This role is well suited to a Data Scientist with a strength in computing and engineering, who (as well as deriving insights) is keen to deploy data science products and continue their ongoing improvements through iteration.

You will have a strong technical and theoretical background, and be proficient in at least one programming language, preferably Python. You will have a good understanding of machine learning and large-scale data analysis, and will be comfortable working with complex data at scale. 

Some recent developments, Ripjar’s data scientists have been involved with:

Key Tasks:

  • Carry out data analysis tasks to develop Ripjar’s understanding of relevant data.
  • Make use of Ripar’s large-scale data processing and analysis infrastructure to analyse data sets in order to identify patterns and to produce statistical outputs to support the development of new analytics and models.
  • Develop and evaluate machine learning models to enhance Ripjar’s software and data products.
  • Integrate these models into our software and consider the lifecycle and practical use of each model.
  • Work with Ripjar's Data Engineers and engineering teams to support the scaling up and integration of new analytics and models into Ripjar's products and data processing pipelines.
  • Produce statistical tests and summarise test outputs.
  • Document analytics, models and test methodologies.
  • Provide support to stakeholders in understanding analytics, models and test results.
  • Support and maintain your models in production.

Key Skills

We value diversity of experience and thought and recognise successful candidates may not tick all the following boxes. If you If you think you have something to offer, then we'd love to chat to you and hear how you would contribute to this role.

  • A good understanding of machine learning and experience training and evaluating machine learning models.
  • Experience integrating data science models into products, with testing, and maintaining those models long term. 
  • Proficiency using Natural Language Processing techniques for solving problems, ideally including Large Language Models
  • Proficiency in Python, particularly with machine learning and data science libraries such as PyTorch, scikit-learn, numpy and scipy.
  • Good communication and interpersonal skills.
  • Experience working with large-scale data processing systems such as Spark and Hadoop.
  • Experience in software development in agile environments and an understanding of the software development lifecycle.
  • Experience using or implementing ML Operations approaches is valuable.
  • Working knowledge of statistics and experience with producing data visualisations.

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say