Aleph Alpha

Senior Data Engineer (f/m/d)

Berlin, Germany
GCP AWS Kubernetes API Machine Learning Java Go Azure Python Scala
Search for More Jobs Talk to a recruiter now 💪
Description

Overview:

Join our forward-thinking data team to drive the development of cutting-edge data solutions for Generative AI applications. We leverage data to empower our customers, fuel our own innovation, and support groundbreaking research. As a Senior Data Engineer, you will take a leadership role in designing, optimizing, and scaling our data infrastructure, ensuring that our solutions are not only robust but also capable of meeting the challenges of tomorrow.

Your Responsibilities:

  • Lead the design and implementation of scalable data architectures to handle large-scale datasets (terabytes of text) with a focus on storage, versioning, and documentation best practices.

  • Architect, develop, and oversee the maintenance of web services that enable efficient consumption of harvested data.

  • Partner with researchers, software engineers, and leadership to continuously refine data collection methodologies and identify new data opportunities.

  • Strategize and prepare large datasets for diverse Machine Learning use cases, with an emphasis on Generative AI.

  • Build, optimize, and automate advanced preprocessing pipelines tailored to specific applications, ensuring high performance and reliability.

  • Ensure data services are robust, scalable, and meet the needs of cross-functional teams developing new products on top of our data infrastructure.

  • Mentor and guide junior data engineers, fostering a culture of knowledge sharing and continuous improvement.

Your Profile:

  • You have 7+ years of experience as a Data Engineer, with a proven track record of architecting large-scale data systems.

  • You are an expert in Python and proficient in at least one other programming language (e.g., Java, Scala, or Golang).

  • You possess a deep understanding of distributed systems, with a demonstrated ability to design and manage efficient data pipelines in both cloud and on-prem environments.

  • You have a strong software engineering background, with a focus on writing clean, maintainable, and well-documented code.

  • You excel at data wrangling, including advanced techniques for extracting, transforming, cleaning, and standardizing data from multiple sources.

  • You bring expertise in Generative AI use cases and understand the pivotal role of data in developing cutting-edge AI solutions.

  • You have experience in driving projects, influencing stakeholders, and aligning data strategies with broader business goals.

Nice to Have:

  • Experience working in multi cloud environments (e.g., GCP, Azure, AWS) as well as on-premise data solutions.

  • Background in Machine Learning or Data Science, with a particular focus on applying data engineering principles to AI research.

  • Proficiency in Golang and an interest in adopting new technologies.

  • Familiarity with Kubernetes for container orchestration and managing scalable deployments.

Aleph Alpha
Aleph Alpha
Artificial Intelligence (AI) Generative AI Machine Learning Natural Language Processing Software

0 applies

4 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say