lithosquare

Data Engineer

Paris, France
Python SQL Bash GDAL/OGR Rasterio Shapely Fiona PyProj GeoPandas GeoTIFF COG GeoParquet LAS/LAZ Zarr Temporal.io Airflow Dagster LLM Docker Kubernetes Terraform dbt Metabase API
Description

Data Engineer

Department: Technology

Location: Paris Office

Employment Type: FullTime

About the company

The transition to a sustainable future requires discovering new mineral resources to power clean technologies and renewable energy solutions. From lithium for electric vehicle batteries, to copper for wind turbines, and rare earth elements for electronics — these minerals are the building blocks of our energy transition.

Lithosquare radically speeds up mineral exploration by combining foundational AI, geological expertise, and real-world data — to reduce uncertainty, prioritize the right targets, reduce costs and accelerate discovery.

Based in Paris, Lithosquare gathered an exceptional team of geologists, scientists, AI engineers, and data specialists to work as one — from field sampling to model optimization — and push the boundaries of what’s possible.

About the job

As a Data Engineer, you will architect the data engine powering our Geology OS, building the infrastructure to process planetary-scale datasets - from satellite imagery and LiDAR to complex geological surveys. Your mission is to transform massive, unstructured multi-source data into high-performance structured databases.

You will build intelligent pipelines leveraging GenAI to handle data variability and evolve our sovereign, open-source analytics stack to monitor global operations and quantify platform value. We seek an engineer with a passion for clean data modeling and expertise in deploying open-source tools in cloud environments.

The role is based in Paris with a flexible remote working policy.

What you’ll do

  • Build intelligent ingestion: design and scale robust pipelines to harvest data from diverse sources, including satellite imagery (multispectral), LiDAR point clouds, and public/private multimodal geological records;

  • Implement self-adjusting pipelines: integrate GenAI/LLMs into our data workflows to create auto-adjustable pipelines capable of handling schema shifts and unstructured document extraction;

  • Geospatial processing & tiling: architect high-performance systems for raster processing and vector tiling (COG, GeoJSON) to enable real-time 3D visualization and cartography;

  • Own the analytics stack: architect and deploy our internal analytics infrastructure using open-source tools to monitor mining operations and field processes;

  • Quantify product value: build data models and dashboards to track platform usage and quantify the scientific and economic value delivered to our geologists;

  • Lead data modeling: design and maintain scalable data schemas that serve as the single source of truth for the entire company;

  • Cross-functional collaboration: partner with AI engineers and geologists to align on data ingestion requirements, structural modeling, and analytics;

  • Production ownership: deploy and operate data services in production (cloud services), ensuring high availability, data observability, and strict security for sensitive exploration data;

  • Tech advocacy: continuously evaluate and implement emerging open-source data technologies to maintain our competitive edge in data processing.

Technical Stack

  • Languages: Python (expert level), SQL (GIS), Bash

  • Geospatial Libraries: GDAL/OGR, Rasterio, Shapely, Fiona, PyProj, Geopandas

  • Data Formats & Tiling: GeoTIFF / COG, GeoParquet, LAS/LAZ, Zarr, Vector Tiles

  • Orchestration: Temporal.io, Airflow or Dagster

  • AI Integration: LLM orchestration, vector databases, prompt engineering for ETL

  • Cloud & Infrastructure: Docker, kubernetes, terraform

  • Analytics & BI: dbt, metabase, open-source observability tools

What we are looking for

  • 5+ years of experience in Data Engineering, with a proven track record of building scalable production systems;

  • Geospatial & remote sensing expertise: deep proficiency in processing raster, vector, and point cloud data, with a solid understanding of coordinate reference systems (CRS) and geospatial indexing;

  • Expertise in python & SQL: ability to write highly optimized code and complex analytical queries;

  • AI-Driven engineering: proven experience integrating LLMs/GenAI into data pipelines to automate the extraction and classification of complex, unstructured documents;

  • Architectural vision: ability to build a modern analytics and geospatial stack from a blank slate, including tiling services (COG, MVT) for web visualization;

  • Rigorous data modeling: strong foundation in data warehousing concepts and performance optimization;

  • Infrastructure fluency: understanding of Kubernetes and containerized environments for deploying data workloads;

  • Mission-driven: a genuine passion for the energy transition and solving "hard" physical-world problems through digital innovation

Perks & Benefits

  • 🏢 Offices located in the heart of Paris

  • 🌱 Strong culture of ownership & entrepreneurship, with clear growth paths as the company expand

  • 🌍 Opportunity to significantly contribute to energy transition

  • 👥 Collaborative work environment with world-class experts in geology, AI, and data science

  • 🔄 Flexible work arrangements enabling work-life balance

  • 💰 Competitive salary package

  • 🍽️ Meal vouchers and premium health insurance coverage (Alan)

Join Lithosquare and become part of a passionate team driving innovation at the intersection of AI and Earth exploration. Let’s make a tangible difference together!

lithosquare
lithosquare

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say