Description
Join our team as we innovate the future of data platform architecture, enabling massive scaling and data processing for ML and Gen AI projects. You'll be at the forefront of processing vast unstructured data, building high-throughput APIs, and supporting distributed compute frameworks for seamless model deployment. Ready to dive into the heart of cutting-edge tech?
Your role in action
- Build our next-generation data platform tooling and servicesto support the ingestion and processing of billions of documents at scale.
- Improve and extend our Spark based distributed data processing pipeline.
- Improve and extend our Rust based distributed query engine used to request large amounts of document data.
- Create tools to automate and optimize processes across disciplines
- Actively participate in the on-call schedule to investigate and fix production issues related to our data processing pipeline or query engine.
- Participate in code reviews for projects written by your team
- Focus on quality through comprehensive unit and integration testing
Your Skills
- 4+ years of software development experience in writing performant, commercial-grade systems and applications
- Experience with monitoring and troubleshooting production environments
- Proficiency in programming languages used in high volume data processing and applications like Java or Scala and Python
- Experience building data pipelines with distributed compute frameworks like Hadoop. Spark, orDask
- Knowledge of Linux/Unix systems, Docker/Kubernetes and CI/CD including scripting in Python or other scripting languages to automate build and deployment processes
- Knowledge of professional software engineering practices & software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
- Leverages best practices and past experiences to mentor and improve the productivity of the team
We’d particularly love it if you have
- Deep experience building and debugging distributed data pipelines
- Experience with columnar databases and storage formats like Delta Lake and Parquet
- Experience deploying and managing services on Kubernetes
- Experience building with Rust
- If you don’t meet 100% of the above qualifications, you should still seriously consider applying.
#LI-Hybrid
Relativity
Computer
Ediscovery
Enterprise Software
Information Technology
Legal
Legal Tech
Software
1 applies
69 views
Jobs from our Partners
Sr Principal Software Engineer- Huntsville
Huntsville, AL
US
Software Developer (TEX) - Huntsville
Huntsville, AL
US
Automated Software Test Developer - Huntsville
Huntsville, AL
US
Power Platform Mobile Applications Developer - Remote (WFH)
Phoenix, AZ
US
Other Jobs from Relativity
Senior .NET Full-stack Engineer (Ingestion)
Remote
Wroclaw, Poland
Senior .NET Full-stack Engineer (Ingestion)
Remote
Warsaw, Poland
Similar Jobs
Principal Software Engineer/Developer
Boston, MA
US
Staff Software Engineer
London, UK
Lead Data Engineer - Data Platform (Bangkok-Based, Relocation Provided)
Bangkok, Thailand
Sr. SW Engineer -Java Full stack
Bengaluru, India
Senior Devops Engineer
Singapore
Software Engineer - Data & Machine Learning Platform
Remote
Washington, D.C.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
50,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 257 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
Cancel anytime / Money-back guarantee