The Impact You Will Drive:
- As an engineer on the Open Source team at Onehouse, you'll play a pivotal role in shaping and realizing the vision and roadmap for Apache Hudi, while also shaping the future of data lakehouse space.
- Collaborate across multiple teams within Onehouse, serving as the vital bridge between the open-source Apache Hudi project and Onehouse's managed solution, ensuring seamless collaboration and integration.
- Engage closely with community partners and contributors, serving as a steward of the Apache Hudi project, fostering collaboration and guiding its evolution.
- Champion a culture of innovation, quality and timely execution, enabling the team to deliver on the vision of the next-generation data lakehouse.
- Architect and implement solutions that scale to accommodate the rapid growth of our customer base, open source community and the ever-expanding demands of the datalake ecosystem at large.
A Typical Day:
- Build, design and deliver features/improvements to Apache Hudi.
- Ensure high quality and timely delivery of innovations and improvements in Apache Hudi.
- Dive deep into the architectural details of data ingestion, data storage, data processing and data querying to ensure that Apache Hudi is built to be the most robust, scalable and interoperable data lakehouse.
- Own discussions and work with open source partners/vendors to: troubleshoot issues with Hudi, ensure Hudi support in for compute engines like Pretso/Trino and act as the face of Hudi to the community at large via meetups, customer meetings, talks etc.
- Partner with and mentor engineers on the team.
What You Bring to the Table:
- 5-7+ years building large-scale data systems.
- You embrace ambiguous/undefined problems with an ability to think abstractly and articulate technical challenges and solutions.
- Positive attitude towards seeking solutions to hard problems, with a bias towards action and forward progress.
- An ability to quickly prototype new directions, shape them into real projects and analyze large/complex data.
- Strong, object-oriented design and coding skills with Java, preferably on a UNIX or Linux platform.
- Experience with inner workings of distributed (multi-tiered) systems, algorithms, and relational databases.
- Experience with large scale data compute engines / processing frameworks.
- Experience building distributed and/or data storage systems or query engines.
- An ability to prioritize across feature development and tech debt, balancing urgency and speed.
- An ability to solve complex programming/optimization problems.
- Robust and clear communication skills.
- Nice to haves (but not required):
- Experience working with open source projects and communities.
- Experience in optimization mathematics (linear programming, nonlinear optimization).
- Existing publications of optimizing large-scale data systems in top-tier distributed system conferences.
- PhD or Masters degree in a related field with industry experience in solving and delivering high-impact optimization projects.
Other Jobs from Onehouse
Tech Lead Manager, Data Infrastructure (US)
Engineering Manager, Data Infrastructure (US)
Software Engineer (IN)
Tech Lead Manager, Data Infrastructure (India)
Staff Software Engineer, Open Source (US)
Similar Jobs
Director, Architecture
Backend Engineer, Admin Experience, Teams & Education (Open to remote across ANZ)
Senior Data Engineer
DATA ENGINEER
DATA ENGINEER
DATA ENGINEER
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say