Devtech Compute Engineer
Location: China, Beijing, China, Shanghai
Time Type: Full time
Job Description
We’re working on the next generation of recommendation tools and pushing the boundaries of accelerating model training and inference on GPU. You’ll join a team of ML, HPC and Software Engineers and Applied Researcher developing a framework designed to make the productization of GPU-based recommender systems as simple and fast as possible.
What you'll be doing:
In your role as Devtech Compute Engineer or CUDA Performance Engineer you will be primarily for the development of performance critical code of our deep learning applications with the goal of establishing world class performance for our customers.
This includes investigating the current performance and exploring optimization opportunities together with the global developers.
Important part of the work is that once optimal performance has been demonstrated that these solutions are integrated into our open source software libraries like ACCV-Lab, Recsys-Example .
With the knowledge to the requirements from customers and performance bottleneck, you will also work with our GPU, CPU, Network team to define the next generation hardware and software solutions.
Our coverage is wide including: LLM, Recsys, Robotic, Assisted Driving.
What we need to see:
2+ years of experience of c++ code development in collaborative software development projects
Skilled at writing CUDA kernels and optimizing code
Basic knowledge of ML algorithms and deep learning
Basic knowledge and understanding of mathematical topics including linear algebra, calculus and statistics
Experience with algorithms and optimization
Python and jupyter notebook for analysis, algorithm exploration and processing
High standard for code quality and rigorous testing practices
Conversational level English proficiency
Some experience with Linux, openMP and MPI
Way to stand out from the crowd:
Experience in c++ HPC code development / PhD in related fields
Able to perform in-depth performance analysis, can demonstrate to model the performance with mathematical and statistical considerations
Linear algebra, calculus and statistics as second nature and this is reflected in your background of mathematics, physics, applied science or HPC related field
Demonstrate the ability to write CUDA kernels with the purpose of utilizing the hardware to its full potential.
Write unit tests and validate the correctness of the optimizations as well as strive for and propose optimal solutions and ambitious goals, convince and help others to do the same
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
