AI Performance Engineer
Location: US - Milpitas
Department: Performance
About us
Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.
Job Summary
Graphcore’s AI/ML training and inference infrastructure is rapidly scaling to meet the growing demands of AI workloads across mobile, edge, and datacenter environments. This role focuses on optimizing performance across ARM-based architectures and large-scale distributed systems, ensuring efficiency, scalability, and reliability across the full hardware-software stack.
The Team
The System Engineering Performance team architects and optimizes high-performance infrastructure for large-scale datacenter deployments. The team works across hardware, software, networking, and system architecture to deliver cutting-edge AI solutions and ensure optimal system performance at scale.
Responsibilities and Duties
- Analyze ML models’ compute and memory requirements using roofline analysis and simulations
- Collaborate across hardware and software teams to optimize large-scale AI workloads
- Benchmark, monitor, and troubleshoot system performance across distributed systems
- Optimize communication stacks including MPI, NCCL, UCX, RDMA, and networking fabrics
- Profile and optimize AI workloads, focusing on performance bottlenecks
- Develop high-quality, ARM-compatible code and documentation
Candidate Profile
Essential:
- BS/MS in Computer Science, Electrical Engineering, or related field
- Experience with distributed systems and communication libraries (MPI, NCCL, UCX, libfabric)
- Strong programming skills in C++ and Python
- Experience profiling and optimizing HPC or AI/ML workloads
- Familiarity with ML benchmarks such as MLPerf
Desirable:
- Experience with GPUs or accelerated computing architectures
- Knowledge of HPC networking and interconnect technologies (InfiniBand, RoCE)
- Familiarity with ML frameworks such as PyTorch or TensorFlow
- Understanding of ARM architectures and toolchains
- Strong debugging, profiling, and performance optimization skills
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
