Job Details:
Job Description:
We are looking for a senior contributor to design, develop and optimize AI frameworks for Inference. In this role, you will work with a cross-geo teams to enhance the inference stack to ensure competitive performance on deep learning inference models with a specific focus on the PyTorch framework.
The roles and responsibilities that you would need to performance may include the following:
- Design and develop SW techniques for AI frameworks - both HW-agnostic and HW-aware
- Contribute to enhancing and extending the Inference and Training capabilities in our Software stack
- Profile deep learning inference workloads as needed and identify optimization opportunities
Qualifications:
- BTech, MS or PhD in CS or related fields with an overall experience of 5+years
- Atleast 2 or 3 years of experience working on Inference frameworks/tools for inference for deep learning models and that have been deployed/used by customers
- Architecture/Design contributions to Inference systems
- Detailed understanding of machine learning systems optimization and deployment techniques such as quantization
- Experience with optimization techniques for deployment of Large Language Models (LLMs)
- Deep implementation knowledge of transformers and inference specific optimizations
- Programming skills in Advanced C++, Python and parallel programming skills
- Ability to debug complex issues in multi-layered SW systems
- Understanding of SW integration across open source frameworks and internal framework layers
- Strong understanding of computer architecture
- Effective communication skills and experience with working in a cross-geo setup
Preferred
- Experience working on and contributing to Inference serving solutions
- Knowledge of compiler algorithms for heterogeneous systems
- Knowledge of open source compiler infrastructure like LLVM or gcc
- Understanding of low-level kernels
Job Type:
Experienced HireShift:
Shift 1 (India)Primary Location:
India, BangaloreAdditional Locations:
Business group:
The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.Position of Trust
N/AWork Model for this Role
This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.0 applies
1 views
Other Jobs from Intel
Senior IP Design Engineer (HBM Controller)
Director, Head of IT Infrastructure - Altera
AI Software Solutions Engineer (AI Frameworks, Workloads)
Technical Project/Program Manager
Oracle Fusion Cloud Architect
Similar Jobs
Software Engineer - Machine Learning
Sr. Machine Learning Engineer
AI Software Solutions Engineer (AI Frameworks, Workloads)
Sr Data Scientist II - AI and Data Science Center of Excellence
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say