- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- Prior experience in deploying training and inference workloads on cloud instances covering both CPU and multi-GPU.
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Work experience deploying ML/DL work-loads in production for video or computer vision use-cases
Prime Video is changing the way millions of customers interact with video content. Every day we face the challenges of a fast paced market, expanding technology set and a wide range of viewing devices.
Prime Video is looking for a driven and talented ML engineer with prior expertise in deploying, optimizing, and maintaining ML and DL-based workloads. ML/DL solutions enable content-adaptive processing and encoding of video as well as on models that measure video quality. You will help deploy proven algorithms/architectures, optimize, re-train, expand coverage to additional encoding profiles or codecs, quantize the models (as necessary), and integrate such workloads at scale with the help of other orchestration teams on instances that offer the best cost and turn-around times. You will develop suitable monitoring dashboards and guardrails to ensure proper operation.
Key job responsibilities
As an ML-engineer, you will assist Research/Applied Scientists in the team to collect ground-truth data, clean data and labels, set up scalable training of such models to utilize multiple GPUs efficiently, deploy pre-trained inference with optimal performance on appropriate EC2 instances, work with SDEs to define suitable job queues and APIs for the inference workloads to integrate them as part of larger orchestration, and will develop suitable monitoring dashboards to keep track of the different training/inference jobs.
You will triage operational bottlenecks and failures related to ML/DL workloads.
You will identify the evolving best practices for running such workloads at scale with optimum performance.
You will define/refine suitable processes related to maintenance of large datasets, framework versions, code maintenance, mechanisms used to identify the right instance type for a given algorithm and ways to maximize utilization of availed compute instances while meeting SLA guarantees.
A day in the life
You will extract and maintain features from a large set of training videos to train classical ML models.
You may obtain and maintain ground-truth labels required for training ML-models.
You will develop or adopt tools to monitor progress during training. You may perform cross-validation in multiple folds to verify the performance of different ML models.
You may benchmark readily available ML/DL solutions (open or proprietary) and compare them against internal solutions.
You will work with stakeholders (e.g. product, studios, Applied Scientists, Engineering team members) to facilitate fully automated as well as human-in-the-loop type of workflows.
You will create appropriate tickets for known issues and will triage and root-cause such issues as per their severity.
About the team
Our mission is to build and operate the most innovative video streaming technology stack that provides the best customer-centric streaming experience for VOD and Live globally and supports all business use cases (subscription, transactional, ad-supported). We invent and implement technologies that deliver a flawless, engaging streaming experience for our customers, using the fewest bits possible. We commit to our values of respect and integrity by creating a work environment that is supportive, diverse, inspiring, and inclusive.
Other Jobs from Amazon
Software Development Engineer in Test II, Visual Search
Systems Development Engineer, Managed Operations
2025 Software Dev Engineer Intern - ZAF
Similar Jobs
Lead Product Manager, Trading
Data Engineering Internship
Senior Software Development Engineer - Payments
Software Development Engineer, Ads Campaign Management
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
π₯³π₯³π₯³ 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineersβ¦ in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. π οΈ
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. π
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. π―
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. π
What Fellow Engineers Say