Dialpad

Machine Learning Engineer

Vancouver, British Columbia
USD 148k - 173k
PyTorch gRPC Kubernetes Azure Machine Learning Python TensorFlow AWS GCP
Description

About Dialpad

Dialpad is the leading Ai-powered customer communications platform creating human-first, Ai-enhanced solutions that will drive the next wave of how businesses communicate with and serve their customers. Enterprise customers like Randstad, Remax, Mizuho, Cigna, T-Mobile, Johns Hopkins, Motorola, Warby Parker, Panera Bread, and Netflix, use Dialpad and its Ai capabilities to deliver amazing customer experiences. Supported by notable investors such as Andreessen Horowitz, Google Ventures, and ICONIQ Capital, Dialpad is a dynamic force in Ai technology with a rapidly expanding presence. Visit dialpad.com to learn more.

About the team

Our NLP team at Dialpad is a diverse and vibrant group of scientists who come from varied fields such as linguistics, computational linguistics, machine learning, computer science, material science, biology, and quantum physics. We prioritize innovation and creativity in our pursuit to push the boundaries of Natural Language Processing and Artificial Intelligence.

We deliver natural language understanding (NLU) features for transcribed speech and typed inputs. The major focus of the team currently is enhancing DialpadGPT, our in-house LLM specifically designed for the domain of business communication, which powers several features. Beyond the technical skills, we're a team that values collaboration, continuous learning, and the application of diverse perspectives to solve complex problems.

Together, our goal is to revolutionize business communications, making it more efficient, accurate, and accessible by harnessing the power of ASR, NLP and machine learning. Using a variety of tools and technologies we're driven to continuously improve and refine our products, delivering top-tier solutions to our customers. 

 

Your role

As a Machine Learning Engineer at Dialpad, you'll be an integral part of our NLP team, working on our in-house LLM, DialpadGPT and other LLMs and addressing the unique challenges of training large models on big data across third party and on-prem distributed infrastructure. You will work on maintaining a robust multi-node training infrastructure and clusters, optimizing distributed training workflows including data loading and logging, debugging complex system errors or hardware failures, and ensuring efficient utilization of resources.

If you're passionate about large scale machine learning, generative AI, and contributing to a team that's changing the face of business communications, you'll find yourself right at home with us.

This position reports to the Manager of the NLP team and has the opportunity to be based in our office in Vancouver, BC or Kitchener, ON. 

What you’ll do [i,e., Responsibilities]

  • Enhance LLM training efficiency by optimizing scripts and architectures, leveraging CUDA and advanced GPU acceleration techniques to improve performance and reduce training time.
  • Optimize preprocessing pipelines by resolving timeout issues and implementing caching strategies.
  • Ensure reliable distributed networking by addressing connection failures and implementing monitoring systems.
  • Minimize downtime on rental machines through efficient recovery workflows and off-hour error mitigation.
  • Streamline debugging of distributed systems, addressing node failures and network partitions.
  • Develop scalable logging frameworks for efficient error detection and resolution.
  • Optimize system scalability by balancing workloads and reducing communication overhead.
  • Design fault-tolerant systems to handle node crashes and ensure seamless job restarts.
  • Monitor system health and implement recovery strategies for unresponsive nodes.
  • Manage massive data sets for large-scale model training workflows.
  • Develop real-time monitoring frameworks for training processes to detect and alert issues promptly.

Skills you’ll bring [i,e., Qualifications]

  • Master’s or PhD degree in Computer Science, Machine Learning, or a related field.
  • 3+ years of experience in distributed systems or machine learning infrastructure.
  • Strong proficiency in Python and experience with distributed ML frameworks (e.g., PyTorch, TensorFlow, Horovod, or Ray).
  • Understanding of networking protocols and distributed communication libraries (e.g., NCCL, gRPC).
  • Hands-on experience with cloud platforms (AWS, GCP, Azure) and cluster orchestration tools (Kubernetes, Slurm).
  • Proven ability to debug and resolve issues in large-scale distributed systems.
  • Familiarity with fault tolerance, caching strategies, and scalable logging systems.
  • Excellent problem-solving and communication skills.

Dialpad benefits and perks

Equity, balance, and belonging

At Dialpad, we strive to ensure that Dialers are free to bring their full selves to work each day, and champion the intersectionality that exists between gender identity, ethnicity, age, disability status, and the many other aspects of our greater humanity.

Culture

We’ve been named a Top Workplace seven times, and a big part of this is because of our collaborative culture that elevates our teammates, celebrates wins, and brings together passion and talent. 

Compensation and equity

Teamwork makes the dream work, and Dialpad offers competitive salaries in addition to stock options because each and every Dialer participates in our success.

Benefits, time-off, and wellness

An apple a day keeps the doctor away—and it doesn’t hurt that we offer flexible time off and great options for medical, dental, and vision plans for all employees. Along with that, employees also receive a stipend to help cover your cell phone bill, home internet bill, home office equipment, gym membership costs, a variety of wellness events, and more!

Continuing education

Dialpad offers a yearly stipend for continued learning and education expenses.

For exceptional talent based in British Columbia, Canada the target base salary range for this position is posted below. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the target range for new hire salaries for the position. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in British Columbia role postings reflect the base salary only, and do not include bonus, equity, or benefits.

British Columbia, Canada Salary Range
$148,700$173,233 CAD

Culture
We’ve been named a Top Workplace seven times, and a big part of this is because of our collaborative culture that elevates our teammates, celebrates wins, and brings together passion and talent. 

Compensation
Teamwork makes the dream work, and Dialpad offers competitive salaries because each and every Dialer participates in our success.

Diversity, Equity, and Inclusion (DEI) at Dialpad

At Dialpad, we are passionate about Doing the Right Thing. This means we are committed to building a values-driven culture that celebrates identity, inclusion and belonging. As a global company, it’s our responsibility to come together to create a culture where all Dialers can Work BeautifullyDelight Our Users, and Innovate Continuously to bring our world-class product to life. 

Every Voice Matters at Dialpad. We build community through our Employee Resource Groups, company-wide celebrations, service days, and a robust internal learning & development program focused on the success of our Dialers.

Don’t meet every single requirement? Studies have shown that women and marginalized groups are less likely to apply to jobs unless they meet every single qualification. At Dialpad we are dedicated to building an inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.

Dialpad is an equal-opportunity employer. We are dedicated to creating a community of inclusion and an environment free from discrimination or harassment.

Dialpad
Dialpad
Analytics Artificial Intelligence Machine Learning Messaging Natural Language Processing Predictive Analytics SaaS Telecommunications Unified Communications VoIP

0 applies

0 views

Other Jobs from Dialpad

Sr. Analytics Engineer, Web

Vancouver, British Columbia Canada

Senior UX Engineer

Seattle, WA US

Associate QA engineer

Bengaluru, India

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say