ASAPP

Lead AI/ML Engineer

New York, NY Mountain View, CA
Python PyTorch TensorFlow AWS Docker Kubernetes OpenAI API Machine Learning AI gRPC Speech-to-Text Text-to-Speech
Description

Lead AI/ML Engineer

Team: Engineering

Location: New York, Mountain View

Commitment: Full-time

Workplace Type: hybrid

Salary:

Compensation package also includes a performance bonus on top of the listed salary range

Separately, we also offer a compelling equity grant comprised of stock options

At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we’re guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed, ownership, and a relentless focus on outcomes. ASAPP’s AI Engineering team is seeking an enterprising, talented and curious machine learning engineer. 

We are seeking a highly experienced Lead AI/ML Engineer to join our Core GenerativeAgent team. You will play a pivotal role in designing, building, and deploying cutting-edge AI systems that power mission-critical enterprise applications. This role is ideal for an individual who thrives in ambiguity, is deeply technical, and has a strong product sense paired with deep expertise in foundational models and enterprise AI systems.

You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio pipelines. This role requires a hands-on technical leader who can architect low-latency, highly reliable conversational voice systems and guide a team through ambiguity toward production excellence.

We are looking for someone who understands the unique constraints of voice experiences, latency, turn-taking, interruption handling, streaming inference, and audio quality, and can translate these into scalable, enterprise-grade systems.

This is a hybrid role with weekly in-person responsibilities. We have offices in New York City and Mountain View, CA

What you'll do

  • Build real-time conversational AI systems, including voice interfaces powered by speech-to-text, text-to-speech, and streaming inference pipelines
  • Design and optimize low-latency inference workflows for multimodal applications involving text, speech, and real-time interactions
  • Integrate and apply foundation models from major providers (OpenAI, AWS Bedrock, Anthropic, etc.) for prototyping and production use cases
  • Adapt, evaluate, and optimize LLMs for domain-specific enterprise applications
  • Build and maintain infrastructure for experimentation, deployment, and monitoring of AI models in production
  • Improve model performance and inference workflows with attention to latency, cost, and reliability
  • Provide technical leadership within the team, mentoring engineers and promoting best practices in ML engineering
  • Partner with product and cross-functional stakeholders to translate requirements into scalable ML solutions
  • Contribute to the evolution of internal standards for experimentation, evaluation, and deployment

What you'll need

  • 6+ years of experience in Machine Learning or AI systems, with hands-on experience in LLMs, speech, or conversational AI systems
  • Experience building on integrating speech-to-text and text-to-speech systems
  • Strong experience integrating voice models into production applications
  • Proficiency on Python and ML frameworks like PyTorch or TensorFlow
  • Proven experience leading complex, cross-functional AI initiatives
  • Deep understanding of latency-sensitive system design and distributed architectures
  • Strong proficiency in Python and ML frameworks such as PyTorch or TensorFlow
  • Understanding of RAG pipelines, prompt engineering, and vector search
  • Experience deploying and scaling AI systems using AWS (required), Docker, Kubernetes, and CI/CD practices
  • Strong communication skills with the ability to align engineering, product, and executive stakeholders
  • Comfortable operating in fast-paced environments and driving clarity in ambiguous problem spaces

What we'd like to see

  • Experience with speech model fine-tuning and acoustic/language model optimization
  • Experience with production applications of S2S models
  • Hands-on experience with real-time or streaming audio systems (WebRTC, gRPC streaming, or similar architectures)
  • Experience optimizing TTS prosody, pronunciation control, and voice customization
  • Background in MLOps, experimentation platforms, or evaluation frameworks for speech and conversational systems
  • Contributions to open-source AI or speech tooling
  • Graduate degree (MS or PhD) in Computer Science, Machine Learning, Speech Processing, or related field
Benefits include:

Competitive compensation with stock options
Comprehensive medical, vision, and dental insurance
401k matching
Fitness and wellness stipend
Mental well-being benefits
Professional learning and development stipend
Parental leave, including adoptive and foster parents
3 weeks paid time off (increases with tenure) along with sick leave, bereavement and jury duty

ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at [email protected] to obtain assistance. #LI-SL1 #LI-Hybrid
ASAPP
ASAPP

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say