TikTok

Tech Expert/Backend Engineer - Global Live (LLM Model Serving)

Singapore
Deep Learning Machine Learning C++ Go Kafka Python Streaming TensorFlow PyTorch Redis
Description
TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and its offices include New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.

Why Join Us
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
Join us.

About Our Team
Mission of Global Live Service Architecture team is Build Real-time Interactive Architecture, Safeguard Global LIVE. We are seeking highly skilled and experienced Expert/Senior Engineers to join our TikTok Live Architecture team. TikTok Live is a world-wide leader in live streaming, which occupies more than 50% of the market share. In the LLM team, you have the chance to understand the most advanced LLM models, and design architecture to apply LLM in the world 's largest businesses. We're people at the forefront of the world.

Responsibilities:
- Model Service Deployment: Responsible for converting large-scale deep learning models into scalable services that meet the diverse needs of TikTok's live streaming business.
- Performance Optimization: Optimize the performance of model inference, including but not limited to efficient utilization of computing resources, minimizing response time, and maximizing throughput.
- Cross-Team Collaboration: Work closely with algorithm and business teams to facilitate the deployment of models into production and resolve issues that arise in the production environment.
- Technical Innovation: Continuously monitor and explore new technologies and methods in the AI field to drive technological advancement in model services.Minimum Qualifications:
- Bachelor's degree or higher in Computer Science, Software Engineering, Artificial Intelligence, or related fields.
- 3+ years of relevant work experience, with experience in deploying and servicing large-scale machine learning models.
- Proficiency in mainstream deep learning frameworks (such as TensorFlow, PyTorch, DeepSpeed) and their deployment in production environments.
- Familiarity with model inference optimization techniques, such as quantization, distillation, distributed inference, ONNX, ZeRO, etc.
- Familiarity with online service tech stacks, such as RPC, Redis, Kafka, etc.
- Strong programming skills, proficient in Python, C++ or Golang, with a deep understanding of system performance optimization.

Preferred Qualification:
- Have LLMs deployment and optimization experience

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

#LI-JG1

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

πŸ₯³πŸ₯³πŸ₯³ 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. πŸ› οΈ
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. πŸš€
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. πŸ“…

What Fellow Engineers Say