Deep Infra

Software Engineer

Palo Alto, CA
Python C++ CUDA NCCL NumPy Pandas SciPy TensorFlow PyTorch Git
Description

Software Engineer, Early Career

Location: Palo Alto, CA, USA

Department: Engineering

Location Type: IN_OFFICE

Employment Type: FULL_TIME

DeepInfra is looking for early-career Software Engineers (0-2 years of experience, including internships) to join our team. You’ll work closely with our experienced engineers to design, build, and scale infrastructure for serving top open-source AI models. This role is ideal for recent graduates or junior engineers who want to grow quickly while working on high-impact, real production AI systems.

If you’re excited about AI/ML, have taken related courses or built projects, and want to learn how to ship things at scale - we’d love to meet you.

What You’ll Do


  • Collaborate with engineers to design, develop, and test inference solutions for state-of-the-art AI models.
  • Implement, optimize, and evaluate AI models using Python, C++, CUDA, and NCCL (previous exposure helpful - deep expertise not required).
  • Monitor and maintain production model-serving systems.
  • Work on new features, fix bugs, and contribute to code reviews.
  • Participate in daily standups, design reviews, and team discussions.
  • Explore new AI/ML techniques and tools, and experiment with improving model performance.
  • Try new things. Ship stuff.


What You Bring


  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field (completed or in final year).
  • Strong fundamentals in data structures, algorithms, and software design.
  • Proficiency in Python, including experience with AI/ML libraries and frameworks (e.g., NumPy, pandas, SciPy, TensorFlow, PyTorch).
  • Experience with AI/ML through coursework, research, personal projects, full-time employment, or internships.
  • Familiarity with AI models, Transformers and Diffusers.
  • Experience with version control systems (e.g., Git) and agile development methodologies.
  • Excellent problem-solving skills, with the ability to debug and optimize code.
  • Strong communication and collaboration skills.
  • Curiosity, willingness to learn, and desire to build real systems.

Bonus


  • Exposure to C++, CUDA, or AI inference.
  • Contributions to open-source ML projects.


Why DeepInfra


  • Work on cutting-edge AI model serving - the systems that power the next generation of LLMs and multimodal models.
  • Small team, huge impact: your work ships directly to customers.
  • Opportunity to learn from engineers building high-performance inference at scale.
  • Fast-paced environment with ownership, autonomy, and end-to-end responsibility.

Annual base salary range
$140,000 - $150,000

Deep Infra
Deep Infra

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say