Responsibilities
- Architect and maintain scalable, reusable AI platforms and tools that support the entire AI lifecycle, including data ingestion, model training or fine-tuning, evaluation, deployment, and monitoring for both traditional and generative AI models.
- Build standardized, production-ready AI workflows and templates using tools like Airflow and MLflow to enable rapid experimentation and deployment
- Implement robust CI/CD pipelines, Docker containerization, model registries, and experiment tracking to support reproducibility, scalability, and governance in ML and genAI
- Optimize genAI technologies, including transformers, embeddings, vector databases, and real-time retrieval-augmented generation (RAG) systems
- Automate and streamline ML and genAI model training, inference, deployment, and versioning workflows, ensuring consistency, reliability, and adherence to industry best practices
- Ensure reliability, observability, and scalability of production ML and genAI workloads by implementing comprehensive monitoring, alerting, and continuous performance evaluation
- Integrate infrastructure components such as Kubernetes orchestration, Ray and cloud solutions (AWS/Azure) for robust production environments
- Drive infrastructure optimization for generative AI use-cases, including efficient inference techniques (batching, caching, quantization), fine-tuning, prompt management, and model updates at scale
- Partner with data engineering, product, infrastructure, and other stakeholder teams to align AI platform initiatives with broader company goals, infrastructure strategy, and innovation roadmap
- Contribute actively to internal documentation, onboarding, and training programs, promoting platform adoption and continuous improvement
Qualifications
- Software engineering background with experience in building distributed systems or platforms designed for machine learning and AI workloads
- Expert-level proficiency in Python and familiarity with ML frameworks (TensorFlow, PyTorch), infrastructure tooling (MLflow, Kubeflow, Ray), and popular APIs (Hugging Face, OpenAI, LangChain)
- Experience implementing modern MLOps practices, including model lifecycle management, CI/CD, Docker, Kubernetes, model registries, and infrastructure-as-code tools (Terraform, Helm)
- Demonstrated experience working with cloud infrastructure, ideally AWS or GCP, including Kubernetes clusters (GKE/EKS), serverless architectures, and managed ML services (e.g., Vertex AI, SageMaker)
- Proven experience with generative AI technologies: transformers, embeddings, prompt engineering strategies, fine-tuning vs. prompt-tuning, vector databases, and retrieval-augmented generation (RAG) systems
- Experience designing and maintaining real-time inference pipelines, including integrations with feature stores, streaming data platforms (Kafka, Kinesis), and observability platforms
- Familiarity with SQL and data warehouse modeling; capable of managing complex data queries, joins, aggregations, and transformations
- Solid understanding of ML monitoring, including identifying model drift, decay, latency optimization, cost management, and scaling API-based genAI applications efficiently
Perks and Benefits @ Clari
- Flexible working hours and hybrid work opportunities
- Life and accidental coverage
- Mental health support provided by Silver Oak Health
- Pre-IPO stock options
- Well-being and professional development stipends
- 100% paid parental leave
- Discretionary paid time off, monthly ‘take a break’ days, and Focus Fridays
- Focus on culture: Charitable giving match, plus in-person and virtual events

0 applies
14 views
Other Jobs from Clari
Software Engineer II, DevOps (CI/CD) - Hybrid, Poland
Lead IT Support Engineer
Staff Software Engineer, Data Platform Connect - Hybrid, Poland
Senior Sales Engineer
Principal Machine Learning Engineer, Copilot
Similar Jobs
Software Engineer (Java)
Senior Site Reliability Engineer
Mid-Level Automation Developer
Senior Software Test Development Engineer - Deep Learning
Principal DevOps Engineer - IoT
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say