Production Engineer (DevOps & Backend) - Amari AI
Department: Amari AI
Location: San Francisco
Employment Type: FullTime
About Us
Global trade still runs on outdated, manual workflows - we are fixing that by building AI agents for the logistics industry. Our AI works alongside humans, automating document-heavy tasks so companies can process shipments faster and with fewer errors.
We have moved past the "zero-to-one" phase and have achieved clear product-market fit. We are currently seeing rapid traction with >100% MoM revenue growth and are already deployed with customers processing meaningful operational volume. We've raised $5M from First Round Capital and Pear VC and are now scaling our platform's breadth and depth. Our deeply technical team comes from Google, LinkedIn, Salesforce and top schools and AI research labs.
The Role
We are looking for a Production Engineer who lives at the intersection of software development and systems engineering. Your mission is to ensure our production environment is rock-solid, automated, and observable. You will own our CI/CD pipelines, manage our AI infrastructure, and build the internal tools that empower our development team to ship code faster and more reliably.
Key Responsibilities
1. Reliability & Infrastructure (The Core)
Availability: Own the "uptime" of our services. Design and implement self-healing systems to minimize downtime and manual intervention.
CI/CD & Deployments: Architect and manage robust deployment pipelines to ensure feature releases are seamless and reversible.
AI Infrastructure: Manage specialized pipelines for AI and human-in-the-loop systems
Databases and compliance: Manage database operations, performance tuning, backups, compliance.
Scalability: Monitor system performance and proactively scale infrastructure to handle traffic spikes.
2. Observability & Metrics
Monitoring: Build and maintain comprehensive dashboards using tools like Prometheus, Grafana, or Datadog.
Alerting: Define and implement "Golden Signals" (Latency, Traffic, Errors, and Saturation) to ensure we know about issues before our customers do.
Incident Response: Lead the "Post-Mortem" process - analyzing why things broke and writing code to ensure they never break the same way twice.
3. Internal Tooling & Backend Development
Custom Tooling: Use your backend skills (Python preferably) to build internal CLI tools, automated scripts, and status dashboards.
Developer Experience: Act as a bridge for the dev team, making "the right way to deploy" the "easiest way to deploy."
Technical Requirements
Backend Proficiency: Strong experience in at least one backend language (e.g., Python, Go, Java) to contribute to internal tools and understand application logic.
Infrastructure as Code (IaC): Hands-on experience with Terraform, CloudFormation, or Ansible.
Containerization: Deep knowledge of Docker and orchestration (Kubernetes/ECS).
Cloud Platforms: Good-level knowledge of GCP
CI/CD Tools: Experience with GitHub Actions, GitLab CI, or Jenkins.
Success Metrics (The "How We'll Measure You")
To be successful in this role, you will be responsible for improving and maintaining:
MTTD/MTTR: Mean Time to Detect and Mean Time to Recover from incidents.
Deployment Frequency: How often we can safely ship code to production.
Change Failure Rate: The percentage of deployments that result in a rollback or failure.
SLA/SLO Compliance: Meeting our uptime and performance targets for customers.
Is this the right fit?
You are a great fit if: You find yourself "automating away" repetitive tasks and get genuinely excited when you see a perfectly tuned Grafana dashboard. You don't just want to write code; you want to see that code survive and thrive in the wild.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
