FlexAI

Senior DevOps Engineer/SRE

Bangalore, India
Docker Kubernetes Python Bash Go Rust AWS Azure GCP Terraform CI/CD
Description

Senior DevOps Engineer/SRE

Location: Bangalore, India

Department: InfraOps

Role Overview

FlexAI is looking for a Senior DevOps / SRE Engineer to build and operate the infrastructure powering our AI and PaaS platform.


You’ll work closely with developers to ensure our systems are reliable, performant, and scalable, while enabling fast product iteration. This role is hands-on and execution-focused, with opportunities to contribute to system design and reliability practices as we scale.


What You’ll Do

Build & Operate Infrastructure:

  • Build and maintain infrastructure for our AI and PaaS platform
  • Deploy and operate Kubernetes clusters and containerized services
  • Implement Infrastructure as Code using Pulumi (or similar tools)

Reliability & SRE Practices:

  • Help define and implement SLIs, SLOs, and error budgets
  • Improve system reliability, availability, and performance
  • Participate in on-call rotations, incident response, and postmortems

CI/CD & Automation:

  • Build and improve CI/CD pipelines for reliable and fast releases
  • Automate operational workflows and reduce manual toil
  • Contribute to GitOps and platform engineering practices

Observability & Performance:

  • Implement and maintain observability using VictoriaMetrics, Grafana (metrics, logs, traces)
  • Monitor systems and troubleshoot performance issues (latency, throughput, cost)

Collaboration:

  • Work closely with developers, platform, and AI teams to support production systems
  • Help debug issues across infrastructure and application layers
  • Contribute to improving engineering productivity and developer experience

What You’ll Need to Be Successful

  • 4+ years of experience in DevOps, SRE, or Infrastructure Engineering
  • Experience operating production systems at scale
  • Hands-on experience with:
    • Kubernetes & containers
    • Infrastructure as Code (Pulumi, Terraform, etc.)
    • Cloud or hybrid environments (AWS, GCP, Azure, or on-prem)
    • Observability tools (Prometheus, Grafana, OpenTelemetry)
  • Experience with CI/CD systems and automation
  • Proficiency in Python, Go, or Bash
  • Strong debugging and problem-solving skills
  • Familiarity with SLOs and reliability practices
  • Experience working in startup or fast-paced environments
  • Comfortable leveraging AI coding tools and agents

Nice to Have

  • Experience with AI/ML infrastructure or GPU workloads
  • Familiarity with distributed systems or compute platforms
  • Exposure to platform engineering concepts
  • Experience supporting systems from Beta to production

Why FlexAI

  • Work on cutting-edge AI infrastructure
  • Build systems that power developers and enterprises
  • High ownership, fast execution, real impact
  • Collaborative, high-caliber team

About the Company

About FlexAI

Build and Deploy AI the right way, anywhere.

The FlexAI Compute Infrastructure Platform provides an "end-to-end AI compute layer" for running and managing workloads across any cloud, any GPU, and any deployment model (public, hybrid, or on-prem). It brings together "1-click simplicity" for users with "enterprise-grade orchestration, security, and automation" under the hood.


Founded by Brijesh Tripathi, who bring experience from Nvidia, Apple, Tesla, Intel and Zoox, FlexAI is not just building a product – we’re shaping the future of AI. Our teams are strategically distributed across Silicon Valley and Bengaluru, united by a shared mission: to deliver more compute with less complexity.

 If you're passionate about shaping the future of artificial intelligence, driving innovation, and contributing to a sustainable and inclusive AI ecosystem, FlexAI is the place for you !

FlexAI
FlexAI

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say