Gather AI

Cloud Engineer, Platform & Infrastructure

Remote Hyderabad, Telangana
Azure AWS Kubernetes Docker Terraform GitHub Actions GitLab CI Prometheus ELK OpenTelemetry Python
Description

Cloud Engineer (Platform & Infrastructure)

Location: Hyderabad, Telangana, India; Open to Remote (India)

Department: Full Stack

About Us

Are you ready to build the future of supply chain? At Gather AI, we're not just creating software; we're pioneering a new era of warehouse intelligence. We've developed a groundbreaking, vision-powered platform that uses autonomous drones and existing equipment to capture real-time data, completely digitizing workflows that have historically been manual and error-prone. This means facilities operate smarter, safer, and more efficiently, ultimately redefining "on-time, in full" delivery.

If you're looking for an opportunity to contribute to truly transformative technology and make a significant impact in a vital industry, Gather AI is the place for you. We're leading the charge in the rapidly evolving robotics industry, and we invite you to join us in reshaping the global supply chain, one intelligent warehouse at a time.

About the Team

This role sits within the Backend and Platform Engineering organization. You'll work day-to-day alongside the Fullstack Engineering team, ensuring application services have the cloud infrastructure they need to scale safely and deploy reliably. You'll also partner closely with the ML Systems Engineering (Ops) team, enabling the infrastructure capabilities required for production ML pipelines, model serving, and data workloads. Cross-functionally, you'll collaborate with QA, Release Engineering, and Platform and Security stakeholders to ensure cloud environments support stable testing pipelines, access control, secrets management, and operational governance.

About the Role

We are looking for a Cloud Engineer (Platform & Infrastructure) to help mature our cloud operations into a structured and scalable platform. Our foundational infrastructure is already in place and actively supporting production workloads, but many current practices evolved organically during earlier growth stages. Rather than building from scratch, you'll evolve an existing production environment by introducing stronger operational patterns, improving deployment safety, and ensuring our infrastructure layer reliably supports increasing system scale. This role offers meaningful ownership of the infrastructure backbone supporting a platform that combines real-time application systems with machine learning workloads, and the opportunity to influence how systems are deployed, operated, and scaled as the organization grows.

What You'll Do

  • Review and rationalize current Azure and AWS environments, identifying configuration drift, security gaps, and operational inconsistencies, and establish clear configuration standards across cloud accounts
  • Introduce repeatable Infrastructure-as-Code patterns to ensure cloud resources are provisioned, versioned, and audited through automated workflows
  • Strengthen CI/CD pipelines for infrastructure and application deployment to reduce manual operations and increase release safety across both application services and ML workloads
  • Establish consistent logging, metrics, and alerting practices across infrastructure and container workloads to improve operational visibility
  • Audit and improve cloud security practices including IAM policies, secrets management, network segmentation, and operational access controls
  • Evaluate current infrastructure architecture and introduce patterns that enable workloads to operate portably across both Azure and AWS environments
  • Improve Kubernetes platform reliability by refining autoscaling policies, workload isolation, and cluster lifecycle management
  • Partner with Fullstack and ML teams to reduce infrastructure friction around environments, networking, and resource provisioning

What You'll Need

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 5+ years of experience operating production cloud infrastructure at scale
  • Deep experience with at least one major cloud provider (Azure or AWS) and working familiarity with the other
  • Hands-on experience with Kubernetes and Docker for running containerized workloads in production environments
  • Proficiency with Terraform or equivalent Infrastructure-as-Code tooling for provisioning and managing cloud infrastructure
  • Experience implementing automated deployment pipelines using tools such as GitHub Actions, GitLab CI, or similar platforms
  • Strong operational mindset with a focus on reliability, automation, and clear technical documentation

Nice to Have

  • Experience with observability tooling such as Prometheus, ELK, OpenTelemetry, or similar logging, metrics, and monitoring systems
  • Familiarity supporting ML infrastructure workloads including pipeline orchestration, model deployment, and scalable inference environments
  • Experience working in logistics, robotics-adjacent platforms, or real-time distributed systems
  • Track record of translating application requirements into secure, reliable, and operationally safe infrastructure architecture
  • Exposure to cloud cost visibility and optimization practices
  • Experience introducing infrastructure governance standards including templates, security baselines, and operational documentation
Gather AI
Gather AI

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say