What will you do?
- Design, deploy, and operate Kubernetes clusters across AWS, Azure, and GCP. Optimize cluster performance, ensure high availability, and implement robust security practices.
- Build and maintain cloud-native infrastructure components (load balancers, networking, storage, etc.) to support applications running on Kubernetes. Leverage Infrastructure as Code (IaC) with Terraform to automate and manage infrastructure provisioning and configuration.
- Embrace GitOps principles using ArgoCD to automate deployments and configuration changes and ensure consistency between the desired and actual system state.
- Establish comprehensive monitoring, logging, and alerting systems to gain insights into platform health and performance. Troubleshoot incidents swiftly and apply SRE principles to improve reliability and resilience.
- Develop automation scripts and tools (Python, Go, or other languages) to streamline workflows, eliminate manual tasks, and reduce operational overhead.
- Partner closely with development teams to understand their needs, provide guidance on platform best practices, and enable smooth integration and deployment of their applications.
- Implement and maintain stringent security measures for Kubernetes and cloud environments, ensuring compliance with industry standards and data protection regulations.
- Analyze resource usage and implement optimization strategies to maximize performance while controlling cloud costs.
- Participate in an on-call rotation, troubleshooting and resolving production issues promptly.
What makes you a match?
- 3+ years of experience working with Kubernetes in production environments. Deep understanding of cluster operations, networking, storage, and security within Kubernetes.
- Strong knowledge of AWS, Azure, and GCP, including core services, networking concepts, and security best practices.
- Proven experience implementing GitOps workflows with ArgoCD and managing infrastructure using Terraform.
- Fluency in at least one programming language (Python, Go, Java) for automation, scripting, and tool development.
- Familiarity with SRE practices like SLOs (Service Level Objectives), error budgeting, and blameless postmortems.
- Excellent analytical and troubleshooting skills to identify and resolve issues in complex cloud environments.
- Ability to communicate effectively with development, operations, and security teams to drive cross-functional initiatives.
- Ability to work from 8.30 PM to 5.30 AM IST to provide coverage for US time zones.
Other Jobs from Atlan
Director of Engineering
Senior Engineering Manager
Senior Software Engineer - Platform
Staff Engineer - Backend (Data Quality)
Staff Engineer - Atlan Marketplace (offline agent)
Similar Jobs
Undergraduate Intern Software Engineer
DevOps/Site Reliability Engineering (SRE)ย - Cloud GCP
Software Engineer
Staff Data Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
๐ฅณ๐ฅณ๐ฅณ 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineersโฆ in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. ๐ ๏ธ
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. ๐
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. ๐ฏ
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. ๐
What Fellow Engineers Say