Teraswitch

Senior Infrastructure Engineer, Kubernetes / Platform Engineering

Remote Pittsburgh, PA
Kubernetes Platform Engineering GitOps CI/CD Python Bash Ansible PostgreSQL MySQL OpenTelemetry Prometheus Grafana KVM Linux Networking
Description

Senior Infrastructure Engineer (Kubernetes / Platform Engineering)

Department: Infrastructure Engineering

Location: Pittsburgh, PA or Hybrid

Employment Type: FullTime

Engineered to outperform, Teraswitch is on a mission to provide high-performance infrastructure services for critical workloads. With 20+ datacenter locations around the world interconnected by our low latency global backbone network, we are the class leader in performance bare metal hosting and rapidly expanding into additional infrastructure services.

The Job

The Infrastructure Engineering team at Teraswitch is responsible for the compute, storage, and platform infrastructure that powers our products and internal operations.

This senior/staff-level role will architect and lead our global, self-hosted Kubernetes deployment and help drive our cloud-native approach to both internal and customer-facing services. You’ll design for a self-hosted (bare metal) environment, without relying on cloud-managed control planes, load balancers, or databases. This role will also build reusable platform capabilities and operating models to facilitate Kubernetes adoption by other teams, and help drive and support adoption of cloud-native across the organization.

While this role has a Kubernetes / platform focus, as a senior member of the Infrastructure Engineering team, you’ll also be expected to cross-train and contribute broadly across infrastructure domains as we grow the team.

What You’ll Do

  • Architect and lead our globally distributed, self-hosted Kubernetes deployment (including provisioning and management, multi-site app deployments, HA/DR strategy, failure domains, etc)

  • Define and implement our Kubernetes storage and networking strategies

  • Define and implement our Kubernetes security posture: secure network policies, RBAC, container security, secrets management, vulnerability management, compliance-oriented controls and reporting

  • Drive modern, cloud-native observability/monitoring for the platform and workloads

  • Deliver platform capabilities for developers: namespaces/tenancy model, “paved road” patterns and templates, standard ingress/certs/secrets approaches, documentation

  • Collaborate with and support the Software team and other internal stakeholders on cloud-native deployments, acting as an internal cloud-native SME

  • Cross-train with the rest of the Infrastructure Engineering team and contribute broadly to the compute, storage, and platform infrastructure that powers Teraswitch products and internal operations

Basic Qualifications

  • Strong experience operating production Kubernetes, including cluster lifecycle responsibilities (e.g. provisioning, management, upgrades, observability, troubleshooting, storage, networking)

  • Experience in self-hosted Kubernetes architectures (i.e. without relying on cloud-managed control planes and managed apps)

  • Experience with internal platform capabilities (GitOps/CI/CD integration, paved roads, developer enablement)

  • Experience with cloud-native observability/monitoring (metrics, logs, traces, alerting)

  • Strong Linux systems and networking expertise

  • Comfortable working in a fast-paced, results-oriented environment

  • Committed to operational best practices and security by design

Preferred Skills/Experience

You do not need all of these—depth in a few areas plus strong fundamentals is sufficient:

  • Experience with multi-cluster, multi-region Kubernetes management (including app deployments and HA/DR strategy)

  • Deep Kubernetes storage knowledge; hands-on experience managing and integrating persistent software-defined cluster storage (e.g. Longhorn, Ceph, VAST, etc)

  • Deep Kubernetes networking knowledge; hands-on experience with advanced cluster networking (e.g. BGP and other mechanisms for workload HA)

  • Solid understanding of and experience implementing Kubernetes security best practices (secure network policies / workload security, RBAC, secrets management, vulnerability management, compliance-oriented controls/reporting)

  • Cloud-native database self-hosting / management experience (MySQL, Postgres) - for example, using tools like CloudNativePG or Vitess

  • Experience with KubeVirt and/or other VM-on-Kubernetes deployments

  • Production-grade, cloud-native observability design (metrics/logs/traces correlation, OpenTelemetry pipelines, Prometheus/Grafana).

  • Service / hosting provider experience (multi-tenant systems, automation-first operations, scalable and secure design)

  • Automation experience - scripting (Python, bash, etc) and/or configuration management (Ansible, etc)

  • Experience with CI/CD and/or GitOps deployment models and workflows

  • Experience in other Infrastructure team domains - e.g. distributed storage systems (block or object storage services), KVM-based virtualization (cloud services), and/or bare metal automation / fleet management

On-Call / Operations

Participate in an on-call system supporting critical production systems.

Location

Preference given to full-time onsite candidates in Pittsburgh, PA, followed by hybrid candidates.

Compensation and Benefits

Along with a competitive pay scale, full-time Teraswitch employees are eligible for the following benefits:

  • Health, Dental, and Vision Insurance

  • 401(k) with company profit sharing

  • PTO and 11 Company Paid Holidays

Teraswitch
Teraswitch

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say