Syndesus

Principal Data Architect

San Jose, CA Newport Beach, CA
Python SQL Scala Java Databricks Snowflake Spark Delta Lake Apache Iceberg Hudi Kafka Kinesis Airflow dbt Dagster Unity Catalog Alation Collibra API
Description

Principal Data Architect

Location: San Jose or Newport Beach, CA

Department: External Positions

Location Type: HYBRID

Employment Type: FULL_TIME

Principal Data Architect – Modern Data Platforms & Data Mesh San Jose, CA | Newport Beach, CA | Hybrid (2–3 days onsite)

Role Overview
We are seeking a highly experienced, hands-on Principal Data Architect to define the strategic vision for a next-generation, petabyte-scale data platform and lead the transition from centralized to decentralized data management. This is a principal-level role — the bar goes well beyond senior. We are looking for someone with demonstrated long tenure at complex, large-scale data environments who has driven real architectural transformation, not just advised on it.
The core mandate is data decentralization — moving from a monolithic data warehouse model to a Data Mesh paradigm — alongside modernizing data security, governance, and real-time capabilities. Consulting backgrounds and serial short-tenure candidates will face significant additional scrutiny; we want people who have gone deep and have the organizational impact to show for it.

What You'll Do
Strategic Architecture & Data Mesh
  • Lead the design and implementation of a scalable, decentralized Data Mesh architecture; define domain boundaries, data products, and federated governance standards
  • Drive the organization from centralized to decentralized data management — change management experience is as important as technical depth
  • Establish data contracts and self-service analytics capabilities across the organization
Hands-On Engineering Leadership
  • Architect and prototype resilient data pipelines using Databricks, Snowflake, and Spark, ensuring high availability and low latency
  • Drive adoption of open table formats (Delta Lake, Apache Iceberg) for ACID compliance, time travel, and schema evolution across the data lakehouse
  • Troubleshoot complex performance bottlenecks in distributed systems
Data Security & Governance
  • Architect advanced data security patterns including dynamic data masking, tokenization, and row-level security
  • Implement centralized discovery and access control using Unity Catalog or equivalent enterprise data catalogs
  • Implement technical controls for data privacy regulations (GDPR, CCPA) including encryption at rest and in transit
Real-Time & Event-Driven Systems
  • Design and deploy high-throughput streaming architectures using Kafka or Pulsar for real-time data ingestion and processing
  • Deep understanding of workflow orchestration tools (Airflow, dbt, Dagster)
MLOps & Analytics Integration
  • Build robust MLOps pipelines and feature stores bridging data engineering and production AI/ML
  • Collaborate with data scientists to operationalize machine learning models end-to-end
Technical Leadership & Culture
  • Mentor senior data engineers and architects; foster a culture of technical excellence and "Data as a Product" thinking
  • Treat data pipelines as code — strict CI/CD, unit testing, and version control practices expected

About You
Experience & Background
  • 10+ years of software and data engineering experience, with a significant portion in technical leadership or architecture
  • FAANG or Big Tech background strongly preferred
  • Proven track record at large-scale enterprise environments — petabyte-scale data infrastructure is the baseline expectation
  • Long tenure at key roles is a strong positive signal; we are not looking for contract-to-contract backgrounds or candidates with a pattern of 1–2 year stints
  • Consulting backgrounds are not disqualifying but will be scrutinized — be prepared to speak specifically to what you owned vs. what you touched
Technical Requirements
  • Data Mesh: proven experience transforming monolithic data warehouses into decentralized Data Mesh architectures including federated governance
  • Platforms: deep hands-on expertise with Databricks and Snowflake including compute optimization and cost management
  • Big Data: strong proficiency in Apache Spark, Delta Lake, Iceberg, and Hudi
  • Coding: expert-level Python, SQL, and Scala/Java nice to have
  • Streaming: Kafka or Kinesis; orchestration via Airflow, dbt, or Dagster
  • Governance: hands-on with Unity Catalog, Alation, or Collibra
  • MLOps: solid understanding of model registry, feature stores, and model serving

Location & Work Model
Hybrid position based out of San Jose, CA or Newport Beach, CA — candidates must be within commutable distance of one of these two offices. Onsite 2–3 days per week expected. Out-of-area candidates considered only in exceptional circumstances and held to a significantly higher bar.

Compensation
Competitive base compensation commensurate with experience. 25% target bonus. Full benefits including medical/dental/vision, retirement plans, paid parental leave, and paid time off.

Syndesus
Syndesus

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say