Senior Data Engineer - EDA Datacenter Analytics and Observability
Location: US, CA, Santa Clara, US, MA, Westford, US, TX, Austin, US, OR, Hillsboro
Time Type: Full time
Job Description
NVIDIA’s Hardware Infrastructure organization is seeking a Senior Data Engineer to build and evolve analytics-ready data platforms that power observability, reliability analysis, and capacity forecasting for EDA datacenters. In this role, you will focus on transforming large-scale observability and telemetry data into trusted, well-modeled datasets that enable data scientists, analysts, and engineers to drive insights across global CPU and GPU compute clusters. We work closely with observability, infrastructure, and data science teams to ensure that data from EDA workloads and datacenter hardware is high quality, accessible, and optimized for analytical and predictive use cases.
What You’ll Be Doing:
Design, build, and maintain analytics-focused data pipelines that ingest, transform, and curate observability data from EDA datacenters
Develop reliable ingestion pipelines for metrics, logs, traces, and hardware health telemetry generated by large-scale CPU and GPU clusters
Partner with observability engineers to integrate data from tools such as Prometheus, Grafana, Elastic/OpenSearch, and Spark-based platforms into unified analytical datasets
Model and organize data to support exploratory analysis, reliability modeling, forecasting, and long-term trend analysis
Build and optimize batch and streaming workflows that support both near-real-time analytics and historical analysis
Implement data quality checks, validation frameworks, and monitoring to ensure analytical accuracy and consistency
Define data retention, aggregation, and enrichment strategies that balance analysis needs, system performance, and storage costs
Enable self-service analytics by improving data discoverability, documentation, and usability
Collaborate with data scientists and analysts to understand analytical requirements and evolve datasets to support new models and insights
Continuously improve pipeline scalability, reliability, and performance as datacenter footprint and workload complexity grow
What We Need to See:
MS (preferred) or BS in Computer Science (or equivalent experience) or a related field with at least 10+ years of experience designing, building, and operating large-scale data pipelines and data platforms for distributed systems or infrastructure data
Proficiency in Python and SQL, with experience supporting analytical and exploratory workloads
Hands-on experience with distributed data processing frameworks such as Spark or similar technologies
Familiarity working with observability and telemetry data, including metrics, logs, traces, and time-series data
Experience designing data models and schemas that support flexible analysis and forecasting
Ability to take ownership of data engineering initiatives and drive them end-to-end in collaboration with multi-functional partners
Experience implementing data quality, validation, and monitoring for analytics pipelines
Strong communication and collaboration skills, particularly when collaborating with engineering and infrastructure teams
Adaptability in fast paced environments with evolving analytical and operational needs
Ways to Stand Out from the Crowd:
Experience supporting datacenter infrastructure analytics, hardware reliability programs, or workload performance analysis
Familiarity with EDA workflows, HPC environments, or GPU-accelerated compute platforms
Experience integrating or operating observability stacks (Prometheus, Grafana, Elastic/OpenSearch, Kafka, Spark, or similar tools)
Background in large-scale distributed systems or data platforms
A track record of improving analytics velocity and reliability through better data foundations
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
