Auditoria AI

Data Engineering Intern

Santa Clara, CA
SQL Python Snowflake MySQL PostgreSQL dbt Airflow Pinecone Weaviate LLM AI RAG
Description

Data Engineering Internship

Location: Santa Clara, CA

Department: Engineering

Location Type: HYBRID

Employment Type: INTERN

About the Role

We're scaling an AI-native enterprise SaaS platform that powers agentic automation for corporate finance teams at Fortune 500 companies. As a Data Engineering Intern, you'll build the data infrastructure that makes our agents work.  Clean, well-modeled, LLM-ready data flowing from customer ERPs into Snowflake, through our semantic layer, and into the retrieval pipelines that ground every decision our agents make.

You'll work across the modern data stack and implement medallion architecture patterns that serve both operational systems and AI/ML workloads.

Key Responsibilities

  • Building ingestion pipelines from customer ERPs and finance systems into data warehouse
  • Writing transformations in our Bronze, Silver, Gold medallion architecture, with an eye toward making data LLM-ready: well-named, well-typed, well-documented, and semantically meaningful
  • Extending the semantic layer that powers natural-language analytics, this is what lets non-technical finance users ask questions and get grounded answers
  • Preparing and structuring data for retrieval, embeddings, vector search, and context assembly for RAG pipelines that feed our agents
  • Implementing data quality checks, lineage, and monitoring so agents never act on bad data
  • Tuning queries and warehouse usage for both cost and latency
  • Contributing to technical documentation and participating in code reviews


Qualifications

  • Pursuing (or recently graduated) a Bachelor's or Master's in Computer Science, Data Engineering, Statistics, or a related field
  • Solid SQL skills: joins, window functions, and a basic grasp of how to read a query plan
  • Hands-on experience with at least one relational database (MySQL, Postgres, or similar) through coursework, projects, or prior internships
  • Comfortable writing Python for data processing and scripting
  • Genuine interest in LLMs and AI systems, you've played with OpenAI/Anthropic APIs, built a RAG project, or thought seriously about how data shape affects model behavior
  • Excellent communication, you can explain what you built and why
  • Must be currently authorized to work in the United States without employer sponsorship, as we are unable to sponsor or transfer visas for this position
  • Must be located in or within commuting distance of Santa Clara, CA to be considered


Preferred Qualifications

  • A graduation date of 2026 or late 2025
  • Exposure to Snowflake, BigQuery, or Databricks
  • Experience with dbt, Airflow, or another orchestration/transformation tool
  • Experience with vector databases (Pinecone, Weaviate, pgvector, Snowflake Cortex Search) or embedding workflows
  • Understanding of dimensional modeling (star/snowflake schemas)
  • Any prior internship or substantive personal project in data engineering
  • Authorized to work in the United States without the need for future sponsorship
Auditoria AI
Auditoria AI

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say