Data Engineering Internship
Location: Santa Clara, CA
Department: Engineering
Location Type: HYBRID
Employment Type: INTERN
About the Role
Key Responsibilities
- Building ingestion pipelines from customer ERPs and finance systems into data warehouse
- Writing transformations in our Bronze, Silver, Gold medallion architecture, with an eye toward making data LLM-ready: well-named, well-typed, well-documented, and semantically meaningful
- Extending the semantic layer that powers natural-language analytics, this is what lets non-technical finance users ask questions and get grounded answers
- Preparing and structuring data for retrieval, embeddings, vector search, and context assembly for RAG pipelines that feed our agents
- Implementing data quality checks, lineage, and monitoring so agents never act on bad data
- Tuning queries and warehouse usage for both cost and latency
- Contributing to technical documentation and participating in code reviews
Qualifications
- Pursuing (or recently graduated) a Bachelor's or Master's in Computer Science, Data Engineering, Statistics, or a related field
- Solid SQL skills: joins, window functions, and a basic grasp of how to read a query plan
- Hands-on experience with at least one relational database (MySQL, Postgres, or similar) through coursework, projects, or prior internships
- Comfortable writing Python for data processing and scripting
- Genuine interest in LLMs and AI systems, you've played with OpenAI/Anthropic APIs, built a RAG project, or thought seriously about how data shape affects model behavior
- Excellent communication, you can explain what you built and why
- Must be currently authorized to work in the United States without employer sponsorship, as we are unable to sponsor or transfer visas for this position
- Must be located in or within commuting distance of Santa Clara, CA to be considered
Preferred Qualifications
- A graduation date of 2026 or late 2025
- Exposure to Snowflake, BigQuery, or Databricks
- Experience with dbt, Airflow, or another orchestration/transformation tool
- Experience with vector databases (Pinecone, Weaviate, pgvector, Snowflake Cortex Search) or embedding workflows
- Understanding of dimensional modeling (star/snowflake schemas)
- Any prior internship or substantive personal project in data engineering
- Authorized to work in the United States without the need for future sponsorship
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
