AI Data Engineer
Location: Sofia, Bulgaria
Time Type: Full time
Job Description
Role Description
We are seeking a mid-level Data Engineer to join our AI team. In this role, you will build, operate, and enhance the data infrastructure supporting our Agentic AI initiatives. You will collaborate with ML engineers, AI scientists, and product managers to deliver reliable data pipelines that enable autonomous and semi-autonomous AI agents. As part of the R&DS AI Innovation Program, you will contribute to production-ready, secure, and compliant data solutions while progressively growing toward deeper architectural ownership.
Key Responsibilities
Mandatory
Design, develop, and maintain scalable data pipelines and ETL/ELT processes supporting AI research, prototyping, and production use cases.
Collaborate with AI scientists and engineers to translate data requirements into ingestion, transformation, and serving solutions.
Apply data governance and security controls ensuring compliance, auditability, and protection of sensitive information.
Monitor, troubleshoot, and resolve data pipeline failures, performance issues, and schema changes.
Continuously improve reliability through testing, observability, documentation, and automation.
Design and maintain efficient data models (e.g., star schemas, feature-ready datasets, semantic layers) supporting analytics, ML workflows, and AI agent operations.
Implement automated data validation, schema checks, and pipeline testing to ensure high-quality data delivery across systems.
Preferred
Contribute to data architectures supporting agent workflows, including training data preparation, retrieval layers, and inference logging.
Build and enhance pipelines supporting near real-time agent interactions and feedback signals.
Strong SQL skills, with experience designing analytical queries and working with relational and NoSQL databases.
Implement and operate vector embedding stores, knowledge graph ingestion pipelines, and retrieval mechanisms.
Implement data quality controls suitable for ML/LLM pipelines in regulated environments.
Assist with performance tuning to reduce latency in agent-driven workflows.
Familiarity with infrastructure-as-code and automated deployment for data pipelines.
Qualifications
Education
Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
Experience
Typically 3+ years of professional experience in data engineering, including production-grade pipeline development.
Programming & Technologies
Strong proficiency in Python; working experience with Java or Scala.
Solid knowledge of SQL and experience with NoSQL databases.
Familiarity with data warehousing and lakehouse platforms.
Cloud & Data Platforms
Hands-on experience with at least one major cloud platform (AWS, Azure, or GCP).
Experience with orchestration frameworks and CI/CD practices for data pipelines.
Preferred Qualifications
Familiarity with vector databases and embedding lifecycle management.
Experience with containerization and orchestration tools (Docker, Kubernetes).
Understanding of RAG data pipelines, LLM fine-tuning datasets, and evaluation signals.
Exposure to streaming or event-driven data processing systems.
IQVIA is a leading global provider of clinical research services, commercial insights and healthcare intelligence to the life sciences and healthcare industries. We create intelligent connections to accelerate the development and commercialization of innovative medical treatments to help improve patient outcomes and population health worldwide. Learn more at https://jobs.iqvia.com
IQVIA is committed to integrity in our hiring process and maintains a zero tolerance policy for candidate fraud. All information and credentials submitted in your application must be truthful and complete. Any false statements, misrepresentations, or material omissions during the recruitment process will result in immediate disqualification of your application, or termination of employment if discovered later, in accordance with applicable law. We appreciate your honesty and professionalism.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
