We are looking for a Lead Data Engineer to join our Content Tech Big Data Engineering Team in India. This is an amazing opportunity to work on Real World Data using big data technologies.
We would love to speak with you if you have skills in Python, Spark and have experience on building big data platforms.
About You – experience, education, skills, and accomplishments
Bachelor’s Degree or equivalent in computer science, software engineering, or a related field
At least 5 years of relevant experience.
Good experience working with Python, PySpark, AWS, AWS Glue, EMR and Delta Lake.
Good knowledge of ETL, including the ability to read and write efficient, robust code, follow or implement best practices and coding standards, design/implement common ETL strategies (CDC, SCD, etc.), and create reusable/maintainable jobs.
Solid background in database systems (such as Postgres, Oracle, Snowflake/Databricks) along with strong knowledge of PL/SQL and SQL.
Experience in handling large volume of data and building data pipelines.
Possess good knowledge of Agile/other SDLC methodologies.
Exposure to a Data warehouse / BI project in a Healthcare Domain.
Strong oral and written communication skills.
It would be great if you also had . . .
Familiarity with
s would be added advantage.
Experience in building big data platforms.
Understanding on healthcare data.
What will you be doing in this role?
As a member of Data Engineering Team, you’ll Step into a key role on an expanding data engineering team to build our data platforms, data pipelines, and data transformation capabilities.
Define and implement our data platform strategy on Cloud, have a meaningful impact on our customers, and working in our high energy, innovative, fast-paced Agile culture.
Drive rapid prototyping and development with Product and Technical teams in building and scaling high-value medical data capabilities.
Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using Apache suite (airflow, spark), SQL, Python, ETL, and AWS big data technologies.
Creation and support of batch and real-time data pipelines and ongoing data monitoring and validation built on AWS/Snowflake/Apache technologies for medical data from many different sources.
Conduct functional and non-functional testing, writing test scenarios and test scripts.
Evaluate existing applications to update and add new features to meet business requirements.
Product you will be developing
Big Data Platforms
About the Team
The stakeholders for the role are Analytics team, Application Teams on the business side, Enterprise Solutions Teams and other Cross-Functional Internal IT Teams, External Vendors and Partners. The team consists of 20+ engineers and are reporting to the Director of technology.
Hours of Work
Fulltime
40 hrs/week
Hybrid working model
At Clarivate, we are committed to providing equal employment opportunities for all persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.

0 applies
2 views
Other Jobs from Clarivate Analytics
Project Manager
Senior Software Engineer (Backend)
Director, Software Engineering
Associate Data Analyst
Lead Software Engineer -- Fullstack
Similar Jobs
DevOps Engineer
DevOps Engineer
Manager, Software Engineering (Python)
Senior Backend Engineer, Inbox
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say