Tvarit

Data Engineer

Pune
Azure Databricks PySpark Apache Spark ETL SQL Server PostgreSQL InfluxDB MongoDB Cassandra Docker Kubernetes Python DevOps MLOps AWS GCP
Description

Data Engineer

Location: Pune

Department: Tech. (Steel)

Experience: 2-4

Job Title: Data Engineer
Location: Pune
Job Type: Full-Time ( WFO )

About TVARIT

TVARIT GmbH specializes in developing and delivering cutting-edge artificial intelligence (AI) solutions for the metal industry, including steel, aluminum, copper, cast iron, and more. Our software products empower customers to make intelligent, data-driven decisions, driving advancements in Predictive Quality (PsQ), Predictive Maintenance (PdM), and Energy Consumption Reduction (PsE), etc.
With a strong portfolio of renowned reference customers, state-of-the-art technology, a talented research team from prestigious universities, and recognition through esteemed awards such as the EU Horizon 2020 AI Prize, TVARIT is recognized as one of the most innovative AI companies in Germany and Europe.
We are seeking a self-motivated individual with a positive "can-do" attitude and excellent oral and written communication skills in English to join our team.

Job Description

We are looking for a Data Engineer with strong expertise in Azure Databricks, PySpark, and distributed computing to develop and optimize scalable ETL pipelines for manufacturing analytics. The role involves working with high-frequency industrial data to enable real-time and batch data processing.

Key Responsibilities

  • Build scalable real-time and batch processing workflows using Azure Databricks, PySpark, and Apache Spark.
  • Perform data pre-processing, including cleaning, transformation, deduplication, normalization, encoding, and scaling to ensure high-quality input for downstream analytics.
  • Design and maintain cloud-based data architectures, including data lakes, lakehouses, and warehouses, following Medallion Architecture.
  • Deploy and optimize data solutions on Azure (preferred), AWS, or GCP, with a focus on performance, security, and scalability.
  • Develop and optimize ETL/ELT pipelines for structured and unstructured data from IoT, MES, SCADA, LIMS, and ERP systems.
  • Automate data workflows using CI/CD and DevOps best practices, ensuring security and compliance with industry standards.
  • Monitor, troubleshoot, and enhance data pipelines for high availability and reliability.
  • Utilize Docker and Kubernetes for scalable data processing.
  • Collaborate with the automation team, data scientists, and engineers to provide clean, structured data for AI/ML models.

Desired Skills and Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
  • Minimum 2 years of experience in data engineering, with a strong focus on cloud platforms such as Azure (preferred), AWS, or GCP.
  • Proficiency in PySpark, Azure Databricks, Python, and Apache Spark.
  • Expertise in relational databases (e.g., SQL Server, PostgreSQL), time-series databases (e.g., InfluxDB), and NoSQL databases (e.g., MongoDB, Cassandra).
  • Experience in containerization (Docker, Kubernetes).
  • Strong analytical and problem-solving skills with attention to detail.
  • Good to have knowledge of MLOps, DevOps, and model lifecycle management.
  • Excellent communication and collaboration skills, with a proven ability to work effectively as a team player.
  • Comfortable working in a dynamic, fast-paced startup environment, adapting quickly to changing priorities and responsibilities.

Tvarit
Tvarit

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say