Citi

Python Data Engineer

Remote Chennai, Tamil Nadu
Python Spark Scala SQL PostgreSQL Oracle SQL Server MySQL Snowflake Google BigQuery AWS Redshift Delta Lake Apache Airflow Azure Data Factory AWS Step Functions GCP Cloud Composer AWS Azure GCP Kubernetes Git Linux
Description

Python Data Engineer - Engineer Intmd Analyst - C11 - CHENNAI

Location: Chennai Tamil Nadu India

Remote Type: Hybrid

Time Type: Full time

Job Description

The Engineer Intmd Analyst is an intermediate level position responsible for a variety of engineering activities including the design, acquisition and development of software and infrastructure in coordination with the Technology team. The overall objective of this role is to ensure quality standards are being met within existing and planned frameworks.

Responsibilities:

  • Design, develop, and optimize scalable data pipelines and ETL/ELT processes using Apache Spark (preferably with Scala or Python) to ingest, transform, and load large datasets from diverse sources.
  • Write, optimize, and troubleshoot complex SQL queries, stored procedures, and functions for data extraction, transformation, and reporting within relational and analytical databases.
  • Develop and maintain data models, schema definitions, and database objects in various data storage solutions (e.g., data warehouses, data lakes, operational databases).
  • Ensure data quality, integrity, accuracy, and consistency across all data assets through robust validation and monitoring mechanisms.
  • Collaborate closely with data scientists, data analysts, business intelligence developers, and application teams to understand data requirements and deliver appropriate data solutions.
  • Monitor data pipeline performance, identify bottlenecks, and implement optimizations to improve efficiency and reduce processing times.
  • Manage data lifecycle, including data archival, retention, and compliance with data governance policies and security standards.
  • Participate in code reviews, contribute to documentation, and adhere to engineering best practices.
  • Troubleshoot and resolve data-related issues in production environments.
  • Contribute to the evaluation and selection of new data technologies and tools.

Qualifications:

  • Experience: 5+ years of professional experience in data engineering, backend development with a strong data focus, or a related field.
  • Data Acumen: Strong understanding of data warehousing concepts, dimensional modeling, and data lake architectures.
  • Problem-Solving: Excellent analytical and problem-solving skills, with a keen attention to detail.
  • Communication: Good verbal and written communication skills, with the ability to articulate technical concepts to both technical and non-technical audiences.
  • Teamwork: Ability to work effectively in a collaborative team environment and contribute positively to team goals.
  • Agile: Experience working in an Agile/Scrum development methodology.

Education:

  • Bachelor’s degree/University degree or equivalent experience

Technical Skills

  • Big Data Processing: Strong proficiency with Apache Spark (DataFrames API, Spark SQL) using Scala or Python.
  • Databases: Expert-level SQL skills. Extensive experience with relational databases (e.g., PostgreSQL, Oracle, SQL Server, MySQL) and experience with cloud-native data warehouses (e.g., Snowflake, Google BigQuery, AWS Redshift) or data lake technologies (e.g., Delta Lake).
  • Programming Languages: Strong proficiency in Python or Scala.
  • ETL/ELT Tools: Experience with ETL/ELT methodologies and tools, including data orchestration tools (e.g., Apache Airflow, Azure Data Factory, AWS Step Functions, GCP Cloud Composer).
  • Cloud Platforms: Exposure to major cloud platforms (AWS, Azure, GCP) and their data services (e.g., S3, ADLS, GCS, EC2, Azure VMs, Kubernetes).
  • Version Control: Proficiency with Git and standard version control workflows.
  • Data Modeling: Experience in designing and implementing efficient and scalable data models.
  • Performance Tuning: Ability to optimize Spark jobs, SQL queries, and database performance.
  • Linux/Unix: Familiarity with Linux/Unix environments for scripting and job execution.

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Systems & Engineering

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi’s EEO Policy Statement and the Know Your Rights poster.

Citi
Citi

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say