Citigroup

Machine Learning Operations Engineer

Irving, TX
USD 107k - 161k
Python Ray Tune MLflow Docker Kubernetes Apache Iceberg Apache Spark FLINK PostgreSQL Oracle MongoDB Kafka Prometheus Grafana Bash Go Helm Terraform TensorFlow PyTorch
Description

ML Operations Engineer - Associate Vice President

Location: Irving, Texas, United States

Employment Type: Regular

We are seeking an experienced MLOps Engineer to join our DevOps and Infrastructure Engineering team. This role is crucial for operationalizing, scaling, and maintaining our Artificial Intelligence (AI) and Machine Learning (ML) applications. The successful candidate will leverage their expertise to ensure seamless, scalable, and reliable deployment and management of AI/ML models, working closely with data scientists and ML engineers. This position requires strong proficiency in Python, hands-on experience with Ray Tune for hyperparameter optimization, and MLflow for experiment tracking and model lifecycle management.

Key Responsibilities:

  • ML Pipeline Development & Automation: Design, build, and maintain robust and scalable end-to-end ML pipelines for data ingestion, preprocessing, model training, validation, and deployment.

  • CI/CD for ML: Implement and manage Continuous Integration/Continuous Delivery (CI/CD) pipelines specifically tailored for machine learning workflows, ensuring automated testing, versioning, and deployment of ML artifacts.

  • Experiment Tracking & Model Management: Utilize MLflow extensively for experiment tracking, reproducible runs, managing model versions, and maintaining a centralized model registry.

  • Hyperparameter Optimization: Leverage Ray Tune for efficient and distributed hyperparameter optimization to enhance model performance and accelerate experimentation.

  • Containerization & Orchestration: Package ML models and their dependencies using Docker and deploy/manage them effectively on Kubernetes clusters.

  • Data Platform Integration: Integrate with and optimize existing data platforms, including Apache Iceberg, Apache Spark, and FLINK, to ensure efficient data processing and feature engineering for ML models.

  • Data Storage & Streaming: Work with PostgreSQL, Oracle, and MongoDB for diverse data storage needs, and utilize Kafka for real-time data streaming to support various ML applications.

  • Monitoring & Observability: Implement comprehensive monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana) for ML models in production, tracking model performance, data drift, and infrastructure health to ensure reliability and facilitate automated retraining or rollback.

  • Scripting & Automation: Develop automation scripts and tools using Python and Bash/Go to streamline MLOps processes and integrate various systems.

  • Collaboration: Act as a vital link between data scientists, ML engineers, and infrastructure teams, facilitating clear communication and ensuring that ML solutions are production-ready.

Required Qualifications:

  • Experience: 3-5 years of hands-on experience in an MLOps, DevOps, or Machine Learning Engineering role, with a proven track record of deploying and managing ML models in production environments.

  • Programming: Expert-level proficiency in Python for ML development, scripting, and automation.

  • MLOps Tooling: Demonstrated hands-on experience with Ray Tune for hyperparameter optimization and AirFlow or MLflow for experiment tracking and model management.

  • Containerization & Orchestration: Strong experience with Docker and Kubernetes (including Helm).

  • CI/CD: Experience implementing CI/CD practices for software and/or ML pipelines.

  • Data Technologies: Familiarity with or experience with Apache Spark, Apache Iceberg, FLINK, and Kafka.

  • Databases: Experience with PostgreSQL, Oracle, and MongoDB.

  • Workflow Orchestration: Experience with Apache Airflow.

  • Infrastructure as Code: Experience with HashiCorp (Terraform).

  • Operating Systems: Proficiency in Linux/Unix environments. 

Desirable Skills:

  • Experience with cloud platforms (AWS, Azure, GCP) and managing cloud-native ML infrastructure.

  • Knowledge of deep learning frameworks such as TensorFlow or PyTorch.

  • Experience with generative AI technologies (e.g., LLMs, prompt engineering, RAG pipelines).

  • Understanding of distributed computing and big data processing techniques. 

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Primary Location:

Irving Texas United States

------------------------------------------------------

Primary Location Full Time Salary Range:

$107,120.00 - $160,680.00


In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit citibenefits.com. Available offerings may vary by jurisdiction, job level, and date of hire.

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Anticipated Posting Close Date:

Feb 12, 2026

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi’s EEO Policy Statement and the Know Your Rights poster.

Citigroup
Citigroup

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say