Senior Data Engineer
Location: Remote
Department: DE TP Colombia
About Fusemachines
Founded in 2013, Fusemachines is a global provider of enterprise AI products and services, on a mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the company helps drive the clients’ AI Enterprise Transformation, regardless of where they are in their Digital AI journeys. With offices in North America, Asia, and Latin America, Fusemachines provides a suite of enterprise AI offerings and specialty services that allow organizations of any size to implement and scale AI. Fusemachines serves companies in industries such as retail, manufacturing, and government.
Fusemachines continues to actively pursue the mission of democratizing AI for the masses by providing high-quality AI education in underserved communities and helping organizations achieve their full potential with AI.
Type: Remote Full-time
Senior Data Engineer
Are you an experienced Data Engineering professional with a passion for building scalable, reliable, and high-performance data systems? Do you have hands-on experience designing and optimizing end-to-end real-time and batch pipelines, and developing cloud-native data architectures using modern technologies such as AWS, GCP, Azure, Databricks, and Snowflake?
We are looking for a Senior Data Engineer to architect, design, and implement scalable, high-performance data solutions. The ideal candidate will be an expert in at least one major cloud data ecosystem (AWS, Azure, GCP, Snowflake, or Databricks) and possess a deep understanding of the end-to-end data lifecycle, from ingestion to business intelligence.
Qualification & Skill Set Requirements
Core Technical Competencies
Experience: 5+ years of hands-on data engineering experience in a production environment.
Languages: Strong proficiency in Python, SQL (complex queries, performance tuning), and PySpark/Apache Spark.
Data Modeling: Expert knowledge of data modeling (3NF, Star, Snowflake Schema) and Lakehouse/Warehouse architectures.
ETL/ELT & Orchestration: Proven experience building pipelines using tools like dbt, Airflow, Dagster, or native cloud orchestrators (Glue, Data Factory, Composer).
Integrations: Experienced in integrating data from diverse sources: APIs, RDBMS/NoSQL databases, flat files, and streaming platforms (Kafka, Kinesis, Pub/Sub).
Cloud Platform Expertise (Specialization-Specific)
Candidates should demonstrate deep expertise in anyone of the following:
Snowflake: SnowSQL, Streams, Tasks, Snowpark, and cost optimization.
Databricks: Delta Lake, Unity Catalog, Delta Live Tables (DLT), and Spark optimization.
GCP: BigQuery, Dataflow, Dataproc, Pub/Sub, and Cloud Functions.
Azure: Synapse Analytics, Data Factory, Azure Databricks, and Stream Analytics.
AWS: Redshift, S3, Lake Formation, Glue, and Lambda.
Professional Practices
SDLC & DevOps: Proficient in Git workflows, CI/CD pipelines (GitHub Actions, Azure DevOps, AWS CodePipeline), and IaC (Terraform/CloudFormation).
Data Governance: Strong understanding of data quality, lineage, observability, security (RBAC, encryption), and compliance frameworks.
Agile: Active experience in Agile/Scrum environments using Jira or Azure Boards.
Mentorship: Ability to lead projects and provide technical guidance to junior/mid-level engineers.
Responsibilities
Architecture: Architect, design, and implement scalable, reliable data solutions and pipelines aligned with business analytics needs.
Optimization: Manage and fine-tune cloud resources and workloads for maximum performance, reliability, and cost-efficiency.
Data Transformation: Lead the development of ETL/ELT processes for both batch and real-time data processing.
Collaboration: Partner with Product, Engineering, and Data Science teams to deliver effective, data-driven solutions.
Governance & Quality: Promote and enforce best practices in data governance, security, and data quality frameworks.
Mentorship: Provide technical leadership and mentorship to the team, ensuring architecture quality and best practices.
Documentation: Maintain comprehensive documentation of data architectures, configurations, and workflows.
Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
