ML Engineer (She/He/They)
Location: Poland
Department: Data & Analytics
Location: Warsaw, Poland - Hybrid
Capco Poland is a leading global technology and management consultancy, dedicated to driving digital transformation across the financial services industry. Our passion lies in helping our clients navigate the complexities of the financial world, and our expertise spans banking and payments, capital markets, wealth, and asset management. We pride ourselves on maintaining a nimble, agile, and entrepreneurial culture, and we are committed to growing our business by hiring top talent.
ROLE OVERWIEW:
This role is responsible for designing and implementing ingestion pipelines, document processing workflows, data normalization, and integrations across Azure services (SharePoint, Microsoft Graph, Azure AI Search, Azure Document Intelligence, App Insights). The engineer will also implement automated evaluation frameworks and quality monitoring mechanisms.
The primary objective of this role is to enable domain agents, task agents, and the master agent to operate on properly prepared, complete, indexed, and secure knowledge artifacts, in full alignment with enterprise security, compliance, and banking standards.
Key Responsibilities
-
Design and develop data connectors (SharePoint, OneDrive, Microsoft Graph, GCP sources, external APIs) to enable automated document and metadata ingestion.
-
Implement ingestion pipelines (batch and event-driven) and processing orchestration mechanisms.
-
Process documents using Azure Document Intelligence, including parsing, OCR, layout extraction, structured data extraction (tables, fields, confidence scoring).
-
Develop transformation and normalization workflows, including data cleaning, segmentation, PII masking, and generation of structured knowledge artifacts.
-
Index content into Azure AI Search and Knowledge Bases, including index design, indexers, skillsets, enrichment pipelines, embeddings, and vector stores.
-
Prepare evaluation datasets (baseline, ground truth, domain-specific test cases).
-
Automate quality evaluation of extraction and indexing processes (precision, recall, fidelity metrics, drift detection).
-
Implement instrumentation, logging, and monitoring using Application Insights and Log Analytics.
-
Optimize document processing costs (batch vs. on-demand processing strategies, layered caching, cost-per-document analysis).
-
Establish CI/CD pipelines and development standards (GitHub Actions, testing, code quality, linting, artifact registry, containerization).
-
Build and maintain containerized services (Docker, Azure Container Registry, Azure Container Apps / AKS / WebApp for Containers).
-
Automate ingestion and evaluation workflows using Azure Functions, Logic Apps, and Durable Functions.
Must-Have Skills & Experience
-
Strong Python skills (asyncio, FastAPI, Pydantic, multiprocessing).
-
Hands-on experience with Azure SDK for Python (Storage, Cognitive Services, AI Search, Application Insights).
-
Practical experience with Microsoft Graph API / SharePoint API (file and metadata retrieval).
-
Azure AI Search: index design, indexers, skillsets, embeddings, vector search.
-
Azure Document Intelligence (OCR, layout extraction, custom models).
-
Testing experience (pytest, integration tests, cloud service mocking).
-
GitHub Actions (build, test, scanning, artifact management, deployment).
-
Docker (image building, multi-stage builds, layer optimization).
-
Logging and monitoring (Application Insights, Log Analytics).
-
Experience working with large document collections (batch ingestion at scale).
-
Data security best practices (PII masking, RBAC, Entra ID integration).
Nice-to-Have
-
MCP Client / MCP Tooling (custom agent integrations).
-
Experience with LangChain or Semantic Kernel (RAG / agent pipelines).
-
GCP experience (BigQuery, Looker, Cloud Functions) within a multi-cloud Knowledge Management architecture.
-
AKS / Azure Container Apps / WebApp for Containers for scalable ingestion services.
-
Durable Functions for orchestrating large document processing workflows.
We offer a flexible collaboration model based on a B2B contract, with the opportunity to work on diverse projects.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
