AI Infrastructure Engineer
Location: Metro Manila, Philippines
Workplace: remote
Description
At Umpisa Inc., our mission is to make the Philippines be known globally as a tech hub.
Umpisa Inc. is a progressive technology services company that partners with select industries, clients and people to work on pioneering and industry-changing solutions via digital transformation, modern software development and venture building.
We create a set of world-class and impactful products and solutions to help organizations and individuals live better lives. We offer demanding, challenging and rewarding careers in software development, product development, emerging technologies, and more for the right candidates.
Essential Skills:
- Aligns with our values: Excellence, Integrity, Professionalism, People Success, Customer Success, Fun, Innovation and Diversity
- Strong communication skills
- Strong problem solving and analytical skills
- Excellent problem-solving ability
- Would like to work as part of a self-organizing Scrum team in a scaled agile framework
- Must be a self-starter and loves to collaborate with the team and client
Job Summary
We are looking for a technical and hands-on AI Infrastructure Engineer to build and scale our AI platform from the ground up. You will work closely with Data Scientists and ML Engineers to design GPU environments, automate deployments, and ensure high-performance model training and inference.
Key Responsibilities
- Define AI infrastructure architecture strategy
- Lead cross-functional collaboration with Data Science and Security teams
- Design multi-region GPU cluster strategy
- Evaluate emerging AI infrastructure technologies
- Establish best practices and governance models
Generative AI Infrastructure & Inference Optimization
- Design and implement inference efficiency initiatives such as prompt/context caching.
- Build systems that allow fine-grained control over cache prefixes and retrieval strategies.
- Optimize latency and cost efficiency of large-scale LLM inference workloads.
- Support Retrieval-Augmented Generation (RAG) architectures.
Secure AI Systems & Encryption
- Architect and implement end-to-end encryption for cached AI content.
- Integrate customer-managed encryption keys (CMEK) within cloud environments.
- Ensure secure multi-tenant data isolation and compliance standards.
Vector Search & Ranking Systems
- Develop enterprise-ready vector similarity search systems.
- Optimize Approximate Nearest Neighbor (ANN) algorithms for scale and latency.
- Build ranking models for personalization, recommendation, and monetization.
- Contribute to highly scalable embedding search infrastructure.
Distributed Storage & Data Systems
- Design and maintain petabyte-scale distributed storage systems.
- Implement materialized views with consistent cross-datacenter updates.
- Support high-update throughput systems with low-latency point queries.
- Optimize large-scale table scans and distributed data processing.
Requirements
- 5+ years in Infrastructure/Cloud Engineering & IAM
- Extensive experience with large-scale distributed system
- Experience leading technical teams
- Strong architectural and documentation skills
- Knowledge of AI workload optimization
- Experience working with hyperscale cloud platforms such as Google Cloud Platform.
- Familiarity with vector databases and ANN indexing techniques.
- Exposure to LLM inference optimization techniques.
- Experience building infrastructure supporting generative AI applications.
- Background in storage engines similar to Google’s Mesa/Napa architecture.
- Strong systems design skills
- Performance optimization mindset
- Security-first engineering approach
- Experience building enterprise-ready cloud services
- Ability to work in high-scale, production-critical environments
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
