About Maximiliano
Spanish
Native or bilingual
Catalan
Native or bilingual
English
Fluent
Experience
- DeepdotsAI Tech LeadDIGITAL AND ITJanuary 2025 - Today (1 year and 5 months)Copenhagen, Denmark• • Led the full lifecycle of production LLM systems (Danish↔English translation, summarization and information extraction), from business definition to deployment: self-hosted open-source models on GCP Cloud Run on a single NVIDIA L4 GPU (24 GB VRAM), serving ~15K requests/day at ~1s latency per request.• • Cut inference costs by 70% by migrating from third-party APIs to self-hosted open-source models, selecting and sizing candidates (Qwen3 14B, Mistral Small) based on multilingual quality and VRAM footprint.• • Built an observability and cost-control layer with LiteLLM as a unified gateway: per-request logging, token/latency/spend tracking, fallbacks and weekly cost reporting per product line.• • Designed RAG systems and agentic workflows that turned customer pain-points into production features, improving retention (from non-returning users to recurring usage every 1–2 days).• • Defined the AI strategy and owned the technical leadership of development, aligning model and infrastructure decisions with commercial objectives.Tech stack: Python, Gemini, OpenAI, Google ADK, LangChain, LiteLLM, vector databases, FastAPI, GCP Cloud Run, Docker, open-source models (Qwen, Mistral).
- SaberMachine Learning EngineerDIGITAL AND ITJanuary 2024 - January 2025 (1 year)Amsterdam, Netherlands• • Sole ML decision-maker in a startup environment: designed the platform's RAG architecture (naive, dense and hybrid retrieval strategies) and the agentic workflows with LLM orchestration.• • Optimized RAG pipeline consumption, reducing cost and tokens per query through retrieval and context-management improvements.• • Built a daily-signals feature generated from data collected each day, orchestrating collection, processing and automated delivery.• • Reported directly to the CEO, acting as technical advisor on the AI product roadmap and feasibility assessments.Tech stack: TypeScript, Node.js, Python, OpenAI, GCP, MongoDB.
- ScilingMachine Learning Engineer & Python DeveloperTECHJanuary 2022 - January 2024 (2 years)Valencia, Spain• • Developed NLP pipelines based on embeddings and classifiers (some trained and deployed by me) for information extraction in a regulated medical domain.• • Built RAG systems achieving recall@K >90% in a specific domain, combining dense and hybrid retrieval.• • Implemented knowledge graphs for expert-system information extraction.• • Developed a conversational chatbot in the medical domain (IBM Watson Assistant + Speech-to-Text).• • Fine-tuned open-source models for specific use cases; backend services with Python, FastAPI and Docker.
Recommendations
Be the first to recommend Maximiliano
Help this freelancer shine by sharing your experience working together.
These freelancer profiles also match your criteria
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Education
- MScUniversidad Europea2025MSc
- MLOps SpecializationDeepLearning.AI2023MLOps Specialization