I will audit and optimize your rag vector search performance


About this gig
Your RAG is in production but returning bad results. Latency is slow. Costs are climbing. Hallucinations slip through. Sound familiar?
I audit and fix RAG pipelines that look right on paper but fail in the real world. 10+ years of production backend work, currently leading the AI search migration for one of Latin America's largest retailers (50K+ products, 1M+ daily users).
What I audit:
- Embedding model fit for your domain
- - Chunking strategy and overlap
- - Retrieval recall and precision (with eval set)
- - Reranking effectiveness
- - Hybrid search weights (keyword vs semantic)
- - Latency per stage and cost per query
- - Hallucination patterns
What you get:
- Written diagnostic with prioritized fixes
- - Code changes for top issues (Standard / Premium)
- - Eval set so you can measure progress
- - Monitoring setup (Premium)
Stack: Python, OpenAI, Anthropic, Pinecone, Weaviate, Qdrant, pgvector, LangChain.
Send me your stack and one example query that fails. I will tell you what is likely broken before you pay.
Get to know Martin Poli
Senior RAG and AI Search Engineer for Backend at Scale
- FromUruguay
- Member sinceMar 2020
Languages
English
My Portfolio
FAQ
Do you need access to my codebase?
For audit-only (Basic) no, I work from your description and example queries. For implementation (Standard/Premium) yes, read access to repo and a test environment.

