Research

Research & open data

Rigorous analyses with public data, transparent methodology, and open source code. Knowledge contributions anyone can verify.

Open DataFeaturedInteractiveApr 5, 2026

Interactive dashboard: 15 Mexican presidents under the same algorithm

Interactive quantitative analysis of the 15 Mexican presidential administrations (1934-2024). 9 structural variables (GDP, poverty, employment, violence, Gini, tax, investment, inflation, debt), LatAm-5 regional Z-score, Contextual Difficulty Index and 1,000-simulation bootstrap.

8 min read
datadashboardmexico
AI & Machine LearningApr 5, 2026

How to reduce fraud 73% with Machine Learning: a case study

Practical case study of implementing a financial fraud detection system with ML for a LATAM fintech. Architecture, features, model, and production results.

10 min read
machine-learningfintechfraud-detection
EngineeringApr 5, 2026

From monolith to microservices: a practical guide for growing companies

Technical guide for migrating from a monolithic architecture to microservices. Patterns, anti-patterns, domain decisions, and lessons learned in production.

15 min read
microservicesarchitecturedevops
Industry AnalysisApr 5, 2026

State of enterprise AI in Latin America 2026

Analysis of the current state of AI adoption in Latin American enterprises. Trends, barriers, success stories, and opportunities by sector.

8 min read
artificial-intelligencelatamdigital-transformation
Industry AnalysisFeaturedApr 19, 2026

EU AI Act: compliance guide for LATAM exporters before August 2, 2026

Fixed regulatory deadline: August 2, 2026. Fines up to €35M or 7% of global revenue. 20-point checklist, self-hosted stack for data residency, and remediation calendar for Latin American SaaS and fintech handling European data.

14 min read
ai-actcomplianceregulation
EngineeringFeaturedApr 26, 2026

Self-hosted AI stack on a $40 Digital Ocean droplet: Qdrant + Langfuse + LiteLLM + Redis

Production Docker Compose with Qdrant, Langfuse, LiteLLM Proxy, Redis 8, and Ollama on a $40 droplet. Nginx + Certbot, backups to Spaces, Grafana and Prometheus monitoring. Entire stack on a single server.

18 min readImplementation proposal
self-hostedqdrantlangfuse
AI & Machine LearningMay 3, 2026

Building a Spanish AI receptionist: Pipecat + Deepgram + Cartesia + Qdrant

Spanish-language voice agent for dental clinics and small law firms. OSS stack: Pipecat for orchestration, Deepgram/Whisper.cpp for STT, Cartesia/Coqui TTS for voice, Qdrant for business RAG, Langfuse for traces, Redis for context.

16 min readImplementation proposal
voice-agentpipecatvapi
EngineeringFeaturedMay 10, 2026

MCP templates: the missing standard for enterprise chatbots

Ten production-ready MCP servers: CRM, WhatsApp Business, CFDI invoicing, calendar, Postgres with RLS, and Qdrant RAG. Open repo numoru/mcp-templates-es with tests, OAuth, Docker, and Cloudflare Workers deployment.

20 min readImplementation proposal
mcptemplateschatbots
Industry AnalysisMay 17, 2026

GEO vs SEO in 2026: why 50% of your organic clicks are disappearing and how to measure it

Generative Engine Optimization explained with our own benchmark: 20 Mexican brands measured across ChatGPT, Perplexity, Gemini, and Claude. Measurement pipeline with Firecrawl, LiteLLM, and Qdrant.

13 min readImplementation proposal
geoseollmo
EngineeringMay 24, 2026

Auditing if ChatGPT cites your brand: an open-source Go tool with LiteLLM and Qdrant

Go script that runs 100 queries against 5 LLMs via LiteLLM proxy, vectorizes responses in Qdrant, and detects semantic citations. React dashboard. Published as numoru/geo-audit.

15 min readImplementation proposal
gogeolitellm
EngineeringMay 31, 2026

MCP from scratch: a Go server with Postgres, Gmail, and Calendar in 30 minutes

Practical guide to implementing a Model Context Protocol server in Go using mcp-go, integrating Postgres (row-level security), Gmail API, and Google Calendar. Includes OAuth, tests, and Cloudflare Workers deployment.

17 min readImplementation proposal
mcpgopostgres
EngineeringJun 7, 2026

Langfuse + Redis + Mem0 as production agent memory: the tiered memory pattern

Tiered memory pattern for LLM agents: Redis for working memory (1h TTL), Langfuse sessions for auditable history, Mem0 for long-term semantic memory. Go code, when to expire, summarize, or vectorize.

14 min readImplementation proposal
langfuseredismem0
Industry AnalysisFeaturedJun 14, 2026

Numoru survey: state of AI agents in LATAM enterprises 2026

Results of our own survey of 87 Latin American enterprises: frameworks used (LangGraph, CrewAI, Mastra), vector DBs (Qdrant, pgvector, Weaviate), observability (Langfuse, Phoenix), and real barriers. Only 23% report clear ROI.

16 min readImplementation proposal
agentssurveylatam
AI & Machine LearningJun 21, 2026

Context engineering: why your RAG breaks at 50k tokens and how to fix it

From naive RAG to production RAG: Chonkie for semantic chunking, Qdrant with hybrid search (dense + BM25), self-hosted BGE-reranker, Anthropic Contextual Retrieval, RAPTOR, RedisVL semantic cache, and Ragas + Langfuse evaluation.

18 min readImplementation proposal
ragcontext-engineeringqdrant
AI & Machine LearningJun 28, 2026

Orchestrating three agents for a dental clinic with LangGraph, Qdrant, Mem0, and Temporal

Full case: appointments agent + reminders agent + reviews agent, orchestrated with LangGraph and Postgres checkpointer. Mem0 semantic memory, Temporal long-running flows, healthcare guardrails, Langfuse traces.

19 min readImplementation proposal
langgraphqdrantmem0
EngineeringJul 5, 2026

Agent evals in CI/CD: Promptfoo + DeepEval + Langfuse + GitHub Actions

Pipeline running the full eval suite on every PR: golden dataset versioned in Qdrant, automatic regression detection, RAG metrics with DeepEval, Langfuse traces, and merge blocking on score drops.

15 min readImplementation proposal
evalspromptfoodeepeval
Industry AnalysisJul 12, 2026

AI adoption matrix by vertical in Mexico: where they pay more and where they pay less

Cross of Fed, Deloitte, and local observations: which vertical pays most for AI, which is underdigitized, what stack applies in each (health → on-prem Ollama, legal → BAML, retail → Mem0 + Redis). 2x2 entry and ticket matrix.

13 min readImplementation proposal
verticalsmexicoadoption
EngineeringJul 19, 2026

Secure vibe coding: a QA pipeline for code generated with Cursor, Aider, and Cline

How to audit AI code before production: Semgrep + Bearer + Trivy in pre-commit, SonarQube Community with AI-specific rules, DSPy for automatic test generation, Promptfoo for prompt validation in CI.

14 min readImplementation proposal
vibe-codingsecuritysemgrep
AI & Machine LearningJul 26, 2026

Fine-tuning Llama 3.3 for ICD in Spanish: dataset, costs, and benchmarks against Claude

Fine-tuning Llama 3.3 8B on the Spanish ICD-11 catalog: Unsloth for efficient training, vLLM for serving, Qdrant for complementary RAG, lm-eval-harness benchmark. Model published on Hugging Face.

17 min readImplementation proposal
fine-tuningllamaunsloth
AI & Machine LearningAug 2, 2026

n8n blueprint for real estate: WhatsApp + CRM + AI listings with Qdrant and Ollama

Exportable self-hosted n8n flow: semantic property search with Qdrant, description generation with Ollama (zero cost per listing), WhatsApp Business Cloud API, Langfuse traces. Repo numoru/n8n-blueprints.

15 min readImplementation proposal
n8nqdrantollama

More research coming soon...