Yuki Tanaka

Yuki is a former Stripe data engineer (2017-2022) who spent her last eighteen months there building the internal Airflow-to-dbt migration tooling used by the revenue team. She left to join a 12-person AI startup as employee #4, where she shipped a LangChain-based contract-review pipeline now processing roughly 40,000 documents a month for mid-market legal teams. She's deep on the retrieval side: chunking strategies that don't shred tables, hybrid BM25+dense rerankers, and the surprisingly hard problem of evaluating RAG quality without paying for human raters. Her PR adding parent-document retriever support to LlamaIndex's PostgreSQL store landed in late 2024. Nine years in data and ML platform work. Based in Seattle, mostly writes Python, reluctantly writes TypeScript when n8n forces her hand.

Website X / Twitter

Artikelen van Yuki Tanaka

Handleidingen May 25, 2026

Document Parsing voor RAG met Docling in Python: Complete Handleiding 2026

Bouw een productieklare RAG-parsing-pipeline met Docling 2.x in Python: PDF, tabellen en formules netjes naar Markdown, met LangChain- en Qdrant-voorbeelden.

Yuki Tanaka 11 min leestijd

Handleidingen Mar 22, 2026

AI Agents Bouwen met LangGraph in Python: Complete Handleiding

Leer hoe je AI-agents bouwt met LangGraph in Python. Van tool-calling en het ReAct-patroon tot geheugen, human-in-the-loop en multi-agent orchestratie. Inclusief werkende codevoorbeelden.

Yuki Tanaka 14 min leestijd

Handleidingen Mar 12, 2026

Prompt Caching voor LLM-Applicaties: Kosten tot 90% Verlagen met Python

Leer hoe prompt caching je LLM-kosten met 50–90% verlaagt. Stapsgewijze Python-implementaties voor OpenAI, Anthropic Claude en Google Gemini, inclusief productiepatronen voor RAG-pipelines.

Yuki Tanaka 14 min leestijd

Handleidingen Mar 10, 2026

Gestructureerde Output van LLMs met Python: Pydantic, Instructor en PydanticAI

Leer hoe je betrouwbare, gevalideerde datastructuren uit LLMs haalt met Python. Praktische handleiding over Pydantic, Instructor en PydanticAI met werkende codevoorbeelden voor productie-AI.

Yuki Tanaka 14 min leestijd

Handleidingen Feb 09, 2026

RAG-Pipelines Bouwen voor Productie: De Complete Gids voor 2026

Bouw een productiewaardige RAG-pipeline in 2026. Van chunkingstrategieën en vectordatabases tot Agentic RAG en GraphRAG — met praktische Python-codevoorbeelden en best practices.

Yuki Tanaka 20 min leestijd