Priya Ramaswamy

Priya spent four years at Zapier building the Tables product before leaving in 2023 to consult on agent infrastructure for Series A startups. She's shipped custom n8n nodes for two YC-backed companies (a clinical-trial logistics platform and a freight broker), and her PR adding streaming-token support to LangChain's Bedrock chat wrapper was merged in early 2024. Most of her current work is unglamorous: helping ops teams replace 40-step Make.com scenarios with a single LangGraph state machine, then arguing with their CFO about token budgets. She writes here about the parts of agent work that vendor blogs skip - eval harnesses that don't lie, retry logic that survives a rate-limited Anthropic endpoint at 2am, and why 'just add a vector DB' is almost always the wrong answer. Based in Toronto. Eight years total in workflow tooling.

Сайт X / Twitter

Статті автора Priya Ramaswamy

Інтеграція LLM API May 03, 2026

Prompt Caching у Claude API 2026: як зменшити витрати на 90% (повний гайд з кодом)

Anthropic тихо змінив TTL prompt cache з 60 до 5 хвилин — і це збільшило витрати багатьох команд на 30–60%. Розбираємо, як перебудувати кешування у 2026: коли 1-годинний TTL, як уникнути беззвучних промахів, де ставити cache_control, як виміряти hit rate. З Python-кодом і реальними розрахунками.

Priya Ramaswamy 10 хв читання