Talks

An archive of my talks at conferences and meetups on topics like Machine Learning, LLMs, and building AI-powered products.

Beyond LMArena: 3 Practical Insights for Custom LLM Benchmarks

Tokyo Tech Leads Circle | Oct 2025 | Slides

On “interviewing” LLMs on your actual tasks, not just public benchmarks. Sharing three practical insights for building your own benchmark: starting small, keeping the eval pipeline simple for debugging, and using pairwise statistical tests to make confident model choices.

Intuitive Jargon-Free Mental Model for MCP

Tokyo AI | Jun 2025 | Slides

On finally getting why MCP exists. Grasping the core concept with a browser analogy, seeing how it modifies system prompts, and watching it all work in live demos with Roo Code.

AI-powered Automatic Replies in Customer Support

PyCon JP | Sep 2024 | YouTube | Slides | Details

On building a precision-focused auto-reply system for customer support. Detailing the process of fine-tuning a transformer for Japanese text classification, ensuring a good user experience, and measuring its real-world business impact.

AI for Efficient Routing of Customer Inquiries at Mercari

PyCon APAC | Oct 2023 | YouTube | Slides | Details

On translating the business need for better customer inquiry routing into an ML problem. Going from simple keyword matching to fine-tuning a transformer for production use, and its real-world impact on support operations.