MCPcopy
hub / github.com/ai-that-works/ai-that-works

github.com/ai-that-works/ai-that-works @main sqlite

repository ↗ · DeepWiki ↗
2,767 symbols 8,638 edges 667 files 781 documented · 28%
README

🦄 AI That Works

On Zoom, Tuesdays at 10 AM PST - an hour of live coding, Q&A, and production-ready AI engineering

Event Calendar Discord YouTube Playlist

🦄 Next Episode

Product Specs with AI

Tuesday, June 16, 2026 at 10 AM PST

We've talked a lot about design discussions for planning work with AI and getting leverage before writing the code, but this process has a common pitfall: it combines product decisions (how does it work, what is the user experience) with technical decisions (how do we build it, what patterns do we follow).

This complecting of concerns can cause important questions to be missed. On today's AI that works we'll dig into techniques to split out product vs. technical questions to enable less-technical folks to participate in product specification that is grounded in codebase research, and ensure technical depth is achieved without getting distracted by product questions.

Register Now



What We're About

Weekly conversations with @hellovai & @dexhorthy about getting the most juice out of today's models

When: Every Tuesday at 10 AM PST on Zoom
Duration: 1 hour of live coding, Q&A, and production-ready insights
Goal: Take your AI app from demo → production

Let's code together.


Pre-Reading & Setup

Before joining, get familiar with our toolkit:

### **Core Tools** - **Zoom** - Live sessions - **Cursor** - AI-powered IDE - **Git** - Version control - **Claude Code** - Agentic Coding - **CodeLayer** - Agentic Coding Tool ### **Languages** - **Python/TypeScript/Go** - Application logic - **BAML** - Prompting DSL - [Repository](https://github.com/boundaryml/baml) - [Getting Started Guide](https://gloochat.notion.site/benefits-of-baml) ### **Package Managers** - **Python:** [UV](https://docs.astral.sh/uv/getting-started/installation) - **TypeScript:** PNPM - **Go:** Go modules

Episodes & Workshops

From Demo to Production - One Episode at a Time

📅 Episode 📝 Description
UPCOMING 2026-06-16 #62: Product Specs with AI coderegister We've talked a lot about design discussions for planning work with AI and getting leverage before writing the code, but this process has a common pitfall: it combines product decisions (how does it work, what is the user experience) with technical decisions (how do we build it, what patterns do we follow). This complecting of concerns can cause important questions to be missed. On today's AI that works we'll dig into techniques to split out product vs. technical questions to enable less-technical folks to participate in product specification that is grounded in codebase research, and ensure technical depth is achieved without getting distracted by product questions.
PAST 2026-06-09 #61: Hands-on with Fable 5 watchcode We had agent observability on the schedule, but Anthropic shipped Fable 5 about twenty minutes before we went live, so we tossed the plan and got hands-on with the new model instead. Zero prep, all live. This is an unscripted look at exactly how we kick the tires on a fresh model release: take the hardest problem you're already deep in, hand it over, and watch whether it finds leverage you didn't. Vaibhav ran it against an in-progress design doc for the BAML VM's observability layer, Dex ran it against an old race-condition benchmark he keeps around, and we talk through what the model caught, what it missed, and why most model releases are more hype than the value they deliver day to day.
PAST 2026-06-02 #60: How to Build AI Agents that Work in Any Language watchcode In this episode, we discuss the challenge of building multilingual AI applications that perform consistently whether your user is interacting in English, Spanish, French, whatever. Can you simply run an English prompt through a basic translator? Or will that break down in production? We'll be breaking down practical engineering strategies for designing flexible, cross-lingual prompt architectures that maintain semantic alignment without forcing you to build and manage separate pipelines for every language.
PAST 2026-05-26 #59: No Vibes Allowed: Performance Engineering code This week on the podcast, we are doing another no vibes allowed episode focusing on performance engineering. We will dive into high-performance engineering on virtual machines to help you maximize efficiency out of your compute infrastructure. When you are deploying heavy LLM workloads or complex pipelines, an unoptimized VM can lead to crippling latency, memory bottlenecks, and ballooning costs that break your application in the real world. We break down practical techniques to configure your environments, manage resources, and eliminate overhead so your models run flawlessly under pressure.
PAST 2026-05-19 #58: How AI Agents Can Safely Ship Code to Production watchcode This week, the top headline is vibe coders realizing that they can use feature flags to ship experimental (read: slop) features to production without impacting all customers. Shipping code is a lot harder when everything is changing all the time. Feature flags can be a good technique to test various things, but how do you set that up? Do you feature flag new models? New prompts? New harnesses? We'll dive into details here and see where feature flags improve your product delivery vs. just giving you an excuse to ship more slop.
PAST 2026-05-12 #57: "Code Mode" Deep Dive watchcode On Monday, Pash from OpenAI shared that Codex has a secret "code mode" feature - an alternative to traditional tool calling. There's a lot of debate going on around the best way to give tools to models - skills vs. mcps, CLIs and bash vs custom tools, or letting the model write code for everything. In this episode we're going to cut through the hype and dive deep on the differences and tradeoffs between these methods. • What is "code mode" and how does it work • Tradeoffs between MCP vs. Bash+CLI vs. Code mode • Why it matters to agent or harness builders
PAST 2026-05-05 #56: OpenAI tells you not to build your own harness watchcode Harness engineering is all the hype now, so on this week on the podcast we're looking back to an article written by OpenAI in February about harness engineering, "Harness engineering: leveraging Codex in an agent-first world". In this article, they claim that the era of "hand-written code" is officially over. We break down their experiment of shipping a million-line product with zero manual coding, shifting the human role from "coder" to "environment designer."
PAST 2026-04-28 #55: No Vibes Allowed - Building Design Docs with AI watchcode In this month's no vibes allowed episode, Vaibhav will show how he uses AI to make design docs for complicated tasks by building out an actual design doc for a feature in BAML. As always for our no vibes allowed series, we will be solving real problems in real production systems.
PAST

Extension points exported contracts — how you extend this code

ThreadStore (Interface)
(no doc) [9 implementers]
2025-06-03-humans-as-tools-async/src/state.ts
ParserState (Interface)
* Parser state for incremental parsing
2025-11-05-event-driven-agents/demo/src/antml/AntmlParser.ts
LumaEvent (Interface)
(no doc)
tools/luma.ts
ZoomToken (Interface)
(no doc)
tools/zoom.ts
ValidationResult (Interface)
(no doc)
tools/validate-metadata.ts
KeyboardShortcutProps (Interface)
(no doc)
2026-04-14-agentic-coding-for-frontend-apps/02-storybook-riptide/src/components/keyboard-shortcut.tsx
User (Interface)
(no doc)
2026-04-14-agentic-coding-for-frontend-apps/03-wired-vs-pure/src/types.ts
Notification (Interface)
(no doc)
2025-10-28-ralph-wiggum-coding-agent-power-tools/webapp/src/components/notifications/NotificationList.tsx

Core symbols most depended-on inside this repo

get
called by 514
2025-06-03-humans-as-tools-async/src/state.ts
error
called by 313
2025-09-23-evals-for-classification/src/shared/logger.py
log
called by 168
2026-01-13-applying-12-factor-principles-to-coding-agent-sdks/src/utils.ts
info
called by 112
2025-09-23-evals-for-classification/src/shared/logger.py
exists
called by 106
2026-01-13-applying-12-factor-principles-to-coding-agent-sdks/src/store/order-store.ts
split
called by 87
2026-04-11-unconf-sf/src/talk_segmenter/transcript_splitter.py
write
called by 87
2026-04-11-unconf-sf/src/talk_segmenter/segment_writer.py
open
called by 86
2025-11-05-event-driven-agents/demo/src/server.ts

Shape

Function 1,484
Method 671
Class 321
Interface 243
Route 48

Languages

TypeScript60%
Python40%

Modules by API surface

2025-07-01-ai-content-pipeline-2/backend/main.py49 symbols
2025-09-23-evals-for-classification/tests/unit/classification/vector_store_test.py38 symbols
2025-06-24-ai-content-pipeline/backend/main.py37 symbols
2025-10-21-agentic-rag-context-engineering/tui.py32 symbols
2025-09-23-evals-for-classification/tests/unit/classification/embeddings_test.py32 symbols
2025-09-23-evals-for-classification/tests/unit/classification/selection_test.py28 symbols
2025-10-28-ralph-wiggum-coding-agent-power-tools/webapp/src/lib/email-notifications.ts27 symbols
2025-08-19-interruptible-agents/runtime.py27 symbols
2025-12-02-multimodal-evals/src/receipt_evaluator.py25 symbols
2025-10-28-ralph-wiggum-coding-agent-power-tools/webapp/src/app/actions/todos.ts25 symbols
2025-07-01-ai-content-pipeline-2/backend/models.py25 symbols
2025-09-23-evals-for-classification/tests/unit/classification/narrowing_test.py24 symbols

Dependencies from manifests, versioned

@anthropic-ai/claude-agent-sdk0.2.38 · 1×
@biomejs/biome2.2.0 · 1×
@boundaryml/baml0.88.0 · 1×
@boundaryml/baml-nextjs-plugin0.1.0 · 1×
@dagrejs/dagre1.1.5 · 1×
@eslint/eslintrc3 · 1×
@hono/node-server1.13.7 · 1×
@hookform/resolvers5.1.1 · 1×
@linear/sdk1.22.0 · 1×
@mdx-js/mdx3.1.0 · 1×
@prisma/client6.18.0 · 1×

For agents

$ claude mcp add ai-that-works \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact