Building AI Agents and Workflows with LangGraph

This repository contains code for my live course: O'Reilly Live Online Training - Building AI Agents and Workflows with LangGraph

Instructor: Sinan Ozdemir

Overview

This course teaches you how to build production-ready AI agents and workflows using LangGraph. You'll progress from basic graph concepts through tool integration, RAG workflows, middleware for context engineering, evaluation-in-the-loop, and a full capstone deep research agent.

Prerequisites

Intermediate Python
Familiarity with LangChain basics (prompts, chains, LLMs)
An OpenRouter API key (required) — gateway to multiple LLM providers
A SerpAPI key (required, free tier available) — web search
Optional: LangSmith API key for tracing/observability

Notebooks

#	Notebook	Topic	Key Concepts
1	From Prompts to Workflows	From Prompts to Workflows	Single prompt vs. generate → critique → refine graph
2	LangGraph Basics	LangGraph Core Primitives	`StateGraph`, `TypedDict`, `add_messages`, conditional edges, checkpoints
3	Tools and Agents	Tools, ReAct Agents, MCP	`@tool`, `create_agent`, manual ReAct, MCP integration with custom server
4	RAG Workflow	RAG as a LangGraph Workflow	Ingestion pipeline (scrape → chunk → embed), Chroma + HuggingFace embeddings, document grading, web search fallback
5	Middleware	Context Engineering	`SummarizationMiddleware`, `HumanInTheLoopMiddleware`, stacking middleware, LangSmith tracing, custom `GuardrailMiddleware`
5.1	Context Summarization Benchmark	Compression Strategies (Advanced)	Rule-following stress test, five compression strategies, custom middleware in `middleware.py`, LLM-as-judge + heatmaps
6	Evaluation	Agent Evaluation	LLM-as-judge, trajectory scoring, evaluation-in-the-loop, self-correcting agents, multi-model comparison
6.1	Structured Grader Evaluation	Grader Benchmark (Advanced)	0–3 structured rubric, Pydantic `with_structured_output`, gold-label test set, multi-model grader accuracy, disagreement analysis
7	Deep Research Agent	Capstone	Planner → researcher → writer → reviewer with revision loop, `SummarizationMiddleware` + `ModelCallLimitMiddleware`, supervisor-with-tools pattern
8	Parallel Graphs	Fan-Out / Fan-In	`Send` API, dynamic parallel dispatch, `operator.add` reducer, sequential vs. parallel timing

Supporting files (run notebooks from the notebooks/ directory):

middleware.py — custom conversation-compression middleware used by notebook 5.1 (MapReduceSummarizationMiddleware, RefineSummarizationMiddleware, RulesFirstSummaryMiddleware, and more)
data/bulk_email_conversation.json — scripted ~84-message email-triage conversation for the notebook 5.1 rule-following benchmark
Notebook 6.1 writes data/structured_grader_evaluation_results.json and data/structured_grader_evaluation_summary.csv when you run the save cell

Notebook highlights

Notebook 1 uses a "generate → critique → refine" pattern (no tools needed) to motivate why graphs beat single prompts
Notebook 3 includes a working MCP server (mcp_server.py) and demonstrates loading external tools via langchain-mcp-adapters
Notebook 4 scrapes real blog posts from AI Office Hours and builds a LangGraph ingestion pipeline before the RAG query workflow
Notebook 5 walks through SummarizationMiddleware and HumanInTheLoopMiddleware, stacks multiple middleware on one agent, optional LangSmith tracing, and a custom GuardrailMiddleware (PII blocking) plus decorator-style hooks; the closing exercise points you at ModelCallLimitMiddleware in the docs
Notebook 5.1 benchmarks five compression strategies on a long scripted conversation — three user rules must survive summarization — using middleware.py and data/bulk_email_conversation.json; ends with LLM-as-judge pass/fail grids and matplotlib heatmaps across model pairings
Notebook 6 compares multiple models against a gold evaluation set using a structured rubric judge
Notebook 6.1 (advanced) benchmarks nine LLMs as graders on a 12-case gold set with a 0–3 correctness rubric, chain-of-thought structured outputs, accuracy tables, matplotlib charts, and a disagreement walkthrough — only OPENROUTER_API_KEY required
Notebook 7 hardens the researcher sub-agent with SummarizationMiddleware and ModelCallLimitMiddleware, then ends with a "supervisor with tools" multi-agent pattern (no langgraph-supervisor dependency — just a ReAct agent whose tools are sub-agents)
Notebook 8 demonstrates LangGraph's Send API for dynamic fan-out/fan-in with a timing comparison showing parallel vs. sequential speedup

Setup Instructions

Using Python 3.11 Virtual Environment

At the time of writing, we need a Python virtual environment with Python 3.11.

Option 1: Python 3.11 is Already Installed

Step 1: Verify Python 3.11 Installation

python3.11 --version

Step 2: Create and activate the virtual environment

python3.11 -m venv .venv
source .venv/bin/activate  # Linux/macOS
# .venv\Scripts\activate   # Windows

Step 3: Install dependencies

pip install -r requirements.txt

Step 4: Configure environment variables

cp .env.example .env
# Edit .env with your API keys

API Keys

All notebooks use OpenRouter as the LLM gateway (via ChatOpenAI with a custom base_url), so you can swap models by changing the model string (e.g., openai/gpt-4.1, anthropic/claude-sonnet-4, etc.).

Web search uses SerpAPI (free tier available).

Embeddings in the RAG notebook use HuggingFace (all-MiniLM-L6-v2) — runs locally, no API key needed.

Reference Repos

oreilly-ai-agents — evaluation, tool selection, positional bias
building-agentic-ai — deep research, multi-agent SDR, policy bots

Instructor

Sinan Ozdemir — LinkedIn | GitHub | AI Office Hours | Building Agentic AI Substack

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
images		images
notebooks		notebooks
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building AI Agents and Workflows with LangGraph

Overview

Prerequisites

Notebooks

Notebook highlights

Setup Instructions

Using Python 3.11 Virtual Environment

Option 1: Python 3.11 is Already Installed

Step 1: Verify Python 3.11 Installation

Step 2: Create and activate the virtual environment

Step 3: Install dependencies

Step 4: Configure environment variables

API Keys

Reference Repos

Instructor

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Building AI Agents and Workflows with LangGraph

Overview

Prerequisites

Notebooks

Notebook highlights

Setup Instructions

Using Python 3.11 Virtual Environment

Option 1: Python 3.11 is Already Installed

Step 1: Verify Python 3.11 Installation

Step 2: Create and activate the virtual environment

Step 3: Install dependencies

Step 4: Configure environment variables

API Keys

Reference Repos

Instructor

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages