Skip to content

Commit d65a614

Browse files
LLM Zoomcamp: restructure for 2026 syllabus + Slack-derived FAQs
- Restructure modules to match the 2026 syllabus: module-2 is now Agents (was Vector Search), module-3 Vector Search (was Evaluation), module-4 Evaluation (was Monitoring), module-5 Monitoring (new), module-6 Best Practices. Old workshops-agents folded into module-2 (Agents) - Drop tooling that's no longer in the course: Qdrant content (not sponsoring this year), Elasticsearch troubleshooting (replaced), Arize/Phoenix (not used), Anaconda-specific install advice (uv is recommended now), 5 placeholder "Question" stubs, minsearch AppendableIndex (fixed in current minsearch) - Add 12 technical FAQs from #course-llm-zoomcamp Slack threads (filtered to genuinely-technical-and-not-already-covered): - module-1: OpenAI 429 quota -> Groq/Gemini/Ollama; minsearch wget->raw syntax error; non-OpenAI token counting; direnv for API keys - module-2 (Agents): asyncio in Jupyter; non-OpenAI tool calling; OpenAI Responses API on non-OpenAI providers - module-3 (Vector Search): question vs answer vs both embedding strategy - module-4 (Evaluation): JSONDecodeError on ground truth; Jupyter kernel crash; Gemini rate limits - project: orchestration tool optional; recommender RAG eval; chunking strategy; reproducibility (no secrets in repo) - Renumber sort_order sequentially within each section ML Zoomcamp: drop Saturn Cloud content - Remove 7 module-8 FAQs that were Saturn Cloud-specific (no longer sponsored). Update general/012 (special hardware) and misc/015 (free cloud alternatives) to drop Saturn Cloud references.
1 parent 52649f4 commit d65a614

96 files changed

Lines changed: 356 additions & 703 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

_questions/llm-zoomcamp/_metadata.yaml

Lines changed: 16 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -3,19 +3,27 @@ course_name: "LLM Zoomcamp"
33
sections:
44
- id: general
55
name: "General Course-Related Questions"
6+
comment: "Course logistics: cohort schedule, certificate, deadlines, leaderboard, project rules. Technical questions belong in the module sections."
67
- id: module-1
7-
name: "Module 1: Introduction"
8+
name: "Module 1: Introduction to LLMs and RAG"
9+
comment: "Basics of LLMs and RAG, OpenAI API, text search. Includes setup-related issues (API keys, ElasticSearch, Ollama, etc.)"
810
- id: module-2
9-
name: "Module 2: Vector Search"
11+
name: "Module 2: Agents"
12+
comment: "Adding agentic functionality to RAG, function calling, MCP, tool use"
1013
- id: module-3
11-
name: "Module 3: Evaluation"
14+
name: "Module 3: Vector Search"
15+
comment: "Vector search, embeddings, indexing, Qdrant, semantic search"
1216
- id: module-4
13-
name: "Module 4: Monitoring"
17+
name: "Module 4: Evaluation"
18+
comment: "Search evaluation, online vs offline evaluation, LLM as a judge, hit rate / MRR / ROUGE"
19+
- id: module-5
20+
name: "Module 5: Monitoring"
21+
comment: "Online evaluation techniques, Phoenix, dashboards, monitoring user feedback"
1422
- id: module-6
15-
name: "Module 6: X"
23+
name: "Module 6: Best Practices"
24+
comment: "Hybrid search, document reranking, advanced RAG techniques"
1625
- id: project
1726
name: "Capstone Project"
1827
- id: workshops-dlthub
19-
name: "Workshops: dlthub"
20-
- id: workshops-agents
21-
name: "Workshops: Agents"
28+
name: "Workshop: Open-Source Data Ingestion (dlt)"
29+
comment: "Workshop on using dlt for ingesting data into RAG systems"

_questions/llm-zoomcamp/general/013_de32098d49_what-other-alternatives-to-elasticsearch-are-there.md

Lines changed: 0 additions & 19 deletions
This file was deleted.

_questions/llm-zoomcamp/module-1/007_66ae44829d_elasticsearch-error-badrequesterror-badrequesterro.md

Lines changed: 0 additions & 51 deletions
This file was deleted.

_questions/llm-zoomcamp/module-1/012_8d23fe792f_authentication-safe-and-easy-way-to-store-and-load.md renamed to _questions/llm-zoomcamp/module-1/007_8d23fe792f_authentication-safe-and-easy-way-to-store-and-load.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
id: 8d23fe792f
33
question: 'Authentication: Safe and easy way to store and load API keys'
4-
sort_order: 12
4+
sort_order: 7
55
---
66

77
You can store your different API keys in a YAML file that you will add to your `.gitignore` file. Be careful to never push or share this file.

_questions/llm-zoomcamp/module-1/013_233dabe430_how-to-store-and-load-api-keys-using-env-file.md renamed to _questions/llm-zoomcamp/module-1/008_233dabe430_how-to-store-and-load-api-keys-using-env-file.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
id: 233dabe430
33
question: How to store and load API keys using .env file
4-
sort_order: 13
4+
sort_order: 8
55
---
66

77
Store the API key in a `.env` file, then use the following steps to load it:

_questions/llm-zoomcamp/module-1/008_a5a76b7c76_fix-badrequesterror-badrequesterror400-media_type_.md

Lines changed: 0 additions & 20 deletions
This file was deleted.

_questions/llm-zoomcamp/module-1/009_2db8d0cd4d_elasticsearch-error-elasticsearch-exited-unexpecte.md

Lines changed: 0 additions & 42 deletions
This file was deleted.

_questions/llm-zoomcamp/module-1/014_86d99bbf21_authentication-why-is-my-openai_api_key-not-found.md renamed to _questions/llm-zoomcamp/module-1/009_86d99bbf21_authentication-why-is-my-openai_api_key-not-found.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
id: 86d99bbf21
33
question: 'Authentication: Why is my OPENAI_API_KEY not found in the Jupyter notebook?'
4-
sort_order: 14
4+
sort_order: 9
55
---
66

77

_questions/llm-zoomcamp/module-1/010_eb4378abaa_elasticsearch-error-elasticsearchindex-got-an-unex.md

Lines changed: 0 additions & 8 deletions
This file was deleted.

_questions/llm-zoomcamp/module-1/015_fe8fed31e6_opensource-i-am-using-groq-and-it-doesnt-provide-a.md renamed to _questions/llm-zoomcamp/module-1/010_fe8fed31e6_opensource-i-am-using-groq-and-it-doesnt-provide-a.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ id: fe8fed31e6
33
question: 'OpenSource: I am using Groq, and it doesn''t provide a tokenizer library
44
based on my research. How can we estimate the number of OpenAI tokens asked in homework
55
question 6?'
6-
sort_order: 15
6+
sort_order: 10
77
---
88

99
The question asks for the number of tokens in the GPT-4o model. `tiktoken` is a Python library that can be used to get the number of tokens. You don't need an OpenAI API key to get the number of tokens. You can use the code provided in the question to get the number of tokens.

0 commit comments

Comments
 (0)