chore: broaden .gitignore to stop committing runtime DB artifacts#1313
Open
kulcsarrudolf wants to merge 1 commit into
Open
chore: broaden .gitignore to stop committing runtime DB artifacts#1313kulcsarrudolf wants to merge 1 commit into
kulcsarrudolf wants to merge 1 commit into
Conversation
b46f109 to
ed4b91e
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
First, a big thank you to Ed for The Complete Agentic AI Engineering Course (2025). I learned a lot from it. I already built my own agents, and I am still working through the material. I also plan to check out the other courses on your list next, because the way you explain things (clear, practical, and hands-on) really works for me. Thanks for putting this together.
With that said, here is a small contribution back. When I first cloned the repo, it was noticeably slow. I looked into why, and found a lot of runtime-generated files adding to the size. That is what motivated this PR.
Problem
Cloning the repo pulls down ~97 MB of git data (working tree ~211 MB). A big chunk of that comes from runtime-generated database and binary files that contributors committed over time:
chroma.sqlite3,data_level0.bin,header.bin,length.bin,link_lists.bin*.db,*.db-wal,*.db-shm,*.sqlite*.wavThe existing
.gitignoreonly matched a narrow set of exact names (memory.db,memory.db-wal,memory.db-shm,6_mcp/accounts.db,6_mcp/memory/*.db), so contributors kept committing their local runtime state without realizing it.Changes
This PR is intentionally non-destructive:
.gitignoreto catch the patterns above going forward.git rm --cached, no file deletions. Existing forks, clones, and open PRs stay unaffected. Nothing disappears from anyone's working tree on pull.Effect: New contributions stop adding DB/vector-store artifacts. The repo stops growing from this class of mistake.
Why not also untrack existing files?
A
git rm --cachedsweep would:Given that the course has a large contributor base with active forks, the right sequencing is:
.gitignore(safe, no one notices).git filter-repo/ BFG) to actually shrink clones. This is disruptive for all forks and needs coordination.Test plan
.gitignorepatterns verified against representative tracked paths (3_crew/stock_picker/memory/chroma.sqlite3,4_langgraph/memory.db,*.wavunderNLP_Agent_Dinesh_Uthayakumar).git statuson a clean checkout shows no unintended deletions.