Skip to content

docs(registry): 14 novos packs + 3 refreshes (Phase 2 cadence)#10

Merged
lglucas merged 4 commits into
mainfrom
chore/registry-additions-2026-05-09
May 9, 2026
Merged

docs(registry): 14 novos packs + 3 refreshes (Phase 2 cadence)#10
lglucas merged 4 commits into
mainfrom
chore/registry-additions-2026-05-09

Conversation

@lglucas
Copy link
Copy Markdown
Owner

@lglucas lglucas commented May 9, 2026

Summary

  • 14 novos packs cataloged em docs/registry/packs/ seguindo o template padrão de uma página
  • 3 refreshes dos packs já existentes (fincept-terminal, auto-research-claw, ruflo) — bumped Last reviewed + version/star numbers
  • INDEX.md atualizado com 14 novas linhas em ordem alfabética + tag files cross-linkados (9 tags afetados)
  • Session log completo em session-log/2026-05-09-registry-additions.md

Os 14 novos packs

Slug Categoria License
agent-exchange agents / infra / experimental MIT
agent-skills-eval agents / tooling MIT
ascii-draw design / tooling GPL-3.0
claude-code-harness ecosystem / tooling MIT
crawlee tooling / ai / data-engineering Apache-2.0
cryptpad productivity / security AGPL-3.0
googleworkspace-cli agents / productivity / agents-marketplace Apache-2.0
isomiddleearth design / tooling check upstream
jeweledtech-agentic-framework agents / experimental MIT
locally-uncensored ai / tooling AGPL-3.0
mirothinker ai / agents / research Apache-2.0
openhuman agents / productivity / tooling GPL-3.0
quran-database learning / research / dataset check upstream
understand-anything tooling / productivity / ai / ecosystem MIT

Decisões importantes (ver session log pra detalhes)

  • MiroThinker ≠ MiroShark (orgs diferentes, problemas diferentes — coexistem com cross-references)
  • googleworkspace-cli ganhou tag agents-marketplace porque traz 100+ SKILL.md bundled com author signal forte (Google oficial)
  • isomiddleearth tem licença não declarada upstream — flag explícita "view-only reference" no Notes
  • Packs AGPL/GPL flagged consistentemente nos Notes (cryptpad, locally-uncensored, openhuman, fincept-terminal)
  • Tag #dataset introduzida como metadata-only (sem tags/dataset.md próprio ainda)

Test plan

  • Browse docs/registry/INDEX.md e verificar que as 14 linhas estão em ordem alfabética sensata
  • Spot-check 3 packs aleatórios (estrutura do template + tags + license badge)
  • Conferir que os 9 tag files atualizados linkam corretamente pros packs novos
  • (opcional) Rodar os-self-test skill após merge pra validar cross-references

Independente do Sprint 0

Esta PR não toca nada de course/, course/systems/, ou da branch sprint-0-systems-html (PR #9). Pode mergear em qualquer ordem.

🤖 Generated with Claude Code

Summary by CodeRabbit

  • Documentation
    • Added 14 new registry entries (agents, tooling, datasets, collaboration, encryption, scraping, and research resources).
    • Added a session-log entry documenting the additions and workflow.
    • Refreshed metadata for 3 existing registry entries and updated the registry index and tag categorizations.
  • Chores
    • Reformatted the markdown link-checker configuration for the CI workflow.

Phase 2 cadence — repos discovered through social-media browsing,
analyzed and cataloged using the standard one-pager template.

Novos packs:
- agent-exchange (marketplace for AI agents, MIT, experimental)
- agent-skills-eval (A/B test runner for skills, MIT)
- ascii-draw (GTK4 ASCII drawing, GPL-3.0)
- claude-code-harness (Plan-Work-Review-Release, MIT)
- crawlee (web scraping for LLM/RAG, Apache-2.0)
- cryptpad (E2E-encrypted office suite, AGPL-3.0)
- googleworkspace-cli (official Google CLI + 100+ SKILL.md, Apache-2.0)
- isomiddleearth (Next.js 16 reference impl, license unstated)
- jeweledtech-agentic-framework (multi-dept biz orchestration, MIT)
- locally-uncensored (local-first multi-modal desktop, AGPL-3.0)
- mirothinker (deep-research agent on Qwen + MCP, Apache-2.0)
- openhuman (personal AI agent, 118+ integrations, GPL-3.0)
- quran-database (MySQL Quran dataset, public domain text)
- understand-anything (codebase-as-knowledge-graph plugin, MIT)

Refreshed (already in registry):
- fincept-terminal (stars 18.3k -> 20.5k)
- auto-research-claw (stars 11.8k -> 12k)
- ruflo (stars 34.1k -> 47.7k, v3.6.10 -> v3.6.30)

INDEX + tag wiring:
- INDEX.md: 14 new rows, 3 refreshes, "Last index update" bumped
- tags/{ai,agents-marketplace,data-engineering,design,ecosystem,
  learning,productivity,research,security}.md: cross-links added

See session-log/2026-05-09-registry-additions.md for full rationale,
including: MiroThinker vs MiroShark distinction, googleworkspace-cli
as agents-marketplace tag rationale, AGPL flagging discipline.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@coderabbitai
Copy link
Copy Markdown

coderabbitai Bot commented May 9, 2026

Warning

Rate limit exceeded

@lglucas has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 52 minutes and 42 seconds before requesting another review.

You’ve run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: becc15f4-123a-46b7-8884-cbb47216e559

📥 Commits

Reviewing files that changed from the base of the PR and between 3719e94 and 28ea3cc.

📒 Files selected for processing (1)
  • .github/workflows/ci.yml
📝 Walkthrough

Walkthrough

This PR expands the registry with 14 new tool packs and refreshes 3 existing entries, adding comprehensive documentation pages, updating the main index, cross-referencing across 9 subject-area tags, and recording the work in a session log.

Changes

Registry Additions — 14 New Packs & 3 Metadata Refreshes

Layer / File(s) Summary
Registry Index
docs/registry/INDEX.md
Index date updated to 2026-05-09 with summary of 14 new and 3 refreshed packs; master pack table expanded with new rows and understand-anything addition.
Pack Documentation
docs/registry/packs/agent-exchange.md, docs/registry/packs/agent-skills-eval.md, docs/registry/packs/ascii-draw.md, docs/registry/packs/claude-code-harness.md, docs/registry/packs/crawlee.md, docs/registry/packs/cryptpad.md, docs/registry/packs/googleworkspace-cli.md, docs/registry/packs/isomiddleearth.md, docs/registry/packs/jeweledtech-agentic-framework.md, docs/registry/packs/locally-uncensored.md, docs/registry/packs/mirothinker.md, docs/registry/packs/openhuman.md, docs/registry/packs/quran-database.md, docs/registry/packs/understand-anything.md, docs/registry/packs/auto-research-claw.md, docs/registry/packs/fincept-terminal.md, docs/registry/packs/ruflo.md
14 new pack pages created with consistent structure (metadata, description, installation guidance, fit signals, conflicts/overlaps, notes). 3 existing packs refreshed with updated star counts and 2026-05-09 last-reviewed dates.
Tag Cross-References
docs/registry/tags/agents-marketplace.md, docs/registry/tags/ai.md, docs/registry/tags/data-engineering.md, docs/registry/tags/design.md, docs/registry/tags/ecosystem.md, docs/registry/tags/learning.md, docs/registry/tags/productivity.md, docs/registry/tags/research.md, docs/registry/tags/security.md
9 tag pages updated to include new pack cross-references organized by functional categories (agents-marketplace, ai, data-engineering, design, ecosystem, learning, productivity, research, security).
Session Log
session-log/2026-05-09-registry-additions.md, session-log/INDEX.md
New session-log entry for 2026-05-09 documenting 14 new packs, 3 refreshed entries, workflow rationale, decisions preserved, and branch/commit plan. INDEX updated with summary row.
CI Workflow
.github/workflows/ci.yml
Lychee link-checker args reformatted from single-line to a folded multi-line YAML block without changing behavior.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related PRs

Poem

🐰 Fourteen new tools hop into the warren,
Each with a page, a purpose, a reason to care,
Tags cross-stitch the patchwork with care,
And log entries whisper: "we were here, we saw fair."

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Title check ✅ Passed The title clearly describes the main change: adding 14 new registry packs and refreshing 3 existing ones as part of Phase 2 cadence. It directly reflects the primary content of the changeset.
Description check ✅ Passed The description covers the required template sections: Summary (with bullet points), Type of change (marked 'docs'), OS coherence checklist items (session log updated, proper branch usage), Vibe-coder impact (implicit—documentation-only), and Test plan. The description is comprehensive and well-structured.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch chore/registry-additions-2026-05-09

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

  • Generate code and open pull requests
  • Plan features and break down work
  • Investigate incidents and troubleshoot customer tickets together
  • Automate recurring tasks and respond to alerts with triggers
  • Summarize progress and report instantly

Built for teams:

  • Shared memory across your entire org—no repeating context
  • Per-thread sandboxes to safely plan and execute work
  • Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 7

🧹 Nitpick comments (1)
docs/registry/packs/mirothinker.md (1)

11-12: ⚡ Quick win

Prefer evidence-neutral wording over “State-of-the-art.”

This superlative is time-sensitive and subjective. Consider replacing with benchmark-grounded phrasing (e.g., “reports strong performance on multiple research benchmarks”).

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@docs/registry/packs/mirothinker.md` around lines 11 - 12, The phrase
"State-of-the-art on multiple research benchmarks." is subjective and
time-sensitive; replace it with an evidence-neutral statement such as "reports
strong performance on multiple research benchmarks" and, where possible, add
specific benchmark names or citations to support the claim (edit the sentence
containing "State-of-the-art on multiple research benchmarks" in the mirothinker
pack description).
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@docs/registry/packs/cryptpad.md`:
- Line 38: Replace the inline code snippet `cryptpad.fr` with a clickable HTTPS
link so readers can open it directly; specifically update the sentence
containing "Or use the public hosted instance at `cryptpad.fr`" to use a
markdown link like "https://cryptpad.fr" (displaying either the full URL or
"cryptpad.fr") so it renders as a clickable HTTPS link.

In `@docs/registry/packs/isomiddleearth.md`:
- Around line 4-7: Update the top-level metadata so the Status explicitly
encodes the restriction while the License still notes “check upstream”: replace
the current "Status: active (553 stars)" entry with a clear restriction like
"Status: reference-only until upstream license is explicit" (you can keep the
star count in parentheses if desired) and keep the "License: check upstream (not
explicitly stated)" line unchanged; edit the Status field in this document
(isomiddleearth.md) to make the adoption restriction explicit.

In `@docs/registry/packs/quran-database.md`:
- Around line 51-53: Update the three bullet points that currently assert "Quran
text itself is in the public domain", flag the repo license as unclear, and warn
about translations so they avoid absolute legal language: change the first
bullet to say the canonical Arabic text is often treated as public-domain in
many jurisdictions but recommend users verify local law and provenance; modify
the second bullet (currently saying "Software license not explicitly stated") to
instruct readers to check the repository's LICENSE or contact maintainers before
redistributing schema/scripts commercially; and adjust the third bullet to
explicitly state that translation editions may have separate copyrights and must
be checked per-edition before bundling. Ensure the new wording is
jurisdiction-dependent and recommends concrete verification steps.

In `@docs/registry/packs/understand-anything.md`:
- Around line 32-34: Replace the platform-specific shell command "open
https://github.com/lum1104/understand-anything" with a cross-platform plain URL
or a neutral prompt (e.g., "Visit:
https://github.com/lum1104/understand-anything") in the
docs/registry/packs/understand-anything.md file so Linux/Windows users aren't
blocked by the macOS-only open command; update the single-line code block
containing that URL accordingly.

In `@docs/registry/tags/learning.md`:
- Line 21: The registry entry for `quran-database` currently asserts "public
domain"; change that phrase to a neutral, non-definitive note such as "license
pending verification" or "license status under review" so it doesn't assert
final license status; update the table cell text for the `quran-database` row to
reflect neutral wording and, if desired, add a brief parenthetical like
"(upstream license verification required)" to make the uncertainty explicit.

In `@session-log/2026-05-09-registry-additions.md`:
- Line 61: The heading "AGPL-3.0 packs flagged consistently" is misleading
because the list includes GPL-3.0 and AGPL+commercial entries (e.g., cryptpad,
locally-uncensored, openhuman, fincept-terminal); change the heading to
"copyleft-licensed packs flagged consistently" or split into two subheadings
("AGPL-licensed packs" and "GPL-licensed packs") and move each pack (cryptpad,
locally-uncensored, openhuman, fincept-terminal) under the correct subheading so
the license grouping matches the listed pack licenses.
- Line 27: The table row for `quran-database` currently states "public domain"
but the PR marks license verification as pending; update the entry for
`quran-database` (the table row containing the symbol `quran-database`) to a
provisional phrasing such as "provisional — license verification pending" or
"pending verification (provisional)" so the session log matches the PR summary.

---

Nitpick comments:
In `@docs/registry/packs/mirothinker.md`:
- Around line 11-12: The phrase "State-of-the-art on multiple research
benchmarks." is subjective and time-sensitive; replace it with an
evidence-neutral statement such as "reports strong performance on multiple
research benchmarks" and, where possible, add specific benchmark names or
citations to support the claim (edit the sentence containing "State-of-the-art
on multiple research benchmarks" in the mirothinker pack description).
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 16e8c7c4-2c2e-4231-b047-201ab4ed7d52

📥 Commits

Reviewing files that changed from the base of the PR and between 98458b7 and 398f773.

📒 Files selected for processing (29)
  • docs/registry/INDEX.md
  • docs/registry/packs/agent-exchange.md
  • docs/registry/packs/agent-skills-eval.md
  • docs/registry/packs/ascii-draw.md
  • docs/registry/packs/auto-research-claw.md
  • docs/registry/packs/claude-code-harness.md
  • docs/registry/packs/crawlee.md
  • docs/registry/packs/cryptpad.md
  • docs/registry/packs/fincept-terminal.md
  • docs/registry/packs/googleworkspace-cli.md
  • docs/registry/packs/isomiddleearth.md
  • docs/registry/packs/jeweledtech-agentic-framework.md
  • docs/registry/packs/locally-uncensored.md
  • docs/registry/packs/mirothinker.md
  • docs/registry/packs/openhuman.md
  • docs/registry/packs/quran-database.md
  • docs/registry/packs/ruflo.md
  • docs/registry/packs/understand-anything.md
  • docs/registry/tags/agents-marketplace.md
  • docs/registry/tags/ai.md
  • docs/registry/tags/data-engineering.md
  • docs/registry/tags/design.md
  • docs/registry/tags/ecosystem.md
  • docs/registry/tags/learning.md
  • docs/registry/tags/productivity.md
  • docs/registry/tags/research.md
  • docs/registry/tags/security.md
  • session-log/2026-05-09-registry-additions.md
  • session-log/INDEX.md

Comment thread docs/registry/packs/cryptpad.md Outdated
Comment thread docs/registry/packs/isomiddleearth.md
Comment thread docs/registry/packs/quran-database.md Outdated
Comment thread docs/registry/packs/understand-anything.md Outdated
Comment thread docs/registry/tags/learning.md Outdated
Comment thread session-log/2026-05-09-registry-additions.md Outdated
Comment thread session-log/2026-05-09-registry-additions.md Outdated
lglucas and others added 2 commits May 9, 2026 19:31
Domínios que respondem com 999 (LinkedIn), 403 (Perestroika), ou
timeout (Lume UFRGS) consistentemente — não são bugs de link, são
anti-bot policies de hosts grandes que bloqueiam crawlers genéricos.

URLs continuam válidas pra navegação humana. Sem exclusions o CI
falhava em PRs que nem tocavam nesses arquivos (caso da PR #10).

Também adicionado --accept 200..=299,403,429 pra cobrir respostas
defensivas de outros sites no futuro sem nova edição.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
- cryptpad.md: cryptpad.fr inline -> link clicavel
- isomiddleearth.md: Status active -> reference-only ate license explicita
- quran-database.md: linguagem absoluta sobre public domain -> jurisdiction-dependent
- understand-anything.md: open URL (macOS-only) -> Visit URL (cross-platform)
- tags/learning.md: row do quran sem assertiva de public domain
- session-log: alinhado com PR (license pending) + heading copyleft no lugar de AGPL-only

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Copy link
Copy Markdown

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In @.github/workflows/ci.yml:
- Around line 62-70: The workflow currently passes --accept '200..=299,403,429'
in the args which globally treats 403/429 as successful; remove that global
--accept for 403/429 and instead add one or both mitigations: use retry/backoff
flags (e.g., add --max-retries and --retry-wait-time to the args) to retry
transient 429/403 responses, and/or stop accepting those codes and add
--exclude-path '.lycheeignore' to maintain a list of known anti-bot/problematic
URLs; if you must keep accepting 403/429, add an inline comment documenting the
risk and consider periodic manual link audits.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1d492d79-a2eb-459d-b951-ff7146368c0e

📥 Commits

Reviewing files that changed from the base of the PR and between 398f773 and 3719e94.

📒 Files selected for processing (7)
  • .github/workflows/ci.yml
  • docs/registry/packs/cryptpad.md
  • docs/registry/packs/isomiddleearth.md
  • docs/registry/packs/quran-database.md
  • docs/registry/packs/understand-anything.md
  • docs/registry/tags/learning.md
  • session-log/2026-05-09-registry-additions.md
✅ Files skipped from review due to trivial changes (5)
  • docs/registry/tags/learning.md
  • docs/registry/packs/isomiddleearth.md
  • docs/registry/packs/understand-anything.md
  • docs/registry/packs/quran-database.md
  • docs/registry/packs/cryptpad.md

Comment thread .github/workflows/ci.yml
Aceitar 403/429 globalmente esconde links genuinamente quebrados.
As exclusions especificas (linkedin/lume/perestroika) ja cobrem os
hosts problematicos conhecidos. Mantemos retry com backoff pra
rate-limits transientes mas sem deixar passar quebras reais.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@lglucas lglucas merged commit a0d1951 into main May 9, 2026
2 checks passed
@lglucas lglucas deleted the chore/registry-additions-2026-05-09 branch May 9, 2026 23:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant