Add embed-openclip-rn101-yfcc15m model (FP16, 86-tag default vocab) by andriiryzhkov · Pull Request #38 · darktable-org/darktable-ai

andriiryzhkov · 2026-06-23T08:50:59Z

Adds embed-openclip-rn101-yfcc15m – a ResNet-101 image embedder for tag suggestion and image-similarity search.

Why this one and not a stronger CLIP variant: every other CLIP training corpus (LAION, WIT-400M, DataComp, MetaCLIP, WebLI) is a web scrape with no per-image consent. YFCC15M is 15M Flickr photos uploaded under Creative Commons – the one option that meets the project's consent-based training-data criterion. The cost is a lower benchmark score (~31% ImageNet zero-shot vs ~67% for LAION ViT-B-32), but in actual photo-library use the gap is much smaller than that number suggests.

Ships model.onnx (60 MB FP16, mean/std + L2 norm baked in) plus tags.json – 86 precomputed centroids for cold-start tag suggestions before users have enough data of their own. Text encoder runs at convert time only, not shipped.

Add embed-openclip-rn101-yfcc15m model (FP16, 86-tag default vocab)

7e18cc4

andriiryzhkov force-pushed the open_clip_yfcc branch from c2c86fc to 7e18cc4 Compare June 25, 2026 07:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add embed-openclip-rn101-yfcc15m model (FP16, 86-tag default vocab)#38

Add embed-openclip-rn101-yfcc15m model (FP16, 86-tag default vocab)#38
andriiryzhkov wants to merge 1 commit into
darktable-org:masterfrom
andriiryzhkov:open_clip_yfcc

andriiryzhkov commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

andriiryzhkov commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant