You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Move field table to vignette; add app screenshot to README
- Replaces the 25-row field table in README with a brief summary and
link to the rendered vignette, which now carries the full table in
the Python API section
- Adds streamlitapp.png screenshot to README
- Gitignores README.html, README_files/, and *.xlsx to prevent
accidental commits of render artefacts and local files
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|`architecture`| str |`decoder-only`, `encoder-decoder`, or `mixture-of-experts`|
22
-
|`license`| str | License name |
23
-
|`open_weights`| bool | Model weights are publicly available |
24
-
|`open_training_data`| bool | Training data is publicly available |
25
-
|`intermediate_checkpoints`| bool | Intermediate training checkpoints have been released |
26
-
|`open_code`| bool | Training code is publicly available |
27
-
|`multilingual`| bool | Officially supports more than one language |
28
-
|`num_languages`| int | Number of officially supported languages |
29
-
|`languages`| list[str]| Names of officially supported languages |
30
-
|`has_instruct_version`| bool | An instruction-tuned variant exists (or the model is itself instruction-tuned) |
31
-
|`model_type`| str | Model release type: `"base"`, `"instruct"`, or `"reasoning"`|
32
-
|`has_think_version`| bool | A chain-of-thought / think variant exists (or the model is itself a reasoning model) |
33
-
|`notes`| str *(optional)*| Additional context; present only for models where extra clarification is needed (e.g. post-trained models where `training_tokens_b` is null for structural reasons) |
34
-
|`foundational_paper`| str | URL of the foundational paper (arXiv for most models; non-arXiv for GPT-J 6B, Grok-1, Mixtral 8x22B, and Sarvam 30B) |
35
-
|`huggingface_id`| str | HuggingFace model identifier |
36
-
|`openness_score`| int | Computed 0–5 score: sum of `open_weights` + `open_training_data` + `intermediate_checkpoints` + `open_code` + permissive license (Apache 2.0 or MIT) |
37
-
38
-
Languages reflect officially supported languages as documented by the model creators, not partial or limited capabilities (e.g. Falcon supports German, Spanish and French officially, but has only limited capabilities in several other languages which are not included).
11
+
Each model record contains 25 fields covering identity, size, training scale, context window, modality, architecture, license, openness flags, language support, and links to the foundational paper and HuggingFace page. Most records are base models; a small number are instruct or reasoning variants. See the [rendered vignette](https://htmlpreview.github.io/?https://github.com/Programming-The-Next-Step-2026/openllm-selector/blob/week-4/docs/vignette.html) for the full field reference.
39
12
40
13
## Installation
41
14
@@ -49,6 +22,8 @@ To run the interactive Streamlit app locally:
This gives access to the Python API (`load_models`, `filter_models`, and related functions) without any further steps. To run the interactive Streamlit app, clone the repository and launch it from the project root:
25
+
This gives access to the Python API (`load_models`, `filter_models`, and related functions) without any further steps.
26
+
27
+
To run the interactive Streamlit app, clone the repository and launch it from the project root:
|`architecture`| str |`decoder-only`, `encoder-decoder`, or `mixture-of-experts`|
233
+
|`license`| str | License name |
234
+
|`open_weights`| bool | Model weights are publicly available |
235
+
|`open_training_data`| bool | Training data is publicly available |
236
+
|`intermediate_checkpoints`| bool | Intermediate training checkpoints have been released |
237
+
|`open_code`| bool | Training code is publicly available |
238
+
|`multilingual`| bool | Officially supports more than one language |
239
+
|`num_languages`| int | Number of officially supported languages |
240
+
|`languages`| list[str]| Names of officially supported languages |
241
+
|`has_instruct_version`| bool | An instruction-tuned variant exists (or the model is itself instruction-tuned) |
242
+
|`model_type`| str | Model release type: `"base"`, `"instruct"`, or `"reasoning"`|
243
+
|`has_think_version`| bool | A chain-of-thought / think variant exists (or the model is itself a reasoning model) |
244
+
|`notes`| str *(optional)*| Additional context; present only for models where extra clarification is needed (e.g. post-trained models where `training_tokens_b` is null for structural reasons) |
245
+
|`foundational_paper`| str | URL of the foundational paper (arXiv for most models; non-arXiv for GPT-J 6B, Grok-1, Mixtral 8x22B, and Sarvam 30B) |
246
+
|`huggingface_id`| str | HuggingFace model identifier |
247
+
|`openness_score`| int | Computed 0–5 score: sum of `open_weights` + `open_training_data` + `intermediate_checkpoints` + `open_code` + permissive license (Apache 2.0 or MIT) |
0 commit comments