Skip to content

Commit f90d687

Browse files
doc: adapt read me
1 parent 997a4f8 commit f90d687

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,8 +10,8 @@ A unified, extensible framework for text classification with categorical variabl
1010
- **ValueEncoder**: Pass raw string categorical values and labels directly — no manual integer encoding required. Build a `ValueEncoder` from `DictEncoder` or sklearn `LabelEncoder` instances once, and the wrapper handles encoding at train time and label decoding after prediction automatically.
1111
- **Unified yet highly customizable**:
1212
- Use any tokenizer from HuggingFace or the original fastText's ngram tokenizer.
13-
- Manipulate the components (`TextEmbedder`, `CategoricalVariableNet`, `ClassificationHead`) to easily create custom architectures - including **self-attention**. All of them are `torch.nn.Module` !
14-
- The `TextClassificationModel` class combines these components and can be extended for custom behavior.
13+
- Text embedding is split into two composable stages: **`TokenEmbedder`** (token → per-token vectors, with optional self-attention) and **`SentenceEmbedder`** (aggregation: mean / first / last / label attention). Combine them with `CategoricalVariableNet` and `ClassificationHead` — all are `torch.nn.Module`.
14+
- The `TextClassificationModel` class assembles these components and can be extended for custom behavior.
1515
- **Multiclass / multilabel classification support**: Support for both multiclass (only one label is true) and multi-label (several labels can be true) classification tasks.
1616
- **PyTorch Lightning**: Automated training with callbacks, early stopping, and logging
1717
- **Easy experimentation**: Simple API for training, evaluating, and predicting with minimal code:

0 commit comments

Comments
 (0)