You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A unified, extensible framework for text classification built on [PyTorch](https://pytorch.org/) and [PyTorch Lightning](https://lightning.ai/docs/pytorch/stable/).
3
+
A unified, extensible framework for text classification with categorical variables built on [PyTorch](https://pytorch.org/) and [PyTorch Lightning](https://lightning.ai/docs/pytorch/stable/).
4
4
5
5
## 🚀 Features
6
6
7
-
-**Unified API**: Consistent interface for different classifier wrappers
8
-
-**Extensible**: Easy to add new classifier implementations through wrapper pattern
9
-
-**FastText Support**: Built-in FastText classifier with n-gram tokenization
10
-
-**Flexible Preprocessing**: Each classifier can implement its own text preprocessing approach
7
+
-**Mixed input support**: Handle text data alongside categorical variables seamlessly.
8
+
-**Unified yet highly customizable**:
9
+
- Use any tokenizer from HuggingFace or the original fastText's ngram tokenizer.
10
+
- Manipulate the components (`TextEmbedder`, `CategoricalVariableNet`, `ClassificationHead`) to easily create custom architectures - including **self-attention**. All of them are `torch.nn.Module` !
11
+
- The `TextClassificationModel` class combines these components and can be extended for custom behavior.
11
12
-**PyTorch Lightning**: Automated training with callbacks, early stopping, and logging
13
+
-**Easy experimentation**: Simple API for training, evaluating, and predicting with minimal code:
14
+
- The `torchTextClassifiers` wrapper class orchestrates the tokenizer and the model for you
15
+
-**Additional features**: explainability using Captum
0 commit comments