Skip to content

Commit f326819

Browse files
author
Rustem
committed
update docs and bumped version
1 parent 78cce51 commit f326819

5 files changed

Lines changed: 8 additions & 2 deletions

File tree

MANIFEST.in

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
# Include documentation
22
include README.md
33
include LICENSE
4+
recursive-include docs *.png *.jpg *.jpeg *.gif
45

56
# Include package data
67
recursive-include textnano *.py

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -117,6 +117,11 @@ textnano urls wikipedia_urls.txt wiki_dataset/
117117
textnano reddit reddit_urls/ --output reddit_urls.txt --max 5000
118118
# 3. Build dataset:
119119
textnano urls reddit_urls.txt reddit_dataset/
120+
```
121+
122+
![Reddit URL Extraction Example](docs/reddit-crawling.png)
123+
124+
```bash
120125

121126
# Project Gutenberg
122127
# 1. Generate URLs (checks each book ID):

docs/reddit-crawling.png

681 KB
Loading

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
44

55
[project]
66
name = "textnano"
7-
version = "0.2.0"
7+
version = "0.2.1"
88
description = "A minimal text dataset builder inspired by lazynlp. Perfect for ML students who just want clean text datasets quickly."
99
readme = "README.md"
1010
requires-python = ">=3.10"

setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212

1313
setup(
1414
name="textnano",
15-
version="0.2.0",
15+
version="0.2.1",
1616
description="Minimal text dataset builder - Zero dependencies, single file, perfect for ML students",
1717
long_description=long_description,
1818
long_description_content_type="text/markdown",

0 commit comments

Comments
 (0)