Commit 17a318a
committed
fix(deps): remove nltk to resolve CVE-2026-54293
No patched version of nltk is available for the URL-encoded path
traversal vulnerability (CVE-2026-54293). Remove it by:
- Replacing UnstructuredHTMLLoader (unstructured -> nltk) with
BSHTMLLoader (beautifulsoup4) in process_html.py
BSHTMLLoader (beautifulsoup4) in process_html.py
- Removing unstructured==0.18.18 and nltk==3.9.4 from pyproject.toml
- Promoting beautifulsoup4 from dev to main dependencies
- Deleting the now-unnecessary post_install.py NLTK data downloader
- Removing the post_install.py step from both Dockerfiles
Signed-off-by: Jack Luar <jluar@precisioninno.com>1 parent f266ead commit 17a318a
6 files changed
Lines changed: 1956 additions & 2257 deletions
File tree
- backend
- src
- tools
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
| 23 | + | |
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
25 | | - | |
| 24 | + | |
26 | 25 | | |
27 | 26 | | |
28 | 27 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
32 | | - | |
33 | | - | |
| 32 | + | |
34 | 33 | | |
35 | 34 | | |
36 | 35 | | |
| |||
50 | 49 | | |
51 | 50 | | |
52 | 51 | | |
53 | | - | |
| 52 | + | |
54 | 53 | | |
55 | 54 | | |
56 | 55 | | |
57 | 56 | | |
58 | | - | |
59 | 57 | | |
60 | 58 | | |
61 | 59 | | |
| |||
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| |||
43 | 43 | | |
44 | 44 | | |
45 | 45 | | |
46 | | - | |
| 46 | + | |
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
| |||
0 commit comments