Skip to content

Wordcloud tokenisation#2024

Merged
lukavdplas merged 8 commits into
developfrom
feature/wordcloud-tokenisation
Apr 8, 2026
Merged

Wordcloud tokenisation#2024
lukavdplas merged 8 commits into
developfrom
feature/wordcloud-tokenisation

Conversation

@lukavdplas
Copy link
Copy Markdown
Contributor

@lukavdplas lukavdplas commented Mar 3, 2026

close #2023

This requests the tokenised text for the wordcloud from Elasticsearch, which is more consistent than using the CountVectorizer.

@lukavdplas lukavdplas marked this pull request as ready for review March 9, 2026 10:11
@lukavdplas lukavdplas linked an issue Mar 9, 2026 that may be closed by this pull request
@lukavdplas lukavdplas marked this pull request as draft March 9, 2026 10:11
@lukavdplas lukavdplas changed the title Feature/wordcloud tokenisation Wordcloud tokenisation Mar 9, 2026
Base automatically changed from feature/batch-termvectors-requests to develop March 26, 2026 13:07
@lukavdplas lukavdplas marked this pull request as ready for review March 26, 2026 13:16
@lukavdplas lukavdplas requested a review from Meesch March 26, 2026 13:44
Copy link
Copy Markdown
Contributor

@Meesch Meesch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work!

@lukavdplas lukavdplas merged commit e18eba0 into develop Apr 8, 2026
4 checks passed
@lukavdplas lukavdplas deleted the feature/wordcloud-tokenisation branch April 8, 2026 14:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Use ES tokenizer in wordcloud

2 participants