Fix handling of word boundaries in Filter cog by Evanroby · Pull Request #6725 · Cog-Creators/Red-DiscordBot

Evanroby · 2026-04-01T15:26:25Z

Description of the changes

refactors word list pattern generation to use a dedicated _build_word_pattern helper, for a more flexible and accurate word boundary handling

Have the changes in this PR been tested?

Yes

Jackenmen

Using negative lookahead/lookbehind with \w for word boundaries seems like an improvement over \b since, unlike \b, it always checks that the preceding character is not a word character instead of determining the behaviour based on the first/last character of the word.

For example, the current implementation (as seen in stable releases) for a word such as <h> has:

false positives: "x<h>y"
false negatives: " <h> "

This PR handles both of these correctly.

With that said, this case is also handled fine by a simpler:

rf"(?<!\w){re.escape(w)}(?!\w)"

I don't think we should really be deciding whether a word boundary should be checked based on whether the first/last character is a word character. Do you have some concrete cases where this would actually be an improvement rather than just make things possibly more confusing?

Evanroby · 2026-05-21T05:57:22Z

Applied change!

[Filter]: fix for unicode symbols and more

7e5db2d

github-actions Bot added the Category: Cogs - Filter This is related to the Filter cog. label Apr 1, 2026

Jackenmen requested changes May 21, 2026

View reviewed changes

Jackenmen added this to the 3.5.x milestone May 21, 2026

Jackenmen added the Type: Bug Unexpected behavior, result, or exception. In case of PRs, it is a fix for the foregoing. label May 21, 2026

suggested changes applied

8502dd9

Evanroby requested a review from Jackenmen May 21, 2026 05:57

Jackenmen modified the milestones: 3.5.x, 3.5.25 May 23, 2026

Jackenmen approved these changes May 23, 2026

View reviewed changes

Jackenmen changed the title ~~[Filter]: Fix for unicode symbols and more~~ Fix handling of word boundaries in Filter cog May 23, 2026

Jackenmen merged commit 17fea2f into Cog-Creators:V3/develop May 23, 2026
18 checks passed

red-githubbot Bot added the Changelog Entry: Pending Changelog entry for this PR hasn't been added by repo maintainers yet. label May 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix handling of word boundaries in Filter cog#6725

Fix handling of word boundaries in Filter cog#6725
Jackenmen merged 2 commits into
Cog-Creators:V3/developfrom
Evanroby:filter-boundary

Evanroby commented Apr 1, 2026

Uh oh!

Jackenmen left a comment

Uh oh!

Evanroby commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Uh oh!

Conversation

Evanroby commented Apr 1, 2026

Description of the changes

Have the changes in this PR been tested?

Uh oh!

Jackenmen left a comment

Choose a reason for hiding this comment

Uh oh!

Evanroby commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants