Skip to content

RegExp Unicode property escapes missing codepoints from Unicode v16/v17 #5377

@Oichkatzelesfrettschen

Description

@Oichkatzelesfrettschen

The Unicode property tables in boa are out of date. Codepoints added in Unicode v16 and v17 are not matched by \p{} property escapes.

Repro:

// U+001C8A GEORGIAN MTAVRULI CAPITAL LETTER AN -- added Unicode v16
/\p{General_Category=Lowercase_Letter}/u.test('\u{1C8A}'); // expected true, got false

// U+2B738 CJK UNIFIED IDEOGRAPH-2B738 -- added Unicode v16
/\p{Script=Han}/u.test('\u{2B738}'); // expected true, got false

Expected: true
Got: false

The boa_unicode property tables need regeneration from Unicode v17 data (current Unicode version as of 2025).

Failing test262: built-ins/RegExp/property-escapes/ (150 cases)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions