Skip to content

lexer: encode \u escapes using locale encoding#51

Open
Franklin-Qi wants to merge 1 commit into
uutils:mainfrom
Franklin-Qi:feature-task#40-Lexer-Numeric-escaping-u-for-different-locales
Open

lexer: encode \u escapes using locale encoding#51
Franklin-Qi wants to merge 1 commit into
uutils:mainfrom
Franklin-Qi:feature-task#40-Lexer-Numeric-escaping-u-for-different-locales

Conversation

@Franklin-Qi

Copy link
Copy Markdown
Contributor

Detect charset from LC_ALL/LC_CTYPE/LANG and encode \u sequences into the locale multibyte encoding (UTF-8, ISO-8859-1, ASCII-only for C/POSIX), matching gawk. Unrepresentable or invalid code points become '?'.

Closes: #40

Detect charset from LC_ALL/LC_CTYPE/LANG and encode \u sequences into
the locale multibyte encoding (UTF-8, ISO-8859-1, ASCII-only for C/POSIX),
matching gawk. Unrepresentable or invalid code points become '?'.

Closes: uutils#40
@Alonely0

Copy link
Copy Markdown
Collaborator

Super nice, tysm! Let me review this on Friday, I'm a bit busy this week.

@Alonely0 Alonely0 self-requested a review June 17, 2026 09:46
@Alonely0

Copy link
Copy Markdown
Collaborator

Do you mind solving the merge conflicts created after merging the other tests? Thanks in advance.

@Franklin-Qi

Copy link
Copy Markdown
Contributor Author

Thank you for the review. I will address this conflict.

Super nice, tysm! Let me review this on Friday, I'm a bit busy this week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Lexer: Numeric escaping \u, for different locales

2 participants