Skip to content

email / idn-email: scope questions - 4.5.3.1 length limits and control / noncharacter codepoints #948

Description

@vtushar06

Two scope questions came up while adding idn-email tests. Both are about what the email / idn-email format should and should not check, and I would rather get a ruling than guess.

1. RFC 5321 section 4.5.3.1 length limits

The email format is the "Mailbox" ABNF (RFC 5321 section 4.1.2; RFC 6531 section 3.3 for idn-email). That grammar has no length limits. The 64-octet local part, 255-octet domain and 63-octet label limits live in section 4.5.3.1, which is about transport size limits, not the Mailbox grammar.

My reading is that format validation should follow the grammar and not enforce the 4.5.3.1 limits - a 65-octet local part would still be a valid email per section 4.1.2. Is that the intended scope, or should the suite test the 4.5.3.1 limits?

2. Control and noncharacter codepoints in the local part (idn-email)

RFC 6531 extends the local part with UTF8-non-ascii (any well-formed non-ASCII), and the "MUST NOT contain any of the ASCII graphics or control characters" clause only bars ASCII controls. By that text, a C1 control (U+0085) or a noncharacter (U+FFFF) is permitted in the local part.

RFC 6531 errata 4996 (Held) questions whether that clause was meant to exclude the 128-255 range too, so the wording is not fully settled. Should idn-email treat a C1 control / noncharacter in the local part as valid (the literal grammar reading) or invalid?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions