Skip to content

Parquet buffered writer is too sensitive regarding unions and unknown fields #1878

@asfimport

Description

@asfimport

Parquet does prevent records with unknown union fields to be written as it would create a TProtocol violation. But it also prevents records with unions having one their field itself having an unknown field (which is acceptable if it is a struct).

The recursive check should be removed, and only when a union field is unknown should the record be deemed invalid.

Reporter: Laurent Goujon / @laurentgo

Note: This issue was originally created as PARQUET-362. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions