Skip to content

Can we convert dictionary-encoded parquet columns to Categorical Dtype? #435

Description

@delucchi-cmu

For something like band, we know that there are only a few possible values, and parquet knows this too and does a dictionary-style encoding on those columns. Can we consider using this hint to convert those columns to Categorical Dtype and save space in-memory?

Is there an in-memory savings to be gained?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions