Skip to content

Consider adding sharding to the Parquet exported BigQuery content #130

@fedorov

Description

@fedorov

As suggested by @mhalle, this can make query from the bucket quicker, perhaps in a significant number of cases. We could shard by collection and perhaps modality.

Related discussion with Claude: https://claude.ai/share/88b80074-de62-4553-a02b-d22d331cf5d2

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions