Skip to content

Prepare keyword extraction dataset #211

@ktagowski

Description

@ktagowski
  • Parse metadata and extract keywords / subjects:
  • Create initial HF dataset with documents and extracted keywords
  • Check whether keywords are extracted from text (explicit keywords) and keywords which do not appears explicitly in text. (Option to load by default explicit keywords)
  • Add keywords assigned by Clarin

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions