Skip to content

describegpt: make it CKAN-aware #2943

@jqnatividad

Description

@jqnatividad

So users can customize LLM prompts to refer to associated CKAN instances:

  • to fetch the Dataset from CKAN if it's not available locally
  • to use existing Data Dictionaries, Summary Stats and Frequency tables (in case they have DP+ with the DRUF installed)
  • to issue catalog searches
  • to fetch for additional metadata context (both DCAT 3 and Croissant)
  • to check for related datasets (e.g. while looking up an NYC address; check NYPD COMPstat data)
  • to check and use controlled vocabularies
  • to check for existing tags, themes and groups
  • to get lookup tables, and enrich/normalize LLM answers from these tables (e.g. with the Building ID number, get the address; with an NYC address, get associated data (e.g. Community Board, Borough, Police and Fire Precincts, Congressional District, Census tables, etc.) )
  • to check private datasets, if provided with the requisite CKAN token

Preferably, via MCP and/or tool-use.

Metadata

Metadata

Assignees

No one assigned

    Labels

    CKANinteroperability with CKAN Data Management SystemCroissantmetadata standard for describing ML datasetsDCAT3metadata standardDRUFfor Data Resource Upload First workflowdatapusher+for Datapusher+enhancementNew feature or request. Once marked with this label, its in the backlog.

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions