Skip to content

Distribute graph TSV enabling remote querying (e.g, with DuckDB) as per Monarch KG #20

@caufieldjh

Description

@caufieldjh

From @cmungall on Slack:

I would be great if KG-Hub was distributed in the same way as the Monarch KG facilitating remote queries; e.g.

$ duckdb


select predicate, count(*) from 'https://data.monarchinitiative.org/monarch-kg/2024-03-18/monarch-kg-denormalized-edges.tsv.gz' group by predicate;

┌────────────────────────────────────────────────────┬──────────────┐
│                     predicate                      │ count_star() │
│                      varchar                       │    int64     │
├────────────────────────────────────────────────────┼──────────────┤
│ biolink:orthologous_to                             │       551239 │
│ biolink:related_to                                 │       179842 │
│ biolink:part_of                                    │        94177 │
│ biolink:actively_involved_in                       │       799846 │
│ biolink:acts_upstream_of_or_within                 │       181004 │
│ biolink:acts_upstream_of_negative_effect           │          174 │
│ biolink:expressed_in                               │      2307867 │
│ biolink:located_in                                 │       504697 │
│ biolink:acts_upstream_of_or_within_negative_effect │          181 │
│ biolink:treats_or_applied_or_studied_to_treat      │         5640 │
│ biolink:interacts_with                             │      2913608 │
│ biolink:colocalizes_with                           │         4051 │
│ biolink:acts_upstream_of_positive_effect           │          509 │
│ biolink:acts_upstream_of_or_within_positive_effect │          507 │
│ biolink:gene_associated_with_condition             │         8004 │
│ biolink:has_phenotype                              │      1171787 │
│ biolink:subclass_of                                │       493276 │
│ biolink:acts_upstream_of                           │         9343 │
│ biolink:has_mode_of_inheritance                    │         8492 │
│ biolink:enables                                    │       845645 │
│ biolink:active_in                                  │       145898 │
│ biolink:contributes_to                             │         7418 │
│ biolink:participates_in                            │       271247 │
│ biolink:causes                                     │         6670 │
├────────────────────────────────────────────────────┴──────────────┤
│ 24 rows                                                 2 columns │
└───────────────────────────────────────────────────────────────────┘

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions