Skip to content

Wrong gene annotation for ENSG00000135070 (ISCA1) #1016

@h-escoffier

Description

@h-escoffier

Description

The annotation of gene ENSG00000135070 (ISCA1, iron-sulfur cluster assembly 1) appears to contain incorrect field values for uniprot, hgnc.symbol, and ncbigene. The UniProt accession Q9BUE6 has been placed under the ensembl list, and the uniprot, hgnc.symbol and ncbigene fields contain swapped/incorrect values.

This appears to be present across the files: genes.tsv, Human-GEM.xml, Human-GEM.xlsx.


Current annotation

{
  'sbo': 'SBO:0000243',
  'ensembl': ['ENSG00000135070', 'ENST00000375991.9', 'ENST00000326094.4', 'ENST00000311534.6', 'ENST00000637705.1', 'Q9BUE6'],
  'uniprot': 'ISCA1',
  'hgnc.symbol': '81689',
  'ncbigene': 'iron-sulfur cluster assembly 1'
}

Expected annotation

{
  'sbo': 'SBO:0000243',
  'ensembl': ['ENSG00000135070', 'ENST00000375991.9', 'ENST00000326094.4', 'ENST00000311534.6', 'ENST00000637705.1'],
  'uniprot': 'Q9BUE6',
  'hgnc.symbol': '28660',
  'ncbigene': '81689'
}

Changes

Field Before After
ensembl included Q9BUE6 (UniProt accession) transcript IDs only
uniprot ISCA1 Q9BUE6
hgnc.symbol 81689 28660
ncbigene iron-sulfur cluster assembly 1 81689

References: UniProt | HGNC | NCBI

I have already applied the fix across affected files in a dedicated branch on my fork and can open a pull request if that would be helpful.

Best,
Hugues

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions