LOEUF compatiblity with gnomAD 4.1.1 by likhitha-surapaneni · Pull Request #837 · Ensembl/VEP_plugins

likhitha-surapaneni · 2026-05-07T08:39:49Z

jamie-m-a

Some issues with file surrounding refseq transcripts and a line drop with that current sort.

jamie-m-a · 2026-05-08T08:02:08Z

+ These files can be tabix-processed by:
+ zcat gnomad.v4.1.1.constraint_metrics.tsv.bgz | (head -n 1 && tail -n +2  | sort -t$'\t' -k 9,9 -k 10,10n ) > loeuf_temp.tsv
+ sed '1s/.*/#&/' loeuf_temp.tsv > loeuf_dataset.tsv
+ bgzip loeuf_dataset.tsv


Ok first up, the sort and zip can be combined and the current sort is losing a line - this is bettrer and a bit faster:
zcat gnomad.v4.1.1.constraint_metrics.tsv.bgz | (sed -u 1q; sort -k 9,9 -k 10,10n) | sed '1s/.*/#&/' | bgzip -c - > loeuf_dataset.tsv.bgz

jamie-m-a · 2026-05-08T08:04:24Z

+ zcat gnomad.v4.1.1.constraint_metrics.tsv.bgz | (head -n 1 && tail -n +2  | sort -t$'\t' -k 9,9 -k 10,10n ) > loeuf_temp.tsv
+ sed '1s/.*/#&/' loeuf_temp.tsv > loeuf_dataset.tsv
+ bgzip loeuf_dataset.tsv
+ tabix -f -s 9 -b 10 -e 11 loeuf_dataset.tsv.gz


However when you try to tabix either file you run in to a bunch of errors, because not everything in the file has chr and sequence data - basically only the ENSG rows have that, RefSeq has NA, which breaks tabix. We either have to skip refseq entries (grep -v NM* on transcript_id) or insert the correct coordinates for those.

However when you try to tabix either file you run in to a bunch of errors, because not everything in the file has chr and sequence data - basically only the ENSG rows have that, RefSeq has NA, which breaks tabix. We either have to skip refseq entries (grep -v NM* on transcript_id) or insert the correct coordinates for those.

Hi @jamie-m-a , comparing RefSeq transcripts to see if they're unique compared to Ensembl ids, it seems like there are about 282 transcripts which are unique to RefSeq

LOEUF compatiblity with gnomAD 4.1.1

5672b1d

jamie-m-a self-requested a review May 7, 2026 12:01

jamie-m-a self-assigned this May 7, 2026

jamie-m-a requested changes May 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LOEUF compatiblity with gnomAD 4.1.1#837

LOEUF compatiblity with gnomAD 4.1.1#837
likhitha-surapaneni wants to merge 1 commit into
Ensembl:postreleasefix/116from
likhitha-surapaneni:update/LOEUF

likhitha-surapaneni commented May 7, 2026

Uh oh!

jamie-m-a left a comment

Uh oh!

jamie-m-a May 8, 2026

Uh oh!

jamie-m-a May 8, 2026

Uh oh!

likhitha-surapaneni May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

likhitha-surapaneni commented May 7, 2026

Uh oh!

jamie-m-a left a comment

Choose a reason for hiding this comment

Uh oh!

jamie-m-a May 8, 2026

Choose a reason for hiding this comment

Uh oh!

jamie-m-a May 8, 2026

Choose a reason for hiding this comment

Uh oh!

likhitha-surapaneni May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants