Skip to content

Latest commit

 

History

History
40 lines (27 loc) · 1.4 KB

File metadata and controls

40 lines (27 loc) · 1.4 KB

BEDbase loader

This repository contains GitHub Actions workflows for automated BED file uploads to BEDbase.

For complete documentation on the data loading architecture, see: BEDbase Data Loading

Accbase Loading

Accbase workflows queue ATAC-seq samples for downstream processing on Rivanna HPC.

Workflow Schedule Purpose
accbase_queue_cron.yml Daily 19:00 UTC Queue new samples from accbase namespace
accbase_queue_gse.yml Manual Queue samples from a specific GSE

Unlike BEDbase (which processes files in GitHub Actions + Fargate), Accbase only queues samples in the sample_queue table. Heavy processing (PEPATAC pipeline) runs on Rivanna HPC.

Configuration: accbase_config.yaml

Required Secrets

For Accbase workflows:

  • ACCBASE_POSTGRES_HOST - PostgreSQL host for accbase
  • ACCBASE_POSTGRES_USER - Database user
  • ACCBASE_POSTGRES_PASSWORD - Database password
  • PEPHUB_API_KEY - Optional, for higher rate limits

Queue Consumer

The sample_queue table is consumed on Rivanna:

  1. Consumer queries for samples with status='pending'
  2. Generates looper submission from PEPhub PEP
  3. Submits SLURM jobs via looper
  4. Updates status to 'processing'
  5. On completion, updates status to 'completed'

This decouples data discovery (GitHub Actions) from heavy processing (Rivanna HPC).