BGC-Finder is a context-aware deep learning framework that leverages protein language models to predict and annotate biosynthetic gene clusters (BGCs).
-
📄 Preprint: [bioRxiv](https://www.biorxiv.org/content/10.1101/2025.04.29.651206v1
-
🧠 Model Weights: Hugging Face
You can annotate your own Genbank files or FASTA files online via Google Colab:
👉 Colab Notebook for Annotation
To run BGC-Finder on your own FASTA-formatted genomes or contigs, follow the step-by-step guide:
| Name | Affiliation | |
|---|---|---|
| Zixin Kang | 29590kang@gmail.com | Graduate Student, School of Life Science and Technology, HUST |
| Haohong Zhang | haohongzh@gmail.com | PhD Student, School of Life Science and Technology, HUST |
| Kang Ning | ningkang@hust.edu.cn | Professor, School of Life Science and Technology, HUST |