## Description Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training: ## References - [list known implementations](https://github.com/Dao-AILab/flash-attention)
Description
Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training:
References