Hi @dinhanhx, we integrated flashattention2 recently. You can learn more about the integration here. Note that not every model supports flashattention2.