Skip to main content

DeepSeek R1, a state-of-the-art open model, is now available. Try it now or read our DeepSeek quickstart!

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

By Dmytro Ivchenko|6/20/2024

Loading...