Stars
YOLOv12: Attention-Centric Real-Time Object Detectors
MoBA: Mixture of Block Attention for Long-Context LLMs
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".