Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View LinB203's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@PKU-YuanGroup

Block or report LinB203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 681 35 Updated Nov 13, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,412 105 Updated Nov 13, 2024

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models

116 4 Updated Nov 12, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 963 44 Updated Nov 13, 2024

The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.

Python 168 7 Updated Oct 30, 2024

The best OSS video generation models

Python 1,935 194 Updated Nov 13, 2024
Python 636 39 Updated Nov 12, 2024

The paper collections for the autoregressive models in vision.

166 5 Updated Nov 12, 2024

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 572 40 Updated Oct 31, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

327 18 Updated Oct 19, 2024

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 133 3 Updated Oct 16, 2024

MoH: Multi-Head Attention as Mixture-of-Head Attention

Python 150 5 Updated Oct 29, 2024

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,249 215 Updated Nov 13, 2024
Python 20 Updated Oct 9, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,057 185 Updated Oct 4, 2024

Next-Token Prediction is All You Need

Python 1,806 70 Updated Oct 24, 2024

A collection of awesome video generation studies.

TeX 340 13 Updated Nov 12, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 686 54 Updated Nov 11, 2024

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 458 25 Updated Sep 16, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识

HTML 83 1 Updated May 12, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 700 26 Updated Nov 11, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,018 44 Updated Nov 11, 2024

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 490 23 Updated Jan 18, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 995 55 Updated Sep 27, 2024

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 497 21 Updated Aug 16, 2024

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 955 58 Updated Oct 24, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,833 112 Updated Jul 29, 2024

Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

181 7 Updated Aug 10, 2024

Video-LlaVA fine-tune for CinePile evaluation

Jupyter Notebook 38 4 Updated Aug 8, 2024

Scaling Diffusion Transformers with Mixture of Experts

Python 202 9 Updated Sep 9, 2024
Next