LinB203

🎯

Focusing

lb203 LinB203

🎯

Focusing

273 followers · 9 following

@PKU-YuanGroup
Shenzhen
07:36 (UTC +08:00)
@LinBin46984

Achievements

Organizations

Stars

wenqsun / DimensionX

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 681 35 Updated Nov 13, 2024

aigc-apps / EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,412 105 Updated Nov 13, 2024

AILab-CVC / VideoGen-Eval

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models

116 4 Updated Nov 12, 2024

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 963 44 Updated Nov 13, 2024

DAMO-NLP-SG / Inf-CLIP

The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.

Python 168 7 Updated Oct 30, 2024

genmoai / models

The best OSS video generation models

Python 1,935 194 Updated Nov 13, 2024

Stability-AI / sd3.5

Python 636 39 Updated Nov 12, 2024

ChaofanTao / Autoregressive-Models-in-Vision-Survey

The paper collections for the autoregressive models in vision.

166 5 Updated Nov 12, 2024

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 572 40 Updated Oct 31, 2024

facebookresearch / MovieGenBench

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

327 18 Updated Oct 19, 2024

SkyworkAI / MoE-plus-plus

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 133 3 Updated Oct 16, 2024

SkyworkAI / MoH

MoH: Multi-Head Attention as Mixture-of-Head Attention

Python 150 5 Updated Oct 29, 2024

jy0205 / Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,249 215 Updated Nov 13, 2024

qiujihao19 / Artemis

Python 20 Updated Oct 9, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,057 185 Updated Oct 4, 2024

baaivision / Emu3

Next-Token Prediction is All You Need

Python 1,806 70 Updated Oct 24, 2024

AlonzoLeeeooo / awesome-video-generation

A collection of awesome video generation studies.

TeX 340 13 Updated Nov 12, 2024

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 686 54 Updated Nov 11, 2024

Vchitect / VEnhancer

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 458 25 Updated Sep 16, 2024

wdndev / mllm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师多模态相关知识

HTML 83 1 Updated May 12, 2024

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 700 26 Updated Nov 11, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,018 44 Updated Nov 11, 2024

TianxingWu / FreeInit

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 490 23 Updated Jan 18, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 995 55 Updated Sep 27, 2024

Alpha-VLLM / Lumina-mGPT

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 497 21 Updated Aug 16, 2024

VITA-MLLM / VITA

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 955 58 Updated Oct 24, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,833 112 Updated Jul 29, 2024

PKU-YuanGroup / Cycle3D

Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

181 7 Updated Aug 10, 2024

mfarre / Video-LLaVA-7B-hf-CinePile

Video-LlaVA fine-tune for CinePile evaluation

Jupyter Notebook 38 4 Updated Aug 8, 2024

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 202 9 Updated Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lb203 LinB203

Achievements

Achievements

Organizations

Block or report LinB203

Stars

wenqsun / DimensionX

aigc-apps / EasyAnimate

AILab-CVC / VideoGen-Eval

deepseek-ai / Janus

DAMO-NLP-SG / Inf-CLIP

genmoai / models

Stability-AI / sd3.5

ChaofanTao / Autoregressive-Models-in-Vision-Survey

rhymes-ai / Allegro

facebookresearch / MovieGenBench

SkyworkAI / MoE-plus-plus

SkyworkAI / MoH

jy0205 / Pyramid-Flow

qiujihao19 / Artemis

QwenLM / Qwen2-VL

baaivision / Emu3

AlonzoLeeeooo / awesome-video-generation

xdit-project / xDiT

Vchitect / VEnhancer

wdndev / mllm_interview_note

lucidrains / transfusion-pytorch

showlab / Show-o

TianxingWu / FreeInit

LTH14 / mar

Alpha-VLLM / Lumina-mGPT

VITA-MLLM / VITA

facebookresearch / chameleon

PKU-YuanGroup / Cycle3D

mfarre / Video-LLaVA-7B-hf-CinePile

feizc / DiT-MoE