Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View wenyihong's full-sized avatar
  • Tsinghua University

Highlights

  • Pro

Block or report wenyihong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,808 398 Updated Oct 6, 2024

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 368 18 Updated Jul 5, 2024

ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)

Python 276 27 Updated Aug 22, 2024
Python 3 Updated Nov 9, 2022

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,931 407 Updated May 29, 2024
Python 11 2 Updated Aug 4, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 133,155 26,584 Updated Oct 10, 2024

The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]

Python 266 19 Updated Apr 29, 2024
Python 779 162 Updated Apr 23, 2024
Python 737 46 Updated Jul 8, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,078 416 Updated Aug 23, 2024

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,126 62 Updated Oct 3, 2024

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Python 1,045 85 Updated May 15, 2024

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python 4,001 346 Updated May 6, 2023

A collection of resources and papers on Diffusion Models

HTML 10,860 936 Updated Aug 1, 2024

[ICCV 2023] A latent space for stochastic diffusion models

Python 562 35 Updated Dec 31, 2023

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,250 178 Updated Oct 9, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,656 607 Updated Jul 25, 2023

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,945 746 Updated Oct 10, 2024

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 974 92 Updated Sep 22, 2024

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Python 2,833 280 Updated Aug 10, 2024

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Python 2,592 342 Updated Jul 25, 2023

Download DeepMind's Kinetics dataset.

Jupyter Notebook 19 10 Updated Feb 28, 2020

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,737 1,139 Updated Jul 30, 2024

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Python 1,695 174 Updated Sep 25, 2023

A collection of awesome resources in Human Pose estimation.

2,427 413 Updated Oct 12, 2022

OOP Course Material & QA

149 18 Updated Feb 28, 2020