Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
AnkiDroid: Anki flashcards on Android. Your secret trick to achieve superhuman information retention.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
MambaOut: Do We Really Need Mamba for Vision?
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
[ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
COYO-700M: Large-scale Image-Text Pair Dataset
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Official Code for DragGAN (SIGGRAPH 2023)
Example models using DeepSpeed
Ongoing research training transformer models at scale
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Generate 3D objects conditioned on text or images
Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。
Query your Apple Health data with natural language 💬 🩺
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".