NEO/VLMEvalKit at main · CloudEngineHub/NEO

Name	Name	Last commit message	Last commit date
parent directory ..
assets	assets
docs	docs
requirements	requirements
scripts	scripts
vlmeval	vlmeval
.gitignore	.gitignore
.pre-commit-config.yaml	.pre-commit-config.yaml
LICENSE	LICENSE
README.md	README.md
requirements.txt	requirements.txt
run.py	run.py
setup.py	setup.py

Name

Last commit message

Last commit date

.pre-commit-config.yaml

A Toolkit for Evaluating Large Vision-Language Models.

English | 简体中文

🏆 OC Learderboard • 🏗️Quickstart • 📊Datasets & Models • 🛠️Development

🤗 HF Leaderboard • 🤗 Evaluation Records • 🤗 HF Video Leaderboard •

🔊 Discord • 📝 Report • 🎯Goal • 🖊️Citation

VLMEvalKit (the python package name is vlmeval) is an open-source evaluation toolkit of large vision-language models (LVLMs). It enables one-command evaluation of LVLMs on various benchmarks, without the heavy workload of data preparation under multiple repositories. In VLMEvalKit, we adopt generation-based evaluation for all LVLMs, and provide the evaluation results obtained with both exact matching and LLM-based answer extraction.

🏗️ QuickStart

See [QuickStart | 快速开始] for a quick start guide.

📊 Demonstration

# Demo
from vlmeval.config import supported_VLM
model = supported_VLM['NEO1_0-2B-SFT']()
# Forward Single Image
ret = model.generate(['assets/apple.jpg', 'What is in this image?'])
print(ret)  # The image features a red apple with a leaf on it.
# Forward Multiple Images
ret = model.generate(['assets/apple.jpg', 'assets/apple.jpg', 'How many apples are there in the provided images? '])
print(ret)  # There are two apples in the provided images.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

🏗️ QuickStart

📊 Demonstration

FilesExpand file tree

VLMEvalKit

Directory actions

More options

Directory actions

More options

Latest commit

History

VLMEvalKit

Folders and files

parent directory

README.md

🏗️ QuickStart

📊 Demonstration