Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration.

AllBooks Images Videos Maps News Shopping

The K-Viscuit Benchmark with Human-VLM Collaboration

Jun 24, 2024 · We propose a semi-automated pipeline for constructing cultural VLM benchmarks to enhance diversity and efficiency. This pipeline leverages human ...

The K-Viscuit Benchmark with Human-VLM Collaboration

www.researchgate.net › publication › 38...

Jun 27, 2024 · This pipeline leverages human-VLM collaboration, where VLMs generate questions based on guidelines, human-annotated examples, and image-wise ...

Evaluating Visual and Cultural Interpretation: The K-Viscuit ...

www.aimodels.fyi › papers › arxiv › eval...

Jun 25, 2024 · The K-ViScuit benchmark is designed to test VLMs' ability to understand and interpret visual scenes in a culturally-aware manner, going beyond ...

‪Yujin Baek‬ - ‪Google Scholar‬

scholar.google.dk › citations

Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration. Y Baek, CH Park, J Kim, YJ Heo, DS Chang, J Choo. arXiv ...

Benchmarking Vision Language Models for Cultural Understanding

arxiv-sanity-lite.com › ...

Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration ... benchmark, CDEval, aimed at evaluating the cultural ...

Yujin Baek's research works - ResearchGate

www.researchgate.net › Yujin-Baek-225...

Yujin Baek's 3 research works with 2 citations and 18 reads, including: Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with ...

Yu-Jung Heo - dblp

dblp.org › Persons

Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration. ... Criteria for Human-Compatible AI in Two-Player ...

Vision-Language Models under Cultural and Inclusive Considerations

www.aimodels.fyi › papers › arxiv › visi...

Jul 8, 2024 · For example, the K-VisCuit benchmark evaluates how accurately the models can understand the cultural significance of images, while the See It ...

The K-Viscuit Benchmark with Human-VLM Collaboration

paperreading.club › page

We propose a semi-automated pipeline for constructing cultural VLM benchmarks to enhance diversity and efficiency. This pipeline leverages human ...

Yu-Jung Heo | Papers With Code

paperswithcode.com › author › yu-jung-...

We propose a semi-automated pipeline for constructing cultural VLM benchmarks to enhance diversity and efficiency. Diversity · Visual Reasoning.