Keyword: multi-view : Search

research-article

Simplified multi-view graph neural network for multilingual knowledge graph completion

Frontiers of Computer Science: Selected Publications from Chinese Universities (FCS), Volume 19, Issue 7https://doi.org/10.1007/s11704-024-3577-3

Abstract

Knowledge graph completion (KGC) aims to fill in missing entities and relations within knowledge graphs (KGs) to address their incompleteness. Most existing KGC models suffer from knowledge coverage as they are designed to operate within a single ...

Article

Spatio-Temporal Heterogeneous Graph Neural Network With Multi-view Learning For Traffic Prediction

Pattern RecognitionPages 35–52https://doi.org/10.1007/978-3-031-78183-4_3

Abstract

Among various traffic data modeling and predicting methods, graph learning-based models attract more attention, because of their powerful representation ability for modeling spatial and temporal dependencies with graph neural networks. Despite ...

research-article

Mitigating World Biases: A Multimodal Multi-View Debiasing Framework for Fake News Video Detection

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 6492–6500https://doi.org/10.1145/3664647.3681673

Short videos turn into an important channel for public sharing, as well as they've become a fertile ground for fake news. Fake news video detection is to judge the veracity of news based on its different modal information, such as video, audio, text, ...

research-article

3D Human Pose Estimation from Multiple Dynamic Views via Single-view Pretraining with Procrustes Alignment

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 10363–10372https://doi.org/10.1145/3664647.3680990

3D Human pose estimation from multiple cameras with unknown calibration has received less attention than it should. The few existing data-driven solutions do not fully exploit 3D training data that are available on the market, and typically train from ...

research-article

Embedding Irregular Urban Regions With Multi-view Fusion Network

ICCPR '24: Proceedings of the 2024 13th International Conference on Computing and Pattern RecognitionPages 279–286https://doi.org/10.1145/3704323.3704349

The functions of urban regions are diverse and complex, making the accurate understanding and identifying these functions is crucial for urban planning and management. However, previous methods usually delineate urban regions based on regular network ...

Article

ThyGraph: A Graph-Based Approach for Thyroid Nodule Diagnosis from Ultrasound Studies

Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 753–763https://doi.org/10.1007/978-3-031-72083-3_70

Abstract

Improved thyroid nodule risk stratification from ultrasound (US) can mitigate overdiagnosis and unnecessary biopsies. Previous studies often train deep learning models using manually selected single US frames; these approaches deviate from ...

research-article

Visibility-guided Human Body Reconstruction from Uncalibrated Multi-view Cameras

ICMR '24: Proceedings of the 2024 International Conference on Multimedia RetrievalPages 589–598https://doi.org/10.1145/3652583.3658110

We present a novel method for 3D human body reconstruction with multi-view images from calibration-free cameras by multi-view fusion with explicit visibility modelling. Existing multi-view methods usually establish geometric constraints by using accurate ...

research-article

Multi-view Subspace Clustering via An Adaptive Consensus Graph Filter

ICMR '24: Proceedings of the 2024 International Conference on Multimedia RetrievalPages 776–784https://doi.org/10.1145/3652583.3658009

Multiview subspace clustering (MVSC) has attracted an increasing amount of attention in recent years. Most existing MVSC methods first collect complementary information from different views and consequently derive a consensus reconstruction coefficient ...

short-paper

MV-HEVC: How to optimize compression of immersive 3D content

MHV '24: Proceedings of the 3rd Mile-High Video ConferencePage 87https://doi.org/10.1145/3638036.3640246

Multiview High Efficiency Video Coding (MV-HEVC) is an HEVC extension focused on efficiently coding spatially related images, such as a left eye and right eye views of 3D stereoscopic content. MV-HEVC was released in the second version of HEVC back in ...

research-article

AdaBoost-Based 3D Object Classification from Surface and Depth Map Descriptors

ICMLC '24: Proceedings of the 2024 16th International Conference on Machine Learning and ComputingPages 441–446https://doi.org/10.1145/3651671.3651716

In this work, we aim to advance the traditional approach to 3D object classification for its lighter computation and memory costs than the deep learning based approach. Specifically, we propose a novel algorithm that uses multiple handcrafted descriptors ...

tutorial

Open Access

Contrastive learning: Big Data Foundations and Applications

CODS-COMAD '24: Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)Pages 493–497https://doi.org/10.1145/3632410.3633291

Contrastive learning (CL) has exploded in popularity due to its ability to learn effective representations using vast quantities of unlabelled data across multiple domains. CL underlies some of the most impressive applications of generative AI for the ...

research-article

Multi-view multi-input CNN-based architecture for diagnosis of Alzheimer's disease in its prodromal stages

International Journal of Biometrics (IJOB), Volume 16, Issue 6Pages 601–613https://doi.org/10.1504/ijbm.2024.141948

Alzheimer's disease (AD) is a progressive neurodegenerative brain disorder, the leading cause of dementia, characterised by memory loss and cognitive decline affecting daily life. Early detection is crucial for effective treatment. 18F-FDG-PET is the ...

research-article

ActRay: Online Active Ray Sampling for Radiance Fields

SA '23: SIGGRAPH Asia 2023 Conference PapersArticle No.: 97, Pages 1–10https://doi.org/10.1145/3610548.3618254

Thanks to the high-quality reconstruction and photorealistic rendering, the Neural Radiance Field (NeRF) has garnered extensive attention and has been continuously improved. Despite its high visual quality, the prohibitive training time limits its ...

research-article

Open Access

Multi-View Predicate Recognition for Solving Semantic Ambiguity Problem in Scene Graph Generation

McGE '23: Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and PracticePages 105–113https://doi.org/10.1145/3607541.3616817

Recent works on Scene Graph Generation (SGG) have been concentrating on solving the problem of long-tailed distribution. While these methods are making significant improvements on the tail predicate categories, they sacrifice the performance of the head ...

research-article

Multi-View Graph Convolutional Network for Multimedia Recommendation

MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 6576–6585https://doi.org/10.1145/3581783.3613915

Multimedia recommendation has received much attention in recent years. It models user preferences based on both behavior information and item multimodal information. Though current GCN-based methods achieve notable success, they suffer from two ...

research-article

Open Access

OccluBEV: Occlusion Aware Spatiotemporal Modeling for Multi-view 3D Object Detection

MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 4074–4083https://doi.org/10.1145/3581783.3613798

Bird's-Eye-View (BEV) based 3D visual perception, which formulates a unified space for multi-view representation, has received wide attention in autonomous driving due to its scalability for downstream tasks. However, view transform in transformer-based ...

research-article

Multi-View Representation Learning via View-Aware Modulation

MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 3876–3886https://doi.org/10.1145/3581783.3612494

Multi-view (representation) learning derives an entity's representation from its multiple observable views to facilitate various downstream tasks. The most challenging topic is how to model unobserved entities and their relationships to specific views. ...

research-article

Robust grasping across diverse sensor qualities: The GraspNet-1Billion dataset

International Journal of Robotics Research (RBRS), Volume 42, Issue 12Pages 1094–1103https://doi.org/10.1177/02783649231193710

Robust object grasping in cluttered scenes is vital to all robotic prehensile manipulation. In this paper, we present the GraspNet-1Billion benchmark that contains rich real-world captured cluttered scenarios and abundant annotations. This benchmark aims ...

research-article

MiTFM: A multi-view information fusion method based on transformer for Next Activity Prediction of Business Processes

Internetware '23: Proceedings of the 14th Asia-Pacific Symposium on InternetwarePages 281–291https://doi.org/10.1145/3609437.3609442

Recent research introduces deep learning algorithms such as recurrent neural networks (RNNs) to predict the next activity, one of the most challenging tasks in predictive business process monitoring. However, the RNN-based models use only the last ...

research-article

Open Access

Nerfstudio: A Modular Framework for Neural Radiance Field Development

SIGGRAPH '23: ACM SIGGRAPH 2023 Conference ProceedingsArticle No.: 72, Pages 1–12https://doi.org/10.1145/3588432.3591516

Neural Radiance Fields (NeRF) are a rapidly growing area of research with wide-ranging applications in computer vision, graphics, robotics, and more. In order to streamline the development and deployment of NeRF research, we propose a modular PyTorch ...

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences