Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
Simplified multi-view graph neural network for multilingual knowledge graph completion
Frontiers of Computer Science: Selected Publications from Chinese Universities (FCS), Volume 19, Issue 7https://doi.org/10.1007/s11704-024-3577-3AbstractKnowledge graph completion (KGC) aims to fill in missing entities and relations within knowledge graphs (KGs) to address their incompleteness. Most existing KGC models suffer from knowledge coverage as they are designed to operate within a single ...
- ArticleDecember 2024
Spatio-Temporal Heterogeneous Graph Neural Network With Multi-view Learning For Traffic Prediction
AbstractAmong various traffic data modeling and predicting methods, graph learning-based models attract more attention, because of their powerful representation ability for modeling spatial and temporal dependencies with graph neural networks. Despite ...
- research-articleOctober 2024
Mitigating World Biases: A Multimodal Multi-View Debiasing Framework for Fake News Video Detection
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 6492–6500https://doi.org/10.1145/3664647.3681673Short videos turn into an important channel for public sharing, as well as they've become a fertile ground for fake news. Fake news video detection is to judge the veracity of news based on its different modal information, such as video, audio, text, ...
- research-articleOctober 2024
3D Human Pose Estimation from Multiple Dynamic Views via Single-view Pretraining with Procrustes Alignment
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 10363–10372https://doi.org/10.1145/3664647.36809903D Human pose estimation from multiple cameras with unknown calibration has received less attention than it should. The few existing data-driven solutions do not fully exploit 3D training data that are available on the market, and typically train from ...
- research-articleJanuary 2025
Embedding Irregular Urban Regions With Multi-view Fusion Network
ICCPR '24: Proceedings of the 2024 13th International Conference on Computing and Pattern RecognitionPages 279–286https://doi.org/10.1145/3704323.3704349The functions of urban regions are diverse and complex, making the accurate understanding and identifying these functions is crucial for urban planning and management. However, previous methods usually delineate urban regions based on regular network ...
-
- ArticleOctober 2024
ThyGraph: A Graph-Based Approach for Thyroid Nodule Diagnosis from Ultrasound Studies
- Ashwath Radhachandran,
- Alekhya Vittalam,
- Vedrana Ivezic,
- Vivek Sant,
- Shreeram Athreya,
- Chace Moleta,
- Maitraya Patel,
- Rinat Masamed,
- Corey Arnold,
- William Speier
Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 753–763https://doi.org/10.1007/978-3-031-72083-3_70AbstractImproved thyroid nodule risk stratification from ultrasound (US) can mitigate overdiagnosis and unnecessary biopsies. Previous studies often train deep learning models using manually selected single US frames; these approaches deviate from ...
- research-articleJune 2024
Visibility-guided Human Body Reconstruction from Uncalibrated Multi-view Cameras
ICMR '24: Proceedings of the 2024 International Conference on Multimedia RetrievalPages 589–598https://doi.org/10.1145/3652583.3658110We present a novel method for 3D human body reconstruction with multi-view images from calibration-free cameras by multi-view fusion with explicit visibility modelling. Existing multi-view methods usually establish geometric constraints by using accurate ...
- research-articleJune 2024
Multi-view Subspace Clustering via An Adaptive Consensus Graph Filter
ICMR '24: Proceedings of the 2024 International Conference on Multimedia RetrievalPages 776–784https://doi.org/10.1145/3652583.3658009Multiview subspace clustering (MVSC) has attracted an increasing amount of attention in recent years. Most existing MVSC methods first collect complementary information from different views and consequently derive a consensus reconstruction coefficient ...
- short-paperMarch 2024
MV-HEVC: How to optimize compression of immersive 3D content
MHV '24: Proceedings of the 3rd Mile-High Video ConferencePage 87https://doi.org/10.1145/3638036.3640246Multiview High Efficiency Video Coding (MV-HEVC) is an HEVC extension focused on efficiently coding spatially related images, such as a left eye and right eye views of 3D stereoscopic content. MV-HEVC was released in the second version of HEVC back in ...
- research-articleJune 2024
AdaBoost-Based 3D Object Classification from Surface and Depth Map Descriptors
ICMLC '24: Proceedings of the 2024 16th International Conference on Machine Learning and ComputingPages 441–446https://doi.org/10.1145/3651671.3651716In this work, we aim to advance the traditional approach to 3D object classification for its lighter computation and memory costs than the deep learning based approach. Specifically, we propose a novel algorithm that uses multiple handcrafted descriptors ...
- tutorialJanuary 2024
Contrastive learning: Big Data Foundations and Applications
CODS-COMAD '24: Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)Pages 493–497https://doi.org/10.1145/3632410.3633291Contrastive learning (CL) has exploded in popularity due to its ability to learn effective representations using vast quantities of unlabelled data across multiple domains. CL underlies some of the most impressive applications of generative AI for the ...
- research-articleOctober 2024
Multi-view multi-input CNN-based architecture for diagnosis of Alzheimer's disease in its prodromal stages
International Journal of Biometrics (IJOB), Volume 16, Issue 6Pages 601–613https://doi.org/10.1504/ijbm.2024.141948Alzheimer's disease (AD) is a progressive neurodegenerative brain disorder, the leading cause of dementia, characterised by memory loss and cognitive decline affecting daily life. Early detection is crucial for effective treatment. 18F-FDG-PET is the ...
- research-articleDecember 2023
ActRay: Online Active Ray Sampling for Radiance Fields
SA '23: SIGGRAPH Asia 2023 Conference PapersArticle No.: 97, Pages 1–10https://doi.org/10.1145/3610548.3618254Thanks to the high-quality reconstruction and photorealistic rendering, the Neural Radiance Field (NeRF) has garnered extensive attention and has been continuously improved. Despite its high visual quality, the prohibitive training time limits its ...
- research-articleOctober 2023
Multi-View Predicate Recognition for Solving Semantic Ambiguity Problem in Scene Graph Generation
McGE '23: Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and PracticePages 105–113https://doi.org/10.1145/3607541.3616817Recent works on Scene Graph Generation (SGG) have been concentrating on solving the problem of long-tailed distribution. While these methods are making significant improvements on the tail predicate categories, they sacrifice the performance of the head ...
- research-articleOctober 2023
Multi-View Graph Convolutional Network for Multimedia Recommendation
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 6576–6585https://doi.org/10.1145/3581783.3613915Multimedia recommendation has received much attention in recent years. It models user preferences based on both behavior information and item multimodal information. Though current GCN-based methods achieve notable success, they suffer from two ...
- research-articleOctober 2023
OccluBEV: Occlusion Aware Spatiotemporal Modeling for Multi-view 3D Object Detection
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 4074–4083https://doi.org/10.1145/3581783.3613798Bird's-Eye-View (BEV) based 3D visual perception, which formulates a unified space for multi-view representation, has received wide attention in autonomous driving due to its scalability for downstream tasks. However, view transform in transformer-based ...
- research-articleOctober 2023
Multi-View Representation Learning via View-Aware Modulation
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 3876–3886https://doi.org/10.1145/3581783.3612494Multi-view (representation) learning derives an entity's representation from its multiple observable views to facilitate various downstream tasks. The most challenging topic is how to model unobserved entities and their relationships to specific views. ...
- research-articleOctober 2023
Robust grasping across diverse sensor qualities: The GraspNet-1Billion dataset
International Journal of Robotics Research (RBRS), Volume 42, Issue 12Pages 1094–1103https://doi.org/10.1177/02783649231193710Robust object grasping in cluttered scenes is vital to all robotic prehensile manipulation. In this paper, we present the GraspNet-1Billion benchmark that contains rich real-world captured cluttered scenarios and abundant annotations. This benchmark aims ...
- research-articleOctober 2023
MiTFM: A multi-view information fusion method based on transformer for Next Activity Prediction of Business Processes
Internetware '23: Proceedings of the 14th Asia-Pacific Symposium on InternetwarePages 281–291https://doi.org/10.1145/3609437.3609442Recent research introduces deep learning algorithms such as recurrent neural networks (RNNs) to predict the next activity, one of the most challenging tasks in predictive business process monitoring. However, the RNN-based models use only the last ...
- research-articleJuly 2023
Nerfstudio: A Modular Framework for Neural Radiance Field Development
- Matthew Tancik,
- Ethan Weber,
- Evonne Ng,
- Ruilong Li,
- Brent Yi,
- Terrance Wang,
- Alexander Kristoffersen,
- Jake Austin,
- Kamyar Salahi,
- Abhik Ahuja,
- David Mcallister,
- Justin Kerr,
- Angjoo Kanazawa
SIGGRAPH '23: ACM SIGGRAPH 2023 Conference ProceedingsArticle No.: 72, Pages 1–12https://doi.org/10.1145/3588432.3591516Neural Radiance Fields (NeRF) are a rapidly growing area of research with wide-ranging applications in computer vision, graphics, robotics, and more. In order to streamline the development and deployment of NeRF research, we propose a modular PyTorch ...