Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleDecember 2024
M-RAT: a Multi-grained Retrieval Augmentation Transformer for Image Captioning
AbstractCurrent encoder-decoder methods for image captioning mai-nly consist of an object detection module (two-stage), or rely on big models with large-scale datasets to improve the effectiveness, which leads to increasing computation costs and cannot ...
- research-articleDecember 2024
Feature-weighted Multi-stage Bayesian Prototype for Few-shot Classification
MMAsia '24: Proceedings of the 6th ACM International Conference on Multimedia in AsiaArticle No.: 82, Pages 1–7https://doi.org/10.1145/3696409.3700244Few-shot classification aims to recognize the query sample through a limited amount of support data, where a prototype classifier is commonly applied. However, although the prototype classifier is simple and non-parametric, it does not fully utilize the ...
- research-articleDecember 2024
FreqFormer: A Frequency Transformer for Semantic Segmentation of Remote Sensing Images
MMAsia '24: Proceedings of the 6th ACM International Conference on Multimedia in AsiaArticle No.: 16, Pages 1–8https://doi.org/10.1145/3696409.3700176Semantic segmentation of remote sensing images (RSIs) is vital for geospatial intelligence. However, traditional methods face challenges with mixed pixels and complex land cover types. Convolutional neural networks and transformers have led the field of ...
- posterDecember 2024
A Encoder-Decoder Framework for Foundation Model-based Remote Sensing Semantic Segmentation
MMAsia '24 Workshops: Proceedings of the 6th ACM International Conference on Multimedia in Asia WorkshopsArticle No.: 16, Pages 1–7https://doi.org/10.1145/3700410.3702132Remote sensing semantic segmentation technology has been widely applied in various fields such as urban planning, land source management and agriculture. Due to the large number of ground objects with different scales and boundaries in remote sensing ...
-
- ArticleDecember 2024
[inline-graphic not available: see fulltext] : A Voting-Based Paradigm for Enhancing Retrieval Augmented Generation
AbstractRetrieval Augmented Generation (RAG) has become a common practice to alleviate the hallucination of Large Language Models (LLMs). The retrieval phase of RAG, however, usually solely depends on the original query, which, to some extent, suffers ...
- research-articleNovember 2024JUST ACCEPTED
A Simple Data Augmentation for Graph Classification: A Perspective of Equivariance and Invariance
ACM Transactions on Knowledge Discovery from Data (TKDD), Just Accepted https://doi.org/10.1145/3706062In graph classification, the out-of-distribution (OOD) issue is attracting great attention. To address this issue, a prevailing idea is to learn stable features, on the assumption that they are substructures causally determining the label and that their ...
- research-articleNovember 2024JUST ACCEPTED
- research-articleNovember 2024
Saliency and edge features-guided end-to-end network for salient object detection
Expert Systems with Applications: An International Journal (EXWA), Volume 257, Issue Chttps://doi.org/10.1016/j.eswa.2024.125016AbstractThe rapid development of the Vision Transformer backbones has enabled the capture of feature information with global dependencies, leading to excellent performance in salient object detection tasks. However, it fails to adequately emphasize fine ...
Highlights- Vision Transformer lacks in its ability to focus on local information.
- The Hungarian algorithm is effectively employed for feature fusion.
- Utilizing random edge neighborhood sampling for localization of local details.
- The ...
- research-articleNovember 2024
A cross-modal feature aggregation and enhancement network for hyperspectral and LiDAR joint classification
Expert Systems with Applications: An International Journal (EXWA), Volume 258, Issue Chttps://doi.org/10.1016/j.eswa.2024.125145AbstractAdvancements in Earth observation technologies have greatly enhanced the potential of integrating hyperspectral (HS) images with Light Detection and Ranging (LiDAR) data for land use and land cover classification. Despite this, most existing ...
Highlights- A newly Cross-modal feature aggregation is introduced for HS and LiDAR.
- Two feature aggregation strategies refine spatial location details.
- INNs feature enhancement is to preserve local contextual integrity.
- research-articleNovember 2024
Numerical quadrature for Gregory triangles
Journal of Computational and Applied Mathematics (JCAM), Volume 453, Issue Chttps://doi.org/10.1016/j.cam.2024.116149AbstractThis paper presents quadrature rules for the space of functions underlying triangular Gregory patches, also called Gregory triangles. We provide numerical and where available symbolic quadrature rules not only for the space spanned by the fifteen ...
- research-articleJanuary 2025
Immersive human-machine teleoperation framework for precision agriculture: Integrating UAV-based digital mapping and virtual reality control
Computers and Electronics in Agriculture (COEA), Volume 226, Issue Chttps://doi.org/10.1016/j.compag.2024.109444Highlights- TCP/IP-based closed-loop system enables immersive VR control of machinery.
- Remote operation framework enhances agricultural safety and productivity.
- The VR system provides 3D spatial information suitable for remote control.
- ...
In agricultural settings, the unstructured nature of certain production environments, along with the high complexity and inherent risks of production tasks, poses significant challenges to achieving full automation and effective on-site machine ...
- research-articleJanuary 2025
Detection of rice panicle density for unmanned harvesters via RP-YOLO
Computers and Electronics in Agriculture (COEA), Volume 226, Issue Chttps://doi.org/10.1016/j.compag.2024.109371Highlights- The method of detecting rice panicle density for unmanned harvesters is developed.
- An improved rice panicle detection model (RP-YOLO) based on YOLOv5n is proposed.
- World coordinates are converted to image coordinates and rice ...
Rice panicle density is one of the essential bases for the automatic speed regulation of unmanned harvesters, making density detection crucial for intelligent upgrades. Currently, existing methods for detecting rice panicle density do not meet ...
- research-articleJanuary 2025
Efficient object recognition under cluttered scenes via descriptor-based matching and single point voting
Computer Aided Geometric Design (CAGD), Volume 114, Issue Chttps://doi.org/10.1016/j.cagd.2024.102394AbstractThis paper addresses the problem of recognizing multiple objects and multiple instances from point clouds. Whereas existing methods utilize descriptors on 3D fields or pointwise voting to achieve this task, our framework takes advantage of both ...
Highlights- Addresses the problem of recognizing multiple objects and multiple instances from point clouds.
- Design a new robust descriptor called Direction-Enhanced Fast Point Feature Histogram (OE-FPFH).
- Propose a novel single point voting ...
- research-articleJanuary 2025
GeoHi-GNN: Geometry-aware hierarchical graph representation learning for normal estimation
Computer Aided Geometric Design (CAGD), Volume 114, Issue Chttps://doi.org/10.1016/j.cagd.2024.102390AbstractNormal estimation has been one of the key tasks in point cloud analysis, while it is challenging when facing with severe noises or complex regions. The challenges mainly come from the selection of supporting points for estimation, that is, ...
Graphical abstract Highlights- Proposing a hierarchically geometric-aware fitting scheme for normal estimation on point cloud.
- Proposing HG module for aggregating and updating features when shrinking graph scale.
- Introducing patch-perceptual loss to infer the ...
- research-articleJanuary 2025
Separating the predictable part of returns with CNN-GRU-attention from inputs to predict stock returns
AbstractThe noise and high randomness of the stock market are primary obstacles to profitability. These factors cause stock returns to consist of short-term predictable and stock-specific residual parts. Therefore, it is beneficial to separate and model ...
Highlights- We propose to separate stock returns into predictable and residual parts.
- We present a novel stock movement separation and prediction method.
- Our method significantly outperforms the best baselines.
- Our method are verified ...
- research-articleOctober 2024
An Active Masked Attention Framework for Many-to-Many Cross-Domain Recommendations
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 9680–9689https://doi.org/10.1145/3664647.3681435Cross-Domain Recommendation (CDR) has been proposed to improve the recommendation accuracy in the target domain (the sparser dataset) by benefiting from the auxiliary information transferred or the knowledge learned from one or many source domains (the ...
- research-articleOctober 2024
Multiscale Representation Enhanced Temporal Flow Fusion Model for Long-Term Workload Forecasting
- Shiyu Wang,
- Zhixuan Chu,
- Yinbo Sun,
- Yu Liu,
- Yuliang Guo,
- Yang Chen,
- Huiyang Jian,
- Lintao Ma,
- Xingyu Lu,
- Jun Zhou
CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge ManagementPages 4948–4956https://doi.org/10.1145/3627673.3680072Accurate workload forecasting is critical for efficient resource management in cloud computing systems, enabling effective scheduling and autoscaling. Despite recent advances with transformer-based forecasting models, challenges remain due to the non-...
- short-paperOctober 2024
Multi-view Temporal Knowledge Graph Reasoning
CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge ManagementPages 4263–4267https://doi.org/10.1145/3627673.3679970Temporal Knowledge Graph (TKG) reasoning is a crucial task that aims to predict future facts based on historical information. In the process of reasoning over TKGs, we identify two types of facts that need to be predicted: 1) recurring facts and 2) ...