Author: Jin, Rong : Search

short-paper

Free

EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations

RecSys '24: Proceedings of the 18th ACM Conference on Recommender SystemsPages 1010–1015https://doi.org/10.1145/3640457.3688185

Content-based recommendation systems play a crucial role in delivering personalized content to users in the digital world. In this work, we introduce EmbSum, a novel framework that enables offline pre-computations of users and candidate items while ...

research-article

FusionSF: Fuse Heterogeneous Modalities in a Vector Quantized Framework for Robust Solar Power Forecasting

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 5532–5543https://doi.org/10.1145/3637528.3671509

Accurate solar power forecasting is crucial to integrate photovoltaic plants into the electric grid, schedule and secure the power grid safety. This problem becomes more demanding for those newly installed solar plants which lack sufficient operational ...

retraction

Retraction Note: Spatial-temporal deep learning model based rumor source identification in social networks

Journal of Combinatorial Optimization (SPJCO), Volume 47, Issue 3https://doi.org/10.1007/s10878-024-01136-8

research-article

HyRSM++: Hybrid relation guided temporal set matching for few-shot action recognition

Pattern Recognition (PATT), Volume 147, Issue Chttps://doi.org/10.1016/j.patcog.2023.110110

Abstract

Few-shot action recognition is a challenging but practical problem aiming to learn a model that can be easily adapted to identify new action categories with only a few labeled samples. However, existing attempts still suffer from two drawbacks: (...

Highlights

A new temporal coherence regularization on videos is proposed.
Capturing the intra- and inter-relations inside the episodic task.
Reformulating the query-support metric as a set matching problem.

research-article

OneNet: enhancing time series forecasting models under concept drift by online ensembling

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 3066, Pages 69949–69980

Online updating of time series forecasting models aims to address the concept drifting problem by efficiently updating forecasting models based on streaming data. Many algorithms are designed for online time series forecasting, with some exploiting cross-...

research-article

One fits all: power general time series analysis by pretrained LM

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1877, Pages 43322–43355

Although we have witnessed great success of pre-trained models in natural language processing (NLP) and computer vision (CV), limited progress has been made for general time series analysis. Unlike NLP and CV where a unified model can be used to perform ...

research-article

Self-Supervised Learning from Untrimmed Videos via Hierarchical Consistency

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 10Pages 12408–12426https://doi.org/10.1109/TPAMI.2023.3273415

Natural untrimmed videos provide rich visual content for self-supervised learning. Yet most previous efforts to learn spatio-temporal representations rely on manually trimmed videos, such as Kinetics dataset (Carreira and Zisserman 2017), resulting in ...

research-article

AdaNPC: exploring non-parametric classifier for test-time adaptation

ICML'23: Proceedings of the 40th International Conference on Machine LearningArticle No.: 1748, Pages 41647–41676

Many recent machine learning tasks focus to develop models that can generalize to unseen distributions. Domain generalization (DG) has become one of the key topics in various fields. Several literatures show that DG can be arbitrarily hard without ...

research-article

FeDXL: provable federated learning for deep X-risk optimization

ICML'23: Proceedings of the 40th International Conference on Machine LearningArticle No.: 479, Pages 11934–11966

In this paper, we tackle a novel federated learning (FL) problem for optimizing a family of X-risks, to which no existing FL algorithms are applicable. In particular, the objective has the form of $\mathbb{E}_{\mathbf{z}\sim \mathcal{S}_1} f(\mathbb{E}_{\...

research-article

What Limits the Performance of Local Self-attention?

International Journal of Computer Vision (IJCV), Volume 131, Issue 10Pages 2516–2528https://doi.org/10.1007/s11263-023-01813-x

Abstract

Although self-attention is powerful in modeling long-range dependencies, the performance of local self-attention (LSA) is just similar to depth-wise convolution, which puzzles researchers on whether to use LSA or its counterparts, which one is ...

research-article

Achieving Human Parity on Visual Question Answering

ACM Transactions on Information Systems (TOIS), Volume 41, Issue 3Article No.: 79, Pages 1–40https://doi.org/10.1145/3572833

The Visual Question Answering (VQA) task utilizes both visual image and language analysis to answer a textual question with respect to an image. It has been a popular research topic with an increasing number of real-world applications in the last decade. ...

research-article

RETRACTED ARTICLE: Spatial-temporal deep learning model based rumor source identification in social networks

Journal of Combinatorial Optimization (SPJCO), Volume 45, Issue 3https://doi.org/10.1007/s10878-023-01018-5

Abstract

Rumor source detection has long been an important but difficult problem. Due to the complexity of the underlying propagation model, most existing methods only rely on the limit observation of a single batch of single snapshot during the ...

research-article

ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning

IEEE Transactions on Multimedia (TOM), Volume 25Pages 9002–9014https://doi.org/10.1109/TMM.2023.3244126

The central idea of contrastive learning is to discriminate between different instances and force different views from the same instance to share the same representation. To avoid trivial solutions, augmentation plays an important role in generating ...

research-article

Stability and generalization analysis of gradient methods for shallow neural networks

NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 2794, Pages 38557–38570

While significant theoretical progress has been achieved, unveiling the generalization mystery of overparameterized neural networks still remains largely elusive. In this paper, we study the generalization behavior of shallow neural networks (SNNs) by ...

research-article

Improved fine-tuning by better leveraging pre-training data

NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 2360, Pages 32568–32581

As a dominant paradigm, fine-tuning a pre-trained model on the target data is widely used in many deep learning applications, especially for small data sets. However, recent studies have empirically shown that training from scratch has the final ...

research-article

Robust graph structure learning via multiple statistical tests

NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 2325, Pages 32083–32096

Graph structure learning aims to learn connectivity in a graph from data. It is particularly important for many computer vision related tasks since no explicit graph structure is available for images for most cases. A natural way to construct a graph ...

research-article

Grow and merge: a unified framework for continuous categories discovery

NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 1991, Pages 27455–27468

Although a number of studies are devoted to novel category discovery, most of them assume a static setting where both labeled and unlabeled data are given at once for finding new categories. In this work, we focus on the application scenarios where ...

research-article

FiLM: frequency improved legendre memory model for long-term time series forecasting

NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 921, Pages 12677–12690

Recent studies have shown that deep learning models such as RNNs and Transformers have brought significant performance gains for long-term forecasting of time series because they effectively utilize historical information. We found, however, that there ...

Article

KVT: k-NN Attention for Boosting Vision Transformers

Computer Vision – ECCV 2022Pages 285–302https://doi.org/10.1007/978-3-031-20053-3_17

Abstract

Convolutional Neural Networks (CNNs) have dominated computer vision for years, due to its ability in capturing locality and translation invariance. Recently, many vision transformer architectures have been proposed and they show promising ...

Article

TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation

Computer Vision – ECCV 2022Pages 73–89https://doi.org/10.1007/978-3-031-19818-2_5

Abstract

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations. Most existing methods are bottom-up approaches that try to group pixels into regions based on their ...

Applied Filters

People

Names

Institutions

Authors

Editors

Advisors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences