Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2024
Short Video Ordering via Position Decoding and Successor Prediction
SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 2167–2176https://doi.org/10.1145/3626772.3657795Short video collection is an easy way for users to consume coherent content on various online short video platforms, such as TikTok, YouTube, Douyin, and WeChat Channel. These collections cover a wide range of content, including online courses, TV series,...
- research-articleJanuary 2024
A Systematic Review of Human Activity Recognition Based on Mobile Devices: Overview, Progress and Trends
IEEE Communications Surveys & Tutorials (IEEE_ICST), Volume 26, Issue 2Pages 890–929https://doi.org/10.1109/COMST.2024.3357591Due to the ever-growing powers in sensing, computing, communicating and storing, mobile devices (e.g., smartphone, smartwatch, smart glasses) become ubiquitous and an indispensable part of people’s daily life. Until now, mobile devices have been ...
- research-articleOctober 2023
Learning Event-Specific Localization Preferences for Audio-Visual Event Localization
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 3446–3454https://doi.org/10.1145/3581783.3612506Audio-Visual Event Localization (AVEL) aims to locate events that are both visible and audible in a video. Existing AVEL methods primarily focus on learning generic localization patterns that are applicable to all events. However, events often exhibit ...
- research-articleOctober 2023
Towards Real-Time Sign Language Recognition and Translation on Edge Devices
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 4502–4512https://doi.org/10.1145/3581783.3611820To provide instant communication for hearing-impaired people, it is essential to achieve real-time sign language processing anytime anywhere. Therefore, in this paper, we propose a Region-aware Temporal Graph based neural Network (RTG-Net), aiming to ...
-
- research-articleAugust 2023
Contrastive learning for sign language recognition and translation
IJCAI '23: Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceArticle No.: 85, Pages 763–772https://doi.org/10.24963/ijcai.2023/85There are two problems that widely exist in current end-to-end sign language processing architecture. One is the CTC spike phenomenon which weakens the visual representational ability in Continuous Sign Language Recognition (CSLR). The other one is the ...
- research-articleJuly 2023
Unsupervised Readability Assessment via Learning from Weak Readability Signals
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 1324–1334https://doi.org/10.1145/3539618.3591695Unsupervised readability assessment aims to evaluate the reading difficulty of text without any manually-labeled data for model training. This is a challenging task because the absence of labeled data makes it difficult for the model to understand what ...
- research-articleJuly 2023
Acoustic-Based Lip Reading for Mobile Devices: Dataset, Benchmark and a Self Distillation-Based Approach
IEEE Transactions on Mobile Computing (ITMV), Volume 23, Issue 5Pages 4548–4565https://doi.org/10.1109/TMC.2023.3294416Speech is a natural communication way between people and a good way for human-computer interaction. However, speech with audible voices often faces the following problems, e.g., being affected by surrounding noises, breaking the quiet environment, leaking ...
- research-articleJuly 2023
- research-articleJuly 2023
Efficient Algorithms for Stochastic Ride-Pooling Assignment with Mixed Fleets
Transportation Science (TRNPS), Volume 57, Issue 4Pages 908–936https://doi.org/10.1287/trsc.2021.0349Ride-pooling, which accommodates multiple passenger requests in a single trip, has the potential to substantially enhance the throughput of mobility-on-demand (MoD) systems. This paper investigates MoD systems that operate mixed fleets composed of “basic ...
- research-articleApril 2023
Learning Robust Multi-Modal Representation for Multi-Label Emotion Recognition via Adversarial Masking and Perturbation
WWW '23: Proceedings of the ACM Web Conference 2023Pages 1510–1518https://doi.org/10.1145/3543507.3583258Recognizing emotions from multi-modal data is an emotion recognition task that requires strong multi-modal representation ability. The general approach to this task is to naturally train the representation model on training data without intervention. ...
- research-articleApril 2023
A Consistent Dual-MRC Framework for Emotion-cause Pair Extraction
ACM Transactions on Information Systems (TOIS), Volume 41, Issue 4Article No.: 105, Pages 1–27https://doi.org/10.1145/3558548Emotion-cause pair extraction (ECPE) is a recently proposed task that aims to extract the potential clause pairs of emotions and its corresponding causes in a document. In this article, we propose a new paradigm for the ECPE task. We cast the task as a ...
- research-articleMarch 2023
The Dual Effects of Team Contest Design on On-Demand Service Work Schedules
Emerging on-demand service platforms (OSPs) have recently embraced teamwork as a strategy for stimulating workers’ productivity and mediating temporal supply and demand imbalances. This research investigates the team contest scheme design problem ...
- research-articleFebruary 2023
Controlling class layout for deep ordinal classification via constrained proxies learning
AAAI'23/IAAI'23/EAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 276, Pages 2483–2491https://doi.org/10.1609/aaai.v37i2.25345For deep ordinal classification, learning a well-structured feature space specific to ordinal classification is helpful to properly capture the ordinal nature among classes. Intuitively, when Euclidean distance metric is used, an ideal ordinal layout in ...
- research-articleMay 2022
Strategic Information Perturbation for an Online In-Vehicle Coordinated Routing Mechanism for Connected Vehicles Under Mixed-Strategy Congestion Game
IEEE Transactions on Intelligent Transportation Systems (ITS-TRANSACTIONS), Volume 23, Issue 5Pages 4541–4555https://doi.org/10.1109/TITS.2020.3045907The increased market penetration of route guidance tools–relaying real-time traffic information to drivers–can have damaging effects on transportation networks, including traffic congestion oscillation resulting from the overreaction ...
- research-articleJanuary 2022
Learning to Classify Open Intent via Soft Labeling and Manifold Mixup
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Volume 30Pages 635–645https://doi.org/10.1109/TASLP.2022.3145308Open intent classification is a practical yet challenging task in dialogue systems. Its objective is to accurately classify samples of known intents while at the same time detecting those of open (unknown) intents. Existing methods usually use outlier ...
- research-articleDecember 2021
Handwriting-Assistant: Reconstructing Continuous Strokes with Millimeter-level Accuracy via Attachable Inertial Sensors
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Volume 5, Issue 4Article No.: 146, Pages 1–25https://doi.org/10.1145/3494956Pen-based handwriting has become one of the major human-computer interaction methods. Traditional approaches either require writing on the specific supporting device like the touch screen, or limit the way of using the pen to pure rotation or ...
- research-articleNovember 2021
Equilibrium Analysis of Urban Traffic Networks with Ride-Sourcing Services
Transportation Science (TRNPS), Volume 55, Issue 6Pages 1260–1279https://doi.org/10.1287/trsc.2021.1078Ride-sourcing services play an increasingly important role in meeting mobility needs in many metropolitan areas. Yet, aside from delivering passengers from their origins to destinations, ride-sourcing vehicles generate a significant number of vacant trips ...
- research-articleNovember 2021
A Control Theoretic Approach to Simultaneously Estimate Average Value of Time and Determine Dynamic Price for High-Occupancy Toll Lanes
IEEE Transactions on Intelligent Transportation Systems (ITS-TRANSACTIONS), Volume 22, Issue 11Pages 7293–7305https://doi.org/10.1109/TITS.2020.3007160The dynamic pricing problem of a freeway corridor with high-occupancy toll (HOT) lanes was formulated and solved based on a point queue abstraction of the traffic system <xref ref-type="bibr" rid="ref1">[1]</xref>. However, existing pricing strategies ...
- research-articleOctober 2021
Skeleton-Aware Neural Sign Language Translation
MM '21: Proceedings of the 29th ACM International Conference on MultimediaPages 4353–4361https://doi.org/10.1145/3474085.3475577As an essential communication way for deaf-mutes, sign languages are expressed by human actions. To distinguish human actions for sign language understanding, the skeleton which contains position information of human pose can provide an important cue, ...