Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 55 results for author: Jensen, C S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.09592  [pdf, other

    cs.LG cs.AI cs.CE

    A Survey of Generative Techniques for Spatial-Temporal Data Mining

    Authors: Qianru Zhang, Haixin Wang, Cheng Long, Liangcai Su, Xingwei He, Jianlong Chang, Tailin Wu, Hongzhi Yin, Siu-Ming Yiu, Qi Tian, Christian S. Jensen

    Abstract: This paper focuses on the integration of generative techniques into spatial-temporal data mining, considering the significant growth and diverse nature of spatial-temporal data. With the advancements in RNNs, CNNs, and other non-generative techniques, researchers have explored their application in capturing temporal and spatial dependencies within spatial-temporal data. However, the emergence of g… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 19 pages

  2. arXiv:2404.14999  [pdf, other

    cs.DB cs.LG

    A Unified Replay-based Continuous Learning Framework for Spatio-Temporal Prediction on Streaming Data

    Authors: Hao Miao, Yan Zhao, Chenjuan Guo, Bin Yang, Kai Zheng, Feiteng Huang, Jiandong Xie, Christian S. Jensen

    Abstract: The widespread deployment of wireless and mobile devices results in a proliferation of spatio-temporal data that is used in applications, e.g., traffic prediction, human mobility mining, and air quality prediction, where spatio-temporal prediction is often essential to enable safety, predictability, or reliability. Many recent proposals that target deep learning for spatio-temporal prediction suff… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted by ICDE 2024

  3. arXiv:2404.13990  [pdf, other

    cs.LG cs.DB

    QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models -- Extended Version

    Authors: David Campos, Bin Yang, Tung Kieu, Miao Zhang, Chenjuan Guo, Christian S. Jensen

    Abstract: We are witnessing an increasing availability of streaming data that may contain valuable information on the underlying processes. It is thus attractive to be able to deploy machine learning models on edge devices near sensors such that decisions can be made instantaneously, rather than first having to transmit incoming data to servers. To enable deployment on edge devices with limited storage and… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 15 pages. An extended version of "QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models" accepted at PVLDB 2024

  4. arXiv:2403.20150  [pdf, other

    cs.LG cs.AI cs.CY

    TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods

    Authors: Xiangfei Qiu, Jilin Hu, Lekui Zhou, Xingjian Wu, Junyang Du, Buang Zhang, Chenjuan Guo, Aoying Zhou, Christian S. Jensen, Zhenli Sheng, Bin Yang

    Abstract: Time series are generated in diverse domains such as economic, traffic, health, and energy, where forecasting of future values has numerous important applications. Not surprisingly, many forecasting methods are being proposed. To ensure progress, it is essential to be able to study and compare such methods empirically in a comprehensive and reliable manner. To achieve this, we propose TFB, an auto… ▽ More

    Submitted 18 June, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Directly accepted by PVLDB 2024

  5. arXiv:2403.16656  [pdf, other

    cs.LG cs.IR

    Graph Augmentation for Recommendation

    Authors: Qianru Zhang, Lianghao Xia, Xuheng Cai, Siuming Yiu, Chao Huang, Christian S. Jensen

    Abstract: Graph augmentation with contrastive learning has gained significant attention in the field of recommendation systems due to its ability to learn expressive user representations, even when labeled data is limited. However, directly applying existing GCL models to real-world recommendation environments poses challenges. There are two primary issues to address. Firstly, the lack of consideration for… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 13 pages and accepted by ICDE 2024

    Journal ref: ICDE 2024

  6. arXiv:2402.14041  [pdf

    cs.LG cs.AI cs.DB

    E2USD: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series

    Authors: Zhichen Lai, Huan Li, Dalin Zhang, Yan Zhao, Weizhu Qian, Christian S. Jensen

    Abstract: Cyber-physical system sensors emit multivariate time series (MTS) that monitor physical system processes. Such time series generally capture unknown numbers of states, each with a different duration, that correspond to specific conditions, e.g., "walking" or "running" in human-activity monitoring. Unsupervised identification of such states facilitates storage and processing in subsequent data anal… ▽ More

    Submitted 27 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by The Web Conference 2024 (WWW 2024)

  7. arXiv:2402.09434  [pdf, other

    eess.SP cs.LG

    Disentangling Imperfect: A Wavelet-Infused Multilevel Heterogeneous Network for Human Activity Recognition in Flawed Wearable Sensor Data

    Authors: Mengna Liu, Dong Xiang, Xu Cheng, Xiufeng Liu, Dalin Zhang, Shengyong Chen, Christian S. Jensen

    Abstract: The popularity and diffusion of wearable devices provides new opportunities for sensor-based human activity recognition that leverages deep learning-based algorithms. Although impressive advances have been made, two major challenges remain. First, sensor data is often incomplete or noisy due to sensor placement and other issues as well as data transmission failure, calling for imputation of missin… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  8. arXiv:2402.07232  [pdf, other

    cs.LG

    UVTM: Universal Vehicle Trajectory Modeling with ST Feature Domain Generation

    Authors: Yan Lin, Jilin Hu, Shengnan Guo, Bin Yang, Christian S. Jensen, Youfang Lin, Huaiyu Wan

    Abstract: Vehicle movement is frequently captured in the form of trajectories, i.e., sequences of timestamped locations. Numerous methods exist that target different tasks involving trajectories such as travel-time estimation, trajectory recovery, and trajectory prediction. However, most methods target only one specific task and cannot be applied universally. Existing efforts to create a universal trajector… ▽ More

    Submitted 23 April, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  9. arXiv:2312.16403  [pdf, other

    cs.LG cs.AI

    Learning Time-aware Graph Structures for Spatially Correlated Time Series Forecasting

    Authors: Minbo Ma, Jilin Hu, Christian S. Jensen, Fei Teng, Peng Han, Zhiqiang Xu, Tianrui Li

    Abstract: Spatio-temporal forecasting of future values of spatially correlated time series is important across many cyber-physical systems (CPS). Recent studies offer evidence that the use of graph neural networks to capture latent correlations between time series holds a potential for enhanced forecasting. However, most existing methods rely on pre-defined or self-learning graphs, which are either static o… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: published in ICDE 2024

  10. arXiv:2312.16355  [pdf, other

    cs.DB

    Efficient Cost Modeling of Space-filling Curves

    Authors: Guanli Liu, Lars Kulik, Christian S. Jensen, Tianyi Li, Jianzhong Qi

    Abstract: A space-filling curve (SFC) maps points in a multi-dimensional space to one-dimensional points by discretizing the multi-dimensional space into cells and imposing a linear order on the cells. This way, an SFC enables the indexing of multi-dimensional data using a one-dimensional index such as a B+-tree. Choosing an appropriate SFC is crucial, as different SFCs have different effects on query perfo… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  11. arXiv:2311.11204  [pdf, other

    cs.DB

    Collectively Simplifying Trajectories in a Database: A Query Accuracy Driven Approach

    Authors: Zheng Wang, Cheng Long, Gao Cong, Christian S. Jensen

    Abstract: Increasing and massive volumes of trajectory data are being accumulated that may serve a variety of applications, such as mining popular routes or identifying ridesharing candidates. As storing and querying massive trajectory data is costly, trajectory simplification techniques have been introduced that intuitively aim to reduce the sizes of trajectories, thus reducing storage and speeding up quer… ▽ More

    Submitted 13 December, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted by ICDE 2024

  12. arXiv:2311.07344  [pdf, other

    cs.DB cs.LG

    Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation (Extended Version)

    Authors: Xiao Li, Huan Li, Hua Lu, Christian S. Jensen, Varun Pandey, Volker Markl

    Abstract: Sensor data streams occur widely in various real-time applications in the context of the Internet of Things (IoT). However, sensor data streams feature missing values due to factors such as sensor failures, communication errors, or depleted batteries. Missing values can compromise the quality of real-time analytics tasks and downstream applications. Existing imputation methods either make strong a… ▽ More

    Submitted 14 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted at VLDB 2024

  13. arXiv:2311.00960  [pdf, other

    cs.DB

    Trajectory Similarity Measurement: An Efficiency Perspective

    Authors: Yanchuan Chang, Egemen Tanin, Gao Cong, Christian S. Jensen, Jianzhong Qi

    Abstract: Trajectories that capture object movement have numerous applications, in which similarity computation between trajectories often plays a key role. Traditionally, the similarity between two trajectories is quantified by means of heuristic measures, e.g., Hausdorff or ERP, that operate directly on the trajectories. In contrast, recent studies exploit deep learning to map trajectories to d-dimensiona… ▽ More

    Submitted 11 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted by VLDB 2024

  14. arXiv:2310.06119  [pdf, other

    cs.LG cs.AI

    Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity Analysis

    Authors: Zezhi Shao, Fei Wang, Yongjun Xu, Wei Wei, Chengqing Yu, Zhao Zhang, Di Yao, Guangyin Jin, Xin Cao, Gao Cong, Christian S. Jensen, Xueqi Cheng

    Abstract: Multivariate Time Series (MTS) widely exists in real-word complex systems, such as traffic and energy systems, making their forecasting crucial for understanding and influencing these systems. Recently, deep learning-based approaches have gained much popularity for effectively modeling temporal and spatial dependencies in MTS, specifically in Long-term Time Series Forecasting (LTSF) and Spatial-Te… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  15. arXiv:2307.10171  [pdf, other

    cs.LG cs.AI cs.DB

    LightPath: Lightweight and Scalable Path Representation Learning

    Authors: Sean Bin Yang, Jilin Hu, Chenjuan Guo, Bin Yang, Christian S. Jensen

    Abstract: Movement paths are used widely in intelligent transportation and smart city applications. To serve such applications, path representation learning aims to provide compact representations of paths that enable efficient and accurate operations when used for different downstream tasks such as path ranking and travel cost estimation. In many cases, it is attractive that the path representation learnin… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted by ACM SIGKDD-23

  16. arXiv:2307.03048  [pdf, other

    cs.LG

    Origin-Destination Travel Time Oracle for Map-based Services

    Authors: Yan Lin, Huaiyu Wan, Jilin Hu, Shengnan Guo, Bin Yang, Youfang Lin, Christian S. Jensen

    Abstract: Given an origin (O), a destination (D), and a departure time (T), an Origin-Destination (OD) travel time oracle~(ODT-Oracle) returns an estimate of the time it takes to travel from O to D when departing at T. ODT-Oracles serve important purposes in map-based services. To enable the construction of such oracles, we provide a travel-time estimation (TTE) solution that leverages historical trajectori… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 15 pages, 12 figures, accepted by SIGMOD International Conference on Management of Data 2024

  17. arXiv:2303.06213  [pdf, other

    cs.LG cs.AI

    CHGNN: A Semi-Supervised Contrastive Hypergraph Learning Network

    Authors: Yumeng Song, Yu Gu, Tianyi Li, Jianzhong Qi, Zhenghao Liu, Christian S. Jensen, Ge Yu

    Abstract: Hypergraphs can model higher-order relationships among data objects that are found in applications such as social networks and bioinformatics. However, recent studies on hypergraph learning that extend graph convolutional networks to hypergraphs cannot learn effectively from features of unlabeled data. To such learning, we propose a contrastive hypergraph neural network, CHGNN, that exploits self-… ▽ More

    Submitted 28 May, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by TKDE

  18. arXiv:2302.13022  [pdf, other

    cs.DB

    Data Imputation for Sparse Radio Maps in Indoor Positioning (Extended Version)

    Authors: Xiao Li, Huan Li, Harry Kai-Ho Chan, Hua Lu, Christian S. Jensen

    Abstract: Indoor location-based services rely on the availability of sufficiently accurate positioning in indoor spaces. A popular approach to positioning relies on so-called radio maps that contain pairs of a vector of Wi-Fi signal strength indicator values (RSSIs), called a fingerprint, and a location label, called a reference point (RP), in which the fingerprint was observed. The positioning accuracy dep… ▽ More

    Submitted 28 February, 2023; v1 submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted at ICDE 2023

  19. arXiv:2302.12721  [pdf, other

    cs.LG cs.DB

    LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation -- Extended Version

    Authors: David Campos, Miao Zhang, Bin Yang, Tung Kieu, Chenjuan Guo, Christian S. Jensen

    Abstract: Due to the sweeping digitalization of processes, increasingly vast amounts of time series data are being produced. Accurate classification of such time series facilitates decision making in multiple domains. State-of-the-art classification accuracy is often achieved by ensemble learning where results are synthesized from multiple base models. This characteristic implies that ensemble learning need… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 15 pages. An extended version of "LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation" accepted at SIGMOD 2023

    Journal ref: Proceedings of the ACM on Management of Data 1, 2 (2023), 171:1-171:27

  20. arXiv:2302.11974  [pdf, other

    cs.LG cs.AI cs.DB

    LightCTS: A Lightweight Framework for Correlated Time Series Forecasting

    Authors: Zhichen Lai, Dalin Zhang, Huan Li, Christian S. Jensen, Hua Lu, Yan Zhao

    Abstract: Correlated time series (CTS) forecasting plays an essential role in many practical applications, such as traffic management and server load control. Many deep learning models have been proposed to improve the accuracy of CTS forecasting. However, while models have become increasingly complex and computationally intensive, they struggle to improve accuracy. Pursuing a different direction, this stud… ▽ More

    Submitted 27 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: accepted by ACM SIGMOD 2023

  21. arXiv:2212.10306  [pdf, other

    cs.LG

    A Pattern Discovery Approach to Multivariate Time Series Forecasting

    Authors: Yunyao Cheng, Chenjuan Guo, Kaixuan Chen, Kai Zhao, Bin Yang, Jiandong Xie, Christian S. Jensen, Feiteng Huang, Kai Zheng

    Abstract: Multivariate time series forecasting constitutes important functionality in cyber-physical systems, whose prediction accuracy can be improved significantly by capturing temporal and multivariate correlations among multiple time series. State-of-the-art deep learning methods fail to construct models for full time series because model complexity grows exponentially with time series length. Rather, t… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  22. arXiv:2211.16126  [pdf, other

    cs.LG cs.AI cs.DB

    Joint Neural Architecture and Hyperparameter Search for Correlated Time Series Forecasting

    Authors: Xinle Wu, Dalin Zhang, Miao Zhang, Chenjuan Guo, Bin Yang, Christian S. Jensen

    Abstract: Sensors in cyber-physical systems often capture interconnected processes and thus emit correlated time series (CTS), the forecasting of which enables important applications. The key to successful CTS forecasting is to uncover the temporal dynamics of time series and the spatial correlations among time series. Deep learning-based solutions exhibit impressive performance at discerning these aspects.… ▽ More

    Submitted 27 February, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: accepted by SIGMOD 2023

  23. arXiv:2209.04635  [pdf, other

    cs.LG cs.AI

    A Comparative Study on Unsupervised Anomaly Detection for Time Series: Experiments and Analysis

    Authors: Yan Zhao, Liwei Deng, Xuanhao Chen, Chenjuan Guo, Bin Yang, Tung Kieu, Feiteng Huang, Torben Bach Pedersen, Kai Zheng, Christian S. Jensen

    Abstract: The continued digitization of societal processes translates into a proliferation of time series data that cover applications such as fraud detection, intrusion detection, and energy management, where anomaly detection is often essential to enable reliability and safety. Many recent studies target anomaly detection for time series data. Indeed, area of time series anomaly detection is characterized… ▽ More

    Submitted 10 September, 2022; originally announced September 2022.

  24. arXiv:2208.10498  [pdf, other

    cs.LG cs.AI cs.SE

    Design Automation for Fast, Lightweight, and Effective Deep Learning Models: A Survey

    Authors: Dalin Zhang, Kaixuan Chen, Yan Zhao, Bin Yang, Lina Yao, Christian S. Jensen

    Abstract: Deep learning technologies have demonstrated remarkable effectiveness in a wide range of tasks, and deep learning holds the potential to advance a multitude of applications, including in edge computing, where deep models are deployed on edge devices to enable instant data processing and response. A key challenge is that while the application of deep models often incurs substantial memory and compu… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  25. arXiv:2207.14539  [pdf, other

    cs.CV cs.LG

    Pre-training General Trajectory Embeddings with Maximum Multi-view Entropy Coding

    Authors: Yan Lin, Huaiyu Wan, Shengnan Guo, Jilin Hu, Christian S. Jensen, Youfang Lin

    Abstract: Spatio-temporal trajectories provide valuable information about movement and travel behavior, enabling various downstream tasks that in turn power real-world applications. Learning trajectory embeddings can improve task performance but may incur high computational costs and face limited training data availability. Pre-training learns generic embeddings by means of specially constructed pretext tas… ▽ More

    Submitted 25 December, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: 15 pages, 7 figures, accepted by IEEE Trans. on Knowledge and Data Engineering

  26. arXiv:2206.09112  [pdf, other

    cs.LG

    Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting

    Authors: Zezhi Shao, Zhao Zhang, Wei Wei, Fei Wang, Yongjun Xu, Xin Cao, Christian S. Jensen

    Abstract: We all depend on mobility, and vehicular transportation affects the daily lives of most of us. Thus, the ability to forecast the state of traffic in a road network is an important functionality and a challenging task. Traffic data is often obtained from sensors deployed in a road network. Recent proposals on spatial-temporal graph neural networks have achieved great progress at modeling complex sp… ▽ More

    Submitted 4 September, 2022; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: Accepted by VLDB 2022

  27. arXiv:2204.10203  [pdf, ps, other

    cs.DB cs.SI

    Maximizing the Influence of Bichromatic Reverse k Nearest Neighbors in Geo-Social Networks

    Authors: Pengfei Jin, Lu Chen, Yunjun Gao, Xueqin Chang, Zhanyu Liu, Christian S. Jensen

    Abstract: Geo-social networks offer opportunities for the marketing and promotion of geo-located services. In this setting, we explore a new problem, called Maximizing the Influence of Bichromatic Reverse k Nearest Neighbors (MaxInfBRkNN). The objective is to find a set of points of interest (POIs), which are geo-textually and socially attractive to social influencers who are expected to largely promote the… ▽ More

    Submitted 25 April, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

  28. arXiv:2204.03341  [pdf, other

    cs.LG cs.DB

    Robust and Explainable Autoencoders for Unsupervised Time Series Outlier Detection---Extended Version

    Authors: Tung Kieu, Bin Yang, Chenjuan Guo, Christian S. Jensen, Yan Zhao, Feiteng Huang, Kai Zheng

    Abstract: Time series data occurs widely, and outlier detection is a fundamental problem in data mining, which has numerous applications. Existing autoencoder-based approaches deliver state-of-the-art performance on challenging real-world data but are vulnerable to outliers and exhibit low explainability. To address these two limitations, we propose robust and explainable unsupervised autoencoder frameworks… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: This paper has been accepted by IEEE ICDE 2022

  29. arXiv:2203.16110  [pdf, other

    cs.LG cs.AI

    Weakly-supervised Temporal Path Representation Learning with Contrastive Curriculum Learning -- Extended Version

    Authors: Sean Bin Yang, Chenjuan Guo, Jilin Hu, Bin Yang, Jian Tang, Christian S. Jensen

    Abstract: In step with the digitalization of transportation, we are witnessing a growing range of path-based smart-city applications, e.g., travel-time estimation and travel path ranking. A temporal path(TP) that includes temporal information, e.g., departure time, into the path is fundamental to enable such applications. In this setting, it is essential to learn generic temporal path representations(TPRs)… ▽ More

    Submitted 15 April, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: This paper has been accepted by IEEE ICDE-22

  30. arXiv:2203.14241  [pdf, other

    cs.SI

    Influence-aware Task Assignment in Spatial Crowdsourcing (Technical Report)

    Authors: Xuanhao Chen, Yan Zhao, Kai Zheng, Bin Yang, Christian S. Jensen

    Abstract: With the widespread diffusion of smartphones, Spatial Crowdsourcing (SC), which aims to assign spatial tasks to mobile workers, has drawn increasing attention in both academia and industry. One of the major issues is how to best assign tasks to workers. Given a worker and a task, the worker will choose to accept the task based on her affinity towards the task, and the worker can propagate the info… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  31. arXiv:2112.11174  [pdf, other

    cs.LG cs.AI

    AutoCTS: Automated Correlated Time Series Forecasting -- Extended Version

    Authors: Xinle Wu, Dalin Zhang, Chenjuan Guo, Chaoyang He, Bin Yang, Christian S. Jensen

    Abstract: Correlated time series (CTS) forecasting plays an essential role in many cyber-physical systems, where multiple sensors emit time series that capture interconnected processes. Solutions based on deep learning that deliver state-of-the-art CTS forecasting performance employ a variety of spatio-temporal (ST) blocks that are able to model temporal dependencies and spatial correlations among time seri… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: to appear in PVLDB 2022

  32. arXiv:2112.09339  [pdf, ps, other

    cs.LG cs.DB

    Deep Spatially and Temporally Aware Similarity Computation for Road Network Constrained Trajectories

    Authors: Ziquan Fang, Yuntao Du, Xinjun Zhu, Lu Chen, Yunjun Gao, Christian S. Jensen

    Abstract: Trajectory similarity computation has drawn massive attention, as it is core functionality in a wide range of applications such as ride-sharing, traffic analysis, and social recommendation. Motivated by the recent success of deep learning technologies, researchers start devoting efforts to learning-based similarity analyses to overcome the limitations (i.e., high cost and poor adaptability) of tra… ▽ More

    Submitted 26 February, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  33. Unsupervised Time Series Outlier Detection with Diversity-Driven Convolutional Ensembles -- Extended Version

    Authors: David Campos, Tung Kieu, Chenjuan Guo, Feiteng Huang, Kai Zheng, Bin Yang, Christian S. Jensen

    Abstract: With the sweeping digitalization of societal, medical, industrial, and scientific processes, sensing technologies are being deployed that produce increasing volumes of time series data, thus fueling a plethora of new or improved applications. In this setting, outlier detection is frequently important, and while solutions based on neural networks exist, they leave room for improvement in terms of b… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 14 pages. An extended version of "Unsupervised Time Series Outlier Detection with Diversity-Driven Convolutional Ensembles", to appear in PVLDB 2022

    Journal ref: Proceedings of the VLDB Endowment, 15, 3 (2022), 611-623

  34. arXiv:2109.11609  [pdf, ps, other

    cs.DB

    Evolutionary Clustering of Streaming Trajectories

    Authors: Tianyi Li, Lu Chen, Christian S. Jensen, Torben Bach Pedersen, Jilin Hu

    Abstract: The widespread deployment of smartphones and location-enabled, networked in-vehicle devices renders it increasingly feasible to collect streaming trajectory data of moving objects. The continuous clustering of such data can enable a variety of real-time services, such as identifying representative paths or common moving trends among objects in real-time. However, little attention has so far been g… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  35. arXiv:2107.05537  [pdf, other

    cs.DB

    PM-LSH: a fast and accurate in-memory framework for high-dimensional approximate NN and closest pair search

    Authors: Bolong Zheng, Xi Zhao, Lianggui Weng, Nguyen Quoc Viet Hung, Hang Liu, Christian S. Jensen

    Abstract: Nearest neighbor (NN) search is inherently computationally expensive in high-dimensional spaces due to the curse of dimensionality. As a well-known solution, locality-sensitive hashing (LSH) is able to answer c-approximate NN (c-ANN) queries in sublinear time with constant probability. Existing LSH methods focus mainly on building hash bucket-based indexing such that the candidate points can be re… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  36. arXiv:2104.13321  [pdf, other

    cs.LG cs.DB

    UniTE -- The Best of Both Worlds: Unifying Function-Fitting and Aggregation-Based Approaches to Travel Time and Travel Speed Estimation

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: Travel time or speed estimation are part of many intelligent transportation applications. Existing estimation approaches rely on either function fitting or aggregation and represent different trade-offs between generalizability and accuracy. Function-fitting approaches learn functions that map feature vectors of, e.g., routes, to travel time or speed estimates, which enables generalization to unse… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  37. arXiv:2101.08929  [pdf, other

    cs.DB

    REPOSE: Distributed Top-k Trajectory Similarity Search with Local Reference Point Tries

    Authors: Bolong Zheng, Lianggui Weng, Xi Zhao, Kai Zeng, Xiaofang Zhou, Christian S. Jensen

    Abstract: Trajectory similarity computation is a fundamental component in a variety of real-world applications, such as ridesharing, road planning, and transportation optimization. Recent advances in mobile devices have enabled an unprecedented increase in the amount of available trajectory data such that efficient query processing can no longer be supported by a single machine. As a result, means of perfor… ▽ More

    Submitted 26 January, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

  38. arXiv:2009.12157  [pdf, other

    cs.DB eess.SP

    SOUP: Spatial-Temporal Demand Forecasting and Competitive Supply

    Authors: Bolong Zheng, Qi Hu, Lingfeng Ming, Jilin Hu, Lu Chen, Kai Zheng, Christian S. Jensen

    Abstract: We consider a setting with an evolving set of requests for transportation from an origin to a destination before a deadline and a set of agents capable of servicing the requests. In this setting, an assignment authority is to assign agents to requests such that the average idle time of the agents is minimized. An example is the scheduling of taxis (agents) to meet incoming requests for trips while… ▽ More

    Submitted 18 January, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

  39. Relational Fusion Networks: Graph Convolutional Networks for Road Networks

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: The application of machine learning techniques in the setting of road networks holds the potential to facilitate many important intelligent transportation applications. Graph Convolutional Networks (GCNs) are neural networks that are capable of leveraging the structure of a network. However, many implicit assumptions of GCNs do not apply to road networks. We introduce the Relational Fusion Network… ▽ More

    Submitted 14 September, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: IEEE Transactions on Intelligent Transportation Systems (2020). arXiv admin note: substantial text overlap with arXiv:1908.11567

  40. arXiv:2005.03468  [pdf, ps, other

    cs.DB

    Indexing Metric Spaces for Exact Similarity Search

    Authors: Lu Chen, Yunjun Gao, Xuan Song, Zheng Li, Yifan Zhu, Xiaoye Miao, Christian S. Jensen

    Abstract: With the continued digitization of societal processes, we are seeing an explosion in available data. This is referred to as big data. In a research setting, three aspects of the data are often viewed as the main sources of challenges when attempting to enable value creation from big data: volume, velocity, and variety. Many studies address volume or velocity, while fewer studies concern the variet… ▽ More

    Submitted 23 May, 2022; v1 submitted 7 May, 2020; originally announced May 2020.

  41. arXiv:2003.08031  [pdf, other

    cs.DB

    PolyFit: Polynomial-based Indexing Approach for Fast Approximate Range Aggregate Queries

    Authors: Zhe Li, Tsz Nam Chan, Man Lung Yiu, Christian S. Jensen

    Abstract: Range aggregate queries find frequent application in data analytics. In some use cases, approximate results are preferred over accurate results if they can be computed rapidly and satisfy approximation guarantees. Inspired by a recent indexing approach, we provide means of representing a discrete point data set by continuous functions that can then serve as compact index structures. More specifica… ▽ More

    Submitted 10 February, 2021; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: 13 pages

  42. On Network Embedding for Machine Learning on Road Networks: A Case Study on the Danish Road Network

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: Road networks are a type of spatial network, where edges may be associated with qualitative information such as road type and speed limit. Unfortunately, such information is often incomplete; for instance, OpenStreetMap only has speed limits for 13% of all Danish road segments. This is problematic for analysis tasks that rely on such information for machine learning. To enable machine learning in… ▽ More

    Submitted 15 November, 2019; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: Best Paper at the 3rd IEEE International Workshop on Big Spatial Data (BSD 2018)

    Journal ref: 2018 IEEE International Conference on Big Data (Big Data), 2018, pp. 3422-3431

  43. arXiv:1908.11567  [pdf, other

    cs.LG cs.DB stat.ML

    Graph Convolutional Networks for Road Networks

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: Machine learning techniques for road networks hold the potential to facilitate many important transportation applications. Graph Convolutional Networks (GCNs) are neural networks that are capable of leveraging the structure of a road network by utilizing information of, e.g., adjacent road segments. While state-of-the-art GCNs target node classification tasks in social, citation, and biological ne… ▽ More

    Submitted 22 July, 2020; v1 submitted 30 August, 2019; originally announced August 2019.

    Comments: Ten-page pre-print version of a four-page ACM SIGSPATIAL 2019 poster paper

  44. arXiv:1811.05157  [pdf, other

    cs.LG stat.ML

    Recurrent Multi-Graph Neural Networks for Travel Cost Prediction

    Authors: Jilin Hu, Chenjuan Guo, Bin Yang, Christian S. Jensen, Lu Chen

    Abstract: Origin-destination (OD) matrices are often used in urban planning, where a city is partitioned into regions and an element (i, j) in an OD matrix records the cost (e.g., travel time, fuel consumption, or travel speed) from region i to region j. In this paper, we partition a day into multiple intervals, e.g., 96 15-min intervals and each interval is associated with an OD matrix which represents the… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

  45. arXiv:1802.07980  [pdf, ps, other

    cs.LG

    Learning to Route with Sparse Trajectory Sets---Extended Version

    Authors: Chenjuan Guo, Bin Yang, Jilin Hu, Christian S. Jensen

    Abstract: Motivated by the increasing availability of vehicle trajectory data, we propose learn-to-route, a comprehensive trajectory-based routing solution. Specifically, we first construct a graph-like structure from trajectories as the routing infrastructure. Second, we enable trajectory-based routing given an arbitrary (source, destination) pair. In the first step, given a road network and a collection… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

  46. arXiv:1711.02476  [pdf, other

    cs.DB

    SWOOP: Top-k Similarity Joins over Set Streams

    Authors: Willi Mann, Nikolaus Augsten, Christian S. Jensen

    Abstract: We provide efficient support for applications that aim to continuously find pairs of similar sets in rapid streams of sets. A prototypical example setting is that of tweets. A tweet is a set of words, and Twitter emits about half a billion tweets per day. Our solution makes it possible to efficiently maintain the top-$k$ most similar tweets from a pair of rapid Twitter streams, e.g., to discover s… ▽ More

    Submitted 2 December, 2019; v1 submitted 7 November, 2017; originally announced November 2017.

  47. arXiv:1607.08681  [pdf, other

    cs.DB

    A Density-Based Approach to the Retrieval of Top-K Spatial Textual Clusters

    Authors: Dingming Wu, Christian S. Jensen

    Abstract: Keyword-based web queries with local intent retrieve web content that is relevant to supplied keywords and that represent points of interest that are near the query location. Two broad categories of such queries exist. The first encompasses queries that retrieve single spatial web objects that each satisfy the query arguments. Most proposals belong to this category. The second category, to which t… ▽ More

    Submitted 28 July, 2016; originally announced July 2016.

  48. arXiv:1510.02886  [pdf, ps, other

    cs.DB

    Efficient and Accurate Path Cost Estimation Using Trajectory Data

    Authors: Jian Dai, Bin Yang, Chenjuan Guo, Christian S. Jensen

    Abstract: Using the growing volumes of vehicle trajectory data, it becomes increasingly possible to capture time-varying and uncertain travel costs in a road network, including travel time and fuel consumption. The current paradigm represents a road network as a graph, assigns weights to the graph's edges by fragmenting trajectories into small pieces that fit the underlying edges, and then applies a routing… ▽ More

    Submitted 3 December, 2015; v1 submitted 10 October, 2015; originally announced October 2015.

    Comments: 16pages, 42 figures

  49. arXiv:1411.3212  [pdf, other

    cs.DB cs.DC cs.DS

    Manycore processing of repeated range queries over massive moving objects observations

    Authors: Francesco Lettich, Salvatore Orlando, Claudio Silvestri, Christian S. Jensen

    Abstract: The ability to timely process significant amounts of continuously updated spatial data is mandatory for an increasing number of applications. Parallelism enables such applications to face this data-intensive challenge and allows the devised systems to feature low latency and high scalability. In this paper we focus on a specific data-intensive problem, concerning the repeated processing of huge am… ▽ More

    Submitted 12 November, 2014; originally announced November 2014.

    ACM Class: D.1.3; C.1.2

  50. arXiv:1308.0484  [pdf, ps, other

    cs.LG cs.DB

    Using Incomplete Information for Complete Weight Annotation of Road Networks -- Extended Version

    Authors: Bin Yang, Manohar Kaul, Christian S. Jensen

    Abstract: We are witnessing increasing interests in the effective use of road networks. For example, to enable effective vehicle routing, weighted-graph models of transportation networks are used, where the weight of an edge captures some cost associated with traversing the edge, e.g., greenhouse gas (GHG) emissions or travel time. It is a precondition to using a graph model for routing that all edges have… ▽ More

    Submitted 15 August, 2013; v1 submitted 2 August, 2013; originally announced August 2013.

    Comments: This is an extended version of "Using Incomplete Information for Complete Weight Annotation of Road Networks," which is accepted for publication in IEEE TKDE