Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
A Unified Framework for Mining Batch and Periodic Batch in Data Streams
IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 36, Issue 11Pages 5544–5561https://doi.org/10.1109/TKDE.2024.3399024<italic>Batch</italic> is an important pattern in data streams, which refers to a group of identical items that arrive closely. We find that some special batches that arrive periodically are of great value. In this paper, we formally define a new pattern, ...
- research-articleAugust 2024
Enabling space-time efficient range queries with REncoder
- Zhuochen Fan,
- Bowen Ye,
- Ziwei Wang,
- Zheng Zhong,
- Jiarui Guo,
- Yuhan Wu,
- Haoyu Li,
- Tong Yang,
- Yaofeng Tu,
- Zirui Liu,
- Bin Cui
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 33, Issue 6Pages 1837–1859https://doi.org/10.1007/s00778-024-00873-wAbstractA range filter is a data structure to answer range membership queries. Range queries are common in modern applications, and range filters have gained rising attention for improving the performance of range queries by ruling out empty range ...
- research-articleAugust 2024
X-former elucidator: reviving efficient attention for long context language modeling
IJCAI '24: Proceedings of the Thirty-Third International Joint Conference on Artificial IntelligenceArticle No.: 904, Pages 8179–8187https://doi.org/10.24963/ijcai.2024/904Transformer-based LLMs are becoming increasingly important in various AI applications. However, apart from the success of LLMs, the explosive demand of long context handling capabilities is a key and in-time problem for both academia and industry. Due to ...
- research-articleJune 2024
BurstBalancer: Do Less, Better Balance for Large-Scale Data Center Traffic
- Zirui Liu,
- Yikai Zhao,
- Zhuochen Fan,
- Tong Yang,
- Xiaodong Li,
- Ruwen Zhang,
- Kaicheng Yang,
- Zihan Jiang,
- Zheng Zhong,
- Yi Huang,
- Cong Liu,
- Jing Hu,
- Gaogang Xie,
- Bin Cui
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 6Pages 932–949https://doi.org/10.1109/TPDS.2023.3295454Layer-3 load balancing is a key topic in the networking field. It is well acknowledged that flowlet is the most promising solution because of its good trade-off between load balance and packet reordering. However, we find its one significant limitation: ...
-
- ArticleOctober 2023
Graph-Enforced Neural Network for Attributed Graph Clustering
AbstractGraph clustering aims to discover cluster structures in graphs. This task becomes more challenging when each node in the graph is associated with an attribute vector (i.e., the attributed graph). Recently, methods built on Graph Auto-Encoder (GAE) ...
- research-articleJanuary 2023
Hole Inpainting Algorithm for Half-Organized Point Cloud Obtained by Structured-Light Section System
IEEE Transactions on Multimedia (TOM), Volume 25Pages 8170–8182https://doi.org/10.1109/TMM.2022.3233254The advances in sensors and data processing technologies enrich the types of 3D point clouds acquirement, empowering numerous extensive and novel applications such as 3D reconstruction in various scenarios. However, the hole defects affect the accuracy ...
- research-articleNovember 2022
PCG: a privacy preserving collaborative graph neural network training framework
- research-articleJuly 2022
ICS-GNN: lightweight interactive community search via graph neural network
- research-articleJuly 2022
VolcanoML: speeding up end-to-end AutoML via scalable search space decomposition
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 32, Issue 2Pages 389–413https://doi.org/10.1007/s00778-022-00752-2AbstractEnd-to-end AutoML has attracted intensive interests from both academia and industry which automatically searches for ML pipelines in a space induced by feature engineering, algorithm/model selection, and hyper-parameter tuning. Existing AutoML ...
- research-articleMay 2021
Memory-aware framework for fast and scalable second-order random walk over billion-edge natural graphs
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 30, Issue 5Pages 769–797https://doi.org/10.1007/s00778-021-00669-2AbstractSecond-order random walk is an important technique for graph analysis. Many applications including graph embedding, proximity measure and community detection use it to capture higher-order patterns in the graph, thus improving the model accuracy. ...
- research-articleApril 2021
Model averaging in distributed machine learning: a case study with Apache Spark
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 30, Issue 4Pages 693–712https://doi.org/10.1007/s00778-021-00664-7AbstractThe increasing popularity of Apache Spark has attracted many users to put their data into its ecosystem. On the other hand, it has been witnessed in the literature that Spark is slow when it comes to distributed machine learning (ML). One resort ...
- ArticleAugust 2020
Densely-Connected Transformer with Co-attentive Information for Matching Text Sequences
AbstractSentence matching, which aims to capture the semantic relationship between two sequences, is a crucial problem in NLP research. It plays a vital role in various natural language tasks such as question answering, natural language inference and ...
- research-articleAugust 2019
Inductance Characteristics of the High-Frequency Transformer in Dual Active Bridge Converters
2019 22nd International Conference on Electrical Machines and Systems (ICEMS)Pages 1–5https://doi.org/10.1109/ICEMS.2019.8921741The high-frequency transformer (HFT) is an important component of dual active bridge (DAB) converters to achieve galvanic isolation and bidirectional power transmission, which is widely utilized in renewable energy, distribution grids, railway traction ...
- ArticleApril 2019
Sparse Gradient Compression for Distributed SGD
AbstractCommunication bandwidth is a bottleneck in distributed machine learning, and limits the system scalability. The transmission of gradients often dominates the communication in distributed SGD. One promising technique is using the gradient ...
- ArticleJuly 2018
CUTE: Querying Knowledge Graphs by Tabular Examples
AbstractKnowledge graphs and the query language SPARQL have opened up the possibility of retrieving information, acquiring knowledge and building applications over large linked data. However, due to the unfamiliarity with both SPARQL and the datasets, ...
- research-articleOctober 2017
Research on a dual active bridge based power electronics transformer using nanocrystalline and Silicon Carbide
IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics SocietyPages 1470–1475https://doi.org/10.1109/IECON.2017.8216250A power electronics transformer (PET) based on dual active bridge (DAB) with input series and output parallel (ISOP) cascaded topology is studied in this paper. Fe-based nanocrystalline alloys transformer cores and Silicon Carbide (SiC) power MOSFETs are ...
- research-articleOctober 2016
Adapting to User Interest Drift for POI Recommendation
IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 28, Issue 10Pages 2566–2581https://doi.org/10.1109/TKDE.2016.2580511Point-of-Interest recommendation is an essential means to help people discover attractive locations, especially when people travel out of town or to unfamiliar regions. While a growing line of research has focused on modeling user geographical ...
- bookJune 2016
Spatio-Temporal Recommendation in Social Media
This book covers the major fundamentals of and the latest research on next-generation spatio-temporal recommendation systems in social media. It begins by describing the emerging characteristics of social media in the era of mobile internet, and ...
- research-articleJune 2015
Heterogeneous Environment Aware Streaming Graph Partitioning
IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 27, Issue 6Pages 1560–1572https://doi.org/10.1109/TKDE.2014.2377743With the increasing availability of graph data and widely adopted cloud computing paradigm, graph partitioning has become an efficient pre-processing technique to balance the computing workload and cope with the large scale of input data. Since the cost ...