Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 65 results for author: Tai, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19791  [pdf, other

    cs.RO

    Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding

    Authors: Yifan Tang, Cong Tai, Fangxing Chen, Wanting Zhang, Tao Zhang, Xueping Liu, Yongjin Liu, Long Zeng

    Abstract: Most existing robotic datasets capture static scene data and thus are limited in evaluating robots' dynamic performance. To address this, we present a mobile robot oriented large-scale indoor dataset, denoted as THUD (Tsinghua University Dynamic) robotic dataset, for training and evaluating their dynamic scene understanding algorithms. Specifically, the THUD dataset construction is first detailed,… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: This version has been accepted by ICRA2024 and the dataset has been published, where the link can be found in the paper

    Journal ref: IEEE International Conference on Robotics & Automation,2024

  2. arXiv:2406.10118  [pdf, other

    cs.CL

    SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

    Authors: Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Railey Montalan, Ryan Ignatius, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse , et al. (36 additional authors not shown)

    Abstract: Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due t… ▽ More

    Submitted 5 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: https://github.com/SEACrowd

  3. arXiv:2405.08049  [pdf, other

    eess.IV cs.CV

    Optimizing Synthetic Correlated Diffusion Imaging for Breast Cancer Tumour Delineation

    Authors: Chi-en Amy Tai, Alexander Wong

    Abstract: Breast cancer is a significant cause of death from cancer in women globally, highlighting the need for improved diagnostic imaging to enhance patient outcomes. Accurate tumour identification is essential for diagnosis, treatment, and monitoring, emphasizing the importance of advanced imaging technologies that provide detailed views of tumour characteristics and disease. Synthetic correlated diffus… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  4. arXiv:2405.07869  [pdf, other

    eess.IV cs.CV

    Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer

    Authors: Chi-en Amy Tai, Alexander Wong

    Abstract: In 2020, prostate cancer saw a staggering 1.4 million new cases, resulting in over 375,000 deaths. The accurate identification of clinically significant prostate cancer is crucial for delivering effective treatment to patients. Consequently, there has been a surge in research exploring the application of deep neural networks to predict clinical significance based on magnetic resonance images. Howe… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  5. arXiv:2405.07861  [pdf, other

    eess.IV cs.CV

    Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging

    Authors: Chi-en Amy Tai, Alexander Wong

    Abstract: Breast cancer was diagnosed for over 7.8 million women between 2015 to 2020. Grading plays a vital role in breast cancer treatment planning. However, the current tumor grading method involves extracting tissue from patients, leading to stress, discomfort, and high medical costs. A recent paper leveraging volumetric deep radiomic features from synthetic correlated diffusion imaging (CDI$^s$) for br… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2405.07854  [pdf, other

    eess.IV cs.CV

    Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction

    Authors: Chi-en Amy Tai, Alexander Wong

    Abstract: In 2020, 685,000 deaths across the world were attributed to breast cancer, underscoring the critical need for innovative and effective breast cancer treatment. Neoadjuvant chemotherapy has recently gained popularity as a promising treatment strategy for breast cancer, attributed to its efficacy in shrinking large tumors and leading to pathologic complete response. However, the current process to r… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  7. arXiv:2405.07814  [pdf, other

    cs.CV

    NutritionVerse-Direct: Exploring Deep Neural Networks for Multitask Nutrition Prediction from Food Images

    Authors: Matthew Keller, Chi-en Amy Tai, Yuhao Chen, Pengcheng Xi, Alexander Wong

    Abstract: Many aging individuals encounter challenges in effectively tracking their dietary intake, exacerbating their susceptibility to nutrition-related health complications. Self-reporting methods are often inaccurate and suffer from substantial bias; however, leveraging intelligent prediction methods can automate and enhance precision in this process. Recent work has explored using computer vision predi… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  8. arXiv:2403.05217  [pdf, other

    cs.CL cs.AI cs.IR

    Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering

    Authors: Hongda Sun, Yuxuan Liu, Chengwei Wu, Haiyu Yan, Cheng Tai, Xin Gao, Shuo Shang, Rui Yan

    Abstract: Open-domain question answering (ODQA) has emerged as a pivotal research spotlight in information systems. Existing methods follow two main paradigms to collect evidence: (1) The \textit{retrieve-then-read} paradigm retrieves pertinent documents from an external corpus; and (2) the \textit{generate-then-read} paradigm employs large language models (LLMs) to generate relevant documents. However, nei… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: TheWebConf 2024 (WWW 2024) oral, code repo: https://github.com/EthanLeo-LYX/LLMQA

  9. arXiv:2401.08598  [pdf, other

    cs.CV

    NutritionVerse-Real: An Open Access Manually Collected 2D Food Scene Dataset for Dietary Intake Estimation

    Authors: Chi-en Amy Tai, Saeejith Nair, Olivia Markham, Matthew Keller, Yifan Wu, Yuhao Chen, Alexander Wong

    Abstract: Dietary intake estimation plays a crucial role in understanding the nutritional habits of individuals and populations, aiding in the prevention and management of diet-related health issues. Accurate estimation requires comprehensive datasets of food scenes, including images, segmentation masks, and accompanying dietary intake metadata. In this paper, we introduce NutritionVerse-Real, an open acces… ▽ More

    Submitted 20 November, 2023; originally announced January 2024.

  10. arXiv:2312.06192  [pdf, other

    cs.CV

    NutritionVerse-Synth: An Open Access Synthetically Generated 2D Food Scene Dataset for Dietary Intake Estimation

    Authors: Saeejith Nair, Chi-en Amy Tai, Yuhao Chen, Alexander Wong

    Abstract: Manually tracking nutritional intake via food diaries is error-prone and burdensome. Automated computer vision techniques show promise for dietary monitoring but require large and diverse food image datasets. To address this need, we introduce NutritionVerse-Synth (NV-Synth), a large-scale synthetic food image dataset. NV-Synth contains 84,984 photorealistic meal images rendered from 7,082 dynamic… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 6 pages

  11. arXiv:2312.03540  [pdf, other

    cs.CV

    FoodFusion: A Latent Diffusion Model for Realistic Food Image Generation

    Authors: Olivia Markham, Yuhao Chen, Chi-en Amy Tai, Alexander Wong

    Abstract: Current state-of-the-art image generation models such as Latent Diffusion Models (LDMs) have demonstrated the capacity to produce visually striking food-related images. However, these generated images often exhibit an artistic or surreal quality that diverges from the authenticity of real-world food representations. This inadequacy renders them impractical for applications requiring realistic food… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  12. arXiv:2312.02966  [pdf, other

    cs.CV

    Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection

    Authors: Cheng-Ju Ho, Chen-Hsuan Tai, Yen-Yu Lin, Ming-Hsuan Yang, Yi-Hsuan Tsai

    Abstract: Semi-supervised object detection is crucial for 3D scene understanding, efficiently addressing the limitation of acquiring large-scale 3D bounding box annotations. Existing methods typically employ a teacher-student framework with pseudo-labeling to leverage unlabeled point clouds. However, producing reliable pseudo-labels in a diverse 3D space still remains challenging. In this work, we propose D… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted in NeurIPS 2023. Code is available at https://github.com/luluho1208/Diffusion-SS3D

  13. arXiv:2311.18612  [pdf, other

    eess.IV cs.CV

    Cancer-Net PCa-Gen: Synthesis of Realistic Prostate Diffusion Weighted Imaging Data via Anatomic-Conditional Controlled Latent Diffusion

    Authors: Aditya Sridhar, Chi-en Amy Tai, Hayden Gunraj, Yuhao Chen, Alexander Wong

    Abstract: In Canada, prostate cancer is the most common form of cancer in men and accounted for 20% of new cancer cases for this demographic in 2022. Due to recent successes in leveraging machine learning for clinical decision support, there has been significant interest in the development of deep neural networks for prostate cancer diagnosis, prognosis, and treatment planning using diffusion weighted imagi… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  14. arXiv:2311.17677  [pdf, other

    eess.IV cs.CV

    COVIDx CXR-4: An Expanded Multi-Institutional Open-Source Benchmark Dataset for Chest X-ray Image-Based Computer-Aided COVID-19 Diagnostics

    Authors: Yifan Wu, Hayden Gunraj, Chi-en Amy Tai, Alexander Wong

    Abstract: The global ramifications of the COVID-19 pandemic remain significant, exerting persistent pressure on nations even three years after its initial outbreak. Deep learning models have shown promise in improving COVID-19 diagnostics but require diverse and larger-scale datasets to improve performance. In this paper, we introduce COVIDx CXR-4, an expanded multi-institutional open-source benchmark datas… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  15. arXiv:2311.11656  [pdf, other

    eess.IV cs.CV

    Double-Condensing Attention Condenser: Leveraging Attention in Deep Learning to Detect Skin Cancer from Skin Lesion Images

    Authors: Chi-en Amy Tai, Elizabeth Janes, Chris Czarnecki, Alexander Wong

    Abstract: Skin cancer is the most common type of cancer in the United States and is estimated to affect one in five Americans. Recent advances have demonstrated strong performance on skin cancer detection, as exemplified by state of the art performance in the SIIM-ISIC Melanoma Classification Challenge; however these solutions leverage ensembles of complex deep neural architectures requiring immense storage… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  16. arXiv:2311.11647  [pdf, other

    cs.CV

    Cancer-Net PCa-Data: An Open-Source Benchmark Dataset for Prostate Cancer Clinical Decision Support using Synthetic Correlated Diffusion Imaging Data

    Authors: Hayden Gunraj, Chi-en Amy Tai, Alexander Wong

    Abstract: The recent introduction of synthetic correlated diffusion (CDI$^s$) imaging has demonstrated significant potential in the realm of clinical decision support for prostate cancer (PCa). CDI$^s$ is a new form of magnetic resonance imaging (MRI) designed to characterize tissue characteristics through the joint correlation of diffusion signal attenuation across different Brownian motion sensitivities.… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  17. arXiv:2309.07726  [pdf, other

    cs.RO

    GRID: Scene-Graph-based Instruction-driven Robotic Task Planning

    Authors: Zhe Ni, Xiaoxin Deng, Cong Tai, Xinyue Zhu, Qinghongbing Xie, Weihang Huang, Xiang Wu, Long Zeng

    Abstract: Recent works have shown that Large Language Models (LLMs) can facilitate the grounding of instructions for robotic task planning. Despite this progress, most existing works have primarily focused on utilizing raw images to aid LLMs in understanding environmental information. However, this approach not only limits the scope of observation but also typically necessitates extensive multimodal data co… ▽ More

    Submitted 10 March, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 8 pages, 10 figures

  18. arXiv:2309.07704  [pdf, other

    cs.CV cs.AI

    NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches

    Authors: Chi-en Amy Tai, Matthew Keller, Saeejith Nair, Yuhao Chen, Yifan Wu, Olivia Markham, Krish Parmar, Pengcheng Xi, Heather Keller, Sharon Kirkpatrick, Alexander Wong

    Abstract: Accurate dietary intake estimation is critical for informing policies and programs to support healthy eating, as malnutrition has been directly linked to decreased quality of life. However self-reporting methods such as food diaries suffer from substantial bias. Other conventional dietary assessment techniques and emerging alternative approaches such as mobile applications incur high time costs an… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  19. arXiv:2307.16081  [pdf, other

    cs.CL

    Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System

    Authors: Lingbo Mo, Shijie Chen, Ziru Chen, Xiang Deng, Ashley Lewis, Sunit Singh, Samuel Stevens, Chang-You Tai, Zhen Wang, Xiang Yue, Tianshu Zhang, Yu Su, Huan Sun

    Abstract: We introduce TacoBot, a user-centered task-oriented digital assistant designed to guide users through complex real-world tasks with multiple steps. Covering a wide range of cooking and how-to tasks, we aim to deliver a collaborative and engaging dialogue experience. Equipped with language understanding, dialogue management, and response generation components supported by a robust search engine, Ta… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

  20. arXiv:2305.14215  [pdf, other

    cs.CL

    Exploring Chain-of-Thought Style Prompting for Text-to-SQL

    Authors: Chang-You Tai, Ziru Chen, Tianshu Zhang, Xiang Deng, Huan Sun

    Abstract: In-context learning with large language models (LLMs) has recently caught increasing attention due to its superior few-shot performance on various tasks. However, its performance on text-to-SQL parsing still has much room for improvement. In this paper, we hypothesize that a crucial aspect of LLMs to improve for text-to-SQL parsing is their multi-step reasoning ability. Thus, we systematically stu… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 main; long paper

  21. arXiv:2304.05899  [pdf, other

    cs.CV

    Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep Radiomic Features from Synthetic Correlated Diffusion Imaging

    Authors: Chi-en Amy Tai, Hayden Gunraj, Alexander Wong

    Abstract: The prevalence of breast cancer continues to grow, affecting about 300,000 females in the United States in 2023. However, there are different levels of severity of breast cancer requiring different treatment strategies, and hence, grading breast cancer has become a vital component of breast cancer diagnosis and treatment planning. Specifically, the gold-standard Scarff-Bloom-Richardson (SBR) grade… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.05308

  22. arXiv:2304.05623  [pdf, other

    eess.IV cs.CV q-bio.QM

    A Multi-Institutional Open-Source Benchmark Dataset for Breast Cancer Clinical Decision Support using Synthetic Correlated Diffusion Imaging Data

    Authors: Chi-en Amy Tai, Hayden Gunraj, Alexander Wong

    Abstract: Recently, a new form of magnetic resonance imaging (MRI) called synthetic correlated diffusion (CDI$^s$) imaging was introduced and showed considerable promise for clinical decision support for cancers such as prostate cancer when compared to current gold-standard MRI techniques. However, the efficacy for CDI$^s$ for other forms of cancers such as breast cancer has not been as well-explored nor ha… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  23. arXiv:2304.05620  [pdf, other

    cs.CV

    NutritionVerse-Thin: An Optimized Strategy for Enabling Improved Rendering of 3D Thin Food Models

    Authors: Chi-en Amy Tai, Jason Li, Sriram Kumar, Saeejith Nair, Yuhao Chen, Pengcheng Xi, Alexander Wong

    Abstract: With the growth in capabilities of generative models, there has been growing interest in using photo-realistic renders of common 3D food items to improve downstream tasks such as food printing, nutrition prediction, or management of food wastage. Despite 3D modelling capabilities being more accessible than ever due to the success of NeRF based view-synthesis, such rendering methods still struggle… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  24. arXiv:2304.05619  [pdf, other

    cs.CV

    NutritionVerse-3D: A 3D Food Model Dataset for Nutritional Intake Estimation

    Authors: Chi-en Amy Tai, Matthew Keller, Mattie Kerrigan, Yuhao Chen, Saeejith Nair, Pengcheng Xi, Alexander Wong

    Abstract: 77% of adults over 50 want to age in place today, presenting a major challenge to ensuring adequate nutritional intake. It has been reported that one in four older adults that are 65 years or older are malnourished and given the direct link between malnutrition and decreased quality of life, there have been numerous studies conducted on how to efficiently track nutritional intake of food. Recent a… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  25. RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud Environments

    Authors: Drew Penney, Bin Li, Lizhong Chen, Jaroslaw J. Sydir, Anna Drewek-Ossowicka, Ramesh Illikkal, Charlie Tai, Ravi Iyer, Andrew Herdrich

    Abstract: Resource sharing between multiple workloads has become a prominent practice among cloud service providers, motivated by demand for improved resource utilization and reduced cost of ownership. Effective resource sharing, however, remains an open challenge due to the adverse effects that resource contention can have on high-priority, user-facing workloads with strict Quality of Service (QoS) require… ▽ More

    Submitted 3 September, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted in Neurocomputing

  26. arXiv:2212.09273  [pdf, other

    cs.CV

    Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection

    Authors: Cheng-Ju Ho, Chen-Hsuan Tai, Yi-Hsuan Tsai, Yen-Yu Lin, Ming-Hsuan Yang

    Abstract: Semi-supervised object detection is important for 3D scene understanding because obtaining large-scale 3D bounding box annotations on point clouds is time-consuming and labor-intensive. Existing semi-supervised methods usually employ teacher-student knowledge distillation together with an augmentation strategy to leverage unlabeled point clouds. However, these methods adopt global augmentation wit… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: BMVC2022

  27. arXiv:2211.10039  [pdf, other

    cs.LG cs.AI

    Why the pseudo label based semi-supervised learning algorithm is effective?

    Authors: Zeping Min, Qian Ge, Cheng Tai

    Abstract: Recently, pseudo label based semi-supervised learning has achieved great success in many fields. The core idea of the pseudo label based semi-supervised learning algorithm is to use the model trained on the labeled data to generate pseudo labels on the unlabeled data, and then train a model to fit the previously generated pseudo labels. We give a theory analysis for why pseudo label based semi-sup… ▽ More

    Submitted 24 January, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

  28. arXiv:2211.05997  [pdf, other

    cs.CV

    LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation

    Authors: Zeyu Hu, Xuyang Bai, Runze Zhang, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai

    Abstract: We propose LiDAL, a novel active learning method for 3D LiDAR semantic segmentation by exploiting inter-frame uncertainty among LiDAR frames. Our core idea is that a well-trained model should generate robust results irrespective of viewpoints for scene scanning and thus the inconsistencies in model predictions across frames provide a very reliable measure of uncertainty for active sample selection… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: ECCV 2022, supplementary materials included

  29. arXiv:2211.05308  [pdf, other

    cs.CV

    Enhancing Clinical Support for Breast Cancer with Deep Learning Models using Synthetic Correlated Diffusion Imaging

    Authors: Chi-en Amy Tai, Hayden Gunraj, Nedim Hodzic, Nic Flanagan, Ali Sabri, Alexander Wong

    Abstract: Breast cancer is the second most common type of cancer in women in Canada and the United States, representing over 25\% of all new female cancer cases. As such, there has been immense research and progress on improving screening and clinical support for breast cancer. In this paper, we investigate enhancing clinical support for breast cancer with deep learning models using a newly introduced magne… ▽ More

    Submitted 4 August, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  30. arXiv:2206.01875  [pdf, other

    cs.IR

    Prospective Preference Enhanced Mixed Attentive Model for Session-based Recommendation

    Authors: Bo Peng, Chang-Yu Tai, Srinivasan Parthasarathy, Xia Ning

    Abstract: Session-based recommendation aims to generate recommendations for the next item of users' interest based on a given session. In this manuscript, we develop prospective preference enhanced mixed attentive model (P2MAM) to generate session-based recommendations using two important factors: temporal patterns and estimates of users' prospective preferences. Unlike existing methods, P2MAM models the te… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: Under review by IEEE Transactions on Knowledge and Data Engineering (TKDE)

  31. PoseCoach: A Customizable Analysis and Visualization System for Video-based Running Coaching

    Authors: Jingyuan Liu, Nazmus Saquib, Zhutian Chen, Rubaiat Habib Kazi, Li-Yi Wei, Hongbo Fu, Chiew-Lan Tai

    Abstract: Videos are an accessible form of media for analyzing sports postures and providing feedback to athletes. Existing sport-specific systems embed bespoke human pose attributes and thus can be hard to scale for new attributes, especially for users without programming experiences. Some systems retain scalability by directly showing the differences between two poses, but they might not clearly visualize… ▽ More

    Submitted 27 February, 2023; v1 submitted 19 April, 2022; originally announced April 2022.

  32. arXiv:2204.00164  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Filter-based Discriminative Autoencoders for Children Speech Recognition

    Authors: Chiang-Lin Tai, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang

    Abstract: Children speech recognition is indispensable but challenging due to the diversity of children's speech. In this paper, we propose a filter-based discriminative autoencoder for acoustic modeling. To filter out the influence of various speaker types and pitches, auxiliary information of the speaker and pitch features is input into the encoder together with the acoustic features to generate phonetic… ▽ More

    Submitted 23 May, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

    Comments: Published in EUSIPCO 2022

  33. arXiv:2203.11496  [pdf, other

    cs.CV

    TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers

    Authors: Xuyang Bai, Zeyu Hu, Xinge Zhu, Qingqiu Huang, Yilun Chen, Hongbo Fu, Chiew-Lan Tai

    Abstract: LiDAR and camera are two important sensors for 3D object detection in autonomous driving. Despite the increasing popularity of sensor fusion in this field, the robustness against inferior image conditions, e.g., bad illumination and sensor misalignment, is under-explored. Existing fusion methods are easily affected by such conditions, mainly due to a hard association of LiDAR points and image pixe… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR2022; Code at \url{https://github.com/XuyangBai/TransFusion}; Based on this work, we achieve the 1st place in the leaderboard of nuScenes tracking

  34. arXiv:2203.08906  [pdf, other

    cs.AR cs.DC cs.NI

    ORCA: A Network and Architecture Co-design for Offloading us-scale Datacenter Applications

    Authors: Yifan Yuan, Jinghan Huang, Yan Sun, Tianchen Wang, Jacob Nelson, Dan R. K. Ports, Yipeng Wang, Ren Wang, Charlie Tai, Nam Sung Kim

    Abstract: Responding to the "datacenter tax" and "killer microseconds" problems for datacenter applications, diverse solutions including Smart NIC-based ones have been proposed. Nonetheless, they often suffer from high overhead of communications over network and/or PCIe links. To tackle the limitations of the current solutions, this paper proposes ORCA, a holistic network and architecture co-design solution… ▽ More

    Submitted 17 October, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: This paper has been accepted by HPCA'23. This arxiv paper is not the final camera-ready version

  35. PROMPT: Learning Dynamic Resource Allocation Policies for Network Applications

    Authors: Drew Penney, Bin Li, Jaroslaw Sydir, Lizhong Chen, Charlie Tai, Stefan Lee, Eoin Walsh, Thomas Long

    Abstract: A growing number of service providers are exploring methods to improve server utilization and reduce power consumption by co-scheduling high-priority latency-critical workloads with best-effort workloads. This practice requires strict resource allocation between workloads to reduce contention and maintain Quality-of-Service (QoS) guarantees. Prior work demonstrated promising opportunities to dynam… ▽ More

    Submitted 24 March, 2023; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Accepted in Future Generation Computer Systems (FGCS)

  36. arXiv:2112.09215  [pdf, other

    cs.CL cs.AI

    Hyperbolic Disentangled Representation for Fine-Grained Aspect Extraction

    Authors: Chang-You Tai, Ming-Yao Li, Lun-Wei Ku

    Abstract: Automatic identification of salient aspects from user reviews is especially useful for opinion analysis. There has been significant progress in utilizing weakly supervised approaches, which require only a small set of seed words for training aspect classifiers. However, there is always room for improvement. First, no weakly supervised approaches fully utilize latent hierarchies between words. Seco… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

  37. arXiv:2108.08771  [pdf, other

    cs.CV

    Learning to Match Features with Seeded Graph Matching Network

    Authors: Hongkai Chen, Zixin Luo, Jiahui Zhang, Lei Zhou, Xuyang Bai, Zeyu Hu, Chiew-Lan Tai, Long Quan

    Abstract: Matching local features across images is a fundamental problem in computer vision. Targeting towards high accuracy and efficiency, we propose Seeded Graph Matching Network, a graph neural network with sparse structure to reduce redundant connectivity and learn compact representation. The network consists of 1) Seeding Module, which initializes the matching by generating a small set of reliable mat… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV2021, code to be realeased at https://github.com/vdvchen/SGMNet

  38. arXiv:2107.13824  [pdf, other

    cs.CV

    VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation

    Authors: Zeyu Hu, Xuyang Bai, Jiaxiang Shang, Runze Zhang, Jiayu Dong, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai

    Abstract: In recent years, sparse voxel-based methods have become the state-of-the-arts for 3D semantic segmentation of indoor scenes, thanks to the powerful 3D CNNs. Nevertheless, being oblivious to the underlying geometry, voxel-based methods suffer from ambiguous features on spatially close objects and struggle with handling complex and irregular geometries due to the lack of geodesic information. In vie… ▽ More

    Submitted 25 July, 2022; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: V1: ICCV2021(Oral), supplementary materials included V2: TPAMI(ICCV2021 SI), supplementary materials included

  39. arXiv:2103.10891  [pdf, other

    cs.LG cs.DC cs.PF

    Accelerating SLIDE Deep Learning on Modern CPUs: Vectorization, Quantizations, Memory Optimizations, and More

    Authors: Shabnam Daghaghi, Nicholas Meisburger, Mengnan Zhao, Yong Wu, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava

    Abstract: Deep learning implementations on CPUs (Central Processing Units) are gaining more traction. Enhanced AI capabilities on commodity x86 architectures are commercially appealing due to the reuse of existing hardware and virtualization ease. A notable work in this direction is the SLIDE system. SLIDE is a C++ implementation of a sparse hash table based back-propagation, which was shown to be significa… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  40. arXiv:2103.05465  [pdf, other

    cs.CV

    PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency

    Authors: Xuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai

    Abstract: Removing outlier correspondences is one of the critical steps for successful feature-based point cloud registration. Despite the increasing popularity of introducing deep learning methods in this field, spatial consistency, which is essentially established by a Euclidean transformation between point clouds, has received almost no individual attention in existing learning frameworks. In this paper,… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021, supplementary materials included

  41. arXiv:2010.07125  [pdf, other

    cs.SI

    Influence Maximization Based on Dynamic Personal Perception in Knowledge Graph

    Authors: Ya-Wen Teng, Yishuo Shi, Chih-Hua Tai, De-Nian Yang, Wang-Chien Lee, Ming-Syan Chen

    Abstract: Viral marketing on social networks, also known as Influence Maximization (IM), aims to select k users for the promotion of a target item by maximizing the total spread of their influence. However, most previous works on IM do not explore the dynamic user perception of promoted items in the process. In this paper, by exploiting the knowledge graph (KG) to capture dynamic user perception, we formula… ▽ More

    Submitted 30 September, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

  42. arXiv:2008.08947  [pdf, other

    cs.IT eess.SP

    An Overview of Generalized Frequency Division Multiplexing (GFDM)

    Authors: Ching-Lun Tai, Tzu-Han Wang, Yu-Hua Huang

    Abstract: As a candidate waveform for next-generation wireless communications, generalized frequency division multiplexing (GFDM) features several decent properties which make it promising. In this paper, we systematically overview the research about GFDM. We start with GFDM transceivers with their main components, which consist of prototype filter design, low-complexity transceiver implementation, and symb… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: 14 pages, 5 figures

  43. JSENet: Joint Semantic Segmentation and Edge Detection Network for 3D Point Clouds

    Authors: Zeyu Hu, Mingmin Zhen, Xuyang Bai, Hongbo Fu, Chiew-lan Tai

    Abstract: Semantic segmentation and semantic edge detection can be seen as two dual problems with close relationships in computer vision. Despite the fast evolution of learning-based 3D semantic segmentation methods, little attention has been drawn to the learning of 3D semantic edge detectors, even less to a joint learning method for the two tasks. In this paper, we tackle the 3D semantic edge detection ta… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: Accepted to ECCV 2020, supplementary materials included

  44. arXiv:2007.04552  [pdf, other

    cs.AR cs.OS

    IOCA: High-Speed I/O-Aware LLC Management for Network-Centric Multi-Tenant Platform

    Authors: Yifan Yuan, Mohammad Alian, Yipeng Wang, Ilia Kurakin, Ren Wang, Charlie Tai, Nam Sung Kim

    Abstract: In modern server CPUs, last-level cache (LLC) is a critical hardware resource that exerts significant influence on the performance of the workloads, and how to manage LLC is a key to the performance isolation and QoS in the cloud with multi-tenancy. In this paper, we argue that besides CPU cores, high-speed network I/O is also important for LLC management. This is because of an Intel architectural… ▽ More

    Submitted 4 March, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted by the 48th IEEE/ACM International Symposium on Computer Architecture (ISCA'21). The title is "Don't Forget the I/O When Allocating Your LLC"

  45. MVIN: Learning Multiview Items for Recommendation

    Authors: Chang-You Tai, Meng-Ru Wu, Yun-Wei Chu, Shao-Yu Chu, Lun-Wei Ku

    Abstract: Researchers have begun to utilize heterogeneous knowledge graphs (KGs) as auxiliary information in recommendation systems to mitigate the cold start and sparsity issues. However, utilizing a graph neural network (GNN) to capture information in KG and further apply in RS is still problematic as it is unable to see each item's properties from multiple perspectives. To address these issues, we propos… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  46. arXiv:2003.05855  [pdf, other

    cs.CV

    End-to-End Learning Local Multi-view Descriptors for 3D Point Clouds

    Authors: Lei Li, Siyu Zhu, Hongbo Fu, Ping Tan, Chiew-Lan Tai

    Abstract: In this work, we propose an end-to-end framework to learn local multi-view descriptors for 3D point clouds. To adopt a similar multi-view representation, existing studies use hand-crafted viewpoints for rendering in a preprocessing stage, which is detached from the subsequent descriptor learning stage. In our framework, we integrate the multi-view rendering into neural networks by using a differen… ▽ More

    Submitted 16 March, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: CVPR 2020. Webpage: https://github.com/craigleili/3DLocalMultiViewDesc

  47. arXiv:2003.03164  [pdf, other

    cs.CV

    D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features

    Authors: Xuyang Bai, Zixin Luo, Lei Zhou, Hongbo Fu, Long Quan, Chiew-Lan Tai

    Abstract: A successful point cloud registration often lies on robust establishment of sparse matches through discriminative 3D local features. Despite the fast evolution of learning-based 3D feature descriptors, little attention has been drawn to the learning of 3D feature detectors, even less for a joint learning of the two tasks. In this paper, we leverage a 3D fully convolutional network for 3D point clo… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR 2020, supplementary materials included

  48. arXiv:2001.05744  [pdf, other

    cs.CV

    SketchDesc: Learning Local Sketch Descriptors for Multi-view Correspondence

    Authors: Deng Yu, Lei Li, Youyi Zheng, Manfred Lau, Yi-Zhe Song, Chiew-Lan Tai, Hongbo Fu

    Abstract: In this paper, we study the problem of multi-view sketch correspondence, where we take as input multiple freehand sketches with different views of the same object and predict as output the semantic correspondence among the sketches. This problem is challenging since the visual features of corresponding points at different views can be very different. To this end, we take a deep learning approach a… ▽ More

    Submitted 10 August, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology

  49. arXiv:1909.13342  [pdf, other

    cs.IT eess.SP

    Interference-Precancelled Pilot Design for LMMSE Channel Estimation of GFDM

    Authors: Ching-Lun Tai, Borching Su, Cai Jia

    Abstract: Generalized frequency division multiplexing (GFDM) is a promising candidate waveform for next-generation wireless communication systems. However, GFDM channel estimation is still challenging due to the inherent interference. In this paper, we formulate a pilot design framework with linear minimum mean square error (LMMSE) channel estimation for GFDM, and propose a novel pilot design to achieve int… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

    Comments: 5 pages, 6 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  50. Greedy Algorithms for Hybrid Compressed Sensing

    Authors: Ching-Lun Tai, Sung-Hsien Hsieh, Chun-Shien Lu

    Abstract: Compressed sensing (CS) is a technique which uses fewer measurements than dictated by the Nyquist sampling theorem. The traditional CS with linear measurements achieves efficient recovery performances, but it suffers from the large bit consumption due to the huge storage occupied by those measurements. Then, the one-bit CS with binary measurements is proposed and saves the bit budget, but it is in… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

    Comments: 13 pages, 6 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible