Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 61 results for author: Min, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10845  [pdf, other

    cs.CV

    LAIP: Learning Local Alignment from Image-Phrase Modeling for Text-based Person Search

    Authors: Haiguang Wang, Yu Wu, Mengxia Wu, Cao Min, Min Zhang

    Abstract: Text-based person search aims at retrieving images of a particular person based on a given textual description. A common solution for this task is to directly match the entire images and texts, i.e., global alignment, which fails to deal with discerning specific details that discriminate against appearance-similar people. As a result, some works shift their attention towards local alignment. One g… ▽ More

    Submitted 23 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2406.05059  [pdf, other

    cs.CV

    GenHeld: Generating and Editing Handheld Objects

    Authors: Chaerin Min, Srinath Sridhar

    Abstract: Grasping is an important human activity that has long been studied in robotics, computer vision, and cognitive science. Most existing works study grasping from the perspective of synthesizing hand poses conditioned on 3D or 2D object representations. We propose GenHeld to address the inverse problem of synthesizing held objects conditioned on 3D hand model or 2D image. Given a 3D model of hand, Ge… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.04390  [pdf, other

    cs.CV

    DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

    Authors: Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai

    Abstract: Vision-centric autonomous driving has recently raised wide attention due to its lower cost. Pre-training is essential for extracting a universal representation. However, current vision-centric pre-training typically relies on either 2D or 3D pre-text tasks, overlooking the temporal characteristics of autonomous driving as a 4D scene understanding task. In this paper, we address this challenge by i… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR2024

  4. arXiv:2405.03520  [pdf, other

    cs.CV

    Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

    Authors: Zheng Zhu, Xiaofeng Wang, Wangbo Zhao, Chen Min, Nianchen Deng, Min Dou, Yuqi Wang, Botian Shi, Kai Wang, Chi Zhang, Yang You, Zhaoxiang Zhang, Dawei Zhao, Liang Xiao, Jian Zhao, Jiwen Lu, Guan Huang

    Abstract: General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems. Recently, the emergence of the Sora model has attained significant attention due to its remarkable simulation capabilities, which exhibits an incipient comprehension of physical law… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: This survey will be regularly updated at: https://github.com/GigaAI-research/General-World-Models-Survey

  5. arXiv:2404.15067  [pdf, other

    cs.CL

    Enhancing Textual Personality Detection toward Social Media: Integrating Long-term and Short-term Perspectives

    Authors: Haohao Zhu, Xiaokun Zhang, Junyu Lu, Youlin Wu, Zewen Bai, Changrong Min, Liang Yang, Bo Xu, Dongyu Zhang, Hongfei Lin

    Abstract: Textual personality detection aims to identify personality characteristics by analyzing user-generated content toward social media platforms. Numerous psychological literature highlighted that personality encompasses both long-term stable traits and short-term dynamic states. However, existing studies often concentrate only on either long-term or short-term personality representations, without eff… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 11 pages, 9 figures

  6. arXiv:2404.05181  [pdf, other

    cs.CV

    Adaptive Learning for Multi-view Stereo Reconstruction

    Authors: Qinglu Min, Jie Zhao, Zhihao Zhang, Chen Min

    Abstract: Deep learning has recently demonstrated its excellent performance on the task of multi-view stereo (MVS). However, loss functions applied for deep MVS are rarely studied. In this paper, we first analyze existing loss functions' properties for deep depth based MVS approaches. Regression based loss leads to inaccurate continuous results by computing mathematical expectation, while classification bas… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  7. arXiv:2403.17863  [pdf, other

    cs.DC

    An AI-Native Runtime for Multi-Wearable Environments

    Authors: Chulhong Min, Utku Günay Acer, SiYoung Jang, Sangwon Choi, Diana A. Vasile, Taesik Gong, Juheon Yi, Fahim Kawsar

    Abstract: The miniaturization of AI accelerators is paving the way for next-generation wearable applications within wearable technologies. We introduce Mojito, an AI-native runtime with advanced MLOps designed to facilitate the development and deployment of these applications on wearable devices. It emphasizes the necessity of dynamic orchestration of distributed resources equipped with ultra-low-power AI a… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures

  8. arXiv:2403.00889  [pdf, other

    cs.CR cs.LG eess.SP

    Time-bound Contextual Bio-ID Generation for Minimalist Wearables

    Authors: Adiba Orzikulova, Diana A. Vasile, Fahim Kawsar, Chulhong Min

    Abstract: As wearable devices become increasingly miniaturized and powerful, a new opportunity arises for instant and dynamic device-to-device collaboration and human-to-device interaction. However, this progress presents a unique challenge: these minimalist wearables lack inherent mechanisms for real-time authentication, posing significant risks to data privacy and overall security. To address this, we int… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  9. arXiv:2401.14132  [pdf, other

    cs.CV cs.DC

    Enabling Cross-Camera Collaboration for Video Analytics on Distributed Smart Cameras

    Authors: Chulhong Min, Juheon Yi, Utku Gunay Acer, Fahim Kawsar

    Abstract: Overlapping cameras offer exciting opportunities to view a scene from different angles, allowing for more advanced, comprehensive and robust analysis. However, existing visual analytics systems for multi-camera streams are mostly limited to (i) per-camera processing and aggregation and (ii) workload-agnostic centralized processing architectures. In this paper, we present Argus, a distributed video… ▽ More

    Submitted 26 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 18 pages, under review

  10. arXiv:2401.08637  [pdf, other

    cs.DC cs.LG

    Synergy: Towards On-Body AI via Tiny AI Accelerator Collaboration on Wearables

    Authors: Taesik Gong, Si Young Jang, Utku Günay Acer, Fahim Kawsar, Chulhong Min

    Abstract: The advent of tiny artificial intelligence (AI) accelerators enables AI to run at the extreme edge, offering reduced latency, lower power cost, and improved privacy. When integrated into wearable devices, these accelerators open exciting opportunities, allowing various AI apps to run directly on the body. We present Synergy that provides AI apps with best-effort performance via system-driven holis… ▽ More

    Submitted 2 July, 2024; v1 submitted 11 December, 2023; originally announced January 2024.

  11. arXiv:2312.10920  [pdf, other

    cs.LG stat.ME

    Domain adaption and physical constrains transfer learning for shale gas production

    Authors: Zhaozhong Yang, Liangjie Gou, Chao Min, Duo Yi, Xiaogang Li, Guoquan Wen

    Abstract: Effective prediction of shale gas production is crucial for strategic reservoir development. However, in new shale gas blocks, two main challenges are encountered: (1) the occurrence of negative transfer due to insufficient data, and (2) the limited interpretability of deep learning (DL) models. To tackle these problems, we propose a novel transfer learning methodology that utilizes domain adaptat… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  12. arXiv:2312.06255  [pdf, ps, other

    cs.LG

    Ensemble Interpretation: A Unified Method for Interpretable Machine Learning

    Authors: Chao Min, Guoyong Liao, Guoquan Wen, Yingjun Li, Xing Guo

    Abstract: To address the issues of stability and fidelity in interpretable learning, a novel interpretable methodology, ensemble interpretation, is presented in this paper which integrates multi-perspective explanation of various interpretation methods. On one hand, we define a unified paradigm to describe the common mechanism of different interpretation methods, and then integrate the multiple interpretati… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  13. arXiv:2311.17878  [pdf, other

    cs.CV

    TSDF-Sampling: Efficient Sampling for Neural Surface Field using Truncated Signed Distance Field

    Authors: Chaerin Min, Sehyun Cha, Changhee Won, Jongwoo Lim

    Abstract: Multi-view neural surface reconstruction has exhibited impressive results. However, a notable limitation is the prohibitively slow inference time when compared to traditional techniques, primarily attributed to the dense sampling, required to maintain the rendering quality. This paper introduces a novel approach that substantially reduces the number of samplings by incorporating the Truncated Sign… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  14. arXiv:2309.10402  [pdf, other

    cs.LG stat.ML

    Minimum width for universal approximation using ReLU networks on compact domain

    Authors: Namjun Kim, Chanho Min, Sejun Park

    Abstract: It has been shown that deep neural networks of a large enough width are universal approximators but they are not if the width is too small. There were several attempts to characterize the minimum width $w_{\min}$ enabling the universal approximation property; however, only a few of them found the exact values. In this work, we show that the minimum width for $L^p$ approximation of $L^p$ functions… ▽ More

    Submitted 5 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  15. arXiv:2308.07241  [pdf, other

    cs.RO cs.AI

    Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents

    Authors: Byeonghwi Kim, Jinyeon Kim, Yuyeong Kim, Cheolhong Min, Jonghyun Choi

    Abstract: Accomplishing household tasks requires to plan step-by-step actions considering the consequences of previous actions. However, the state-of-the-art embodied agents often make mistakes in navigating the environment and interacting with proper objects due to imperfect learning by imitating experts or algorithmic planners without such knowledge. To improve both visual navigation and object interactio… ▽ More

    Submitted 12 March, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: ICCV 2023 (Project page: https://bhkim94.github.io/projects/CAPEAM)

  16. arXiv:2308.07234  [pdf, other

    cs.CV cs.RO

    UniWorld: Autonomous Driving Pre-training via World Models

    Authors: Chen Min, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

    Abstract: In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he introduced the concept of the occupancy grid as World Models for robots. We imbue the robot with a spatial-temporal world model, termed UniWorld, to perceive its surroundings and predict the future behavior of other participants. UniWorld involves initially predicting 4D geometric occupancy as the World Models… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:2305.18829

  17. arXiv:2307.10198  [pdf

    cs.AI

    Has China caught up to the US in AI research? An exploration of mimetic isomorphism as a model for late industrializers

    Authors: Chao Min, Yi Zhao, Yi Bu, Ying Ding, Caroline S. Wagner

    Abstract: Artificial Intelligence (AI), a cornerstone of 21st-century technology, has seen remarkable growth in China. In this paper, we examine China's AI development process, demonstrating that it is characterized by rapid learning and differentiation, surpassing the export-oriented growth propelled by Foreign Direct Investment seen in earlier Asian industrializers. Our data indicates that China current… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  18. arXiv:2306.03553  [pdf, other

    cs.AI

    An Approach to Solving the Abstraction and Reasoning Corpus (ARC) Challenge

    Authors: Tan John Chong Min

    Abstract: We utilise the power of Large Language Models (LLMs), in particular GPT4, to be prompt engineered into performing an arbitrary task. Here, we give the model some human priors via text, along with some typical procedures for solving the ARC tasks, and ask it to generate the i) broad description of the input-output relation, ii) detailed steps of the input-output mapping, iii) use the detailed steps… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 14 pages

  19. arXiv:2306.01988  [pdf, other

    cs.CV

    Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection

    Authors: Tao Lei, Yetong Xu, Hailong Ning, Zhiyong Lv, Chongdan Min, Yaochu Jin, Asoke K. Nandi

    Abstract: Popular Transformer networks have been successfully applied to remote sensing (RS) image change detection (CD) identifications and achieve better results than most convolutional neural networks (CNNs), but they still suffer from two main problems. First, the computational complexity of the Transformer grows quadratically with the increase of image spatial resolution, which is unfavorable to very h… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  20. arXiv:2305.18829  [pdf, other

    cs.CV cs.MM cs.RO

    UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving

    Authors: Chen Min, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

    Abstract: Multi-camera 3D perception has emerged as a prominent research field in autonomous driving, offering a viable and cost-effective alternative to LiDAR-based solutions. The existing multi-camera algorithms primarily rely on monocular 2D pre-training. However, the monocular 2D pre-training overlooks the spatial and temporal correlations among the multi-camera system. To address this limitation, we pr… ▽ More

    Submitted 27 April, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted by RAL2024

  21. arXiv:2305.04446  [pdf, other

    cs.CL cs.AI

    Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks

    Authors: Junyu Lu, Bo Xu, Xiaokun Zhang, Changrong Min, Liang Yang, Hongfei Lin

    Abstract: The widespread dissemination of toxic online posts is increasingly damaging to society. However, research on detecting toxic language in Chinese has lagged significantly. Existing datasets lack fine-grained annotation of toxic types and expressions, and ignore the samples with indirect toxicity. In addition, it is crucial to introduce lexical knowledge to detect the toxicity of posts, which has be… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: 13 pages, 4 figures. The paper has been accepted in ACL 2023

  22. arXiv:2305.03249  [pdf, other

    cs.GR cs.LG

    PMP: Learning to Physically Interact with Environments using Part-wise Motion Priors

    Authors: Jinseok Bae, Jungdam Won, Donggeun Lim, Cheol-Hui Min, Young Min Kim

    Abstract: We present a method to animate a character incorporating multiple part-wise motion priors (PMP). While previous works allow creating realistic articulated motions from reference data, the range of motion is largely limited by the available samples. Especially for the interaction-rich scenarios, it is impractical to attempt acquiring every possible interacting motion, as the combination of physical… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 13 pages, 11 figures

  23. arXiv:2304.08878  [pdf, other

    cs.CV

    Deep Collective Knowledge Distillation

    Authors: Jihyeon Seo, Kyusam Oh, Chanho Min, Yongkeun Yun, Sungwoo Cho

    Abstract: Many existing studies on knowledge distillation have focused on methods in which a student model mimics a teacher model well. Simply imitating the teacher's knowledge, however, is not sufficient for the student to surpass that of the teacher. We explore a method to harness the knowledge of other students to complement the knowledge of the teacher. We propose deep collective knowledge distill… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  24. arXiv:2304.03295  [pdf, other

    cs.SD cs.HC eess.AS

    Automatic Detection of Reactions to Music via Earable Sensing

    Authors: Euihyoek Lee, Chulhong Min, Jeaseung Lee, Jin Yu, Seungwoo Kang

    Abstract: We present GrooveMeter, a novel system that automatically detects vocal and motion reactions to music via earable sensing and supports music engagement-aware applications. To this end, we use smart earbuds as sensing devices, which are already widely used for music listening, and devise reaction detection techniques by leveraging an inertial measurement unit (IMU) and a microphone on earbuds. To e… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  25. arXiv:2302.11195  [pdf

    cs.LG

    Prediction of single well production rate in water-flooding oil fields driven by the fusion of static, temporal and spatial information

    Authors: Chao Min, Yijia Wang, Huohai Yang, Wei Zhao

    Abstract: It is very difficult to forecast the production rate of oil wells as the output of a single well is sensitive to various uncertain factors, which implicitly or explicitly show the influence of the static, temporal and spatial properties on the oil well production. In this study, a novel machine learning model is constructed to fuse the static geological information, dynamic well production history… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  26. arXiv:2212.10718  [pdf

    cs.LG math.NA stat.ME

    Interpretability and causal discovery of the machine learning models to predict the production of CBM wells after hydraulic fracturing

    Authors: Chao Min, Guoquan Wen, Liangjie Gou, Xiaogang Li, Zhaozhong Yang

    Abstract: Machine learning approaches are widely studied in the production prediction of CBM wells after hydraulic fracturing, but merely used in practice due to the low generalization ability and the lack of interpretability. A novel methodology is proposed in this article to discover the latent causality from observed data, which is aimed at finding an indirect way to interpret the machine learning result… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  27. arXiv:2212.06144  [pdf, other

    cs.LG cs.AI

    Optimizing Learning Rate Schedules for Iterative Pruning of Deep Neural Networks

    Authors: Shiyu Liu, Rohan Ghosh, John Tan Chong Min, Mehul Motani

    Abstract: The importance of learning rate (LR) schedules on network pruning has been observed in a few recent works. As an example, Frankle and Carbin (2019) highlighted that winning tickets (i.e., accuracy preserving subnetworks) can not be found without applying a LR warmup schedule and Renda, Frankle and Carbin (2020) demonstrated that rewinding the LR to its initial state at the end of each pruning cycl… ▽ More

    Submitted 30 December, 2022; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 23 Pages. arXiv admin note: text overlap with arXiv:2110.08764

  28. arXiv:2208.10145  [pdf, other

    cs.CV

    STS: Surround-view Temporal Stereo for Multi-view 3D Detection

    Authors: Zengran Wang, Chen Min, Zheng Ge, Yinhao Li, Zeming Li, Hongyu Yang, Di Huang

    Abstract: Learning accurate depth is essential to multi-view 3D object detection. Recent approaches mainly learn depth from monocular images, which confront inherent difficulties due to the ill-posed nature of monocular depth learning. Instead of using a sole monocular depth method, in this work, we propose a novel Surround-view Temporal Stereo (STS) technique that leverages the geometry correspondence betw… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  29. DropNet: Reducing Neural Network Complexity via Iterative Pruning

    Authors: John Tan Chong Min, Mehul Motani

    Abstract: Modern deep neural networks require a significant amount of computing time and power to train and deploy, which limits their usage on edge devices. Inspired by the iterative weight pruning in the Lottery Ticket Hypothesis, we propose DropNet, an iterative pruning method which prunes nodes/filters to reduce network complexity. DropNet iteratively removes nodes/filters with the lowest average post-a… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Published at ICML 2020. Code can be found at https://github.com/tanchongmin/DropNet

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, PMLR 119:9356-9366, 2020 https://proceedings.mlr.press/v119/tan20a.html

  30. arXiv:2207.05991  [pdf, other

    cs.LG cs.AI

    Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test Environments

    Authors: John Tan Chong Min, Mehul Motani

    Abstract: Traditional reinforcement learning (RL) environments typically are the same for both the training and testing phases. Hence, current RL methods are largely not generalizable to a test environment which is conceptually similar but different from what the method has been trained on, which we term the novel test environment. As an effort to push RL research towards algorithms which can generalize to… ▽ More

    Submitted 13 July, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: AlphaZero, Generalization

  31. arXiv:2206.09907  [pdf, other

    cs.CV

    ORFD: A Dataset and Benchmark for Off-Road Freespace Detection

    Authors: Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai

    Abstract: Freespace detection is an essential component of autonomous driving technology and plays an important role in trajectory planning. In the last decade, deep learning-based free space detection methods have been proved feasible. However, these efforts were focused on urban road environments and few deep learning-based methods were specifically designed for off-road free space detection due to the la… ▽ More

    Submitted 26 June, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Accepted by ICRA2022

  32. arXiv:2206.09900  [pdf, other

    cs.CV

    Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders

    Authors: Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

    Abstract: Current perception models in autonomous driving heavily rely on large-scale labelled 3D data, which is both costly and time-consuming to annotate. This work proposes a solution to reduce the dependence on labelled 3D training data by leveraging pre-training on large-scale unlabeled outdoor LiDAR point clouds using masked autoencoders (MAE). While existing masked point autoencoding methods mainly f… ▽ More

    Submitted 9 October, 2023; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Accepted by TIV

  33. arXiv:2205.08756  [pdf

    cs.DL

    Team formation and team performance: The balance between team freshness and repeat collaboration

    Authors: Meijun Liu, Ajay Jaiswal, Yi Bu, Chao Min, Sijie Yang, Zhibo Liu, Daniel Daniel Acuña, Ying Ding

    Abstract: Incorporating fresh members in teams is considered a pathway to team creativity. However, whether freshness improves team performance or not remains unclear, as well as the optimal involvement of fresh members for team performance. This study uses a group of authors on the byline of a publication as a proxy for a scientific team. We extend an indicator, i.e., team freshness, to measure the extent… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  34. arXiv:2203.15121  [pdf, ps, other

    cs.CR

    Tightly Seal Your Sensitive Pointers with PACTight

    Authors: Mohannad Ismail, Andrew Quach, Christopher Jelesnianski, Yeongjin Jang, Changwoo Min

    Abstract: ARM is becoming more popular in desktops and data centers, opening a new realm in terms of security attacks against ARM. ARM has released Pointer Authentication, a new hardware security feature that is intended to ensure pointer integrity with cryptographic primitives. In this paper, we utilize Pointer Authentication (PA) to build a novel scheme to completely prevent any misuse of security-sensiti… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted for publication to USENIX Security 2022

  35. Enabling Volatile Caches for Energy Harvesting Systems

    Authors: Jianping Zeng, Jongouk Choi, Xinwei Fu, Ajay Paddayuru Shreepathi, Dongyoon Lee, Changwoo Min, Changhee Jung

    Abstract: Energy harvesting systems have shown their unique benefit of ultra-long operation time without maintenance and are expected to be more prevalent in the era of Internet of Things. However, due to the batteryless nature, they suffer unpredictable frequent power outages. They thus require a lightweight mechanism for crash consistency since saving/restoring checkpoints across the outages can limit for… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: 13 pages and 19 figures

  36. arXiv:2202.08418  [pdf, other

    cs.CV cs.AI

    Neural Marionette: Unsupervised Learning of Motion Skeleton and Latent Dynamics from Volumetric Video

    Authors: Jinseok Bae, Hojun Jang, Cheol-Hui Min, Hyungun Choi, Young Min Kim

    Abstract: We present Neural Marionette, an unsupervised approach that discovers the skeletal structure from a dynamic sequence and learns to generate diverse motions that are consistent with the observed motion dynamics. Given a video stream of point cloud observation of an articulated body under arbitrary motion, our approach discovers the unknown low-dimensional skeletal relationship that can effectively… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 7 pages (main), 10 pages (appendix) and to be appeared in AAAI2022

  37. arXiv:2202.00758  [pdf, other

    cs.LG cs.AI cs.HC

    ColloSSL: Collaborative Self-Supervised Learning for Human Activity Recognition

    Authors: Yash Jain, Chi Ian Tang, Chulhong Min, Fahim Kawsar, Akhil Mathur

    Abstract: A major bottleneck in training robust Human-Activity Recognition models (HAR) is the need for large-scale labeled sensor datasets. Because labeling large amounts of sensor data is an expensive task, unsupervised and semi-supervised learning techniques have emerged that can learn good features from the data without requiring any labels. In this paper, we extend this line of research and present a n… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Accepted to ACM IMWUT 2022

  38. arXiv:2109.03947  [pdf, other

    cs.LG

    SensiX++: Bringing MLOPs and Multi-tenant Model Serving to Sensory Edge Devices

    Authors: Chulhong Min, Akhil Mathur, Utku Gunay Acer, Alessandro Montanari, Fahim Kawsar

    Abstract: We present SensiX++ - a multi-tenant runtime for adaptive model execution with integrated MLOps on edge devices, e.g., a camera, a microphone, or IoT sensors. SensiX++ operates on two fundamental principles - highly modular componentisation to externalise data operations with clear abstractions and document-centric manifestation for system-wide orchestration. First, a data coordinator manages the… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: 13 pages, 15 figures

  39. arXiv:2108.03824  [pdf, other

    cs.CV

    AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network

    Authors: Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang

    Abstract: In this paper, we present a novel recurrent multi-view stereo network based on long short-term memory (LSTM) with adaptive aggregation, namely AA-RMVSNet. We firstly introduce an intra-view aggregation module to adaptively extract image features by using context-aware convolution and multi-scale aggregation, which efficiently improves the performance on challenging regions, such as thin objects an… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  40. arXiv:2107.09176  [pdf

    cs.DL cs.CY

    Temporal search in the scientific space predicts breakthrough inventions

    Authors: Chao Min, Qing Ke

    Abstract: The development of inventions is theorized as a process of searching and recombining existing knowledge components. Previous studies under this theory have examined myriad characteristics of recombined knowledge and their performance implications. One feature that has received much attention is technological knowledge age. Yet, little is known about how the age of scientific knowledge influences t… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  41. arXiv:2106.15328  [pdf, other

    cs.CV

    Deep Learning for Multi-View Stereo via Plane Sweep: A Survey

    Authors: Qingtian Zhu, Chen Min, Zizhuang Wei, Yisong Chen, Guoping Wang

    Abstract: 3D reconstruction has lately attracted increasing attention due to its wide application in many areas, such as autonomous driving, robotics and virtual reality. As a dominant technique in artificial intelligence, deep learning has been successfully adopted to solve various computer vision problems. However, deep learning for 3D reconstruction is still at its infancy due to its unique challenges an… ▽ More

    Submitted 29 July, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

  42. arXiv:2105.03386  [pdf, other

    cs.DM cs.CG

    Topology and Routing Problems: The Circular Frame

    Authors: Rak-Kyeong Seong, Chanho Min, Sang-Hoon Han, Jaeho Yang, Seungwoo Nam, Kyusam Oh

    Abstract: In this work, we solve the problem of finding non-intersecting paths between points on a plane with a new approach by borrowing ideas from geometric topology, in particular, from the study of polygonal schema in mathematics. We use a topological transformation on the 2-dimensional planar routing environment that simplifies the routing problem into a problem of connecting points on a circle with st… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 15 pages, 10 figures

  43. arXiv:2104.04275  [pdf, other

    cs.CV cs.LG cs.RO

    GATSBI: Generative Agent-centric Spatio-temporal Object Interaction

    Authors: Cheol-Hui Min, Jinseok Bae, Junho Lee, Young Min Kim

    Abstract: We present GATSBI, a generative model that can transform a sequence of raw observations into a structured latent representation that fully captures the spatio-temporal context of the agent's actions. In vision-based decision-making scenarios, an agent faces complex high-dimensional observations where multiple entities interact with each other. The agent requires a good scene representation of the… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: accepted to CVPR'2021 as an oral presentation. Code and video will be released soon

  44. Attentional Graph Neural Network for Parking-slot Detection

    Authors: Chen Min, Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

    Abstract: Deep learning has recently demonstrated its promising performance for vision-based parking-slot detection. However, very few existing methods explicitly take into account learning the link information of the marking-points, resulting in complex post-processing and erroneous detection. In this paper, we propose an attentional graph neural network based parking-slot detection method, which refers th… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: Accepted by RAL

    Journal ref: IEEE Robotics and Automation Letters, vol.6, pp. 3445-3450, 2021

  45. Enhancing Application Performance by Memory Partitioning in Android Platforms

    Authors: Geunsik Lim, Changwoo Min, Young Ik Eom

    Abstract: This paper suggests a new memory partitioning scheme that can enhance process lifecycle, while avoiding Low Memory Killer and Out-of-Memory Killer operations on mobile devices. Our proposed scheme offers the complete concept of virtual memory nodes in operating systems of Android devices.

    Submitted 26 January, 2021; originally announced January 2021.

  46. arXiv:2101.09359  [pdf, other

    cs.DC

    Load-Balancing for Improving User Responsiveness on Multicore Embedded Systems

    Authors: Geunsik Lim, Changwoo Min, YoungIk Eom

    Abstract: Most commercial embedded devices have been deployed with a single processor architecture. The code size and complexity of applications running on embedded devices are rapidly increasing due to the emergence of application business models such as Google Play Store and Apple App Store. As a result, a high-performance multicore CPUs have become a major trend in the embedded market as well as in the p… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

  47. User-Aware Power Management for Mobile Devices

    Authors: Geunsik Lim, Changwoo Min, Dong Hyun Kang, Young Ik Eom

    Abstract: The power management techniques to extend battery lifespan is becoming increasingly important due to longer user applications' running time in mobile devices. Even when users do not use any applications, battery lifespan decreases continually. It occurs because of service daemons of mobile platform and network-based data synchronization operations. In this paper, we propose a new power management… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  48. arXiv:2101.08877  [pdf

    cs.AR cs.OS cs.PF

    Virtual Memory Partitioning for Enhancing Application Performance in Mobile Platforms

    Authors: Geunsik Lim, Changwoo Min, Young Ik Eom

    Abstract: Recently, the amount of running software on smart mobile devices is gradually increasing due to the introduction of application stores. The application store is a type of digital distribution platform for application software, which is provided as a component of an operating system on a smartphone or tablet. Mobile devices have limited memory capacity and, unlike server and desktop systems, due to… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  49. arXiv:2101.08577  [pdf

    cs.DL

    References of References: How Far is the Knowledge Ancestry

    Authors: Chao Min, Jiawei Xu, Tao Han, Yi Bu

    Abstract: Scientometrics studies have extended from direct citations to high-order citations, as simple citation count is found to tell only part of the story regarding scientific impact. This extension is deemed to be beneficial in scenarios like research evaluation, science history modeling, and information retrieval. In contrast to citations of citations (forward citation generations), references of refe… ▽ More

    Submitted 1 April, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

  50. arXiv:2012.06086  [pdf, ps, other

    cs.SE

    WITCHER : Detecting Crash Consistency Bugs in Non-volatile Memory Programs

    Authors: Xinwei Fu, Wook-Hee Kim, Ajay Paddayuru Shreepathi, Mohannad Ismail, Sunny Wadkar, Changwoo Min, Dongyoon Lee

    Abstract: The advent of non-volatile main memory (NVM) enables the development of crash-consistent software without paying storage stack overhead. However, building a correct crash-consistent program remains very challenging in the presence of a volatile cache. This paper presents WITCHER, a crash consistency bug detector for NVM software, that is (1) scalable -- does not suffer from test space explosion, (… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.