Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 384 results for author: Kong, H

.
  1. arXiv:2409.14888  [pdf, other

    cs.CV

    Advancing Video Quality Assessment for AIGC

    Authors: Xinli Yue, Jianhui Sun, Han Kong, Liangchao Yao, Tianyi Wang, Lei Li, Fengyun Rao, Jing Lv, Fan Xia, Yuetang Deng, Qian Wang, Lingchen Zhao

    Abstract: In recent years, AI generative models have made remarkable progress across various domains, including text generation, image generation, and video generation. However, assessing the quality of text-to-video generation is still in its infancy, and existing evaluation frameworks fall short when compared to those for natural videos. Current video quality assessment (VQA) methods primarily focus on ev… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 5 pages, 1 figure

  2. arXiv:2409.08824  [pdf, other

    cs.CV

    Pathfinder for Low-altitude Aircraft with Binary Neural Network

    Authors: Kaijie Yin, Tian Gao, Hui Kong

    Abstract: A prior global topological map (e.g., the OpenStreetMap, OSM) can boost the performance of autonomous mapping by a ground mobile robot. However, the prior map is usually incomplete due to lacking labeling in partial paths. To solve this problem, this paper proposes an OSM maker using airborne sensors carried by low-altitude aircraft, where the core of the OSM maker is a novel efficient pathfinder… ▽ More

    Submitted 22 September, 2024; v1 submitted 13 September, 2024; originally announced September 2024.

  3. arXiv:2408.12527  [pdf, other

    cs.RO cs.CV

    UMAD: University of Macau Anomaly Detection Benchmark Dataset

    Authors: Dong Li, Lineng Chen, Cheng-Zhong Xu, Hui Kong

    Abstract: Anomaly detection is critical in surveillance systems and patrol robots by identifying anomalous regions in images for early warning. Depending on whether reference data are utilized, anomaly detection can be categorized into anomaly detection with reference and anomaly detection without reference. Currently, anomaly detection without reference, which is closely related to out-of-distribution (OoD… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024, project code at https://github.com/IMRL/UMAD

  4. arXiv:2408.09504  [pdf

    cs.RO

    Design and Experimental Study of Vacuum Suction Grabbing Technology to Grasp Fabric Piece

    Authors: Ray Wai Man Kong, Mingyi Liu, Theodore Ho Tin Kong

    Abstract: The primary objective of this study was to design the grabbing technique used to determine the vacuum suction gripper and its design parameters for the pocket welting operation in apparel manufacturing. It presents the application of vacuum suction in grabbing technology, a technique that has revolutionized the handling and manipulation to grasp the various fabric materials in a range of garment i… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: 9 Pages, 3 figures, 6 diagrams, 1 table

  5. arXiv:2407.17078  [pdf, other

    cs.RO

    Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments

    Authors: Wei Gao, Zezhou Sun, Mingle Zhao, Cheng-Zhong Xu, Hui Kong

    Abstract: The autonomous mapping of large-scale urban scenes presents significant challenges for autonomous robots. To mitigate the challenges, global planning, such as utilizing prior GPS trajectories from OpenStreetMap (OSM), is often used to guide the autonomous navigation of robots for mapping. However, due to factors like complex terrain, unexpected body movement, and sensor noise, the uncertainty of t… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  6. arXiv:2407.12867  [pdf, other

    astro-ph.HE gr-qc

    Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run

    Authors: Gayathri Raman, Samuele Ronchini, James Delaunay, Aaron Tohuvavohu, Jamie A. Kennea, Tyler Parsotan, Elena Ambrosi, Maria Grazia Bernardini, Sergio Campana, Giancarlo Cusumano, Antonino D'Ai, Paolo D'Avanzo, Valerio D'Elia, Massimiliano De Pasquale, Simone Dichiara, Phil Evans, Dieter Hartmann, Paul Kuin, Andrea Melandri, Paul O'Brien, Julian P. Osborne, Kim Page, David M. Palmer, Boris Sbarufatti, Gianpiero Tagliaferri , et al. (1797 additional authors not shown)

    Abstract: We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 50 pages, 10 figures, 4 tables

  7. arXiv:2407.04519  [pdf, other

    cs.CV

    Success or Failure? Analyzing Segmentation Refinement with Few-Shot Segmentation

    Authors: Seonghyeon Moon, Haein Kong, Muhammad Haris Khan

    Abstract: The purpose of segmentation refinement is to enhance the initial coarse masks generated by segmentation algorithms. The refined masks are expected to capture the details and contours of the target objects. Research on segmentation refinement has developed as a response to the need for high-quality initial masks. However, to our knowledge, no method has been developed that can determine the success… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 4 pages

  8. arXiv:2405.19813  [pdf, other

    cs.RO

    SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization

    Authors: Jiang Wang, Yuanzheng He, Daobilige Su, Katsutoshi Itoyama, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong

    Abstract: Robot audition systems with multiple microphone arrays have many applications in practice. However, accurate calibration of multiple microphone arrays remains challenging because there are many unknown parameters to be identified, including the relative transforms (i.e., orientation, translation) and asynchronous factors (i.e., initial time offset and sampling clock difference) between microphone… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: This paper was accepted to and going to appear in the IEEE Transactions on Robotics

  9. arXiv:2405.16593  [pdf, other

    astro-ph.CO

    The Construction of Large-scale Structure Catalogs for the Dark Energy Spectroscopic Instrument

    Authors: A. J. Ross, J. Aguilar, S. Ahlen, S. Alam, A. Anand, S. Bailey, D. Bianchi, S. Brieden, D. Brooks, E. Burtin, A. Carnero Rosell, E. Chaussidon, T. Claybaugh, S. Cole, K. Dawson, A. de la Macorra, A. de Mattia, Arjun Dey, Biprateep Dey, P. Doel, K. Fanning, S. Ferraro, J. Ereza, A. Font-Ribera, J. E. Forero-Romero , et al. (61 additional authors not shown)

    Abstract: We present the technical details on how large-scale structure (LSS) catalogs are constructed from redshifts measured from spectra observed by the Dark Energy Spectroscopic Instrument (DESI). The LSS catalogs provide the information needed to determine the relative number density of DESI tracers as a function of redshift and celestial coordinates and, e.g., determine clustering statistics. We produ… ▽ More

    Submitted 18 July, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted (by JCAP) version of supporting publication of DESI 2024II: Sample definitions, characteristics, and two-point clustering statistics

  10. arXiv:2405.16299  [pdf, other

    astro-ph.CO

    Forward modeling fluctuations in the DESI LRGs target sample using image simulations

    Authors: Hui Kong, Ashley J. Ross, Klaus Honscheid, Dustin Lang, Anna Porredon, Arnaud de Mattia, Mehdi Rezaie, Rongpu Zhou, Edward Schlafly, John Moustakas, Alberto Rosado-Marin, Jessica Nicole Aguilar, Steven Ahlen, David Brooks, Edmond Chaussidon, Todd Claybaugh, Shaun Cole, Axel de la Macorra, Arjun Dey, Biprateep Dey, Peter Doel, Kevin Fanning, Jaime E. Forero-Romero, Enrique Gaztanaga, Satya Gontcho A Gontcho , et al. (28 additional authors not shown)

    Abstract: We use the forward modeling pipeline, Obiwan, to study the imaging systematics of the Luminous Red Galaxies (LRGs) targeted by the Dark Energy Spectroscopic Instrument (DESI). We update the Obiwan pipeline, which had previously been developed to simulate the optical images used to target DESI data, to further simulate WISE images in the infrared. This addition makes it possible to simulate the DES… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 46 pages, 26 figures

  11. arXiv:2405.02817  [pdf, other

    cs.CL

    Labeling supervised fine-tuning data with the scaling law

    Authors: Huanjun Kong

    Abstract: This paper introduces a multi-stage manual annotation calibrated by the scaling law, offering a high-quality Supervised Fine-Tuning data acquisition method for environments with constrained resources like GPU poor, limited GPT access, and funding restrictions. We have preprocessed 58k authentic chat data and manually annotated 2.3k questions. After this, we conducted fine-tuning on Qwen models, ra… ▽ More

    Submitted 16 August, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 tables, 3 figures

  12. arXiv:2405.02145  [pdf, other

    cs.RO

    Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving

    Authors: Haicheng Liao, Xuelin Li, Yongkang Li, Hanlin Kong, Chengyue Wang, Bonan Wang, Yanchen Guan, KaHou Tam, Zhenning Li, Chengzhong Xu

    Abstract: Trajectory prediction is a cornerstone in autonomous driving (AD), playing a critical role in enabling vehicles to navigate safely and efficiently in dynamic environments. To address this task, this paper presents a novel trajectory prediction model tailored for accuracy in the face of heterogeneous and uncertain traffic scenarios. At the heart of this model lies the Characterized Diffusion Module… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024

  13. arXiv:2404.19385  [pdf

    cond-mat.mtrl-sci

    High-performance solid-state electrochemical thermal switches with earth-abundant cerium oxide

    Authors: Ahrong Jeong, Mitsuki Yoshimura, Hyeonjun Kong, Zhiping Bian, Jason Tam, Bin Feng, Yuichi Ikuhara, Takashi Endo, Yasutaka Matsuo, Hiromichi Ohta

    Abstract: Thermal switches, which electrically turn heat flow on and off, have attracted attention as thermal management devices. Electrochemical reduction/oxidation switches the thermal conductivity (\k{appa}\) of active metal oxide films. The performance of the previously proposed electrochemical thermal switches is low; on/off \k{appa}\-ratio is mostly less than 5 and \k{appa}\-switching width is less th… ▽ More

    Submitted 22 August, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures with supporting information (12 pages, 11 figures, 1 table)

  14. arXiv:2404.17520  [pdf, other

    cs.RO

    A Cognitive-Driven Trajectory Prediction Model for Autonomous Driving in Mixed Autonomy Environment

    Authors: Haicheng Liao, Zhenning Li, Chengyue Wang, Bonan Wang, Hanlin Kong, Yanchen Guan, Guofa Li, Zhiyong Cui, Chengzhong Xu

    Abstract: As autonomous driving technology progresses, the need for precise trajectory prediction models becomes paramount. This paper introduces an innovative model that infuses cognitive insights into trajectory prediction, focusing on perceived safety and dynamic decision-making. Distinct from traditional approaches, our model excels in analyzing interactions and behavior patterns in mixed autonomy traff… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  15. arXiv:2404.11313  [pdf, other

    eess.IV cs.AI

    NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

    Authors: Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo , et al. (43 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i.e., Kuaishou/Kwai Platform. The KVQ database is divided into three parts, including 2926 videos for training, 420 videos for validation, and 854 videos for testing. The… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024 Workshop. The challenge report for CVPR NTIRE2024 Short-form UGC Video Quality Assessment Challenge

  16. arXiv:2404.05364  [pdf, other

    astro-ph.HE gr-qc

    Autoregressive Search of Gravitational Waves: Denoising

    Authors: Sangin Kim, C. Y. Hui, Jianqi Yan, Alex P. Leung, Kwangmin Oh, A. K. H. Kong, L. C. -C. Lin, Kwan-Lok Li

    Abstract: Because of the small strain amplitudes of gravitational-wave (GW) signals, unveiling them in the presence of detector/environmental noise is challenging. For visualizing the signals and extracting its waveform for a comparison with theoretical prediction, a frequency-domain whitening process is commonly adopted for filtering the data. In this work, we propose an alternative template-free framework… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Phys. Rev. D in press, 16 pages, 11 figures, 1 table

  17. arXiv:2404.04248  [pdf, other

    astro-ph.HE gr-qc

    Observation of Gravitational Waves from the Coalescence of a $2.5\text{-}4.5~M_\odot$ Compact Object and a Neutron Star

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, S. Akçay, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah , et al. (1771 additional authors not shown)

    Abstract: We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the so… ▽ More

    Submitted 26 July, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: 45 pages (10 pages author list, 13 pages main text, 1 page acknowledgements, 13 pages appendices, 8 pages bibliography), 17 figures, 16 tables. Update to match version published in The Astrophysical Journal Letters. Data products available from https://zenodo.org/records/10845779

    Report number: LIGO-P2300352

    Journal ref: ApJL 970, L34 (2024)

  18. arXiv:2404.02405  [pdf, other

    cs.CV

    TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression

    Authors: Ho-Joong Kim, Jung-Ho Hong, Heejo Kong, Seong-Whan Lee

    Abstract: In this paper, we investigate that the normalized coordinate expression is a key factor as reliance on hand-crafted components in query-based detectors for temporal action detection (TAD). Despite significant advancements towards an end-to-end framework in object detection, query-based detectors have been limited in achieving full end-to-end modeling in TAD. To address this issue, we propose \mode… ▽ More

    Submitted 3 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  19. arXiv:2403.15026  [pdf, other

    cs.CV

    VRSO: Visual-Centric Reconstruction for Static Object Annotation

    Authors: Chenyao Yu, Yingfeng Cai, Jiaxin Zhang, Hui Kong, Wei Sui, Cong Yang

    Abstract: As a part of the perception results of intelligent driving systems, static object detection (SOD) in 3D space provides crucial cues for driving environment understanding. With the rapid deployment of deep neural networks for SOD tasks, the demand for high-quality training samples soars. The traditional, also reliable, way is manual labelling over the dense LiDAR point clouds and reference images.… ▽ More

    Submitted 29 August, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted at 2024 IEEE International Conference on Intelligent Robots and Systems (IROS)

  20. arXiv:2403.05791  [pdf, other

    eess.AS cs.SD

    Asynchronous Microphone Array Calibration using Hybrid TDOA Information

    Authors: Chengjie Zhang, Jiang Wang, He Kong

    Abstract: Asynchronous microphone array calibration is a prerequisite for most audition robot applications. A popular solution to the above calibration problem is the batch form of Simultaneous Localisation and Mapping (SLAM), using the time difference of arrival measurements between two microphones (TDOA-M), and the robot (which serves as a moving sound source during calibration) odometry information. In t… ▽ More

    Submitted 19 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  21. arXiv:2403.04350  [pdf, other

    gr-qc astro-ph.CO astro-ph.IM

    Extract non-Gaussian Features in Gravitational Wave Observation Data Using Self-Supervised Learning

    Authors: Yu-Chiung Lin, Albert K. H. Kong

    Abstract: We propose a self-supervised learning model to denoise gravitational wave (GW) signals in the time series strain data without relying on waveform information. Denoising GW data is a crucial intermediate process for machine-learning-based data analysis techniques, as it can simplify the model for downstream tasks such as detections and parameter estimations. We use the blind-spot neural network and… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 39 pages, 15 figures in the main article, and 43 figures in the appendix

  22. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  23. arXiv:2402.07945  [pdf, other

    cs.HC cs.AI cs.CV

    ScreenAgent: A Vision Language Model-driven Computer Control Agent

    Authors: Runliang Niu, Jindong Li, Shiqi Wang, Yali Fu, Xiyu Hu, Xueyuan Leng, He Kong, Yi Chang, Qi Wang

    Abstract: Existing Large Language Models (LLM) can invoke a variety of tools and APIs to complete complex tasks. The computer, as the most powerful and universal tool, could potentially be controlled directly by a trained LLM agent. Powered by the computer, we can hopefully build a more generalized agent to assist humans in various daily digital works. In this paper, we construct an environment for a Vision… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  24. arXiv:2402.03649  [pdf, ps, other

    math.AT math.CT

    Group completions and the homotopical monadicity theorem

    Authors: Hana Jia Kong, J. Peter May, Foling Zou

    Abstract: We abstract and generalize homotopical monadicity statements, placing in a single conceptual framework a range of old and recent recognition and characterization principles in iterated loop space theory in classical, equivariant, and multiplicative frameworks. Some of the examples are new and some are old, but all are illuminated by the coherent framework, which we feel certain will encompass exam… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 88 pages

    MSC Class: 55P48; 55P91; 18M60; 18C15

  25. arXiv:2402.00330  [pdf, other

    cs.RO

    Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering

    Authors: Tianxiao Gao, Mingle Zhao, Chengzhong Xu, Hui Kong

    Abstract: Vision-aided localization for low-cost mobile robots in diverse environments has attracted widespread attention recently. Although many current systems are applicable in daytime environments, nocturnal visual localization is still an open problem owing to the lack of stable visual information. An insight from most nocturnal scenes is that the static and bright streetlights are reliable visual info… ▽ More

    Submitted 3 March, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  26. AscDAMs: Advanced SLAM-based channel detection and mapping system

    Authors: Tengfei Wang, Fucheng Lu, Jintao Qin, Taosheng Huang, Hui Kong, Ping Shen

    Abstract: Obtaining high-resolution, accurate channel topography and deposit conditions is the prior challenge for the study of channelized debris flow. Currently, wide-used mapping technologies including satellite imaging and drone photogrammetry struggle to precisely observe channel interior conditions of mountainous long-deep gullies, particularly those in the Wenchuan Earthquake region. SLAM is an emerg… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  27. arXiv:2401.08772  [pdf, other

    cs.CL

    HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

    Authors: Huanjun Kong, Songyang Zhang, Jiaying Li, Min Xiao, Jun Xu, Kai Chen

    Abstract: In this work, we present HuixiangDou, a technical assistant powered by Large Language Models (LLM). This system is designed to assist algorithm developers by providing insightful responses to questions related to open-source algorithm projects, such as computer vision and deep learning projects from OpenMMLab. We further explore the integration of this assistant into the group chats of instant mes… ▽ More

    Submitted 12 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 13 pages, 4 figures

  28. arXiv:2401.04872  [pdf, other

    cs.CV cs.LG cs.RO

    Knowledge-aware Graph Transformer for Pedestrian Trajectory Prediction

    Authors: Yu Liu, Yuexin Zhang, Kunming Li, Yongliang Qiao, Stewart Worrall, You-Fu Li, He Kong

    Abstract: Predicting pedestrian motion trajectories is crucial for path planning and motion control of autonomous vehicles. Accurately forecasting crowd trajectories is challenging due to the uncertain nature of human motions in different environments. For training, recent deep learning-based prediction approaches mainly utilize information like trajectory history and interactions between pedestrians, among… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: This paper was accepted to and presented at the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC), September 2023

  29. arXiv:2401.00496  [pdf, other

    cs.CV cs.AI cs.LG

    SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

    Authors: Dimitrios Psychogyios, Emanuele Colleoni, Beatrice Van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi , et al. (25 additional authors not shown)

    Abstract: Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segme… ▽ More

    Submitted 23 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  30. arXiv:2312.08746  [pdf, other

    cs.CV

    DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators

    Authors: Hanyang Kong, Dongze Lian, Michael Bi Mi, Xinchao Wang

    Abstract: We introduce DreamDrone, a novel zero-shot and training-free pipeline for generating unbounded flythrough scenes from textual prompts. Different from other methods that focus on warping images frame by frame, we advocate explicitly warping the intermediate latent code of the pre-trained text-to-image diffusion model for high-quality image generation and generalization ability. To further enhance t… ▽ More

    Submitted 24 September, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 16 pages, 12 figures, project page: https://hyokong.github.io/dreamdrone-page/

  31. arXiv:2310.17750  [pdf, other

    cs.CL

    A Framework for Automated Measurement of Responsible AI Harms in Generative AI Applications

    Authors: Ahmed Magooda, Alec Helyar, Kyle Jackson, David Sullivan, Chad Atalla, Emily Sheng, Dan Vann, Richard Edgar, Hamid Palangi, Roman Lutz, Hongliang Kong, Vincent Yun, Eslam Kamal, Federico Zarfati, Hanna Wallach, Sarah Bird, Mei Chen

    Abstract: We present a framework for the automated measurement of responsible AI (RAI) metrics for large language models (LLMs) and associated products and services. Our framework for automatically measuring harms from LLMs builds on existing technical and sociotechnical expertise and leverages the capabilities of state-of-the-art LLMs, such as GPT-4. We use this framework to run through several case studie… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: This is a living document

  32. arXiv:2310.17450  [pdf, other

    astro-ph.HE

    Rapid Generation of Kilonova Light Curves Using Conditional Variational Autoencoder

    Authors: Surojit Saha, Michael J. Williams, Laurence Datrier, Fergus Hayes, Matt Nicholl, Albert K. H. Kong, Martin Hendry, IK Siong Heng, Gavin P. Lamb, En-Tzu Lin, Daniel Williams

    Abstract: The discovery of the optical counterpart, along with the gravitational waves from GW170817, of the first binary neutron star merger, opened up a new era for multi-messenger astrophysics. Combining the GW data with the optical counterpart, also known as AT2017gfo, classified as a kilonova, has revealed the nature of compact binary merging systems by extracting enriched information about the total b… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 19 pages, 7 figures (3 additional figures in appendix), accepted to ApJ

  33. arXiv:2310.10511  [pdf

    physics.plasm-ph

    A linear parameters study of ion cyclotron emission using drift ring beam distribution

    Authors: Haozhe Kong, Huasheng Xie, Jizhong Sun

    Abstract: Ion cyclotron emission (ICE) holds great potential as a diagnostic tool for fast ions in fusion devices. The theory of magnetoacoustic cyclotron instability (MCI), as an emission mechanism for ICE, states that MCI is driven by a velocity distribution of fast ions that approximates a drift ring beam. The influence of key parameters on the linear MCI is systematically investigated using the linear k… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 14 pages, 21 figures

    Journal ref: Nucl. Fusion 63 (2023) 126034

  34. arXiv:2308.14480  [pdf, other

    cs.CV cs.MM

    Priority-Centric Human Motion Generation in Discrete Latent Space

    Authors: Hanyang Kong, Kehong Gong, Dongze Lian, Michael Bi Mi, Xinchao Wang

    Abstract: Text-to-motion generation is a formidable task, aiming to produce human motions that align with the input text while also adhering to human capabilities and physical laws. While there have been advancements in diffusion models, their application in discrete spaces remains underexplored. Current methods often overlook the varying significance of different motions, treating them uniformly. It is ess… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  35. arXiv:2308.13666  [pdf, other

    astro-ph.HE

    A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run

    Authors: C. Fletcher, J. Wood, R. Hamburg, P. Veres, C. M. Hui, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. M. Giles, A. Goldstein, B. A. Hristov, D. Kocevski, S. Lesage, B. Mailyan, C. Malacaria, S. Poolakkil, A. von Kienlin, C. A. Wilson-Hodge, The Fermi Gamma-ray Burst Monitor Team, M. Crnogorčević, J. DeLaunay, A. Tohuvavohu, R. Caputo, S. B. Cenko , et al. (1674 additional authors not shown)

    Abstract: We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  36. arXiv:2308.06287  [pdf, other

    astro-ph.HE

    Chandra Observation of NGC 1559: Eight Ultraluminous X-ray Sources Including a Compact Binary Candidate

    Authors: Chen-Hsun Ma, Kwan-Lok Li, You-Hua Chu, Albert K. H. Kong

    Abstract: Despite the 30-year history of ultra-luminous X-ray sources (ULXs) studies, issues like the majority of their physical natures (i.e., neutron stars, stellar-mass black holes, or intermediate black holes) as well as the accretion mechanisms are still under debate. Expanding the ULX sample size in the literature is clearly a way to help. To this end, we investigated the X-ray source population, ULXs… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in ApJ

  37. Influences of dynamical disruptions on the evolution of pulsars in globular clusters

    Authors: Kwangmin Oh, C. Y. Hui, Jongsuk Hong, J. Takata, A. K. H. Kong, Pak-Hin Thomas Tam, Kwan-Lok Li, K. S. Cheng

    Abstract: By comparing the physical properties of pulsars hosted by core-collapsed (CCed) and non-core-collapsed (Non-CCed) globular clusters (GCs), we find that pulsars in CCed GCs rotate significantly slower than their counterparts in Non-CCed GCs. Additionally, radio luminosities at 1.4 GHz in CCed GCs are higher. These findings are consistent with the scenario that dynamical interactions in GCs can inte… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 9 pages, 8 figures, 3 tables, Accepted in MNRAS

  38. arXiv:2308.04793  [pdf, other

    astro-ph.HE astro-ph.GA

    Cosmic ray calorimetry in star-forming galaxy populations and implications for their contribution to the extra-galactic $γ$-ray background

    Authors: Ellis R. Owen, Albert K. H. Kong, Kuo-Chuan Pan

    Abstract: Star-forming galaxies (SFGs) have been established as an important source population in the extra-galactic $γ$-ray background (EGB). Their intensive star-formation creates an abundance of environments able to accelerate particles, and these build-up a rich sea of cosmic rays (CRs). Above GeV energies, CR protons can undergo hadronic interactions with their environment to produce $γ$-rays. SFGs can… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures, 1 table. Presented at the 38th International Cosmic Ray Conference (ICRC2023)

    Journal ref: PoS (ICRC2023), 554

  39. arXiv:2308.03822  [pdf, other

    astro-ph.HE

    Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1750 additional authors not shown)

    Abstract: Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 24 pages, 5 figures

    Report number: LIGO-P2300080

  40. arXiv:2308.01873  [pdf, other

    math.AT

    A deformation of Borel equivariant homotopy

    Authors: Gabriel Angelini-Knoll, Mark Behrens, Eva Belmont, Hana Jia Kong

    Abstract: We describe a deformation of the $\infty$-category of Borel $G$-spectra for a finite group $G$. This provides a new presentation of the $a$-complete real Artin--Tate motivic stable homotopy category when $G=C_2$ and gives a new interpretation of the $a$-completed $C_2$-effective slice spectral sequence. As a new computational tool, we present a modified Adams--Novikov spectral sequence which compu… ▽ More

    Submitted 27 September, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: updated introduction and references; 49 pages, comments welcome!

    MSC Class: 55P91; 14F42; 55P42; 55T15

  41. arXiv:2308.00867  [pdf, ps, other

    astro-ph.SR

    Evidence of stellar oscillations in the post-common envelop binary candidate ASASSN-V J205543.90+240033.5

    Authors: J. Takata, A. K. H. Kong, X. F. Wang, F. F. Song, J. Mao, X. Hou, C. -P. Hu, L. C. -C. Lin, K. L. Li, C. Y. Hui

    Abstract: ASASSN-V J205543.90+240033.5 (ASJ2055) is a possible post-common envelope binary system. Its optical photometric data shows an orbital variation about $0.52$~days and a fast period modulation of $P_0\sim 9.77$~minute, whose origin is unknown. In this {\it Letter}, we report an evidence of the stellar oscillation of the companion star as the origin of the fast period modulation. We analyze the phot… ▽ More

    Submitted 3 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: 9 pages, 5 figures, 2 tables. Accepted for publication in ApJ Letter

  42. arXiv:2307.01753  [pdf, other

    astro-ph.CO cs.LG physics.comp-ph physics.data-an

    Local primordial non-Gaussianity from the large-scale clustering of photometric DESI luminous red galaxies

    Authors: Mehdi Rezaie, Ashley J. Ross, Hee-Jong Seo, Hui Kong, Anna Porredon, Lado Samushia, Edmond Chaussidon, Alex Krolewski, Arnaud de Mattia, Florian Beutler, Jessica Nicole Aguilar, Steven Ahlen, Shadab Alam, Santiago Avila, Benedict Bahr-Kalus, Jose Bermejo-Climent, David Brooks, Todd Claybaugh, Shaun Cole, Kyle Dawson, Axel de la Macorra, Peter Doel, Andreu Font-Ribera, Jaime E. Forero-Romero, Satya Gontcho A Gontcho , et al. (24 additional authors not shown)

    Abstract: We use angular clustering of luminous red galaxies from the Dark Energy Spectroscopic Instrument (DESI) imaging surveys to constrain the local primordial non-Gaussianity parameter $\fnl$. Our sample comprises over 12 million targets, covering 14,000 square degrees of the sky, with redshifts in the range $0.2< z < 1.35$. We identify Galactic extinction, survey depth, and astronomical seeing as the… ▽ More

    Submitted 25 June, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 21 pages, 17 figures, 7 tables (Appendix excluded). Published in MNRAS

  43. arXiv:2306.14222  [pdf, other

    cs.CL cs.AI q-fin.ST

    Unveiling the Potential of Sentiment: Can Large Language Models Predict Chinese Stock Price Movements?

    Authors: Haohan Zhang, Fengrui Hua, Chengjin Xu, Hao Kong, Ruiting Zuo, Jian Guo

    Abstract: The rapid advancement of Large Language Models (LLMs) has spurred discussions about their potential to enhance quantitative trading strategies. LLMs excel in analyzing sentiments about listed companies from financial news, providing critical insights for trading decisions. However, the performance of LLMs in this task varies substantially due to their inherent characteristics. This paper introduce… ▽ More

    Submitted 4 May, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

  44. arXiv:2305.14038  [pdf, other

    cs.CV cs.RO

    Why semantics matters: A deep study on semantic particle-filtering localization in a LiDAR semantic pole-map

    Authors: Yuming Huang, Yi Gu, Chengzhong Xu, Hui Kong

    Abstract: In most urban and suburban areas, pole-like structures such as tree trunks or utility poles are ubiquitous. These structural landmarks are very useful for the localization of autonomous vehicles given their geometrical locations in maps and measurements from sensors. In this work, we aim at creating an accurate map for autonomous vehicles or robots with pole-like structures as the dominant localiz… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  45. arXiv:2305.07931  [pdf, other

    cs.CV

    GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples

    Authors: Tian Gao, Cheng-Zhong Xu, Le Zhang, Hui Kong

    Abstract: Vision Transformer (ViT) has performed remarkably in various computer vision tasks. Nonetheless, affected by the massive amount of parameters, ViT usually suffers from serious overfitting problems with a relatively limited number of training samples. In addition, ViT generally demands heavy computing resources, which limit its deployment on resource-constrained devices. As a type of model-compress… ▽ More

    Submitted 18 January, 2024; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: Accepted by Neural Networks

  46. arXiv:2305.06086  [pdf

    physics.plasm-ph

    Enhancement of Fusion Reactivity under Non-Maxwellian Distributions: Effects of Drift-Ring-Beam, Slowing-Down, and Kappa Super-Thermal Distributions

    Authors: Haozhe Kong, Huasheng Xie, Bing Liu, Muzhi Tan, Di Luo, Zhi Li, Jizhong Sun

    Abstract: Non-Maxwellian distributions of particles are commonly observed in fusion studies, especially for magnetic confinement fusion plasmas. The particle distribution has a direct effect on fusion reactivity, which is the focus of this study. We investigate the effects of three types of non-Maxwellian distributions, namely drift-ring-beam, slowing-down, and kappa super-thermal distributions, on the fusi… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 12 pages, 18 figures

    Journal ref: Plasma Phys. Control. Fusion 66 (2024) 015009

  47. arXiv:2304.08393  [pdf, other

    gr-qc astro-ph.CO astro-ph.HE

    Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1670 additional authors not shown)

    Abstract: Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 28 pages, 11 figures

    Report number: LIGO-P2200031

  48. A journey from the hard to the soft state: How do QPOs evolve in the 2021 outburst of GX 339-4?

    Authors: H. Stiele, A. K. H. Kong

    Abstract: We investigated the snapshots of five NICER observations of the black hole transient GX 339-4 when the source transited from the hard state into the soft state during its outburst in 2021. In this paper, we focused our study on the evolution of quasi-periodic oscillations (QPOs) and noise components using power-density spectra. In addition, we derived hardness ratios comparing count rates above an… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 6 pages, 5 figures, supplementary online material as appendices (13 pages), accepted for publication in MNRAS

  49. arXiv:2303.15937  [pdf, other

    cs.CV

    PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout

    Authors: HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang

    Abstract: Content-aware visual-textual presentation layout aims at arranging spatial space on the given canvas for pre-defined elements, including text, logo, and underlay, which is a key to automatic template-free creative graphic design. In practical applications, e.g., poster designs, the canvas is originally non-empty, and both inter-element relationships as well as inter-layer relationships should be c… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023. Dataset and code are available at https://github.com/PKU-ICST-MIPL/PosterLayout-CVPR2023

  50. arXiv:2302.09123  [pdf, other

    math.AT

    The $\mathbb C$-motivic Adams-Novikov spectral sequence for topological modular forms

    Authors: Daniel C. Isaksen, Hana Jia Kong, Guchuan Li, Yangyang Ruan, Heyi Zhu

    Abstract: We analyze the $\mathbb{C}$-motivic (and classical) Adams-Novikov spectral sequence for the $\mathbb{C}$-motivic modular forms spectrum $\mathit{mmf}$ (and for the classical topological modular forms spectrum $\mathit{tmf}$). We primarily use purely algebraic techniques, with a few exceptions. Along the way, we settle a previously unresolved detail about the multiplicative structure of the homotop… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: 38 pages, 5 figures. Comments welcome!

    Report number: HIM-Spectral-2022 MSC Class: 14F42; 55T15; 55Q10