-
Commissioning results from the Robo-AO-2 facility for rapid visible and near-infrared AO imaging
Authors:
Christoph Baranec,
James Ou,
Reed Riddle,
Ruihan Zhang,
Luke Mckay,
Rachel Rampy,
Morgan Bonnet,
Iven Hamilton,
Greg Ching,
Jessica Young,
Maıssa Salama,
Paul Barnes,
Shane Jacobson,
Peter Onaka,
Mark Chun,
Zachary Werber,
Keith Powell,
Marcos A. van Dam,
Benjamin Shappee
Abstract:
We installed the next-generation automated laser adaptive optics system, Robo-AO-2, on the University of Hawaii 2.2-m telescope on Maunakea in 2023. We engineered Robo-AO-2 to deliver robotic, diffraction-limited observations at visible and near-infrared wavelengths in unprecedented numbers. This new instrument takes advantage of upgraded components, manufacturing techniques and control; and inclu…
▽ More
We installed the next-generation automated laser adaptive optics system, Robo-AO-2, on the University of Hawaii 2.2-m telescope on Maunakea in 2023. We engineered Robo-AO-2 to deliver robotic, diffraction-limited observations at visible and near-infrared wavelengths in unprecedented numbers. This new instrument takes advantage of upgraded components, manufacturing techniques and control; and includes a parallel reconfigurable natural guide star wavefront sensor with which to explore hybrid wavefront sensing techniques. We present the results of commissioning in 2023 and 2024.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other
Authors:
Yifei Gao,
Jie Ou,
Lei Wang,
Yuting Xiao,
Zhiyuan Xiang,
Ruiting Dai,
Jun Cheng
Abstract:
Emergent Large Language Models (LLMs) use their extraordinary performance and powerful deduction capacity to discern from traditional language models. However, the expenses of computational resources and storage for these LLMs are stunning, quantization then arises as a trending conversation. To address accuracy decay caused by quantization, two streams of works in post-training quantization metho…
▽ More
Emergent Large Language Models (LLMs) use their extraordinary performance and powerful deduction capacity to discern from traditional language models. However, the expenses of computational resources and storage for these LLMs are stunning, quantization then arises as a trending conversation. To address accuracy decay caused by quantization, two streams of works in post-training quantization methods stand out. One uses other weights to compensate existing quantization error, while the other transfers the quantization difficulty to other parts in the model. Combining both merits, we introduce Learnable Singular value Increment (LSI) as an advanced solution. LSI uses Singular Value Decomposition to extract singular values of the weights and make them learnable to help weights compensate each other conditioned on activation. Incorporating LSI with existing techniques, we achieve state-of-the-art performance in diverse quantization settings, no matter in weight-only, weight-activation or extremely low bit scenarios. By unleashing the potential of LSI, efficient finetuning on quantized model is no longer a prohibitive problem.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
Authors:
Jingyang Ou,
Shen Nie,
Kaiwen Xue,
Fengqi Zhu,
Jiacheng Sun,
Zhenguo Li,
Chongxuan Li
Abstract:
Discrete diffusion models with absorbing processes have shown promise in language modeling. The key quantities to be estimated are the ratios between the marginal probabilities of two transitive states at all timesteps, called the concrete score. In this paper, we reveal that the concrete score in absorbing diffusion can be expressed as conditional probabilities of clean data, multiplied by a time…
▽ More
Discrete diffusion models with absorbing processes have shown promise in language modeling. The key quantities to be estimated are the ratios between the marginal probabilities of two transitive states at all timesteps, called the concrete score. In this paper, we reveal that the concrete score in absorbing diffusion can be expressed as conditional probabilities of clean data, multiplied by a time-dependent scalar in an analytic form. Motivated by the finding, we propose reparameterized absorbing discrete diffusion (RADD), a dedicated diffusion model that characterizes the time-independent conditional probabilities. Besides its simplicity, RADD can reduce the number of function evaluations (NFEs) by caching the output of the time-independent network when the noisy sample remains unchanged in a sampling interval. Empirically, RADD is up to 3.5 times faster while consistently achieving a better performance than the strongest baseline. Built upon the new factorization of the concrete score, we further prove a surprising result that the exact likelihood of absorbing diffusion can be rewritten to a simple form (named denoising cross-entropy) and then estimated efficiently by the Monte Carlo method. The resulting approach also applies to the original parameterization of the concrete score. It significantly advances the state-of-the-art discrete diffusion on 5 zero-shot language modeling benchmarks (measured by perplexity) at the GPT-2 scale.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Data quality control system and long-term performance monitor of the LHAASO-KM2A
Authors:
Zhen Cao,
F. Aharonian,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
W. Bian,
A. V. Bukevich,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
H. X. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. Chen
, et al. (263 additional authors not shown)
Abstract:
The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To…
▽ More
The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To ensure the reliability of the LHAASO-KM2A data, a three-level quality control system has been established. It is used to monitor the status of detector units, stability of reconstructed parameters and the performance of the array based on observations of the Crab Nebula and Moon shadow. This paper will introduce the control system and its application on the LHAASO-KM2A data collected from August 2021 to July 2023. During this period, the pointing and angular resolution of the array were stable. From the observations of the Moon shadow and Crab Nebula, the results achieved using the two methods are consistent with each other. According to the observation of the Crab Nebula at energies from 25 TeV to 100 TeV, the time averaged pointing errors are estimated to be $-0.003^{\circ} \pm 0.005^{\circ}$ and $0.001^{\circ} \pm 0.006^{\circ}$ in the R.A. and Dec directions, respectively.
△ Less
Submitted 13 June, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech
Authors:
Zhongren Dong,
Zixing Zhang,
Weixiang Xu,
Jing Han,
Jianjun Ou,
Björn W. Schuller
Abstract:
Automatically detecting Alzheimer's Disease (AD) from spontaneous speech plays an important role in its early diagnosis. Recent approaches highly rely on the Transformer architectures due to its efficiency in modelling long-range context dependencies. However, the quadratic increase in computational complexity associated with self-attention and the length of audio poses a challenge when deploying…
▽ More
Automatically detecting Alzheimer's Disease (AD) from spontaneous speech plays an important role in its early diagnosis. Recent approaches highly rely on the Transformer architectures due to its efficiency in modelling long-range context dependencies. However, the quadratic increase in computational complexity associated with self-attention and the length of audio poses a challenge when deploying such models on edge devices. In this context, we construct a novel framework, namely Hierarchical Attention-Free Transformer (HAFFormer), to better deal with long speech for AD detection. Specifically, we employ an attention-free module of Multi-Scale Depthwise Convolution to replace the self-attention and thus avoid the expensive computation, and a GELU-based Gated Linear Unit to replace the feedforward layer, aiming to automatically filter out the redundant information. Moreover, we design a hierarchical structure to force it to learn a variety of information grains, from the frame level to the dialogue level. By conducting extensive experiments on the ADReSS-M dataset, the introduced HAFFormer can achieve competitive results (82.6% accuracy) with other recent work, but with significant computational complexity and model size reduction compared to the standard Transformer. This shows the efficiency of HAFFormer in dealing with long audio for AD detection.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting
Authors:
Yifei Gao,
Jie Ou,
Lei Wang,
Jun Cheng
Abstract:
Recent developments in neural rendering techniques have greatly enhanced the rendering of photo-realistic 3D scenes across both academic and commercial fields. The latest method, known as 3D Gaussian Splatting (3D-GS), has set new benchmarks for rendering quality and speed. Nevertheless, the limitations of 3D-GS become pronounced in synthesizing new viewpoints, especially for views that greatly de…
▽ More
Recent developments in neural rendering techniques have greatly enhanced the rendering of photo-realistic 3D scenes across both academic and commercial fields. The latest method, known as 3D Gaussian Splatting (3D-GS), has set new benchmarks for rendering quality and speed. Nevertheless, the limitations of 3D-GS become pronounced in synthesizing new viewpoints, especially for views that greatly deviate from those seen during training. Additionally, issues such as dilation and aliasing arise when zooming in or out. These challenges can all be traced back to a single underlying issue: insufficient sampling. In our paper, we present a bootstrapping method that significantly addresses this problem. This approach employs a diffusion model to enhance the rendering of novel views using trained 3D-GS, thereby streamlining the training process. Our results indicate that bootstrapping effectively reduces artifacts, as well as clear enhancements on the evaluation metrics. Furthermore, we show that our method is versatile and can be easily integrated, allowing various 3D reconstruction projects to benefit from our approach.
△ Less
Submitted 12 May, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Authors:
Jiao Ou,
Jiayu Wu,
Che Liu,
Fuzheng Zhang,
Di Zhang,
Kun Gai
Abstract:
Aligning large language models (LLMs) with human expectations requires high-quality instructional dialogues, which can be achieved by raising diverse, in-depth, and insightful instructions that deepen interactions. Existing methods target instructions from real instruction dialogues as a learning goal and fine-tune a user simulator for posing instructions. However, the user simulator struggles to…
▽ More
Aligning large language models (LLMs) with human expectations requires high-quality instructional dialogues, which can be achieved by raising diverse, in-depth, and insightful instructions that deepen interactions. Existing methods target instructions from real instruction dialogues as a learning goal and fine-tune a user simulator for posing instructions. However, the user simulator struggles to implicitly model complex dialogue flows and pose high-quality instructions. In this paper, we take inspiration from the cognitive abilities inherent in human learning and propose the explicit modeling of complex dialogue flows through instructional strategy reuse. Specifically, we first induce high-level strategies from various real instruction dialogues. These strategies are applied to new dialogue scenarios deductively, where the instructional strategies facilitate high-quality instructions. Experimental results show that our method can generate diverse, in-depth, and insightful instructions for a given dialogue history. The constructed multi-turn instructional dialogues can outperform competitive baselines on the downstream chat model.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
Authors:
Jie Ou,
Yueming Chen,
Wenhong Tian
Abstract:
While Large Language Models (LLMs) have shown remarkable abilities, they are hindered by significant resource consumption and considerable latency due to autoregressive processing. In this study, we introduce Adaptive N-gram Parallel Decoding (ANPD), an innovative and lossless approach that accelerates inference by allowing the simultaneous generation of multiple tokens. ANPD incorporates a two-st…
▽ More
While Large Language Models (LLMs) have shown remarkable abilities, they are hindered by significant resource consumption and considerable latency due to autoregressive processing. In this study, we introduce Adaptive N-gram Parallel Decoding (ANPD), an innovative and lossless approach that accelerates inference by allowing the simultaneous generation of multiple tokens. ANPD incorporates a two-stage approach: it begins with a rapid drafting phase that employs an N-gram module, which adapts based on the current interactive context, followed by a verification phase, during which the original LLM assesses and confirms the proposed tokens. Consequently, ANPD preserves the integrity of the LLM's original output while enhancing processing speed. We further leverage a multi-level architecture for the N-gram module to enhance the precision of the initial draft, consequently reducing inference latency. ANPD eliminates the need for retraining or extra GPU memory, making it an efficient and plug-and-play enhancement. In our experiments, models such as LLaMA and its fine-tuned variants have shown speed improvements up to 3.67x, validating the effectiveness of our proposed ANPD.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Dynamic Resolution Guidance for Facial Expression Recognition
Authors:
Jie Ou,
Xu Li,
Tianxiang Jiang,
Yuanlun Xie
Abstract:
Facial expression recognition (FER) is vital for human-computer interaction and emotion analysis, yet recognizing expressions in low-resolution images remains challenging. This paper introduces a practical method called Dynamic Resolution Guidance for Facial Expression Recognition (DRGFER) to effectively recognize facial expressions in images with varying resolutions without compromising FER model…
▽ More
Facial expression recognition (FER) is vital for human-computer interaction and emotion analysis, yet recognizing expressions in low-resolution images remains challenging. This paper introduces a practical method called Dynamic Resolution Guidance for Facial Expression Recognition (DRGFER) to effectively recognize facial expressions in images with varying resolutions without compromising FER model accuracy. Our framework comprises two main components: the Resolution Recognition Network (RRN) and the Multi-Resolution Adaptation Facial Expression Recognition Network (MRAFER). The RRN determines image resolution, outputs a binary vector, and the MRAFER assigns images to suitable facial expression recognition networks based on resolution. We evaluated DRGFER on widely-used datasets RAFDB and FERPlus, demonstrating that our method retains optimal model performance at each resolution and outperforms alternative resolution approaches. The proposed framework exhibits robustness against resolution variations and facial expressions, offering a promising solution for real-world applications.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Longitudinal tri-foci Metalens empowered multiple-magnification and diffraction-limited microscope
Authors:
Chuang Sun,
Zixuan Wang,
Kian Shen Kiang,
Jun-Yu Ou,
Jize Yan
Abstract:
Dielectric metalens has emerged as an attractive device for advanced imaging system because of its powerful manipulation ability of light beam, small volume, and light weight. However, the applications of silicon nitride (Si3N4) metalens are limited by the low refraction index of Si3N4, and multi-foci metalens has not been realized based on a Si3N4 metalens. Here, we deeply explore the working mec…
▽ More
Dielectric metalens has emerged as an attractive device for advanced imaging system because of its powerful manipulation ability of light beam, small volume, and light weight. However, the applications of silicon nitride (Si3N4) metalens are limited by the low refraction index of Si3N4, and multi-foci metalens has not been realized based on a Si3N4 metalens. Here, we deeply explore the working mechanism of a truncated waveguide meta-atom and obtain a Si3N4 metalens with longitudinal three diffraction-limited focal points. By utilizing the metalens sample as a condenser lens, a commercial microscope can obtain three magnifications based on a single objective lens. Finally, an infinity-corrected microscope with three high magnifications (9.5X, 10X, and 29X) and diffraction-limited resolution is integrated into centimetre-dimension for the first time by using the tri-foci metalens sample as an objective lens. This research would boost the scaling up of metalens microscope as well as the multifunctional application of Si3N4 metalens.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Near-infrared metalens empowered dual-mode high resolution and large FOV microscope
Authors:
Chuang Sun,
Hailong Pi,
Kian Shen Kiang,
Jize Yan,
Jun-Yu Ou
Abstract:
The spiral phase contrast microscope can clearly distinguish the morphological information of the low contrast objects (i.e., biological samples) because of the isotropic edge-enhancement effect, while the bright field microscope can image the overall morphology of amplitude objects. However, the imaging resolution, magnification, and field of view of conventional spiral phase contrast microscopes…
▽ More
The spiral phase contrast microscope can clearly distinguish the morphological information of the low contrast objects (i.e., biological samples) because of the isotropic edge-enhancement effect, while the bright field microscope can image the overall morphology of amplitude objects. However, the imaging resolution, magnification, and field of view of conventional spiral phase contrast microscopes based on 4f filtering configuration are limited by the system's complexity. Here, we reported compact dual-mode microscopes working at near-infrared using the engineered metalens which can be tuned between the spiral phase contrast imaging and bright field imaging by polarization control. The metalens combines the high-resolution objective lens and polarization-controlled phase filter into a single-layer nanofins array. We demonstrated two infinity-corrected microscope systems to achieve subwavelength resolution (0.7 times of wavelength), large magnification (58X), and large field of view (600um times 800um). Unstained onion epidermal is imaged by the microscope to show the dual-mode imaging ability for the biological sample. Finally, a singlet dual-mode microscope system is demonstrated to show the edge-detection application for industrial standards. Our results could open new opportunities in applications of biological imaging, industrial machine vision, and semiconductor inspection.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Enhancing Role-playing Systems through Aggressive Queries: Evaluation and Improvement
Authors:
Yihong Tang,
Jiao Ou,
Che Liu,
Fuzheng Zhang,
Di Zhang,
Kun Gai
Abstract:
The advent of Large Language Models (LLMs) has propelled dialogue generation into new realms, particularly in the field of role-playing systems (RPSs). While enhanced with ordinary role-relevant training dialogues, existing LLM-based RPSs still struggle to align with roles when handling intricate and trapped queries in boundary scenarios. In this paper, we design the Modular ORchestrated Trap-sett…
▽ More
The advent of Large Language Models (LLMs) has propelled dialogue generation into new realms, particularly in the field of role-playing systems (RPSs). While enhanced with ordinary role-relevant training dialogues, existing LLM-based RPSs still struggle to align with roles when handling intricate and trapped queries in boundary scenarios. In this paper, we design the Modular ORchestrated Trap-setting Interaction SystEm (MORTISE) to benchmark and improve the role-playing LLMs' performance. MORTISE can produce highly role-relevant aggressive queries through the collaborative effort of multiple LLM-based modules, and formulate corresponding responses to create an adversarial training dataset via a consistent response generator. We select 190 Chinese and English roles to construct aggressive queries to benchmark existing role-playing LLMs. Through comprehensive evaluation, we find that existing models exhibit a general deficiency in role alignment capabilities. We further select 180 of the roles to collect an adversarial training dataset (named RoleAD) and retain the other 10 roles for testing. Experiments on models improved by RoleAD indicate that our adversarial dataset ameliorates this deficiency, with the improvements demonstrating a degree of generalizability in ordinary scenarios.
△ Less
Submitted 15 June, 2024; v1 submitted 16 February, 2024;
originally announced February 2024.
-
Tunable on-chip optical traps for levitating particles based on single-layer metasurface
Authors:
Chuang Sun,
Hailong Pi,
Kian Shen Kiang,
Tiberius S. Georgescu,
Jun-Yu Ou,
Hendrik Ulbricht,
Jize Yan
Abstract:
Optically levitated multiple nanoparticles has emerged as a platform for studying complex fundamental physics such as non-equilibrium phenomena, quantum entanglement, and light-matter interaction, which could be applied for sensing weak forces and torques with high sensitivity and accuracy. An optical trapping landscape of increased complexity is needed to engineer the interaction between levitate…
▽ More
Optically levitated multiple nanoparticles has emerged as a platform for studying complex fundamental physics such as non-equilibrium phenomena, quantum entanglement, and light-matter interaction, which could be applied for sensing weak forces and torques with high sensitivity and accuracy. An optical trapping landscape of increased complexity is needed to engineer the interaction between levitated particles beyond the single harmonic trap. However, existing platforms based on spatial light modulators for studying interactions between levitated particles suffered from low efficiency, instability at focal points, the complexity of optical systems, and the scalability for sensing applications. Here, we experimentally demonstrated that a metasurface which forms two diffraction-limited focal points with a high numerical aperture (0.9) and high efficiency (31%) can generate tunable optical potential wells without any intensity fluctuations. A bistable potential and double potential wells were observed in the experiment by varying the focal points distance, and two nanoparticles were levitated in double potential wells for hours, which could be used for investigating the levitated particles nonlinear dynamics, thermal dynamics, and optical binding. This would pave the way for scaling the number of levitated optomechanical devices or realizing paralleled levitated sensors.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation
Authors:
Jiahui Zhong,
Wenhong Tian,
Yuanlun Xie,
Zhijia Liu,
Jie Ou,
Taoran Tian,
Lei Zhang
Abstract:
Current state-of-the-art medical image segmentation methods prioritize accuracy but often at the expense of increased computational demands and larger model sizes. Applying these large-scale models to the relatively limited scale of medical image datasets tends to induce redundant computation, complicating the process without the necessary benefits. This approach not only adds complexity but also…
▽ More
Current state-of-the-art medical image segmentation methods prioritize accuracy but often at the expense of increased computational demands and larger model sizes. Applying these large-scale models to the relatively limited scale of medical image datasets tends to induce redundant computation, complicating the process without the necessary benefits. This approach not only adds complexity but also presents challenges for the integration and deployment of lightweight models on edge devices. For instance, recent transformer-based models have excelled in 2D and 3D medical image segmentation due to their extensive receptive fields and high parameter count. However, their effectiveness comes with a risk of overfitting when applied to small datasets and often neglects the vital inductive biases of Convolutional Neural Networks (CNNs), essential for local feature representation. In this work, we propose PMFSNet, a novel medical imaging segmentation model that effectively balances global and local feature processing while avoiding the computational redundancy typical in larger models. PMFSNet streamlines the UNet-based hierarchical structure and simplifies the self-attention mechanism's computational complexity, making it suitable for lightweight applications. It incorporates a plug-and-play PMFS block, a multi-scale feature enhancement module based on attention mechanisms, to capture long-term dependencies. Extensive comprehensive results demonstrate that even with a model (less than 1 million parameters), our method achieves superior performance in various segmentation tasks across different data scales. It achieves (IoU) metrics of 84.68%, 82.02%, and 78.82% on public datasets of teeth CT (CBCT), ovarian tumors ultrasound(MMOTU), and skin lesions dermoscopy images (ISIC 2018), respectively. The source code is available at https://github.com/yykzjh/PMFSNet.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
The dimension of polynomial growth holomorphic functions and forms on gradient Kähler Ricci shrinkers
Authors:
Fei He,
Jianyu Ou
Abstract:
We study polynomial growth holomorphic functions and forms on complete gradient shrinking Ricci solitons. By relating to the spectral data of the $f$-Laplacian, we show that the dimension of the space of polynomial growth holomorphic functions or holomorphic $(p,0)$-forms are finite. In particular, a sharp dimension estimate for the space of linear growth holomorphic functions was obtained. Under…
▽ More
We study polynomial growth holomorphic functions and forms on complete gradient shrinking Ricci solitons. By relating to the spectral data of the $f$-Laplacian, we show that the dimension of the space of polynomial growth holomorphic functions or holomorphic $(p,0)$-forms are finite. In particular, a sharp dimension estimate for the space of linear growth holomorphic functions was obtained. Under some additional curvature assumption, we prove a sharp estimate for the frequency of polynomial growth holomorphic functions, which was used to obtain dimension upper bound as a power function of the polynomial order.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
DialogBench: Evaluating LLMs as Human-like Dialogue Systems
Authors:
Jiao Ou,
Junda Lu,
Che Liu,
Yihong Tang,
Fuzheng Zhang,
Di Zhang,
Kun Gai
Abstract:
Large language models (LLMs) have achieved remarkable breakthroughs in new dialogue capabilities by leveraging instruction tuning, which refreshes human impressions of dialogue systems. The long-standing goal of dialogue systems is to be human-like enough to establish long-term connections with users. Therefore, there has been an urgent need to evaluate LLMs as human-like dialogue systems. In this…
▽ More
Large language models (LLMs) have achieved remarkable breakthroughs in new dialogue capabilities by leveraging instruction tuning, which refreshes human impressions of dialogue systems. The long-standing goal of dialogue systems is to be human-like enough to establish long-term connections with users. Therefore, there has been an urgent need to evaluate LLMs as human-like dialogue systems. In this paper, we propose DialogBench, a dialogue evaluation benchmark that contains 12 dialogue tasks to probe the capabilities of LLMs as human-like dialogue systems should have. Specifically, we prompt GPT-4 to generate evaluation instances for each task. We first design the basic prompt based on widely used design principles and further mitigate the existing biases to generate higher-quality evaluation instances. Our extensive tests on English and Chinese DialogBench of 26 LLMs show that instruction tuning improves the human likeness of LLMs to a certain extent, but most LLMs still have much room for improvement as human-like dialogue systems. Interestingly, results also show that the positioning of assistant AI can make instruction tuning weaken the human emotional perception of LLMs and their mastery of information about human daily life.
△ Less
Submitted 29 March, 2024; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Exploring the Limits of Historical Information for Temporal Knowledge Graph Extrapolation
Authors:
Yi Xu,
Junjie Ou,
Hui Xu,
Luoyi Fu,
Lei Zhou,
Xinbing Wang,
Chenghu Zhou
Abstract:
Temporal knowledge graphs, representing the dynamic relationships and interactions between entities over time, have been identified as a promising approach for event forecasting. However, a limitation of most temporal knowledge graph reasoning methods is their heavy reliance on the recurrence or periodicity of events, which brings challenges to inferring future events related to entities that lack…
▽ More
Temporal knowledge graphs, representing the dynamic relationships and interactions between entities over time, have been identified as a promising approach for event forecasting. However, a limitation of most temporal knowledge graph reasoning methods is their heavy reliance on the recurrence or periodicity of events, which brings challenges to inferring future events related to entities that lack historical interaction. In fact, the current state of affairs is often the result of a combination of historical information and underlying factors that are not directly observable. To this end, we investigate the limits of historical information for temporal knowledge graph extrapolation and propose a new event forecasting model called Contrastive Event Network (CENET) based on a novel training framework of historical contrastive learning. CENET learns both the historical and non-historical dependency to distinguish the most potential entities that best match the given query. Simultaneously, by launching contrastive learning, it trains representations of queries to probe whether the current moment is more dependent on historical or non-historical events. These representations further help train a binary classifier, whose output is a boolean mask, indicating the related entities in the search space. During the inference process, CENET employs a mask-based strategy to generate the final results. We evaluate our proposed model on five benchmark graphs. The results demonstrate that CENET significantly outperforms all existing methods in most metrics, achieving at least 8.3% relative improvement of Hits@1 over previous state-of-the-art baselines on event-based datasets.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
On bicyclic graphs with maximal Graovac-Ghorbani index
Authors:
Rui Song,
Saihua Liu,
Jianping Ou
Abstract:
Graovac-Ghorbani index is a new version of the atom-bond connectivity index. D. Pacheco et al. [D. Pacheco, L. de Lima, C. S. Oliveira, On the Graovac-Ghorbani Index for Bicyclic Graph with No Pendent Vertices, MATCH Commun. Math. Comput. Chem. 86 (2021) 429-448] conjectured a sharp lower and upper bounds to the Graovac-Ghorbani index for all bicyclic graphs. Motivated by their nice work, in this…
▽ More
Graovac-Ghorbani index is a new version of the atom-bond connectivity index. D. Pacheco et al. [D. Pacheco, L. de Lima, C. S. Oliveira, On the Graovac-Ghorbani Index for Bicyclic Graph with No Pendent Vertices, MATCH Commun. Math. Comput. Chem. 86 (2021) 429-448] conjectured a sharp lower and upper bounds to the Graovac-Ghorbani index for all bicyclic graphs. Motivated by their nice work, in this paper we determine the maximal Graovac-Ghorbani index of bicyclic graphs and characterize the corresponding extremal graphs, which solves one of their Conjectures.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
A Robust Integrated Multi-Strategy Bus Control System via Deep Reinforcement Learning
Authors:
Qinghui Nie,
Jishun Ou,
Haiyang Zhang,
Jiawei Lu,
Shen Li,
Haotian Shi
Abstract:
An efficient urban bus control system has the potential to significantly reduce travel delays and streamline the allocation of transportation resources, thereby offering enhanced and user-friendly transit services to passengers. However, bus operation efficiency can be impacted by bus bunching. This problem is notably exacerbated when the bus system operates along a signalized corridor with unpred…
▽ More
An efficient urban bus control system has the potential to significantly reduce travel delays and streamline the allocation of transportation resources, thereby offering enhanced and user-friendly transit services to passengers. However, bus operation efficiency can be impacted by bus bunching. This problem is notably exacerbated when the bus system operates along a signalized corridor with unpredictable travel demand. To mitigate this challenge, we introduce a multi-strategy fusion approach for the longitudinal control of connected and automated buses. The approach is driven by a physics-informed deep reinforcement learning (DRL) algorithm and takes into account a variety of traffic conditions along urban signalized corridors. Taking advantage of connected and autonomous vehicle (CAV) technology, the proposed approach can leverage real-time information regarding bus operating conditions and road traffic environment. By integrating the aforementioned information into the DRL-based bus control framework, our designed physics-informed DRL state fusion approach and reward function efficiently embed prior physics and leverage the merits of equilibrium and consensus concepts from control theory. This integration enables the framework to learn and adapt multiple control strategies to effectively manage complex traffic conditions and fluctuating passenger demands. Three control variables, i.e., dwell time at stops, speed between stations, and signal priority, are formulated to minimize travel duration and ensure bus stability with the aim of avoiding bus bunching. We present simulation results to validate the effectiveness of the proposed approach, underlining its superior performance when subjected to sensitivity analysis, specifically considering factors such as traffic volume, desired speed, and traffic signal conditions.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Zwicky Transient Facility and Globular Clusters: the gr-Band Period-Luminosity Relations for Mira Variables at Maximum Light and Their Applications to Local Galaxies
Authors:
Chow-Choong Ngeow,
Jia-Yu Ou,
Anupam Bhardwaj,
Josiah Purdum,
Ben Rusholme,
Avery Wold
Abstract:
Based on 14 Miras located in 7 globular clusters, we derived the first gr-band period-luminosity (PL) at maximum light for the large-amplitude Mira variables using the multi-year light-curve data collected from the Zwicky Transient Facility (ZTF). Since Miras are red variables, we applied a color-term correction to subsets of ZTF light curves, and found that such corrections do not have a large im…
▽ More
Based on 14 Miras located in 7 globular clusters, we derived the first gr-band period-luminosity (PL) at maximum light for the large-amplitude Mira variables using the multi-year light-curve data collected from the Zwicky Transient Facility (ZTF). Since Miras are red variables, we applied a color-term correction to subsets of ZTF light curves, and found that such corrections do not have a large impact on period determinations. We applied our derived PL relations to the known extragalactic Miras in five local galaxies (Sextans, Leo I, Leo II, NGC6822 and IC1613), and determined their Mira-based distances. We demonstrated that our PL relations can be applied to short-period (<300 days) Miras, including those in the two most distant galaxies (NGC6822 and IC1613) in our sample even when only a portion of the light-curves around maximum light have detections. We have also shown that the long-period extragalactic Miras do not follow the PL relations extrapolated to longer periods. Hence, our derived PL relations are only applicable to the short-period Miras, which will be discovered in abundance in local galaxies within the era of Vera C. Rubin Observatory's Legacy Survey of Space and Time.
△ Less
Submitted 13 July, 2023;
originally announced July 2023.
-
Pragmatic Inference with a CLIP Listener for Contrastive Captioning
Authors:
Jiefu Ou,
Benno Krojer,
Daniel Fried
Abstract:
We propose a simple yet effective and robust method for contrastive captioning: generating discriminative captions that distinguish target images from very similar alternative distractor images. Our approach is built on a pragmatic inference procedure that formulates captioning as a reference game between a speaker, which produces possible captions describing the target, and a listener, which sele…
▽ More
We propose a simple yet effective and robust method for contrastive captioning: generating discriminative captions that distinguish target images from very similar alternative distractor images. Our approach is built on a pragmatic inference procedure that formulates captioning as a reference game between a speaker, which produces possible captions describing the target, and a listener, which selects the target given the caption. Unlike previous methods that derive both speaker and listener distributions from a single captioning model, we leverage an off-the-shelf CLIP model to parameterize the listener. Compared with captioner-only pragmatic models, our method benefits from rich vision language alignment representations from CLIP when reasoning over distractors. Like previous methods for discriminative captioning, our method uses a hyperparameter to control the tradeoff between the informativity (how likely captions are to allow a human listener to discriminate the target image) and the fluency of the captions. However, we find that our method is substantially more robust to the value of this hyperparameter than past methods, which allows us to automatically optimize the captions for informativity - outperforming past methods for discriminative captioning by 11% to 15% accuracy in human evaluations
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Entangled Photons Enabled Ultrafast Stimulated Raman Spectroscopy for Molecular Dynamics
Authors:
Joel Jiahao Fan,
Zhe-Yu Jeff Ou,
Zhedong Zhang
Abstract:
Quantum entanglement has emerged as a great resource for interactions between molecules and radiation. We propose a new paradigm of stimulated Raman scattering with entangled photons. A quantum ultrafast Raman spectroscopy is developed for condensed-phase molecules, to monitor the exciton populations and coherences. Analytic results are obtained, showing a time-frequency scale not attainable by cl…
▽ More
Quantum entanglement has emerged as a great resource for interactions between molecules and radiation. We propose a new paradigm of stimulated Raman scattering with entangled photons. A quantum ultrafast Raman spectroscopy is developed for condensed-phase molecules, to monitor the exciton populations and coherences. Analytic results are obtained, showing a time-frequency scale not attainable by classical light. The Raman signal presents an unprecedented selectivity of molecular correlation functions, as a result of the Hong-Ou-Mandel interference. This is a typical quantum nature, advancing the spectroscopy for clarity. Our work suggests a new scheme of optical signals and spectroscopy, with potential to unveil advanced information about complex materials.
△ Less
Submitted 24 May, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Ensemble Reinforcement Learning: A Survey
Authors:
Yanjie Song,
P. N. Suganthan,
Witold Pedrycz,
Junwei Ou,
Yongming He,
Yingwu Chen,
Yutong Wu
Abstract:
Reinforcement Learning (RL) has emerged as a highly effective technique for addressing various scientific and applied problems. Despite its success, certain complex tasks remain challenging to be addressed solely with a single model and algorithm. In response, ensemble reinforcement learning (ERL), a promising approach that combines the benefits of both RL and ensemble learning (EL), has gained wi…
▽ More
Reinforcement Learning (RL) has emerged as a highly effective technique for addressing various scientific and applied problems. Despite its success, certain complex tasks remain challenging to be addressed solely with a single model and algorithm. In response, ensemble reinforcement learning (ERL), a promising approach that combines the benefits of both RL and ensemble learning (EL), has gained widespread popularity. ERL leverages multiple models or training algorithms to comprehensively explore the problem space and possesses strong generalization capabilities. In this study, we present a comprehensive survey on ERL to provide readers with an overview of recent advances and challenges in the field. Firstly, we provide an introduction to the background and motivation for ERL. Secondly, we conduct a detailed analysis of strategies such as model selection and combination that have been successfully implemented in ERL. Subsequently, we explore the application of ERL, summarize the datasets, and analyze the algorithms employed. Finally, we outline several open questions and discuss future research directions of ERL. By offering guidance for future scientific research and engineering applications, this survey significantly contributes to the advancement of ERL.
△ Less
Submitted 13 December, 2023; v1 submitted 5 March, 2023;
originally announced March 2023.
-
Wavelength dependence of laser-induced nanowelded microstructures assembled from metal nanoparticles
Authors:
Ariel Rogers,
Isabelle I. Niyonshuti,
Jun Ou,
Diksha Shrestha,
Jingyi Chen,
Yong Wang
Abstract:
Light-based nanowelding of metallic nanoparticles is of particular interest because it provides convenient and controlled means for the conversion of nanoparticles into microstructures and fabrication of nanodevices. Here, we demonstrated the wavelength dependence of laser-induced nanowelded shapes of silver nanoparticles (AgNPs). We observed that the nanowelded microstructures illuminated by the…
▽ More
Light-based nanowelding of metallic nanoparticles is of particular interest because it provides convenient and controlled means for the conversion of nanoparticles into microstructures and fabrication of nanodevices. Here, we demonstrated the wavelength dependence of laser-induced nanowelded shapes of silver nanoparticles (AgNPs). We observed that the nanowelded microstructures illuminated by the 405 nm laser only were more branched than those formed with illumination of both the 405 nm and 532 nm lasers. We quantified this observation by several compactness descriptors and examined the dependence of the power of the 532 nm laser. More importantly, to understand the experimental observations, we formulated and tested a hypothesis by calculating the wavelength-dependent electric field enhancement due to surface plasmon resonance of the AgNPs and nanowelded microstructures when illuminated with lights at the two wavelengths. Based on the different patterns of "hot spots" for welding AgNPs from these calculations, numerical simulations successfully "reproduced" the different shapes of nanowelded microstructures, supporting our hypothesis. This work suggests the possibility of light-based controlling the shapes of laser-induced nanowelded microstructures of metallic nanoparticles. This work is expected to facilitate the development of better nanowelding strategies of metallic nanoparticles for broader applications.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Hierarchical Event Grounding
Authors:
Jiefu Ou,
Adithya Pratapa,
Rishubh Gupta,
Teruko Mitamura
Abstract:
Event grounding aims at linking mention references in text corpora to events from a knowledge base (KB). Previous work on this task focused primarily on linking to a single KB event, thereby overlooking the hierarchical aspects of events. Events in documents are typically described at various levels of spatio-temporal granularity (Glavas et al. 2014). These hierarchical relations are utilized in d…
▽ More
Event grounding aims at linking mention references in text corpora to events from a knowledge base (KB). Previous work on this task focused primarily on linking to a single KB event, thereby overlooking the hierarchical aspects of events. Events in documents are typically described at various levels of spatio-temporal granularity (Glavas et al. 2014). These hierarchical relations are utilized in downstream tasks of narrative understanding and schema construction. In this work, we present an extension to the event grounding task that requires tackling hierarchical event structures from the KB. Our proposed task involves linking a mention reference to a set of event labels from a subevent hierarchy in the KB. We propose a retrieval methodology that leverages event hierarchy through an auxiliary hierarchical loss (Murty et al. 2018). On an automatically created multilingual dataset from Wikipedia and Wikidata, our experiments demonstrate the effectiveness of the hierarchical loss against retrieve and re-rank baselines (Wu et al. 2020; Pratapa, Gupta, and Mitamura 2022). Furthermore, we demonstrate the systems' ability to aid hierarchical discovery among unseen events.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
A Distance Measurement to M33 Using Optical Photometry of Mira Variables
Authors:
Jia-Yu Ou,
Chow-Choong Ngeow,
Anupam Bhardwaj,
Matthew J. Graham,
Russ R. Laher,
Frank J. Masci,
Reed Riddle
Abstract:
We present a systematic analysis to determine and improve the pulsation periods of 1637 known long-period Mira variables in M33 using $gri$-band light curves spanning $\sim18$~years from several surveys, including M33 variability survey, Panoramic Survey Telescope and Rapid Response System, Palomar Transient Factory (PTF), intermediate PTF, and Zwicky Transient Facility. Based on these collections…
▽ More
We present a systematic analysis to determine and improve the pulsation periods of 1637 known long-period Mira variables in M33 using $gri$-band light curves spanning $\sim18$~years from several surveys, including M33 variability survey, Panoramic Survey Telescope and Rapid Response System, Palomar Transient Factory (PTF), intermediate PTF, and Zwicky Transient Facility. Based on these collections of light curves, we found that optical band light curves that are as complete as possible are crucial to determine the periods of distant Miras. We demonstrated that the machine learning techniques can be used to classify Miras into O-rich and C-rich based on the $(J-K_s)$ period--color plane. Finally, We derived the distance modulus to M33 using O-rich Miras at maximum light together with our improved periods as $24.67 \pm 0.06$~mag, which is in good agreement with the recommended value given in the literature.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
TESS Asteroseismic Analysis of HD 76920: The Giant Star Hosting An Extremely Eccentric Exoplanet
Authors:
Chen Jiang,
Tao Wu,
Adina D. Feinstein,
Keivan G. Stassun,
Timothy R. Bedding,
Dimitri Veras,
Enrico Corsaro,
Derek L. Buzasi,
Dennis Stello,
Yaguang Li,
Savita Mathur,
Rafael A. Garcia,
Sylvain N. Breton,
Mia S. Lundkvist,
Przemyslaw J. Mikolajczyk,
Charlotte Gehan,
Tiago L. Campante,
Diego Bossini,
Stephen R. Kane,
Jia Mian Joel Ong,
Mutlu Yildiz,
Cenk Kayhan,
Zeynep Celik Orhan,
Sibel Ortel,
Xinyi Zhang
, et al. (8 additional authors not shown)
Abstract:
The Transiting Exoplanet Survey Satellite (TESS) mission searches for new exoplanets. The observing strategy of TESS results in high-precision photometry of millions of stars across the sky, allowing for detailed asteroseismic studies of individual systems. In this work, we present a detailed asteroseismic analysis of the giant star HD 76920 hosting a highly eccentric giant planet ($e = 0.878$) wi…
▽ More
The Transiting Exoplanet Survey Satellite (TESS) mission searches for new exoplanets. The observing strategy of TESS results in high-precision photometry of millions of stars across the sky, allowing for detailed asteroseismic studies of individual systems. In this work, we present a detailed asteroseismic analysis of the giant star HD 76920 hosting a highly eccentric giant planet ($e = 0.878$) with an orbital period of 415 days, using 5 sectors of TESS light curve that cover around 140 days of data. Solar-like oscillations in HD 76920 are detected around $52 \, μ$Hz by TESS for the first time. By utilizing asteroseismic modeling that takes classical observational parameters and stellar oscillation frequencies as constraints, we determine improved measurements of the stellar mass ($1.22 \pm 0.11\, M_\odot$), radius ($8.68 \pm 0.34\,R_\odot$), and age ($5.2 \pm 1.4\,$Gyr). With the updated parameters of the host star, we update the semi-major axis and mass of the planet as $a=1.165 \pm 0.035$ au and $M_{\rm p}\sin{i} = 3.57 \pm 0.22\,M_{\rm Jup}$. With an orbital pericenter of $0.142 \pm 0.005$ au, we confirm that the planet is currently far away enough from the star to experience negligible tidal decay until being engulfed in the stellar envelope. We also confirm that this event will occur within about 100\,Myr, depending on the stellar model used.
△ Less
Submitted 6 February, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Temporal Knowledge Graph Reasoning with Historical Contrastive Learning
Authors:
Yi Xu,
Junjie Ou,
Hui Xu,
Luoyi Fu
Abstract:
Temporal knowledge graph, serving as an effective way to store and model dynamic relations, shows promising prospects in event forecasting. However, most temporal knowledge graph reasoning methods are highly dependent on the recurrence or periodicity of events, which brings challenges to inferring future events related to entities that lack historical interaction. In fact, the current moment is of…
▽ More
Temporal knowledge graph, serving as an effective way to store and model dynamic relations, shows promising prospects in event forecasting. However, most temporal knowledge graph reasoning methods are highly dependent on the recurrence or periodicity of events, which brings challenges to inferring future events related to entities that lack historical interaction. In fact, the current moment is often the combined effect of a small part of historical information and those unobserved underlying factors. To this end, we propose a new event forecasting model called Contrastive Event Network (CENET), based on a novel training framework of historical contrastive learning. CENET learns both the historical and non-historical dependency to distinguish the most potential entities that can best match the given query. Simultaneously, it trains representations of queries to investigate whether the current moment depends more on historical or non-historical events by launching contrastive learning. The representations further help train a binary classifier whose output is a boolean mask to indicate related entities in the search space. During the inference process, CENET employs a mask-based strategy to generate the final results. We evaluate our proposed model on five benchmark graphs. The results demonstrate that CENET significantly outperforms all existing methods in most metrics, achieving at least $8.3\%$ relative improvement of Hits@1 over previous state-of-the-art baselines on event-based datasets.
△ Less
Submitted 2 December, 2022; v1 submitted 20 November, 2022;
originally announced November 2022.
-
Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues
Authors:
Jiao Ou,
Jinchao Zhang,
Yang Feng,
Jie Zhou
Abstract:
The construction of open-domain dialogue systems requires high-quality dialogue datasets. The dialogue data admits a wide variety of responses for a given dialogue history, especially responses with different semantics. However, collecting high-quality such a dataset in most scenarios is labor-intensive and time-consuming. In this paper, we propose a data augmentation method to automatically augme…
▽ More
The construction of open-domain dialogue systems requires high-quality dialogue datasets. The dialogue data admits a wide variety of responses for a given dialogue history, especially responses with different semantics. However, collecting high-quality such a dataset in most scenarios is labor-intensive and time-consuming. In this paper, we propose a data augmentation method to automatically augment high-quality responses with different semantics by counterfactual inference. Specifically, given an observed dialogue, our counterfactual generation model first infers semantically different responses by replacing the observed reply perspective with substituted ones. Furthermore, our data selection method filters out detrimental augmented responses. Experimental results show that our data augmentation method can augment high-quality responses with different semantics for a given dialogue history, and can outperform competitive baselines on multiple downstream tasks.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
Photonic Metamaterial Analogue of a Continuous Time Crystal
Authors:
Tongjun Liu,
Jun-Yu Ou,
Kevin F. MacDonald,
Nikolay I. Zheludev
Abstract:
Time crystals are an eagerly sought phase of matter with broken time-translation symmetry. Quantum time crystals with discretely broken time-translation symmetry have been demonstrated in trapped ions, atoms and spins while continuously broken time-translation symmetry has been observed in an atomic condensate inside an optical cavity. Here we report that a classical metamaterial nanostructure, a…
▽ More
Time crystals are an eagerly sought phase of matter with broken time-translation symmetry. Quantum time crystals with discretely broken time-translation symmetry have been demonstrated in trapped ions, atoms and spins while continuously broken time-translation symmetry has been observed in an atomic condensate inside an optical cavity. Here we report that a classical metamaterial nanostructure, a two-dimensional array of plasmonic metamolecules supported on flexible nanowires, can be driven to a state possessing all of the key features of a continuous time crystal: continuous coherent illumination by light resonant with the metamolecules' plasmonic mode triggers a spontaneous phase transition to a superradiant-like state of transmissivity oscillations, resulting from many-body interactions among the metamolecules, characterized by long-range order in space and time. The phenomenon is of interest to the study of dynamic classical many-body states in the strongly correlated regime and applications in all-optical modulation, frequency conversion and timing.
△ Less
Submitted 30 January, 2023; v1 submitted 1 September, 2022;
originally announced September 2022.
-
Liouville Theorem on Ricci shrinkers with constant scalar curvature and its application
Authors:
Weixiong Mai,
Jianyu Ou
Abstract:
In this paper we consider harmonic functions on gradient shrinking Ricci solitons with constant scalar curvature. A Liouville theorem is proved without using gradient estimate : any bounded harmonic function is constant on gradient shrinking Ricci solitons with constant scalar curvature. As an application, we show that the space of harmonic functions with polynomial growth has finite dimension.
In this paper we consider harmonic functions on gradient shrinking Ricci solitons with constant scalar curvature. A Liouville theorem is proved without using gradient estimate : any bounded harmonic function is constant on gradient shrinking Ricci solitons with constant scalar curvature. As an application, we show that the space of harmonic functions with polynomial growth has finite dimension.
△ Less
Submitted 15 August, 2022;
originally announced August 2022.
-
Searching for Binary Asteroids in Pan-STARRS1 Archival Images
Authors:
James Ou,
Christoph Baranec,
Schelte J. Bus
Abstract:
We developed two different point spread function (PSF) analysis techniques for discovering wide separation binary asteroids in wide field surveys. We then applied these techniques to images of main belt asteroids in the 4 to 60 km size range captured by Pan-STARRS1. Johnston (2019) lists fewer than 10 known binaries in this size range with separations greater than 10% of the primary's Hill radius,…
▽ More
We developed two different point spread function (PSF) analysis techniques for discovering wide separation binary asteroids in wide field surveys. We then applied these techniques to images of main belt asteroids in the 4 to 60 km size range captured by Pan-STARRS1. Johnston (2019) lists fewer than 10 known binaries in this size range with separations greater than 10% of the primary's Hill radius, so discovering more wide binary asteroids is crucial for understanding the limits of binary stability and improving our knowledge of asteroid masses. We analyzed each image by: i) comparing the major axis orientation of the asteroid's elliptical PSF to its non-sidereal rate on the sky, and ii) comparing the one-dimensional median profile created by collapsing the image along the asteroid's direction of motion to that of nearby field stars. For both methods, we flagged any results that deviated significantly from the expected measurements of single asteroids, and those targets with the most flags were identified as binary candidates for confirmation with high-acuity imaging.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
ET White Paper: To Find the First Earth 2.0
Authors:
Jian Ge,
Hui Zhang,
Weicheng Zang,
Hongping Deng,
Shude Mao,
Ji-Wei Xie,
Hui-Gen Liu,
Ji-Lin Zhou,
Kevin Willis,
Chelsea Huang,
Steve B. Howell,
Fabo Feng,
Jiapeng Zhu,
Xinyu Yao,
Beibei Liu,
Masataka Aizawa,
Wei Zhu,
Ya-Ping Li,
Bo Ma,
Quanzhi Ye,
Jie Yu,
Maosheng Xiang,
Cong Yu,
Shangfei Liu,
Ming Yang
, et al. (142 additional authors not shown)
Abstract:
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500…
▽ More
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500 square degrees. Staring in the direction that encompasses the original Kepler field for four continuous years, this monitoring will return tens of thousands of transiting planets, including the elusive Earth twins orbiting solar-type stars. The seventh telescope is a 30cm microlensing telescope that will monitor an area of 4 square degrees toward the galactic bulge. This, combined with simultaneous ground-based KMTNet observations, will measure masses for hundreds of long-period and free-floating planets. Together, the transit and the microlensing telescopes will revolutionize our understandings of terrestrial planets across a large swath of orbital distances and free space. In addition, the survey data will also facilitate studies in the fields of asteroseismology, Galactic archeology, time-domain sciences, and black holes in binaries.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Ballistic Dynamics of Flexural Thermal Movements in a Nano-membrane Revealed with Subatomic Resolution
Authors:
Tongjun Liu,
Jun-Yu Ou,
Nikitas Papasimakis,
Kevin F. MacDonald,
Vitalyi E. Gusev,
Nikolay I. Zheludev
Abstract:
Flexural oscillations of free-standing films, nano-membranes and nano-wires are attracting growing attention for their importance to the thermal, electrical and mechanical properties of 2D materials. Here we report on the observation of short-timescale ballistic motion in the flexural mode of a nano-membrane cantilever, driven by thermal fluctuation of flexural phonons, including measurements of b…
▽ More
Flexural oscillations of free-standing films, nano-membranes and nano-wires are attracting growing attention for their importance to the thermal, electrical and mechanical properties of 2D materials. Here we report on the observation of short-timescale ballistic motion in the flexural mode of a nano-membrane cantilever, driven by thermal fluctuation of flexural phonons, including measurements of ballistic velocities and displacements performed with sub-atomic resolution, using a new free electron edge-scattering technique. Within intervals <10 μs, the membrane moves ballistically at a constant velocity, typically ~300 μm/s, while Brownian-like dynamics emerge for longer observation periods. Access to the ballistic regime provides verification of the equipartition theorem and Maxwell-Boltzmann statistics for flexural modes, and can be used in fast thermometry and mass sensing during atomic absorption/desorption processes on the membrane. We argue that the ballistic regime should be accounted for in understanding the electrical, optical, thermal and mechanical properties of 2D materials.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Picophotonics -- Subatomic Optical Localization Beyond Thermal Fluctuations
Authors:
Tongjun Liu,
Cheng-Hung Chi,
Jun-Yu Ou,
Jie Xu,
Eng Aik Chan,
Kevin F. MacDonald,
Nikolay I. Zheludev
Abstract:
Despite recent tremendous progress in optical imaging and metrology, the resolution gap between atomic scale transmission electron microscopy and optical techniques has not been closed. Is optical imaging and metrology of nanostructures exhibiting Brownian motion possible with resolution beyond thermal fluctuations? Here we report on an experiment in which the average position of a nanowire with a…
▽ More
Despite recent tremendous progress in optical imaging and metrology, the resolution gap between atomic scale transmission electron microscopy and optical techniques has not been closed. Is optical imaging and metrology of nanostructures exhibiting Brownian motion possible with resolution beyond thermal fluctuations? Here we report on an experiment in which the average position of a nanowire with a thermal oscillation amplitude of ~150 pm is resolved in single-shot measurements with precision of 92 pm using light at a wavelength of λ = 488 nm, providing the first example of such sub-Brownian metrology with ~λ/5,300 precision. To localize the nanowire, we employ a deep learning analysis of the scattering of topologically structured light, which is highly sensitive to the nanowire's position. As a non-invasive optical metrology with sub-Brownian absolute errors, down to a fraction of the typical size of an atom (Si: 220 pm diameter), it opens the exciting field of picophotonics.
△ Less
Submitted 30 January, 2023; v1 submitted 3 May, 2022;
originally announced May 2022.
-
An Adaptive Optics Census of Companions to Northern Stars Within 25 pc with Robo-AO
Authors:
Maissa Salama,
Carl Ziegler,
Christoph Baranec,
Michael C. Liu,
Nicholas M. Law,
Reed Riddle,
Todd J. Henry,
Jennifer G. Winters,
Wei-Chun Jao,
James Ou,
Arcelia Hermosillo Ruiz
Abstract:
In order to assess the multiplicity statistics of stars across spectral types and populations in a volume-limited sample, we censused nearby stars for companions with Robo-AO. We report on observations of 1157 stars of all spectral types within 25 pc with decl. $>-13^{\circ}$ searching for tight companions. We detected 154 companion candidates with separations ranging from $\sim$0.15$''$ to 4.0…
▽ More
In order to assess the multiplicity statistics of stars across spectral types and populations in a volume-limited sample, we censused nearby stars for companions with Robo-AO. We report on observations of 1157 stars of all spectral types within 25 pc with decl. $>-13^{\circ}$ searching for tight companions. We detected 154 companion candidates with separations ranging from $\sim$0.15$''$ to 4.0$''$ and magnitude differences up to $Δ$m$_{\textit{i'}}\le$7 using the robotic adaptive optics instrument Robo-AO. We confirmed physical association from Gaia EDR3 astrometry for 53 of the companion candidates, 99 remain to be confirmed, and 2 were ruled out as background objects. We complemented the high-resolution imaging companion search with a search for co-moving objects with separations out to 10,000 AU in Gaia EDR3, which resulted in an additional 147 companions registered. Of the 301 total companions reported in this study, 49 of them are new discoveries. Out of the 191 stars with significant acceleration measurements in the Hipparcos-Gaia catalog of accelerations, we detect companions around 115 of them, with the significance of the acceleration increasing as the companion separation decreases. From this survey, we report the following multiplicity fractions (compared to literature values): 40.9%$\pm$3.0% (44%) for FGK stars and 28.2%$\pm$2.3% (27%) for M stars, as well as higher-order fractions of 5.5%$\pm$1.1% (11%) and 3.9%$\pm$0.9% (5%) for FGK stars and M type stars, respectively.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
Difference of photometric properties between regular and non-regular Miras in the Magellanic Clouds
Authors:
Jia-Yu Ou,
Chow-Choong Ngeow
Abstract:
Mira variables are asymptotic giant branch pulsating stars with long pulsation periods and large amplitudes in optical bands. By applying the random forest algorithm to the I-band light curves for the Miras in the Magellanic Clouds, we have classified these Miras into regular Miras and non-regular Miras. Wherein non-regular Miras exhibit a long-term variation in addition to their primary pulsation…
▽ More
Mira variables are asymptotic giant branch pulsating stars with long pulsation periods and large amplitudes in optical bands. By applying the random forest algorithm to the I-band light curves for the Miras in the Magellanic Clouds, we have classified these Miras into regular Miras and non-regular Miras. Wherein non-regular Miras exhibit a long-term variation in addition to their primary pulsation periods. Our results confirm that the period-luminosity relation for maximum light has a smaller dispersion, but only occurs on the regular oxygen-rich (O-rich) Miras, which we recommend to be applied in future distance scale work. We have also collected multi-band photometry for these Miras to perform a spectral-energy-distribution (SED) fitting with stellar and dust components, showing that a significant fraction of dust is present around the non-regular Miras. According to our results, we believe that the periodic long-term variations seen in the non-regular Miras might be due to the presence of dust.
△ Less
Submitted 2 March, 2022;
originally announced March 2022.
-
Influence maximization under limited network information: Seeding high-degree neighbors
Authors:
Jiamin Ou,
Vincent Buskens,
Arnout Van De Rijt,
Debabrata Panja
Abstract:
The diffusion of information, norms, and practices across a social network can be initiated by compelling a small number of seed individuals to adopt first. Strategies proposed in previous work either assume full network information or large degree of control over what information is collected. However, privacy settings on the Internet and high non-response in surveys often severely limit availabl…
▽ More
The diffusion of information, norms, and practices across a social network can be initiated by compelling a small number of seed individuals to adopt first. Strategies proposed in previous work either assume full network information or large degree of control over what information is collected. However, privacy settings on the Internet and high non-response in surveys often severely limit available connectivity information. Here we propose a seeding strategy for scenarios with limited network information: Only the degrees and connections of some random nodes are known. This new strategy is a modification of "random neighbor sampling" and seeds the highest-degree neighbors of randomly selected nodes. In simulations of a linear threshold model on a range of synthetic and real-world networks, we find that this new strategy outperforms other seeding strategies, including high-degree seeding and clustered seeding.
△ Less
Submitted 8 February, 2022;
originally announced February 2022.
-
ESAN: Efficient Sentiment Analysis Network of A-Shares Research Reports for Stock Price Prediction
Authors:
Tuo Sun,
Wanrong Zheng,
Shufan Yu,
Mengxun Li,
Jiarui Ou
Abstract:
In this paper, we are going to develop a natural language processing model to help us to predict stocks in the long term. The whole network includes two modules. The first module is a natural language processing model which seeks out reliable factors from input reports. While the other is a time-series forecasting model which takes the factors as input and aims to predict stocks earnings yield. To…
▽ More
In this paper, we are going to develop a natural language processing model to help us to predict stocks in the long term. The whole network includes two modules. The first module is a natural language processing model which seeks out reliable factors from input reports. While the other is a time-series forecasting model which takes the factors as input and aims to predict stocks earnings yield. To indicate the efficiency of our model to combine the sentiment analysis module and the time-series forecasting module, we name our method ESAN.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
Volatile optical bistability enabled by mechanical nonlinearity
Authors:
Dimitrios Papas,
Jun-Yu Ou,
Eric Plum,
Nikolay I. Zheludev
Abstract:
Optical devices with metastable states controlled with light (optical flip-flops) are needed in data storage, signal processing and displays. Although non-volatile optical memory relying on structural phase transitions in chalcogenide glasses has been widely used for optical data storage, beyond that, weak optical nonlinearities have hindered the development of low-power bistable devices. Here we…
▽ More
Optical devices with metastable states controlled with light (optical flip-flops) are needed in data storage, signal processing and displays. Although non-volatile optical memory relying on structural phase transitions in chalcogenide glasses has been widely used for optical data storage, beyond that, weak optical nonlinearities have hindered the development of low-power bistable devices. Here we report on a new type of volatile optical bistability in a resonant hybrid nano-optomechanical device, comprising of a pair of anchored nanowires decorated with plasmonic metamolecules. The nonlinearity resides in the mechanical properties of the nanowires and is transduced to its optical response by reconfiguring the plasmonic metamolecules. Such a system can be driven to a bistable response by acoustic signals modulated at the natural mechanical resonance of the nanowire. The memory of such a device is volatile and can be erased by removing the acoustic signal but in its presence, it can be switched between bistable optical states with microwatts of optical power. We argue that the demonstration of hybrid nano-optomechanical bistability opens new opportunities to develop practical low-power bistable devices.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
Three Balls Theorem for Eigenfunctions of Dirac Operator in Clifford Analysis
Authors:
Weixiong Mai,
Jianyu Ou
Abstract:
In this paper we establish the three balls theorem for functions $u$ satisfying $Du=λu$ in Clifford analysis, where $D$ is the Dirac operator. As an application, we generalize Hadamard's three circles theorem to monogenic function in $\mathbb R^{n+1}.$
In this paper we establish the three balls theorem for functions $u$ satisfying $Du=λu$ in Clifford analysis, where $D$ is the Dirac operator. As an application, we generalize Hadamard's three circles theorem to monogenic function in $\mathbb R^{n+1}.$
△ Less
Submitted 13 April, 2023; v1 submitted 28 October, 2021;
originally announced October 2021.
-
An Adaptive Sampling and Edge Detection Approach for Encoding Static Images for Spiking Neural Networks
Authors:
Peyton Chandarana,
Junlin Ou,
Ramtin Zand
Abstract:
Current state-of-the-art methods of image classification using convolutional neural networks are often constrained by both latency and power consumption. This places a limit on the devices, particularly low-power edge devices, that can employ these methods. Spiking neural networks (SNNs) are considered to be the third generation of artificial neural networks which aim to address these latency and…
▽ More
Current state-of-the-art methods of image classification using convolutional neural networks are often constrained by both latency and power consumption. This places a limit on the devices, particularly low-power edge devices, that can employ these methods. Spiking neural networks (SNNs) are considered to be the third generation of artificial neural networks which aim to address these latency and power constraints by taking inspiration from biological neuronal communication processes. Before data such as images can be input into an SNN, however, they must be first encoded into spike trains. Herein, we propose a method for encoding static images into temporal spike trains using edge detection and an adaptive signal sampling method for use in SNNs. The edge detection process consists of first performing Canny edge detection on the 2D static images and then converting the edge detected images into two X and Y signals using an image-to-signal conversion method. The adaptive signaling approach consists of sampling the signals such that the signals maintain enough detail and are sensitive to abrupt changes in the signal. Temporal encoding mechanisms such as threshold-based representation (TBR) and step-forward (SF) are then able to be used to convert the sampled signals into spike trains. We use various error and indicator metrics to optimize and evaluate the efficiency and precision of the proposed image encoding approach. Comparison results between the original and reconstructed signals from spike trains generated using edge-detection and adaptive temporal encoding mechanism exhibit 18x and 7x reduction in average root mean square error (RMSE) compared to the conventional SF and TBR encoding, respectively, while used for encoding MNIST dataset.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Boost Neural Networks by Checkpoints
Authors:
Feng Wang,
Guoyizhe Wei,
Qiao Liu,
Jinxiang Ou,
Xian Wei,
Hairong Lv
Abstract:
Training multiple deep neural networks (DNNs) and averaging their outputs is a simple way to improve the predictive performance. Nevertheless, the multiplied training cost prevents this ensemble method to be practical and efficient. Several recent works attempt to save and ensemble the checkpoints of DNNs, which only requires the same computational cost as training a single network. However, these…
▽ More
Training multiple deep neural networks (DNNs) and averaging their outputs is a simple way to improve the predictive performance. Nevertheless, the multiplied training cost prevents this ensemble method to be practical and efficient. Several recent works attempt to save and ensemble the checkpoints of DNNs, which only requires the same computational cost as training a single network. However, these methods suffer from either marginal accuracy improvements due to the low diversity of checkpoints or high risk of divergence due to the cyclical learning rates they adopted. In this paper, we propose a novel method to ensemble the checkpoints, where a boosting scheme is utilized to accelerate model convergence and maximize the checkpoint diversity. We theoretically prove that it converges by reducing exponential loss. The empirical evaluation also indicates our proposed ensemble outperforms single model and existing ensembles in terms of accuracy and efficiency. With the same training budget, our method achieves 4.16% lower error on Cifar-100 and 6.96% on Tiny-ImageNet with ResNet-110 architecture. Moreover, the adaptive sample weights in our method make it an effective solution to address the imbalanced class distribution. In the experiments, it yields up to 5.02% higher accuracy over single EfficientNet-B0 on the imbalanced datasets.
△ Less
Submitted 25 October, 2021; v1 submitted 3 October, 2021;
originally announced October 2021.
-
Searching for orbital decay in a heartbeat star system KIC 3766353
Authors:
Jian-Wen Ou,
Cong Yu,
Chen Jiang,
Ming Yang,
Hubiao Niu
Abstract:
Theory suggests that the orbits of a large fraction of binary systems, including planet-star binary systems, shrink by few orders of magnitude after formation. But so far, only one hot Jupiter with tidally-driven orbital decay has been found by transit timing variations. We propose to search for orbital decay companions in heartbeat star systems because the orbital angular momentum is effectively…
▽ More
Theory suggests that the orbits of a large fraction of binary systems, including planet-star binary systems, shrink by few orders of magnitude after formation. But so far, only one hot Jupiter with tidally-driven orbital decay has been found by transit timing variations. We propose to search for orbital decay companions in heartbeat star systems because the orbital angular momentum is effectively transferred to the host star causing tidal dissipation. KIC 3766353 is one of the heartbeat stars with tidally excited oscillations. We acquired the primary and the secondary eclipse time variations from the \textit{Kepler} photometric light curves. Timing analysis shows that KIC 3766353 is a hierarchical triple system with a hidden third body and a red dwarf (mass $0.35\ M_{\odot}$, radius $0.34\ R_{\odot}$) in its inner orbit. The minimum mass of the third body is $\sim 0.26 \ M_{\odot}$, and the distance from the inner orbital is $\sim 111.4 \ R_{\odot}$. The period decay rate of the red dwarf is approximately 358 ms yr$^{-1}$. The combined effects of the light-travel time and the orbital decay lead to the observed timing variations. Future monitoring with a long time base-line observations is required to delve into the contributions of these two effects.
△ Less
Submitted 25 September, 2021;
originally announced September 2021.
-
Constructing Emotion Consensus and Utilizing Unpaired Data for Empathetic Dialogue Generation
Authors:
Lei Shen,
Jinchao Zhang,
Jiao Ou,
Xiaofang Zhao,
Jie Zhou
Abstract:
Researches on dialogue empathy aim to endow an agent with the capacity of accurate understanding and proper responding for emotions. Existing models for empathetic dialogue generation focus on the emotion flow in one direction, that is, from the context to response. We argue that conducting an empathetic conversation is a bidirectional process, where empathy occurs when the emotions of two interlo…
▽ More
Researches on dialogue empathy aim to endow an agent with the capacity of accurate understanding and proper responding for emotions. Existing models for empathetic dialogue generation focus on the emotion flow in one direction, that is, from the context to response. We argue that conducting an empathetic conversation is a bidirectional process, where empathy occurs when the emotions of two interlocutors could converge on the same point, i.e., reaching an emotion consensus. Besides, we also find that the empathetic dialogue corpus is extremely limited, which further restricts the model performance. To address the above issues, we propose a dual-generative model, Dual-Emp, to simultaneously construct the emotion consensus and utilize some external unpaired data. Specifically, our model integrates a forward dialogue model, a backward dialogue model, and a discrete latent variable representing the emotion consensus into a unified architecture. Then, to alleviate the constraint of paired data, we extract unpaired emotional data from open-domain conversations and employ Dual-Emp to produce pseudo paired empathetic samples, which is more efficient and low-cost than the human annotation. Automatic and human evaluations demonstrate that our method outperforms competitive baselines in producing coherent and empathetic responses.
△ Less
Submitted 18 September, 2021; v1 submitted 16 September, 2021;
originally announced September 2021.
-
The Measurement of Dynamic Tidal Contribution to Apsidal Motion in Heartbeat Star KIC 4544587
Authors:
Jian-wen Ou,
Cong Yu,
Ming Yang,
Chen Jiang,
Bo Ma,
Guan-fu Liu,
Shang-fei Liu,
Juan-juan Luo
Abstract:
Apsidal motion is a gradual shift in the position of periastron. The impact of dynamic tides on apsidal motion has long been debated, because the contribution could not be quantified due to the lack of high quality observations. KIC 4544587 with tidally excited oscillations has been observed by \textit{Kepler} high-precision photometric data based on long time baseline and short-cadence schema. In…
▽ More
Apsidal motion is a gradual shift in the position of periastron. The impact of dynamic tides on apsidal motion has long been debated, because the contribution could not be quantified due to the lack of high quality observations. KIC 4544587 with tidally excited oscillations has been observed by \textit{Kepler} high-precision photometric data based on long time baseline and short-cadence schema. In this paper, we compute the rate of apsidal motion that arises from the dynamic tides as $19.05\pm 1.70$ mrad yr$^{-1}$ via tracking the orbital phase shifts of tidally excited oscillations. We also calculate the procession rate of the orbit due to the Newtonian and general relativistic contribution as $21.49 \pm 2.8$ and $2.4 \pm 0.06$ mrad yr$^{-1}$, respectively. The sum of these three factors is in excellent agreement with the total observational rate of apsidal motion $42.97 \pm 0.18$ mrad yr$^{-1}$ measured by eclipse timing variations. The tidal effect accounts for about 44\% of the overall observed apsidal motion and is comparable to that of the Newtonian term. Dynamic tides have a significant contribution to the apsidal motion. The analysis method mentioned in this paper presents an alternative approach to measuring the contribution of the dynamic tides quantitatively.
△ Less
Submitted 2 September, 2021; v1 submitted 1 September, 2021;
originally announced September 2021.
-
Hidden dependence of spreading vulnerability on topological complexity
Authors:
Mark M. Dekker,
Raoul D. Schram,
Jiamin Ou,
Debabrata Panja
Abstract:
Many dynamical phenomena in complex systems concern spreading that plays out on top of networks with changing architecture over time -- commonly known as temporal networks. A complex system's proneness to facilitate spreading phenomena, which we abbreviate as its `spreading vulnerability', is often surmised to be related to the topology of the temporal network featured by the system. Yet, cleanly…
▽ More
Many dynamical phenomena in complex systems concern spreading that plays out on top of networks with changing architecture over time -- commonly known as temporal networks. A complex system's proneness to facilitate spreading phenomena, which we abbreviate as its `spreading vulnerability', is often surmised to be related to the topology of the temporal network featured by the system. Yet, cleanly extracting spreading vulnerability of a complex system directly from the topological information of the temporal network remains a challenge. Here, using data from a diverse set of real-world complex systems, we develop the `entropy of temporal entanglement' as a novel and insightful quantity to measure topological complexities of temporal networks. We show that this parameter-free quantity naturally allows for topological comparisons across vastly different complex systems. Importantly, by simulating three different types of stochastic dynamical processes playing out on top of temporal networks, we demonstrate that the entropy of temporal entanglement serves as a quantitative embodiment of the systems' spreading vulnerability, irrespective of the details of the processes. In being able to do so, i.e., in being able to quantitatively extract a complex system's proneness to facilitate spreading phenomena from topology, this entropic measure opens itself for applications in a wide variety of natural, social, biological and engineered systems.
△ Less
Submitted 14 April, 2022; v1 submitted 4 July, 2021;
originally announced July 2021.
-
Quantifying agent impacts on contact sequences in social interactions
Authors:
Mark M. Dekker,
Tessa F. Blanken,
Fabian Dablander,
Jiamin Ou,
Denny Borsboom,
Debabrata Panja
Abstract:
Human social behavior plays a crucial role in how pathogens like SARS-CoV-2 or fake news spread in a population. Social interactions determine the contact network among individuals, while spreading, requiring individual-to-individual transmission, takes place on top of the network. Studying the topological aspects of a contact network, therefore, not only has the potential of leading to valuable i…
▽ More
Human social behavior plays a crucial role in how pathogens like SARS-CoV-2 or fake news spread in a population. Social interactions determine the contact network among individuals, while spreading, requiring individual-to-individual transmission, takes place on top of the network. Studying the topological aspects of a contact network, therefore, not only has the potential of leading to valuable insights into how the behavior of individuals impacts spreading phenomena, but it may also open up possibilities for devising effective behavioral interventions. Because of the temporal nature of interactions - since the topology of the network, containing who is in contact with whom, when, for how long, and in which precise sequence, varies (rapidly) in time - analyzing them requires developing network methods and metrics that respect temporal variability, in contrast to those developed for static (i.e., time-invariant) networks. Here, by means of event mapping, we propose a method to quantify how quickly agents mingle by transforming temporal network data of agent contacts. We define a novel measure called 'contact sequence centrality', which quantifies the impact of an individual on the contact sequences, reflecting the individual's behavioral potential for spreading. Comparing contact sequence centrality across agents allows for ranking the impact of agents and identifying potential 'behavioral super-spreaders'. The method is applied to social interaction data collected at an art fair in Amsterdam. We relate the measure to the existing network metrics, both temporal and static, and find that (mostly at longer time scales) traditional metrics lose their resemblance to contact sequence centrality. Our work highlights the importance of accounting for the sequential nature of contacts when analyzing social interactions.
△ Less
Submitted 14 April, 2022; v1 submitted 3 July, 2021;
originally announced July 2021.
-
Exploring Discourse Structures for Argument Impact Classification
Authors:
Xin Liu,
Jiefu Ou,
Yangqiu Song,
Xin Jiang
Abstract:
Discourse relations among arguments reveal logical structures of a debate conversation. However, no prior work has explicitly studied how the sequence of discourse relations influence a claim's impact. This paper empirically shows that the discourse relations between two arguments along the context path are essential factors for identifying the persuasive power of an argument. We further propose D…
▽ More
Discourse relations among arguments reveal logical structures of a debate conversation. However, no prior work has explicitly studied how the sequence of discourse relations influence a claim's impact. This paper empirically shows that the discourse relations between two arguments along the context path are essential factors for identifying the persuasive power of an argument. We further propose DisCOC to inject and fuse the sentence-level structural discourse information with contextualized features derived from large-scale language models. Experimental results and extensive analysis show that the attention and gate mechanisms that explicitly model contexts and texts can indeed help the argument impact classification task defined by Durmus et al. (2019), and discourse structures among the context path of the claim to be classified can further boost the performance.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
Full-Resolution Encoder-Decoder Networks with Multi-Scale Feature Fusion for Human Pose Estimation
Authors:
Jie Ou,
Mingjian Chen,
Hong Wu
Abstract:
To achieve more accurate 2D human pose estimation, we extend the successful encoder-decoder network, simple baseline network (SBN), in three ways. To reduce the quantization errors caused by the large output stride size, two more decoder modules are appended to the end of the simple baseline network to get full output resolution. Then, the global context blocks (GCBs) are added to the encoder and…
▽ More
To achieve more accurate 2D human pose estimation, we extend the successful encoder-decoder network, simple baseline network (SBN), in three ways. To reduce the quantization errors caused by the large output stride size, two more decoder modules are appended to the end of the simple baseline network to get full output resolution. Then, the global context blocks (GCBs) are added to the encoder and decoder modules to enhance them with global context features. Furthermore, we propose a novel spatial-attention-based multi-scale feature collection and distribution module (SA-MFCD) to fuse and distribute multi-scale features to boost the pose estimation. Experimental results on the MS COCO dataset indicate that our network can remarkably improve the accuracy of human pose estimation over SBN, our network using ResNet34 as the backbone network can even achieve the same accuracy as SBN with ResNet152, and our networks can achieve superior results with big backbone networks.
△ Less
Submitted 1 June, 2021;
originally announced June 2021.