Search | arXiv e-print repository

XENONnT WIMP Search: Signal & Background Modeling and Statistical Inference

Authors: XENON Collaboration, E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, D. Antón Martin, F. Arneodo, L. Baudis, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, K. Boese, A. Brown, G. Bruno, R. Budnik, J. M. R. Cardoso, A. P. Cimental Chávez, A. P. Colijn, J. Conrad, J. J. Cuenca-García, V. D'Andrea , et al. (139 additional authors not shown)

Abstract: The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 t… ▽ More The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 tonne-years yielded no signal excess over background expectations, from which competitive exclusion limits were derived on WIMP-nucleon elastic scatter cross sections, for WIMP masses ranging from 6 GeV/$c^2$ up to the TeV/$c^2$ scale. This work details the modeling and statistical methods employed in this search. By means of calibration data, we model the detector response, which is then used to derive background and signal models. The construction and validation of these models is discussed, alongside additional purely data-driven backgrounds. We also describe the statistical inference framework, including the definition of the likelihood function and the construction of confidence intervals. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 20 pages, 10 figures

arXiv:2406.02096 [pdf, other]

MS-Mapping: Multi-session LiDAR Mapping with Wasserstein-based Keyframe Selection

Authors: Xiangcheng Hu, Jin Wu, Jianhao Jiao, Wei Zhang, Ping Tan

Abstract: Large-scale multi-session LiDAR mapping plays a crucial role in various applications but faces significant challenges in data redundancy and pose graph scalability. This paper present MS-Mapping, a novel multi-session LiDAR mapping system that combines an incremental mapping scheme with support for various LiDAR-based odometry, enabling high-precision and consistent map assembly in large-scale env… ▽ More Large-scale multi-session LiDAR mapping plays a crucial role in various applications but faces significant challenges in data redundancy and pose graph scalability. This paper present MS-Mapping, a novel multi-session LiDAR mapping system that combines an incremental mapping scheme with support for various LiDAR-based odometry, enabling high-precision and consistent map assembly in large-scale environments. Our approach introduces a real-time keyframe selection method based on the Wasserstein distance, which effectively reduces data redundancy and pose graph complexity. We formulate the LiDAR point cloud keyframe selection problem using a similarity method based on Gaussian mixture models (GMM) and tackle the real-time challenge by employing an incremental voxel update method. Extensive experiments on large-scale campus scenes and over \SI{12.8}{km} of public and self-collected datasets demonstrate the efficiency, accuracy, and consistency of our map assembly approach. To facilitate further research and development in the community, we make our code https://github.com/JokerJohn/MS-Mapping and datasets publicly available. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 5 pages, 4 figures

arXiv:2406.01467 [pdf, other]

RaDe-GS: Rasterizing Depth in Gaussian Splatting

Authors: Baowen Zhang, Chuan Fang, Rakesh Shrestha, Yixun Liang, Xiaoxiao Long, Ping Tan

Abstract: Gaussian Splatting (GS) has proven to be highly effective in novel view synthesis, achieving high-quality and real-time rendering. However, its potential for reconstructing detailed 3D shapes has not been fully explored. Existing methods often suffer from limited shape accuracy due to the discrete and unstructured nature of Gaussian splats, which complicates the shape extraction. While recent tech… ▽ More Gaussian Splatting (GS) has proven to be highly effective in novel view synthesis, achieving high-quality and real-time rendering. However, its potential for reconstructing detailed 3D shapes has not been fully explored. Existing methods often suffer from limited shape accuracy due to the discrete and unstructured nature of Gaussian splats, which complicates the shape extraction. While recent techniques like 2D GS have attempted to improve shape reconstruction, they often reformulate the Gaussian primitives in ways that reduce both rendering quality and computational efficiency. To address these problems, our work introduces a rasterized approach to render the depth maps and surface normal maps of general 3D Gaussian splats. Our method not only significantly enhances shape reconstruction accuracy but also maintains the computational efficiency intrinsic to Gaussian Splatting. It achieves a Chamfer distance error comparable to NeuraLangelo on the DTU dataset and maintains similar computational efficiency as the original 3D GS methods. Our method is a significant advancement in Gaussian Splatting and can be directly integrated into existing Gaussian Splatting-based methods. △ Less

Submitted 24 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.14979 [pdf, other]

CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner

Authors: Weiyu Li, Jiarui Liu, Rui Chen, Yixun Liang, Xuelin Chen, Ping Tan, Xiaoxiao Long

Abstract: We present a novel generative 3D modeling system, coined CraftsMan, which can generate high-fidelity 3D geometries with highly varied shapes, regular mesh topologies, and detailed surfaces, and, notably, allows for refining the geometry in an interactive manner. Despite the significant advancements in 3D generation, existing methods still struggle with lengthy optimization processes, irregular mes… ▽ More We present a novel generative 3D modeling system, coined CraftsMan, which can generate high-fidelity 3D geometries with highly varied shapes, regular mesh topologies, and detailed surfaces, and, notably, allows for refining the geometry in an interactive manner. Despite the significant advancements in 3D generation, existing methods still struggle with lengthy optimization processes, irregular mesh topologies, noisy surfaces, and difficulties in accommodating user edits, consequently impeding their widespread adoption and implementation in 3D modeling software. Our work is inspired by the craftsman, who usually roughs out the holistic figure of the work first and elaborates the surface details subsequently. Specifically, we employ a 3D native diffusion model, which operates on latent space learned from latent set-based 3D representations, to generate coarse geometries with regular mesh topology in seconds. In particular, this process takes as input a text prompt or a reference image and leverages a powerful multi-view (MV) diffusion model to generate multiple views of the coarse geometry, which are fed into our MV-conditioned 3D diffusion model for generating the 3D geometry, significantly improving robustness and generalizability. Following that, a normal-based geometry refiner is used to significantly enhance the surface details. This refinement can be performed automatically, or interactively with user-supplied edits. Extensive experiments demonstrate that our method achieves high efficacy in producing superior-quality 3D assets compared to existing methods. HomePage: https://craftsman3d.github.io/, Code: https://github.com/wyysf-98/CraftsMan △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: HomePage: https://craftsman3d.github.io/, Code: https://github.com/wyysf-98/CraftsMan

arXiv:2405.14198 [pdf, other]

Enabling Sustainable Freight Forwarding Network via Collaborative Games

Authors: Pang-Jin Tan, Shih-Fen Cheng, Richard Chen

Abstract: Freight forwarding plays a crucial role in facilitating global trade and logistics. However, as the freight forwarding market is extremely fragmented, freight forwarders often face the issue of not being able to fill the available shipping capacity. This recurrent issue motivates the creation of various freight forwarding networks that aim at exchanging capacities and demands so that the resource… ▽ More Freight forwarding plays a crucial role in facilitating global trade and logistics. However, as the freight forwarding market is extremely fragmented, freight forwarders often face the issue of not being able to fill the available shipping capacity. This recurrent issue motivates the creation of various freight forwarding networks that aim at exchanging capacities and demands so that the resource utilization of individual freight forwarders can be maximized. In this paper, we focus on how to design such a collaborative network based on collaborative game theory, with the Shapley value representing a fair scheme for profit sharing. Noting that the exact computation of Shapley values is intractable for large-scale real-world scenarios, we incorporate the observation that collaboration among two forwarders is only possible if their service routes and demands overlap. This leads to a new class of collaborative games called the Locally Collaborative Games (LCGs), where agents can only collaborate with their neighbors. We propose an efficient approach to compute Shapley values for LCGs, and numerically demonstrate that our approach significantly outperforms the state-of-the-art approach for a wide variety of network structures. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: Accepted to the 33rd International Joint Conference on Artificial Intelligence (IJCAI-24)

arXiv:2405.11616 [pdf, other]

Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention

Authors: Peng Li, Yuan Liu, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo

Abstract: In this paper, we introduce Era3D, a novel multiview diffusion method that generates high-resolution multiview images from a single-view image. Despite significant advancements in multiview generation, existing methods still suffer from camera prior mismatch, inefficacy, and low resolution, resulting in poor-quality multiview images. Specifically, these methods assume that the input images should… ▽ More In this paper, we introduce Era3D, a novel multiview diffusion method that generates high-resolution multiview images from a single-view image. Despite significant advancements in multiview generation, existing methods still suffer from camera prior mismatch, inefficacy, and low resolution, resulting in poor-quality multiview images. Specifically, these methods assume that the input images should comply with a predefined camera type, e.g. a perspective camera with a fixed focal length, leading to distorted shapes when the assumption fails. Moreover, the full-image or dense multiview attention they employ leads to an exponential explosion of computational complexity as image resolution increases, resulting in prohibitively expensive training costs. To bridge the gap between assumption and reality, Era3D first proposes a diffusion-based camera prediction module to estimate the focal length and elevation of the input image, which allows our method to generate images without shape distortions. Furthermore, a simple but efficient attention layer, named row-wise attention, is used to enforce epipolar priors in the multiview diffusion, facilitating efficient cross-view information fusion. Consequently, compared with state-of-the-art methods, Era3D generates high-quality multiview images with up to a 512*512 resolution while reducing computation complexity by 12x times. Comprehensive experiments demonstrate that Era3D can reconstruct high-quality and detailed 3D meshes from diverse single-view input images, significantly outperforming baseline multiview diffusion methods. Project page: https://penghtyx.github.io/Era3D/. △ Less

Submitted 29 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

arXiv:2405.05814 [pdf]

MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction

Authors: Pinhuang Tan, Mengxiao Geng, Jingya Lu, Liu Shi, Bin Huang, Qiegen Liu

Abstract: Computed Tomography (CT) technology reduces radiation haz-ards to the human body through sparse sampling, but fewer sampling angles pose challenges for image reconstruction. Score-based generative models are widely used in sparse-view CT re-construction, performance diminishes significantly with a sharp reduction in projection angles. Therefore, we propose an ultra-sparse view CT reconstruction me… ▽ More Computed Tomography (CT) technology reduces radiation haz-ards to the human body through sparse sampling, but fewer sampling angles pose challenges for image reconstruction. Score-based generative models are widely used in sparse-view CT re-construction, performance diminishes significantly with a sharp reduction in projection angles. Therefore, we propose an ultra-sparse view CT reconstruction method utilizing multi-scale dif-fusion models (MSDiff), designed to concentrate on the global distribution of information and facilitate the reconstruction of sparse views with local image characteristics. Specifically, the proposed model ingeniously integrates information from both comprehensive sampling and selectively sparse sampling tech-niques. Through precise adjustments in diffusion model, it is capable of extracting diverse noise distribution, furthering the understanding of the overall structure of images, and aiding the fully sampled model in recovering image information more effec-tively. By leveraging the inherent correlations within the projec-tion data, we have designed an equidistant mask, enabling the model to focus its attention more effectively. Experimental re-sults demonstrated that the multi-scale model approach signifi-cantly improved the quality of image reconstruction under ultra-sparse angles, with good generalization across various datasets. △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.03976 [pdf]

Anomalous Gate-tunable Capacitance in Graphene Moiré Heterostructures

Authors: Linshang Chen, Haoran Long, Heng Wu, Rui Mei, Zhengyu Su, Mengjie Feng, Jiang-Bin Wu, Kenji Watanabe, Takashi Taniguchi, Xuewei Cao, Zhongming Wei, Ping-Heng Tan, Yanmeng Shi

Abstract: Interface engineered ferroelectricity in van der Waals heterostructures is of broad interest both fundamentally and technologically for the applications in neuromorphic computing and so on. In particular, the moiré ferroelectricity in graphene/hexagonal boron nitride (hBN) heterostructures driven by charge ordering instead of traditional lattice displacement has drawn considerable attention becaus… ▽ More Interface engineered ferroelectricity in van der Waals heterostructures is of broad interest both fundamentally and technologically for the applications in neuromorphic computing and so on. In particular, the moiré ferroelectricity in graphene/hexagonal boron nitride (hBN) heterostructures driven by charge ordering instead of traditional lattice displacement has drawn considerable attention because of its fascinating properties and promising high-frequency programmable electrical polarization switching. Yet, the underlying mechanism of the electronic ferroelectricity is still under debate. On the other hand, combining the interface engineered ferroelectricity and strong correlations in moiré heterostructures could enable the realization of novel quantum states such as ferroelectric superconductivity and multiferroicity. Here we study the electronic transport properties of twisted double bilayer graphene (TDBLG), aligned with one of the neighbouring hBN. We observe a strong gating hysteresis and ferroelectric-like behaviour, as well as the electronic ratchet effect. We find that the top gate is anomalously screened. On the contrary, the back gate is anomalously doubly efficient in injecting charges into graphene, that is, the effective back gate capacitance is two times larger than its geometry capacitance. This unexpected gate-tunable capacitance causes a dramatic change of electric fields between forward and backward scans. The asymmetric gating behaviours and anomalous change in capacitance could be explained with a simple model involved with a spontaneous electric polarization between top hBN and graphene. Our work provides more insights into the mysterious ferroelectricity in graphene/hBN moiré heterostructures and paves the way to the understanding of the underlying mechanism. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 20 pages, 13 figures

arXiv:2404.17675 [pdf, other]

Ideal noncrystals: A possible new class of ordered matter without apparent broken symmetry

Authors: Xinyu Fan, Ding Xu, Jianhua Zhang, Hao Hu, Peng Tan, Ning Xu, Hajime Tanaka, Hua Tong

Abstract: Order and disorder constitute two fundamental and opposite themes in condensed matter physics and materials science. Crystals are considered the epitome of order characterized by long-range translational order. The discovery of quasicrystals, with no periodicity but rotational symmetries forbidden for crystals, leads to a paradigm shift in solid-state physics. Moving one step forward, it is intrig… ▽ More Order and disorder constitute two fundamental and opposite themes in condensed matter physics and materials science. Crystals are considered the epitome of order characterized by long-range translational order. The discovery of quasicrystals, with no periodicity but rotational symmetries forbidden for crystals, leads to a paradigm shift in solid-state physics. Moving one step forward, it is intriguing to ask whether ordered matter exists without apparent symmetry breaking. The same question may arise in the pursuit of how ordered amorphous (noncrystalline) solids can be. Here we report the finding of ideal noncrystals in two dimensions, which are disordered in the conventional sense without Bragg peaks but highly ordered according to the steric order. We find that such ideal noncrystals have vibrational modes the same as phonons following the Debye law. The elastic responses are fully affine, which is again characteristic of crystals, and the spatial fluctuations of local volume fractions approach hyperuniformity. Therefore, ideal noncrystals represent an anomalous form of matter with a mixed nature of noncrystalline structure but crystal-like properties. Since such states are found to be thermodynamically favorable, we identify them as a possible new class of ordered matter without apparent broken symmetry. Our results thus extend the scope of the ordered state of matter and may impact the understanding of entropy-driving ordering also in generic amorphous materials. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2404.15121 [pdf, other]

Taming Diffusion Probabilistic Models for Character Control

Authors: Rui Chen, Mingyi Shi, Shaoli Huang, Ping Tan, Taku Komura, Xuelin Chen

Abstract: We present a novel character control framework that effectively utilizes motion diffusion probabilistic models to generate high-quality and diverse character animations, responding in real-time to a variety of dynamic user-supplied control signals. At the heart of our method lies a transformer-based Conditional Autoregressive Motion Diffusion Model (CAMDM), which takes as input the character's his… ▽ More We present a novel character control framework that effectively utilizes motion diffusion probabilistic models to generate high-quality and diverse character animations, responding in real-time to a variety of dynamic user-supplied control signals. At the heart of our method lies a transformer-based Conditional Autoregressive Motion Diffusion Model (CAMDM), which takes as input the character's historical motion and can generate a range of diverse potential future motions conditioned on high-level, coarse user control. To meet the demands for diversity, controllability, and computational efficiency required by a real-time controller, we incorporate several key algorithmic designs. These include separate condition tokenization, classifier-free guidance on past motion, and heuristic future trajectory extension, all designed to address the challenges associated with taming motion diffusion probabilistic models for character control. As a result, our work represents the first model that enables real-time generation of high-quality, diverse character animations based on user interactive control, supporting animating the character in multiple styles with a single unified model. We evaluate our method on a diverse set of locomotion skills, demonstrating the merits of our method over existing character controllers. Project page and source codes: https://aiganimation.github.io/CAMDM/ △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: Accepted by SIGGRAPH 2024 (Conference Track). Project page and source codes: https://aiganimation.github.io/CAMDM/

arXiv:2404.14850 [pdf, other]

Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models

Authors: Yang Tan, Mingchen Li, Bingxin Zhou, Bozitao Zhong, Lirong Zheng, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong

Abstract: Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfe… ▽ More Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfer to life science tasks is non-trivial due to the different training strategies and data forms. To address this gap, we introduce SES-Adapter, a simple, efficient, and scalable adapter method for enhancing the representation learning of PLMs. SES-Adapter incorporates PLM embeddings with structural sequence embeddings to create structure-aware representations. We show that the proposed method is compatible with different PLM architectures and across diverse tasks. Extensive evaluations are conducted on 2 types of folding structures with notable quality differences, 9 state-of-the-art baselines, and 9 benchmark datasets across distinct downstream tasks. Results show that compared to vanilla PLMs, SES-Adapter improves downstream task performance by a maximum of 11% and an average of 3%, with significantly accelerated training speed by a maximum of 1034% and an average of 362%, the convergence rate is also improved by approximately 2 times. Moreover, positive optimization is observed even with low-quality predicted structures. The source code for SES-Adapter is available at https://github.com/tyang816/SES-Adapter. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 30 pages, 4 figures, 8 tables

arXiv:2404.11367 [pdf]

Phonon Directionality Determines the Polarization of the Band-Edge Exciton Emission in Two-Dimensional Metal Halide Perovskites

Authors: Roman Krahne, Alexander Schleusener, Mehrdad Faraji, Lin-Han Li, Miao-Ling Lin, Ping-Heng Tan

Abstract: Two-dimensional metal-halide perovskites are highly versatile for light-driven applications due to their exceptional variety in material composition, which can be exploited for tunability of mechanical and optoelectronic properties. The band edge emission is governed by the exciton fine structure that is defined by structure and composition of both organic and inorganic layers. Moreover, electroni… ▽ More Two-dimensional metal-halide perovskites are highly versatile for light-driven applications due to their exceptional variety in material composition, which can be exploited for tunability of mechanical and optoelectronic properties. The band edge emission is governed by the exciton fine structure that is defined by structure and composition of both organic and inorganic layers. Moreover, electronic and elastic properties are intricately connected in these materials. Electron-phonon coupling plays a crucial role in the recombination dynamics. However, the nature of the electron-phonon coupling, as well as which kind of phonons are involved, is still under debate. Here we investigate the emission and phonon response from single two-dimensional lead-iodide microcrystals with angle-resolved polarized spectroscopy. We find an intricate dependence of the emission polarization with the vibrational directionality in the materials, which clearly reveals that several bands of the low-frequency phonons of the inorganic lead-iodide perovskite lattice play the key role in the band edge emission. Our findings demonstrate how the emission spectrum and polarization of two-dimensional layered perovskites can be designed by their material composition, which is essential for optoelectronic applications, where fine control on the spectral and structural properties of the light is desired. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 20 pages, 4 figures

arXiv:2404.02788 [pdf, other]

GenN2N: Generative NeRF2NeRF Translation

Authors: Xiangyue Liu, Han Xue, Kunming Luo, Ping Tan, Li Yi

Abstract: We present GenN2N, a unified NeRF-to-NeRF translation framework for various NeRF translation tasks such as text-driven NeRF editing, colorization, super-resolution, inpainting, etc. Unlike previous methods designed for individual translation tasks with task-specific schemes, GenN2N achieves all these NeRF editing tasks by employing a plug-and-play image-to-image translator to perform editing in th… ▽ More We present GenN2N, a unified NeRF-to-NeRF translation framework for various NeRF translation tasks such as text-driven NeRF editing, colorization, super-resolution, inpainting, etc. Unlike previous methods designed for individual translation tasks with task-specific schemes, GenN2N achieves all these NeRF editing tasks by employing a plug-and-play image-to-image translator to perform editing in the 2D domain and lifting 2D edits into the 3D NeRF space. Since the 3D consistency of 2D edits may not be assured, we propose to model the distribution of the underlying 3D edits through a generative model that can cover all possible edited NeRFs. To model the distribution of 3D edited NeRFs from 2D edited images, we carefully design a VAE-GAN that encodes images while decoding NeRFs. The latent space is trained to align with a Gaussian distribution and the NeRFs are supervised through an adversarial loss on its renderings. To ensure the latent code does not depend on 2D viewpoints but truly reflects the 3D edits, we also regularize the latent code through a contrastive learning scheme. Extensive experiments on various editing tasks show GenN2N, as a universal framework, performs as well or better than task-specific specialists while possessing flexible generative power. More results on our project page: https://xiangyueliu.github.io/GenN2N/ △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: Accepted to CVPR 2024. Project page: https://xiangyueliu.github.io/GenN2N/

arXiv:2404.01543 [pdf, other]

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

Authors: Ziqian Bai, Feitong Tan, Sean Fanello, Rohit Pandey, Mingsong Dou, Shichen Liu, Ping Tan, Yinda Zhang

Abstract: 3D head avatars built with neural implicit volumetric representations have achieved unprecedented levels of photorealism. However, the computational cost of these methods remains a significant barrier to their widespread adoption, particularly in real-time applications such as virtual reality and teleconferencing. While attempts have been made to develop fast neural rendering approaches for static… ▽ More 3D head avatars built with neural implicit volumetric representations have achieved unprecedented levels of photorealism. However, the computational cost of these methods remains a significant barrier to their widespread adoption, particularly in real-time applications such as virtual reality and teleconferencing. While attempts have been made to develop fast neural rendering approaches for static scenes, these methods cannot be simply employed to support realistic facial expressions, such as in the case of a dynamic facial performance. To address these challenges, we propose a novel fast 3D neural implicit head avatar model that achieves real-time rendering while maintaining fine-grained controllability and high rendering quality. Our key idea lies in the introduction of local hash table blendshapes, which are learned and attached to the vertices of an underlying face parametric model. These per-vertex hash-tables are linearly merged with weights predicted via a CNN, resulting in expression dependent embeddings. Our novel representation enables efficient density and color predictions using a lightweight MLP, which is further accelerated by a hierarchical nearest neighbor search method. Extensive experiments show that our approach runs in real-time while achieving comparable rendering quality to state-of-the-arts and decent results on challenging expressions. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: In CVPR2024. Project page: https://augmentedperception.github.io/monoavatar-plus

arXiv:2403.16053 [pdf, other]

Quantitatively predicting angle-resolved polarized Raman intensity of black phosphorus flakes

Authors: Tao Liu, Jia-Liang Xie, Yu-Chen Leng, Heng Wu, Jiahong Wang, Yang Li, Xue-Feng Yu, Miao-Ling Lin, Ping-Heng Tan

Abstract: In-plane anisotropic layered materials (ALMs), such as black phosphorus (BP), exhibit unique angle-resolved polarized Raman (ARPR) spectroscopy characteristics, as attributed to birefringence, linear dichroism and complex Raman tensor. Moreover, the ARPR intensity profiles of BP flakes deposited on multilayer dielectrics are notably sensitive to their thickness, owing to interference effects. The… ▽ More In-plane anisotropic layered materials (ALMs), such as black phosphorus (BP), exhibit unique angle-resolved polarized Raman (ARPR) spectroscopy characteristics, as attributed to birefringence, linear dichroism and complex Raman tensor. Moreover, the ARPR intensity profiles of BP flakes deposited on multilayer dielectrics are notably sensitive to their thickness, owing to interference effects. The intricate anisotropic effects present challenges in accurately predicting the ARPR intensity of BP flakes. In this study, we propose a comprehensive strategy for predicting the ARPR intensity of BP flakes by explicitly considering optical anisotropy, encompassing birefringence, linear dichroism, and anisotropic cavity interference effects within multilayered structures. Through this approach, we have identified the intrinsic complex Raman tensors for phonon modes, independent of the BP flake thickness. By leveraging this methodology, we have elucidated the flake thickness-dependent effective complex Raman tensor elements, allowing for precise prediction of the observed ARPR intensity profile for the BP flake. This work provides a profound understanding of ARPR behaviors for ALM flakes. △ Less

Submitted 24 March, 2024; originally announced March 2024.

Comments: 6 pages, 4 figures

arXiv:2403.14878 [pdf, other]

Offline tagging of radon-induced backgrounds in XENON1T and applicability to other liquid xenon detectors

Authors: E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, G. Bruno, R. Budnik, T. K. Bui, J. M. R. Cardoso, A. P. Cimental Chavez, A. P. Colijn, J. Conrad , et al. (142 additional authors not shown)

Abstract: This paper details the first application of a software tagging algorithm to reduce radon-induced backgrounds in liquid noble element time projection chambers, such as XENON1T and XENONnT. The convection velocity field in XENON1T was mapped out using $^{222}\text{Rn}$ and $^{218}\text{Po}$ events, and the root-mean-square convection speed was measured to be $0.30 \pm 0.01$ cm/s. Given this velocity… ▽ More This paper details the first application of a software tagging algorithm to reduce radon-induced backgrounds in liquid noble element time projection chambers, such as XENON1T and XENONnT. The convection velocity field in XENON1T was mapped out using $^{222}\text{Rn}$ and $^{218}\text{Po}$ events, and the root-mean-square convection speed was measured to be $0.30 \pm 0.01$ cm/s. Given this velocity field, $^{214}\text{Pb}$ background events can be tagged when they are followed by $^{214}\text{Bi}$ and $^{214}\text{Po}$ decays, or preceded by $^{218}\text{Po}$ decays. This was achieved by evolving a point cloud in the direction of a measured convection velocity field, and searching for $^{214}\text{Bi}$ and $^{214}\text{Po}$ decays or $^{218}\text{Po}$ decays within a volume defined by the point cloud. In XENON1T, this tagging system achieved a $^{214}\text{Pb}$ background reduction of $6.2^{+0.4}_{-0.9}\%$ with an exposure loss of $1.8\pm 0.2 \%$, despite the timescales of convection being smaller than the relevant decay times. We show that the performance can be improved in XENONnT, and that the performance of such a software-tagging approach can be expected to be further improved in a diffusion-limited scenario. Finally, a similar method might be useful to tag the cosmogenic $^{137}\text{Xe}$ background, which is relevant to the search for neutrinoless double-beta decay. △ Less

Submitted 19 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

Comments: 17 pages, 19 figures

arXiv:2403.12013 [pdf, other]

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Authors: Xiao Fu, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long

Abstract: We introduce GeoWizard, a new generative foundation model designed for estimating geometric attributes, e.g., depth and normals, from single images. While significant research has already been conducted in this area, the progress has been substantially limited by the low diversity and poor quality of publicly available datasets. As a result, the prior works either are constrained to limited scenar… ▽ More We introduce GeoWizard, a new generative foundation model designed for estimating geometric attributes, e.g., depth and normals, from single images. While significant research has already been conducted in this area, the progress has been substantially limited by the low diversity and poor quality of publicly available datasets. As a result, the prior works either are constrained to limited scenarios or suffer from the inability to capture geometric details. In this paper, we demonstrate that generative models, as opposed to traditional discriminative models (e.g., CNNs and Transformers), can effectively address the inherently ill-posed problem. We further show that leveraging diffusion priors can markedly improve generalization, detail preservation, and efficiency in resource usage. Specifically, we extend the original stable diffusion model to jointly predict depth and normal, allowing mutual information exchange and high consistency between the two representations. More importantly, we propose a simple yet effective strategy to segregate the complex data distribution of various scenes into distinct sub-distributions. This strategy enables our model to recognize different scene layouts, capturing 3D geometry with remarkable fidelity. GeoWizard sets new benchmarks for zero-shot depth and normal prediction, significantly enhancing many downstream applications such as 3D reconstruction, 2D content creation, and novel viewpoint synthesis. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: Project page: https://fuxiao0719.github.io/projects/geowizard/

arXiv:2403.11270 [pdf, other]

Bilateral Propagation Network for Depth Completion

Authors: Jie Tang, Fei-Peng Tian, Boshi An, Jian Li, Ping Tan

Abstract: Depth completion aims to derive a dense depth map from sparse depth measurements with a synchronized color image. Current state-of-the-art (SOTA) methods are predominantly propagation-based, which work as an iterative refinement on the initial estimated dense depth. However, the initial depth estimations mostly result from direct applications of convolutional layers on the sparse depth map. In thi… ▽ More Depth completion aims to derive a dense depth map from sparse depth measurements with a synchronized color image. Current state-of-the-art (SOTA) methods are predominantly propagation-based, which work as an iterative refinement on the initial estimated dense depth. However, the initial depth estimations mostly result from direct applications of convolutional layers on the sparse depth map. In this paper, we present a Bilateral Propagation Network (BP-Net), that propagates depth at the earliest stage to avoid directly convolving on sparse data. Specifically, our approach propagates the target depth from nearby depth measurements via a non-linear model, whose coefficients are generated through a multi-layer perceptron conditioned on both \emph{radiometric difference} and \emph{spatial distance}. By integrating bilateral propagation with multi-modal fusion and depth refinement in a multi-scale framework, our BP-Net demonstrates outstanding performance on both indoor and outdoor scenes. It achieves SOTA on the NYUv2 dataset and ranks 1st on the KITTI depth completion benchmark at the time of submission. Experimental results not only show the effectiveness of bilateral propagation but also emphasize the significance of early-stage propagation in contrast to the refinement stage. Our code and trained models will be available on the project page. △ Less

Submitted 1 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

Comments: Accepted by CVPR 2024

arXiv:2403.09917 [pdf]

doi 10.1016/j.pss.2024.105863

The Equilibrium Vapor Pressures of Ammonia and Oxygen Ices at Outer Solar System Temperatures

Authors: B. P. Blakley, Will M. Grundy, Jordan K. Steckloff, Sugata P. Tan, Jennifer Hanley, Anna E. Engle, Stephen C. Tegler, Gerrick E. Lindberg, Shae M. Raposa, Kendall J. Koga, Cecilia L. Thieberger

Abstract: Few laboratory studies have investigated the vapor pressures of the volatiles that may be present as ices in the outer solar system; even fewer studies have investigated these species at the temperatures and pressures suitable to the surfaces of icy bodies in the Saturnian and Uranian systems ($\lt$100 K, $\lt10^{-9}$ bar). This study adds to the work of Grundy et al. (2024) in extending the known… ▽ More Few laboratory studies have investigated the vapor pressures of the volatiles that may be present as ices in the outer solar system; even fewer studies have investigated these species at the temperatures and pressures suitable to the surfaces of icy bodies in the Saturnian and Uranian systems ($\lt$100 K, $\lt10^{-9}$ bar). This study adds to the work of Grundy et al. (2024) in extending the known equilibrium vapor pressures of outer solar system ices through laboratory investigations at very low temperatures. Our experiments with ammonia and oxygen ices provide new thermodynamic models for these species' respective enthalpies of sublimation. We find that ammonia ice, and to a lesser degree oxygen ice, are stable at higher temperatures than extrapolations in previous literature have predicted. Our results show that these ices should be retained over longer periods of time than previous extrapolations would predict, and a greater amount of these solids is required to support observation in exospheres of airless bodies in the outer solar system. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 29 pages, 9 figures, to be published in Planetary and Space Science

arXiv:2403.05012 [pdf]

Ultrafast Dynamics of Bilayer and Trilayer Nickelate Superconductors

Authors: Y. D. Li, Y. T. Cao, L. Y. Liu, P. Peng, H. Lin, C. Y. Pei, M. X. Zhang, H. Wu, X. Du, W. X. Zhao, K. Y. Zhai, J. K. Zhao, M. -L. Lin, P. H. Tan, Y. P. Qi, G. Li, H. J. Guo, Luyi Yang, L. X. Yang

Abstract: In addition to the pressurized high-temperature superconductivity, bilayer and trilayer nickelate superconductors Lan+1NinO3n+1 (n = 2 and 3) exhibit many intriguing properties at ambient pressure, such as orbital-dependent electronic correlation, non-Fermi liquid behavior, and density-wave transitions. Here, using ultrafast reflectivity measurement, we observe a drastic difference between the ult… ▽ More In addition to the pressurized high-temperature superconductivity, bilayer and trilayer nickelate superconductors Lan+1NinO3n+1 (n = 2 and 3) exhibit many intriguing properties at ambient pressure, such as orbital-dependent electronic correlation, non-Fermi liquid behavior, and density-wave transitions. Here, using ultrafast reflectivity measurement, we observe a drastic difference between the ultrafast dynamics of the bilayer and trilayer nickelates at ambient pressure. Firstly, we observe a coherent phonon mode in La4Ni3O10 involving the collective vibration of La, Ni, and O atoms, which is absent in La3Ni2O7. Secondly, the temperature-dependent relaxation time diverges near the density-wave transition temperature of La4Ni3O10, in drastic contrast to kink-like changes in La3Ni2O7. Moreover, we estimate the electron-phonon coupling constants to be 0.05~0.07 and 0.12~0.16 for La3Ni2O7 and La4Ni3O10, respectively, suggesting a relatively minor role of electron-phonon coupling in the electronic properties of Lan+1NinO3n+1. Our work not only sheds light on the relevant microscopic interaction but also establishes a foundation for further studying the interplay between superconductivity and density-wave transitions in nickelate superconductors. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2402.16479 [pdf, other]

Edge Detectors Can Make Deep Convolutional Neural Networks More Robust

Authors: Jin Ding, Jie-Chao Zhao, Yong-Zhi Sun, Ping Tan, Jia-Wei Wang, Ji-En Ma, You-Tong Fang

Abstract: Deep convolutional neural networks (DCNN for short) are vulnerable to examples with small perturbations. Improving DCNN's robustness is of great significance to the safety-critical applications, such as autonomous driving and industry automation. Inspired by the principal way that human eyes recognize objects, i.e., largely relying on the shape features, this paper first employs the edge detectors… ▽ More Deep convolutional neural networks (DCNN for short) are vulnerable to examples with small perturbations. Improving DCNN's robustness is of great significance to the safety-critical applications, such as autonomous driving and industry automation. Inspired by the principal way that human eyes recognize objects, i.e., largely relying on the shape features, this paper first employs the edge detectors as layer kernels and designs a binary edge feature branch (BEFB for short) to learn the binary edge features, which can be easily integrated into any popular backbone. The four edge detectors can learn the horizontal, vertical, positive diagonal, and negative diagonal edge features, respectively, and the branch is stacked by multiple Sobel layers (using edge detectors as kernels) and one threshold layer. The binary edge features learned by the branch, concatenated with the texture features learned by the backbone, are fed into the fully connected layers for classification. We integrate the proposed branch into VGG16 and ResNet34, respectively, and conduct experiments on multiple datasets. Experimental results demonstrate the BEFB is lightweight and has no side effects on training. And the accuracy of the BEFB integrated models is better than the original ones on all datasets when facing FGSM, PGD, and C\&W attacks. Besides, BEFB integrated models equipped with the robustness enhancing techniques can achieve better classification accuracy compared to the original models. The work in this paper for the first time shows it is feasible to enhance the robustness of DCNNs through combining both shape-like features and texture features. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: 26 pages, 18 figures, 7 tables. submitted to Neural Networks, under review

arXiv:2402.10551 [pdf, other]

Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information

Authors: Aishwarya Jayagopal, Hansheng Xue, Ziyang He, Robert J. Walsh, Krishna Kumar Hariprasannan, David Shao Peng Tan, Tuan Zea Tan, Jason J. Pitt, Anand D. Jeyasekharan, Vaibhav Rajan

Abstract: Cancer remains a global challenge due to its growing clinical and economic burden. Its uniquely personal manifestation, which makes treatment difficult, has fuelled the quest for personalized treatment strategies. Thus, genomic profiling is increasingly becoming part of clinical diagnostic panels. Effective use of such panels requires accurate drug response prediction (DRP) models, which are chall… ▽ More Cancer remains a global challenge due to its growing clinical and economic burden. Its uniquely personal manifestation, which makes treatment difficult, has fuelled the quest for personalized treatment strategies. Thus, genomic profiling is increasingly becoming part of clinical diagnostic panels. Effective use of such panels requires accurate drug response prediction (DRP) models, which are challenging to build due to limited labelled patient data. Previous methods to address this problem have used various forms of transfer learning. However, they do not explicitly model the variable length sequential structure of the list of mutations in such diagnostic panels. Further, they do not utilize auxiliary information (like patient survival) for model training. We address these limitations through a novel transformer based method, which surpasses the performance of state-of-the-art DRP models on benchmark data. We also present the design of a treatment recommendation system (TRS), which is currently deployed at the National University Hospital, Singapore and is being evaluated in a clinical trial. △ Less

Submitted 16 February, 2024; originally announced February 2024.

arXiv:2402.10446 [pdf, other]

The XENONnT Dark Matter Experiment

Authors: XENON Collaboration, E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, M. Balata, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui , et al. (170 additional authors not shown)

Abstract: The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in… ▽ More The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in cryostat). The experiment is expected to extend the sensitivity to WIMP dark matter by more than an order of magnitude compared to XENON1T, thanks to the larger active mass and the significantly reduced background, improved by novel systems such as a radon removal plant and a neutron veto. This article describes the XENONnT experiment and its sub-systems in detail and reports on the detector performance during the first science run. △ Less

Submitted 15 February, 2024; originally announced February 2024.

Comments: 32 pages, 19 figures

arXiv:2402.02004 [pdf]

Enhancing the efficiency of protein language models with minimal wet-lab data through few-shot learning

Authors: Ziyi Zhou, Liang Zhang, Yuanxi Yu, Mingchen Li, Liang Hong, Pan Tan

Abstract: Accurately modeling the protein fitness landscapes holds great importance for protein engineering. Recently, due to their capacity and representation ability, pre-trained protein language models have achieved state-of-the-art performance in predicting protein fitness without experimental data. However, their predictions are limited in accuracy as well as interpretability. Furthermore, such deep le… ▽ More Accurately modeling the protein fitness landscapes holds great importance for protein engineering. Recently, due to their capacity and representation ability, pre-trained protein language models have achieved state-of-the-art performance in predicting protein fitness without experimental data. However, their predictions are limited in accuracy as well as interpretability. Furthermore, such deep learning models require abundant labeled training examples for performance improvements, posing a practical barrier. In this work, we introduce FSFP, a training strategy that can effectively optimize protein language models under extreme data scarcity. By combining the techniques of meta-transfer learning, learning to rank, and parameter-efficient fine-tuning, FSFP can significantly boost the performance of various protein language models using merely tens of labeled single-site mutants from the target protein. The experiments across 87 deep mutational scanning datasets underscore its superiority over both unsupervised and supervised approaches, revealing its potential in facilitating AI-guided protein design. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2401.14427 [pdf, other]

Beimingwu: A Learnware Dock System

Authors: Zhi-Hao Tan, Jian-Dong Liu, Xiao-Dong Bi, Peng Tan, Qin-Cheng Zheng, Hai-Tian Liu, Yi Xie, Xiao-Chuan Zou, Yang Yu, Zhi-Hua Zhou

Abstract: The learnware paradigm proposed by Zhou [2016] aims to enable users to reuse numerous existing well-trained models instead of building machine learning models from scratch, with the hope of solving new user tasks even beyond models' original purposes. In this paradigm, developers worldwide can submit their high-performing models spontaneously to the learnware dock system (formerly known as learnwa… ▽ More The learnware paradigm proposed by Zhou [2016] aims to enable users to reuse numerous existing well-trained models instead of building machine learning models from scratch, with the hope of solving new user tasks even beyond models' original purposes. In this paradigm, developers worldwide can submit their high-performing models spontaneously to the learnware dock system (formerly known as learnware market) without revealing their training data. Once the dock system accepts the model, it assigns a specification and accommodates the model. This specification allows the model to be adequately identified and assembled to reuse according to future users' needs, even if they have no prior knowledge of the model. This paradigm greatly differs from the current big model direction and it is expected that a learnware dock system housing millions or more high-performing models could offer excellent capabilities for both planned tasks where big models are applicable; and unplanned, specialized, data-sensitive scenarios where big models are not present or applicable. This paper describes Beimingwu, the first open-source learnware dock system providing foundational support for future research of learnware paradigm.The system significantly streamlines the model development for new user tasks, thanks to its integrated architecture and engine design, extensive engineering implementations and optimizations, and the integration of various algorithms for learnware identification and reuse. Notably, this is possible even for users with limited data and minimal expertise in machine learning, without compromising the raw data's security. Beimingwu supports the entire process of learnware paradigm. The system lays the foundation for future research in learnware-related algorithms and systems, and prepares the ground for hosting a vast array of learnwares and establishing a learnware ecosystem. △ Less

Submitted 24 January, 2024; originally announced January 2024.

arXiv:2401.05478 [pdf, other]

Population Graph Cross-Network Node Classification for Autism Detection Across Sample Groups

Authors: Anna Stephens, Francisco Santos, Pang-Ning Tan, Abdol-Hossein Esfahanian

Abstract: Graph neural networks (GNN) are a powerful tool for combining imaging and non-imaging medical information for node classification tasks. Cross-network node classification extends GNN techniques to account for domain drift, allowing for node classification on an unlabeled target network. In this paper we present OTGCN, a powerful, novel approach to cross-network node classification. This approach l… ▽ More Graph neural networks (GNN) are a powerful tool for combining imaging and non-imaging medical information for node classification tasks. Cross-network node classification extends GNN techniques to account for domain drift, allowing for node classification on an unlabeled target network. In this paper we present OTGCN, a powerful, novel approach to cross-network node classification. This approach leans on concepts from graph convolutional networks to harness insights from graph data structures while simultaneously applying strategies rooted in optimal transport to correct for the domain drift that can occur between samples from different data collection sites. This blended approach provides a practical solution for scenarios with many distinct forms of data collected across different locations and equipment. We demonstrate the effectiveness of this approach at classifying Autism Spectrum Disorder subjects using a blend of imaging and non-imaging data. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: To appear ICDM DMBIH workshop 2023

arXiv:2311.02900 [pdf, other]

doi 10.1109/ICRA48506.2021.9561575

Initialisation of Autonomous Aircraft Visual Inspection Systems via CNN-Based Camera Pose Estimation

Authors: Xueyan Oh, Leonard Loh, Shaohui Foong, Zhong Bao Andy Koh, Kow Leong Ng, Poh Kang Tan, Pei Lin Pearlin Toh, U-Xuan Tan

Abstract: General Visual Inspection is a manual inspection process regularly used to detect and localise obvious damage on the exterior of commercial aircraft. There has been increasing demand to perform this process at the boarding gate to minimize the downtime of the aircraft and automating this process is desired to reduce the reliance on human labour. This automation typically requires the first step of… ▽ More General Visual Inspection is a manual inspection process regularly used to detect and localise obvious damage on the exterior of commercial aircraft. There has been increasing demand to perform this process at the boarding gate to minimize the downtime of the aircraft and automating this process is desired to reduce the reliance on human labour. This automation typically requires the first step of estimating a camera's pose with respect to the aircraft for initialisation. However, localisation methods often require infrastructure, which can be very challenging when performed in uncontrolled outdoor environments and within the limited turnover time (approximately 2 hours) on an airport tarmac. In addition, access to commercial aircraft can be very restricted, causing development and testing of solutions to be a challenge. Hence, this paper proposes an on-site infrastructure-less initialisation method, by using the same pan-tilt-zoom camera used for the inspection task to estimate its own pose. This is achieved using a Deep Convolutional Neural Network trained with only synthetic images to regress the camera's pose. We apply domain randomisation when generating our dataset for training our network and improve prediction accuracy by introducing a new component to an existing loss function that leverages on known aircraft geometry to relate position and orientation. Experiments are conducted and we have successfully regressed camera poses with a median error of 0.22 m and 0.73 degrees. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: This paper has been accepted by 2021 IEEE International Conference on Robotics and Automation (ICRA) with DOI: 10.1109/ICRA48506.2021.9561575

arXiv:2310.17415 [pdf, other]

PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications

Authors: Yang Tan, Mingchen Li, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong

Abstract: Large protein language models are adept at capturing the underlying evolutionary information in primary structures, offering significant practical value for protein engineering. Compared to natural language models, protein amino acid sequences have a smaller data volume and a limited combinatorial space. Choosing an appropriate vocabulary size to optimize the pre-trained model is a pivotal issue.… ▽ More Large protein language models are adept at capturing the underlying evolutionary information in primary structures, offering significant practical value for protein engineering. Compared to natural language models, protein amino acid sequences have a smaller data volume and a limited combinatorial space. Choosing an appropriate vocabulary size to optimize the pre-trained model is a pivotal issue. Moreover, despite the wealth of benchmarks and studies in the natural language community, there remains a lack of a comprehensive benchmark for systematically evaluating protein language model quality. Given these challenges, PETA trained language models with 14 different vocabulary sizes under three tokenization methods. It conducted thousands of tests on 33 diverse downstream datasets to assess the models' transfer learning capabilities, incorporating two classification heads and three random seeds to mitigate potential biases. Extensive experiments indicate that vocabulary sizes between 50 and 200 optimize the model, whereas sizes exceeding 800 detrimentally affect the model's representational performance. Our code, model weights and datasets are available at https://github.com/ginnm/ProteinPretraining. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 46 pages, 4figures, 9 tables

arXiv:2310.16857 [pdf]

Improvement in Alzheimer's Disease MRI Images Analysis by Convolutional Neural Networks Via Topological Optimization

Authors: Peiwen Tan

Abstract: This research underscores the efficacy of Fourier topological optimization in refining MRI imagery, thereby bolstering the classification precision of Alzheimer's Disease through convolutional neural networks. Recognizing that MRI scans are indispensable for neurological assessments, but frequently grapple with issues like blurriness and contrast irregularities, the deployment of Fourier topologic… ▽ More This research underscores the efficacy of Fourier topological optimization in refining MRI imagery, thereby bolstering the classification precision of Alzheimer's Disease through convolutional neural networks. Recognizing that MRI scans are indispensable for neurological assessments, but frequently grapple with issues like blurriness and contrast irregularities, the deployment of Fourier topological optimization offered enhanced delineation of brain structures, ameliorated noise, and superior contrast. The applied techniques prioritized boundary enhancement, contrast and brightness adjustments, and overall image lucidity. Employing CNN architectures VGG16, ResNet50, InceptionV3, and Xception, the post-optimization analysis revealed a marked elevation in performance. Conclusively, the amalgamation of Fourier topological optimization with CNNs delineates a promising trajectory for the nuanced classification of Alzheimer's Disease, portending a transformative impact on its diagnostic paradigms. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2310.12363 [pdf]

AI-Based Decadal Predictive Analysis of Twenty Infectious Diseases in China with an Improved BSTS-MCMC Model

Authors: Peiwen Tan

Abstract: This study embarks on a comprehensive exploration of the decadal trends and future trajectories of twenty distinct infectious diseases in China from 1998 to 2021. A refined Hybrid Bayesian Structural Time Series (BSTS)-Markov Chain Monte Carlo (MCMC) model is employed, intertwining with Long Short-Term Memory (LSTM) networks to dissect intricate relationships amidst population demographics, econom… ▽ More This study embarks on a comprehensive exploration of the decadal trends and future trajectories of twenty distinct infectious diseases in China from 1998 to 2021. A refined Hybrid Bayesian Structural Time Series (BSTS)-Markov Chain Monte Carlo (MCMC) model is employed, intertwining with Long Short-Term Memory (LSTM) networks to dissect intricate relationships amidst population demographics, economic indices, and the evolution of infectious diseases. The findings reveal the persistent prevalence of high incidence diseases in future 10 years, like AIDS, Gonorrhea, and Syphilis, and stable occurrences of middle incidence rate diseases such as Brucellosis and Scarlet Fever, while also foretelling the potential disappearance of lower incidence rate diseases like Cholera, Encephalitis B, and Measles. The study particularly underscores the transformative impact of the COVID-19 pandemic, showcasing its extensive implications on the incidences and management of a plethora of diseases, urging a deeper probe into the nuanced alterations in disease transmission, testing, and reporting modalities amidst global health crises. This research accentuates the critical role of advanced predictive analytics in fostering global preparedness and response mechanisms, and in fortifying the resilience and adaptability of China public health framework against burgeoning infectious disease threats. △ Less

Submitted 6 October, 2023; originally announced October 2023.

arXiv:2310.10377 [pdf, ps, other]

doi 10.1103/PhysRevA.109.013706

Direct measurement of coherent light proportion from a practical laser source

Authors: Xi Jie Yeo, Eva Ernst, Alvin Leow, Jaesuk Hwang, Lijiong Shen, Christian Kurtsiefer, Peng Kian Tan

Abstract: We present a technique to estimate the proportion of coherent emission in the light emitted by a practical laser source without spectral filtering. The technique is based on measuring interferometric photon correlations between the output ports of an asymmetric Mach-Zehnder interferometer. With this, we characterize the fraction of coherent emission in the light emitted by a laser diode when trans… ▽ More We present a technique to estimate the proportion of coherent emission in the light emitted by a practical laser source without spectral filtering. The technique is based on measuring interferometric photon correlations between the output ports of an asymmetric Mach-Zehnder interferometer. With this, we characterize the fraction of coherent emission in the light emitted by a laser diode when transiting through the lasing threshold. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 7 pages, 4 figures

Journal ref: Phys. Rev. A 109, 013706 (2024)

arXiv:2310.05456 [pdf]

Ensemble-based Hybrid Optimization of Bayesian Neural Networks and Traditional Machine Learning Algorithms

Authors: Peiwen Tan

Abstract: This research introduces a novel methodology for optimizing Bayesian Neural Networks (BNNs) by synergistically integrating them with traditional machine learning algorithms such as Random Forests (RF), Gradient Boosting (GB), and Support Vector Machines (SVM). Feature integration solidifies these results by emphasizing the second-order conditions for optimality, including stationarity and positive… ▽ More This research introduces a novel methodology for optimizing Bayesian Neural Networks (BNNs) by synergistically integrating them with traditional machine learning algorithms such as Random Forests (RF), Gradient Boosting (GB), and Support Vector Machines (SVM). Feature integration solidifies these results by emphasizing the second-order conditions for optimality, including stationarity and positive definiteness of the Hessian matrix. Conversely, hyperparameter tuning indicates a subdued impact in improving Expected Improvement (EI), represented by EI(x). Overall, the ensemble method stands out as a robust, algorithmically optimized approach. △ Less

Submitted 9 October, 2023; originally announced October 2023.

arXiv:2310.03602 [pdf, other]

Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints

Authors: Chuan Fang, Xiaotao Hu, Kunming Luo, Ping Tan

Abstract: Text-driven 3D indoor scene generation could be useful for gaming, film industry, and AR/VR applications. However, existing methods cannot faithfully capture the room layout, nor do they allow flexible editing of individual objects in the room. To address these problems, we present Ctrl-Room, which is able to generate convincing 3D rooms with designer-style layouts and high-fidelity textures from… ▽ More Text-driven 3D indoor scene generation could be useful for gaming, film industry, and AR/VR applications. However, existing methods cannot faithfully capture the room layout, nor do they allow flexible editing of individual objects in the room. To address these problems, we present Ctrl-Room, which is able to generate convincing 3D rooms with designer-style layouts and high-fidelity textures from just a text prompt. Moreover, Ctrl-Room enables versatile interactive editing operations such as resizing or moving individual furniture items. Our key insight is to separate the modeling of layouts and appearance. %how to model the room that takes into account both scene texture and geometry at the same time. To this end, Our proposed method consists of two stages, a `Layout Generation Stage' and an `Appearance Generation Stage'. The `Layout Generation Stage' trains a text-conditional diffusion model to learn the layout distribution with our holistic scene code parameterization. Next, the `Appearance Generation Stage' employs a fine-tuned ControlNet to produce a vivid panoramic image of the room guided by the 3D scene layout and text prompt. In this way, we achieve a high-quality 3D room with convincing layouts and lively textures. Benefiting from the scene code parameterization, we can easily edit the generated room model through our mask-guided editing module, without expensive editing-specific training. Extensive experiments on the Structured3D dataset demonstrate that our method outperforms existing methods in producing more reasonable, view-consistent, and editable 3D rooms from natural language prompts. △ Less

Submitted 8 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

arXiv:2310.02596 [pdf, other]

SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D

Authors: Weiyu Li, Rui Chen, Xuelin Chen, Ping Tan

Abstract: It is inherently ambiguous to lift 2D results from pre-trained diffusion models to a 3D world for text-to-3D generation. 2D diffusion models solely learn view-agnostic priors and thus lack 3D knowledge during the lifting, leading to the multi-view inconsistency problem. We find that this problem primarily stems from geometric inconsistency, and avoiding misplaced geometric structures substantially… ▽ More It is inherently ambiguous to lift 2D results from pre-trained diffusion models to a 3D world for text-to-3D generation. 2D diffusion models solely learn view-agnostic priors and thus lack 3D knowledge during the lifting, leading to the multi-view inconsistency problem. We find that this problem primarily stems from geometric inconsistency, and avoiding misplaced geometric structures substantially mitigates the problem in the final outputs. Therefore, we improve the consistency by aligning the 2D geometric priors in diffusion models with well-defined 3D shapes during the lifting, addressing the vast majority of the problem. This is achieved by fine-tuning the 2D diffusion model to be viewpoint-aware and to produce view-specific coordinate maps of canonically oriented 3D objects. In our process, only coarse 3D information is used for aligning. This "coarse" alignment not only resolves the multi-view inconsistency in geometries but also retains the ability in 2D diffusion models to generate detailed and diversified high-quality objects unseen in the 3D datasets. Furthermore, our aligned geometric priors (AGP) are generic and can be seamlessly integrated into various state-of-the-art pipelines, obtaining high generalizability in terms of unseen shapes and visual appearance while greatly alleviating the multi-view inconsistency problem. Our method represents a new state-of-the-art performance with an 85+% consistency rate by human evaluation, while many previous methods are around 30%. Our project page is https://sweetdreamer3d.github.io/ △ Less

Submitted 20 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: Project page: https://sweetdreamer3d.github.io/

arXiv:2309.13814 [pdf, other]

DVI-SLAM: A Dual Visual Inertial SLAM Network

Authors: Xiongfeng Peng, Zhihua Liu, Weiming Li, Ping Tan, SoonYong Cho, Qiang Wang

Abstract: Recent deep learning based visual simultaneous localization and mapping (SLAM) methods have made significant progress. However, how to make full use of visual information as well as better integrate with inertial measurement unit (IMU) in visual SLAM has potential research value. This paper proposes a novel deep SLAM network with dual visual factors. The basic idea is to integrate both photometric… ▽ More Recent deep learning based visual simultaneous localization and mapping (SLAM) methods have made significant progress. However, how to make full use of visual information as well as better integrate with inertial measurement unit (IMU) in visual SLAM has potential research value. This paper proposes a novel deep SLAM network with dual visual factors. The basic idea is to integrate both photometric factor and re-projection factor into the end-to-end differentiable structure through multi-factor data association module. We show that the proposed network dynamically learns and adjusts the confidence maps of both visual factors and it can be further extended to include the IMU factors as well. Extensive experiments validate that our proposed method significantly outperforms the state-of-the-art methods on several public datasets, including TartanAir, EuRoC and ETH3D-SLAM. Specifically, when dynamically fusing the three factors together, the absolute trajectory error for both monocular and stereo configurations on EuRoC dataset has reduced by 45.3% and 36.2% respectively. △ Less

Submitted 26 May, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

Comments: Accepted to ICRA2024

Journal ref: The 2024 IEEE International Conference on Robotics and Automation (ICRA2024)

arXiv:2309.11996 [pdf, other]

doi 10.1140/epjc/s10052-023-12296-y

Design and performance of the field cage for the XENONnT experiment

Authors: E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso, D. Cichon , et al. (139 additional authors not shown)

Abstract: The precision in reconstructing events detected in a dual-phase time projection chamber depends on an homogeneous and well understood electric field within the liquid target. In the XENONnT TPC the field homogeneity is achieved through a double-array field cage, consisting of two nested arrays of field shaping rings connected by an easily accessible resistor chain. Rather than being connected to t… ▽ More The precision in reconstructing events detected in a dual-phase time projection chamber depends on an homogeneous and well understood electric field within the liquid target. In the XENONnT TPC the field homogeneity is achieved through a double-array field cage, consisting of two nested arrays of field shaping rings connected by an easily accessible resistor chain. Rather than being connected to the gate electrode, the topmost field shaping ring is independently biased, adding a degree of freedom to tune the electric field during operation. Two-dimensional finite element simulations were used to optimize the field cage, as well as its operation. Simulation results were compared to ${}^{83m}\mathrm{Kr}$ calibration data. This comparison indicates an accumulation of charge on the panels of the TPC which is constant over time, as no evolution of the reconstructed position distribution of events is observed. The simulated electric field was then used to correct the charge signal for the field dependence of the charge yield. This correction resolves the inconsistent measurement of the drift electron lifetime when using different calibrations sources and different field cage tuning voltages. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Journal ref: Eur. Phys. J. C 84, 138 (2024)

arXiv:2309.06404 [pdf]

doi 10.1021/acs.nanolett.3c00780

Tunable Circular Photogalvanic and Photovoltaic Effect in 2D Tellurium with Different Chirality

Authors: Chang Niu, Shouyuan Huang, Neil Ghosh, Pukun Tan, Mingyi Wang, Wenzhuo Wu, Xianfan Xu, Peide D. Ye

Abstract: Chirality arises from the asymmetry of matters, where two counterparts are the mirror image of each other. The interaction between circular-polarization light and quantum materials is enhanced in chiral space groups due to the structural chirality. Tellurium (Te) possesses the simplest chiral crystal structure, with Te atoms covalently bonded into a spiral atomic chain (left- or right-handed) with… ▽ More Chirality arises from the asymmetry of matters, where two counterparts are the mirror image of each other. The interaction between circular-polarization light and quantum materials is enhanced in chiral space groups due to the structural chirality. Tellurium (Te) possesses the simplest chiral crystal structure, with Te atoms covalently bonded into a spiral atomic chain (left- or right-handed) with a periodicity of three. Here, we investigate the tunable circular photo-electric responses in 2D Te field-effect transistor with different chirality, including the longitudinal circular photogalvanic effect induced by the radial spin texture (electron-spin polarization parallel to the electron momentum direction) and the circular photovoltaic induced by the chiral crystal structure (helical Te atomic chains). Our work demonstrates the controllable manipulation of the chirality degree of freedom in materials. △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: 30 pages

Journal ref: Nano Letters 23, no. 8 (2023): 3599-3606

arXiv:2309.05078 [pdf]

doi 10.1016/j.icarus.2023.115767

Laboratory Measurement of Volatile Ice Vapor Pressures with a Quartz Crystal Microbalance

Authors: W. M. Grundy, S. C. Tegler, J. K. Steckloff, S. P. Tan, M. J. Loeffler, A. V. Jasko, K. J. Koga, B. P. Blakley, S. M. Raposa, A. E. Engle, C. L. Thieberger, J. Hanley, G. E. Lindberg, M. D. Gomez, A. O. Madden-Watson

Abstract: Nitrogen, carbon monoxide, and methane are key materials in the far outer Solar System where their high volatility enables them to sublimate, potentially driving activity at very low temperatures. Knowledge of their vapor pressures and latent heats of sublimation at relevant temperatures is needed to model the processes involved. We describe a method for using a quartz crystal microbalance to meas… ▽ More Nitrogen, carbon monoxide, and methane are key materials in the far outer Solar System where their high volatility enables them to sublimate, potentially driving activity at very low temperatures. Knowledge of their vapor pressures and latent heats of sublimation at relevant temperatures is needed to model the processes involved. We describe a method for using a quartz crystal microbalance to measure the sublimation flux of these volatile ices in the free molecular flow regime, accounting for the simultaneous sublimation from and condensation onto the quartz crystal to derive vapor pressures and latent heats of sublimation. We find vapor pressures to be somewhat lower than previous estimates in literature, with carbon monoxide being the most discrepant of the three species, almost an order of magnitude lower than had been thought. These results have important implications across a variety of astrophysical and planetary environments. △ Less

Submitted 21 September, 2023; v1 submitted 10 September, 2023; originally announced September 2023.

arXiv:2308.11162 [pdf, other]

A Preliminary Investigation into Search and Matching for Tumour Discrimination in WHO Breast Taxonomy Using Deep Networks

Authors: Abubakr Shafique, Ricardo Gonzalez, Liron Pantanowitz, Puay Hoon Tan, Alberto Machado, Ian A Cree, Hamid R. Tizhoosh

Abstract: Breast cancer is one of the most common cancers affecting women worldwide. They include a group of malignant neoplasms with a variety of biological, clinical, and histopathological characteristics. There are more than 35 different histological forms of breast lesions that can be classified and diagnosed histologically according to cell morphology, growth, and architecture patterns. Recently, deep… ▽ More Breast cancer is one of the most common cancers affecting women worldwide. They include a group of malignant neoplasms with a variety of biological, clinical, and histopathological characteristics. There are more than 35 different histological forms of breast lesions that can be classified and diagnosed histologically according to cell morphology, growth, and architecture patterns. Recently, deep learning, in the field of artificial intelligence, has drawn a lot of attention for the computerized representation of medical images. Searchable digital atlases can provide pathologists with patch matching tools allowing them to search among evidently diagnosed and treated archival cases, a technology that may be regarded as computational second opinion. In this study, we indexed and analyzed the WHO breast taxonomy (Classification of Tumours 5th Ed.) spanning 35 tumour types. We visualized all tumour types using deep features extracted from a state-of-the-art deep learning model, pre-trained on millions of diagnostic histopathology images from the TCGA repository. Furthermore, we test the concept of a digital "atlas" as a reference for search and matching with rare test cases. The patch similarity search within the WHO breast taxonomy data reached over 88% accuracy when validating through "majority vote" and more than 91% accuracy when validating using top-n tumour types. These results show for the first time that complex relationships among common and rare breast lesions can be investigated using an indexed digital archive. △ Less

Submitted 21 August, 2023; originally announced August 2023.

arXiv:2308.06313 [pdf, other]

doi 10.22331/q-2024-02-12-1247

Qibolab: an open-source hybrid quantum operating system

Authors: Stavros Efthymiou, Alvaro Orgaz-Fuertes, Rodolfo Carobene, Juan Cereijo, Andrea Pasquale, Sergi Ramos-Calderer, Simone Bordoni, David Fuentes-Ruiz, Alessandro Candido, Edoardo Pedicillo, Matteo Robbiati, Yuanzheng Paul Tan, Jadwiga Wilkens, Ingo Roth, José Ignacio Latorre, Stefano Carrazza

Abstract: We present Qibolab, an open-source software library for quantum hardware control integrated with the Qibo quantum computing middleware framework. Qibolab provides the software layer required to automatically execute circuit-based algorithms on custom self-hosted quantum hardware platforms. We introduce a set of objects designed to provide programmatic access to quantum control through pulses-orien… ▽ More We present Qibolab, an open-source software library for quantum hardware control integrated with the Qibo quantum computing middleware framework. Qibolab provides the software layer required to automatically execute circuit-based algorithms on custom self-hosted quantum hardware platforms. We introduce a set of objects designed to provide programmatic access to quantum control through pulses-oriented drivers for instruments, transpilers and optimization algorithms. Qibolab enables experimentalists and developers to delegate all complex aspects of hardware implementation to the library so they can standardize the deployment of quantum computing algorithms in a extensible hardware-agnostic way, using superconducting qubits as the first officially supported quantum technology. We first describe the status of all components of the library, then we show examples of control setup for superconducting qubits platforms. Finally, we present successful application results related to circuit-based algorithms. △ Less

Submitted 5 February, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

Comments: 20 pages, 9 figures, accepted in Quantum, code available at https://github.com/qiboteam/qibolab

Report number: TIF-UNIMI-2023-14, CERN-TH-2023-142

Journal ref: Quantum 8, 1247 (2024)

arXiv:2308.03492 [pdf, other]

Learning Photometric Feature Transform for Free-form Object Scan

Authors: Xiang Feng, Kaizhang Kang, Fan Pei, Huakeng Ding, Jinjiang You, Ping Tan, Kun Zhou, Hongzhi Wu

Abstract: We propose a novel framework to automatically learn to aggregate and transform photometric measurements from multiple unstructured views into spatially distinctive and view-invariant low-level features, which are fed to a multi-view stereo method to enhance 3D reconstruction. The illumination conditions during acquisition and the feature transform are jointly trained on a large amount of synthetic… ▽ More We propose a novel framework to automatically learn to aggregate and transform photometric measurements from multiple unstructured views into spatially distinctive and view-invariant low-level features, which are fed to a multi-view stereo method to enhance 3D reconstruction. The illumination conditions during acquisition and the feature transform are jointly trained on a large amount of synthetic data. We further build a system to reconstruct the geometry and anisotropic reflectance of a variety of challenging objects from hand-held scans. The effectiveness of the system is demonstrated with a lightweight prototype, consisting of a camera and an array of LEDs, as well as an off-the-shelf tablet. Our results are validated against reconstructions from a professional 3D scanner and photographs, and compare favorably with state-of-the-art techniques. △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2307.13282 [pdf, other]

High-Resolution Volumetric Reconstruction for Clothed Humans

Authors: Sicong Tang, Guangyuan Wang, Qing Ran, Lingzhi Li, Li Shen, Ping Tan

Abstract: We present a novel method for reconstructing clothed humans from a sparse set of, e.g., 1 to 6 RGB images. Despite impressive results from recent works employing deep implicit representation, we revisit the volumetric approach and demonstrate that better performance can be achieved with proper system design. The volumetric representation offers significant advantages in leveraging 3D spatial conte… ▽ More We present a novel method for reconstructing clothed humans from a sparse set of, e.g., 1 to 6 RGB images. Despite impressive results from recent works employing deep implicit representation, we revisit the volumetric approach and demonstrate that better performance can be achieved with proper system design. The volumetric representation offers significant advantages in leveraging 3D spatial context through 3D convolutions, and the notorious quantization error is largely negligible with a reasonably large yet affordable volume resolution, e.g., 512. To handle memory and computation costs, we propose a sophisticated coarse-to-fine strategy with voxel culling and subspace sparse convolution. Our method starts with a discretized visual hull to compute a coarse shape and then focuses on a narrow band nearby the coarse shape for refinement. Once the shape is reconstructed, we adopt an image-based rendering approach, which computes the colors of surface points by blending input images with learned weights. Extensive experimental results show that our method significantly reduces the mean point-to-surface (P2S) precision of state-of-the-art methods by more than 50% to achieve approximately 2mm accuracy with a 512 volume resolution. Additionally, images rendered from our textured model achieve a higher peak signal-to-noise ratio (PSNR) compared to state-of-the-art methods. △ Less

Submitted 25 July, 2023; originally announced July 2023.

arXiv:2307.12682 [pdf]

Pro-PRIME: A general Temperature-Guided Language model to engineer enhanced Stability and Activity in Proteins

Authors: Pan Tan, Mingchen Li, Yuanxi Yu, Fan Jiang, Lirong Zheng, Banghao Wu, Xinyu Sun, Liqi Kang, Jie Song, Liang Zhang, Yi Xiong, Wanli Ouyang, Zhiqiang Hu, Guisheng Fan, Yufeng Pei, Liang Hong

Abstract: Designing protein mutants of both high stability and activity is a critical yet challenging task in protein engineering. Here, we introduce Pro-PRIME, a deep learning zero-shot model, which can suggest protein mutants of improved stability and activity without any prior experimental mutagenesis data. By leveraging temperature-guided language modelling, Pro-PRIME demonstrated superior predictive po… ▽ More Designing protein mutants of both high stability and activity is a critical yet challenging task in protein engineering. Here, we introduce Pro-PRIME, a deep learning zero-shot model, which can suggest protein mutants of improved stability and activity without any prior experimental mutagenesis data. By leveraging temperature-guided language modelling, Pro-PRIME demonstrated superior predictive power compared to current state-of-the-art models on the public mutagenesis dataset over 33 proteins. Furthermore, we carried out wet experiments to test Pro-PRIME on five distinct proteins to engineer certain physicochemical properties, including thermal stability, rates of RNA polymerization and DNA cleavage, hydrolase activity, antigen-antibody binding affinity, or even the nonnatural properties, e.g., the ability to polymerize non-natural nucleic acid or resilience to extreme alkaline conditions. Surprisingly, about 40% AI-designed mutants show better performance than the one before mutation for all five proteins studied and for all properties targeted for engineering. Hence, Pro-PRIME demonstrates the general applicability in protein engineering. △ Less

Submitted 13 May, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

Comments: arXiv admin note: text overlap with arXiv:2304.03780

arXiv:2306.16340 [pdf, other]

Cosmogenic background simulations for the DARWIN observatory at different underground locations

Authors: M. Adrover, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, B. Antunovic, E. Aprile, M. Babicz, D. Bajpai, E. Barberio, L. Baudis, M. Bazyk, N. Bell, L. Bellagamba, R. Biondi, Y. Biondi, A. Bismark, C. Boehm, A. Breskin, E. J. Brookes, A. Brown, G. Bruno, R. Budnik, C. Capelli, J. M. R. Cardoso , et al. (158 additional authors not shown)

Abstract: Xenon dual-phase time projections chambers (TPCs) have proven to be a successful technology in studying physical phenomena that require low-background conditions. With 40t of liquid xenon (LXe) in the TPC baseline design, DARWIN will have a high sensitivity for the detection of particle dark matter, neutrinoless double beta decay ($0νββ$), and axion-like particles (ALPs). Although cosmic muons are… ▽ More Xenon dual-phase time projections chambers (TPCs) have proven to be a successful technology in studying physical phenomena that require low-background conditions. With 40t of liquid xenon (LXe) in the TPC baseline design, DARWIN will have a high sensitivity for the detection of particle dark matter, neutrinoless double beta decay ($0νββ$), and axion-like particles (ALPs). Although cosmic muons are a source of background that cannot be entirely eliminated, they may be greatly diminished by placing the detector deep underground. In this study, we used Monte Carlo simulations to model the cosmogenic background expected for the DARWIN observatory at four underground laboratories: Laboratori Nazionali del Gran Sasso (LNGS), Sanford Underground Research Facility (SURF), Laboratoire Souterrain de Modane (LSM) and SNOLAB. We determine the production rates of unstable xenon isotopes and tritium due to muon-included neutron fluxes and muon-induced spallation. These are expected to represent the dominant contributions to cosmogenic backgrounds and thus the most relevant for site selection. △ Less

Submitted 28 June, 2023; originally announced June 2023.

arXiv:2306.12361 [pdf, other]

Sigma-point Kalman Filter with Nonlinear Unknown Input Estimation via Optimization and Data-driven Approach for Dynamic Systems

Authors: Junn Yong Loo, Ze Yang Ding, Vishnu Monn Baskaran, Surya Girinatha Nurzaman, Chee Pin Tan

Abstract: Most works on joint state and unknown input (UI) estimation require the assumption that the UIs are linear; this is potentially restrictive as it does not hold in many intelligent autonomous systems. To overcome this restriction and circumvent the need to linearize the system, we propose a derivative-free Unknown Input Sigma-point Kalman Filter (SPKF-nUI) where the SPKF is interconnected with a ge… ▽ More Most works on joint state and unknown input (UI) estimation require the assumption that the UIs are linear; this is potentially restrictive as it does not hold in many intelligent autonomous systems. To overcome this restriction and circumvent the need to linearize the system, we propose a derivative-free Unknown Input Sigma-point Kalman Filter (SPKF-nUI) where the SPKF is interconnected with a general nonlinear UI estimator that can be implemented via nonlinear optimization and data-driven approaches. The nonlinear UI estimator uses the posterior state estimate which is less susceptible to state prediction error. In addition, we introduce a joint sigma-point transformation scheme to incorporate both the state and UI uncertainties in the estimation of SPKF-nUI. An in-depth stochastic stability analysis proves that the proposed SPKF-nUI yields exponentially converging estimation error bounds under reasonable assumptions. Finally, two case studies are carried out on a simulation-based rigid robot and a physical soft robot, i.e., robots made of soft materials with complex dynamics to validate effectiveness of the proposed filter on nonlinear dynamic systems. Our results demonstrate that the proposed SPKF-nUI achieves the lowest state and UI estimation errors when compared to the existing nonlinear state-UI filters. △ Less

Submitted 24 June, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

arXiv:2306.11871 [pdf, other]

Search for events in XENON1T associated with Gravitational Waves

Authors: XENON Collaboration, E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antoń Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso , et al. (138 additional authors not shown)

Abstract: We perform a blind search for particle signals in the XENON1T dark matter detector that occur close in time to gravitational wave signals in the LIGO and Virgo observatories. No particle signal is observed in the nuclear recoil, electronic recoil, CE$ν$NS, and S2-only channels within $\pm$ 500 seconds of observations of the gravitational wave signals GW170104, GW170729, GW170817, GW170818, and GW1… ▽ More We perform a blind search for particle signals in the XENON1T dark matter detector that occur close in time to gravitational wave signals in the LIGO and Virgo observatories. No particle signal is observed in the nuclear recoil, electronic recoil, CE$ν$NS, and S2-only channels within $\pm$ 500 seconds of observations of the gravitational wave signals GW170104, GW170729, GW170817, GW170818, and GW170823. We use this null result to constrain mono-energetic neutrinos and Beyond Standard Model particles emitted in the closest coalescence GW170817, a binary neutron star merger. We set new upper limits on the fluence (time-integrated flux) of coincident neutrinos down to 17 keV at 90% confidence level. Furthermore, we constrain the product of coincident fluence and cross section of Beyond Standard Model particles to be less than $10^{-29}$ cm$^2$/cm$^2$ in the [5.5-210] keV energy range at 90% confidence level. △ Less

Submitted 27 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.06972 [pdf, other]

Local laser heating effects in diamond probed by photoluminescence of SiV centers at low temperature

Authors: YuanFei Gao, JiaMin Lai, ZhenYao Li, PingHeng Tan, ChongXin Shan, Jun Zhang

Abstract: Diamond is generally considered to have high thermal conductivity, so little attention has been paid to the laser heating effects at low excitation power. However, defects during the growth process can result in a great degradation of thermal conductivity, especially at low temperatures. Here, we observed the dynamic redshift and broadening of zero phonon line (ZPL) of silicon-vacancy (SiV) center… ▽ More Diamond is generally considered to have high thermal conductivity, so little attention has been paid to the laser heating effects at low excitation power. However, defects during the growth process can result in a great degradation of thermal conductivity, especially at low temperatures. Here, we observed the dynamic redshift and broadening of zero phonon line (ZPL) of silicon-vacancy (SiV) centers in diamondin the experiment. Utilizing the intrinsic temperature response of the fine structure spectra of SiV as a probe, we confirmed that the laser heating effect appears and the temperature rising results from high defect concentration. By simulating the thermal diffusion process, we have estimated the thermal conductivity of around 1 W/(mK) at the local site, which is a two order magnitude lower than that of single-crystal diamond. Our results provide a feasible scheme for characterizing the laser heating effect of diamond at low temperatures. △ Less

Submitted 12 June, 2023; originally announced June 2023.

arXiv:2306.04919 [pdf, other]

Unsupervised Cross-Domain Soft Sensor Modelling via Deep Physics-Inspired Particle Flow Bayes

Authors: Junn Yong Loo, Ze Yang Ding, Surya G. Nurzaman, Chee-Ming Ting, Vishnu Monn Baskaran, Chee Pin Tan

Abstract: Data-driven soft sensors are essential for achieving accurate perception through reliable state inference. However, developing representative soft sensor models is challenged by issues such as missing labels, domain adaptability, and temporal coherence in data. To address these challenges, we propose a deep Particle Flow Bayes (DPFB) framework for cross-domain soft sensor modeling in the absence o… ▽ More Data-driven soft sensors are essential for achieving accurate perception through reliable state inference. However, developing representative soft sensor models is challenged by issues such as missing labels, domain adaptability, and temporal coherence in data. To address these challenges, we propose a deep Particle Flow Bayes (DPFB) framework for cross-domain soft sensor modeling in the absence of target state labels. In particular, a sequential Bayes objective is first formulated to perform the maximum likelihood estimation underlying the cross-domain soft sensing problem. At the core of the framework, we incorporate a physics-inspired particle flow that optimizes the sequential Bayes objective to perform an exact Bayes update of the model extracted latent and hidden features. As a result, these contributions enable the proposed framework to learn a rich approximate posterior feature representation capable of characterizing complex cross-domain system dynamics and performing effective time series unsupervised domain adaptation (UDA). Finally, we validate the framework on a complex industrial multiphase flow process system with complex dynamics and multiple operating conditions. The results demonstrate that the DPFB framework achieves superior cross-domain soft sensing performance, outperforming state-of-the-art deep UDA and normalizing flow approaches. △ Less

Submitted 8 July, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

arXiv:2305.18163 [pdf, other]

Compact Real-time Radiance Fields with Neural Codebook

Authors: Lingzhi Li, Zhongshu Wang, Zhen Shen, Li Shen, Ping Tan

Abstract: Reconstructing neural radiance fields with explicit volumetric representations, demonstrated by Plenoxels, has shown remarkable advantages on training and rendering efficiency, while grid-based representations typically induce considerable overhead for storage and transmission. In this work, we present a simple and effective framework for pursuing compact radiance fields from the perspective of co… ▽ More Reconstructing neural radiance fields with explicit volumetric representations, demonstrated by Plenoxels, has shown remarkable advantages on training and rendering efficiency, while grid-based representations typically induce considerable overhead for storage and transmission. In this work, we present a simple and effective framework for pursuing compact radiance fields from the perspective of compression methodology. By exploiting intrinsic properties exhibiting in grid models, a non-uniform compression stem is developed to significantly reduce model complexity and a novel parameterized module, named Neural Codebook, is introduced for better encoding high-frequency details specific to per-scene models via a fast optimization. Our approach can achieve over 40 $\times$ reduction on grid model storage with competitive rendering quality. In addition, the method can achieve real-time rendering speed with 180 fps, realizing significant advantage on storage cost compared to real-time rendering methods. △ Less

Submitted 29 May, 2023; originally announced May 2023.

Comments: Accepted by ICME 2023

arXiv:2305.17445 [pdf, other]

Synthesizing Speech Test Cases with Text-to-Speech? An Empirical Study on the False Alarms in Automated Speech Recognition Testing

Authors: Julia Kaiwen Lau, Kelvin Kai Wen Kong, Julian Hao Yong, Per Hoong Tan, Zhou Yang, Zi Qian Yong, Joshua Chern Wey Low, Chun Yong Chong, Mei Kuan Lim, David Lo

Abstract: Recent studies have proposed the use of Text-To-Speech (TTS) systems to automatically synthesise speech test cases on a scale and uncover a large number of failures in ASR systems. However, the failures uncovered by synthetic test cases may not reflect the actual performance of an ASR system when it transcribes human audio, which we refer to as false alarms. Given a failed test case synthesised fr… ▽ More Recent studies have proposed the use of Text-To-Speech (TTS) systems to automatically synthesise speech test cases on a scale and uncover a large number of failures in ASR systems. However, the failures uncovered by synthetic test cases may not reflect the actual performance of an ASR system when it transcribes human audio, which we refer to as false alarms. Given a failed test case synthesised from TTS systems, which consists of TTS-generated audio and the corresponding ground truth text, we feed the human audio stating the same text to an ASR system. If human audio can be correctly transcribed, an instance of a false alarm is detected. In this study, we investigate false alarm occurrences in five popular ASR systems using synthetic audio generated from four TTS systems and human audio obtained from two commonly used datasets. Our results show that the least number of false alarms is identified when testing Deepspeech, and the number of false alarms is the highest when testing Wav2vec2. On average, false alarm rates range from 21% to 34% in all five ASR systems. Among the TTS systems used, Google TTS produces the least number of false alarms (17%), and Espeak TTS produces the highest number of false alarms (32%) among the four TTS systems. Additionally, we build a false alarm estimator that flags potential false alarms, which achieves promising results: a precision of 98.3%, a recall of 96.4%, an accuracy of 98.5%, and an F1 score of 97.3%. Our study provides insight into the appropriate selection of TTS systems to generate high-quality speech to test ASR systems. Additionally, a false alarm estimator can be a way to minimise the impact of false alarms and help developers choose suitable test inputs when evaluating ASR systems. The source code used in this paper is publicly available on GitHub at https://github.com/julianyonghao/FAinASRtest. △ Less

Submitted 18 July, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

Comments: 13 pages, Accepted at ISSTA2023

Showing 1–50 of 333 results for author: Tan, P