Search | arXiv e-print repository

B-ary Tree Push-Pull Method is Provably Efficient for Distributed Learning on Heterogeneous Data

Abstract: This paper considers the distributed learning problem where a group of agents cooperatively minimizes the summation of their local cost functions based on peer-to-peer communication. Particularly, we propose a highly efficient algorithm, termed ``B-ary Tree Push-Pull'' (BTPP), that employs two B-ary spanning trees for distributing the information related to the parameters and stochastic gradients… ▽ More This paper considers the distributed learning problem where a group of agents cooperatively minimizes the summation of their local cost functions based on peer-to-peer communication. Particularly, we propose a highly efficient algorithm, termed ``B-ary Tree Push-Pull'' (BTPP), that employs two B-ary spanning trees for distributing the information related to the parameters and stochastic gradients across the network. The simple method is efficient in communication since each agent interacts with at most $(B+1)$ neighbors per iteration. More importantly, BTPP achieves linear speedup for smooth nonconvex objective functions with only $\tilde{O}(n)$ transient iterations, significantly outperforming the state-of-the-art results to the best of our knowledge. △ Less

Submitted 6 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

arXiv:2312.12835 [pdf, ps, other]

doi 10.1609/aaai.v38i15.29584

Near-Optimal Resilient Aggregation Rules for Distributed Learning Using 1-Center and 1-Mean Clustering with Outliers

Authors: Yuhao Yi, Ronghui You, Hong Liu, Changxin Liu, Yuan Wang, Jiancheng Lv

Abstract: Byzantine machine learning has garnered considerable attention in light of the unpredictable faults that can occur in large-scale distributed learning systems. The key to secure resilience against Byzantine machines in distributed learning is resilient aggregation mechanisms. Although abundant resilient aggregation rules have been proposed, they are designed in ad-hoc manners, imposing extra barri… ▽ More Byzantine machine learning has garnered considerable attention in light of the unpredictable faults that can occur in large-scale distributed learning systems. The key to secure resilience against Byzantine machines in distributed learning is resilient aggregation mechanisms. Although abundant resilient aggregation rules have been proposed, they are designed in ad-hoc manners, imposing extra barriers on comparing, analyzing, and improving the rules across performance criteria. This paper studies near-optimal aggregation rules using clustering in the presence of outliers. Our outlier-robust clustering approach utilizes geometric properties of the update vectors provided by workers. Our analysis show that constant approximations to the 1-center and 1-mean clustering problems with outliers provide near-optimal resilient aggregators for metric-based criteria, which have been proven to be crucial in the homogeneous and heterogeneous cases respectively. In addition, we discuss two contradicting types of attacks under which no single aggregation rule is guaranteed to improve upon the naive average. Based on the discussion, we propose a two-phase resilient aggregation framework. We run experiments for image classification using a non-convex loss function. The proposed algorithms outperform previously known aggregation rules by a large margin with both homogeneous and heterogeneous data distributions among non-faulty workers. Code and appendix are available at https://github.com/jerry907/AAAI24-RASHB. △ Less

Submitted 31 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 17 pages, 4 figures. Accepted by the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'24)

Journal ref: AAAI 2024, 38, 16469-16477

arXiv:2308.08225 [pdf]

Investigation on the Compressibility Characteristics of Low Mach Number Laminar Flow in Rotating Channel

Authors: Junxin Che, Ruquan You, Wenbin Chen, Haiwang Li

Abstract: In high-speed rotating channels, significant compressive effects are observed, resulting in distinct flow characteristics compared to incompressible flows. In this study, we employed a finite volume method based on the simple algorithm to solve for low-speed compressible laminar flow within rotating channels using an orthogonal uniform grid. The governing equations include the full Navier-Stokes e… ▽ More In high-speed rotating channels, significant compressive effects are observed, resulting in distinct flow characteristics compared to incompressible flows. In this study, we employed a finite volume method based on the simple algorithm to solve for low-speed compressible laminar flow within rotating channels using an orthogonal uniform grid. The governing equations include the full Navier-Stokes equations and the energy equation. Contrary to stationary channel, the alterations in flow within rotating channel are primarily influenced by the compressive effects of centrifugal force and the compressibility of fluid within the flow's normal section. The first effect involves a reduction in the velocity due to centrifugal force, leading to an increasing influence of the Coriolis force compared to inertial forces along the flow direction. This trend in axial changes aligns closely with the increase in rotation speed. The second effect arises from the increase in Mach number and the Coriolis compression, resulting in slight density differences within the cross-section. Strong centrifugal forces generate significant centrifugal additional force (buoyancy force). Consequently, under the same local rotation number, the velocity profiles of the mainstream experience considerable changes. Additionally, higher Mach number significantly impact wall shear stress, with the leading side being notably affected. For instance, at a cross-sectional Ro = 0.6 and Ma = 0.035, the dimensionless shear stress on the leading side decreased by 13%. Furthermore, while an increase in Mach number has minimal impact on the cross-sectional secondary flow structure, changes in mainstream velocity profiles influence secondary flow intensity, resulting in an enhanced velocity peak and a shift towards the trailing side. △ Less

Submitted 26 September, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

arXiv:2307.15877 [pdf]

High-order Discontinuity Detection Physics-Informed Neural Network

Authors: Ruquan You, Shuming Zhang, Tinglin Kong, Haiwang Li

Abstract: In order to solve the problem of the difficult direct measurement of temperature field in fluid machinery under high-speed compressible conditions, this study combines high-order finite difference numerical format, Weighted Essentially Non-Oscillatory (WENO) discontinuity detection, and traditional Physics-Informed Neural Network (PINN) to develop a high-order discontinuity detection PINN (Hodd-PI… ▽ More In order to solve the problem of the difficult direct measurement of temperature field in fluid machinery under high-speed compressible conditions, this study combines high-order finite difference numerical format, Weighted Essentially Non-Oscillatory (WENO) discontinuity detection, and traditional Physics-Informed Neural Network (PINN) to develop a high-order discontinuity detection PINN (Hodd-PINN) that can achieve temperature field inversion with a small number of measurement points. When dealing with pure convection problems, Hodd-PINN introduces a 7th-order discretization for the convection term, reducing an additional 9.7% error compared to traditional low-order discretization methods. When dealing with pure diffusion problems, Hodd-PINN introduces an 8th-order discretization for the diffusion term, reducing an additional 12.8% error compared to traditional low-order discretization methods. In addition, this paper develops a loss function based on WENO discontinuity detection technology, which helps eliminate false discontinuities, allowing Hodd-PINN to successfully identify sparse waves that are easily overlooked in PINN's predicted results, reducing the error by 24.2%. Through extensive testing, this paper points out that the Hodd-PINN, which incorporates high-order discretization and discontinuity detection technology, can further reduce the prediction error of PINN, effectively reducing the data requirement, and can effectively solve the problem of false discontinuities. This method has important value for the inversion of temperature and velocity fields in fluid machinery under high-speed compressible conditions. △ Less

Submitted 28 July, 2023; originally announced July 2023.

arXiv:2305.17908 [pdf]

Study of the Effect of a Novel Dimensionless Parameter -- the Centrifugal Work Number(CW), on Spanwise Rotating channel Low-speed Compressible Flow

Authors: Junxin Che, Ruquan You, Fei Zeng, Haiwang Li, Wenbin Chen, Zhi Tao

Abstract: In the study of rotating channel flow, the key dimensionless parameters typically include the Reynolds number, rotation number, Prandtl number and buoyancy number. Our research focused on comparing the flow characteristics between the enlarged model, analyzed under the rotating similarity theory, and the original channel flow. Significantly different flow behaviors were observed between these two… ▽ More In the study of rotating channel flow, the key dimensionless parameters typically include the Reynolds number, rotation number, Prandtl number and buoyancy number. Our research focused on comparing the flow characteristics between the enlarged model, analyzed under the rotating similarity theory, and the original channel flow. Significantly different flow behaviors were observed between these two cases. Through theoretical derivation and dimensional analysis, we identified a new significant parameter - the centrifugal work number (CW). This parameter characterizes the ratio of centrifugal work to gas enthalpy in the rotating channel and plays a crucial role in measuring the compressibility of fluids within the rotating channel. Additionally, we utilized large eddy simulation(LES) to validate the impact of the centrifugal work ratio on the flow state of the rotating channel, thus enhancing the similarity theory of rotating channel compressible flow. △ Less

Submitted 17 September, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

Comments: 14 pages, 8 figures

arXiv:2304.07511 [pdf, other]

Pilgrimage to Pureland: Art, Perception and the Wutai Mural VR Reconstruction

Authors: Rongxuan Mu, Yuhe Nie, Kent Cao, Ruoxin You, Yinzong Wei, Xin Tong

Abstract: Virtual reality (VR) supports audiences to engage with cultural heritage proactively. We designed an easy-to-access and guided Pilgrimage To Pureland VR reconstruction of Dunhuang Mogao Grottoes to offer the general public an accessible and engaging way to explore the Dunhuang murals. We put forward an immersive VR reconstruction paradigm that can efficiently convert complex 2D artwork into a VR e… ▽ More Virtual reality (VR) supports audiences to engage with cultural heritage proactively. We designed an easy-to-access and guided Pilgrimage To Pureland VR reconstruction of Dunhuang Mogao Grottoes to offer the general public an accessible and engaging way to explore the Dunhuang murals. We put forward an immersive VR reconstruction paradigm that can efficiently convert complex 2D artwork into a VR environment. We reconstructed the Mt. Wutai pilgrimage mural in Cave 61, Mogao Grottoes, Dunhuang, into an immersive VR environment and created a plot-based and interactive experience that offers users a more accessible solution to visit, understand and appreciate the complex religious, historical, and artistic value of Dunhuang murals. \textcolor{black}{Our system remarkably smoothed users' approaches to those elusive cultural heritages. Appropriate adaptation of plots and 3D VR transfer consistent with the original art style could enhance the accessibility of cultural heritages. △ Less

Submitted 15 April, 2023; originally announced April 2023.

arXiv:2210.16819 [pdf, other]

Relative Attention-based One-Class Adversarial Autoencoder for Continuous Authentication of Smartphone Users

Authors: Mingming Hu, Kun Zhang, Ruibang You, Bibo Tu

Abstract: Behavioral biometrics-based continuous authentication is a promising authentication scheme, which uses behavioral biometrics recorded by built-in sensors to authenticate smartphone users throughout the session. However, current continuous authentication methods suffer some limitations: 1) behavioral biometrics from impostors are needed to train continuous authentication models. Since the distribut… ▽ More Behavioral biometrics-based continuous authentication is a promising authentication scheme, which uses behavioral biometrics recorded by built-in sensors to authenticate smartphone users throughout the session. However, current continuous authentication methods suffer some limitations: 1) behavioral biometrics from impostors are needed to train continuous authentication models. Since the distribution of negative samples from diverse attackers are unknown, it is a difficult problem to solve in real-world scenarios; 2) most deep learning-based continuous authentication methods need to train two models to improve authentication performance. A deep learning model for deep feature extraction, and a machine learning-based classifier for classification; 3) weak capability of capturing users' behavioral patterns leads to poor authentication performance. To solve these issues, we propose a relative attention-based one-class adversarial autoencoder for continuous authentication of smartphone users. First, we propose a one-class adversarial autoencoder to learn latent representations of legitimate users' behavioral patterns, which is trained only with legitimate smartphone users' behavioral biometrics. Second, we present the relative attention layer to capture richer contextual semantic representation of users' behavioral patterns, which modifies the standard self-attention mechanism using convolution projection instead of linear projection to perform the attention maps. Experimental results demonstrate that we can achieve superior performance of 1.05% EER, 1.09% EER, and 1.08% EER with a high authentication frequency (0.7s) on three public datasets. △ Less

Submitted 1 November, 2022; v1 submitted 30 October, 2022; originally announced October 2022.

arXiv:2206.01929 [pdf]

doi 10.1063/5.0096701

Temperature and Velocity Characteristics of Rotating Turbulent Boundary Layers Under Non-Isothermal Conditions

Authors: Zhi Tao, Ruquan You, Yao Ma, Haiwang Li

Abstract: This paper describes an experimental investigation, by means of hot-wire anemometry, of the characteristics of velocity and temperature in a rotating turbulent boundary layer under isothermal and non-isothermal conditions. The ranges of experimental parameters are: Reynolds number from 10000 to 25000, rotational speed from 0 to 150 rpm, and y+ from 1.8 to 100. The relative temperature difference i… ▽ More This paper describes an experimental investigation, by means of hot-wire anemometry, of the characteristics of velocity and temperature in a rotating turbulent boundary layer under isothermal and non-isothermal conditions. The ranges of experimental parameters are: Reynolds number from 10000 to 25000, rotational speed from 0 to 150 rpm, and y+ from 1.8 to 100. The relative temperature difference is held constant at 0.1. Detailed velocity and temperature distributions in the boundary layer are measured in the rotating state, and a new criterion for boundary layer segmentation under rotation is proposed. The applicability of boundary layer theory under the rotating state is extended. The influence of Coriolis force and buoyancy on the velocity and temperature distributions in the turbulent boundary layers are analyzed. Coriolis force is found to play an important role in the behavior of the boundary layer under rotation, as it shifts the velocity and temperature boundary layers. Under isothermal conditions, such effects can be classified according to the dominant force: viscous, Coriolis, or inertial. Under non-isothermal conditions, buoyancy occurs. The buoyancy induced by the Coriolis force suppresses the effect of the Coriolis force, and the suppression effect increases with temperature difference. The variation of turbulent Prandtl number Prt under rotation is also obtained. △ Less

Submitted 4 June, 2022; originally announced June 2022.

arXiv:2112.13380 [pdf]

Construction method for general phenomenological RANS turbulence model

Authors: Shuming Zhang, Haiwang Li, Ruquan You, Tinglin Kong, Zhi Tao

Abstract: This paper proposes a phenomenological Reynolds Averaged Navier-Stokes (RANS) calculation model based on physical constraints. In this model part of the source terms in the e equation was replaced with the deep learning model, using the standard k-e model as a template. The simulation results of this model achieved a high error reduction of 51.7 % compared to the standard k-e model. To improve the… ▽ More This paper proposes a phenomenological Reynolds Averaged Navier-Stokes (RANS) calculation model based on physical constraints. In this model part of the source terms in the e equation was replaced with the deep learning model, using the standard k-e model as a template. The simulation results of this model achieved a high error reduction of 51.7 % compared to the standard k-e model. To improve the adaptability and accuracy compared to the convergence of the abnormal flow regime, the coordinate technology proposed in this study was used in the modelling process. For the training data, the k-field and e-field were automatically corrected using this approach when the flow state deviated from the theoretical assumption. Based on the coordinate technology, a deep learning model for the source term of the equation was built, and the simulation error was reduced by 6.2 % compared to the uncoordinated one. From the results, the proposed coordinate technology can effectively be adapted to the underdeveloped flow state and assist in the more accurately modelling of the phenomenological RANS calculation model under a complex flow state. △ Less

Submitted 26 December, 2021; originally announced December 2021.

arXiv:2101.12147 [pdf, other]

doi 10.1063/5.0039177

Two-color differential dynamic microscopy for capturing fast dynamics

Authors: Ruilin You, Ryan McGorty

Abstract: Differential dynamic microscopy (DDM) is increasingly used in the fields of soft matter physics and biophysics to extract the dynamics of microscopic objects across a range of wavevectors using optical microscopy. Standard DDM is limited to detecting dynamics no faster than the camera frame rate. We report on an extension to DDM where we sequentially illuminate the sample with spectrally-distinct… ▽ More Differential dynamic microscopy (DDM) is increasingly used in the fields of soft matter physics and biophysics to extract the dynamics of microscopic objects across a range of wavevectors using optical microscopy. Standard DDM is limited to detecting dynamics no faster than the camera frame rate. We report on an extension to DDM where we sequentially illuminate the sample with spectrally-distinct light and image with a color camera. By pulsing blue and then red light separated by a lag time much smaller than the camera's exposure time we are able to use this two-color DDM method to measure dynamics occurring much faster than the camera frame rate. △ Less

Submitted 28 January, 2021; originally announced January 2021.

Comments: The following article has been accepted by Review of Scientific Instruments

arXiv:1912.07872 [pdf, other]

Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

Authors: Renchun You, Zhiyao Guo, Lei Cui, Xiang Long, Yingze Bao, Shilei Wen

Abstract: Multi-label image and video classification are fundamental yet challenging tasks in computer vision. The main challenges lie in capturing spatial or temporal dependencies between labels and discovering the locations of discriminative features for each class. In order to overcome these challenges, we propose to use cross-modality attention with semantic graph embedding for multi label classificatio… ▽ More Multi-label image and video classification are fundamental yet challenging tasks in computer vision. The main challenges lie in capturing spatial or temporal dependencies between labels and discovering the locations of discriminative features for each class. In order to overcome these challenges, we propose to use cross-modality attention with semantic graph embedding for multi label classification. Based on the constructed label graph, we propose an adjacency-based similarity graph embedding method to learn semantic label embeddings, which explicitly exploit label relationships. Then our novel cross-modality attention maps are generated with the guidance of learned label embeddings. Experiments on two multi-label image classification datasets (MS-COCO and NUS-WIDE) show our method outperforms other existing state-of-the-arts. In addition, we validate our method on a large multi-label video classification dataset (YouTube-8M Segments) and the evaluation results demonstrate the generalization capability of our method. △ Less

Submitted 27 March, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

Comments: Accepted by AAAI2020

arXiv:1910.06148 [pdf]

Experimental investigation of turbulent flow in a two-pass channel with different U-turn

Authors: Runzhou Liu, Haiwang Li, Ruquan You, Zhi Tao

Abstract: In this paper, the TR-PIV method is used to study the internal flow field characteristics in U-shaped channels. The Reynolds number, based on the square cross section channel hydraulic diameter is 8888,13333 and 17777. Mean flow, Reynolds stress and POD are taken into consideration to investigate the flow characteristic with three different turning sections. Through analysis, a series of important… ▽ More In this paper, the TR-PIV method is used to study the internal flow field characteristics in U-shaped channels. The Reynolds number, based on the square cross section channel hydraulic diameter is 8888,13333 and 17777. Mean flow, Reynolds stress and POD are taken into consideration to investigate the flow characteristic with three different turning sections. Through analysis, a series of important conclusions have been drawn. For the main flow, the structure of turning sections has obvious influence on the characteristics of flow field. The size and number of vortices in the corner area are significantly reduced, because the increase in Reynolds number makes the influx impact stronger. It can be seen from the Reynolds stress distribution which is obviously different in different turning sections that the pulsation caused by the mixing of the main flow and the vortex is obviously stronger than that at the boundary. The flow at the turning section is complex, the distribution of the proportion of turbulent kinetic energy in the low-order mode is relatively gentle, and there is an obvious wavy structure at the turning section of the inner circle and outer circular passage, which matches the velocity field from the POD. △ Less

Submitted 14 October, 2019; originally announced October 2019.

Comments: 22 pages,18 figures

arXiv:1904.12578 [pdf, ps, other]

HAXMLNet: Hierarchical Attention Network for Extreme Multi-Label Text Classification

Authors: Ronghui You, Zihan Zhang, Suyang Dai, Shanfeng Zhu

Abstract: Extreme multi-label text classification (XMTC) addresses the problem of tagging each text with the most relevant labels from an extreme-scale label set. Traditional methods use bag-of-words (BOW) representations without context information as their features. The state-ot-the-art deep learning-based method, AttentionXML, which uses a recurrent neural network (RNN) and the multi-label attention, can… ▽ More Extreme multi-label text classification (XMTC) addresses the problem of tagging each text with the most relevant labels from an extreme-scale label set. Traditional methods use bag-of-words (BOW) representations without context information as their features. The state-ot-the-art deep learning-based method, AttentionXML, which uses a recurrent neural network (RNN) and the multi-label attention, can hardly deal with extreme-scale (hundreds of thousands labels) problem. To address this, we propose our HAXMLNet, which uses an efficient and effective hierarchical structure with the multi-label attention. Experimental results show that HAXMLNet reaches a competitive performance with other state-of-the-art methods. △ Less

Submitted 24 March, 2019; originally announced April 2019.

arXiv:1811.01727 [pdf, other]

AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification

Authors: Ronghui You, Zihan Zhang, Ziye Wang, Suyang Dai, Hiroshi Mamitsuka, Shanfeng Zhu

Abstract: Extreme multi-label text classification (XMTC) is an important problem in the era of big data, for tagging a given text with the most relevant multiple labels from an extremely large-scale label set. XMTC can be found in many applications, such as item categorization, web page tagging, and news annotation. Traditionally most methods used bag-of-words (BOW) as inputs, ignoring word context as well… ▽ More Extreme multi-label text classification (XMTC) is an important problem in the era of big data, for tagging a given text with the most relevant multiple labels from an extremely large-scale label set. XMTC can be found in many applications, such as item categorization, web page tagging, and news annotation. Traditionally most methods used bag-of-words (BOW) as inputs, ignoring word context as well as deep semantic information. Recent attempts to overcome the problems of BOW by deep learning still suffer from 1) failing to capture the important subtext for each label and 2) lack of scalability against the huge number of labels. We propose a new label tree-based deep learning model for XMTC, called AttentionXML, with two unique features: 1) a multi-label attention mechanism with raw text as input, which allows to capture the most relevant part of text to each label; and 2) a shallow and wide probabilistic label tree (PLT), which allows to handle millions of labels, especially for "tail labels". We empirically compared the performance of AttentionXML with those of eight state-of-the-art methods over six benchmark datasets, including Amazon-3M with around 3 million labels. AttentionXML outperformed all competing methods under all experimental settings. Experimental results also show that AttentionXML achieved the best performance against tail labels among label tree-based methods. The code and datasets are available at http://github.com/yourh/AttentionXML . △ Less

Submitted 4 November, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

Comments: Accepted by NeurIPS 2019

Showing 1–14 of 14 results for author: You, R