-
Identifying Genetic Variants for Obesity Incorporating Prior Insights: Quantile Regression with Insight Fusion for Ultra-high Dimensional Data
Authors:
Jiantong Wang,
Heng Lian,
Yan Yu,
Heping Zhang
Abstract:
Obesity is widely recognized as a critical and pervasive health concern. We strive to identify important genetic risk factors from hundreds of thousands of single nucleotide polymorphisms (SNPs) for obesity. We propose and apply a novel Quantile Regression with Insight Fusion (QRIF) approach that can integrate insights from established studies or domain knowledge to simultaneously select variables…
▽ More
Obesity is widely recognized as a critical and pervasive health concern. We strive to identify important genetic risk factors from hundreds of thousands of single nucleotide polymorphisms (SNPs) for obesity. We propose and apply a novel Quantile Regression with Insight Fusion (QRIF) approach that can integrate insights from established studies or domain knowledge to simultaneously select variables and modeling for ultra-high dimensional genetic data, focusing on high conditional quantiles of body mass index (BMI) that are of most interest. We discover interesting new SNPs and shed new light on a comprehensive view of the underlying genetic risk factors for different levels of BMI. This may potentially pave the way for more precise and targeted treatment strategies. The QRIF approach intends to balance the trade-off between the prior insights and the observed data while being robust to potential false information. We further establish the desirable asymptotic properties under the challenging non-differentiable check loss functions via Huber loss approximation and nonconvex SCAD penalty via local linear approximation. Finally, we develop an efficient algorithm for the QRIF approach. Our simulation studies further demonstrate its effectiveness.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Statistical inference for high-dimensional convoluted rank regression
Authors:
Leheng Cai,
Xu Guo,
Heng Lian,
Liping Zhu
Abstract:
High-dimensional penalized rank regression is a powerful tool for modeling high-dimensional data due to its robustness and estimation efficiency. However, the non-smoothness of the rank loss brings great challenges to the computation. To solve this critical issue, high-dimensional convoluted rank regression is recently proposed, and penalized convoluted rank regression estimators are introduced. H…
▽ More
High-dimensional penalized rank regression is a powerful tool for modeling high-dimensional data due to its robustness and estimation efficiency. However, the non-smoothness of the rank loss brings great challenges to the computation. To solve this critical issue, high-dimensional convoluted rank regression is recently proposed, and penalized convoluted rank regression estimators are introduced. However, these developed estimators cannot be directly used to make inference. In this paper, we investigate the inference problem of high-dimensional convoluted rank regression. We first establish estimation error bounds of penalized convoluted rank regression estimators under weaker conditions on the predictors. Based on the penalized convoluted rank regression estimators, we further introduce a debiased estimator. We then provide Bahadur representation for our proposed estimator. We further develop simultaneous inference procedures. A novel bootstrap procedure is proposed and its theoretical validity is also established. Finally, simulation and real data analysis are conducted to illustrate the merits of our proposed methods.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
HARIS: Human-Like Attention for Reference Image Segmentation
Authors:
Mengxi Zhang,
Heqing Lian,
Yiming Liu,
Jie Chen
Abstract:
Referring image segmentation (RIS) aims to locate the particular region corresponding to the language expression. Existing methods incorporate features from different modalities in a \emph{bottom-up} manner. This design may get some unnecessary image-text pairs, which leads to an inaccurate segmentation mask. In this paper, we propose a referring image segmentation method called HARIS, which intro…
▽ More
Referring image segmentation (RIS) aims to locate the particular region corresponding to the language expression. Existing methods incorporate features from different modalities in a \emph{bottom-up} manner. This design may get some unnecessary image-text pairs, which leads to an inaccurate segmentation mask. In this paper, we propose a referring image segmentation method called HARIS, which introduces the Human-Like Attention mechanism and uses the parameter-efficient fine-tuning (PEFT) framework. To be specific, the Human-Like Attention gets a \emph{feedback} signal from multi-modal features, which makes the network center on the specific objects and discard the irrelevant image-text pairs. Besides, we introduce the PEFT framework to preserve the zero-shot ability of pre-trained encoders. Extensive experiments on three widely used RIS benchmarks and the PhraseCut dataset demonstrate that our method achieves state-of-the-art performance and great zero-shot ability.
△ Less
Submitted 21 May, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
Distributed Iterative Hard Thresholding for Variable Selection in Tobit Models
Authors:
Changxin Yang,
Zhongyi Zhu,
Heng Lian
Abstract:
While extensive research has been conducted on high-dimensional data and on regression with left-censored responses, simultaneously addressing these complexities remains challenging, with only a few proposed methods available. In this paper, we utilize the Iterative Hard Thresholding (IHT) algorithm on the Tobit model in such a setting. Theoretical analysis demonstrates that our estimator converge…
▽ More
While extensive research has been conducted on high-dimensional data and on regression with left-censored responses, simultaneously addressing these complexities remains challenging, with only a few proposed methods available. In this paper, we utilize the Iterative Hard Thresholding (IHT) algorithm on the Tobit model in such a setting. Theoretical analysis demonstrates that our estimator converges with a near-optimal minimax rate. Additionally, we extend the method to a distributed setting, requiring only a few rounds of communication while retaining the estimation rate of the centralized version. Simulation results show that the IHT algorithm for the Tobit model achieves superior accuracy in predictions and subset selection, with the distributed estimator closely matching that of the centralized estimator. When applied to high-dimensional left-censored HIV viral load data, our method also exhibits similar superiority.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal
Authors:
Haoran Lian,
Yizhe Xiong,
Jianwei Niu,
Shasha Mo,
Zhenpeng Su,
Zijia Lin,
Peng Liu,
Hui Chen,
Guiguang Ding
Abstract:
Byte Pair Encoding (BPE) serves as a foundation method for text tokenization in the Natural Language Processing (NLP) field. Despite its wide adoption, the original BPE algorithm harbors an inherent flaw: it inadvertently introduces a frequency imbalance for tokens in the text corpus. Since BPE iteratively merges the most frequent token pair in the text corpus while keeping all tokens that have be…
▽ More
Byte Pair Encoding (BPE) serves as a foundation method for text tokenization in the Natural Language Processing (NLP) field. Despite its wide adoption, the original BPE algorithm harbors an inherent flaw: it inadvertently introduces a frequency imbalance for tokens in the text corpus. Since BPE iteratively merges the most frequent token pair in the text corpus while keeping all tokens that have been merged in the vocabulary, it unavoidably holds tokens that primarily represent subwords of complete words and appear infrequently on their own in the text corpus. We term such tokens as Scaffold Tokens. Due to their infrequent appearance in the text corpus, Scaffold Tokens pose a learning imbalance issue for language models. To address that issue, we propose Scaffold-BPE, which incorporates a dynamic scaffold token removal mechanism by parameter-free, computation-light, and easy-to-implement modifications to the original BPE. This novel approach ensures the exclusion of low-frequency Scaffold Tokens from the token representations for the given texts, thereby mitigating the issue of frequency imbalance and facilitating model training. On extensive experiments across language modeling tasks and machine translation tasks, Scaffold-BPE consistently outperforms the original BPE, well demonstrating its effectiveness and superiority.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Temporal Scaling Law for Large Language Models
Authors:
Yizhe Xiong,
Xiansheng Chen,
Xin Ye,
Hui Chen,
Zijia Lin,
Haoran Lian,
Zhenpeng Su,
Jianwei Niu,
Guiguang Ding
Abstract:
Recently, Large Language Models (LLMs) have been widely adopted in a wide range of tasks, leading to increasing attention towards the research on how scaling LLMs affects their performance. Existing works, termed Scaling Laws, have discovered that the final test loss of LLMs scales as power-laws with model size, computational budget, and dataset size. However, the temporal change of the test loss…
▽ More
Recently, Large Language Models (LLMs) have been widely adopted in a wide range of tasks, leading to increasing attention towards the research on how scaling LLMs affects their performance. Existing works, termed Scaling Laws, have discovered that the final test loss of LLMs scales as power-laws with model size, computational budget, and dataset size. However, the temporal change of the test loss of an LLM throughout its pre-training process remains unexplored, though it is valuable in many aspects, such as selecting better hyperparameters \textit{directly} on the target LLM. In this paper, we propose the novel concept of Temporal Scaling Law, studying how the test loss of an LLM evolves as the training steps scale up. In contrast to modeling the test loss as a whole in a coarse-grained manner, we break it down and dive into the fine-grained test loss of each token position, and further develop a dynamic hyperbolic-law. Afterwards, we derive the much more precise temporal scaling law by studying the temporal patterns of the parameters in the dynamic hyperbolic-law. Results on both in-distribution (ID) and out-of-distribution (OOD) validation datasets demonstrate that our temporal scaling law accurately predicts the test loss of LLMs across training steps. Our temporal scaling law has broad practical applications. First, it enables direct and efficient hyperparameter selection on the target LLM, such as data mixture proportions. Secondly, viewing the LLM pre-training dynamics from the token position granularity provides some insights to enhance the understanding of LLM pre-training.
△ Less
Submitted 16 June, 2024; v1 submitted 27 April, 2024;
originally announced April 2024.
-
RLEMMO: Evolutionary Multimodal Optimization Assisted By Deep Reinforcement Learning
Authors:
Hongqiao Lian,
Zeyuan Ma,
Hongshu Guo,
Ting Huang,
Yue-Jiao Gong
Abstract:
Solving multimodal optimization problems (MMOP) requires finding all optimal solutions, which is challenging in limited function evaluations. Although existing works strike the balance of exploration and exploitation through hand-crafted adaptive strategies, they require certain expert knowledge, hence inflexible to deal with MMOP with different properties. In this paper, we propose RLEMMO, a Meta…
▽ More
Solving multimodal optimization problems (MMOP) requires finding all optimal solutions, which is challenging in limited function evaluations. Although existing works strike the balance of exploration and exploitation through hand-crafted adaptive strategies, they require certain expert knowledge, hence inflexible to deal with MMOP with different properties. In this paper, we propose RLEMMO, a Meta-Black-Box Optimization framework, which maintains a population of solutions and incorporates a reinforcement learning agent for flexibly adjusting individual-level searching strategies to match the up-to-date optimization status, hence boosting the search performance on MMOP. Concretely, we encode landscape properties and evolution path information into each individual and then leverage attention networks to advance population information sharing. With a novel reward mechanism that encourages both quality and diversity, RLEMMO can be effectively trained using a policy gradient algorithm. The experimental results on the CEC2013 MMOP benchmark underscore the competitive optimization performance of RLEMMO against several strong baselines.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
PAVITS: Exploring Prosody-aware VITS for End-to-End Emotional Voice Conversion
Authors:
Tianhua Qi,
Wenming Zheng,
Cheng Lu,
Yuan Zong,
Hailun Lian
Abstract:
In this paper, we propose Prosody-aware VITS (PAVITS) for emotional voice conversion (EVC), aiming to achieve two major objectives of EVC: high content naturalness and high emotional naturalness, which are crucial for meeting the demands of human perception. To improve the content naturalness of converted audio, we have developed an end-to-end EVC architecture inspired by the high audio quality of…
▽ More
In this paper, we propose Prosody-aware VITS (PAVITS) for emotional voice conversion (EVC), aiming to achieve two major objectives of EVC: high content naturalness and high emotional naturalness, which are crucial for meeting the demands of human perception. To improve the content naturalness of converted audio, we have developed an end-to-end EVC architecture inspired by the high audio quality of VITS. By seamlessly integrating an acoustic converter and vocoder, we effectively address the common issue of mismatch between emotional prosody training and run-time conversion that is prevalent in existing EVC models. To further enhance the emotional naturalness, we introduce an emotion descriptor to model the subtle prosody variations of different speech emotions. Additionally, we propose a prosody predictor, which predicts prosody features from text based on the provided emotion label. Notably, we introduce a prosody alignment loss to establish a connection between latent prosody features from two distinct modalities, ensuring effective training. Experimental results show that the performance of PAVITS is superior to the state-of-the-art EVC methods. Speech Samples are available at https://jeremychee4.github.io/pavits4EVC/ .
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition
Authors:
Yong Wang,
Cheng Lu,
Hailun Lian,
Yan Zhao,
Björn Schuller,
Yuan Zong,
Wenming Zheng
Abstract:
Swin-Transformer has demonstrated remarkable success in computer vision by leveraging its hierarchical feature representation based on Transformer. In speech signals, emotional information is distributed across different scales of speech features, e.\,g., word, phrase, and utterance. Drawing above inspiration, this paper presents a hierarchical speech Transformer with shifted windows to aggregate…
▽ More
Swin-Transformer has demonstrated remarkable success in computer vision by leveraging its hierarchical feature representation based on Transformer. In speech signals, emotional information is distributed across different scales of speech features, e.\,g., word, phrase, and utterance. Drawing above inspiration, this paper presents a hierarchical speech Transformer with shifted windows to aggregate multi-scale emotion features for speech emotion recognition (SER), called Speech Swin-Transformer. Specifically, we first divide the speech spectrogram into segment-level patches in the time domain, composed of multiple frame patches. These segment-level patches are then encoded using a stack of Swin blocks, in which a local window Transformer is utilized to explore local inter-frame emotional information across frame patches of each segment patch. After that, we also design a shifted window Transformer to compensate for patch correlations near the boundaries of segment patches. Finally, we employ a patch merging operation to aggregate segment-level emotional features for hierarchical speech representation by expanding the receptive field of Transformer from frame-level to segment-level. Experimental results demonstrate that our proposed Speech Swin-Transformer outperforms the state-of-the-art methods.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Improving Speaker-independent Speech Emotion Recognition Using Dynamic Joint Distribution Adaptation
Authors:
Cheng Lu,
Yuan Zong,
Hailun Lian,
Yan Zhao,
Björn Schuller,
Wenming Zheng
Abstract:
In speaker-independent speech emotion recognition, the training and testing samples are collected from diverse speakers, leading to a multi-domain shift challenge across the feature distributions of data from different speakers. Consequently, when the trained model is confronted with data from new speakers, its performance tends to degrade. To address the issue, we propose a Dynamic Joint Distribu…
▽ More
In speaker-independent speech emotion recognition, the training and testing samples are collected from diverse speakers, leading to a multi-domain shift challenge across the feature distributions of data from different speakers. Consequently, when the trained model is confronted with data from new speakers, its performance tends to degrade. To address the issue, we propose a Dynamic Joint Distribution Adaptation (DJDA) method under the framework of multi-source domain adaptation. DJDA firstly utilizes joint distribution adaptation (JDA), involving marginal distribution adaptation (MDA) and conditional distribution adaptation (CDA), to more precisely measure the multi-domain distribution shifts caused by different speakers. This helps eliminate speaker bias in emotion features, allowing for learning discriminative and speaker-invariant speech emotion features from coarse-level to fine-level. Furthermore, we quantify the adaptation contributions of MDA and CDA within JDA by using a dynamic balance factor based on $\mathcal{A}$-Distance, promoting to effectively handle the unknown distributions encountered in data from new speakers. Experimental results demonstrate the superior performance of our DJDA as compared to other state-of-the-art (SOTA) methods.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach
Authors:
Yan Zhao,
Yuan Zong,
Hailun Lian,
Cheng Lu,
Jingang Shi,
Wenming Zheng
Abstract:
Cross-corpus speech emotion recognition (SER) poses a challenge due to feature distribution mismatch, potentially degrading the performance of established SER methods. In this paper, we tackle this challenge by proposing a novel transfer subspace learning method called acoustic knowledgeguided transfer linear regression (AKTLR). Unlike existing approaches, which often overlook domain-specific know…
▽ More
Cross-corpus speech emotion recognition (SER) poses a challenge due to feature distribution mismatch, potentially degrading the performance of established SER methods. In this paper, we tackle this challenge by proposing a novel transfer subspace learning method called acoustic knowledgeguided transfer linear regression (AKTLR). Unlike existing approaches, which often overlook domain-specific knowledge related to SER and simply treat cross-corpus SER as a generic transfer learning task, our AKTLR method is built upon a well-designed acoustic knowledge-guided dual sparsity constraint mechanism. This mechanism emphasizes the potential of minimalistic acoustic parameter feature sets to alleviate classifier overadaptation, which is empirically validated acoustic knowledge in SER, enabling superior generalization in cross-corpus SER tasks compared to using large feature sets. Through this mechanism, we extend a simple transfer linear regression model to AKTLR. This extension harnesses its full capability to seek emotiondiscriminative and corpus-invariant features from established acoustic parameter feature sets used for describing speech signals across two scales: contributive acoustic parameter groups and constituent elements within each contributive group. Our proposed method is evaluated through extensive cross-corpus SER experiments on three widely-used speech emotion corpora: EmoDB, eNTERFACE, and CASIA. The results confirm the effectiveness and superior performance of our method, outperforming recent state-of-the-art transfer subspace learning and deep transfer learning-based cross-corpus SER methods. Furthermore, our work provides experimental evidence supporting the feasibility and superiority of incorporating domain-specific knowledge into the transfer learning model to address cross-corpus SER tasks.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Observation of strong attenuation within the photonic band gap of multiconnected networks
Authors:
Pengbo Zhu,
Runkai Chen,
Xiangbo Yang,
Yanglong Fan,
Huada Lian,
Zhen-Yu Wang
Abstract:
We theoretically and experimentally study a photonic band gap (PBG) material made of coaxial cables. The coaxial cables are waveguides for the electromagnetic waves and provide paths for direct wave interference within the material. Using multiconnected coaxial cables to form a unit cell, we realize PBGs via (i) direct interference between the waveguides within each cell and (ii) scattering among…
▽ More
We theoretically and experimentally study a photonic band gap (PBG) material made of coaxial cables. The coaxial cables are waveguides for the electromagnetic waves and provide paths for direct wave interference within the material. Using multiconnected coaxial cables to form a unit cell, we realize PBGs via (i) direct interference between the waveguides within each cell and (ii) scattering among different cells. We systematically investigate the transmission of EM waves in our PBG materials and discuss the mechanism of band gap formation. We observe experimentally for the first time the wide band gap with strong attenuation caused by direct destructive interference.
△ Less
Submitted 28 September, 2023;
originally announced October 2023.
-
Layer-Adapted Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
Authors:
Yan Zhao,
Yuan Zong,
Jincen Wang,
Hailun Lian,
Cheng Lu,
Li Zhao,
Wenming Zheng
Abstract:
In this paper, we propose a new unsupervised domain adaptation (DA) method called layer-adapted implicit distribution alignment networks (LIDAN) to address the challenge of cross-corpus speech emotion recognition (SER). LIDAN extends our previous ICASSP work, deep implicit distribution alignment networks (DIDAN), whose key contribution lies in the introduction of a novel regularization term called…
▽ More
In this paper, we propose a new unsupervised domain adaptation (DA) method called layer-adapted implicit distribution alignment networks (LIDAN) to address the challenge of cross-corpus speech emotion recognition (SER). LIDAN extends our previous ICASSP work, deep implicit distribution alignment networks (DIDAN), whose key contribution lies in the introduction of a novel regularization term called implicit distribution alignment (IDA). This term allows DIDAN trained on source (training) speech samples to remain applicable to predicting emotion labels for target (testing) speech samples, regardless of corpus variance in cross-corpus SER. To further enhance this method, we extend IDA to layer-adapted IDA (LIDA), resulting in LIDAN. This layer-adpated extention consists of three modified IDA terms that consider emotion labels at different levels of granularity. These terms are strategically arranged within different fully connected layers in LIDAN, aligning with the increasing emotion-discriminative abilities with respect to the layer depth. This arrangement enables LIDAN to more effectively learn emotion-discriminative and corpus-invariant features for SER across various corpora compared to DIDAN. It is also worthy to mention that unlike most existing methods that rely on estimating statistical moments to describe pre-assumed explicit distributions, both IDA and LIDA take a different approach. They utilize an idea of target sample reconstruction to directly bridge the feature distribution gap without making assumptions about their distribution type. As a result, DIDAN and LIDAN can be viewed as implicit cross-corpus SER methods. To evaluate LIDAN, we conducted extensive cross-corpus SER experiments on EmoDB, eNTERFACE, and CASIA corpora. The experimental results demonstrate that LIDAN surpasses recent state-of-the-art explicit unsupervised DA methods in tackling cross-corpus SER tasks.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Time-Frequency Transformer: A Novel Time Frequency Joint Learning Method for Speech Emotion Recognition
Authors:
Yong Wang,
Cheng Lu,
Yuan Zong,
Hailun Lian,
Yan Zhao,
Sunan Li
Abstract:
In this paper, we propose a novel time-frequency joint learning method for speech emotion recognition, called Time-Frequency Transformer. Its advantage is that the Time-Frequency Transformer can excavate global emotion patterns in the time-frequency domain of speech signal while modeling the local emotional correlations in the time domain and frequency domain respectively. For the purpose, we firs…
▽ More
In this paper, we propose a novel time-frequency joint learning method for speech emotion recognition, called Time-Frequency Transformer. Its advantage is that the Time-Frequency Transformer can excavate global emotion patterns in the time-frequency domain of speech signal while modeling the local emotional correlations in the time domain and frequency domain respectively. For the purpose, we first design a Time Transformer and Frequency Transformer to capture the local emotion patterns between frames and inside frequency bands respectively, so as to ensure the integrity of the emotion information modeling in both time and frequency domains. Then, a Time-Frequency Transformer is proposed to mine the time-frequency emotional correlations through the local time-domain and frequency-domain emotion features for learning more discriminative global speech emotion representation. The whole process is a time-frequency joint learning process implemented by a series of Transformer models. Experiments on IEMOCAP and CASIA databases indicate that our proposed method outdoes the state-of-the-art methods.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Non-commutative resolutions as mirrors of singular Calabi--Yau varieties
Authors:
Tsung-Ju Lee,
Bong H. Lian,
Mauricio Romo
Abstract:
It has been conjectured that the hemisphere partition function arXiv:1308.2217, arXiv:1308.2438 in a gauged linear sigma model (GLSM) computes the central charge arXiv:math/0212237 of an object in the bounded derived category of coherent sheaves for Calabi--Yau (CY) manifolds. There is also evidence in arXiv:alg-geom/ 9511001, arXiv:hep-th/0007071. On the other hand, non-commutative resolutions of…
▽ More
It has been conjectured that the hemisphere partition function arXiv:1308.2217, arXiv:1308.2438 in a gauged linear sigma model (GLSM) computes the central charge arXiv:math/0212237 of an object in the bounded derived category of coherent sheaves for Calabi--Yau (CY) manifolds. There is also evidence in arXiv:alg-geom/ 9511001, arXiv:hep-th/0007071. On the other hand, non-commutative resolutions of singular CY varieties have been studied in the context of abelian GLSMs arXiv:0709.3855. In this paper, we study an analogous construction of abelian GLSMs for non-commutative resolutions and propose they can be used to study a class of recently discovered mirror pairs of singular CY varieties. Our main result shows that the hemisphere partition functions (a.k.a.~$A$-periods) in the new GLSM are in fact period integrals (a.k.a.~$B$-periods) of the singular CY varieties. We conjecture that the two are completely equivalent: $B$-periods are the same as $A$-periods. We give some examples to support this conjecture and formulate some expected homological mirror symmetry (HMS) relation between the GLSM theory and the CY. As shown in arXiv:2003.07148, the $B$-periods in this case are precisely given by a certain fractional version of the $B$-series of arXiv:alg-geom/9511001. Since a hemisphere partition function is defined as a contour integral in a cone in the complexified secondary fan (or FI-theta parameter space) arXiv:1308.2438, it can be reduced to a sum of residues (by theorems of Passare-Tsikh-Zhdanov and Tsikh-Zhdanov). Our conjecture shows that this residue sum may now be amenable to computations in terms of the $B$-series.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Learning Local to Global Feature Aggregation for Speech Emotion Recognition
Authors:
Cheng Lu,
Hailun Lian,
Wenming Zheng,
Yuan Zong,
Yan Zhao,
Sunan Li
Abstract:
Transformer has emerged in speech emotion recognition (SER) at present. However, its equal patch division not only damages frequency information but also ignores local emotion correlations across frames, which are key cues to represent emotion. To handle the issue, we propose a Local to Global Feature Aggregation learning (LGFA) for SER, which can aggregate longterm emotion correlations at differe…
▽ More
Transformer has emerged in speech emotion recognition (SER) at present. However, its equal patch division not only damages frequency information but also ignores local emotion correlations across frames, which are key cues to represent emotion. To handle the issue, we propose a Local to Global Feature Aggregation learning (LGFA) for SER, which can aggregate longterm emotion correlations at different scales both inside frames and segments with entire frequency information to enhance the emotion discrimination of utterance-level speech features. For this purpose, we nest a Frame Transformer inside a Segment Transformer. Firstly, Frame Transformer is designed to excavate local emotion correlations between frames for frame embeddings. Then, the frame embeddings and their corresponding segment features are aggregated as different-level complements to be fed into Segment Transformer for learning utterance-level global emotion features. Experimental results show that the performance of LGFA is superior to the state-of-the-art methods.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
Deep Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
Authors:
Yan Zhao,
Jincen Wang,
Yuan Zong,
Wenming Zheng,
Hailun Lian,
Li Zhao
Abstract:
In this paper, we propose a novel deep transfer learning method called deep implicit distribution alignment networks (DIDAN) to deal with cross-corpus speech emotion recognition (SER) problem, in which the labeled training (source) and unlabeled testing (target) speech signals come from different corpora. Specifically, DIDAN first adopts a simple deep regression network consisting of a set of conv…
▽ More
In this paper, we propose a novel deep transfer learning method called deep implicit distribution alignment networks (DIDAN) to deal with cross-corpus speech emotion recognition (SER) problem, in which the labeled training (source) and unlabeled testing (target) speech signals come from different corpora. Specifically, DIDAN first adopts a simple deep regression network consisting of a set of convolutional and fully connected layers to directly regress the source speech spectrums into the emotional labels such that the proposed DIDAN can own the emotion discriminative ability. Then, such ability is transferred to be also applicable to the target speech samples regardless of corpus variance by resorting to a well-designed regularization term called implicit distribution alignment (IDA). Unlike widely-used maximum mean discrepancy (MMD) and its variants, the proposed IDA absorbs the idea of sample reconstruction to implicitly align the distribution gap, which enables DIDAN to learn both emotion discriminative and corpus invariant features from speech spectrums. To evaluate the proposed DIDAN, extensive cross-corpus SER experiments on widely-used speech emotion corpora are carried out. Experimental results show that the proposed DIDAN can outperform lots of recent state-of-the-art methods in coping with the cross-corpus SER tasks.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Speech Emotion Recognition via an Attentive Time-Frequency Neural Network
Authors:
Cheng Lu,
Wenming Zheng,
Hailun Lian,
Yuan Zong,
Chuangao Tang,
Sunan Li,
Yan Zhao
Abstract:
Spectrogram is commonly used as the input feature of deep neural networks to learn the high(er)-level time-frequency pattern of speech signal for speech emotion recognition (SER). \textcolor{black}{Generally, different emotions correspond to specific energy activations both within frequency bands and time frames on spectrogram, which indicates the frequency and time domains are both essential to r…
▽ More
Spectrogram is commonly used as the input feature of deep neural networks to learn the high(er)-level time-frequency pattern of speech signal for speech emotion recognition (SER). \textcolor{black}{Generally, different emotions correspond to specific energy activations both within frequency bands and time frames on spectrogram, which indicates the frequency and time domains are both essential to represent the emotion for SER. However, recent spectrogram-based works mainly focus on modeling the long-term dependency in time domain, leading to these methods encountering the following two issues: (1) neglecting to model the emotion-related correlations within frequency domain during the time-frequency joint learning; (2) ignoring to capture the specific frequency bands associated with emotions.} To cope with the issues, we propose an attentive time-frequency neural network (ATFNN) for SER, including a time-frequency neural network (TFNN) and time-frequency attention. Specifically, aiming at the first issue, we design a TFNN with a frequency-domain encoder (F-Encoder) based on the Transformer encoder and a time-domain encoder (T-Encoder) based on the Bidirectional Long Short-Term Memory (Bi-LSTM). The F-Encoder and T-Encoder model the correlations within frequency bands and time frames, respectively, and they are embedded into a time-frequency joint learning strategy to obtain the time-frequency patterns for speech emotions. Moreover, to handle the second issue, we also adopt time-frequency attention with a frequency-attention network (F-Attention) and a time-attention network (T-Attention) to focus on the emotion-related frequency band ranges and time frame ranges, which can enhance the discriminability of speech emotion features.
△ Less
Submitted 22 October, 2022;
originally announced October 2022.
-
Flexible Alignment Super-Resolution Network for Multi-Contrast MRI
Authors:
Yiming Liu,
Mengxi Zhang,
Weiqin Zhang,
Bo Jiang,
Bo Hou,
Dan Liu,
Jie Chen,
Heqing Lian
Abstract:
Magnetic resonance imaging plays an essential role in clinical diagnosis by acquiring the structural information of biological tissue. Recently, many multi-contrast MRI super-resolution networks achieve good effects. However, most studies ignore the impact of the inappropriate foreground scale and patch size of multi-contrast MRI, which probably leads to inappropriate feature alignment. To tackle…
▽ More
Magnetic resonance imaging plays an essential role in clinical diagnosis by acquiring the structural information of biological tissue. Recently, many multi-contrast MRI super-resolution networks achieve good effects. However, most studies ignore the impact of the inappropriate foreground scale and patch size of multi-contrast MRI, which probably leads to inappropriate feature alignment. To tackle this problem, we propose the Flexible Alignment Super-Resolution Network (FASR-Net) for multi-contrast MRI Super-Resolution. The Flexible Alignment module of FASR-Net consists of two modules for feature alignment. (1) The Single-Multi Pyramid Alignment(S-A) module solves the situation where low-resolution (LR) images and reference (Ref) images have different scales. (2) The Multi-Multi Pyramid Alignment(M-A) module solves the situation where LR and Ref images have the same scale. Besides, we propose the Cross-Hierarchical Progressive Fusion (CHPF) module aiming at fusing the features effectively, further improving the image quality. Compared with other state-of-the-art methods, FASR-net achieves the most competitive results on FastMRI and IXI datasets. Our code will be available at \href{https://github.com/yimingliu123/FASR-Net}{https://github.com/yimingliu123/FASR-Net}.
△ Less
Submitted 8 January, 2023; v1 submitted 7 October, 2022;
originally announced October 2022.
-
Online Deep Learning from Doubly-Streaming Data
Authors:
Heng Lian,
John Scovil Atwood,
Bojian Hou,
Jian Wu,
Yi He
Abstract:
This paper investigates a new online learning problem with doubly-streaming data, where the data streams are described by feature spaces that constantly evolve, with new features emerging and old features fading away. The challenges of this problem are two folds: 1) Data samples ceaselessly flowing in may carry shifted patterns over time, requiring learners to update hence adapt on-the-fly. 2) New…
▽ More
This paper investigates a new online learning problem with doubly-streaming data, where the data streams are described by feature spaces that constantly evolve, with new features emerging and old features fading away. The challenges of this problem are two folds: 1) Data samples ceaselessly flowing in may carry shifted patterns over time, requiring learners to update hence adapt on-the-fly. 2) Newly emerging features are described by very few samples, resulting in weak learners that tend to make error predictions. A plausible idea to overcome the challenges is to establish relationship between the pre-and-post evolving feature spaces, so that an online learner can leverage the knowledge learned from the old features to better the learning performance on the new features. Unfortunately, this idea does not scale up to high-dimensional media streams with complex feature interplay, which suffers an tradeoff between onlineness (biasing shallow learners) and expressiveness(requiring deep learners). Motivated by this, we propose a novel OLD^3S paradigm, where a shared latent subspace is discovered to summarize information from the old and new feature spaces, building intermediate feature mapping relationship. A key trait of OLD^3S is to treat the model capacity as a learnable semantics, yields optimal model depth and parameters jointly, in accordance with the complexity and non-linearity of the input data streams in an online fashion. Both theoretical analyses and empirical studies substantiate the viability and effectiveness of our proposal.
△ Less
Submitted 14 September, 2022; v1 submitted 25 April, 2022;
originally announced April 2022.
-
Vertex Algebras and Commutative Algebras
Authors:
Bong H. Lian,
Andrew R. Linshaw
Abstract:
This paper begins with a brief survey of the period prior to and soon after the creation of the theory of vertex operator algebras (VOAs). This survey is intended to highlight some of the important developments leading to the creation of VOA theory. The paper then proceeds to describe progress made in the field of VOAs in the last 15 years which is based on fruitful analogies and connections betwe…
▽ More
This paper begins with a brief survey of the period prior to and soon after the creation of the theory of vertex operator algebras (VOAs). This survey is intended to highlight some of the important developments leading to the creation of VOA theory. The paper then proceeds to describe progress made in the field of VOAs in the last 15 years which is based on fruitful analogies and connections between VOAs and commutative algebras. First, there are several functors from VOAs to commutative algebras that allow methods from commutative algebra to be used to solve VOA problems. To illustrate this, we present a method for describing orbifolds and cosets using methods of classical invariant theory. This was essential in the recent solution of a conjecture of Gaiotto and Rapčák that is of current interest in physics. We also recast some old conjectures in the subject in terms of commutative algebra and give some generalizations of these conjectures. We also give an overview of the theory of topological VOAs (TVOAs), with applications to BRST cohomology theory and conformal string theory, based on work in the 90's. We construct a functor from TVOAs to Batalin-Vilkovisky algebras -- supercommutative algebras equipped with a certain odd Poisson structure realized by a second order differential operator -- and present a number of interesting applications. This paper is based in part on the lecture given by the first author at the Harvard CMSA Math-Science Literature Lecture Series on May 22, 2020.
△ Less
Submitted 7 July, 2021;
originally announced July 2021.
-
Nonparametric Quantile Regression for Homogeneity Pursuit in Panel Data Models
Authors:
Xiaoyu Zhang,
Di Wang,
Heng Lian,
Guodong Li
Abstract:
Many panel data have the latent subgroup effect on individuals, and it is important to correctly identify these groups since the efficiency of resulting estimators can be improved significantly by pooling the information of individuals within each group. However, the currently assumed parametric and semiparametric relationship between the response and predictors may be misspecified, which leads to…
▽ More
Many panel data have the latent subgroup effect on individuals, and it is important to correctly identify these groups since the efficiency of resulting estimators can be improved significantly by pooling the information of individuals within each group. However, the currently assumed parametric and semiparametric relationship between the response and predictors may be misspecified, which leads to a wrong grouping result, and the nonparametric approach hence can be considered to avoid such mistakes. Moreover, the response may depend on predictors in different ways at various quantile levels, and the corresponding grouping structure may also vary. To tackle these problems, this article proposes a nonparametric quantile regression method for homogeneity pursuit in panel data models with individual effects, and a pairwise fused penalty is used to automatically select the number of groups. The asymptotic properties are established, and an ADMM algorithm is also developed. The finite sample performance is evaluated by simulation experiments, and the usefulness of the proposed methodology is further illustrated by an empirical example.
△ Less
Submitted 22 August, 2022; v1 submitted 3 May, 2021;
originally announced May 2021.
-
Exact polarization energy for clusters of contacting dielectrics
Authors:
Huada Lian,
Jian Qin
Abstract:
The induced surface charges appear to diverge when dielectric particles form close contacts. Resolving this singularity numerically is prohibitively expensive because high spatial resolution is needed. We show that the strength of this singularity is logarithmic in both inter-particle separation and dielectric permittivity. A regularization scheme is proposed to isolate this singularity, and to ca…
▽ More
The induced surface charges appear to diverge when dielectric particles form close contacts. Resolving this singularity numerically is prohibitively expensive because high spatial resolution is needed. We show that the strength of this singularity is logarithmic in both inter-particle separation and dielectric permittivity. A regularization scheme is proposed to isolate this singularity, and to calculate the exact cohesive energy for clusters of contacting dielectric particles. The results indicate that polarization energy stabilizes clusters of open configurations when permittivity is high, in agreement with the behavior of conducting particles, but stabilizes the compact configurations when permittivity is low.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.
-
On Calabi--Yau fractional complete intersections
Authors:
Tsung-Ju Lee,
Bong H. Lian,
Shing-Tung Yau
Abstract:
In this article, we study mirror symmetry for pairs of singular Calabi--Yau manifolds which are double covers of toric manifolds. Their period integrals can be seen as certain `fractional' analogues of those of ordinary complete intersections. This new structure can then be used to solve their Riemann--Hilbert problems. The latter can then be used to answer definitively questions about mirror symm…
▽ More
In this article, we study mirror symmetry for pairs of singular Calabi--Yau manifolds which are double covers of toric manifolds. Their period integrals can be seen as certain `fractional' analogues of those of ordinary complete intersections. This new structure can then be used to solve their Riemann--Hilbert problems. The latter can then be used to answer definitively questions about mirror symmetry for this class of Calabi--Yau manifolds.
△ Less
Submitted 15 February, 2022; v1 submitted 10 August, 2020;
originally announced August 2020.
-
Mirror symmetry for double cover Calabi--Yau varieties
Authors:
Shinobu Hosono,
Tsung-Ju Lee,
Bong H. Lian,
Shing-Tung Yau
Abstract:
The presented paper is a continuation of the series of papers arXiv:1810.00606 and arXiv:1903.09373. In this paper, utilizing Batyrev and Borisov's duality construction on nef-partitions, we generalize the recipe in arXiv:1810.00606 and arXiv:1903.09373 to construct a pair of singular double cover Calabi--Yau varieties $(Y,Y^{\vee})$ over toric manifolds and compute their topological Euler charact…
▽ More
The presented paper is a continuation of the series of papers arXiv:1810.00606 and arXiv:1903.09373. In this paper, utilizing Batyrev and Borisov's duality construction on nef-partitions, we generalize the recipe in arXiv:1810.00606 and arXiv:1903.09373 to construct a pair of singular double cover Calabi--Yau varieties $(Y,Y^{\vee})$ over toric manifolds and compute their topological Euler characteristics and Hodge numbers. In the $3$-dimensional cases, we show that $(Y,Y^{\vee})$ forms a topological mirror pair, i.e., $h^{p,q}(Y)=h^{3-p,q}(Y^{\vee})$ for all $p,q$.
△ Less
Submitted 30 November, 2020; v1 submitted 16 March, 2020;
originally announced March 2020.
-
Impurity-pinned incommensurate charge density wave and local phonon excitations in 2H-NbS2
Authors:
Chenhaoping Wen,
Yuan Xie,
Yueshen Wu,
Shiwei Shen,
Pengfei Kong,
Hailong Lian,
Jun Li,
Hui Xing,
Shichao Yan
Abstract:
Here we report a scanning tunneling microscopy (STM) and spectroscopy (STS) study in the superconducting state of 2H-NbS2. We directly visualize the existence of incommensurate charge density wave (CDW) that is pinned by atomic impurities. In strong tunneling conditions, the incommensurate CDW is de-pinned from impurities by the electric field from STM tip. We perform STM-based inelastic tunneling…
▽ More
Here we report a scanning tunneling microscopy (STM) and spectroscopy (STS) study in the superconducting state of 2H-NbS2. We directly visualize the existence of incommensurate charge density wave (CDW) that is pinned by atomic impurities. In strong tunneling conditions, the incommensurate CDW is de-pinned from impurities by the electric field from STM tip. We perform STM-based inelastic tunneling spectroscopy (IETS) to detect phonon excitations in 2H-NbS2 and measure the influence of atomic impurities on local phonon excitations. In comparison with the calculated vibrational density of states in 2H-NbS2, we find two branches of phonon excitations which correspond to the vibrations of Nb ions and S ions, and the strength of the local phonon excitations is insensitive to the atomic impurities. Our results demonstrate the coexistence of incommensurate CDW and superconductivity in 2H-NbS2, and open the way of detecting atomic-scale phonon excitations in transition metal dichalcogenides with STM-based IETS.
△ Less
Submitted 13 June, 2020; v1 submitted 25 February, 2020;
originally announced February 2020.
-
High-dimensional vector autoregressive time series modeling via tensor decomposition
Authors:
Di Wang,
Yao Zheng,
Heng Lian,
Guodong Li
Abstract:
The classical vector autoregressive model is a fundamental tool for multivariate time series analysis. However, it involves too many parameters when the number of time series and lag order are even moderately large. This paper proposes to rearrange the transition matrices of the model into a tensor form such that the parameter space can be restricted along three directions simultaneously via tenso…
▽ More
The classical vector autoregressive model is a fundamental tool for multivariate time series analysis. However, it involves too many parameters when the number of time series and lag order are even moderately large. This paper proposes to rearrange the transition matrices of the model into a tensor form such that the parameter space can be restricted along three directions simultaneously via tensor decomposition. In contrast, the reduced-rank regression method can restrict the parameter space in only one direction. Besides achieving substantial dimension reduction, the proposed model is interpretable from the factor modeling perspective. Moreover, to handle high-dimensional time series, this paper considers imposing sparsity on factor matrices to improve the model interpretability and estimation efficiency, which leads to a sparsity-inducing estimator. For the low-dimensional case, we derive asymptotic properties of the proposed least squares estimator and introduce an alternating least squares algorithm. For the high-dimensional case, we establish non-asymptotic properties of the sparsity-inducing estimator and propose an ADMM algorithm for regularized estimation. Simulation experiments and a real data example demonstrate the advantages of the proposed approach over various existing methods.
△ Less
Submitted 3 November, 2020; v1 submitted 14 September, 2019;
originally announced September 2019.
-
Lithium ion intercalation in thin crystals of hexagonal TaSe2 gated by a polymer electrolyte
Authors:
Yueshen Wu,
Hailong Lian,
Jiaming He,
Jinyu Liu,
Shun Wang,
Hui Xing,
Zhiqiang Mao,
Ying Liu
Abstract:
Ionic liquid gating has been used to modify properties of layered transition metal dichalcogenides (TMDCs), including two-dimensional (2D) crystals of TMDCs used extensively recently in the device work, which has led to observations of properties not seen in the bulk. The main effect comes from the electrostatic gating due to strong electric field at the interface. In addition, ionic liquid gating…
▽ More
Ionic liquid gating has been used to modify properties of layered transition metal dichalcogenides (TMDCs), including two-dimensional (2D) crystals of TMDCs used extensively recently in the device work, which has led to observations of properties not seen in the bulk. The main effect comes from the electrostatic gating due to strong electric field at the interface. In addition, ionic liquid gating also leads to ion intercalation when the ion size of gate electrolyte is small compared to the interlayer spacing of TMDCs. However, the microscopic processes of ion intercalation have rarely been explored in layered TMDCs. Here, we employed a technique combining photolithography device fabrication and electrical transport measurements on the thin crystals of hexagonal TaSe2 using multiple channel devices gated by a polymer electrolyte LiClO4/PEO. The gate voltage and time dependent source-drain resistances of these thin crystals were used to obtain information on the intercalation process, the effect of ion intercalation, and the correlation between the ion occupation of allowed interstitial sites and the device characteristics. We found a gate voltage controlled modulation of the charge density waves and scattering rate of charge carriers. Our work suggests that ion intercalation can be a useful tool for layered materials engineering and 2D crystal device design.
△ Less
Submitted 24 October, 2018;
originally announced October 2018.
-
K3 surfaces from configurations of six lines in $\mathbb{P}^2$ and mirror symmetry I
Authors:
Shinobu Hosono,
Bong H. Lian,
Hiromichi Takagi,
Shing-Tung Yau
Abstract:
From the viewpoint of mirror symmetry, we revisit the hypergeometric system $E(3,6)$ for a family of K3 surfaces. We construct a good resolution of the Baily-Borel-Satake compactification of its parameter space, which admits special boundary points (LCSLs) given by normal crossing divisors. We find local isomorphisms between the $E(3,6)$ systems and the associated GKZ systems defined locally on th…
▽ More
From the viewpoint of mirror symmetry, we revisit the hypergeometric system $E(3,6)$ for a family of K3 surfaces. We construct a good resolution of the Baily-Borel-Satake compactification of its parameter space, which admits special boundary points (LCSLs) given by normal crossing divisors. We find local isomorphisms between the $E(3,6)$ systems and the associated GKZ systems defined locally on the parameter space and cover the entire parameter space. Parallel structures are conjectured in general for hypergeometric system $E(n,m)$ on Grassmannians. Local solutions and mirror symmetry will be described in a companion paper \cite{HLTYpartII}, where we introduce a K3 analogue of the elliptic lambda function in terms of genus two theta functions.
△ Less
Submitted 22 March, 2019; v1 submitted 1 October, 2018;
originally announced October 2018.
-
A General Framework For Frequentist Model Averaging
Authors:
Priyam Mitra,
Heng Lian,
Ritwik Mitra,
Hua Liang,
Min-ge Xie
Abstract:
Model selection strategies have been routinely employed to determine a model for data analysis in statistics, and further study and inference then often proceed as though the selected model were the true model that were known a priori. This practice does not account for the uncertainty introduced by the selection process and the fact that the selected model can possibly be a wrong one. Model avera…
▽ More
Model selection strategies have been routinely employed to determine a model for data analysis in statistics, and further study and inference then often proceed as though the selected model were the true model that were known a priori. This practice does not account for the uncertainty introduced by the selection process and the fact that the selected model can possibly be a wrong one. Model averaging approaches try to remedy this issue by combining estimators for a set of candidate models. Specifically, instead of deciding which model is the 'right' one, a model averaging approach suggests to fit a set of candidate models and average over the estimators using certain data adaptive weights. In this paper we establish a general frequentist model averaging framework that does not set any restrictions on the set of candidate models. It greatly broadens the scope of the existing methodologies under the frequentist model averaging development. Assuming the data is from an unknown model, we derive the model averaging estimator and study its limiting distributions and related predictions while taking possible modeling biases into account. We propose a set of optimal weights to combine the individual estimators so that the expected mean squared error of the average estimator is minimized. Simulation studies are conducted to compare the performance of the estimator with that of the existing methods. The results show the benefits of the proposed approach over traditional model selection approaches as well as existing model averaging methods.
△ Less
Submitted 9 February, 2018;
originally announced February 2018.
-
Band dependence of charge density wave in quasi-one-dimensional Ta2NiSe7 probed by orbital magnetoresistance
Authors:
Jiaming He,
Yiran Zhang,
Libin Wen,
Yusen Yang,
Jinyu Liu,
Yueshen Wu,
Hailong Lian,
Hui Xing,
Shun Wang,
Zhiqiang Mao,
Ying Liu
Abstract:
Ta2NiSe7 is a quasi-one-dimensional (quasi-1D) transition-metal chalcogenide with Ta and Ni chain structure. An incommensurate charge-density wave (CDW) in this quasi-1D structure was well studied previously using tunnelling spectrum, X-ray and electron diffraction, whereas its transport property and the relation to the underlying electronic states remain to be explored. Here we report our results…
▽ More
Ta2NiSe7 is a quasi-one-dimensional (quasi-1D) transition-metal chalcogenide with Ta and Ni chain structure. An incommensurate charge-density wave (CDW) in this quasi-1D structure was well studied previously using tunnelling spectrum, X-ray and electron diffraction, whereas its transport property and the relation to the underlying electronic states remain to be explored. Here we report our results of magnetoresistance (MR) on Ta2NiSe7. A breakdown of the Kohler's rule is found upon entering the CDW state. Concomitantly, a clear change of curvature in the field dependence of MR is observed. We show that the curvature change is well described by two-band orbital MR, with the hole density being strongly suppressed in the CDW state, indicating that the $p$ orbitals from Se atoms dominate the change in transport through the CDW transition.
△ Less
Submitted 6 January, 2018;
originally announced January 2018.
-
Differential zeros of period integrals and generalized hypergeometric functions
Authors:
Jingyue Chen,
An Huang,
Bong H. Lian,
Shing-Tung Yau
Abstract:
In this paper, we study the zero loci of local systems of the form $δΠ$, where $Π$ is the period sheaf of the universal family of CY hypersurfaces in a suitable ambient space $X$, and $δ$ is a given differential operator on the space of sections $V^\vee=Γ(X,K_X^{-1})$. Using earlier results of three of the authors and their collaborators, we give several different descriptions of the zero locus of…
▽ More
In this paper, we study the zero loci of local systems of the form $δΠ$, where $Π$ is the period sheaf of the universal family of CY hypersurfaces in a suitable ambient space $X$, and $δ$ is a given differential operator on the space of sections $V^\vee=Γ(X,K_X^{-1})$. Using earlier results of three of the authors and their collaborators, we give several different descriptions of the zero locus of $δΠ$. As applications, we prove that the locus is algebraic and in some cases, non-empty. We also give an explicit way to compute the polynomial defining equations of the locus in some cases. This description gives rise to a natural stratification to the zero locus.
△ Less
Submitted 4 November, 2018; v1 submitted 3 September, 2017;
originally announced September 2017.
-
Debiased distributed learning for sparse partial linear models in high dimensions
Authors:
Shaogao Lv,
Heng Lian
Abstract:
Although various distributed machine learning schemes have been proposed recently for pure linear models and fully nonparametric models, little attention has been paid on distributed optimization for semi-paramemetric models with multiple-level structures (e.g. sparsity, linearity and nonlinearity). To address these issues, the current paper proposes a new communication-efficient distributed learn…
▽ More
Although various distributed machine learning schemes have been proposed recently for pure linear models and fully nonparametric models, little attention has been paid on distributed optimization for semi-paramemetric models with multiple-level structures (e.g. sparsity, linearity and nonlinearity). To address these issues, the current paper proposes a new communication-efficient distributed learning algorithm for partially sparse linear models with an increasing number of features. The proposed method is based on the classical divide and conquer strategy for handing big data and each sub-method defined on each subsample consists of a debiased estimation of the double-regularized least squares approach. With the proposed method, we theoretically prove that our global parametric estimator can achieve optimal parametric rate in our semi-parametric model given an appropriate partition on the total data. Specially, the choice of data partition relies on the underlying smoothness of the nonparametric component, but it is adaptive to the sparsity parameter. Even under the non-distributed setting, we develop a new and easily-read proof for optimal estimation of the parametric error in high dimensional partial linear model. Finally, several simulated experiments are implemented to indicate comparable empirical performance of our debiased technique under the distributed setting.
△ Less
Submitted 3 November, 2019; v1 submitted 17 August, 2017;
originally announced August 2017.
-
Homogeneity Pursuit in Single Index Models based Panel Data Analysis
Authors:
Heng Lian,
Xinghao Qiao,
Wenyang Zhang
Abstract:
Panel data analysis is an important topic in statistics and econometrics. Traditionally, in panel data analysis, all individuals are assumed to share the same unknown parameters, e.g. the same coefficients of covariates when the linear models are used, and the differences between the individuals are accounted for by cluster effects. This kind of modelling only makes sense if our main interest is o…
▽ More
Panel data analysis is an important topic in statistics and econometrics. Traditionally, in panel data analysis, all individuals are assumed to share the same unknown parameters, e.g. the same coefficients of covariates when the linear models are used, and the differences between the individuals are accounted for by cluster effects. This kind of modelling only makes sense if our main interest is on the global trend, this is because it would not be able to tell us anything about the individual attributes which are sometimes very important. In this paper, we proposed a modelling based on the single index models embedded with homogeneity for panel data analysis, which builds the individual attributes in the model and is parsimonious at the same time. We develop a data driven approach to identify the structure of homogeneity, and estimate the unknown parameters and functions based on the identified structure. Asymptotic properties of the resulting estimators are established. Intensive simulation studies conducted in this paper also show the resulting estimators work very well when sample size is finite. Finally, the proposed modelling is applied to a public financial dataset and a UK climate dataset, the results reveal some interesting findings.
△ Less
Submitted 7 June, 2017; v1 submitted 2 June, 2017;
originally announced June 2017.
-
Additive Partially Linear Models for Massive Heterogeneous Data
Authors:
Binhuan Wang,
Yixin Fang,
Heng Lian,
Hua Liang
Abstract:
We consider an additive partially linear framework for modelling massive heterogeneous data. The major goal is to extract multiple common features simultaneously across all sub-populations while exploring heterogeneity of each sub-population. We propose an aggregation type of estimators for the commonality parameters that possess the asymptotic optimal bounds and the asymptotic distributions as if…
▽ More
We consider an additive partially linear framework for modelling massive heterogeneous data. The major goal is to extract multiple common features simultaneously across all sub-populations while exploring heterogeneity of each sub-population. We propose an aggregation type of estimators for the commonality parameters that possess the asymptotic optimal bounds and the asymptotic distributions as if there were no heterogeneity. This oracle result holds when the number of sub-populations does not grow too fast and the tuning parameters are selected carefully. A plug-in estimator for the heterogeneity parameter is further constructed, and shown to possess the asymptotic distribution as if the commonality information were available. Furthermore, we develop a heterogeneity test for the linear components and a homogeneity test for the non-linear components accordingly. The performance of the proposed methods is evaluated via simulation studies and an application to the Medicare Provider Utilization and Payment data.
△ Less
Submitted 28 December, 2018; v1 submitted 13 January, 2017;
originally announced January 2017.
-
On the hyperplane conjecture for periods of Calabi-Yau hypersurfaces in $\mathbb P^n$
Authors:
Bong H. Lian,
Minxian Zhu
Abstract:
In [HLY1], Hosono, Lian, and Yau posed a conjecture characterizing the set of solutions to certain Gelfand-Kapranov-Zelevinsky hypergeometric equations which are realized as periods of Calabi-Yau hypersurfaces in a Gorenstein Fano toric variety $X$. We prove this conjecture in the case where $X$ is a complex projective space.
In [HLY1], Hosono, Lian, and Yau posed a conjecture characterizing the set of solutions to certain Gelfand-Kapranov-Zelevinsky hypergeometric equations which are realized as periods of Calabi-Yau hypersurfaces in a Gorenstein Fano toric variety $X$. We prove this conjecture in the case where $X$ is a complex projective space.
△ Less
Submitted 23 October, 2016;
originally announced October 2016.
-
Geometric quantum discord and non-Markovianity of structured reservoirs
Authors:
Ming-Liang Hu,
Han-Li Lian
Abstract:
The reservoir memory effects can lead to information backflow and recurrence of the previously lost quantum correlations. We establish connections between the direction of information flow and variation of the geometric quantum discords (GQDs) measured respectively by the trace distance, the Hellinger distance, and the Bures distance for two qubits subjecting to the bosonic structured reservoirs,…
▽ More
The reservoir memory effects can lead to information backflow and recurrence of the previously lost quantum correlations. We establish connections between the direction of information flow and variation of the geometric quantum discords (GQDs) measured respectively by the trace distance, the Hellinger distance, and the Bures distance for two qubits subjecting to the bosonic structured reservoirs, and unveil their dependence on a factor whose derivative signifies the (non-)Markovianity of the dynamics. By considering the reservoirs with Lorentzian and Ohmic-like spectra, we further demonstrated that the non-Markovianity induced by the backflow of information from the reservoirs to the system enhances the GQDs in most of the parameter regions. This highlights the potential of non-Markovianity as a resource for protecting the GQDs.
△ Less
Submitted 23 December, 2015;
originally announced December 2015.
-
Holonomic Systems for Period Mappings
Authors:
Jingyue Chen,
An Huang,
Bong H. Lian
Abstract:
Period mappings were introduced in the sixties [G] to study variation of complex structures of families of algebraic varieties. The theory of tautological systems was introduced recently [LSY,LY] to understand period integrals of algebraic manifolds. In this paper, we give an explicit construction of a tautological system for each component of a period mapping.
Period mappings were introduced in the sixties [G] to study variation of complex structures of families of algebraic varieties. The theory of tautological systems was introduced recently [LSY,LY] to understand period integrals of algebraic manifolds. In this paper, we give an explicit construction of a tautological system for each component of a period mapping.
△ Less
Submitted 3 September, 2017; v1 submitted 16 December, 2015;
originally announced December 2015.
-
Greedy Forward Regression for Variable Screening
Authors:
Ming-Yen Cheng,
Sanying Feng,
Gaorong Li,
Heng Lian
Abstract:
Two popular variable screening methods under the ultra-high dimensional setting with the desirable sure screening property are the sure independence screening (SIS) and the forward regression (FR). Both are classical variable screening methods and recently have attracted greater attention under the new light of high-dimensional data analysis. We consider a new and simple screening method that inco…
▽ More
Two popular variable screening methods under the ultra-high dimensional setting with the desirable sure screening property are the sure independence screening (SIS) and the forward regression (FR). Both are classical variable screening methods and recently have attracted greater attention under the new light of high-dimensional data analysis. We consider a new and simple screening method that incorporates multiple predictors in each step of forward regression, with decision on which variables to incorporate based on the same criterion. If only one step is carried out, it actually reduces to the SIS. Thus it can be regarded as a generalization and unification of the FR and the SIS. More importantly, it preserves the sure screening property and has similar computational complexity as FR in each step, yet it can discover the relevant covariates in fewer steps. Thus, it reduces the computational burden of FR drastically while retaining advantages of the latter over SIS. Furthermore, we show that it can find all the true variables if the number of steps taken is the same as the correct model size, even when using the original FR. An extensive simulation study and application to two real data examples demonstrate excellent performance of the proposed method.
△ Less
Submitted 3 November, 2015;
originally announced November 2015.
-
Chain Integral Solutions to Tautological Systems
Authors:
An Huang,
Bong H. Lian,
Shing-Tung Yau,
Xinwen Zhu
Abstract:
We give a new geometrical interpretation of the local analytic solutions to a differential system, which we call a tautological system $τ$, arising from the universal family of Calabi-Yau hypersurfaces $Y_a$ in a $G$-variety $X$ of dimension $n$. First, we construct a natural topological correspondence between relative cycles in $H_n(X-Y_a,\cup D-Y_a)$ bounded by the union of $G$-invariant divisor…
▽ More
We give a new geometrical interpretation of the local analytic solutions to a differential system, which we call a tautological system $τ$, arising from the universal family of Calabi-Yau hypersurfaces $Y_a$ in a $G$-variety $X$ of dimension $n$. First, we construct a natural topological correspondence between relative cycles in $H_n(X-Y_a,\cup D-Y_a)$ bounded by the union of $G$-invariant divisors $\cup D$ in $X$ to the solution sheaf of $τ$, in the form of chain integrals. Applying this to a toric variety with torus action, we show that in addition to the period integrals over cycles in $Y_a$, the new chain integrals generate the full solution sheaf of a GKZ system. This extends an earlier result for hypersurfaces in a projective homogeneous variety, whereby the chains are cycles. In light of this result, the mixed Hodge structure of the solution sheaf is now seen as the MHS of $H_n(X-Y_a,\cup D-Y_a)$. In addition, we generalize the result on chain integral solutions to the case of general type hypersurfaces. This chain integral correspondence can also be seen as the Riemann-Hilbert correspondence in one homological degree. Finally, we consider interesting cases in which the chain integral correspondence possibly fails to be bijective.
△ Less
Submitted 5 August, 2015; v1 submitted 3 August, 2015;
originally announced August 2015.
-
The matching energy of random graphs
Authors:
Xiaolin Chen,
Xueliang Li,
Huishu Lian
Abstract:
The matching energy of a graph was introduced by Gutman and Wagner, which is defined as the sum of the absolute values of the roots of the matching polynomial of the graph. For the random graph $G_{n,p}$ of order $n$ with fixed probability $p\in (0,1)$, Gutman and Wagner [I. Gutman, S. Wagner, The matching energy of a graph, Discrete Appl. Math. 160(2012), 2177--2187] proposed a conjecture that th…
▽ More
The matching energy of a graph was introduced by Gutman and Wagner, which is defined as the sum of the absolute values of the roots of the matching polynomial of the graph. For the random graph $G_{n,p}$ of order $n$ with fixed probability $p\in (0,1)$, Gutman and Wagner [I. Gutman, S. Wagner, The matching energy of a graph, Discrete Appl. Math. 160(2012), 2177--2187] proposed a conjecture that the matching energy of $G_{n,p}$ converges to $\frac{8\sqrt{p}}{3π}n^{\frac{3}{2}}$ almost surely. In this paper, using analysis method, we prove that the conjecture is true.
△ Less
Submitted 29 December, 2014; v1 submitted 22 December, 2014;
originally announced December 2014.
-
Solution to a conjecture on the maximum skew-spectral radius of odd-cycle graphs
Authors:
Xiaolin Chen,
Xueliang Li,
Huishu Lian
Abstract:
Let $G$ be a simple graph with no even cycle, called an odd-cycle graph. Cavers et al. [Cavers et al. Skew-adjacency matrices of graphs, Linear Algebra Appl. 436(2012), 4512--1829] showed that the spectral radius of $G^σ$ is the same for every orientation $σ$ of $G$, and equals the maximum matching root of $G$. They proposed a conjecture that the graphs which attain the maximum skew spectral radiu…
▽ More
Let $G$ be a simple graph with no even cycle, called an odd-cycle graph. Cavers et al. [Cavers et al. Skew-adjacency matrices of graphs, Linear Algebra Appl. 436(2012), 4512--1829] showed that the spectral radius of $G^σ$ is the same for every orientation $σ$ of $G$, and equals the maximum matching root of $G$. They proposed a conjecture that the graphs which attain the maximum skew spectral radius among the odd-cycle graphs $G$ of order $n$ are isomorphic to the odd-cycle graph with one vertex degree $n-1$ and size $m=\lfloor 3(n-1)/2\rfloor$. This paper, by using the Kelmans transformation, gives a proof of the conjecture. Moreover, sharp upper bounds of the maximum matching roots of the odd-cycle graphs with given order $n$ and size $m$ are given and extremal graphs are characterized.
△ Less
Submitted 18 December, 2014;
originally announced December 2014.
-
The mass-metallicity relation of Lyman-break analogues and its dependence on galaxy properties
Authors:
J. H. Lian,
J. R. Li,
W. Yan,
X. Kong
Abstract:
We investigate the mass-metallicity relation and its dependence on galaxy physical properties with a sample of 703 Lyman-break analogues (LBAs) in local Universe, which have similar properties to high redshift star-forming galaxies. The sample is selected according to $\ha$ luminosity, $L(\ha)>10^{41.8}\,{\rm erg\,s^{-1}}$, and surface brightness, $I(\ha)>10^{40.5}\,{\rm erg\,s^{-1}\,kpc^{-2}}$, c…
▽ More
We investigate the mass-metallicity relation and its dependence on galaxy physical properties with a sample of 703 Lyman-break analogues (LBAs) in local Universe, which have similar properties to high redshift star-forming galaxies. The sample is selected according to $\ha$ luminosity, $L(\ha)>10^{41.8}\,{\rm erg\,s^{-1}}$, and surface brightness, $I(\ha)>10^{40.5}\,{\rm erg\,s^{-1}\,kpc^{-2}}$, criteria. The mass-metallicity relation of LBAs harmoniously agrees with that of star-forming galaxies at $z \sim$ 1.4-1.7 in stellar mass range of $10^{8.5}M_{\odot}<M_{*}<10^{11}M_{\odot}$. The relation between stellar mass, metallicity and star formation rate of our sample is roughly consistent with the local fundamental metallicity relation. We find that the mass-metallicity relation shows a strong correlation with the 4000Å\, break; galaxies with higher 4000Å\, break typically have higher metallicity at a fixed mass, by 0.06 dex in average. This trend is independent of the methodology of metallicity. We also use the metallicity estimated by $T_{\rm e}$-method to confirm it. The scatter in mass-metallicity relation can be reduced from 0.091 to 0.077 dex by a three-dimensional relation between stellar mass, metallicity and 4000Å\, break. The reduction of scatter in mass-metallicity relation suggests that the galaxy stellar age plays an important role as the second parameter in the mass-metallicity relation of LBAs.
△ Less
Submitted 23 November, 2014;
originally announced November 2014.
-
CY Principal Bundles over Compact Kähler Manifolds
Authors:
Jingyue Chen,
Bong H. Lian
Abstract:
A CY bundle on a connected compact complex manifold $X$ was a crucial ingredient in constructing differential systems for period integrals in [LY], by lifting line bundles from the base $X$ to the total space. A question was therefore raised as to whether there exists such a bundle that supports the liftings of all line bundles from $X$, simultaneously. This was a key step for giving a uniform con…
▽ More
A CY bundle on a connected compact complex manifold $X$ was a crucial ingredient in constructing differential systems for period integrals in [LY], by lifting line bundles from the base $X$ to the total space. A question was therefore raised as to whether there exists such a bundle that supports the liftings of all line bundles from $X$, simultaneously. This was a key step for giving a uniform construction of differential systems for arbitrary complete intersections in $X$. In this paper, we answer the existence question in the affirmative if $X$ is assumed to be Kähler, and also in general if the Picard group of $X$ is assumed to be discrete. Furthermore, we prove a rigidity property of CY bundles if the principal group is an algebraic torus, showing that such a CY bundle is essentially determined by its character map.
△ Less
Submitted 11 November, 2016; v1 submitted 10 November, 2014;
originally announced November 2014.
-
Whittaker modules for the derivation Lie algebra of torus with two variables
Authors:
Haifeng Lian,
Xiufu Zhang
Abstract:
Let $\mathcal{L}$ be the derivation Lie algebra of ${\mathbb C}[t_1^{\pm 1},t_2^{\pm 1}]$. Given a triangle decomposition
$\mathcal{L} =\mathcal{L}^{+}\oplus\mathfrak{h}\oplus\mathcal{L}^{-}$, we define a nonsingular Lie algebra homomorphism $ψ:\mathcal{L}^{+}\rightarrow\mathbb{C}$ and the universal Whittaker $\mathcal{L}$-module $W_ψ$ of type $ψ$. We obtain all Whittaker vectors and submodules…
▽ More
Let $\mathcal{L}$ be the derivation Lie algebra of ${\mathbb C}[t_1^{\pm 1},t_2^{\pm 1}]$. Given a triangle decomposition
$\mathcal{L} =\mathcal{L}^{+}\oplus\mathfrak{h}\oplus\mathcal{L}^{-}$, we define a nonsingular Lie algebra homomorphism $ψ:\mathcal{L}^{+}\rightarrow\mathbb{C}$ and the universal Whittaker $\mathcal{L}$-module $W_ψ$ of type $ψ$. We obtain all Whittaker vectors and submodules of $W_ψ$, and all simple Whittaker $\mathcal{L}$-modules of type $ψ$.
△ Less
Submitted 18 September, 2014;
originally announced September 2014.
-
Lower bounds of the skew spectral radii and skew energy of oriented graphs
Authors:
Xiaolin Chen,
Xueliang Li,
Huishu Lian
Abstract:
Let $G$ be a graph with maximum degree $Δ$, and let $G^σ$ be an oriented graph of $G$ with skew adjacency matrix $S(G^σ)$. The skew spectral radius $ρ_s(G^σ)$ of $G^σ$ is defined as the spectral radius of $S(G^σ)$. The skew spectral radius has been studied, but only few results about its lower bound are known. This paper determines some lower bounds of the skew spectral radius, and then studies th…
▽ More
Let $G$ be a graph with maximum degree $Δ$, and let $G^σ$ be an oriented graph of $G$ with skew adjacency matrix $S(G^σ)$. The skew spectral radius $ρ_s(G^σ)$ of $G^σ$ is defined as the spectral radius of $S(G^σ)$. The skew spectral radius has been studied, but only few results about its lower bound are known. This paper determines some lower bounds of the skew spectral radius, and then studies the oriented graphs whose skew spectral radii attain the lower bound $\sqrtΔ$. Moreover, we apply the skew spectral radius to the skew energy of oriented graphs, which is defined as the sum of the norms of all the eigenvalues of $S(G^σ)$, and denoted by $\mathcal{E}_s(G^σ)$. As results, we obtain some lower bounds of the skew energy, which improve the known lower bound obtained by Adiga et al.
△ Less
Submitted 12 June, 2014; v1 submitted 20 May, 2014;
originally announced May 2014.
-
Variable Selection and Estimation for Partially Linear Single-index Models with Longitudinal Data
Authors:
Gaorong Li,
Peng Lai,
Heng Lian
Abstract:
In this paper, we consider the partially linear single-index models with longitudinal data. To deal with the variable selection problem in this context, we propose a penalized procedure combined with two bias correction methods, resulting in the bias-corrected generalized estimating equation (GEE) and the bias-corrected quadratic inference function (QIF), which can take into account the correlatio…
▽ More
In this paper, we consider the partially linear single-index models with longitudinal data. To deal with the variable selection problem in this context, we propose a penalized procedure combined with two bias correction methods, resulting in the bias-corrected generalized estimating equation (GEE) and the bias-corrected quadratic inference function (QIF), which can take into account the correlations. Asymptotic properties of these methods are demonstrated. We also evaluate the finite sample performance of the proposed methods via Monte Carlo simulation studies and a real data analysis.
△ Less
Submitted 7 February, 2014;
originally announced February 2014.
-
Letter to the Editor
Authors:
Yuao Hu,
Ye Tian,
Heng Lian
Abstract:
The paper by Alfons, Croux and Gelper (2013), Sparse least trimmed squares regression for analyzing high-dimensional large data sets, considered a combination of least trimmed squares (LTS) and lasso penalty for robust and sparse high-dimensional regression. In a recent paper [She and Owen (2011)], a method for outlier detection based on a sparsity penalty on the mean shift parameter was proposed…
▽ More
The paper by Alfons, Croux and Gelper (2013), Sparse least trimmed squares regression for analyzing high-dimensional large data sets, considered a combination of least trimmed squares (LTS) and lasso penalty for robust and sparse high-dimensional regression. In a recent paper [She and Owen (2011)], a method for outlier detection based on a sparsity penalty on the mean shift parameter was proposed (designated by "SO" in the following). This work is mentioned in Alfons et al. as being an "entirely different approach." Certainly the problem studied by Alfons et al. is novel and interesting.
△ Less
Submitted 9 December, 2013;
originally announced December 2013.
-
Reduced-rank Regression in Sparse Multivariate Varying-Coefficient Models with High-dimensional Covariates
Authors:
Heng Lian,
Shujie Ma
Abstract:
In genetic studies, not only can the number of predictors obtained from microarray measurements be extremely large, there can also be multiple response variables. Motivated by such a situation, we consider semiparametric dimension reduction methods in sparse multivariate regression models. Previous studies on joint variable and rank selection have focused on parametric models while here we conside…
▽ More
In genetic studies, not only can the number of predictors obtained from microarray measurements be extremely large, there can also be multiple response variables. Motivated by such a situation, we consider semiparametric dimension reduction methods in sparse multivariate regression models. Previous studies on joint variable and rank selection have focused on parametric models while here we consider the more challenging varying-coefficient models which make the investigation on nonlinear interactions of variables possible. Spline approximation, rank constraints and concave group penalties are utilized for model estimation. Asymptotic oracle properties of the estimators are presented. We also propose reduced-rank independent screening to deal with the situation when the dimension is so high that penalized estimation cannot be efficiently applied. In simulations, we show the advantages of simultaneously performing variable and rank selection. A real data set is analyzed to illustrate the good prediction performance when incorporating interactions between genetic variables and an index variable.
△ Less
Submitted 24 September, 2013;
originally announced September 2013.
-
Bayesian Quantile Regression for Partially Linear Additive Models
Authors:
Yuao Hu,
Kaifeng Zhao,
Heng Lian
Abstract:
In this article, we develop a semiparametric Bayesian estimation and model selection approach for partially linear additive models in conditional quantile regression. The asymmetric Laplace distribution provides a mechanism for Bayesian inferences of quantile regression models based on the check loss. The advantage of this new method is that nonlinear, linear and zero function components can be se…
▽ More
In this article, we develop a semiparametric Bayesian estimation and model selection approach for partially linear additive models in conditional quantile regression. The asymmetric Laplace distribution provides a mechanism for Bayesian inferences of quantile regression models based on the check loss. The advantage of this new method is that nonlinear, linear and zero function components can be separated automatically and simultaneously during model fitting without the need of pre-specification or parameter tuning. This is achieved by spike-and-slab priors using two sets of indicator variables. For posterior inferences, we design an effective partially collapsed Gibbs sampler. Simulation studies are used to illustrate our algorithm. The proposed approach is further illustrated by applications to two real data sets.
△ Less
Submitted 10 July, 2013;
originally announced July 2013.