-
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
Authors:
LLM-jp,
:,
Akiko Aizawa,
Eiji Aramaki,
Bowen Chen,
Fei Cheng,
Hiroyuki Deguchi,
Rintaro Enomoto,
Kazuki Fujii,
Kensuke Fukumoto,
Takuya Fukushima,
Namgi Han,
Yuto Harada,
Chikara Hashimoto,
Tatsuya Hiraoka,
Shohei Hisada,
Sosuke Hosokawa,
Lu Jie,
Keisuke Kamata,
Teruhito Kanazawa,
Hiroki Kanezashi,
Hiroshi Kataoka,
Satoru Katsumata,
Daisuke Kawahara,
Seiya Kawano
, et al. (57 additional authors not shown)
Abstract:
This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its…
▽ More
This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its activities, and technical reports on the LLMs developed by LLM-jp. For the latest activities, visit https://llm-jp.nii.ac.jp/en/.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting
Authors:
Muhammad Yeza Baihaqi,
Angel GarcĂa Contreras,
Seiya Kawano,
Koichiro Yoshino
Abstract:
Rapport is known as a conversational aspect focusing on relationship building, which influences outcomes in collaborative tasks. This study aims to establish human-agent rapport through small talk by using a rapport-building strategy. We implemented this strategy for the virtual agents based on dialogue strategies by prompting a large language model (LLM). In particular, we utilized two dialogue s…
▽ More
Rapport is known as a conversational aspect focusing on relationship building, which influences outcomes in collaborative tasks. This study aims to establish human-agent rapport through small talk by using a rapport-building strategy. We implemented this strategy for the virtual agents based on dialogue strategies by prompting a large language model (LLM). In particular, we utilized two dialogue strategies-predefined sequence and free-form-to guide the dialogue generation framework. We conducted analyses based on human evaluations, examining correlations between total turn, utterance characters, rapport score, and user experience variables: naturalness, satisfaction, interest, engagement, and usability. We investigated correlations between rapport score and naturalness, satisfaction, engagement, and conversation flow. Our experimental results also indicated that using free-form to prompt the rapport-building strategy performed the best in subjective scores.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Spatially Selected and Dependent Random Effects for Small Area Estimation with Application to Rent Burden
Authors:
Sho Kawano,
Paul A. Parker,
Zehang Richard Li
Abstract:
Area-level models for small area estimation typically rely on areal random effects to shrink design-based direct estimates towards a model-based predictor. Incorporating the spatial dependence of the random effects into these models can further improve the estimates when there are not enough covariates to fully account for spatial dependence of the areal means. A number of recent works have invest…
▽ More
Area-level models for small area estimation typically rely on areal random effects to shrink design-based direct estimates towards a model-based predictor. Incorporating the spatial dependence of the random effects into these models can further improve the estimates when there are not enough covariates to fully account for spatial dependence of the areal means. A number of recent works have investigated models that include random effects for only a subset of areas, in order to improve the precision of estimates. However, such models do not readily handle spatial dependence. In this paper, we introduce a model that accounts for spatial dependence in both the random effects as well as the latent process that selects the effects. We show how this model can significantly improve predictive accuracy via an empirical simulation study based on data from the American Community Survey, and illustrate its properties via an application to estimate county-level median rent burden.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Multi-task learning via robust regularized clustering with non-convex group penalties
Authors:
Akira Okazaki,
Shuichi Kawano
Abstract:
Multi-task learning (MTL) aims to improve estimation and prediction performance by sharing common information among related tasks. One natural assumption in MTL is that tasks are classified into clusters based on their characteristics. However, existing MTL methods based on this assumption often ignore outlier tasks that have large task-specific components or no relation to other tasks. To address…
▽ More
Multi-task learning (MTL) aims to improve estimation and prediction performance by sharing common information among related tasks. One natural assumption in MTL is that tasks are classified into clusters based on their characteristics. However, existing MTL methods based on this assumption often ignore outlier tasks that have large task-specific components or no relation to other tasks. To address this issue, we propose a novel MTL method called Multi-Task Learning via Robust Regularized Clustering (MTLRRC). MTLRRC incorporates robust regularization terms inspired by robust convex clustering, which is further extended to handle non-convex and group-sparse penalties. The extension allows MTLRRC to simultaneously perform robust task clustering and outlier task detection. The connection between the extended robust clustering and the multivariate M-estimator is also established. This provides an interpretation of the robustness of MTLRRC against outlier tasks. An efficient algorithm based on a modified alternating direction method of multipliers is developed for the estimation of the parameters. The effectiveness of MTLRRC is demonstrated through simulation studies and application to real data.
△ Less
Submitted 27 May, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution
Authors:
Nobuhiro Ueda,
Hideko Habe,
Yoko Matsui,
Akishige Yuguchi,
Seiya Kawano,
Yasutomo Kawanishi,
Sadao Kurohashi,
Koichiro Yoshino
Abstract:
Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the verbal information that appears in user interactions to the visual information observed in egocentric views. To this end, we propose a multimodal referen…
▽ More
Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the verbal information that appears in user interactions to the visual information observed in egocentric views. To this end, we propose a multimodal reference resolution task and construct a Japanese Conversation dataset for Real-world Reference Resolution (J-CRe3). Our dataset contains egocentric video and dialogue audio of real-world conversations between two people acting as a master and an assistant robot at home. The dataset is annotated with crossmodal tags between phrases in the utterances and the object bounding boxes in the video frames. These tags include indirect reference relations, such as predicate-argument structures and bridging references as well as direct reference relations. We also constructed an experimental model and clarified the challenges in multimodal reference resolution tasks.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions
Authors:
Shun Inadumi,
Seiya Kawano,
Akishige Yuguchi,
Yasutomo Kawanishi,
Koichiro Yoshino
Abstract:
Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subjective or objective terms. Such ambiguities in questions are often clarified by the contexts in conversational situations, such as joint attention wit…
▽ More
Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subjective or objective terms. Such ambiguities in questions are often clarified by the contexts in conversational situations, such as joint attention with a user or user gaze information. In this study, we propose the Gaze-grounded VQA dataset (GazeVQA) that clarifies ambiguous questions using gaze information by focusing on a clarification process complemented by gaze information. We also propose a method that utilizes gaze target estimation results to improve the accuracy of GazeVQA tasks. Our experimental results showed that the proposed method improved the performance in some cases of a VQA system on GazeVQA and identified some typical problems of GazeVQA tasks that need to be improved.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
ROSE: Rotation-based Squeezing Robotic Gripper toward Universal Handling of Objects
Authors:
Son Tien Bui,
Shinya Kawano,
Van Anh Ho
Abstract:
Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects' shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other…
▽ More
Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects' shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other hand, integrating soft materials into the gripper's design may tolerate the above uncertainties and reduce complexity in control. In this paper, we introduce ROSE, a novel soft gripper that can embrace the object and squeeze it by buckling a funnel-liked thin-walled soft membrane around the object by simple rotation of the base. Thanks to this design, ROSE hand can adapt to a wide range of objects that can fit in the funnel and handle with gentle gripping force. Regardless of this, ROSE can generate a high lift force (up to 33kgf) while significantly reducing the normal pressure on the gripped objects. In our experiment, a 198g ROSE can be integrated into a robot arm with a single actuation and successfully lift various types of objects, even after 400,000 trials. The embracing mechanism helps reduce the dependence of friction between the object and the membrane, as ROSE could pick up a chicken egg submerged inside an olive oil tank. We also report a feasible design for equipping the ROSE hand with tactile sensing while appealing to the scalability of the design to fit a wide range of objects. Video: https://youtu.be/E1wAI09LaoY
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
Bayesian Fused Lasso Modeling for Binary Data
Authors:
Yuko Kakikawa,
Shuichi Kawano
Abstract:
L1-norm regularized logistic regression models are widely used for analyzing data with binary response. In those analyses, fusing regression coefficients is useful for detecting groups of variables. This paper proposes a binomial logistic regression model with Bayesian fused lasso. Assuming a Laplace prior on regression coefficients and differences between adjacent regression coefficients enables…
▽ More
L1-norm regularized logistic regression models are widely used for analyzing data with binary response. In those analyses, fusing regression coefficients is useful for detecting groups of variables. This paper proposes a binomial logistic regression model with Bayesian fused lasso. Assuming a Laplace prior on regression coefficients and differences between adjacent regression coefficients enables us to perform variable selection and variable fusion simultaneously in the Bayesian framework. We also propose assuming a horseshoe prior on the differences to improve the flexibility of variable fusion. The Gibbs sampler is derived to estimate the parameters by a hierarchical expression of priors and a data-augmentation method. Using simulation studies and real data analysis, we compare the proposed methods with the existing method.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
Mid-infrared optical coherence tomography with MHz axial line rate for real-time non-destructive testing
Authors:
Satoko Yagi,
Takuma Nakamura,
Kazuki Hashimoto,
Shotaro Kawano,
Takuro Ideguchi
Abstract:
Non-destructive testing (NDT) is crucial for ensuring product quality and safety across various industries. Conventional methods such as ultrasonic, terahertz, and X-ray imaging have limitations in terms of probe-contact requirement, depth resolution, or radiation risks. Optical coherence tomography (OCT) is a promising alternative to solve these limitations, but it suffers from strong scattering,…
▽ More
Non-destructive testing (NDT) is crucial for ensuring product quality and safety across various industries. Conventional methods such as ultrasonic, terahertz, and X-ray imaging have limitations in terms of probe-contact requirement, depth resolution, or radiation risks. Optical coherence tomography (OCT) is a promising alternative to solve these limitations, but it suffers from strong scattering, limiting its penetration depth. Recently, OCT in the mid-infrared (MIR) spectral region has attracted attention with a significantly lower scattering rate than in the near-infrared region. However, the highest reported A-scan rate of MIR-OCT has been 3 kHz, which requires long data acquisition time to take an image, unsatisfying industrial demands for real-time diagnosis. Here, we present a high-speed MIR-OCT system operating in the 3-4 um region that employs the swept-source OCT technique based on time-stretch infrared spectroscopy. By integrating a broadband femtosecond MIR pulsed laser operating at a repetition rate of 50 MHz, we achieved an A-scan rate of 1 MHz with an axial resolution of 11.6 um and a sensitivity of 55 dB. As a proof-of-concept demonstration, we imaged the surface of substrates covered by highly scattering paint coatings. The demonstrated A-scan rate surpasses previous state-of-the-art by more than two orders of magnitude, paving the way for real-time NDT of industrial products, cultural assets, and structures.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Anisotropic Optical Conductivity Accompanied by a Small Energy Gap in One-Dimensional Thermoelectric Telluride Ta4SiTe4
Authors:
Fumiya Matsunaga,
Yoshihiko Okamoto,
Yasunori Yokoyama,
Kanji Takehana,
Yasutaka Imanaka,
Yuto Nakamura,
Hideo Kishida,
Shoya Kawano,
Kazuyuki Matsuhira,
Koshi Takenaka
Abstract:
We investigated the optical properties of single crystals of one-dimensional telluride Ta4SiTe4, which shows high thermoelectric performance below room temperature. Optical conductivity estimated from reflectivity spectra indicates the presence of a small energy gap of 0.1-0.15 eV at the Fermi energy. At the lowest energy, optical conductivity along the Ta4SiTe4 chain is an order of magnitude high…
▽ More
We investigated the optical properties of single crystals of one-dimensional telluride Ta4SiTe4, which shows high thermoelectric performance below room temperature. Optical conductivity estimated from reflectivity spectra indicates the presence of a small energy gap of 0.1-0.15 eV at the Fermi energy. At the lowest energy, optical conductivity along the Ta4SiTe4 chain is an order of magnitude higher than that perpendicular to this direction, reflecting the anisotropic electron conduction in Ta4SiTe4. These results indicate that coexistence of a very small band gap and anisotropic electron conduction is a promising strategy to develop a high-performance thermoelectric material for low temperature applications.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Simultaneous Modeling of Disease Screening and Severity Prediction: A Multi-task and Sparse Regularization Approach
Authors:
Kazuharu Harada,
Shuichi Kawano,
Masataka Taguri
Abstract:
The exploration of biomarkers, which are clinically useful biomolecules, and the development of prediction models using them are important problems in biomedical research. Biomarkers are widely used for disease screening, and some are related not only to the presence or absence of a disease but also to its severity. These biomarkers can be useful for prioritization of treatment and clinical decisi…
▽ More
The exploration of biomarkers, which are clinically useful biomolecules, and the development of prediction models using them are important problems in biomedical research. Biomarkers are widely used for disease screening, and some are related not only to the presence or absence of a disease but also to its severity. These biomarkers can be useful for prioritization of treatment and clinical decision-making. Considering a model helpful for both disease screening and severity prediction, this paper focuses on regression modeling for an ordinal response equipped with a hierarchical structure.
If the response variable is a combination of the presence of disease and severity such as \{{\it healthy, mild, intermediate, severe}\}, for example, the simplest method would be to apply the conventional ordinal regression model. However, the conventional model has flexibility issues and may not be suitable for the problems addressed in this paper, where the levels of the response variable might be heterogeneous. Therefore, this paper proposes a model assuming screening and severity prediction as different tasks, and an estimation method based on structural sparse regularization that leverages any common structure between the tasks when such commonality exists. In numerical experiments, the proposed method demonstrated stable performance across many scenarios compared to existing ordinal regression methods.
△ Less
Submitted 25 June, 2024; v1 submitted 9 September, 2023;
originally announced September 2023.
-
Multi-Task Learning Regression via Convex Clustering
Authors:
Akira Okazaki,
Shuichi Kawano
Abstract:
Multi-task learning (MTL) is a methodology that aims to improve the general performance of estimation and prediction by sharing common information among related tasks. In the MTL, there are several assumptions for the relationships and methods to incorporate them. One of the natural assumptions in the practical situation is that tasks are classified into some clusters with their characteristics. F…
▽ More
Multi-task learning (MTL) is a methodology that aims to improve the general performance of estimation and prediction by sharing common information among related tasks. In the MTL, there are several assumptions for the relationships and methods to incorporate them. One of the natural assumptions in the practical situation is that tasks are classified into some clusters with their characteristics. For this assumption, the group fused regularization approach performs clustering of the tasks by shrinking the difference among tasks. This enables us to transfer common information within the same cluster. However, this approach also transfers the information between different clusters, which worsens the estimation and prediction. To overcome this problem, we propose an MTL method with a centroid parameter representing a cluster center of the task. Because this model separates parameters into the parameters for regression and the parameters for clustering, we can improve estimation and prediction accuracy for regression coefficient vectors. We show the effectiveness of the proposed method through Monte Carlo simulations and applications to real data.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
Ab initio calculation for electronic structure and optical property of tungsten carbide in a TiCN-based cermet for solar thermal applications
Authors:
Shota Hayakawa,
Toshiharu Chono,
Kosuke Watanabe,
Shoya Kawano,
Kazuma Nakamura,
Koji Miyazaki
Abstract:
We present an ab initio calculation to understand electronic structures and optical properties of a tungsten carbide WC being a major component of a TiCN-based cermet. We found that the WC has a fairly low-energy plasma excitation $\sim$0.6 eV (2 $ÎĽ$m) and therefore can be a good constituent of a solar selective absorber. The evaluated figure of merit for photothermal conversion is prominently hig…
▽ More
We present an ab initio calculation to understand electronic structures and optical properties of a tungsten carbide WC being a major component of a TiCN-based cermet. We found that the WC has a fairly low-energy plasma excitation $\sim$0.6 eV (2 $ÎĽ$m) and therefore can be a good constituent of a solar selective absorber. The evaluated figure of merit for photothermal conversion is prominently high compared to those of the other materials included in the TiCN-based cermet. The imaginary part of the dielectric function is considerably small around the zero point of the real part of the dielectric function, corresponding to the plasma excitation energy. Therefore, a clear plasma edge appeared, ensuring the high performance of the WC as the solar absorber.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Multivariate regression modeling in integrative analysis via sparse regularization
Authors:
Shuichi Kawano,
Toshikazu Fukushima,
Junichi Nakagawa,
Mamoru Oshiki
Abstract:
The multivariate regression model basically offers the analysis of a single dataset with multiple responses. However, such a single-dataset analysis often leads to unsatisfactory results. Integrative analysis is an effective method to pool useful information from multiple independent datasets and provides better performance than single-dataset analysis. In this study, we propose a multivariate reg…
▽ More
The multivariate regression model basically offers the analysis of a single dataset with multiple responses. However, such a single-dataset analysis often leads to unsatisfactory results. Integrative analysis is an effective method to pool useful information from multiple independent datasets and provides better performance than single-dataset analysis. In this study, we propose a multivariate regression modeling in integrative analysis. The integration is achieved by sparse estimation that performs variable and group selection. Based on the idea of alternating direction method of multipliers, we develop its computational algorithm that enjoys the convergence property. The performance of the proposed method is demonstrated through Monte Carlo simulation and analyzing wastewater treatment data with microbe measurements.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Determination of Majorana type-phases from the time evolution of lepton numbers
Authors:
Nicholas J. Benoit,
Yuta Kawamura,
Saki Kawano,
Takuya Morozumi,
Yusuke Shimizu,
Kei Yamamoto
Abstract:
We have investigated an approach to determine the Majorana type-phases using the time evolution of lepton family numbers. The Majorana type-phases are related to the orientation of unitarity triangles for the Pontecorvo-Maki-Nakagawa-Sakata (PMNS) matrix, and the Majorana phases $α_{21}$ and $α_{31}$. After taking the second-order time derivative of the lepton family number expectation values, the…
▽ More
We have investigated an approach to determine the Majorana type-phases using the time evolution of lepton family numbers. The Majorana type-phases are related to the orientation of unitarity triangles for the Pontecorvo-Maki-Nakagawa-Sakata (PMNS) matrix, and the Majorana phases $α_{21}$ and $α_{31}$. After taking the second-order time derivative of the lepton family number expectation values, the dependencies on the summation of Majorana type-phases can be determined. Thus allowing for the extraction of the orientation of the unitarity triangles and the Majorana phases. We study how to extract the Majorana type-phases and the lightest neutrino mass for three massive neutrinos, and when a neutrino is massless, i.e., $m_{1,3}=0$. Our result can be complimentary to using neutrinoless double-beta decay for determining the orientation of PMNS unitarity triangles and the Majorana phases.
△ Less
Submitted 7 December, 2022; v1 submitted 30 November, 2022;
originally announced December 2022.
-
What Should the System Do Next?: Operative Action Captioning for Estimating System Actions
Authors:
Taiki Nakamura,
Seiya Kawano,
Akishige Yuguchi,
Yasutomo Kawanishi,
Koichiro Yoshino
Abstract:
Such human-assisting systems as robots need to correctly understand the surrounding situation based on observations and output the required support actions for humans. Language is one of the important channels to communicate with humans, and the robots are required to have the ability to express their understanding and action planning results. In this study, we propose a new task of operative acti…
▽ More
Such human-assisting systems as robots need to correctly understand the surrounding situation based on observations and output the required support actions for humans. Language is one of the important channels to communicate with humans, and the robots are required to have the ability to express their understanding and action planning results. In this study, we propose a new task of operative action captioning that estimates and verbalizes the actions to be taken by the system in a human-assisting domain. We constructed a system that outputs a verbal description of a possible operative action that changes the current state to the given target state. We collected a dataset consisting of two images as observations, which express the current state and the state changed by actions, and a caption that describes the actions that change the current state to the target state, by crowdsourcing in daily life situations. Then we constructed a system that estimates operative action by a caption. Since the operative action's caption is expected to contain some state-changing actions, we use scene-graph prediction as an auxiliary task because the events written in the scene graphs correspond to the state changes. Experimental results showed that our system successfully described the operative actions that should be conducted between the current and target states. The auxiliary tasks that predict the scene graphs improved the quality of the estimation results.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
Bayesian Fused Lasso Modeling via Horseshoe Prior
Authors:
Yuko Kakikawa,
Kaito Shimamura,
Shuichi Kawano
Abstract:
Bayesian fused lasso is one of the sparse Bayesian methods, which shrinks both regression coefficients and their successive differences simultaneously. In this paper, we propose a Bayesian fused lasso modeling via horseshoe prior. By assuming a horseshoe prior on the difference of successive regression coefficients, the proposed method enables us to prevent over-shrinkage of those differences. We…
▽ More
Bayesian fused lasso is one of the sparse Bayesian methods, which shrinks both regression coefficients and their successive differences simultaneously. In this paper, we propose a Bayesian fused lasso modeling via horseshoe prior. By assuming a horseshoe prior on the difference of successive regression coefficients, the proposed method enables us to prevent over-shrinkage of those differences. We also propose a Bayesian hexagonal operator for regression with shrinkage and equality selection (HORSES) with horseshoe prior, which imposes priors on all combinations of differences of regression coefficients. Simulation studies and an application to real data show that the proposed method gives better performance than existing methods.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
Multi-task Learning for Compositional Data via Sparse Network Lasso
Authors:
Akira Okazaki,
Shuichi Kawano
Abstract:
A network lasso enables us to construct a model for each sample, which is known as multi-task learning. Existing methods for multi-task learning cannot be applied to compositional data due to their intrinsic properties. In this paper, we propose a multi-task learning method for compositional data using a sparse network lasso. We focus on a symmetric form of the log-contrast model, which is a regre…
▽ More
A network lasso enables us to construct a model for each sample, which is known as multi-task learning. Existing methods for multi-task learning cannot be applied to compositional data due to their intrinsic properties. In this paper, we propose a multi-task learning method for compositional data using a sparse network lasso. We focus on a symmetric form of the log-contrast model, which is a regression model with compositional covariates. The effectiveness of the proposed method is shown through simulation studies and application to gut microbiome data.
△ Less
Submitted 17 November, 2021; v1 submitted 12 November, 2021;
originally announced November 2021.
-
A Bayesian approach to multi-task learning with network lasso
Authors:
Kaito Shimamura,
Shuichi Kawano
Abstract:
Network lasso is a method for solving a multi-task learning problem through the regularized maximum likelihood method. A characteristic of network lasso is setting a different model for each sample. The relationships among the models are represented by relational coefficients. A crucial issue in network lasso is to provide appropriate values for these relational coefficients. In this paper, we pro…
▽ More
Network lasso is a method for solving a multi-task learning problem through the regularized maximum likelihood method. A characteristic of network lasso is setting a different model for each sample. The relationships among the models are represented by relational coefficients. A crucial issue in network lasso is to provide appropriate values for these relational coefficients. In this paper, we propose a Bayesian approach to solve multi-task learning problems by network lasso. This approach allows us to objectively determine the relational coefficients by Bayesian estimation. The effectiveness of the proposed method is shown in a simulation study and a real data analysis.
△ Less
Submitted 18 October, 2021;
originally announced October 2021.
-
Proteomics Standards Initiatives ProForma 2.0 Unifying the encoding of Proteoforms and Peptidoforms
Authors:
Richard D. LeDuc,
Eric W. Deutsch,
Pierre-Alain Binz,
Ryan T. Fellers,
Anthony J. Cesnik,
Joshua A. Klein,
Tim Van Den Bossche,
Ralf Gabriels,
Arshika Yalavarthi,
Yasset Perez-Riverol,
Jeremy Carver,
Wout Bittremieux,
Shin Kawano,
Benjamin Pullman,
Nuno Bandeira,
Neil L. Kelleher,
Paul M. Thomas,
Juan Antonio VizcaĂno
Abstract:
There is the need to represent in a standard manner all the possible variations of a protein or peptide primary sequence, including both artefactual and post-translational modifications of peptides and proteins. With that overall aim, here, the Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has developed a notation, called ProForma 2.0, which is a substantial extension of…
▽ More
There is the need to represent in a standard manner all the possible variations of a protein or peptide primary sequence, including both artefactual and post-translational modifications of peptides and proteins. With that overall aim, here, the Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has developed a notation, called ProForma 2.0, which is a substantial extension of the original ProForma notation, developed by the Consortium for Top-Down Proteomics (CTDP). ProForma 2.0 aims to unify the representation of proteoforms and peptidoforms. Therefore, this notation supports use cases needed for bottom-up and middle/topdown proteomics approaches and allows the encoding of highly modified proteins and peptides using a human and machine-readable string. ProForma 2.0 covers encoding protein modification names and accessions, cross-linking reagents including disulfides, glycans, modifications encoded using mass shifts and/or via chemical formulas, labile and C or N-terminal modifications, ambiguity in the modification position and representation of atomic isotopes, among other use cases. Notational conventions are based on public controlled vocabularies and ontologies. Detailed information about the notation and existing implementations are available at http://www.psidev.info/proforma and at the corresponding GitHub repository (https://github.com/HUPO-PSI/proforma).
△ Less
Submitted 21 March, 2022; v1 submitted 23 September, 2021;
originally announced September 2021.
-
Smoothly varying ridge regularization
Authors:
Daeju Kim,
Shuichi Kawano,
Yoshiyuki Ninomiya
Abstract:
A basis expansion with regularization methods is much appealing to the flexible or robust nonlinear regression models for data with complex structures. When the underlying function has inhomogeneous smoothness, it is well known that conventional reguralization methods do not perform well. In this case, an adaptive procedure such as a free-knot spline or a local likelihood method is often introduce…
▽ More
A basis expansion with regularization methods is much appealing to the flexible or robust nonlinear regression models for data with complex structures. When the underlying function has inhomogeneous smoothness, it is well known that conventional reguralization methods do not perform well. In this case, an adaptive procedure such as a free-knot spline or a local likelihood method is often introduced as an effective method. However, both methods need intensive computational loads. In this study, we consider a new efficient basis expansion by proposing a smoothly varying regularization method which is constructed by some special penalties. We call them adaptive-type penalties. In our modeling, adaptive-type penalties play key rolls and it has been successful in giving good estimation for inhomogeneous smoothness functions. A crucial issue in the modeling process is the choice of a suitable model among candidates. To select the suitable model, we derive an approximated generalized information criterion (GIC). The proposed method is investigated through Monte Carlo simulations and real data analysis. Numerical results suggest that our method performs well in various situations.
△ Less
Submitted 29 January, 2021;
originally announced February 2021.
-
Classification of atomic environments via the Gromov-Wasserstein distance
Authors:
Sakura Kawano,
Jeremy K. Mason
Abstract:
Interpreting molecular dynamics simulations usually involves automated classification of local atomic environments to identify regions of interest. Existing approaches are generally limited to a small number of reference structures and only include limited information about the local chemical composition. This work proposes to use a variant of the Gromov-Wasserstein (GW) distance to quantify the d…
▽ More
Interpreting molecular dynamics simulations usually involves automated classification of local atomic environments to identify regions of interest. Existing approaches are generally limited to a small number of reference structures and only include limited information about the local chemical composition. This work proposes to use a variant of the Gromov-Wasserstein (GW) distance to quantify the difference between a local atomic environment and a set of arbitrary reference environments in a way that is sensitive to atomic displacements, missing atoms, and differences in chemical composition. This involves describing a local atomic environment as a finite metric measure space, which has the additional advantages of not requiring the local environment to be centered on an atom and of not making any assumptions about the material class. Numerical examples illustrate the efficacy and versatility of the algorithm.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
Multilinear Common Component Analysis via Kronecker Product Representation
Authors:
Kohei Yoshikawa,
Shuichi Kawano
Abstract:
We consider the problem of extracting a common structure from multiple tensor datasets. For this purpose, we propose multilinear common component analysis (MCCA) based on Kronecker products of mode-wise covariance matrices. MCCA constructs a common basis represented by linear combinations of the original variables which loses as little information of the multiple tensor datasets. We also develop a…
▽ More
We consider the problem of extracting a common structure from multiple tensor datasets. For this purpose, we propose multilinear common component analysis (MCCA) based on Kronecker products of mode-wise covariance matrices. MCCA constructs a common basis represented by linear combinations of the original variables which loses as little information of the multiple tensor datasets. We also develop an estimation algorithm for MCCA that guarantees mode-wise global convergence. Numerical studies are conducted to show the effectiveness of MCCA.
△ Less
Submitted 20 November, 2020; v1 submitted 6 September, 2020;
originally announced September 2020.
-
Relevance Vector Machine with Weakly Informative Hyperprior and Extended Predictive Information Criterion
Authors:
Kazuaki. Murayama,
Shuichi. Kawano
Abstract:
In the variational relevance vector machine, the gamma distribution is representative as a hyperprior over the noise precision of automatic relevance determination prior. Instead of the gamma hyperprior, we propose to use the inverse gamma hyperprior with a shape parameter close to zero and a scale parameter not necessary close to zero. This hyperprior is associated with the concept of a weakly in…
▽ More
In the variational relevance vector machine, the gamma distribution is representative as a hyperprior over the noise precision of automatic relevance determination prior. Instead of the gamma hyperprior, we propose to use the inverse gamma hyperprior with a shape parameter close to zero and a scale parameter not necessary close to zero. This hyperprior is associated with the concept of a weakly informative prior. The effect of this hyperprior is investigated through regression to non-homogeneous data. Because it is difficult to capture the structure of such data with a single kernel function, we apply the multiple kernel method, in which multiple kernel functions with different widths are arranged for input data. We confirm that the degrees of freedom in a model is controlled by adjusting the scale parameter and keeping the shape parameter close to zero. A candidate for selecting the scale parameter is the predictive information criterion. However the estimated model using this criterion seems to cause over-fitting. This is because the multiple kernel method makes the model a situation where the dimension of the model is larger than the data size. To select an appropriate scale parameter even in such a situation, we also propose an extended prediction information criterion. It is confirmed that a multiple kernel relevance vector regression model with good predictive accuracy can be obtained by selecting the scale parameter minimizing extended prediction information criterion.
△ Less
Submitted 7 May, 2020;
originally announced May 2020.
-
Variable fusion for Bayesian linear regression via spike-and-slab priors
Authors:
Shengyi Wu,
Kaito Shimamura,
Kohei Yoshikawa,
Kazuaki Murayama,
Shuichi Kawano
Abstract:
In linear regression models, fusion of coefficients is used to identify predictors having similar relationships with a response. This is called variable fusion. This paper presents a novel variable fusion method in terms of Bayesian linear regression models. We focus on hierarchical Bayesian models based on a spike-and-slab prior approach. A spike-and-slab prior is tailored to perform variable fus…
▽ More
In linear regression models, fusion of coefficients is used to identify predictors having similar relationships with a response. This is called variable fusion. This paper presents a novel variable fusion method in terms of Bayesian linear regression models. We focus on hierarchical Bayesian models based on a spike-and-slab prior approach. A spike-and-slab prior is tailored to perform variable fusion. To obtain estimates of the parameters, we develop a Gibbs sampler for the parameters. Simulation studies and a real data analysis show that our proposed method achieves better performance than previous methods.
△ Less
Submitted 2 December, 2020; v1 submitted 30 March, 2020;
originally announced March 2020.
-
Sparse principal component regression via singular value decomposition approach
Authors:
Shuichi Kawano
Abstract:
Principal component regression (PCR) is a two-stage procedure: the first stage performs principal component analysis (PCA) and the second stage constructs a regression model whose explanatory variables are replaced by principal components obtained by the first stage. Since PCA is performed by using only explanatory variables, the principal components have no information about the response variable…
▽ More
Principal component regression (PCR) is a two-stage procedure: the first stage performs principal component analysis (PCA) and the second stage constructs a regression model whose explanatory variables are replaced by principal components obtained by the first stage. Since PCA is performed by using only explanatory variables, the principal components have no information about the response variable. To address the problem, we propose a one-stage procedure for PCR in terms of singular value decomposition approach. Our approach is based upon two loss functions, a regression loss and a PCA loss, with sparse regularization. The proposed method enables us to obtain principal component loadings that possess information about both explanatory variables and a response variable. An estimation algorithm is developed by using alternating direction method of multipliers. We conduct numerical studies to show the effectiveness of the proposed method.
△ Less
Submitted 21 February, 2020;
originally announced February 2020.
-
Optical vortex-induced forward mass transfer: Manifestation of helical trajectory of optical vortex
Authors:
Ryosuke Nakamura,
Haruki Kawaguchi,
Muneaki Iwata,
Akihiro Kaneko,
Ryo Nagura,
Satoyuki Kawano,
Kohei Toyoda,
Katsuhiko Miyamoto,
Takashige Omatsu
Abstract:
The orbital angular momentum of an optical vortex field is found to twist high viscosity donor material to form a micron-scale 'spin jet'. This unique phenomenon manifests the helical trajectory of the optical vortex. Going beyond both the conventional ink jet and laser induced forward mass transfer (LIFT) patterning technologies, it also offers the formation and ejection of a micron-scale 'spin j…
▽ More
The orbital angular momentum of an optical vortex field is found to twist high viscosity donor material to form a micron-scale 'spin jet'. This unique phenomenon manifests the helical trajectory of the optical vortex. Going beyond both the conventional ink jet and laser induced forward mass transfer (LIFT) patterning technologies, it also offers the formation and ejection of a micron-scale 'spin jet' of the donor material even with an ultrahigh viscosity of 4 Pas. This optical vortex laser induced forward mass transfer (OV-LIFT) patterning technique will enable the development of next generation printed photonic/electric/spintronic circuits formed of ultrahigh viscosity donor dots containing functional nanoparticles, such as quantum dots, metallic particles and magnetic ferrite particles, with ultrahigh spatial resolution. It can also potentially explore a completely new needleless drug injection.
△ Less
Submitted 9 December, 2019;
originally announced December 2019.
-
Bayesian sparse convex clustering via global-local shrinkage priors
Authors:
Kaito Shimamura,
Shuichi Kawano
Abstract:
Sparse convex clustering is to cluster observations and conduct variable selection simultaneously in the framework of convex clustering. Although a weighted $L_1$ norm is usually employed for the regularization term in sparse convex clustering, its use increases the dependence on the data and reduces the estimation accuracy if the sample size is not sufficient. To tackle these problems, this paper…
▽ More
Sparse convex clustering is to cluster observations and conduct variable selection simultaneously in the framework of convex clustering. Although a weighted $L_1$ norm is usually employed for the regularization term in sparse convex clustering, its use increases the dependence on the data and reduces the estimation accuracy if the sample size is not sufficient. To tackle these problems, this paper proposes a Bayesian sparse convex clustering method based on the ideas of Bayesian lasso and global-local shrinkage priors. We introduce Gibbs sampling algorithms for our method using scale mixtures of normal distributions. The effectiveness of the proposed methods is shown in simulation studies and a real data analysis.
△ Less
Submitted 26 May, 2020; v1 submitted 19 November, 2019;
originally announced November 2019.
-
Sparse Reduced-Rank Regression for Simultaneous Rank and Variable Selection via Manifold Optimization
Authors:
Kohei Yoshikawa,
Shuichi Kawano
Abstract:
We consider the problem of constructing a reduced-rank regression model whose coefficient parameter is represented as a singular value decomposition with sparse singular vectors. The traditional estimation procedure for the coefficient parameter often fails when the true rank of the parameter is high. To overcome this issue, we develop an estimation algorithm with rank and variable selection via s…
▽ More
We consider the problem of constructing a reduced-rank regression model whose coefficient parameter is represented as a singular value decomposition with sparse singular vectors. The traditional estimation procedure for the coefficient parameter often fails when the true rank of the parameter is high. To overcome this issue, we develop an estimation algorithm with rank and variable selection via sparse regularization and manifold optimization, which enables us to obtain an accurate estimation of the coefficient parameter even if the true rank of the coefficient parameter is high. Using sparse regularization, we can also select an optimal value of the rank. We conduct Monte Carlo experiments and real data analysis to illustrate the effectiveness of our proposed method.
△ Less
Submitted 1 November, 2019; v1 submitted 11 October, 2019;
originally announced October 2019.
-
Sparse principal component regression for generalized linear models
Authors:
Shuichi Kawano,
Hironori Fujisawa,
Toyoyuki Takada,
Toshihiko Shiroishi
Abstract:
Principal component regression (PCR) is a widely used two-stage procedure: principal component analysis (PCA), followed by regression in which the selected principal components are regarded as new explanatory variables in the model. Note that PCA is based only on the explanatory variables, so the principal components are not selected using the information on the response variable. In this paper, w…
▽ More
Principal component regression (PCR) is a widely used two-stage procedure: principal component analysis (PCA), followed by regression in which the selected principal components are regarded as new explanatory variables in the model. Note that PCA is based only on the explanatory variables, so the principal components are not selected using the information on the response variable. In this paper, we propose a one-stage procedure for PCR in the framework of generalized linear models. The basic loss function is based on a combination of the regression loss and PCA loss. An estimate of the regression parameter is obtained as the minimizer of the basic loss function with a sparse penalty. We call the proposed method sparse principal component regression for generalized linear models (SPCR-glm). Taking the two loss function into consideration simultaneously, SPCR-glm enables us to obtain sparse principal component loadings that are related to a response variable. However, a combination of loss functions may cause a parameter identification problem, but this potential problem is avoided by virtue of the sparse penalty. Thus, the sparse penalty plays two roles in this method. The parameter estimation procedure is proposed using various update algorithms with the coordinate descent algorithm. We apply SPCR-glm to two real datasets, doctor visits data and mouse consomic strain data. SPCR-glm provides more easily interpretable principal component (PC) scores and clearer classification on PC plots than the usual PCA.
△ Less
Submitted 12 October, 2016; v1 submitted 28 September, 2016;
originally announced September 2016.
-
Bayesian generalized fused lasso modeling via NEG distribution
Authors:
Kaito Shimamura,
Masao Ueki,
Shuichi Kawano,
Sadanori Konishi
Abstract:
The fused lasso penalizes a loss function by the $L_1$ norm for both the regression coefficients and their successive differences to encourage sparsity of both. In this paper, we propose a Bayesian generalized fused lasso modeling based on a normal-exponential-gamma (NEG) prior distribution. The NEG prior is assumed into the difference of successive regression coefficients. The proposed method ena…
▽ More
The fused lasso penalizes a loss function by the $L_1$ norm for both the regression coefficients and their successive differences to encourage sparsity of both. In this paper, we propose a Bayesian generalized fused lasso modeling based on a normal-exponential-gamma (NEG) prior distribution. The NEG prior is assumed into the difference of successive regression coefficients. The proposed method enables us to construct a more versatile sparse model than the ordinary fused lasso by using a flexible regularization term. We also propose a sparse fused algorithm to produce exact sparse solutions. Simulation studies and real data analyses show that the proposed method has superior performance to the ordinary fused lasso.
△ Less
Submitted 16 February, 2016;
originally announced February 2016.
-
Sparse principal component regression with adaptive loading
Authors:
Shuichi Kawano,
Hironori Fujisawa,
Toyoyuki Takada,
Toshihiko Shiroishi
Abstract:
Principal component regression (PCR) is a two-stage procedure that selects some principal components and then constructs a regression model regarding them as new explanatory variables. Note that the principal components are obtained from only explanatory variables and not considered with the response variable. To address this problem, we propose the sparse principal component regression (SPCR) tha…
▽ More
Principal component regression (PCR) is a two-stage procedure that selects some principal components and then constructs a regression model regarding them as new explanatory variables. Note that the principal components are obtained from only explanatory variables and not considered with the response variable. To address this problem, we propose the sparse principal component regression (SPCR) that is a one-stage procedure for PCR. SPCR enables us to adaptively obtain sparse principal component loadings that are related to the response variable and select the number of principal components simultaneously. SPCR can be obtained by the convex optimization problem for each of parameters with the coordinate descent algorithm. Monte Carlo simulations and real data analyses are performed to illustrate the effectiveness of SPCR.
△ Less
Submitted 31 October, 2014; v1 submitted 26 February, 2014;
originally announced February 2014.
-
Electrochemical response of biased nanoelectrodes in solution
Authors:
Kentaro Doi,
Makusu Tsutsui,
Takahito Ohshiro,
Chih-Chun Chien,
Michael Zwolak,
Masateru Taniguchi,
Tomoji Kawai,
Satoyuki Kawano,
Massimiliano Di Ventra
Abstract:
Novel approaches to DNA sequencing and detection require the measurement of electrical currents between metal probes immersed in ionic solution. Here, we experimentally demonstrate that these systems maintain large background currents with a transient response that decays very slowly in time and noise that increases with ionic concentration. Using a non-equilibrium stochastic model, we obtain an a…
▽ More
Novel approaches to DNA sequencing and detection require the measurement of electrical currents between metal probes immersed in ionic solution. Here, we experimentally demonstrate that these systems maintain large background currents with a transient response that decays very slowly in time and noise that increases with ionic concentration. Using a non-equilibrium stochastic model, we obtain an analytical expression for the ionic current that shows these results are due to a fast electrochemical reaction at the electrode surface followed by the slow formation of a diffusion layer. During the latter, ions translocate in the weak electric field generated after the initial rapid screening of the strong fields near the electrode surfaces. Our theoretical results are in very good agreement with experimental findings.
△ Less
Submitted 13 February, 2013;
originally announced February 2013.
-
Adaptive bridge regression modeling with model selection criteria
Authors:
Shuichi Kawano
Abstract:
We consider the problem of constructing an adaptive bridge regression modeling, which is a penalized procedure by imposing different weights to different coefficients in the bridge penalty term. A crucial issue in the modeling process is the choices of adjusted parameters included in the models. We treat the selection of the adjusted parameters as model selection and evaluation problems. In order…
▽ More
We consider the problem of constructing an adaptive bridge regression modeling, which is a penalized procedure by imposing different weights to different coefficients in the bridge penalty term. A crucial issue in the modeling process is the choices of adjusted parameters included in the models. We treat the selection of the adjusted parameters as model selection and evaluation problems. In order to select the parameters, model selection criteria are derived from information-theoretic and Bayesian approach. We conduct some numerical studies to investigate the effectiveness of our proposed modeling strategy.
△ Less
Submitted 28 August, 2012; v1 submitted 13 April, 2012;
originally announced April 2012.
-
Selection of tuning parameters in bridge regression models via Bayesian information criterion
Authors:
Shuichi Kawano
Abstract:
We consider the bridge linear regression modeling, which can produce a sparse or non-sparse model. A crucial point in the model building process is the selection of adjusted parameters including a regularization parameter and a tuning parameter in bridge regression models. The choice of the adjusted parameters can be viewed as a model selection and evaluation problem. We propose a model selection…
▽ More
We consider the bridge linear regression modeling, which can produce a sparse or non-sparse model. A crucial point in the model building process is the selection of adjusted parameters including a regularization parameter and a tuning parameter in bridge regression models. The choice of the adjusted parameters can be viewed as a model selection and evaluation problem. We propose a model selection criterion for evaluating bridge regression models in terms of Bayesian approach. This selection criterion enables us to select the adjusted parameters objectively. We investigate the effectiveness of our proposed modeling strategy through some numerical examples.
△ Less
Submitted 13 April, 2012; v1 submitted 20 March, 2012;
originally announced March 2012.
-
Semi-supervised logistic discrimination via labeled data and unlabeled data from different sampling distributions
Authors:
Shuichi Kawano
Abstract:
This article addresses the problem of classification method based on both labeled and unlabeled data, where we assume that a density function for labeled data is different from that for unlabeled data. We propose a semi-supervised logistic regression model for classification problem along with the technique of covariate shift adaptation. Unknown parameters involved in proposed models are estimated…
▽ More
This article addresses the problem of classification method based on both labeled and unlabeled data, where we assume that a density function for labeled data is different from that for unlabeled data. We propose a semi-supervised logistic regression model for classification problem along with the technique of covariate shift adaptation. Unknown parameters involved in proposed models are estimated by regularization with EM algorithm. A crucial issue in the modeling process is the choices of tuning parameters in our semi-supervised logistic models. In order to select the parameters, a model selection criterion is derived from an information-theoretic approach. Some numerical studies show that our modeling procedure performs well in various cases.
△ Less
Submitted 13 October, 2012; v1 submitted 26 August, 2011;
originally announced August 2011.
-
Varying-coefficient modeling via regularized basis functions
Authors:
Hidetoshi Matsui,
Toshihiro Misumi,
Shuichi Kawano
Abstract:
We address the problem of constructing varying-coefficient models based on basis expansions along with the technique of regularization. A crucial point in our modeling procedure is the selection of smoothing parameters in the regularization method. In order to choose the parameters objectively, we derive model selection criteria from the viewpoints of information-theoretic and Bayesian approach. W…
▽ More
We address the problem of constructing varying-coefficient models based on basis expansions along with the technique of regularization. A crucial point in our modeling procedure is the selection of smoothing parameters in the regularization method. In order to choose the parameters objectively, we derive model selection criteria from the viewpoints of information-theoretic and Bayesian approach. We demonstrate the effectiveness of proposed modeling strategy through Monte Carlo simulations and analyzing a real data set.
△ Less
Submitted 18 July, 2011;
originally announced July 2011.
-
Semi-supervised logistic discrimination for functional data
Authors:
Shuichi Kawano,
Sadanori Konishi
Abstract:
Multi-class classification methods based on both labeled and unlabeled functional data sets are discussed. We present a semi-supervised logistic model for classification in the context of functional data analysis. Unknown parameters in our proposed model are estimated by regularization with the help of EM algorithm. A crucial point in the modeling procedure is the choice of a regularization parame…
▽ More
Multi-class classification methods based on both labeled and unlabeled functional data sets are discussed. We present a semi-supervised logistic model for classification in the context of functional data analysis. Unknown parameters in our proposed model are estimated by regularization with the help of EM algorithm. A crucial point in the modeling procedure is the choice of a regularization parameter involved in the semi-supervised functional logistic model. In order to select the adjusted parameter, we introduce model selection criteria from information-theoretic and Bayesian viewpoints. Monte Carlo simulations and a real data analysis are given to examine the effectiveness of our proposed modeling strategy.
△ Less
Submitted 28 May, 2012; v1 submitted 21 February, 2011;
originally announced February 2011.
-
On the maximum value of ground states for the scalar field equation with double power nonlinearity
Authors:
Shinji Kawano
Abstract:
We evaluate the maximum value of the unique positive solution to semilinear elliptic equations with double power nonlinearities. It is known that a positive solution to this problem exists under some condition.Moreover, Ouyang and Shi in 1998 found that the solution is unique under the same condition. In the present paper we investigate the maximum value of the solution. The key idea is to exami…
▽ More
We evaluate the maximum value of the unique positive solution to semilinear elliptic equations with double power nonlinearities. It is known that a positive solution to this problem exists under some condition.Moreover, Ouyang and Shi in 1998 found that the solution is unique under the same condition. In the present paper we investigate the maximum value of the solution. The key idea is to examine the function defined from the nonlinearity, which arises from the well-known Pohozaev identity.
△ Less
Submitted 30 January, 2009;
originally announced January 2009.
-
Uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities, revised eddition
Authors:
Shinji Kawano
Abstract:
We consider uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. The condition to assure the existence of positive solutions to these types of equations has long been known. On the other hand for uniqueness, quite technical additional condition is proposed by Ouyang and Shi in 1998. In the present paper we remark that this additional condition is un…
▽ More
We consider uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. The condition to assure the existence of positive solutions to these types of equations has long been known. On the other hand for uniqueness, quite technical additional condition is proposed by Ouyang and Shi in 1998. In the present paper we remark that this additional condition is unnecessary.
△ Less
Submitted 16 January, 2009;
originally announced January 2009.
-
Classification of double power nonlinear functions
Authors:
Shinji Kawano
Abstract:
In this article we investigate the nature of the functions, including important double power terms which arise naturally in considering typical nonlinear Schroedinger equations.
In this article we investigate the nature of the functions, including important double power terms which arise naturally in considering typical nonlinear Schroedinger equations.
△ Less
Submitted 6 November, 2008;
originally announced November 2008.
-
Existence and uniqueness conditions of positive solutions to semilinear elliptic equations with double power nonlinearities
Authors:
Shinji Kawano
Abstract:
In this article we find the equivalent conditions to assure the existence and uniqueness of positive solutions to semilinear elliptic equations wih double power nonlinearities. As a bonus, we give a simpler proof of our former result that the uniqueness condition comes from the existence condition.
In this article we find the equivalent conditions to assure the existence and uniqueness of positive solutions to semilinear elliptic equations wih double power nonlinearities. As a bonus, we give a simpler proof of our former result that the uniqueness condition comes from the existence condition.
△ Less
Submitted 6 November, 2008;
originally announced November 2008.
-
Uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities
Authors:
Shinji Kawano
Abstract:
We consider semilinear elliptic equations with double power nonlineaities. The condition to assure the existence of positive solutions is well-known. In the present paper, we remark that the additional condition to assure uniqueness proposed by Ouyang and Shi is unnecessary.
We consider semilinear elliptic equations with double power nonlineaities. The condition to assure the existence of positive solutions is well-known. In the present paper, we remark that the additional condition to assure uniqueness proposed by Ouyang and Shi is unnecessary.
△ Less
Submitted 6 November, 2008;
originally announced November 2008.
-
On semilinear elliptic equations with global coupling
Authors:
Shinji Kawano
Abstract:
We consider nonlinear elliptic equations which contains global coupling as a nonlinear term. We classify the existence of all possible positive solutions to this problem.
We consider nonlinear elliptic equations which contains global coupling as a nonlinear term. We classify the existence of all possible positive solutions to this problem.
△ Less
Submitted 31 October, 2008;
originally announced October 2008.
-
A remark on the uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities
Authors:
Shinji Kawano
Abstract:
We consider the uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. We deduce the uniqueness from the argument in the classical paper by Peletier and Serrin, thereby recovering a part of the uniqueness result of Ouyang and Shi.
We consider the uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. We deduce the uniqueness from the argument in the classical paper by Peletier and Serrin, thereby recovering a part of the uniqueness result of Ouyang and Shi.
△ Less
Submitted 31 October, 2008;
originally announced October 2008.
-
Separation of long DNA chains using non-uniform electric field: a numerical study
Authors:
Shin-ichiro Nagahiro,
Satoyuki Kawano,
Hidetoshi Kotera
Abstract:
We study migration of DNA molecules through a microchannel with a series of electric traps controlled by an ac electric field. We describe the motion of DNA based on Brownian dynamics simulations of a beads-spring chain. Our simulation demonstrates that the chain captured by an electrode escapes from the binding electric field due to thermal fluctuation. We find that the mobility of chain would…
▽ More
We study migration of DNA molecules through a microchannel with a series of electric traps controlled by an ac electric field. We describe the motion of DNA based on Brownian dynamics simulations of a beads-spring chain. Our simulation demonstrates that the chain captured by an electrode escapes from the binding electric field due to thermal fluctuation. We find that the mobility of chain would depend on the chain length; the mobility sharply increases when the length of a chain exceeds a critical value, which is strongly affected by the amplitude of the applied ac field. Thus we can adjust the length regime, in which this microchannel well separates DNA molecules, without changing the structure of the channel. We also present a theoretical insight into the relation between the critical chain length and the field amplitude.
△ Less
Submitted 24 July, 2006; v1 submitted 17 July, 2006;
originally announced July 2006.
-
Study of Field-Induced Magnetic Order in Singlet-Ground-State Magnet CsFeCl$_3$
Authors:
Mitsuru Toda,
Yutaka Fujii,
Shinji Kawano,
Takao Goto,
Meiro Chiba,
Shizumasa Ueda,
Kenji Nakajima,
Kazuhisa Kakurai,
Jens Klenke,
Ralf Feyerherm,
Matthias Meschke,
Hans Anton Graf,
Michael Steiner
Abstract:
The field-induced magnetic order in the singlet-ground-state system CsFeCl$_3$ has been studied by measuring magnetization and neutron diffraction. The field dependence of intensity for the neutron magnetic reflection has clearly demonstrated that the field-induced ordered phase is described by the order parameter $<S_x>$. A condensate growth of magnons is investigated through the temperature de…
▽ More
The field-induced magnetic order in the singlet-ground-state system CsFeCl$_3$ has been studied by measuring magnetization and neutron diffraction. The field dependence of intensity for the neutron magnetic reflection has clearly demonstrated that the field-induced ordered phase is described by the order parameter $<S_x>$. A condensate growth of magnons is investigated through the temperature dependence of $M_z$ and $M_{\perp}$, and this ordering is discussed in the context of a magnon Bose-Einstein condensation. Development of the coherent state and the static correlation length has been observed in the incommensurate phase in the field region of $5 < H < 6$ T at 1.8 K. At $H > H_{\rm c}$, a satellite peak was found in coexistence with the commensurate peak at the phase boundary around 10 T, which indicates that the tilt of the c-axis would be less than $\sim 0.5^{\circ}$ in the whole experiments.
△ Less
Submitted 9 July, 2004; v1 submitted 8 July, 2004;
originally announced July 2004.