Search | arXiv e-print repository

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its activities, and technical reports on the LLMs developed by LLM-jp. For the latest activities, visit https://llm-jp.nii.ac.jp/en/. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2406.09839 [pdf, other]

Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting

Authors: Muhammad Yeza Baihaqi, Angel García Contreras, Seiya Kawano, Koichiro Yoshino

Abstract: Rapport is known as a conversational aspect focusing on relationship building, which influences outcomes in collaborative tasks. This study aims to establish human-agent rapport through small talk by using a rapport-building strategy. We implemented this strategy for the virtual agents based on dialogue strategies by prompting a large language model (LLM). In particular, we utilized two dialogue s… ▽ More Rapport is known as a conversational aspect focusing on relationship building, which influences outcomes in collaborative tasks. This study aims to establish human-agent rapport through small talk by using a rapport-building strategy. We implemented this strategy for the virtual agents based on dialogue strategies by prompting a large language model (LLM). In particular, we utilized two dialogue strategies-predefined sequence and free-form-to guide the dialogue generation framework. We conducted analyses based on human evaluations, examining correlations between total turn, utterance characters, rapport score, and user experience variables: naturalness, satisfaction, interest, engagement, and usability. We investigated correlations between rapport score and naturalness, satisfaction, engagement, and conversation flow. Our experimental results also indicated that using free-form to prompt the rapport-building strategy performed the best in subjective scores. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: will be presented at INTERSPEECH 2024

arXiv:2404.12463 [pdf, other]

Spatially Selected and Dependent Random Effects for Small Area Estimation with Application to Rent Burden

Authors: Sho Kawano, Paul A. Parker, Zehang Richard Li

Abstract: Area-level models for small area estimation typically rely on areal random effects to shrink design-based direct estimates towards a model-based predictor. Incorporating the spatial dependence of the random effects into these models can further improve the estimates when there are not enough covariates to fully account for spatial dependence of the areal means. A number of recent works have invest… ▽ More Area-level models for small area estimation typically rely on areal random effects to shrink design-based direct estimates towards a model-based predictor. Incorporating the spatial dependence of the random effects into these models can further improve the estimates when there are not enough covariates to fully account for spatial dependence of the areal means. A number of recent works have investigated models that include random effects for only a subset of areas, in order to improve the precision of estimates. However, such models do not readily handle spatial dependence. In this paper, we introduce a model that accounts for spatial dependence in both the random effects as well as the latent process that selects the effects. We show how this model can significantly improve predictive accuracy via an empirical simulation study based on data from the American Community Survey, and illustrate its properties via an application to estimate county-level median rent burden. △ Less

Submitted 18 April, 2024; originally announced April 2024.

arXiv:2404.03250 [pdf, ps, other]

Multi-task learning via robust regularized clustering with non-convex group penalties

Authors: Akira Okazaki, Shuichi Kawano

Abstract: Multi-task learning (MTL) aims to improve estimation and prediction performance by sharing common information among related tasks. One natural assumption in MTL is that tasks are classified into clusters based on their characteristics. However, existing MTL methods based on this assumption often ignore outlier tasks that have large task-specific components or no relation to other tasks. To address… ▽ More Multi-task learning (MTL) aims to improve estimation and prediction performance by sharing common information among related tasks. One natural assumption in MTL is that tasks are classified into clusters based on their characteristics. However, existing MTL methods based on this assumption often ignore outlier tasks that have large task-specific components or no relation to other tasks. To address this issue, we propose a novel MTL method called Multi-Task Learning via Robust Regularized Clustering (MTLRRC). MTLRRC incorporates robust regularization terms inspired by robust convex clustering, which is further extended to handle non-convex and group-sparse penalties. The extension allows MTLRRC to simultaneously perform robust task clustering and outlier task detection. The connection between the extended robust clustering and the multivariate M-estimator is also established. This provides an interpretation of the robustness of MTLRRC against outlier tasks. An efficient algorithm based on a modified alternating direction method of multipliers is developed for the estimation of the parameters. The effectiveness of MTLRRC is demonstrated through simulation studies and application to real data. △ Less

Submitted 27 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

Comments: 32 pages

arXiv:2403.19259 [pdf, other]

J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution

Authors: Nobuhiro Ueda, Hideko Habe, Yoko Matsui, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi, Koichiro Yoshino

Abstract: Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the verbal information that appears in user interactions to the visual information observed in egocentric views. To this end, we propose a multimodal referen… ▽ More Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the verbal information that appears in user interactions to the visual information observed in egocentric views. To this end, we propose a multimodal reference resolution task and construct a Japanese Conversation dataset for Real-world Reference Resolution (J-CRe3). Our dataset contains egocentric video and dialogue audio of real-world conversations between two people acting as a master and an assistant robot at home. The dataset is annotated with crossmodal tags between phrases in the utterances and the object bounding boxes in the video frames. These tags include indirect reference relations, such as predicate-argument structures and bridging references as well as direct reference relations. We also constructed an experimental model and clarified the challenges in multimodal reference resolution tasks. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: LREC-COLING 2024

arXiv:2403.17545 [pdf, other]

A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions

Authors: Shun Inadumi, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi, Koichiro Yoshino

Abstract: Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subjective or objective terms. Such ambiguities in questions are often clarified by the contexts in conversational situations, such as joint attention wit… ▽ More Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subjective or objective terms. Such ambiguities in questions are often clarified by the contexts in conversational situations, such as joint attention with a user or user gaze information. In this study, we propose the Gaze-grounded VQA dataset (GazeVQA) that clarifies ambiguous questions using gaze information by focusing on a clarification process complemented by gaze information. We also propose a method that utilizes gaze target estimation results to improve the accuracy of GazeVQA tasks. Our experimental results showed that the proposed method improved the performance in some cases of a VQA system on GazeVQA and identified some typical problems of GazeVQA tasks that need to be improved. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: LREC-COLING 2024

arXiv:2402.06906 [pdf, other]

doi 10.15607/RSS.2023.XIX.090

ROSE: Rotation-based Squeezing Robotic Gripper toward Universal Handling of Objects

Authors: Son Tien Bui, Shinya Kawano, Van Anh Ho

Abstract: Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects' shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other… ▽ More Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects' shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other hand, integrating soft materials into the gripper's design may tolerate the above uncertainties and reduce complexity in control. In this paper, we introduce ROSE, a novel soft gripper that can embrace the object and squeeze it by buckling a funnel-liked thin-walled soft membrane around the object by simple rotation of the base. Thanks to this design, ROSE hand can adapt to a wide range of objects that can fit in the funnel and handle with gentle gripping force. Regardless of this, ROSE can generate a high lift force (up to 33kgf) while significantly reducing the normal pressure on the gripped objects. In our experiment, a 198g ROSE can be integrated into a robot arm with a single actuation and successfully lift various types of objects, even after 400,000 trials. The embracing mechanism helps reduce the dependence of friction between the object and the membrane, as ROSE could pick up a chicken egg submerged inside an olive oil tank. We also report a feasible design for equipping the ROSE hand with tactile sensing while appealing to the scalability of the design to fit a wide range of objects. Video: https://youtu.be/E1wAI09LaoY △ Less

Submitted 10 February, 2024; originally announced February 2024.

Comments: 9 pages, 9 figures, RSS2023 conference

Journal ref: Robotics: Science and System 2023

arXiv:2312.08838 [pdf, ps, other]

Bayesian Fused Lasso Modeling for Binary Data

Authors: Yuko Kakikawa, Shuichi Kawano

Abstract: L1-norm regularized logistic regression models are widely used for analyzing data with binary response. In those analyses, fusing regression coefficients is useful for detecting groups of variables. This paper proposes a binomial logistic regression model with Bayesian fused lasso. Assuming a Laplace prior on regression coefficients and differences between adjacent regression coefficients enables… ▽ More L1-norm regularized logistic regression models are widely used for analyzing data with binary response. In those analyses, fusing regression coefficients is useful for detecting groups of variables. This paper proposes a binomial logistic regression model with Bayesian fused lasso. Assuming a Laplace prior on regression coefficients and differences between adjacent regression coefficients enables us to perform variable selection and variable fusion simultaneously in the Bayesian framework. We also propose assuming a horseshoe prior on the differences to improve the flexibility of variable fusion. The Gibbs sampler is derived to estimate the parameters by a hierarchical expression of priors and a data-augmentation method. Using simulation studies and real data analysis, we compare the proposed methods with the existing method. △ Less

Submitted 14 December, 2023; originally announced December 2023.

MSC Class: 62F15 and 62J07 and 62J12

arXiv:2312.02408 [pdf]

Mid-infrared optical coherence tomography with MHz axial line rate for real-time non-destructive testing

Authors: Satoko Yagi, Takuma Nakamura, Kazuki Hashimoto, Shotaro Kawano, Takuro Ideguchi

Abstract: Non-destructive testing (NDT) is crucial for ensuring product quality and safety across various industries. Conventional methods such as ultrasonic, terahertz, and X-ray imaging have limitations in terms of probe-contact requirement, depth resolution, or radiation risks. Optical coherence tomography (OCT) is a promising alternative to solve these limitations, but it suffers from strong scattering,… ▽ More Non-destructive testing (NDT) is crucial for ensuring product quality and safety across various industries. Conventional methods such as ultrasonic, terahertz, and X-ray imaging have limitations in terms of probe-contact requirement, depth resolution, or radiation risks. Optical coherence tomography (OCT) is a promising alternative to solve these limitations, but it suffers from strong scattering, limiting its penetration depth. Recently, OCT in the mid-infrared (MIR) spectral region has attracted attention with a significantly lower scattering rate than in the near-infrared region. However, the highest reported A-scan rate of MIR-OCT has been 3 kHz, which requires long data acquisition time to take an image, unsatisfying industrial demands for real-time diagnosis. Here, we present a high-speed MIR-OCT system operating in the 3-4 um region that employs the swept-source OCT technique based on time-stretch infrared spectroscopy. By integrating a broadband femtosecond MIR pulsed laser operating at a repetition rate of 50 MHz, we achieved an A-scan rate of 1 MHz with an axial resolution of 11.6 um and a sensitivity of 55 dB. As a proof-of-concept demonstration, we imaged the surface of substrates covered by highly scattering paint coatings. The demonstrated A-scan rate surpasses previous state-of-the-art by more than two orders of magnitude, paving the way for real-time NDT of industrial products, cultural assets, and structures. △ Less

Submitted 4 December, 2023; originally announced December 2023.

arXiv:2311.13031 [pdf]

Anisotropic Optical Conductivity Accompanied by a Small Energy Gap in One-Dimensional Thermoelectric Telluride Ta4SiTe4

Authors: Fumiya Matsunaga, Yoshihiko Okamoto, Yasunori Yokoyama, Kanji Takehana, Yasutaka Imanaka, Yuto Nakamura, Hideo Kishida, Shoya Kawano, Kazuyuki Matsuhira, Koshi Takenaka

Abstract: We investigated the optical properties of single crystals of one-dimensional telluride Ta4SiTe4, which shows high thermoelectric performance below room temperature. Optical conductivity estimated from reflectivity spectra indicates the presence of a small energy gap of 0.1-0.15 eV at the Fermi energy. At the lowest energy, optical conductivity along the Ta4SiTe4 chain is an order of magnitude high… ▽ More We investigated the optical properties of single crystals of one-dimensional telluride Ta4SiTe4, which shows high thermoelectric performance below room temperature. Optical conductivity estimated from reflectivity spectra indicates the presence of a small energy gap of 0.1-0.15 eV at the Fermi energy. At the lowest energy, optical conductivity along the Ta4SiTe4 chain is an order of magnitude higher than that perpendicular to this direction, reflecting the anisotropic electron conduction in Ta4SiTe4. These results indicate that coexistence of a very small band gap and anisotropic electron conduction is a promising strategy to develop a high-performance thermoelectric material for low temperature applications. △ Less

Submitted 21 November, 2023; originally announced November 2023.

Comments: 6 pages, 4 figures

arXiv:2309.04685 [pdf, other]

Simultaneous Modeling of Disease Screening and Severity Prediction: A Multi-task and Sparse Regularization Approach

Authors: Kazuharu Harada, Shuichi Kawano, Masataka Taguri

Abstract: The exploration of biomarkers, which are clinically useful biomolecules, and the development of prediction models using them are important problems in biomedical research. Biomarkers are widely used for disease screening, and some are related not only to the presence or absence of a disease but also to its severity. These biomarkers can be useful for prioritization of treatment and clinical decisi… ▽ More The exploration of biomarkers, which are clinically useful biomolecules, and the development of prediction models using them are important problems in biomedical research. Biomarkers are widely used for disease screening, and some are related not only to the presence or absence of a disease but also to its severity. These biomarkers can be useful for prioritization of treatment and clinical decision-making. Considering a model helpful for both disease screening and severity prediction, this paper focuses on regression modeling for an ordinal response equipped with a hierarchical structure. If the response variable is a combination of the presence of disease and severity such as \{{\it healthy, mild, intermediate, severe}\}, for example, the simplest method would be to apply the conventional ordinal regression model. However, the conventional model has flexibility issues and may not be suitable for the problems addressed in this paper, where the levels of the response variable might be heterogeneous. Therefore, this paper proposes a model assuming screening and severity prediction as different tasks, and an estimation method based on structural sparse regularization that leverages any common structure between the tasks when such commonality exists. In numerical experiments, the proposed method demonstrated stable performance across many scenarios compared to existing ordinal regression methods. △ Less

Submitted 25 June, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

arXiv:2304.13342 [pdf, ps, other]

Multi-Task Learning Regression via Convex Clustering

Authors: Akira Okazaki, Shuichi Kawano

Abstract: Multi-task learning (MTL) is a methodology that aims to improve the general performance of estimation and prediction by sharing common information among related tasks. In the MTL, there are several assumptions for the relationships and methods to incorporate them. One of the natural assumptions in the practical situation is that tasks are classified into some clusters with their characteristics. F… ▽ More Multi-task learning (MTL) is a methodology that aims to improve the general performance of estimation and prediction by sharing common information among related tasks. In the MTL, there are several assumptions for the relationships and methods to incorporate them. One of the natural assumptions in the practical situation is that tasks are classified into some clusters with their characteristics. For this assumption, the group fused regularization approach performs clustering of the tasks by shrinking the difference among tasks. This enables us to transfer common information within the same cluster. However, this approach also transfers the information between different clusters, which worsens the estimation and prediction. To overcome this problem, we propose an MTL method with a centroid parameter representing a cluster center of the task. Because this model separates parameters into the parameters for regression and the parameters for clustering, we can improve estimation and prediction accuracy for regression coefficient vectors. We show the effectiveness of the proposed method through Monte Carlo simulations and applications to real data. △ Less

Submitted 26 April, 2023; originally announced April 2023.

Comments: 18 pages, 4 tables

arXiv:2304.10855 [pdf, other]

Ab initio calculation for electronic structure and optical property of tungsten carbide in a TiCN-based cermet for solar thermal applications

Authors: Shota Hayakawa, Toshiharu Chono, Kosuke Watanabe, Shoya Kawano, Kazuma Nakamura, Koji Miyazaki

Abstract: We present an ab initio calculation to understand electronic structures and optical properties of a tungsten carbide WC being a major component of a TiCN-based cermet. We found that the WC has a fairly low-energy plasma excitation $\sim$0.6 eV (2 $μ$m) and therefore can be a good constituent of a solar selective absorber. The evaluated figure of merit for photothermal conversion is prominently hig… ▽ More We present an ab initio calculation to understand electronic structures and optical properties of a tungsten carbide WC being a major component of a TiCN-based cermet. We found that the WC has a fairly low-energy plasma excitation $\sim$0.6 eV (2 $μ$m) and therefore can be a good constituent of a solar selective absorber. The evaluated figure of merit for photothermal conversion is prominently high compared to those of the other materials included in the TiCN-based cermet. The imaginary part of the dielectric function is considerably small around the zero point of the real part of the dielectric function, corresponding to the plasma excitation energy. Therefore, a clear plasma edge appeared, ensuring the high performance of the WC as the solar absorber. △ Less

Submitted 21 April, 2023; originally announced April 2023.

Comments: 13pages, 8 figures, 2tables

arXiv:2304.07451 [pdf, ps, other]

Multivariate regression modeling in integrative analysis via sparse regularization

Authors: Shuichi Kawano, Toshikazu Fukushima, Junichi Nakagawa, Mamoru Oshiki

Abstract: The multivariate regression model basically offers the analysis of a single dataset with multiple responses. However, such a single-dataset analysis often leads to unsatisfactory results. Integrative analysis is an effective method to pool useful information from multiple independent datasets and provides better performance than single-dataset analysis. In this study, we propose a multivariate reg… ▽ More The multivariate regression model basically offers the analysis of a single dataset with multiple responses. However, such a single-dataset analysis often leads to unsatisfactory results. Integrative analysis is an effective method to pool useful information from multiple independent datasets and provides better performance than single-dataset analysis. In this study, we propose a multivariate regression modeling in integrative analysis. The integration is achieved by sparse estimation that performs variable and group selection. Based on the idea of alternating direction method of multipliers, we develop its computational algorithm that enjoys the convergence property. The performance of the proposed method is demonstrated through Monte Carlo simulation and analyzing wastewater treatment data with microbe measurements. △ Less

Submitted 14 April, 2023; originally announced April 2023.

arXiv:2212.00142 [pdf, other]

Determination of Majorana type-phases from the time evolution of lepton numbers

Authors: Nicholas J. Benoit, Yuta Kawamura, Saki Kawano, Takuya Morozumi, Yusuke Shimizu, Kei Yamamoto

Abstract: We have investigated an approach to determine the Majorana type-phases using the time evolution of lepton family numbers. The Majorana type-phases are related to the orientation of unitarity triangles for the Pontecorvo-Maki-Nakagawa-Sakata (PMNS) matrix, and the Majorana phases $α_{21}$ and $α_{31}$. After taking the second-order time derivative of the lepton family number expectation values, the… ▽ More We have investigated an approach to determine the Majorana type-phases using the time evolution of lepton family numbers. The Majorana type-phases are related to the orientation of unitarity triangles for the Pontecorvo-Maki-Nakagawa-Sakata (PMNS) matrix, and the Majorana phases $α_{21}$ and $α_{31}$. After taking the second-order time derivative of the lepton family number expectation values, the dependencies on the summation of Majorana type-phases can be determined. Thus allowing for the extraction of the orientation of the unitarity triangles and the Majorana phases. We study how to extract the Majorana type-phases and the lightest neutrino mass for three massive neutrinos, and when a neutrino is massless, i.e., $m_{1,3}=0$. Our result can be complimentary to using neutrinoless double-beta decay for determining the orientation of PMNS unitarity triangles and the Majorana phases. △ Less

Submitted 7 December, 2022; v1 submitted 30 November, 2022; originally announced December 2022.

Comments: 18 pages, 5 figures, In v2,one reference is cited in a correct place. The order of the author's addresses is corrected

Report number: HUPD-2213

arXiv:2210.02735 [pdf, ps, other]

What Should the System Do Next?: Operative Action Captioning for Estimating System Actions

Authors: Taiki Nakamura, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi, Koichiro Yoshino

Abstract: Such human-assisting systems as robots need to correctly understand the surrounding situation based on observations and output the required support actions for humans. Language is one of the important channels to communicate with humans, and the robots are required to have the ability to express their understanding and action planning results. In this study, we propose a new task of operative acti… ▽ More Such human-assisting systems as robots need to correctly understand the surrounding situation based on observations and output the required support actions for humans. Language is one of the important channels to communicate with humans, and the robots are required to have the ability to express their understanding and action planning results. In this study, we propose a new task of operative action captioning that estimates and verbalizes the actions to be taken by the system in a human-assisting domain. We constructed a system that outputs a verbal description of a possible operative action that changes the current state to the given target state. We collected a dataset consisting of two images as observations, which express the current state and the state changed by actions, and a caption that describes the actions that change the current state to the target state, by crowdsourcing in daily life situations. Then we constructed a system that estimates operative action by a caption. Since the operative action's caption is expected to contain some state-changing actions, we use scene-graph prediction as an auxiliary task because the events written in the scene graphs correspond to the state changes. Experimental results showed that our system successfully described the operative actions that should be conducted between the current and target states. The auxiliary tasks that predict the scene graphs improved the quality of the estimation results. △ Less

Submitted 6 October, 2022; originally announced October 2022.

Comments: Under review in ICRA2023

arXiv:2201.08053 [pdf, ps, other]

Bayesian Fused Lasso Modeling via Horseshoe Prior

Authors: Yuko Kakikawa, Kaito Shimamura, Shuichi Kawano

Abstract: Bayesian fused lasso is one of the sparse Bayesian methods, which shrinks both regression coefficients and their successive differences simultaneously. In this paper, we propose a Bayesian fused lasso modeling via horseshoe prior. By assuming a horseshoe prior on the difference of successive regression coefficients, the proposed method enables us to prevent over-shrinkage of those differences. We… ▽ More Bayesian fused lasso is one of the sparse Bayesian methods, which shrinks both regression coefficients and their successive differences simultaneously. In this paper, we propose a Bayesian fused lasso modeling via horseshoe prior. By assuming a horseshoe prior on the difference of successive regression coefficients, the proposed method enables us to prevent over-shrinkage of those differences. We also propose a Bayesian hexagonal operator for regression with shrinkage and equality selection (HORSES) with horseshoe prior, which imposes priors on all combinations of differences of regression coefficients. Simulation studies and an application to real data show that the proposed method gives better performance than existing methods. △ Less

Submitted 20 January, 2022; originally announced January 2022.

Comments: 17 pages

MSC Class: 62F15; 62J07 (Primary) 62J05 (Secondary)

arXiv:2111.06617 [pdf, ps, other]

doi 10.3390/e24121839

Multi-task Learning for Compositional Data via Sparse Network Lasso

Authors: Akira Okazaki, Shuichi Kawano

Abstract: A network lasso enables us to construct a model for each sample, which is known as multi-task learning. Existing methods for multi-task learning cannot be applied to compositional data due to their intrinsic properties. In this paper, we propose a multi-task learning method for compositional data using a sparse network lasso. We focus on a symmetric form of the log-contrast model, which is a regre… ▽ More A network lasso enables us to construct a model for each sample, which is known as multi-task learning. Existing methods for multi-task learning cannot be applied to compositional data due to their intrinsic properties. In this paper, we propose a multi-task learning method for compositional data using a sparse network lasso. We focus on a symmetric form of the log-contrast model, which is a regression model with compositional covariates. The effectiveness of the proposed method is shown through simulation studies and application to gut microbiome data. △ Less

Submitted 17 November, 2021; v1 submitted 12 November, 2021; originally announced November 2021.

Comments: 21 pages, 4 figures

arXiv:2110.09040 [pdf, ps, other]

A Bayesian approach to multi-task learning with network lasso

Authors: Kaito Shimamura, Shuichi Kawano

Abstract: Network lasso is a method for solving a multi-task learning problem through the regularized maximum likelihood method. A characteristic of network lasso is setting a different model for each sample. The relationships among the models are represented by relational coefficients. A crucial issue in network lasso is to provide appropriate values for these relational coefficients. In this paper, we pro… ▽ More Network lasso is a method for solving a multi-task learning problem through the regularized maximum likelihood method. A characteristic of network lasso is setting a different model for each sample. The relationships among the models are represented by relational coefficients. A crucial issue in network lasso is to provide appropriate values for these relational coefficients. In this paper, we propose a Bayesian approach to solve multi-task learning problems by network lasso. This approach allows us to objectively determine the relational coefficients by Bayesian estimation. The effectiveness of the proposed method is shown in a simulation study and a real data analysis. △ Less

Submitted 18 October, 2021; originally announced October 2021.

arXiv:2109.11352 [pdf]

Proteomics Standards Initiatives ProForma 2.0 Unifying the encoding of Proteoforms and Peptidoforms

Authors: Richard D. LeDuc, Eric W. Deutsch, Pierre-Alain Binz, Ryan T. Fellers, Anthony J. Cesnik, Joshua A. Klein, Tim Van Den Bossche, Ralf Gabriels, Arshika Yalavarthi, Yasset Perez-Riverol, Jeremy Carver, Wout Bittremieux, Shin Kawano, Benjamin Pullman, Nuno Bandeira, Neil L. Kelleher, Paul M. Thomas, Juan Antonio Vizcaíno

Abstract: There is the need to represent in a standard manner all the possible variations of a protein or peptide primary sequence, including both artefactual and post-translational modifications of peptides and proteins. With that overall aim, here, the Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has developed a notation, called ProForma 2.0, which is a substantial extension of… ▽ More There is the need to represent in a standard manner all the possible variations of a protein or peptide primary sequence, including both artefactual and post-translational modifications of peptides and proteins. With that overall aim, here, the Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has developed a notation, called ProForma 2.0, which is a substantial extension of the original ProForma notation, developed by the Consortium for Top-Down Proteomics (CTDP). ProForma 2.0 aims to unify the representation of proteoforms and peptidoforms. Therefore, this notation supports use cases needed for bottom-up and middle/topdown proteomics approaches and allows the encoding of highly modified proteins and peptides using a human and machine-readable string. ProForma 2.0 covers encoding protein modification names and accessions, cross-linking reagents including disulfides, glycans, modifications encoded using mass shifts and/or via chemical formulas, labile and C or N-terminal modifications, ambiguity in the modification position and representation of atomic isotopes, among other use cases. Notational conventions are based on public controlled vocabularies and ontologies. Detailed information about the notation and existing implementations are available at http://www.psidev.info/proforma and at the corresponding GitHub repository (https://github.com/HUPO-PSI/proforma). △ Less

Submitted 21 March, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

arXiv:2102.00136 [pdf, ps, other]

Smoothly varying ridge regularization

Authors: Daeju Kim, Shuichi Kawano, Yoshiyuki Ninomiya

Abstract: A basis expansion with regularization methods is much appealing to the flexible or robust nonlinear regression models for data with complex structures. When the underlying function has inhomogeneous smoothness, it is well known that conventional reguralization methods do not perform well. In this case, an adaptive procedure such as a free-knot spline or a local likelihood method is often introduce… ▽ More A basis expansion with regularization methods is much appealing to the flexible or robust nonlinear regression models for data with complex structures. When the underlying function has inhomogeneous smoothness, it is well known that conventional reguralization methods do not perform well. In this case, an adaptive procedure such as a free-knot spline or a local likelihood method is often introduced as an effective method. However, both methods need intensive computational loads. In this study, we consider a new efficient basis expansion by proposing a smoothly varying regularization method which is constructed by some special penalties. We call them adaptive-type penalties. In our modeling, adaptive-type penalties play key rolls and it has been successful in giving good estimation for inhomogeneous smoothness functions. A crucial issue in the modeling process is the choice of a suitable model among candidates. To select the suitable model, we derive an approximated generalized information criterion (GIC). The proposed method is investigated through Monte Carlo simulations and real data analysis. Numerical results suggest that our method performs well in various situations. △ Less

Submitted 29 January, 2021; originally announced February 2021.

Comments: 21 pages, 6 figures, 3 tables

arXiv:2011.01300 [pdf, other]

doi 10.1016/j.commatsci.2020.110144

Classification of atomic environments via the Gromov-Wasserstein distance

Authors: Sakura Kawano, Jeremy K. Mason

Abstract: Interpreting molecular dynamics simulations usually involves automated classification of local atomic environments to identify regions of interest. Existing approaches are generally limited to a small number of reference structures and only include limited information about the local chemical composition. This work proposes to use a variant of the Gromov-Wasserstein (GW) distance to quantify the d… ▽ More Interpreting molecular dynamics simulations usually involves automated classification of local atomic environments to identify regions of interest. Existing approaches are generally limited to a small number of reference structures and only include limited information about the local chemical composition. This work proposes to use a variant of the Gromov-Wasserstein (GW) distance to quantify the difference between a local atomic environment and a set of arbitrary reference environments in a way that is sensitive to atomic displacements, missing atoms, and differences in chemical composition. This involves describing a local atomic environment as a finite metric measure space, which has the additional advantages of not requiring the local environment to be centered on an atom and of not making any assumptions about the material class. Numerical examples illustrate the efficacy and versatility of the algorithm. △ Less

Submitted 2 November, 2020; originally announced November 2020.

Journal ref: Comp. Mater. Sci. 188, 110144 (2021)

arXiv:2009.02695 [pdf, other]

Multilinear Common Component Analysis via Kronecker Product Representation

Authors: Kohei Yoshikawa, Shuichi Kawano

Abstract: We consider the problem of extracting a common structure from multiple tensor datasets. For this purpose, we propose multilinear common component analysis (MCCA) based on Kronecker products of mode-wise covariance matrices. MCCA constructs a common basis represented by linear combinations of the original variables which loses as little information of the multiple tensor datasets. We also develop a… ▽ More We consider the problem of extracting a common structure from multiple tensor datasets. For this purpose, we propose multilinear common component analysis (MCCA) based on Kronecker products of mode-wise covariance matrices. MCCA constructs a common basis represented by linear combinations of the original variables which loses as little information of the multiple tensor datasets. We also develop an estimation algorithm for MCCA that guarantees mode-wise global convergence. Numerical studies are conducted to show the effectiveness of MCCA. △ Less

Submitted 20 November, 2020; v1 submitted 6 September, 2020; originally announced September 2020.

Comments: 35 pages, 7 figures

arXiv:2005.03419 [pdf, other]

Relevance Vector Machine with Weakly Informative Hyperprior and Extended Predictive Information Criterion

Authors: Kazuaki. Murayama, Shuichi. Kawano

Abstract: In the variational relevance vector machine, the gamma distribution is representative as a hyperprior over the noise precision of automatic relevance determination prior. Instead of the gamma hyperprior, we propose to use the inverse gamma hyperprior with a shape parameter close to zero and a scale parameter not necessary close to zero. This hyperprior is associated with the concept of a weakly in… ▽ More In the variational relevance vector machine, the gamma distribution is representative as a hyperprior over the noise precision of automatic relevance determination prior. Instead of the gamma hyperprior, we propose to use the inverse gamma hyperprior with a shape parameter close to zero and a scale parameter not necessary close to zero. This hyperprior is associated with the concept of a weakly informative prior. The effect of this hyperprior is investigated through regression to non-homogeneous data. Because it is difficult to capture the structure of such data with a single kernel function, we apply the multiple kernel method, in which multiple kernel functions with different widths are arranged for input data. We confirm that the degrees of freedom in a model is controlled by adjusting the scale parameter and keeping the shape parameter close to zero. A candidate for selecting the scale parameter is the predictive information criterion. However the estimated model using this criterion seems to cause over-fitting. This is because the multiple kernel method makes the model a situation where the dimension of the model is larger than the data size. To select an appropriate scale parameter even in such a situation, we also propose an extended prediction information criterion. It is confirmed that a multiple kernel relevance vector regression model with good predictive accuracy can be obtained by selecting the scale parameter minimizing extended prediction information criterion. △ Less

Submitted 7 May, 2020; originally announced May 2020.

Comments: 29 pages, 12 captioned figures, 23 files of non-captioned figures

arXiv:2003.13299 [pdf, other]

doi 10.1007/978-981-16-2765-1_41

Variable fusion for Bayesian linear regression via spike-and-slab priors

Authors: Shengyi Wu, Kaito Shimamura, Kohei Yoshikawa, Kazuaki Murayama, Shuichi Kawano

Abstract: In linear regression models, fusion of coefficients is used to identify predictors having similar relationships with a response. This is called variable fusion. This paper presents a novel variable fusion method in terms of Bayesian linear regression models. We focus on hierarchical Bayesian models based on a spike-and-slab prior approach. A spike-and-slab prior is tailored to perform variable fus… ▽ More In linear regression models, fusion of coefficients is used to identify predictors having similar relationships with a response. This is called variable fusion. This paper presents a novel variable fusion method in terms of Bayesian linear regression models. We focus on hierarchical Bayesian models based on a spike-and-slab prior approach. A spike-and-slab prior is tailored to perform variable fusion. To obtain estimates of the parameters, we develop a Gibbs sampler for the parameters. Simulation studies and a real data analysis show that our proposed method achieves better performance than previous methods. △ Less

Submitted 2 December, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

Comments: 19 pages

Journal ref: Proceedings in the 13th KES International Conference on Intelligent Decision Technologies 238 (2021) 491-501

arXiv:2002.09188 [pdf, ps, other]

doi 10.1007/s11634-020-00435-2

Sparse principal component regression via singular value decomposition approach

Authors: Shuichi Kawano

Abstract: Principal component regression (PCR) is a two-stage procedure: the first stage performs principal component analysis (PCA) and the second stage constructs a regression model whose explanatory variables are replaced by principal components obtained by the first stage. Since PCA is performed by using only explanatory variables, the principal components have no information about the response variable… ▽ More Principal component regression (PCR) is a two-stage procedure: the first stage performs principal component analysis (PCA) and the second stage constructs a regression model whose explanatory variables are replaced by principal components obtained by the first stage. Since PCA is performed by using only explanatory variables, the principal components have no information about the response variable. To address the problem, we propose a one-stage procedure for PCR in terms of singular value decomposition approach. Our approach is based upon two loss functions, a regression loss and a PCA loss, with sparse regularization. The proposed method enables us to obtain principal component loadings that possess information about both explanatory variables and a response variable. An estimation algorithm is developed by using alternating direction method of multipliers. We conduct numerical studies to show the effectiveness of the proposed method. △ Less

Submitted 21 February, 2020; originally announced February 2020.

Comments: 30 pages

Journal ref: Advances in Data Analysis and Classification 15 (2021) 795-823

arXiv:1912.03907 [pdf]

doi 10.1364/OE.27.038019

Optical vortex-induced forward mass transfer: Manifestation of helical trajectory of optical vortex

Authors: Ryosuke Nakamura, Haruki Kawaguchi, Muneaki Iwata, Akihiro Kaneko, Ryo Nagura, Satoyuki Kawano, Kohei Toyoda, Katsuhiko Miyamoto, Takashige Omatsu

Abstract: The orbital angular momentum of an optical vortex field is found to twist high viscosity donor material to form a micron-scale 'spin jet'. This unique phenomenon manifests the helical trajectory of the optical vortex. Going beyond both the conventional ink jet and laser induced forward mass transfer (LIFT) patterning technologies, it also offers the formation and ejection of a micron-scale 'spin j… ▽ More The orbital angular momentum of an optical vortex field is found to twist high viscosity donor material to form a micron-scale 'spin jet'. This unique phenomenon manifests the helical trajectory of the optical vortex. Going beyond both the conventional ink jet and laser induced forward mass transfer (LIFT) patterning technologies, it also offers the formation and ejection of a micron-scale 'spin jet' of the donor material even with an ultrahigh viscosity of 4 Pas. This optical vortex laser induced forward mass transfer (OV-LIFT) patterning technique will enable the development of next generation printed photonic/electric/spintronic circuits formed of ultrahigh viscosity donor dots containing functional nanoparticles, such as quantum dots, metallic particles and magnetic ferrite particles, with ultrahigh spatial resolution. It can also potentially explore a completely new needleless drug injection. △ Less

Submitted 9 December, 2019; originally announced December 2019.

arXiv:1911.08703 [pdf, other]

Bayesian sparse convex clustering via global-local shrinkage priors

Authors: Kaito Shimamura, Shuichi Kawano

Abstract: Sparse convex clustering is to cluster observations and conduct variable selection simultaneously in the framework of convex clustering. Although a weighted $L_1$ norm is usually employed for the regularization term in sparse convex clustering, its use increases the dependence on the data and reduces the estimation accuracy if the sample size is not sufficient. To tackle these problems, this paper… ▽ More Sparse convex clustering is to cluster observations and conduct variable selection simultaneously in the framework of convex clustering. Although a weighted $L_1$ norm is usually employed for the regularization term in sparse convex clustering, its use increases the dependence on the data and reduces the estimation accuracy if the sample size is not sufficient. To tackle these problems, this paper proposes a Bayesian sparse convex clustering method based on the ideas of Bayesian lasso and global-local shrinkage priors. We introduce Gibbs sampling algorithms for our method using scale mixtures of normal distributions. The effectiveness of the proposed methods is shown in simulation studies and a real data analysis. △ Less

Submitted 26 May, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

arXiv:1910.05083 [pdf, other]

Sparse Reduced-Rank Regression for Simultaneous Rank and Variable Selection via Manifold Optimization

Authors: Kohei Yoshikawa, Shuichi Kawano

Abstract: We consider the problem of constructing a reduced-rank regression model whose coefficient parameter is represented as a singular value decomposition with sparse singular vectors. The traditional estimation procedure for the coefficient parameter often fails when the true rank of the parameter is high. To overcome this issue, we develop an estimation algorithm with rank and variable selection via s… ▽ More We consider the problem of constructing a reduced-rank regression model whose coefficient parameter is represented as a singular value decomposition with sparse singular vectors. The traditional estimation procedure for the coefficient parameter often fails when the true rank of the parameter is high. To overcome this issue, we develop an estimation algorithm with rank and variable selection via sparse regularization and manifold optimization, which enables us to obtain an accurate estimation of the coefficient parameter even if the true rank of the coefficient parameter is high. Using sparse regularization, we can also select an optimal value of the rank. We conduct Monte Carlo experiments and real data analysis to illustrate the effectiveness of our proposed method. △ Less

Submitted 1 November, 2019; v1 submitted 11 October, 2019; originally announced October 2019.

Comments: 28 pages

arXiv:1609.08886 [pdf, ps, other]

doi 10.1016/j.csda.2018.03.008

Sparse principal component regression for generalized linear models

Authors: Shuichi Kawano, Hironori Fujisawa, Toyoyuki Takada, Toshihiko Shiroishi

Abstract: Principal component regression (PCR) is a widely used two-stage procedure: principal component analysis (PCA), followed by regression in which the selected principal components are regarded as new explanatory variables in the model. Note that PCA is based only on the explanatory variables, so the principal components are not selected using the information on the response variable. In this paper, w… ▽ More Principal component regression (PCR) is a widely used two-stage procedure: principal component analysis (PCA), followed by regression in which the selected principal components are regarded as new explanatory variables in the model. Note that PCA is based only on the explanatory variables, so the principal components are not selected using the information on the response variable. In this paper, we propose a one-stage procedure for PCR in the framework of generalized linear models. The basic loss function is based on a combination of the regression loss and PCA loss. An estimate of the regression parameter is obtained as the minimizer of the basic loss function with a sparse penalty. We call the proposed method sparse principal component regression for generalized linear models (SPCR-glm). Taking the two loss function into consideration simultaneously, SPCR-glm enables us to obtain sparse principal component loadings that are related to a response variable. However, a combination of loss functions may cause a parameter identification problem, but this potential problem is avoided by virtue of the sparse penalty. Thus, the sparse penalty plays two roles in this method. The parameter estimation procedure is proposed using various update algorithms with the coordinate descent algorithm. We apply SPCR-glm to two real datasets, doctor visits data and mouse consomic strain data. SPCR-glm provides more easily interpretable principal component (PC) scores and clearer classification on PC plots than the usual PCA. △ Less

Submitted 12 October, 2016; v1 submitted 28 September, 2016; originally announced September 2016.

Comments: 29 pages

Journal ref: Computational Statistics & Data Analysis 124 (2018) 180-196

arXiv:1602.04910 [pdf, other]

doi 10.1080/03610926.2018.1489056

Bayesian generalized fused lasso modeling via NEG distribution

Authors: Kaito Shimamura, Masao Ueki, Shuichi Kawano, Sadanori Konishi

Abstract: The fused lasso penalizes a loss function by the $L_1$ norm for both the regression coefficients and their successive differences to encourage sparsity of both. In this paper, we propose a Bayesian generalized fused lasso modeling based on a normal-exponential-gamma (NEG) prior distribution. The NEG prior is assumed into the difference of successive regression coefficients. The proposed method ena… ▽ More The fused lasso penalizes a loss function by the $L_1$ norm for both the regression coefficients and their successive differences to encourage sparsity of both. In this paper, we propose a Bayesian generalized fused lasso modeling based on a normal-exponential-gamma (NEG) prior distribution. The NEG prior is assumed into the difference of successive regression coefficients. The proposed method enables us to construct a more versatile sparse model than the ordinary fused lasso by using a flexible regularization term. We also propose a sparse fused algorithm to produce exact sparse solutions. Simulation studies and real data analyses show that the proposed method has superior performance to the ordinary fused lasso. △ Less

Submitted 16 February, 2016; originally announced February 2016.

Comments: 26 pages

MSC Class: Primary 62F15; 62J07; Secondary 62J05

Journal ref: Communications in Statistics - Theory and Methods 48 (2019) 4132-4153

arXiv:1402.6455 [pdf, ps, other]

doi 10.1016/j.csda.2015.03.016

Sparse principal component regression with adaptive loading

Authors: Shuichi Kawano, Hironori Fujisawa, Toyoyuki Takada, Toshihiko Shiroishi

Abstract: Principal component regression (PCR) is a two-stage procedure that selects some principal components and then constructs a regression model regarding them as new explanatory variables. Note that the principal components are obtained from only explanatory variables and not considered with the response variable. To address this problem, we propose the sparse principal component regression (SPCR) tha… ▽ More Principal component regression (PCR) is a two-stage procedure that selects some principal components and then constructs a regression model regarding them as new explanatory variables. Note that the principal components are obtained from only explanatory variables and not considered with the response variable. To address this problem, we propose the sparse principal component regression (SPCR) that is a one-stage procedure for PCR. SPCR enables us to adaptively obtain sparse principal component loadings that are related to the response variable and select the number of principal components simultaneously. SPCR can be obtained by the convex optimization problem for each of parameters with the coordinate descent algorithm. Monte Carlo simulations and real data analyses are performed to illustrate the effectiveness of SPCR. △ Less

Submitted 31 October, 2014; v1 submitted 26 February, 2014; originally announced February 2014.

Comments: 24 pages

MSC Class: 62H25; 62J07

Journal ref: Computational Statistics & Data Analysis 89 (2015) 192-203

arXiv:1302.3016 [pdf, ps, other]

Electrochemical response of biased nanoelectrodes in solution

Authors: Kentaro Doi, Makusu Tsutsui, Takahito Ohshiro, Chih-Chun Chien, Michael Zwolak, Masateru Taniguchi, Tomoji Kawai, Satoyuki Kawano, Massimiliano Di Ventra

Abstract: Novel approaches to DNA sequencing and detection require the measurement of electrical currents between metal probes immersed in ionic solution. Here, we experimentally demonstrate that these systems maintain large background currents with a transient response that decays very slowly in time and noise that increases with ionic concentration. Using a non-equilibrium stochastic model, we obtain an a… ▽ More Novel approaches to DNA sequencing and detection require the measurement of electrical currents between metal probes immersed in ionic solution. Here, we experimentally demonstrate that these systems maintain large background currents with a transient response that decays very slowly in time and noise that increases with ionic concentration. Using a non-equilibrium stochastic model, we obtain an analytical expression for the ionic current that shows these results are due to a fast electrochemical reaction at the electrode surface followed by the slow formation of a diffusion layer. During the latter, ions translocate in the weak electric field generated after the initial rapid screening of the strong fields near the electrode surfaces. Our theoretical results are in very good agreement with experimental findings. △ Less

Submitted 13 February, 2013; originally announced February 2013.

Report number: LA-UR-13-20328

arXiv:1204.3130

Adaptive bridge regression modeling with model selection criteria

Authors: Shuichi Kawano

Abstract: We consider the problem of constructing an adaptive bridge regression modeling, which is a penalized procedure by imposing different weights to different coefficients in the bridge penalty term. A crucial issue in the modeling process is the choices of adjusted parameters included in the models. We treat the selection of the adjusted parameters as model selection and evaluation problems. In order… ▽ More We consider the problem of constructing an adaptive bridge regression modeling, which is a penalized procedure by imposing different weights to different coefficients in the bridge penalty term. A crucial issue in the modeling process is the choices of adjusted parameters included in the models. We treat the selection of the adjusted parameters as model selection and evaluation problems. In order to select the parameters, model selection criteria are derived from information-theoretic and Bayesian approach. We conduct some numerical studies to investigate the effectiveness of our proposed modeling strategy. △ Less

Submitted 28 August, 2012; v1 submitted 13 April, 2012; originally announced April 2012.

Comments: 14 pages, 4 figures

Journal ref: Bulletin of Informatics and Cybernetics 44 (2012) 29-39

arXiv:1203.4326 [pdf, ps, other]

doi 10.1007/s00362-013-0561-7

Selection of tuning parameters in bridge regression models via Bayesian information criterion

Authors: Shuichi Kawano

Abstract: We consider the bridge linear regression modeling, which can produce a sparse or non-sparse model. A crucial point in the model building process is the selection of adjusted parameters including a regularization parameter and a tuning parameter in bridge regression models. The choice of the adjusted parameters can be viewed as a model selection and evaluation problem. We propose a model selection… ▽ More We consider the bridge linear regression modeling, which can produce a sparse or non-sparse model. A crucial point in the model building process is the selection of adjusted parameters including a regularization parameter and a tuning parameter in bridge regression models. The choice of the adjusted parameters can be viewed as a model selection and evaluation problem. We propose a model selection criterion for evaluating bridge regression models in terms of Bayesian approach. This selection criterion enables us to select the adjusted parameters objectively. We investigate the effectiveness of our proposed modeling strategy through some numerical examples. △ Less

Submitted 13 April, 2012; v1 submitted 20 March, 2012; originally announced March 2012.

Comments: 20 pages, 5 figures

MSC Class: 62J05; 62G05; 62F15

Journal ref: Statistical Papers 55 (2014) 1207-1223

arXiv:1108.5244 [pdf, ps, other]

doi 10.1002/sam.11204

Semi-supervised logistic discrimination via labeled data and unlabeled data from different sampling distributions

Authors: Shuichi Kawano

Abstract: This article addresses the problem of classification method based on both labeled and unlabeled data, where we assume that a density function for labeled data is different from that for unlabeled data. We propose a semi-supervised logistic regression model for classification problem along with the technique of covariate shift adaptation. Unknown parameters involved in proposed models are estimated… ▽ More This article addresses the problem of classification method based on both labeled and unlabeled data, where we assume that a density function for labeled data is different from that for unlabeled data. We propose a semi-supervised logistic regression model for classification problem along with the technique of covariate shift adaptation. Unknown parameters involved in proposed models are estimated by regularization with EM algorithm. A crucial issue in the modeling process is the choices of tuning parameters in our semi-supervised logistic models. In order to select the parameters, a model selection criterion is derived from an information-theoretic approach. Some numerical studies show that our modeling procedure performs well in various cases. △ Less

Submitted 13 October, 2012; v1 submitted 26 August, 2011; originally announced August 2011.

Comments: 19 pages

Journal ref: Statistical Analysis and Data Mining 6 (2013) 472-481

arXiv:1107.3618 [pdf, ps, other]

doi 10.1080/00949655.2013.785548

Varying-coefficient modeling via regularized basis functions

Authors: Hidetoshi Matsui, Toshihiro Misumi, Shuichi Kawano

Abstract: We address the problem of constructing varying-coefficient models based on basis expansions along with the technique of regularization. A crucial point in our modeling procedure is the selection of smoothing parameters in the regularization method. In order to choose the parameters objectively, we derive model selection criteria from the viewpoints of information-theoretic and Bayesian approach. W… ▽ More We address the problem of constructing varying-coefficient models based on basis expansions along with the technique of regularization. A crucial point in our modeling procedure is the selection of smoothing parameters in the regularization method. In order to choose the parameters objectively, we derive model selection criteria from the viewpoints of information-theoretic and Bayesian approach. We demonstrate the effectiveness of proposed modeling strategy through Monte Carlo simulations and analyzing a real data set. △ Less

Submitted 18 July, 2011; originally announced July 2011.

Comments: 10 pages, 4 figures

MSC Class: 62G08

Journal ref: Journal of Statistical Computation and Simulation 84 (2014) 2156-2165

arXiv:1102.4399 [pdf, ps, other]

Semi-supervised logistic discrimination for functional data

Authors: Shuichi Kawano, Sadanori Konishi

Abstract: Multi-class classification methods based on both labeled and unlabeled functional data sets are discussed. We present a semi-supervised logistic model for classification in the context of functional data analysis. Unknown parameters in our proposed model are estimated by regularization with the help of EM algorithm. A crucial point in the modeling procedure is the choice of a regularization parame… ▽ More Multi-class classification methods based on both labeled and unlabeled functional data sets are discussed. We present a semi-supervised logistic model for classification in the context of functional data analysis. Unknown parameters in our proposed model are estimated by regularization with the help of EM algorithm. A crucial point in the modeling procedure is the choice of a regularization parameter involved in the semi-supervised functional logistic model. In order to select the adjusted parameter, we introduce model selection criteria from information-theoretic and Bayesian viewpoints. Monte Carlo simulations and a real data analysis are given to examine the effectiveness of our proposed modeling strategy. △ Less

Submitted 28 May, 2012; v1 submitted 21 February, 2011; originally announced February 2011.

Comments: 21 pages, 7 figures

MSC Class: 62H30; 62G05; 68T10

Journal ref: Bulletin of Informatics and Cybernetics 44 (2012) 1-15

arXiv:0901.4829 [pdf, ps, other]

On the maximum value of ground states for the scalar field equation with double power nonlinearity

Authors: Shinji Kawano

Abstract: We evaluate the maximum value of the unique positive solution to semilinear elliptic equations with double power nonlinearities. It is known that a positive solution to this problem exists under some condition.Moreover, Ouyang and Shi in 1998 found that the solution is unique under the same condition. In the present paper we investigate the maximum value of the solution. The key idea is to exami… ▽ More We evaluate the maximum value of the unique positive solution to semilinear elliptic equations with double power nonlinearities. It is known that a positive solution to this problem exists under some condition.Moreover, Ouyang and Shi in 1998 found that the solution is unique under the same condition. In the present paper we investigate the maximum value of the solution. The key idea is to examine the function defined from the nonlinearity, which arises from the well-known Pohozaev identity. △ Less

Submitted 30 January, 2009; originally announced January 2009.

Comments: 8 pages

MSC Class: 35B45

arXiv:0901.2426 [pdf, ps, other]

Uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities, revised eddition

Authors: Shinji Kawano

Abstract: We consider uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. The condition to assure the existence of positive solutions to these types of equations has long been known. On the other hand for uniqueness, quite technical additional condition is proposed by Ouyang and Shi in 1998. In the present paper we remark that this additional condition is un… ▽ More We consider uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. The condition to assure the existence of positive solutions to these types of equations has long been known. On the other hand for uniqueness, quite technical additional condition is proposed by Ouyang and Shi in 1998. In the present paper we remark that this additional condition is unnecessary. △ Less

Submitted 16 January, 2009; originally announced January 2009.

MSC Class: 35J15

arXiv:0811.0951 [pdf, ps, other]

Classification of double power nonlinear functions

Authors: Shinji Kawano

Abstract: In this article we investigate the nature of the functions, including important double power terms which arise naturally in considering typical nonlinear Schroedinger equations. In this article we investigate the nature of the functions, including important double power terms which arise naturally in considering typical nonlinear Schroedinger equations. △ Less

Submitted 6 November, 2008; originally announced November 2008.

MSC Class: 35B05

arXiv:0811.0946 [pdf, ps, other]

Existence and uniqueness conditions of positive solutions to semilinear elliptic equations with double power nonlinearities

Authors: Shinji Kawano

Abstract: In this article we find the equivalent conditions to assure the existence and uniqueness of positive solutions to semilinear elliptic equations wih double power nonlinearities. As a bonus, we give a simpler proof of our former result that the uniqueness condition comes from the existence condition. In this article we find the equivalent conditions to assure the existence and uniqueness of positive solutions to semilinear elliptic equations wih double power nonlinearities. As a bonus, we give a simpler proof of our former result that the uniqueness condition comes from the existence condition. △ Less

Submitted 6 November, 2008; originally announced November 2008.

MSC Class: 35B05

arXiv:0811.0937 [pdf, ps, other]

Uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities

Authors: Shinji Kawano

Abstract: We consider semilinear elliptic equations with double power nonlineaities. The condition to assure the existence of positive solutions is well-known. In the present paper, we remark that the additional condition to assure uniqueness proposed by Ouyang and Shi is unnecessary. We consider semilinear elliptic equations with double power nonlineaities. The condition to assure the existence of positive solutions is well-known. In the present paper, we remark that the additional condition to assure uniqueness proposed by Ouyang and Shi is unnecessary. △ Less

Submitted 6 November, 2008; originally announced November 2008.

MSC Class: 35B05

arXiv:0810.5646 [pdf, ps, other]

On semilinear elliptic equations with global coupling

Authors: Shinji Kawano

Abstract: We consider nonlinear elliptic equations which contains global coupling as a nonlinear term. We classify the existence of all possible positive solutions to this problem. We consider nonlinear elliptic equations which contains global coupling as a nonlinear term. We classify the existence of all possible positive solutions to this problem. △ Less

Submitted 31 October, 2008; originally announced October 2008.

MSC Class: 35J60

arXiv:0810.5638 [pdf, ps, other]

A remark on the uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities

Authors: Shinji Kawano

Abstract: We consider the uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. We deduce the uniqueness from the argument in the classical paper by Peletier and Serrin, thereby recovering a part of the uniqueness result of Ouyang and Shi. We consider the uniqueness of positive solutions to semilinear elliptic equations with double power nonlinearities. We deduce the uniqueness from the argument in the classical paper by Peletier and Serrin, thereby recovering a part of the uniqueness result of Ouyang and Shi. △ Less

Submitted 31 October, 2008; originally announced October 2008.

MSC Class: 35B05

arXiv:cond-mat/0607430 [pdf, ps, other]

doi 10.1103/PhysRevE.75.011902

Separation of long DNA chains using non-uniform electric field: a numerical study

Authors: Shin-ichiro Nagahiro, Satoyuki Kawano, Hidetoshi Kotera

Abstract: We study migration of DNA molecules through a microchannel with a series of electric traps controlled by an ac electric field. We describe the motion of DNA based on Brownian dynamics simulations of a beads-spring chain. Our simulation demonstrates that the chain captured by an electrode escapes from the binding electric field due to thermal fluctuation. We find that the mobility of chain would… ▽ More We study migration of DNA molecules through a microchannel with a series of electric traps controlled by an ac electric field. We describe the motion of DNA based on Brownian dynamics simulations of a beads-spring chain. Our simulation demonstrates that the chain captured by an electrode escapes from the binding electric field due to thermal fluctuation. We find that the mobility of chain would depend on the chain length; the mobility sharply increases when the length of a chain exceeds a critical value, which is strongly affected by the amplitude of the applied ac field. Thus we can adjust the length regime, in which this microchannel well separates DNA molecules, without changing the structure of the channel. We also present a theoretical insight into the relation between the critical chain length and the field amplitude. △ Less

Submitted 24 July, 2006; v1 submitted 17 July, 2006; originally announced July 2006.

Comments: 12 pages, 9 figures

arXiv:cond-mat/0407196 [pdf, ps, other]

doi 10.1103/PhysRevB.71.224426

Study of Field-Induced Magnetic Order in Singlet-Ground-State Magnet CsFeCl$_3$

Authors: Mitsuru Toda, Yutaka Fujii, Shinji Kawano, Takao Goto, Meiro Chiba, Shizumasa Ueda, Kenji Nakajima, Kazuhisa Kakurai, Jens Klenke, Ralf Feyerherm, Matthias Meschke, Hans Anton Graf, Michael Steiner

Abstract: The field-induced magnetic order in the singlet-ground-state system CsFeCl$_3$ has been studied by measuring magnetization and neutron diffraction. The field dependence of intensity for the neutron magnetic reflection has clearly demonstrated that the field-induced ordered phase is described by the order parameter $<S_x>$. A condensate growth of magnons is investigated through the temperature de… ▽ More The field-induced magnetic order in the singlet-ground-state system CsFeCl$_3$ has been studied by measuring magnetization and neutron diffraction. The field dependence of intensity for the neutron magnetic reflection has clearly demonstrated that the field-induced ordered phase is described by the order parameter $<S_x>$. A condensate growth of magnons is investigated through the temperature dependence of $M_z$ and $M_{\perp}$, and this ordering is discussed in the context of a magnon Bose-Einstein condensation. Development of the coherent state and the static correlation length has been observed in the incommensurate phase in the field region of $5 < H < 6$ T at 1.8 K. At $H > H_{\rm c}$, a satellite peak was found in coexistence with the commensurate peak at the phase boundary around 10 T, which indicates that the tilt of the c-axis would be less than $\sim 0.5^{\circ}$ in the whole experiments. △ Less

Submitted 9 July, 2004; v1 submitted 8 July, 2004; originally announced July 2004.

Comments: 5 pages, 5 figures

Showing 1–47 of 47 results for author: Kawano, S