Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 146 results for author: Komatsu, T

.
  1. arXiv:2407.13186  [pdf, other

    cs.RO

    Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks

    Authors: Takumi Komatsu, Motonari Kambara, Shumpei Hatanaka, Haruka Matsuo, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi, Komei Sugiura

    Abstract: Domestic service robots (DSRs) that support people in everyday environments have been widely investigated. However, their ability to predict and describe future risks resulting from their own actions remains insufficient. In this study, we focus on the linguistic explainability of DSRs. Most existing methods do not explicitly model the region of possible collisions; thus, they do not properly gene… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted for presentation at Advanced Robotics 24

  2. arXiv:2406.13139  [pdf, other

    eess.AS cs.SD

    Audio Fingerprinting with Holographic Reduced Representations

    Authors: Yusuke Fujita, Tatsuya Komatsu

    Abstract: This paper proposes an audio fingerprinting model with holographic reduced representation (HRR). The proposed method reduces the number of stored fingerprints, whereas conventional neural audio fingerprinting requires many fingerprints for each audio track to achieve high accuracy and time resolution. We utilize HRR to aggregate multiple fingerprints into a composite fingerprint via circular convo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: accepted at Interspeech 2024

  3. arXiv:2406.12194  [pdf, other

    eess.AS cs.SD

    Universal Score-based Speech Enhancement with High Content Preservation

    Authors: Robin Scheibler, Yusuke Fujita, Yuma Shirahata, Tatsuya Komatsu

    Abstract: We propose UNIVERSE++, a universal speech enhancement method based on score-based diffusion and adversarial training. Specifically, we improve the existing UNIVERSE model that decouples clean speech feature extraction and diffusion. Our contributions are three-fold. First, we make several modifications to the network architecture, improving training stability and final performance. Second, we intr… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 5 pages, 5 figures, accepted at Interspeech 2024

  4. arXiv:2403.13816  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Combined X-ray diffraction, electrical resistivity, and $ab$ $initio$ study of (TMTTF)$_2$PF$_6$ under pressure: implications to the unified phase diagram

    Authors: Miho Itoi, Kazuyoshi Yoshimi, Hanming Ma, Takahiro Misawa, Takao Tsumuraya, Dilip Bhoi, Tokutaro Komatsu, Hatsumi Mori, Yoshiya Uwatoko, Hitoshi Seo

    Abstract: We present a combined experimental and theoretical study on the quasi-one-dimensional organic conductor (TMTTF)$_2$PF$_6$, and elucidate the variation of its physical properties under pressure. We fully resolve the crystal structure by single crystal x-ray diffraction measurements using a diamond anvil cell up to 8 GPa, and based on the structural data, we perform first-principles density-function… ▽ More

    Submitted 18 February, 2024; originally announced March 2024.

    Comments: 13 pages, 10 figures

  5. arXiv:2403.07534  [pdf, ps, other

    math.NT math.CO

    Frobenius numbers associated with Diophantine triples of $x^2+y^2=z^r$ (extended version)

    Authors: Takao Komatsu, Neha Gupta, Manoj Upreti

    Abstract: We give an explicit formula for the $p$-Frobenius number of triples associated with Diophantine equations $x^2+y^2=z^r$, that is, the largest positive integer that can only be represented in $p$ ways by combining the three integers of the solutions of Diophantine equations $x^2+y^2=z^r$. When $r=2$, the Frobenius number has already been given.

    Submitted 12 March, 2024; originally announced March 2024.

  6. arXiv:2401.11700  [pdf, other

    cs.CL cs.SD eess.AS

    Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers

    Authors: Michael Hentschel, Yuta Nishikawa, Tatsuya Komatsu, Yusuke Fujita

    Abstract: This study presents a novel approach for knowledge distillation (KD) from a BERT teacher model to an automatic speech recognition (ASR) model using intermediate layers. To distil the teacher's knowledge, we use an attention decoder that learns from BERT's token probabilities. Our method shows that language model (LM) information can be more effectively distilled into an ASR model using both the in… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted at ICASSP 2024

  7. Estimation of articulated angle in six-wheeled dump trucks using multiple GNSS receivers for autonomous driving

    Authors: Taro Suzuki, Kazunori Ohno, Syotaro Kojima, Naoto Miyamoto, Takahiro Suzuki, Tomohiro Komatsu, Yukinori Shibata, Kimitaka Asano, Keiji Nagatani

    Abstract: Due to the declining birthrate and aging population, the shortage of labor in the construction industry has become a serious problem, and increasing attention has been paid to automation of construction equipment. We focus on the automatic operation of articulated six-wheel dump trucks at construction sites. For the automatic operation of the dump trucks, it is important to estimate the position a… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: This is an electronic version of an article published in ADVANCED ROBOTICS, 35:23, 1376-1387, 2021. ADVANCED ROBOTICS is available online at: www.tandfonline.com/Article DOI; 10.1080/01691864.2019.1619622

    Journal ref: Advanced Robotics, 35:23, 1376-1387, 2021

  8. arXiv:2311.03997  [pdf, ps, other

    math.NT

    On a conjecture of Ramírez Alfonsín and Skałba III

    Authors: Yuchen Ding, Takao Komatsu

    Abstract: Let $1<c<d$ be two relatively prime integers. For a non-negative integer $\ell$, let $g_\ell(c,d)$ be the largest integer $n$ such that $n=c x+d y$ has at most $\ell$ non-negative solutions $(x,y)$. In this paper we prove that $$ π_{\ell,c,d}\sim\frac{π\bigl(g_\ell(c,d)\bigr)}{2 \ell+2}\quad(\text{as}~ c\to\infty)\,, $$ where $π_{\ell,c,d}$ is the number of primes $n$ having more than $\ell$ disti… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  9. arXiv:2310.03273  [pdf, other

    cs.CV cs.LG eess.IV

    Ablation Study to Clarify the Mechanism of Object Segmentation in Multi-Object Representation Learning

    Authors: Takayuki Komatsu, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: Multi-object representation learning aims to represent complex real-world visual input using the composition of multiple objects. Representation learning methods have often used unsupervised learning to segment an input image into individual objects and encode these objects into each latent vector. However, it is not clear how previous methods have achieved the appropriate segmentation of individu… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  10. arXiv:2309.09491  [pdf, ps, other

    math.NT

    Polynomial identities and Fermat quotients

    Authors: Takao Komatsu, B. Sury

    Abstract: We prove some polynomial identities from which we deduce congruences modulo $p^2$ for the Fermat quotient $\frac{2^p-2}{p}$ for any odd prime $p$ (Proposition 1 and Theorem 1). These congruences are simpler than the one obtained by Jothilingam in 1985 which involves listing quadratic residues in some order. On the way, we also observe some more congruences for the Fermat quotient that generalize E… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  11. arXiv:2309.08141  [pdf, other

    eess.AS cs.CL cs.LG cs.SD eess.SP

    Audio Difference Learning for Audio Captioning

    Authors: Tatsuya Komatsu, Yusuke Fujita, Kazuya Takeda, Tomoki Toda

    Abstract: This study introduces a novel training paradigm, audio difference learning, for improving audio captioning. The fundamental concept of the proposed learning method is to create a feature representation space that preserves the relationship between audio, enabling the generation of captions that detail intricate audio information. This method employs a reference audio along with the input audio, bo… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: submitted to ICASSP2024

  12. arXiv:2309.08140  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions

    Authors: Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana

    Abstract: We propose PromptTTS++, a prompt-based text-to-speech (TTS) synthesis system that allows control over speaker identity using natural language descriptions. To control speaker identity within the prompt-based TTS framework, we introduce the concept of speaker prompt, which describes voice characteristics (e.g., gender-neutral, young, old, and muffled) designed to be approximately independent of spe… ▽ More

    Submitted 27 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024

  13. arXiv:2307.08998  [pdf, ps, other

    math.CO math.NT

    $p$-numerical semigroups of Pell triples

    Authors: Takao Komatsu, Jiaxin Mu

    Abstract: For a nonnegative integer $p$, the $p$-numerical semigroup $S_p$ is defined as the set of integers whose nonnegative integral linear combinations of given positive integers $a_1,a_2,\dots,a_κ$ with $\gcd(a_1,a_2,\dots,a_κ)=1$ are expressed in more than $p$ ways. When $p=0$, $S=S_0$ is the original numerical semigroup. The largest element and the cardinality of $\mathbb N_0\backslash S_p$ are calle… ▽ More

    Submitted 3 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: J. Ramanujan Math. Soc. arXiv admin note: text overlap with arXiv:2304.00443

  14. arXiv:2304.00443  [pdf, ps, other

    math.NT math.CO math.GR

    $p$-numerical semigroup of generalized Fibonacci triples

    Authors: Takao Komatsu, Shanta Laishram, Pooja Punyani

    Abstract: For a nonnegative integer $p$, we give explicit formulas for the $p$-Frobenius number and the $p$-genus of generalized Fibonacci numerical semigroups. Here, the $p$-numerical semigroup $S_p$ is defined as the set of integers whose nonnegative integral linear combinations of given positive integers $a_1,a_2,\dots,a_k$ are expressed more than $p$ ways. When $p=0$, $S_0$ with the $0$-Frobenius number… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    MSC Class: 11D07; 20M14; 05A17; 05A19; 11D04; 11B68; 11P81

  15. arXiv:2303.06806  [pdf, other

    eess.AS cs.CL cs.SD

    Neural Diarization with Non-autoregressive Intermediate Attractors

    Authors: Yusuke Fujita, Tatsuya Komatsu, Robin Scheibler, Yusuke Kida, Tetsuji Ogawa

    Abstract: End-to-end neural diarization (EEND) with encoder-decoder-based attractors (EDA) is a promising method to handle the whole speaker diarization problem simultaneously with a single neural network. While the EEND model can produce all frame-level speaker labels simultaneously, it disregards output label dependency. In this work, we propose a novel EEND model that introduces the label dependency betw… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023

  16. arXiv:2302.09583  [pdf, ps, other

    math.CO

    Alternating Walk/Zeta Correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato

    Abstract: We consider the alternating zeta function and the alternating $L$-function of a graph $G$, and express them by using the Ihara zeta function of $G$. Next, we define a generalized alternating zeta function of a graph, and express the generalized alternating zeta function of a vertex-transitive regular graph by spectra of the transition probability matrix of the symmetric simple random walk on it an… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: 24 pages. arXiv admin note: text overlap with arXiv:2205.00457; text overlap with arXiv:1905.13182 by other authors

    MSC Class: 05C50; 15A15

  17. arXiv:2212.13704  [pdf, other

    math-ph math.CO math.PR

    Ronkin/Zeta Correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato, Kohei Sato

    Abstract: The Ronkin function was defined by Ronkin in the consideration of the zeros of almost periodic function. Recently, this function has been used in various research fields in mathematics, physics and so on. Especially in mathematics, it has a closed connections with tropical geometry, amoebas, Newton polytopes and dimer models. On the other hand, we have been investigated a new class of zeta funct… ▽ More

    Submitted 12 February, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 19 pages. arXiv admin note: substantial text overlap with arXiv:2202.05966

  18. arXiv:2210.17019  [pdf, ps, other

    math.NT math.AC math.CO

    Sylvester sums on the Frobenius set in arithmetic progression with initial gaps

    Authors: Takao Komatsu

    Abstract: Let $a_1,a_2,\dots,a_k$ be positive integers with $\gcd(a_1,a_2,\dots,a_k)=1$. Frobenius number is the largest positive integer that is NOT representable in terms of $a_1,a_2,\dots,a_k$. When $k\ge 3$, there is no explicit formula in general, but some formulae may exist for special sequences $a_1,a_2,\dots,a_k$, including, those forming arithmetic progressions and their modifications. In this pape… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  19. arXiv:2209.13118  [pdf, ps, other

    math.NT math.CO

    The Frobenius number for shifted geometric sequences associated with the number of solutions

    Authors: Takao Komatsu

    Abstract: For a non-negative integer $p$, one of the generalized Frobenius numbers, that is called the $p$-Frobenius number, is the largest integer that is represented at most in $p$ ways as a linear combination with nonnegative integer coefficients of a given set of positive integers whose greatest common divisor is one. The famous so-called Frobenius number proposed by Frobenius is reduced to the $0$-Frob… ▽ More

    Submitted 9 February, 2024; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: Publicationes Mathematicae Debrecen (in the second half of 2024 in volume 105)

  20. arXiv:2209.06674  [pdf, ps, other

    math.CO

    Analytic aspects of $q,r$-analogue of poly-Stirling numbers of both kinds

    Authors: Takao Komatsu, Eli Bagno, David Garber

    Abstract: The Stirling numbers of type $B$ of the second kind count signed set partitions. In this paper we provide new combinatorial and analytical identities regarding these numbers as well as Broder's $r$-version of these numbers. Among these identities one can find recursions, explicit formulas based on the inclusion-exclusion principle, and also exponential generating functions. These Stirling number… ▽ More

    Submitted 5 April, 2024; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Revised version. 37 pages, no figures; submitted

    MSC Class: Primary: 05A15; Secondary: 05A18; 05A19; 05A30; 11B73

  21. arXiv:2207.08962  [pdf, ps, other

    math.CO math.AC math.NT

    $p$-numerical semigroups with $p$-symmetric properties

    Authors: Takao Komatsu, Haotian Ying

    Abstract: The so-called Frobenius number in the famous linear Diophantine problem of Frobenius is the largest integer such that the linear equation $a_1 x_1+\cdots+a_k x_k=n$ ($a_1,\dots,a_k$ are given positive integers with $\gcd(a_1,\dots,a_k)=1$) does not have a non-negative integer solution $(x_1,\dots,x_k)$. The generalized Frobenius number (called the $p$-Frobenius number) is the largest integer such… ▽ More

    Submitted 18 June, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Journal of Algebra and its Applications (2024)

    MSC Class: 20M14; 11D07; 20M05; 05A15; 11B25

  22. The Frobenius number for sequences of triangular numbers associated with number of solutions

    Authors: Takao Komatsu

    Abstract: The famous linear diophantine problem of Frobenius is the problem to determine the largest integer (Frobenius number) whose number of representations in terms of $a_1,\dots,a_k$ is at most zero, that is not representable. In other words, all the integers greater than this number can be represented for at least one way. One of the natural generalizations of this problem is to find the largest integ… ▽ More

    Submitted 1 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Annals of Combinatorics

  23. arXiv:2206.13052  [pdf, ps, other

    math.NT math.CO

    The $p$-numerical semigroup of the triple of arithmetic progressions

    Authors: Takao Komatsu, Haotian Ying

    Abstract: For given positive integers $a_1,a_2,\dots,a_k$ with $\gcd(a_1,a_2,\dots,a_k)=1$, the denumerant $d(n)=d(n;a_1,a_2,\dots,a_k)$ is the number of nonnegative solutions $(x_1,x_2,\dots,x_k)$ of the linear equation $a_1 x_1+a_2 x_2+\dots+a_k x_k=n$ for a positive integer $n$. For a given nonnegative integer $p$, let $S_p=S_p(a_1,a_2,\dots,a_k)$ be the set of all nonnegative integers $n$'s such that… ▽ More

    Submitted 27 June, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Symmetry Vol.15 (2023)

  24. arXiv:2206.05660  [pdf, ps, other

    math.CO math.NT

    The $p$-Frobenius and $p$-Sylvester numbers for Fibonacci and Lucas triplets

    Authors: Takao Komatsu, Haotian Ying

    Abstract: In this paper we study a certain kind of generalized linear Diophantine problem of Frobenius. Let $a_1,a_2,\dots,a_l$ be positive integers such that their greatest common divisor is one. For a nonnegative integer $p$, denote the $p$-Frobenius number by $g_p(a_1,a_2,\dots,a_l)$, which is the largest integer that can be represented at most $p$ ways by a linear combination with nonnegative integer co… ▽ More

    Submitted 3 December, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: Mathematical Biosciences and Engineering

    MSC Class: 11D07; 05A15; 05A17; 05A19; 11B68; 11D04; 11P81; 20M14

  25. arXiv:2205.00457  [pdf, ps, other

    math.CO

    Metzler/Zeta Correspondence

    Authors: Yusuke Ide, Takashi Komatsu, Norio Konno, Iwao Sato

    Abstract: We present an explicit formula for the determinant on the Metzler matrix of a digraph $D$. Furthermore, we introduce a walk-type zeta function with respect to this Metzler matrix of the symmetric digraph of a finite torus, and express its limit formula by using the integral expression.

    Submitted 1 July, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: 16 pages

    MSC Class: 05C50; 15A15

  26. arXiv:2204.07325  [pdf, ps, other

    math.NT math.CO

    Sylvester power and weighted sums on the Frobenius set in arithmetic progression

    Authors: Takao Komatsu

    Abstract: Let $a_1,a_2,\dots,a_k$ be positive integers with $\gcd(a_1,a_2,\dots,a_k)=1$. Frobenius number is the largest positive integer that is NOT representable in terms of $a_1,a_2,\dots,a_k$. When $k\ge 3$, there is no explicit formula in general, but some formulae may exist for special sequences $a_1,a_2,\dots,a_k$, including, those forming arithmetic progressions and their modifications. In this pape… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: text overlap with arXiv:2111.11021, arXiv:2203.12238, arXiv:2101.04298

    MSC Class: 11D07; 05A15; 05A17; 05A19; 11B68; 11D04; 11P81

    Journal ref: Discrete Applied Mathematics 315 (2022), 110-126

  27. arXiv:2204.02279  [pdf, ps, other

    cs.SD eess.AS

    How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks

    Authors: Keisuke Imoto, Yuka Komatsu, Shunsuke Tsubaki, Tatsuya Komatsu

    Abstract: Acoustic scene classification (ASC) and sound event detection (SED) are fundamental tasks in environmental sound analysis, and many methods based on deep learning have been proposed. Considering that information on acoustic scenes and sound events helps SED and ASC mutually, some researchers have proposed a joint analysis of acoustic scenes and sound events by multitask learning (MTL). However, co… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: Submitted to INTERSPEECH 2022

  28. arXiv:2204.00176  [pdf, other

    cs.CL cs.SD eess.AS

    Better Intermediates Improve CTC Inference

    Authors: Tatsuya Komatsu, Yusuke Fujita, Jaesong Lee, Lukas Lee, Shinji Watanabe, Yusuke Kida

    Abstract: This paper proposes a method for improved CTC inference with searched intermediates and multi-pass conditioning. The paper first formulates self-conditioned CTC as a probabilistic model with an intermediate prediction as a latent representation and provides a tractable conditioning framework. We then propose two new conditioning methods based on the new formulation: (1) Searched intermediate condi… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: 5 pages, submitted INTERSPEECH2022

  29. Alternate Intermediate Conditioning with Syllable-level and Character-level Targets for Japanese ASR

    Authors: Yusuke Fujita, Tatsuya Komatsu, Yusuke Kida

    Abstract: End-to-end automatic speech recognition directly maps input speech to characters. However, the mapping can be problematic when several different pronunciations should be mapped into one character or when one pronunciation is shared among many different characters. Japanese ASR suffers the most from such many-to-one and one-to-many mapping problems due to Japanese kanji characters. To alleviate the… ▽ More

    Submitted 12 March, 2023; v1 submitted 31 March, 2022; originally announced April 2022.

    Comments: SLT 2022

  30. arXiv:2204.00174  [pdf, other

    cs.CL cs.SD eess.AS

    InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR

    Authors: Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida

    Abstract: This paper proposes InterAug: a novel training method for CTC-based ASR using augmented intermediate representations for conditioning. The proposed method exploits the conditioning framework of self-conditioned CTC to train robust models by conditioning with "noisy" intermediate predictions. During the training, intermediate predictions are changed to incorrect intermediate predictions, and fed in… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: This paper was submitted to INTERSPEECH2022

  31. arXiv:2203.12238  [pdf, ps, other

    math.NT math.CO

    Sylvester sums on the Frobenius set in arithmetic progression

    Authors: Takao Komatsu

    Abstract: Let $a_1,a_2,\dots,a_k$ be positive integers with $\gcd(a_1,a_2,\dots,a_k)=1$. The concept of the weighted sum $\sum_{n\in{\rm NR}}λ^{n}$ is introduced in \cite{KZ0,KZ}, where ${\rm NR}={\rm NR}(a_1,a_2,\dots,a_k)$ denotes the set of positive integers nonrepresentable in terms of $a_1,a_2,\dots,a_k$. When $λ=1$, such a sum is often called Sylvester sum. The main purpose of this paper is to give ex… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: In: F. Yilmaz et al. (eds.), Mathematical Methods for Engineering Applications, Springer Proceedings in Mathematics & Statistics, vol. 384. Springer, Cham., 2022 May. (to appear)

    MSC Class: 11D07; 05A15; 05A17; 05A19; 11B68; 11D04; 11P81

  32. arXiv:2202.08474  [pdf, other

    eess.AS cs.SD

    Non-Autoregressive ASR with Self-Conditioned Folded Encoders

    Authors: Tatsuya Komatsu

    Abstract: This paper proposes CTC-based non-autoregressive ASR with self-conditioned folded encoders. The proposed method realizes non-autoregressive ASR with fewer parameters by folding the conventional stack of encoders into only two blocks; base encoders and folded encoders. The base encoders convert the input audio features into a neural representation suitable for recognition. This is followed by the f… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 5 pages, accepted at ICASSP2022

  33. Acoustic Event Detection with Classifier Chains

    Authors: Tatsuya Komatsu, Shinji Watanabe, Koichi Miyazaki, Tomoki Hayashi

    Abstract: This paper proposes acoustic event detection (AED) with classifier chains, a new classifier based on the probabilistic chain rule. The proposed AED with classifier chains consists of a gated recurrent unit and performs iterative binary detection of each event one by one. In each iteration, the event's activity is estimated and used to condition the next output based on the probabilistic chain rule… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 5pages, presented at Interspeech2021

  34. arXiv:2202.08456  [pdf, other

    eess.AS cs.LG cs.SD

    MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition

    Authors: Jin Sakuma, Tatsuya Komatsu, Robin Scheibler

    Abstract: We propose multi-layer perceptron (MLP)-based architectures suitable for variable length input. MLP-based architectures, recently proposed for image classification, can only be used for inputs of a fixed, pre-defined size. However, many types of data are naturally variable in length, for example, acoustic signals. We propose three approaches to extend MLP-based architectures for use with sequences… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 8 pages, 4 figures

  35. arXiv:2202.05966  [pdf, ps, other

    quant-ph math-ph math.CO math.NT math.PR

    Mahler/Zeta Correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato, Shunya Tamura

    Abstract: The Mahler measure was introduced by Mahler in the study of number theory. It is known that the Mahler measure appears in different areas of mathematics and physics. On the other hand, we have been investigated a new class of zeta functions for various kinds of walks including quantum walks by a series of our previous work on "Zeta Correspondence". The quantum walk is a quantum counterpart of the… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 27 pages. arXiv admin note: text overlap with arXiv:2109.07664, arXiv:2104.10287

  36. arXiv:2201.03973  [pdf, ps, other

    math.CO

    A Generalized Grover/Zeta Correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato, Shunya Tamura

    Abstract: We introduce a generalized Grover matrix of a graph and present an explicit formula for its characteristic polynomial. As a corollary, we give the spectra for the generalized Grover matrix of a regular graph. Next, we define a zeta function and a generalized zeta function of a graph $G$ with respect to its generalized Grover matrix as an analog of the Ihara zeta function and present explicit formu… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

    Comments: 10 pages. arXiv admin note: text overlap with arXiv:2011.14162

    MSC Class: 60F05; 05C50; 15A15; 05C25

  37. arXiv:2112.15310  [pdf, ps, other

    math.NT math.CO

    Cameron's operator in terms of determinants, and hypergeometric numbers

    Authors: Narakorn Rompurk Kanasri, Takao Komatsu, Vichian Laohakosol

    Abstract: By studying Cameron's operator in terms of determinants, two kinds of "integer" sequences of incomplete numbers were introduced. One was the sequence of restricted numbers, including $s$-step Fibonacci sequences. Another was the sequence of associated numbers, including Lamé sequences of higher order. By the classical Trudi's formula and the inverse relation, more expressions were able to be obtai… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

    Journal ref: Boletín de la Sociedad Matemática Mexicana, Third Series 28 (2022), issue 1, Article 9, 23 pp

  38. arXiv:2111.11021  [pdf, ps, other

    math.NT math.CO

    On the determination of $p$-Frobenius and related numbers using the $p$-Apéry set

    Authors: Takao Komatsu

    Abstract: In this paper, we give convenient formulas in order to obtain explicit expressions of a generalized Frobenius number called the $p$-Frobenius number as well as its related values. Here, for a non-negative integer $p$, the $p$-Frobenius number is the largest integer whose number of solutions of the linear diophantine equation in terms of positive integers $a_1,a_2,\dots,a_k$ with… ▽ More

    Submitted 12 January, 2024; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Rev. R. Acad. Cienc. Exactas Fis. Nat., Ser. A Mat., RACSAM (The title is changed from the original manuscript)

    MSC Class: 11D07; 05A15; 05A17; 05A19; 11B68; 11D04; 11P81; 20M14

  39. arXiv:2110.05249  [pdf, other

    eess.AS cs.CL cs.SD

    A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

    Authors: Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe

    Abstract: Non-autoregressive (NAR) models simultaneously generate multiple outputs in a sequence, which significantly reduces the inference speed at the cost of accuracy drop compared to autoregressive baselines. Showing great potential for real-time applications, an increasing number of NAR models have been explored in different fields to mitigate the performance gap against AR models. In this work, we con… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted to ASRU2021

  40. arXiv:2107.03590  [pdf, ps, other

    quant-ph math-ph math.CO

    CTM/Zeta Correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato

    Abstract: In our previous work, we investigated the relation between zeta functions and discrete-time models including random and quantum walks. In this paper, we introduce a zeta function for the continuous-time model (CTM) and consider CTMs including the corresponding random and quantum walks on the d-dimensional torus.

    Submitted 15 March, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: 9 pages, minor corrections, Quantum Studies: Mathematics and Foundations, Volume 9, pp.165-173 (2022)

  41. arXiv:2107.03300  [pdf, ps, other

    math.CO

    Vertex-Face/Zeta correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato

    Abstract: We present the characteristic polynomial for the transition matrix of a vertex-face walk on a graph, and obtain its spectra. Furthermore, we express the characteristic polynomial for the transition matrix of a vertex-face walk on the 2-dimensional torus by using its adjacency matrix, and obtain its spectra. As an application, we define a new walk-type zeta function with respect to the transition m… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: 14 pages. arXiv admin note: text overlap with arXiv:2103.12971, arXiv:2011.14162

    MSC Class: 60F05; 05C10; 05C50; 15A15

  42. Possibility of multi-step electroweak phase transition in the two Higgs doublet models

    Authors: Mayumi Aoki, Takatoshi Komatsu, Hiroto Shibuya

    Abstract: We discuss whether a multi-step electroweak phase transition (EWPT) occurs in two Higgs doublet models (2HDMs). The EWPT is related to interesting phenomena such as baryogenesis and a gravitational wave from it. We examine parameter regions in CP-conserving 2HDMs and find certain areas where the multi-step EWPTs occur. The parameter search shows the multi-step EWPT prefers the scalar potential wit… ▽ More

    Submitted 15 June, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: 34 pages, 20 figures, 5 tables; Published version

    Report number: KANAZAWA-21-08

    Journal ref: Prog Theor Exp Phys (2022)

  43. arXiv:2105.14270  [pdf, ps, other

    math-ph

    A discontinuity of the energy of quantum walk in impurities

    Authors: Kenta Higuchi, Takashi Komatsu, Norio Konno, Hisashi Morioka, Etsuo Segawa

    Abstract: We consider the discrete-time quantum walk whose local dynamics is denoted by $C$ at the perturbed region $\{0,1,\dots,M-1\}$ and free at the other positions. We obtain the stationary state with a bounded initial state. The initial state is set so that the perturbed region receives the inflow $ω^n$ at time $n$ $(|ω|=1)$. From this expression, we compute the scattering on the surface of $-1$ and… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Comments: 16 pages. arXiv admin note: text overlap with arXiv:1912.11555

  44. arXiv:2105.08277  [pdf, ps, other

    math.CO math.NT

    Asymmetric Circular Graph with Hosoya Index and Negative Continued Fractions

    Authors: Takao Komatsu

    Abstract: It has been known that the Hosoya index of caterpillar graph can be calculated as the numerator of the simple continued fraction. Recently, the author \cite{Komatsu2020} introduces a more general graph called caterpillar-bond graph and shows that its Hosoya index can be calculated as the numerator of the general continued fraction. In this paper, we show how the Hosoya index of the graph with no… ▽ More

    Submitted 21 November, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Journal ref: Carpathian Math. Publ. 10 (2021), No.3, 608--618

  45. arXiv:2105.08274  [pdf, ps, other

    math.NT math.CO

    Weighted Sylvester sums on the Frobenius set

    Authors: Takao Komatsu, Yuan Zhang

    Abstract: Let $a$ and $b$ be relatively prime positive integers. In this paper the weighted sum $\sum_{n\in{\rm NR}(a,b)}λ^{n-1}n^m$ is given explicitly or in terms of the Apostol-Bernoulli numbers, where $m$ is a nonnegative integer, and ${\rm NR}(a,b)$ denotes the set of positive integers nonrepresentable in terms of $a$ and $b$.

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: Irish Math. Soc. Bull. (to appear)

  46. arXiv:2105.04056  [pdf, ps, other

    quant-ph math-ph math.CO math.PR

    IPS/Zeta Correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato

    Abstract: Our previous works presented zeta functions by the Konno-Sato theorem or the Fourier analysis for one-particle models including random walks, correlated random walks, quantum walks, and open quantum random walks. This paper introduces a new zeta function for multi-particle models with probabilistic or quantum interactions, called the interacting particle system (IPS). We compute the zeta function… ▽ More

    Submitted 6 February, 2022; v1 submitted 9 May, 2021; originally announced May 2021.

    Comments: 19 pages

    Journal ref: Quantum Information and Computation, Vol.22, No.3 & 4, pp.251-269 (2022)

  47. arXiv:2105.02678  [pdf, ps, other

    math.CO

    The trace formula with respect to the twisted Grover matrix of a mixed digraph

    Authors: Takashi Komatsu, Sho Kubota, Norio Konno, Iwao Sato

    Abstract: We define a zeta function woth respect to the twisted Grover matrix of a mixed digraph, and present an exponential expression and a determinant expression of this zeta function. As an application, we give a trace formula with respect to the twisted Grover matrix of a mixed digraph.

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 17 pages

    MSC Class: 05C50; 11M06

  48. arXiv:2105.02677  [pdf, ps, other

    math.CO

    The scattering matrix with respect to an Hermitian matrix of a graph

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato

    Abstract: Recently, Gnutzmann and Smilansky presented a formula for the bond scattering matrix of a graph with respect to a Hermitian matrix. We present another proof for this Gnutzmann and Smilansky's formula by a technique used in the zeta function of a graph. Furthermore, we generalize Gnutzmann and Smilansky's formula to a regular covering of a graph. Finally, we define an $L$-fuction of a graph, and pr… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 21 pages. arXiv admin note: substantial text overlap with arXiv:1211.4719

    MSC Class: 05C50; 15A15

  49. arXiv:2104.10328  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers

    Authors: Yusuke Kida, Tatsuya Komatsu, Masahito Togami

    Abstract: This paper proposes a novel label-synchronous speech-to-text alignment technique for automatic speech recognition (ASR). The speech-to-text alignment is a problem of splitting long audio recordings with un-aligned transcripts into utterance-wise pairs of speech and text. Unlike conventional methods based on frame-synchronous prediction, the proposed method re-defines the speech-to-text alignment a… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Submitted to INTERSPEECH 2021

  50. arXiv:2104.10287  [pdf, ps, other

    quant-ph math-ph math.CO math.PR

    Walk/Zeta Correspondence

    Authors: Takashi Komatsu, Norio Konno, Iwao Sato

    Abstract: Our previous work presented explicit formulas for the generalized zeta function and the generalized Ihara zeta function corresponding to the Grover walk and the positive-support version of the Grover walk on the regular graph via the Konno-Sato theorem, respectively. This paper extends these walks to a class of walks including random walks, correlated random walks, quantum walks, and open quantum… ▽ More

    Submitted 20 December, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: 31 pages

    Journal ref: Journal of Statistical Physics, volume 190, Article number: 36 (2023)