State-space Models with Layer-wise Nonlinearity are Universal Approximators with Exponential Decaying Memory.

AllImages Books Videos Maps News Shopping

Scholarly articles for State-space Models with Layer-wise Nonlinearity are Universal Approximators with Exponential Decaying Memory.

scholar.google.com › citations

… layer-wise nonlinearity are universal approximators …
Wang · Cited by 15

[2309.13414] State-space Models with Layer-wise Nonlinearity are ... - arXiv

Sep 23, 2023 · Our findings demonstrate that the addition of layer-wise nonlinear activation enhances the model's capacity to learn complex sequence patterns.

State-space models with layer-wise nonlinearity are universal ...

openreview.net › forum

Sep 21, 2023 · This paper provides theory for sequence-to-sequence models based on state space layers, which have achieved state-of-the-art performance on a ...

[PDF] State-space Models with Layer-wise Nonlinearity are Universal ... - arXiv

arxiv.org › pdf

Nov 1, 2023 · Figure 1: Network structure of two-layer state-space model. 2. The state-space models are shown to have an exponentially decaying memory, which.

State-space models with layer-wise nonlinearity are universal ...

proceedings.neurips.cc › paper › hash

State-space models with layer-wise nonlinearity are universal approximators with exponential decaying memory. Part of Advances in Neural Information ...

State-space models with layer-wise nonlinearity are universal ...

dl.acm.org › doi › abs

State-space models with layer-wise nonlinearity are universal approximators with exponential decaying memory. AUTHORs: Shida Wang and Beichen XueAuthors Info ...

State-space Models with Layer-wise Nonlinearity are Universal ...

www.semanticscholar.org › paper › State...

It is proved that stacking state-space models with layer-wise nonlinear activation is sufficient to approximate any continuous sequence-to-sequence ...

People also search for

Stablessm Alleviating the Curse of Memory in State-space models through Stable Reparameterization

On the universality of linear recurrences followed by non-linear projections

‪Beichen Xue‬ - ‪Google Scholar‬

scholar.google.com › citations

State-space models with layer-wise nonlinearity are universal approximators with exponential decaying memory. S Wang, B Xue. Advances in Neural Information ...

Working with Exponential Decay part3(Machine Learning 2024) - Medium

medium.com › working-with-exponentia...

7 days ago · State-space Models with Layer-wise Nonlinearity are Universal Approximators with ExponentialDecaying Memory. Authors: Shida Wang, Beichen Xue.

Awesome-state-space-models - GitHub

github.com › radarFudan › Awesome-sta...

The authors show that the layer-wise nonlinearity is enough to achieve the universality when the state-space models are multi-layer. It is also shown that ...

Images

View all

PDF] State-space Models with Layer-wise Nonlinearity are Universal ...