Supervised Level-Wise Pretraining for Sequential Data Classification

Ienco, Dino; Interdonato, Roberto; Gaetano, Raffaele

doi:10.1007/978-3-030-63823-8_52

Dino Ienco¹¹,
Roberto Interdonato¹² &
Raffaele Gaetano¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1333))

Included in the following conference series:

International Conference on Neural Information Processing

2207 Accesses

Abstract

Recurrent Neural Networks (RNNs) can be seriously impacted by the initial parameters assignment, which may result in poor generalization performances on new unseen data. With the objective to tackle this crucial issue, in the context of RNN based classification, we propose a new supervised layer-wise pretraining strategy to initialize network parameters. The proposed approach leverages a data-aware strategy that sets up a taxonomy of classification problems automatically derived by the model behavior. To the best of our knowledge, despite the great interest in RNN-based classification, this is the first data-aware strategy dealing with the initialization of such models. The proposed strategy has been tested on five benchmarks coming from three different domains, i.e., Text Classification, Speech Recognition and Remote Sensing. Results underline the benefit of our approach and point out that data-aware strategies positively support the initialization of Recurrent Neural Network based classification models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Overview of Incorporating Nonlinear Functions into Recurrent Neural Network Models

Deep Recurrent Neural Network (Deep-RNN) for Classification of Nonlinear Data

Using Metalearning for Parameter Tuning in Neural Networks

Notes

References

Britz, D., Guan, M.Y., Luong, M.: Efficient attention using a fixed-size memory representation. In: EMNLP, pp. 392–400 (2017)
Google Scholar
Cai, H., Zhu, L., Han, S.: Proxylessnas: direct neural architecture search on target task and hardware. In: ICLR (2019)
Google Scholar
Choromanski, K., Downey, C., Boots, B.: Initialization matters: orthogonal predictive state recurrent neural networks. In: ICLR (2018)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT, pp. 4171–4186 (2019)
Google Scholar
Ge, W., Yu, Y.: Borrowing treasures from the wealthy: deep transfer learning through selective joint fine-tuning. In: CVPR, pp. 10–19 (2017)
Google Scholar
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: AISTATS, pp. 249–256 (2010)
Google Scholar
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: CVPR, pp. 2261–2269 (2017)
Google Scholar
Ienco, D., Gaetano, R., Dupaquier, C., Maurel, P.: Land cover classification via multitemporal spatial data by deep recurrent neural networks. IEEE Geosci. Remote Sensing Lett. 14(10), 1685–1689 (2017)
Article Google Scholar
Ienco, D., Gaetano, R., Interdonato, R., Ose, K., Minh, D.H.T.: Combining sentinel-1 and sentinel-2 time series via RNN for object-based land cover classification. In: IGARSS, pp. 3930–3933 (2019)
Google Scholar
Interdonato, R., Ienco, D., Gaetano, R., Ose, K.: A dual view point deep learning architecture for time series classification. ISPRS J. Photogrammetry Remote Sensing 149, 91–104 (2019)
Article Google Scholar
Khalil, R.A., Jones, E., Babar, M.I., Jan, T., Zafar, M.H., Alhussain, T.: Speech emotion recognition using deep learning techniques: a review. IEEE Access 7, 117327–117345 (2019)
Article Google Scholar
Liu, W., Wang, Z., Liu, X., Zeng, N., Liu, Y., Alsaadi, F.E.: A survey of deep neural network architectures and their applications. Neurocomputing 234, 11–26 (2017)
Article Google Scholar
Mou, L., Ghamisi, P., Zhu, X.X.: Deep recurrent neural networks for hyperspectral image classification. IEEE Trans. Geosci. Remote Sensing 55(7), 3639–3655 (2017)
Article Google Scholar
Peng, A.Y., Koh, Y.S., Riddle, P., Pfahringer, B.: Using supervised pretraining to improve generalization of neural networks on binary classification problems. In: ECML/PKDD, pp. 410–425 (2018)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP, pp. 1532–1543 (2014)
Google Scholar
Wang, J., Wang, Z., Zhang, D., Yan, J.: Combining knowledge with deep convolutional neural networks for short text classification. In: IJCAI, pp. 2915–2921 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

INRAE, UMR TETIS, LIRMM, Univ. Montpellier, Montpellier, France
Dino Ienco
CIRAD, UMR TETIS, Montpellier, France
Roberto Interdonato & Raffaele Gaetano

Authors

Dino Ienco
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Interdonato
View author publications
You can also search for this author in PubMed Google Scholar
Raffaele Gaetano
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dino Ienco .

Editor information

Editors and Affiliations

Department of AI, Ping An Life, Shenzhen, China
Haiqin Yang
Faculty of Information Technology, King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand
Kitsuchart Pasupa
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi-Sing Leung
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong
James T. Kwok
School of Information Technology, King Mongkut’s University of Technology Thonburi, Bangkok, Thailand
Jonathan H. Chan
The Chinese University of Hong Kong, New Territories, Hong Kong
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ienco, D., Interdonato, R., Gaetano, R. (2020). Supervised Level-Wise Pretraining for Sequential Data Classification. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Communications in Computer and Information Science, vol 1333. Springer, Cham. https://doi.org/10.1007/978-3-030-63823-8_52

Download citation

DOI: https://doi.org/10.1007/978-3-030-63823-8_52
Published: 17 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63822-1
Online ISBN: 978-3-030-63823-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Supervised Level-Wise Pretraining for Sequential Data Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Overview of Incorporating Nonlinear Functions into Recurrent Neural Network Models

Deep Recurrent Neural Network (Deep-RNN) for Classification of Nonlinear Data

Using Metalearning for Parameter Tuning in Neural Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Supervised Level-Wise Pretraining for Sequential Data Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Overview of Incorporating Nonlinear Functions into Recurrent Neural Network Models

Deep Recurrent Neural Network (Deep-RNN) for Classification of Nonlinear Data

Using Metalearning for Parameter Tuning in Neural Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation