Marine Mammal Species Classification Using Convolutional Neural Networks and a Novel Acoustic Representation

Thomas, Mark; Martin, Bruce; Kowarski, Katie; Gaudet, Briand; Matwin, Stan

doi:10.1007/978-3-030-46133-1_18

Mark Thomas¹⁴,
Bruce Martin¹⁵,
Katie Kowarski¹⁵,
Briand Gaudet¹⁵ &
…
Stan Matwin^14,16

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11908))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

2378 Accesses

Abstract

Research into automated systems for detecting and classifying marine mammals in acoustic recordings is expanding internationally due to the necessity to analyze large collections of data for conservation purposes. In this work, we present a Convolutional Neural Network that is capable of classifying the vocalizations of three species of whales, non-biological sources of noise, and a fifth class pertaining to ambient noise. In this way, the classifier is capable of detecting the presence and absence of whale vocalizations in an acoustic recording. Through transfer learning, we show that the classifier is capable of learning high-level representations and can generalize to additional species. We also propose a novel representation of acoustic signals that builds upon the commonly used spectrogram representation by way of interpolating and stacking multiple spectrograms produced using different Short-time Fourier Transform (STFT) parameters. The proposed representation is particularly effective for the task of marine mammal species classification where the acoustic events we are attempting to classify are sensitive to the parameters of the STFT.

Stan Matwin’s research is supported by the Natural Sciences and Engineering Research Council and by the Canada Research Chairs program.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Towards a Novel Data Representation for Classifying Acoustic Signals

Deep neural networks for automated detection of marine mammal species

Article Open access 17 January 2020

Using Neural Networks to Identify Bird Species from Birdsong Samples

References

Protecting north Atlantic right whales from collisions with ships in the Gulf of St. Lawrence. http://bit.ly/tc_whales
Abdel-Hamid, O., Mohamed, A.R., Jiang, H., Deng, L., Penn, G., Yu, D.: Convolutional neural networks for speech recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 22(10), 1533–1545 (2014)
Article Google Scholar
Baumgartner, M.F., Mussoline, S.E.: A generalized baleen whale call detection and classification system. J. Acoust. Soc. Am. 129(5), 2889–2902 (2011)
Article Google Scholar
Choi, K., Fazekas, G., Sandler, M., Cho, K.: Convolutional recurrent neural networks for music classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2392–2396. IEEE (2017)
Google Scholar
Clark, C.W., Marler, P., Beeman, K.: Quantitative analysis of animal vocal phonology: an application to swamp sparrow song. Ethology 76(2), 101–115 (1987)
Article Google Scholar
Cooley, J.W., Tukey, J.W.: An algorithm for the machine calculation of complex Fourier series. Math. Comput. 19(90), 297–301 (1965)
Article MathSciNet Google Scholar
Deng, L., et al.: Recent advances in deep learning for speech research at Microsoft. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 26, p. 64. IEEE (2013)
Google Scholar
Dugan, P.J., Rice, A.N., Urazghildiiev, I.R., Clark, C.W.: North Atlantic right whale acoustic signal processing: Part i. comparison of machine learning recognition algorithms. In: IEEE Long Island Systems, Applications and Technology Conference, pp. 1–6. IEEE (2010)
Google Scholar
Gillespie, D., Caillat, M., Gordon, J., White, P.: Automatic detection and classification of odontocete whistles. J. Acous. Soc. Am. 134(3), 2427–2437 (2013)
Article Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Halkias, X.C., Paris, S., Glotin, H.: Classification of mysticete sounds using machine learning techniques. J. Acous. Soc. Am. 134(5), 3496–3505 (2013)
Article Google Scholar
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Humphrey, E.J., Bello, J.P.: Rethinking automatic chord recognition with convolutional neural networks. In: 11th International Conference on Machine Learning and Applications (ICMLA), vol. 2, pp. 357–362. IEEE (2012)
Google Scholar
Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3128–3137 (2015)
Google Scholar
Liu, S., Liu, M., Wang, M., Ma, T., Qing, X.: Classification of cetacean whistles based on convolutional neural network. In: 10th International Conference on Wireless Communications and Signal Processing (WCSP), pp. 1–5. IEEE (2018)
Google Scholar
Luo, W., Yang, W., Zhang, Y.: Convolutional neural network for detecting odontocete echolocation clicks. J. Acous. Soc. Am. 145(1), EL7–EL12 (2019)
Article Google Scholar
van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)
MATH Google Scholar
Mellinger, D.K., Martin, S.W., Morrissey, R.P., Thomas, L., Yosco, J.J.: A method for detecting whistles, moans, and other frequency contour sounds. J. Acous. Soc. Am. 129(6), 4055–4061 (2011)
Article Google Scholar
Paszke, A., et al.: Automatic differentiation in PyTorch. In: NIPS-W (2017)
Google Scholar
Piczak, K.J.: Environmental sound classification with convolutional neural networks. In: IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6. IEEE (2015)
Google Scholar
Roch, M.A., et al.: Classification of echolocation clicks from odontocetes in the southern California bight. J. Acous. Soc. Am. 129(1), 467–475 (2011)
Article Google Scholar
Salamon, J., Bello, J.P.: Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Process. Lett. 24(3), 279–283 (2016)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Skowronski, M.D., Harris, J.G.: Acoustic detection and classification of microchiroptera using machine learning: lessons learned from automatic speech recognition. J. Acous. Soc. Am. 119(3), 1817–1833 (2006)
Article Google Scholar
van Den Oord, A., et al.: Wavenet: a generative model for raw audio. SSW 125 (2016)
Google Scholar
Wang, D., Zhang, L., Lu, Z., Xu, K.: Large-scale whale call classification using deep convolutional neural network architectures. In: IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), pp. 1–5. IEEE (2018)
Google Scholar
Zimmer, W.M.: Passive Acoustic Monitoring of Cetaceans. Cambridge University Press, New York (2011)
Book Google Scholar

Download references

Acknowledgements

Collaboration between researchers at JASCO Applied Sciences and Dalhousie University was made possible through a Natural Sciences and Engineering Research Council Engage Grant. The acoustic recordings described in this paper were collected by JASCO Applied Sciences under a contribution agreement with the Environmental Studies Research Fund.

Author information

Authors and Affiliations

Faculty of Computer Science, Dalhousie University, Halifax, Canada
Mark Thomas & Stan Matwin
JASCO Applied Sciences, Dartmouth, Canada
Bruce Martin, Katie Kowarski & Briand Gaudet
Institute of Computer Science Polish Academy of Sciences, Warsaw, Poland
Stan Matwin

Authors

Mark Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Bruce Martin
View author publications
You can also search for this author in PubMed Google Scholar
Katie Kowarski
View author publications
You can also search for this author in PubMed Google Scholar
Briand Gaudet
View author publications
You can also search for this author in PubMed Google Scholar
Stan Matwin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark Thomas .

Editor information

Editors and Affiliations

Leuphana University, Lüneburg, Germany
Ulf Brefeld
IRISA/Inria, Rennes, France
Elisa Fromont
University of Würzburg, Würzburg, Germany
Andreas Hotho
Leiden University, Leiden, The Netherlands
Arno Knobbe
ETH Zurich, Zurich, Switzerland
Marloes Maathuis
Institut National des Sciences Appliquées, Villeurbanne, France
Céline Robardet

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thomas, M., Martin, B., Kowarski, K., Gaudet, B., Matwin, S. (2020). Marine Mammal Species Classification Using Convolutional Neural Networks and a Novel Acoustic Representation. In: Brefeld, U., Fromont, E., Hotho, A., Knobbe, A., Maathuis, M., Robardet, C. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019. Lecture Notes in Computer Science(), vol 11908. Springer, Cham. https://doi.org/10.1007/978-3-030-46133-1_18

Download citation

DOI: https://doi.org/10.1007/978-3-030-46133-1_18
Published: 30 April 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46132-4
Online ISBN: 978-3-030-46133-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Marine Mammal Species Classification Using Convolutional Neural Networks and a Novel Acoustic Representation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Towards a Novel Data Representation for Classifying Acoustic Signals

Deep neural networks for automated detection of marine mammal species

Using Neural Networks to Identify Bird Species from Birdsong Samples

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Marine Mammal Species Classification Using Convolutional Neural Networks and a Novel Acoustic Representation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Towards a Novel Data Representation for Classifying Acoustic Signals

Deep neural networks for automated detection of marine mammal species

Using Neural Networks to Identify Bird Species from Birdsong Samples

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation