Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Multi-level Annotation in SpeeCon Polish Speech Database

  • Conference paper
Intelligent Media Technology for Communicative Intelligence (IMTCI 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3490))

Included in the following conference series:

Abstract

SpeeCon Polish Speech Database was collected within the framework of the SpeeCon project partially sponsored by the EC (IST-1999-10003). The database contains two sets of data, which comprise 550 adults’ recording sessions and 50 sessions from children, respectively. The adult speakers were recorded in various environments: offices, living rooms, cars and public places. Recordings contain free spontaneous speech passages, elicited spontaneous speech, phonetically compact words and sentences, general-purpose words and phrases, specific application words and utterances. One of the most important problems in the construction of the database is to define bases for multi-level transcription composed of several tiers. They could be grouped into three classes – linguistic, symbolic and physical representation. The orthographic transcription is applied to the sentence, phrase and word tiers, symbolic transcription related to grammar and articulation – to part of speech, phoneme and syllabic tiers and mnemonics – to the description of some characteristic of the measurable physical data. The paper presents the rules applied to text, speech and noise transcriptions and remarks on pronunciation varieties found in the database. The final part of the paper discusses the problem of the lexicon creation, which is an alphabetically ordered list of distinct lexical items occurring in the recorded corpus. The Polish lexicon has been built up by various methods, including hand-annotation and generation by rule with subsequent manual check.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Gubrynowicz, R.: The Polish Database of Spoken Language. In: Proc. First Int. Conference on Language Resources and Evaluation, Granada, May 28–30, pp. 1031–1037 (1998)

    Google Scholar 

  2. Grocholewski, S.: First Polish Database. In: Proc. First Int. Conference on Language Resources and Evaluation, Granada, May 28–30, pp. 1059–1062 (1998)

    Google Scholar 

  3. Lamel, L.F., Kassel, R.H., Seneff, S.: Speech database development: Design and analysis of the acoustic-phonetic corpus. In: Proc. DARPA Speech Recognition Workshop, pp. 100–109 (1986)

    Google Scholar 

  4. Damhuis, M., Boogaart, T., Veld, C., Versteijlen, M., Schelvis, W., Bos, L., Boves, L.: Creation and analysis of the Dutch POLYPHONE corpus. In: Proc. Int. Congress on Speech and Language Processing, Yokohama, pp. 1803–1806 (1994)

    Google Scholar 

  5. Höge, H., Draxler, C., van den Heuvel, H., Johansen, F., Sanders, E., Tropf, H.: SpeechDat Multilingual Speech Databases for Teleservices: Across the Finish Line. In: Proceedings of Eurospeech 1999, Budapest, vol. 6, pp. 2699–2702 (1999)

    Google Scholar 

  6. http://www.speecon.com/

  7. Biedrzycki, L.: Phonology of English and Polish resonants (in Polish), PWN, Warszawa (1978)

    Google Scholar 

  8. http://www.phon.ucl.ac.uk/home/sampa/polish.htm

  9. http://www.phon.ucl.ac.uk/home/sampa/samprosa.htm

  10. http://www.speech.kth.se/wavesurfer/

  11. http://www.speecon.com/public_docs/D21.zip

  12. Marasek, K.: Large Vocabulary Continuous Speech Recognition System for Polish. Archives of Acoustics 28(4), 293–303

    Google Scholar 

  13. http://www.praat.org

  14. http://htk.ca.ed.uk

  15. Brill, E.: A Corpus-Based Approach to Language Learning. PhD Dissertation, University of Pennsylvania (1996)

    Google Scholar 

  16. Przepiórkowski, A.: The IPI Corpus, http://dach.ipipan.waw.pl/~adamp/Papers/2004-corpus/book_en.pdf

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Marasek, K., Gubrynowicz, R. (2005). Multi-level Annotation in SpeeCon Polish Speech Database. In: Bolc, L., Michalewicz, Z., Nishida, T. (eds) Intelligent Media Technology for Communicative Intelligence. IMTCI 2004. Lecture Notes in Computer Science(), vol 3490. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558637_7

Download citation

  • DOI: https://doi.org/10.1007/11558637_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29035-3

  • Online ISBN: 978-3-540-31738-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics