SpeechTyper: From Speech to Typographic Composition

Parente, Jéssica; Martins, Tiago; Bicker, João; Machado, Penousal

doi:10.1007/978-3-031-03789-4_14

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13221))

Included in the following conference series:

International Conference on Computational Intelligence in Music, Sound, Art and Design (Part of EvoStar)

2057 Accesses

The original version of this chapter was revised: a reference was corrected to include a previously omitted author’s name. The correction to this chapter is available at https://doi.org/10.1007/978-3-031-03789-4_27

Abstract

Many authors consider typography as what language looks like. Over time, designers explored connections between type design and sound, trying to bridge the gap between the two areas. This paper describes SpeechTyper, an ongoing system that generates typographic compositions based on speech. Our goal is to create typographic representations that convey aspects of oral communication expressively. The system takes a pre-processed analysis of speech recordings and uses it to affect the glyph design of the recited words. The glyphs’ structure is generated using a system we developed previously that extracts skeletons from existing typefaces.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

2013: The Web Browser as Synthesizer and Interface

Musical Syntax and Sonification of Voice and Speech Interfaces: A Case Study in Turn-Taking

ProZed: A Speech Prosody Editor for Linguists, Using Analysis-by-Synthesis

Change history

15 April 2022
In an older version of this paper, there was an error in reference no. 18: the names of the cited paper were incorrectly published. This has been corrected.

References

Baker, J.: Colloquy Type (2012). https://etapes.com/colloquy-type-un-caractere-generatif/. Accessed 9 Oct 2021
Bargues, C.: Dada optophonetic (2016). http://www.diptyqueparis-memento.com/en/dada-optophonetic/. Accessed 18 Nov 2020
Cephei, A.: Vosk (2019). https://alphacephei.com/vosk/. Accessed 10 Aug 2021
Cheng, K.: Designing Type, vol. 10. Yale University Press, New Haven (2005)
Google Scholar
Cipriani, A., Giri, M.: Musica elettronica e sound design: teoria epratica con max e msp, vol. 2. ConTempoNet (2013)
Google Scholar
Fuller, R.: More consistent and systematic than any form of writing I know. Kurt Schwitters’s Systemschrift. Sch. J. Kurt Schwitters Soc. 5 (2014)
Google Scholar
Golan et al.: Ursonography (2005). http://m.flong.com/archive/projects/ursonography/index.html. Accessed 5 Feb 2022
Gómez, R., et al.: Speech training for deaf and hearing-impaired people. In: Sixth European Conference on Speech Communication and Technology, EUROSPEECH 1999 (1999)
Google Scholar
Krcadinac, U., Pasquier, P., Jovanovic, J., Devedzic, V.: Synesketch: an open source library for sentence-based emotion recognition. IEEE Trans. Affect. Comput. 4(3), 312–325 (2013)
Article Google Scholar
Lupton, E.: Thinking with Type: A Critical Guide for Designers, Writers, Editors, & Students. Princeton Architectural Press, New York (2014)
Google Scholar
Maçãs, C., Palma, D., Rebelo, A.: TypEm: a generative typeface that represents the emotion of the text. In: Proceedings of the 9th International Conference on Digital and Interactive Arts, pp. 1–10 (2019)
Google Scholar
Mainz, G.M.: Gestalten mit Code (n.d.). http://generative-typografie.de/generativetypografie. Accessed 20 Nov 2020
Massin, R.: La lettre et l’image. Commun. et langages 6, 42–53 (1970)
Article Google Scholar
McDonnell, M.: Visual music. In: Visual Music Marathon, Boston Cyberarts Festival Programme (2007)
Google Scholar
McFee, B., et al.: Audio and music signal analysis in python. In: Proceedings of the 14th Python in Science Conference, pp. 18–25 (2015). https://librosa.org/doc/latest/index.html. Accessed 10 Aug 2021
Design, M., Müller, F., Meek, F.M.: Sculpt sound and glyphs simultaneously (2007). https://robmeek.com/project/meek-fm/. Accessed 10 Nov 2020
Parente, J., Martins, T., Bicker, J.: Generative type design: an approach focused on skeletons extraction and their anatomical deconstruction. In: Book of Proceedings of Typography Meeting (2018)
Google Scholar
Parente, J., Martins, T., Bicker, J., Machado, P.: Which type is your type? In: Eleventh International Conference on Computational Creativity (2020)
Google Scholar
Riechers, A.: What Does Your City Sound Like as a Font? (2018). https://eyeondesign.aiga.org/what-does-your-city-sound-like-as-a-font/. Accessed 28 Nov 2020
Silanteva, D.: Typographic Music (2011). http://www.ddina.com/index.php?/2011/typographic-music/2/. Accessed 1 Nov 2020
Sutela, J.: Experiments with Google nimiia cétiï (2018). https://experiments.withgoogle.com/nimiia-cetii. Accessed 8 Dec 2020
Typeroom. Ran Zheng Wants Us to Feel, Look and Hear Typography in Miraculous Ways (2017). https://www.typeroom.eu/article/ran-zheng-wants-us-feel-look-and-hear-typography-miraculous-ways. Accessed 7 Oct 2020
Wölfel, M., Schlippe, T., Stitz, A.: Voice driven type design. In: 2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 1–9. IEEE (2015)
Google Scholar
Zheng, R.: Look-Hear. Ph.D. thesis, Maryland Institute College of Art, Graphic Design (MFA) (2016)
Google Scholar

Download references

Acknowledgements

This work is partially funded by national funds through the FCT - Foundation for Science and Technology, I.P., within the scope of the project CISUC - UID/CEC/00326/2020 and by European Social Fund, through the Regional Operational Program Centro 2020, and under the grant SFRH/BD/148706/2019.

Author information

Authors and Affiliations

University of Coimbra, CISUC, DEI, Coimbra, Portugal
Jéssica Parente, Tiago Martins, João Bicker & Penousal Machado

Authors

Jéssica Parente
View author publications
You can also search for this author in PubMed Google Scholar
Tiago Martins
View author publications
You can also search for this author in PubMed Google Scholar
João Bicker
View author publications
You can also search for this author in PubMed Google Scholar
Penousal Machado
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jéssica Parente .

Editor information

Editors and Affiliations

University of Coimbra, Coimbra, Portugal
Tiago Martins
University of A Coruña, A Coruña, Spain
Nereida Rodríguez-Fernández
University of Coimbra, Coimbra, Portugal
Sérgio M. Rebelo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Parente, J., Martins, T., Bicker, J., Machado, P. (2022). SpeechTyper: From Speech to Typographic Composition. In: Martins, T., Rodríguez-Fernández, N., Rebelo, S.M. (eds) Artificial Intelligence in Music, Sound, Art and Design. EvoMUSART 2022. Lecture Notes in Computer Science, vol 13221. Springer, Cham. https://doi.org/10.1007/978-3-031-03789-4_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-03789-4_14
Published: 15 April 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-03788-7
Online ISBN: 978-3-031-03789-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

SpeechTyper: From Speech to Typographic Composition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

2013: The Web Browser as Synthesizer and Interface

Musical Syntax and Sonification of Voice and Speech Interfaces: A Case Study in Turn-Taking

ProZed: A Speech Prosody Editor for Linguists, Using Analysis-by-Synthesis

Change history

15 April 2022

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

SpeechTyper: From Speech to Typographic Composition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

2013: The Web Browser as Synthesizer and Interface

Musical Syntax and Sonification of Voice and Speech Interfaces: A Case Study in Turn-Taking

ProZed: A Speech Prosody Editor for Linguists, Using Analysis-by-Synthesis

Change history

15 April 2022

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation