Abstract
Many authors consider typography as what language looks like. Over time, designers explored connections between type design and sound, trying to bridge the gap between the two areas. This paper describes SpeechTyper, an ongoing system that generates typographic compositions based on speech. Our goal is to create typographic representations that convey aspects of oral communication expressively. The system takes a pre-processed analysis of speech recordings and uses it to affect the glyph design of the recited words. The glyphs’ structure is generated using a system we developed previously that extracts skeletons from existing typefaces.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Change history
15 April 2022
In an older version of this paper, there was an error in reference no. 18: the names of the cited paper were incorrectly published. This has been corrected.
References
Baker, J.: Colloquy Type (2012). https://etapes.com/colloquy-type-un-caractere-generatif/. Accessed 9 Oct 2021
Bargues, C.: Dada optophonetic (2016). http://www.diptyqueparis-memento.com/en/dada-optophonetic/. Accessed 18 Nov 2020
Cephei, A.: Vosk (2019). https://alphacephei.com/vosk/. Accessed 10 Aug 2021
Cheng, K.: Designing Type, vol. 10. Yale University Press, New Haven (2005)
Cipriani, A., Giri, M.: Musica elettronica e sound design: teoria epratica con max e msp, vol. 2. ConTempoNet (2013)
Fuller, R.: More consistent and systematic than any form of writing I know. Kurt Schwitters’s Systemschrift. Sch. J. Kurt Schwitters Soc. 5 (2014)
Golan et al.: Ursonography (2005). http://m.flong.com/archive/projects/ursonography/index.html. Accessed 5 Feb 2022
Gómez, R., et al.: Speech training for deaf and hearing-impaired people. In: Sixth European Conference on Speech Communication and Technology, EUROSPEECH 1999 (1999)
Krcadinac, U., Pasquier, P., Jovanovic, J., Devedzic, V.: Synesketch: an open source library for sentence-based emotion recognition. IEEE Trans. Affect. Comput. 4(3), 312–325 (2013)
Lupton, E.: Thinking with Type: A Critical Guide for Designers, Writers, Editors, & Students. Princeton Architectural Press, New York (2014)
Maçãs, C., Palma, D., Rebelo, A.: TypEm: a generative typeface that represents the emotion of the text. In: Proceedings of the 9th International Conference on Digital and Interactive Arts, pp. 1–10 (2019)
Mainz, G.M.: Gestalten mit Code (n.d.). http://generative-typografie.de/generativetypografie. Accessed 20 Nov 2020
Massin, R.: La lettre et l’image. Commun. et langages 6, 42–53 (1970)
McDonnell, M.: Visual music. In: Visual Music Marathon, Boston Cyberarts Festival Programme (2007)
McFee, B., et al.: Audio and music signal analysis in python. In: Proceedings of the 14th Python in Science Conference, pp. 18–25 (2015). https://librosa.org/doc/latest/index.html. Accessed 10 Aug 2021
Design, M., Müller, F., Meek, F.M.: Sculpt sound and glyphs simultaneously (2007). https://robmeek.com/project/meek-fm/. Accessed 10 Nov 2020
Parente, J., Martins, T., Bicker, J.: Generative type design: an approach focused on skeletons extraction and their anatomical deconstruction. In: Book of Proceedings of Typography Meeting (2018)
Parente, J., Martins, T., Bicker, J., Machado, P.: Which type is your type? In: Eleventh International Conference on Computational Creativity (2020)
Riechers, A.: What Does Your City Sound Like as a Font? (2018). https://eyeondesign.aiga.org/what-does-your-city-sound-like-as-a-font/. Accessed 28 Nov 2020
Silanteva, D.: Typographic Music (2011). http://www.ddina.com/index.php?/2011/typographic-music/2/. Accessed 1 Nov 2020
Sutela, J.: Experiments with Google nimiia cétiï (2018). https://experiments.withgoogle.com/nimiia-cetii. Accessed 8 Dec 2020
Typeroom. Ran Zheng Wants Us to Feel, Look and Hear Typography in Miraculous Ways (2017). https://www.typeroom.eu/article/ran-zheng-wants-us-feel-look-and-hear-typography-miraculous-ways. Accessed 7 Oct 2020
Wölfel, M., Schlippe, T., Stitz, A.: Voice driven type design. In: 2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 1–9. IEEE (2015)
Zheng, R.: Look-Hear. Ph.D. thesis, Maryland Institute College of Art, Graphic Design (MFA) (2016)
Acknowledgements
This work is partially funded by national funds through the FCT - Foundation for Science and Technology, I.P., within the scope of the project CISUC - UID/CEC/00326/2020 and by European Social Fund, through the Regional Operational Program Centro 2020, and under the grant SFRH/BD/148706/2019.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Parente, J., Martins, T., Bicker, J., Machado, P. (2022). SpeechTyper: From Speech to Typographic Composition. In: Martins, T., Rodríguez-Fernández, N., Rebelo, S.M. (eds) Artificial Intelligence in Music, Sound, Art and Design. EvoMUSART 2022. Lecture Notes in Computer Science, vol 13221. Springer, Cham. https://doi.org/10.1007/978-3-031-03789-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-03789-4_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-03788-7
Online ISBN: 978-3-031-03789-4
eBook Packages: Computer ScienceComputer Science (R0)