Unicode
2,070 Followers
Recent papers in Unicode
This paper describes the Linear A / Minoan digital corpus and the approaches we applied to develop it. We aim to set up a suitable study resource for Linear A and Minoan. Firstly we start by introducing Linear A and Minoan in order to... more
First draft of comparison with the basic phonemes which from the Linear-B and Cypriotic syllabaries as of 2nd September 2011
This paper is a review on terminology and usage of three horizontal dashes (-, – and —) in Croatian orthographies and orthographic papers. Considerable contradictions and inconsistencies have been spotted in both terminology and practical... more
This proposal will refer to one controversial decision of unification made during this inclusion and argue for disunification and adding one more character to the extension.
L'Enciclopedia Informatica tratta centinaia di argomenti in maniera semplice e chiara in modo tale che possa essere consultate sia dal neofita sia dal professionista. Acronimi, sigle, gergo tutto è spiegato senza mai dare per scontato che... more
從漢字本體規劃的角度論述香港中國語文學會提出的“和諧體”在理論上有何優勢,提出面向未來的漢字應用研究理論。
‘Avestan’ is the name of the ritual language of Zoroastrianism, which was the state religion of the Iranian empire in Achaemenid, Arsacid and Sasanid times, covering a time span of more than 1200 years.1 It is named after the ‘Avesta’,... more
Keynote presentation on introducing Tamil Unicode, standardization process, stability principle of Unicode, a brief review of the Tamil Symbol/Fractions proposal as well as the Tamil Virtual Academiy experts recommendation to the... more
In the past decades, typography has transitioned from metal type to digital fonts. How well is digital technology serving the various writing systems around the world? How has the digital infrastructure of typography changed in the past 3... more
Arabic script remains one of the most widely employed writing systems in the world, for Arabic and non-Arabic languages alike. Focusing on naskh—the style most commonly used across the Middle East—Letters of Light traces the evolution of... more
In this paper we discuss the various ASCII based scripts that are made for Indian languages and the problems associated with these types of scripts. Then we will discuss the solution we suggest to overcome these problems in the form of "... more
You never know where a story might lead you or what questions it may leave unanswered. Take for example a story we did in January about a campaign to make the dumpling emoji a reality. Jennifer 8. Lee told us that emojis have to be... more
In this study, I extended input methods for the Japanese language to Egyptian hieroglyphics. There are several systems that capable of inputting Egyptian hieroglyphic writing. However, they do not allow us to directly input hieroglyphs,... more
Until recently, Arabic text representation was the exclusive domain of professional calligraphers and typographers. Today it revolves around elusive computer codes and ugly fonts. Yet, scholars are expected to be able to handle literary... more
Growth of information technology has played a great role in connecting the world together. The to and fro of information is common in this world. Fonts play a key major role in this communication process in digital domain. Common encoding... more
A writing system as a set of visible used to represent units of language in a systematic way. Egyptian hieroglyphs were a formal writing system used by the ancient Egyptians that combined logographic and alphabetic elements. In serious... more
International Forum for Information Technology in Tamil (INFITT - http://www.infitt.org) through its Unicode Working Group has reviewed the various Grantha proposals and has commented on those in the enclosed documents. The Working Group... more
Khamti Shan, a Tai language spoken in Kachin State, Myanmar, is a northern dialect of Shan spoken in Shan State. Shan and Khamti Shan have adapted the Myanmar (Burmese) writing system for their own use. The Khamti Shan orthography was... more
A paper on synchronic aspects of writing and standardization of single and double quotation marks is the second and final part of the study on quotation characters in the Croatian language. Quotation marks are examined from three research... more
Appunti pubblicati on-line (2005), sulla gestione elettronica dei testi in lingue classiche dopo l'introduzione della codifica Unicode.
The 'spidery kh' is one of the most puzzling characters in the Glagolitic alphabet. The paper presents evidence especially from variations of the sun symbol in the Caucasus to show its relationship to the Glagolitic glyph. Written and... more
Avestan’ is the name of the ritual language of Zoroastrianism, which was the state religion of the Iranian empire in Achaemenid, Arsacid and Sasanid times, covering a time span of more than 1200 years. It is named after the ‘Avesta’,... more
現在の漢字「字体」「字形」概念の混乱ぶりを指摘し,UNICODEの「ソースコードセパレーション(原規格分離)」原則の非科学性をあぶり出す。
- by Yixing Zhu
- Unicode, 漢字, 字体, 漢字記号論
Linguistic studies are much advanced scientific practice today. Linguistic literature is addressing global linguistic community irrespective of the paradigm, family and the size of the languages. In the midst of the most alarmed situation... more
The paper gives an update on the current state of Unicode (v. 13 and 14, 2020-2021) and outlines several areas of open problems.
Tamga or tamgha are emblematic symbols which were historically used by various Mongolic and Turkic tribes or clans in Central Asia. Over a hundred different Mongolian tamga are known. Certain tamga were adopted by individual medieval... more
The Khitan Small Script (Chinese Qìdān xiǎozì 契丹小字) and the Khitan Large Script (Chinese Qìdān dàzì 契丹大字) are two distinct and morphologically different scripts that were both used by the Khitan people of Northern China to write the... more
This is a proposal to encode six additional Tangut ideographs. Two of these characters are attested in a recently discovered Tangut manuscript translation of a Tibetan Tantric Buddhist text written in 1258 by Drogön Chögyal Phagpa... more
The paper deals with the structural premises of diachronic corpora that are meant to represent specimens of a given language throughout its historical stages and to provide a diachronic cross-century retrieval. On the basis of the... more