Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Windows-1250 is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use the Latin script. It is primarily used by Czech.[1] It is also used for Polish (as can Windows-1257), Slovak, Hungarian, Slovene (as can Windows-1257), Serbo-Croatian (Latin script), Romanian (before a 1993 spelling reform) and Albanian (as can Windows-1252). It may also be used with the German language, though it's missing uppercase .[a] German-language texts encoded with Windows-1250 and Windows-1252 are identical.

Windows-1250
MIME / IANAwindows-1250
Alias(es)cp1250 (Code page 1250)
Language(s)Czech, Polish, Slovak, Hungarian, Slovene, Serbo-Croatian (Latin script), Montenegrin, Romanian (before 1993 spelling reform), Turkmen, Rotokas, Albanian, English, German, Irish, Luxembourgish, Dutch
Created byMicrosoft
StandardWHATWG Encoding Standard
Classificationextended ASCII, Windows-125x
Other related encoding(s)ISO-8859-2

This has been replaced by UTF-8 far more than Windows-1252 has. As of October 2022, less than 0.04% of all web pages use Windows-1250.[2][3][4]

Windows-1250 is similar to ISO-8859-2 and has all the printable characters it has and more. However a few of them are rearranged (unlike Windows-1252, which keeps all printable characters from ISO-8859-1 in the same place). Most of the rearrangements seem to have been done to keep characters shared with Windows-1252 in the same place but three of the characters moved (Ą, Ľ, ź) cannot be explained this way, since those do not occur in Windows-1252 and could have been put in the same positions as in ISO-8859-2 if ˇ had been put e.g. at 9F.

IBM uses code page 1250 (CCSID 1250 and euro sign extended CCSID 5346) for Windows-1250.[5][6][7][8][9][10][11]

Character set

edit

The following table shows Windows-1250. Each character is shown with its Unicode equivalent.

Windows-1250[12]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2x  SP  ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL
8x Š Ś Ť Ž Ź
9x š ś ť ž ź
Ax NBSP ˇ ˘ Ł ¤ Ą ¦ § ¨ © Ş « ¬ SHY ® Ż
Bx ° ± ˛ ł ´ µ · ¸ ą ş » Ľ ˝ ľ ż
Cx Ŕ Á Â Ă Ä Ĺ Ć Ç Č É Ę Ë Ě Í Î Ď
Dx Đ Ń Ň Ó Ô Ő Ö × Ř Ů Ú Ű Ü Ý Ţ ß
Ex ŕ á â ă ä ĺ ć ç č é ę ë ě í î ď
Fx đ ń ň ó ô ő ö ÷ ř ů ú ű ü ý ţ ˙
  Different from ISO-8859-2
  Different from Windows-1252 to match ISO-8859-2
  Different from both Windows-1252 and ISO-8859-2

See also

edit

Notes

edit
  1. ^ In 2017, the Council for German Orthography officially adopted a capital, ⟨ẞ⟩, before support for German was complete. Fully compatible with ISO/IEC 8859-1 for German texts.

References

edit
  1. ^ "Distribution of Content Languages among websites that use Windows-1250". w3techs.com. Retrieved 2022-10-23.
  2. ^ "Historical trends in the usage of character encodings for websites, October 2022". w3techs.com.
  3. ^ "Frequently Asked Questions". w3techs.com.
  4. ^ "Distribution of Character Encodings among websites that use Czech". w3techs.com. Retrieved 2022-10-23.
  5. ^ "Code page 1250 information document". Archived from the original on 2016-03-03.
  6. ^ "CCSID 1250 information document". Archived from the original on 2016-03-27.
  7. ^ "CCSID 5346 information document". Archived from the original on 2014-11-29.
  8. ^ Code Page CPGID 01250 (pdf) (PDF), IBM
  9. ^ Code Page CPGID 01250 (txt), IBM
  10. ^ International Components for Unicode (ICU), ibm-1250_P100-1995.ucm, 2002-12-03
  11. ^ International Components for Unicode (ICU), ibm-5346_P100-1998.ucm, 2002-12-03
  12. ^ Steele, Shawn (1998), CP1250 to Unicode table, Unicode Consortium, CP1250.TXT
edit