fbpx
Wikipedia

ISO/IEC 8859-5

ISO/IEC 8859-5:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 5: Latin/Cyrillic alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. It is informally referred to as Latin/Cyrillic.

ISO-8859-5
Alias(es)ISO-IR-144, cyrillic, csISOLatinCyrillic[1]
Language(s)Russian, Bulgarian, Belarusian, Macedonian, Serbian, Ukrainian (partial)
StandardISO/IEC 8859-5,
ECMA-113 (since 1988 edition)
ClassificationExtended ASCII, ISO 8859
ExtendsUS-ASCII, ISO-IR-153
Based onMain code page[2]
ExtensionsIBM-915
Preceded byECMA-113:1986 (ISO-IR-111)
Other related encoding(s)IBM-1124

It was designed to cover languages using a Cyrillic alphabet such as Bulgarian, Belarusian, Russian, Serbian and Macedonian but was never widely used. The 8-bit encodings KOI8-R and KOI8-U, IBM-866, and also Windows-1251 are far more commonly used. In contrast to the relationship between Windows-1252 and ISO 8859-1, Windows-1251 is not closely related to ISO 8859-5. However, the main Cyrillic block in Unicode uses a layout based on ISO-8859-5.

ISO 8859-5 would also have been usable for Ukrainian in the Soviet Union from 1933 to 1990, but it is missing the Ukrainian letter ge, ґ, which is required in Ukrainian orthography before and since, and during that period outside Soviet Ukraine. As a result, IBM created Code page 1124.

ISO-8859-5 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. The Windows code page for ISO-8859-5 is code page 28595 a.k.a. Windows-28595.[3]

Codepage layout edit

Differences from ISO 8859-1 are shown with its Unicode equivalent code point.

ISO/IEC 8859-5
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x
1x
2x  SP  ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~
8x
9x
Ax NBSP Ё
0401
Ђ
0402
Ѓ
0403
Є
0404
Ѕ
0405
І
0406
Ї
0407
Ј
0408
Љ
0409
Њ
040A
Ћ
040B
Ќ
040C
SHY Ў
040E
Џ
040F
Bx А
0410
Б
0411
В
0412
Г
0413
Д
0414
Е
0415
Ж
0416
З
0417
И
0418
Й
0419
К
041A
Л
041B
М
041C
Н
041D
О
041E
П
041F
Cx Р
0420
С
0421
Т
0422
У
0423
Ф
0424
Х
0425
Ц
0426
Ч
0427
Ш
0428
Щ
0429
Ъ
042A
Ы
042B
Ь
042C
Э
042D
Ю
042E
Я
042F
Dx а
0430
б
0431
в
0432
г
0433
д
0434
е
0435
ж
0436
з
0437
и
0438
й
0439
к
043A
л
043B
м
043C
н
043D
о
043E
п
043F
Ex р
0440
с
0441
т
0442
у
0443
ф
0444
х
0445
ц
0446
ч
0447
ш
0448
щ
0449
ъ
044A
ы
044B
ь
044C
э
044D
ю
044E
я
044F
Fx
2116
ё
0451
ђ
0452
ѓ
0453
є
0454
ѕ
0455
і
0456
ї
0457
ј
0458
љ
0459
њ
045A
ћ
045B
ќ
045C
§
00A7
ў
045E
џ
045F

History and related code pages edit

The ECMA-113 standard has been equivalent to ISO-8859-5 since its second edition,[4] its first edition (ISO-IR-111) having been an extension of the earlier KOI-8 (defined by GOST 19768-74), which lays out the Russian letters in the same way as their ASCII Roman equivalents where possible. The initial draft of ISO-8859-5 (DIS-8859-5:1987) followed ISO-IR-111, but was revised[4] after GOST 19768-74 was replaced[5] by the new ISO-IR-153 in 1987, which re-arranged the Russian letters into alphabetical order (except for Ё).[5][6] ISO-IR-153 contains the Russian letters, including Ё, and the non-breaking space and soft hyphen, whereas the full Cyrillic set of ISO-8859-5 is also called ISO-IR-144.[7]

Possibly as a consequence of this confusion, RFC 1345 erroneously lists yet another code page as "ISO-IR-111", combining the letter order and case order of ISO-8859-5 with the row order of ISO-IR-111 (and consequently compatible with neither in practice, but in practice partially compatible[2] with Windows-1251).[8][2]

IBM Code page 915 is an extension of ISO/IEC 8859-5, adding some semigraphic and other symbols in the C1 area. IBM Code page 1124 is mostly identical to ISO-8859-5, but replaces ѓ with ґ for Ukrainian use.

ISO-IR-200, "Uralic Supplementary Cyrillic Set",[9] was registered in 1998 by Everson Gunn Teoranta (which Michael Everson was a director of, prior to the founding of Evertype in 2001),[10] and changes several of the non-Russian letters in order to support the Kildin Sami, Komi and Nenets languages, not supported by ISO-8859-5 itself. Michael Everson also introduced Mac OS Barents Cyrillic for the same languages on classic Mac OS.

ISO-IR 200[9] (differences from ISO-8859-5)
0 1 2 3 4 5 6 7 8 9 A B C D E F
Ax NBSP Ё Ӈ
04C7
Ӓ
04D2
Ӭ
04EC
Ҍ
048C
І Ӧ
04E6
Ҋ
048A
Ӆ
04C5
Ӊ
04C9
«
00AB
Ӎ
04CD
SHY Ҏ
048E
ʼ
02BC
Fx ё ӈ
04C8
ӓ
04D3
ӭ
04ED
ҍ
048D
і ӧ
04E7
ҋ
048B
ӆ
04C6
ӊ
04CA
»
00BB
ӎ
04CE
§ ҏ
048F
ˮ
02EE

ISO-IR-201, "Volgaic Supplementary Cyrillic Set",[11] was similarly introduced by Everson Gunn Teoranta in order to support the Chuvash, Komi, Mari and Udmurt languages, spoken in the titular republics of Russia.

ISO-IR 201[11] (differences from ISO-8859-5)
0 1 2 3 4 5 6 7 8 9 A B C D E F
Ax NBSP Ё Ӑ
04D0
Ӓ
04D2
Ӗ
04D6
Ҫ
04AA
І Ӧ
04E6
Ӥ
04E4
Ӝ
04DC
Ҥ
04A4
Ӹ
04F8
Ӟ
04DE
SHY Ӱ
04F0
Ӵ
04F4
Fx ё ӑ
04D1
ӓ
04D3
ӗ
04D7
ҫ
04AB
і ӧ
04E7
ӥ
04E5
ӝ
04DD
ҥ
04A5
ӹ
04F9
ӟ
04DF
§ ӱ
04F1
ӵ
04F5

References edit

  1. ^ Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12.
  2. ^ a b c Nechayev, Valentin (2013) [2001]. "Review of 8-bit Cyrillic encodings universe". from the original on 2016-12-05. Retrieved 2016-12-05.
  3. ^ "Code Page Identifiers".
  4. ^ a b "ECMA-113 - Ecma International" (PDF).
  5. ^ a b Czyborra, Roman (1998-11-30) [1998-05-25]. . Archived from the original on 2016-12-03. Retrieved 2016-12-03.
  6. ^ "gost19768-87 TXT.GZ file".
  7. ^ European Computer Manufacturers Association (1 May 1988). Cyrillic part of the Latin/Cyrillic alphabet (PDF). ITSCJ/IPSJ. ISO-IR-144.
  8. ^ Sokolov, Michael (2003-04-05). "ECMA-cyrillic alias iso-ir-111 sore". IETF Charsets Mailing List (Mailing list).
  9. ^ a b National Standards Authority of Ireland. Uralic Supplementary Cyrillic Set (PDF). ITSCJ/IPSJ. ISO-IR-200.
  10. ^ Gunn, Marion; Everson, Michael (2001-09-20). "Everson Gunn Teoranta (EGT) & Everson Typography". Unicode Mail List (Mailing list).
  11. ^ a b National Standards Authority of Ireland. Volgaic Supplementary Cyrillic Set (PDF). ITSCJ/IPSJ. ISO-IR-201.

External links edit

  • ISO-IR 144 Cyrillic part of the Latin/Cyrillic Alphabet (May 1, 1988, from ISO 8859-5 2nd version)
  • ISO/IEC 8859-5:1999
  • Standard ECMA-113: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Cyrillic Alphabet 3rd edition (December 1999)

8859, ecma, redirects, here, encoding, defined, first, 1986, edition, that, standard, 1999, information, technology, single, byte, coded, graphic, character, sets, part, latin, cyrillic, alphabet, part, 8859, series, ascii, based, standard, character, encoding. ECMA 113 redirects here For the encoding defined by the first 1986 edition of that standard see ISO IR 111 ISO IEC 8859 5 1999 Information technology 8 bit single byte coded graphic character sets Part 5 Latin Cyrillic alphabet is part of the ISO IEC 8859 series of ASCII based standard character encodings first edition published in 1988 It is informally referred to as Latin Cyrillic ISO 8859 5Alias es ISO IR 144 cyrillic csISOLatinCyrillic 1 Language s Russian Bulgarian Belarusian Macedonian Serbian Ukrainian partial StandardISO IEC 8859 5 ECMA 113 since 1988 edition ClassificationExtended ASCII ISO 8859ExtendsUS ASCII ISO IR 153Based onMain code page 2 ExtensionsIBM 915Preceded byECMA 113 1986 ISO IR 111 Other related encoding s IBM 1124vteIt was designed to cover languages using a Cyrillic alphabet such as Bulgarian Belarusian Russian Serbian and Macedonian but was never widely used The 8 bit encodings KOI8 R and KOI8 U IBM 866 and also Windows 1251 are far more commonly used In contrast to the relationship between Windows 1252 and ISO 8859 1 Windows 1251 is not closely related to ISO 8859 5 However the main Cyrillic block in Unicode uses a layout based on ISO 8859 5 ISO 8859 5 would also have been usable for Ukrainian in the Soviet Union from 1933 to 1990 but it is missing the Ukrainian letter ge g which is required in Ukrainian orthography before and since and during that period outside Soviet Ukraine As a result IBM created Code page 1124 ISO 8859 5 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO IEC 6429 The Windows code page for ISO 8859 5 is code page 28595 a k a Windows 28595 3 Contents 1 Codepage layout 2 History and related code pages 3 References 4 External linksCodepage layout editDifferences from ISO 8859 1 are shown with its Unicode equivalent code point ISO IEC 8859 5 0 1 2 3 4 5 6 7 8 9 A B C D E F0x1x2x SP amp 3x 0 1 2 3 4 5 6 7 8 9 lt gt 4x A B C D E F G H I J K L M N O5x P Q R S T U V W X Y Z 6x a b c d e f g h i j k l m n o7x p q r s t u v w x y z 8x9xAx NBSP Yo0401 Ђ0402 Ѓ0403 Ye0404 Ѕ0405 I0406 Yi0407 Ј0408 Љ0409 Њ040A Ћ040B Ќ040C SHY Ў040E Џ040FBx A0410 B0411 V0412 G0413 D0414 E0415 Zh0416 Z0417 I0418 J0419 K041A L041B M041C N041D O041E P041FCx R0420 S0421 T0422 U0423 F0424 H0425 C0426 Ch0427 Sh0428 Sh0429 042A Y042B 042C E042D Yu042E Ya042FDx a0430 b0431 v0432 g0433 d0434 e0435 zh0436 z0437 i0438 j0439 k043A l043B m043C n043D o043E p043FEx r0440 s0441 t0442 u0443 f0444 h0445 c0446 ch0447 sh0448 sh0449 044A y044B 044C e044D yu044E ya044FFx 2116 yo0451 ђ0452 ѓ0453 ye0454 ѕ0455 i0456 yi0457 ј0458 љ0459 њ045A ћ045B ќ045C 00A7 y045E џ045FHistory and related code pages editFurther information ISO IR 111 Naming confusion The ECMA 113 standard has been equivalent to ISO 8859 5 since its second edition 4 its first edition ISO IR 111 having been an extension of the earlier KOI 8 defined by GOST 19768 74 which lays out the Russian letters in the same way as their ASCII Roman equivalents where possible The initial draft of ISO 8859 5 DIS 8859 5 1987 followed ISO IR 111 but was revised 4 after GOST 19768 74 was replaced 5 by the new ISO IR 153 in 1987 which re arranged the Russian letters into alphabetical order except for Yo 5 6 ISO IR 153 contains the Russian letters including Yo and the non breaking space and soft hyphen whereas the full Cyrillic set of ISO 8859 5 is also called ISO IR 144 7 Possibly as a consequence of this confusion RFC 1345 erroneously lists yet another code page as ISO IR 111 combining the letter order and case order of ISO 8859 5 with the row order of ISO IR 111 and consequently compatible with neither in practice but in practice partially compatible 2 with Windows 1251 8 2 IBM Code page 915 is an extension of ISO IEC 8859 5 adding some semigraphic and other symbols in the C1 area IBM Code page 1124 is mostly identical to ISO 8859 5 but replaces ѓ with g for Ukrainian use ISO IR 200 Uralic Supplementary Cyrillic Set 9 was registered in 1998 by Everson Gunn Teoranta which Michael Everson was a director of prior to the founding of Evertype in 2001 10 and changes several of the non Russian letters in order to support the Kildin Sami Komi and Nenets languages not supported by ISO 8859 5 itself Michael Everson also introduced Mac OS Barents Cyrillic for the same languages on classic Mac OS ISO IR 200 9 differences from ISO 8859 5 0 1 2 3 4 5 6 7 8 9 A B C D E FAx NBSP Yo Ӈ04C7 Ӓ04D2 Ӭ04EC Ҍ048C I Ӧ04E6 Ҋ048A Ӆ04C5 Ӊ04C9 00AB Ӎ04CD SHY Ҏ048E ʼ02BCFx yo ӈ04C8 ӓ04D3 ӭ04ED ҍ048D i ӧ04E7 ҋ048B ӆ04C6 ӊ04CA 00BB ӎ04CE ҏ048F ˮ02EEISO IR 201 Volgaic Supplementary Cyrillic Set 11 was similarly introduced by Everson Gunn Teoranta in order to support the Chuvash Komi Mari and Udmurt languages spoken in the titular republics of Russia ISO IR 201 11 differences from ISO 8859 5 0 1 2 3 4 5 6 7 8 9 A B C D E FAx NBSP Yo Ӑ04D0 Ӓ04D2 Ӗ04D6 Ҫ04AA I Ӧ04E6 Ӥ04E4 Ӝ04DC Ҥ04A4 Ӹ04F8 Ӟ04DE SHY Ӱ04F0 Ӵ04F4Fx yo ӑ04D1 ӓ04D3 ӗ04D7 ҫ04AB i ӧ04E7 ӥ04E5 ӝ04DD ҥ04A5 ӹ04F9 ӟ04DF ӱ04F1 ӵ04F5References edit Character Sets Internet Assigned Numbers Authority IANA 2018 12 12 a b c Nechayev Valentin 2013 2001 Review of 8 bit Cyrillic encodings universe Archived from the original on 2016 12 05 Retrieved 2016 12 05 Code Page Identifiers a b ECMA 113 Ecma International PDF a b Czyborra Roman 1998 11 30 1998 05 25 The Cyrillic Charset Soup Archived from the original on 2016 12 03 Retrieved 2016 12 03 gost19768 87 TXT GZ file European Computer Manufacturers Association 1 May 1988 Cyrillic part of the Latin Cyrillic alphabet PDF ITSCJ IPSJ ISO IR 144 Sokolov Michael 2003 04 05 ECMA cyrillic alias iso ir 111 sore IETF Charsets Mailing List Mailing list a b National Standards Authority of Ireland Uralic Supplementary Cyrillic Set PDF ITSCJ IPSJ ISO IR 200 Gunn Marion Everson Michael 2001 09 20 Everson Gunn Teoranta EGT amp Everson Typography Unicode Mail List Mailing list a b National Standards Authority of Ireland Volgaic Supplementary Cyrillic Set PDF ITSCJ IPSJ ISO IR 201 External links editISO IR 144 Cyrillic part of the Latin Cyrillic Alphabet May 1 1988 from ISO 8859 5 2nd version ISO IEC 8859 5 1999 Standard ECMA 113 8 Bit Single Byte Coded Graphic Character Sets Latin Cyrillic Alphabet 3rd edition December 1999 Retrieved from https en wikipedia org w index php title ISO IEC 8859 5 amp oldid 1185461265, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.