fbpx
Wikipedia

ARIB STD B24 character set

Volume 1 of the Association of Radio Industries and Businesses (ARIB) STD-B24 standard for Broadcast Markup Language[2] specifies, amongst other details, a character encoding for use in Japanese-language broadcasting. It was introduced on 1999-10-26.[2] The latest revision is version 6.3 as of 2016-07-06.

ARIB STB-B24 encoding
StandardARIB STB-B24 Volume 1
ClassificationISO 2022 profile/extension
Transforms / EncodesARIB STB-B24 Kanji, Kana and mosaic sets,
JIS X 0201
ARIB STB-B24 Kanji set
Weather symbols: a few of the extended symbols included.
Language(s)Japanese, English, Russian
Partial support: Greek, Chinese
StandardARIB STB-B24 Volume 1
ClassificationISO-2022-structured CJK DBCS
ExtendsJIS X 0208
Encoding formats
  • ARIB STB-B24 encoding (ISO 2022 based)
  • Shift JIS (ARIB variant)[1]

It includes a number of ARIB extended characters (ARIB外字, ARIB gaiji) not found in the base standards (JIS X 0208 and JIS X 0201). It was the source standard for many symbol characters which were added to Unicode, including portions of the Miscellaneous Symbols, Enclosed Alphanumeric Supplement and Enclosed Ideographic Supplement blocks.[3] Its contributions partially overlap the Unicode emoji, but were added a year earlier, in Unicode 5.2.[4]

Fascicle 1 of the ARIB STD-B62 standard, published in 2014, defines Unicode mappings for a selection of the B24 extended characters (excluding, for example, those duplicated by JIS X 0213), as well as a few extended Kanji.[5] It also includes a mapping of utilised characters outside the Basic Multilingual Plane to the BMP's private use area.

Sets and codes edit

The ARIB STD B24 standard defines multiple character sets and a method of switching between them. These include a Kanji set (an extension of JIS X 0208), an Alphanumeric set, a Hiragana set, Katakana sets of two distinct layouts and four mosaic sets.[6] The sets are selected using ISO 2022 mechanisms for 94-sets, using the following codes (proportional sets use the same layout as the corresponding non-proportional ones):[7]

Set Type Code (column/line) Code (hexadecimal) Code (ASCII character) Comments
Kanji 2-byte 4/2 42 B The escape code B used for the ARIB Kanji set[7] is used for the 1983 version of JIS C 6226 (JIS X 0208, of which the ARIB Kanji set is an extension) in ISO-2022-JP.[8][9]
Alphanumeric 1-byte 4/10 4A J JIS_C6220-ro (ISO646-JP, JIS X 0201 Roman set). Similar to ASCII, with two assignments differing. Escape code J matches usage in ISO-2022-JP.[9]
Proportional alphanumeric 1-byte 3/6 36 6
Hiragana 1-byte 3/0 30 0 Hiragana themselves follow the same layout as row 4 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Hiragana 1-byte 3/7 37 7
Katakana 1-byte 3/1 31 1 Katakana themselves follow the same layout as row 5 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Katakana 1-byte 3/8 38 8
JIS X 0201 Katakana 1-byte 4/9 49 I JIS_C6220-jp (JIS X 0201 Kana set). Escape code matches usage in ISO-2022-JP-3.
Mosaic A 1-byte 3/2 32 2 Pseudographics
Mosaic B 1-byte 3/3 33 3
Mosaic C 1-byte 3/4 34 4 Non-spacing pseudographics
Mosaic D 1-byte 3/5 35 5

Code charts edit

Kanji (double-byte) set edit

This is a double-byte character set extending JIS X 0208.

Lead byte edit

The encoding bytes correspond to the row or cell number plus 0x20, or 32 in decimal (see below). Hence, the code set starting with 0x21 has a row number of 1, and its cell 1 has a continuation byte of 0x21 (or 33), and so forth. Most of the code corresponds to JIS X 0208.

ARIB STD-B24 Kanji (double-byte) set (lead bytes)
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x  SP  1-_ 2-_ 3-_ 4-_ 5-_ 6-_ 7-_ 8-_ 9-_ 10-_ 11-_ 12-_ 13-_ 14-_ 15-_
3x 16-_ 17-_ 18-_ 19-_ 20-_ 21-_ 22-_ 23-_ 24-_ 25-_ 26-_ 27-_ 28-_ 29-_ 30-_ 31-_
4x 32-_ 33-_ 34-_ 35-_ 36-_ 37-_ 38-_ 39-_ 40-_ 41-_ 42-_ 43-_ 44-_ 45-_ 46-_ 47-_
5x 48-_ 49-_ 50-_ 51-_ 52-_ 53-_ 54-_ 55-_ 56-_ 57-_ 58-_ 59-_ 60-_ 61-_ 62-_ 63-_
6x 64-_ 65-_ 66-_ 67-_ 68-_ 69-_ 70-_ 71-_ 72-_ 73-_ 74-_ 75-_ 76-_ 77-_ 78-_ 79-_
7x 80-_ 81-_ 82-_ 83-_ 84-_ 85-_ 86-_ 87-_ 88-_ 89-_ 90-_ 91-_ 92-_ 93-_ 94-_ DEL
  Unused lead byte
  Lead byte
  Differences from JIS X 0208

Character sets 0x21-0x74 (row numbers 1-84: punctuation, alphabets, numbers, Kana, Kanji) edit

Character set 0x7A (row number 90, traffic symbols) edit

Characters 90-45 through 90-63 and 90-66 through 90-84 (shown below shaded) are listed in the B24 standard only in table 7-10 (the list of extension characters), and are also the only characters in rows 90 through 91 which are not transport-related symbols; this is noted in the B24 standard in an endnote to table 7-10.[10] The remainder of the extensions are listed in both table 7-4 (the double-byte code chart) and table 7-10.[10]

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7A)[5][11]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
26CC

26CD
❗︎
2757

26CF

26D0

26D1

26D2

26D5

26D3
⛔︎
26D4
3x 🅿
1F17F
🆊
1F18A

26D6

26D7

26D8

26D9

26DA

26DB

26DC

26DD

26DE

26DF

26E0

26E1
4x ⭕︎
2B55

3248

3249

324A

324B

324C

324D

324E

324F

2491

2492

2493
5x 🅊
1F14A
🅌
1F14C
🄿
1F13F
🅆
1F146
🅋
1F14B
🈐
1F210
🈑
1F211
🈒
1F212
🈓
1F213
🅂
1F142
🈔
1F214
🈕
1F215
🈖
1F216
🅍
1F14D
🄱
1F131
🄽
1F13D
6x ⬛︎
2B1B

2B24
🈗
1F217
🈘
1F218
🈙
1F219
🈚︎
1F21A
🈛
1F21B

26BF
🈜
1F21C
🈝
1F21D
🈞
1F21E
🈟
1F21F
🈠
1F220
🈡
1F221
🈢
1F222
🈣
1F223
7x 🈤
1F224
🈥
1F225
🅎
1F14E

3299
🈀
1F200
  Additions from table 7-10 not in table 7-4.

Character set 0x7B (row number 91, map symbols) edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7B)[5][11][12]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
26E3

2B56

2B57

2B58

2B59

2613

328B

3012

26E8

3246

3245

26E9
[a]
0FD6
⛪︎
26EA

26EB
3x
26EC

2668

26ED

26EE

26EF
⚓︎
2693

2708

26F0

26F1
⛲︎
26F2
⛳︎
26F3

26F4
⛵︎
26F5
🅗
1F157

24B9

24C8
4x
26F6
🅟
1F15F
🆋
1F18B
🆍
1F18D
🆌
1F18C
🅹
1F179

26F7

26F8

26F9
⛺︎
26FA
🅻
1F17B

260E

26FB

26FC
⛽︎
26FD

26FE
5x 🅼
1F17C

26FF
6x
7x
  Not in ARIB STD-B62

Character set 0x7C (row number 92, units, enclosed forms, list markers, arrows) edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7C)[5][11][12]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
27A1

2B05

2B06

2B07

2B2F

2B2E

5E74

6708

65E5

5186

33A1

33A5

339D

33A0

33A4
3x 🄀
1F100

2488

2489

248A

248B

248C

248D

248E

248F

2490
[b] [b] [b] [b] [b] [b]
4x 🄁
1F101
🄂
1F102
🄃
1F103
🄄
1F104
🄅
1F105
🄆
1F106
🄇
1F107
🄈
1F108
🄉
1F109
🄊
1F10A

3233

3236

3232

3231

3239

3244
5x
25B6

25C0

3016

3017

27D0
²
00B2
³
00B3
🄭
1F12D
(vn)[c] (ob)[c] (cb)[c] (ce[c] mb)[c] (hp)[c] (br)[c] (p)[c]
6x (s)[c] (ms)[c] (t)[c] (bs)[c] (b)[c] (tb)[c] (tp)[c] (ds)[c] (ag)[c] (eg)[c] (vo)[c] (fl)[c] (ke[c] y)[c] (sa[c] x)[c]
7x (sy[c] n)[c] (or[c] g)[c] (pe[c] r)[c] 🄬
1F12C
🄫
1F12B

3247
🆐
1F190
🈦
1F226

213B
  Not in ARIB STD-B62

Character set 0x7D (row number 93, game and weather symbols, fractions, units, enclosed forms) edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7D)[5][11][12]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
322A

322B

322C

322D

322E

322F

3230

3237

337E

337D

337C

337B

2116

2121

3036
3x ⚾︎
26BE
🉀
1F240
🉁
1F241
🉂
1F242
🉃
1F243
🉄
1F244
🉅
1F245
🉆
1F246
🉇
1F247
🉈
1F248
🄪
1F12A
🈧
1F227
🈨
1F228
🈩
1F229
🈔
1F214
🈪
1F22A
4x 🈫
1F22B
🈬
1F22C
🈭
1F22D
🈮
1F22E
🈯︎
1F22F
🈰
1F230
🈱
1F231

2113

338F

3390

33CA

339E

33A2

3371
5x ½
00BD

2189

2153

2154
¼
00BC
¾
00BE

2155

2156

2157

2158

2159

215A

2150

215B

2151

2152
6x
2600

2601

2602
⛄︎
26C4

2616

2617

26C9

26CA

2666

2665

2663

2660

26CB

2A00

203C

2049
7x ⛅︎
26C5
☔︎
2614

26C6

2603

26C7
⚡︎
26A1

26C8

269E

269F

266C

260E
  Not in ARIB STD-B62

Character set 0x7E (row number 94, list markers) edit

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7E)[5][11][12]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
2160

2161

2162

2163

2164

2165

2166

2167

2168

2169

216A

216B

2470

2471

2472
3x
2473

2474

2475

2476

2477

2478

2479

247A

247B

247C

247D

247E

247F

3251

3252

3253
4x
3254
🄐
1F110
🄑
1F111
🄒
1F112
🄓
1F113
🄔
1F114
🄕
1F115
🄖
1F116
🄗
1F117
🄘
1F118
🄙
1F119
🄚
1F11A
🄛
1F11B
🄜
1F11C
🄝
1F11D
🄞
1F11E
5x 🄟
1F11F
🄠
1F120
🄡
1F121
🄢
1F122
🄣
1F123
🄤
1F124
🄥
1F125
🄦
1F126
🄧
1F127
🄨
1F128
🄩
1F129

3255

3256

3257

3258

3259
6x
325A

2460

2461

2462

2463

2464

2465

2466

2467

2468

2469

246A

246B

246C

246D

246E
7x
246F

2776

2777

2778

2779

277A

277B

277C

277D

277E

277F

24EB

24EC

325B
  Not in ARIB STD-B62

Single-byte sets edit

Alphanumeric set edit

ARIB STD-B24 Alphanumeric set[14]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x !
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3x 0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4x @
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5x P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
¥
00A5
]
005D
^
005E
_
005F
6x `
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7x p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D

203E
  Differences from US-ASCII

Hiragana set edit

ARIB STD-B24 Hiragana set[15]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
3041

3042

3043

3044

3045

3046

3047

3048

3049

304A

304B

304C

304D

304E

304F
3x
3050

3051

3052

3053

3054

3055

3056

3057

3058

3059

305A

305B

305C

305D

305E

305F
4x
3060

3061

3062

3063

3064

3065

3066

3067

3068

3069

306A

306B

306C

306D

306E

306F
5x
3070

3071

3072

3073

3074

3075

3076

3077

3078

3079

307A

307B

307C

307D

307E

307F
6x
3080

3081

3082

3083

3084

3085

3086

3087

3088

3089

308A

308B

308C

308D

308E

308F
7x
3090

3091

3092

3093

309D

309E

30FC

3002

300C

300D

3001

30FB
  Character allocations not following row 4 of JIS X 0208

Katakana set edit

ARIB STD-B24 Katakana set[16]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
30A1

30A2

30A3

30A4

30A5

30A6

30A7

30A8

30A9

30AA

30AB

30AC

30AD

30AE

30AF
3x
30B0

30B1

30B2

30B3

30B4

30B5

30B6

30B7

30B8

30B9

30BA

30BB

30BC

30BD

30BE

30BF
4x
30C0

30C1

30C2

30C3

30C4

30C5

30C6

30C7

30C8

30C9

30CA

30CB

30CC

30CD

30CE

30CF
5x
30D0

30D1

30D2

30D3

30D4

30D5

30D6

30D7

30D8

30D9

30DA

30DB

30DC

30DD

30DE

30DF
6x
30E0

30E1

30E2

30E3

30E4

30E5

30E6

30E7

30E8

30E9

30EA

30EB

30EC

30ED

30EE

30EF
7x
30F0

30F1

30F2

30F3

30F4

30F5

30F6

30FD

30FE

30FC

3002

300C

300D

3001

30FB
  Character allocations not following row 5 of JIS X 0208

JIS X 0201 Katakana set edit

ARIB STD-B24 JIS X 0201 Katakana set[17]
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x
FF61

FF62

FF63

FF64

FF65

FF66

FF67

FF68

FF69

FF6A

FF6B

FF6C

FF6D

FF6E

FF6F
3x
FF70

FF71

FF72

FF73

FF74

FF75

FF76

FF77

FF78

FF79

FF7A

FF7B

FF7C

FF7D

FF7E
ソ
FF7F
4x
FF80

FF81

FF82

FF83

FF84

FF85

FF86

FF87

FF88

FF89

FF8A

FF8B

FF8C

FF8D

FF8E

FF8F
5x
FF90

FF91

FF92

FF93

FF94

FF95

FF96

FF97

FF98

FF99

FF9A

FF9B

FF9C

FF9D

FF9E

FF9F
6x
7x

Mosaic sets edit

Shift_JIS variant edit

In addition to the modified ISO 2022 encoding, the B24 standard also specifies a Shift JIS encoding following JIS X 0208:1997, but with the addition of the extended characters in the kanji set.[1]

First byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2 ! " # $ % & ' ( ) * + , - . /
3 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4 @ A B C D E F G H I J K L M N O
5 P Q R S T U V W X Y Z [ ¥ ] ^ _
6 ` a b c d e f g h i j k l m n o
7 p q r s t u v w x y z { | }
8
9
A
B ソ
C
D
E
F
Second byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
 
Non printable ASCII character
Unaltered ASCII character
Modified ASCII character
Single-byte half-width katakana
First byte of a double-byte character, used by JIS X 0208
First byte of an ARIB extended character
Not used as first byte, unallocated space in JIS X 0208
Not used as first byte
Second byte of a double-byte character whose first half of the JIS sequence was odd
Second byte of a double-byte character whose first half of the JIS sequence was even
Unused as second byte of a double-byte character


See also edit

Footnotes edit

  1. ^ Glossed as "temple" (i.e. Buddhist temple) in B24 table 7-10 (the list of extension characters).
  2. ^ a b c d e f Small form (70% size per code chart / table 7-10) of a kanji character. Shown here simulated. Private Use Area code points shown are those used by the Nishiki-teki font.[13]
  3. ^ a b c d e f g h i j k l m n o p q r s t u v w x y z aa ab ac ad Musical abbreviation (or half thereof) not present in Unicode, simulated here with multiple characters. Private Use Area code points shown are those used by the Nishiki-teki font.

References edit

  1. ^ a b ARIB (2008), p. 105, part 2, section 7.3
  2. ^ a b ARIB (2008)
  3. ^ Suignard, Michel (2008-03-11). "ISO/IEC JTC1/SC2/WG2 N 3397: Japanese TV Symbols" (PDF).
  4. ^ "Unicode 5.2 Emoji List". Emojipedia.
  5. ^ a b c d e f ARIB (2014), pp. 33–50, part 2, Table 5-2
  6. ^ ARIB (2008), pp. 48–52
  7. ^ a b ARIB (2008), p. 39, part 2, Table 7-3
  8. ^ Japanese National Committee on ISO/TC97/SC2 (1984-07-01). Japanese Graphic Character Set for Information Interchange (PDF). ITSCJ/IPSJ. ISO-IR-87.{{citation}}: CS1 maint: numeric names: authors list (link)
  9. ^ a b RFC 1468 (IETF)
  10. ^ a b ARIB (2008), p. 72
  11. ^ a b c d e ARIB (2008), pp. 54–72, part 2, Table 7-10
  12. ^ a b c d ARIB (2008), pp. 46–47, part 2, Table 7-4
  13. ^ "Nishiki-teki Version 3.82b (2021-07-23) - 6,416 characters in the Private Use Areas" (PDF).
  14. ^ ARIB (2008), p. 48, part 2, Table 7-5
  15. ^ ARIB (2008), p. 50, part 2, Table 7-7
  16. ^ ARIB (2008), p. 49, part 2, Table 7-6
  17. ^ ARIB (2008), p. 52, part 2, Table 7-9
  • Data Coding and Transmission Specification for Digital Broadcasting (PDF) (ARIB Standard). 5.2-E1. Vol. 1. Association of Radio Industries and Businesses (ARIB). 2008-06-06 [1999-10-26]. ARIB STD-B24. (PDF) from the original on 2017-07-10. Retrieved 2017-07-10.
  • Multimedia Coding Specification for Digital Broadcasting (Second Generation) (PDF) (ARIB Standard). 1.0-E1. Vol. 1. Association of Radio Industries and Businesses (ARIB). 2014-07-31. ARIB STD-B62. Retrieved 2019-02-11.

Further reading edit

External links edit

  • (in Japanese)
  • STD-B24 and others, List of ARIB Standards in the Field of Broadcasting (ARIB)

arib, character, volume, association, radio, industries, businesses, arib, standard, broadcast, markup, language, specifies, amongst, other, details, character, encoding, japanese, language, broadcasting, introduced, 1999, latest, revision, version, 2016, arib. Volume 1 of the Association of Radio Industries and Businesses ARIB STD B24 standard for Broadcast Markup Language 2 specifies amongst other details a character encoding for use in Japanese language broadcasting It was introduced on 1999 10 26 2 The latest revision is version 6 3 as of 2016 07 06 ARIB STB B24 encodingStandardARIB STB B24 Volume 1ClassificationISO 2022 profile extensionTransforms EncodesARIB STB B24 Kanji Kana and mosaic sets JIS X 0201vteARIB STB B24 Kanji setWeather symbols a few of the extended symbols included Language s Japanese English RussianPartial support Greek ChineseStandardARIB STB B24 Volume 1ClassificationISO 2022 structured CJK DBCSExtendsJIS X 0208Encoding formatsARIB STB B24 encoding ISO 2022 based Shift JIS ARIB variant 1 vteIt includes a number of ARIB extended characters ARIB外字 ARIB gaiji not found in the base standards JIS X 0208 and JIS X 0201 It was the source standard for many symbol characters which were added to Unicode including portions of the Miscellaneous Symbols Enclosed Alphanumeric Supplement and Enclosed Ideographic Supplement blocks 3 Its contributions partially overlap the Unicode emoji but were added a year earlier in Unicode 5 2 4 Fascicle 1 of the ARIB STD B62 standard published in 2014 defines Unicode mappings for a selection of the B24 extended characters excluding for example those duplicated by JIS X 0213 as well as a few extended Kanji 5 It also includes a mapping of utilised characters outside the Basic Multilingual Plane to the BMP s private use area Contents 1 Sets and codes 2 Code charts 2 1 Kanji double byte set 2 1 1 Lead byte 2 1 2 Character sets 0x21 0x74 row numbers 1 84 punctuation alphabets numbers Kana Kanji 2 1 3 Character set 0x7A row number 90 traffic symbols 2 1 4 Character set 0x7B row number 91 map symbols 2 1 5 Character set 0x7C row number 92 units enclosed forms list markers arrows 2 1 6 Character set 0x7D row number 93 game and weather symbols fractions units enclosed forms 2 1 7 Character set 0x7E row number 94 list markers 2 2 Single byte sets 2 2 1 Alphanumeric set 2 2 2 Hiragana set 2 2 3 Katakana set 2 2 4 JIS X 0201 Katakana set 2 2 5 Mosaic sets 3 Shift JIS variant 4 See also 5 Footnotes 6 References 7 Further reading 8 External linksSets and codes editSee also ISO 2022 The ARIB STD B24 standard defines multiple character sets and a method of switching between them These include a Kanji set an extension of JIS X 0208 an Alphanumeric set a Hiragana set Katakana sets of two distinct layouts and four mosaic sets 6 The sets are selected using ISO 2022 mechanisms for 94 sets using the following codes proportional sets use the same layout as the corresponding non proportional ones 7 Set Type Code column line Code hexadecimal Code ASCII character CommentsKanji 2 byte 4 2 42 B The escape code B used for the ARIB Kanji set 7 is used for the 1983 version of JIS C 6226 JIS X 0208 of which the ARIB Kanji set is an extension in ISO 2022 JP 8 9 Alphanumeric 1 byte 4 10 4A J JIS C6220 ro ISO646 JP JIS X 0201 Roman set Similar to ASCII with two assignments differing Escape code J matches usage in ISO 2022 JP 9 Proportional alphanumeric 1 byte 3 6 36 6Hiragana 1 byte 3 0 30 0 Hiragana themselves follow the same layout as row 4 of JIS X 0208 but without a lead byte Also adds several additional assignments for punctuation Proportional Hiragana 1 byte 3 7 37 7Katakana 1 byte 3 1 31 1 Katakana themselves follow the same layout as row 5 of JIS X 0208 but without a lead byte Also adds several additional assignments for punctuation Proportional Katakana 1 byte 3 8 38 8JIS X 0201 Katakana 1 byte 4 9 49 I JIS C6220 jp JIS X 0201 Kana set Escape code matches usage in ISO 2022 JP 3 Mosaic A 1 byte 3 2 32 2 PseudographicsMosaic B 1 byte 3 3 33 3Mosaic C 1 byte 3 4 34 4 Non spacing pseudographicsMosaic D 1 byte 3 5 35 5Code charts editKanji double byte set edit This is a double byte character set extending JIS X 0208 Lead byte edit The encoding bytes correspond to the row or cell number plus 0x20 or 32 in decimal see below Hence the code set starting with 0x21 has a row number of 1 and its cell 1 has a continuation byte of 0x21 or 33 and so forth Most of the code corresponds to JIS X 0208 ARIB STD B24 Kanji double byte set lead bytes 0 1 2 3 4 5 6 7 8 9 A B C D E F2x SP 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 3x 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 4x 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 5x 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 6x 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 7x 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 DEL Unused lead byte Lead byte Differences from JIS X 0208Character sets 0x21 0x74 row numbers 1 84 punctuation alphabets numbers Kana Kanji edit Further information JIS X 0208 Code charts Character set 0x7A row number 90 traffic symbols edit Characters 90 45 through 90 63 and 90 66 through 90 84 shown below shaded are listed in the B24 standard only in table 7 10 the list of extension characters and are also the only characters in rows 90 through 91 which are not transport related symbols this is noted in the B24 standard in an endnote to table 7 10 10 The remainder of the extensions are listed in both table 7 4 the double byte code chart and table 7 10 10 ARIB STD B24 Kanji double byte set prefixed with 0x7A 5 11 0 1 2 3 4 5 6 7 8 9 A B C D E F2x 26CC 26CD 2757 26CF 26D0 26D1 26D2 26D5 26D3 26D43x 1F17F 1F18A 26D6 26D7 26D8 26D9 26DA 26DB 26DC 26DD 26DE 26DF 26E0 26E14x 2B55 3248 3249 324A 324B 324C 324D 324E 324F 2491 2492 24935x 1F14A 1F14C 1F13F 1F146 1F14B 1F210 1F211 1F212 1F213 1F142 1F214 1F215 1F216 1F14D 1F131 1F13D6x 2B1B 2B24 1F217 1F218 1F219 1F21A 1F21B 26BF 1F21C 1F21D 1F21E 1F21F 1F220 1F221 1F222 1F2237x 1F224 1F225 1F14E 3299 1F200 Additions from table 7 10 not in table 7 4 Character set 0x7B row number 91 map symbols edit See also List of Japanese map symbols Characters from ARIB STD B24 which were not retained in ARIB STD B62 are shown shaded ARIB STD B24 Kanji double byte set prefixed with 0x7B 5 11 12 0 1 2 3 4 5 6 7 8 9 A B C D E F2x 26E3 2B56 2B57 2B58 2B59 2613 328B 3012 26E8 3246 3245 26E9 a 0FD6 26EA 26EB3x 26EC 2668 26ED 26EE 26EF 2693 2708 26F0 26F1 26F2 26F3 26F4 26F5 1F157 24B9 24C84x 26F6 1F15F 1F18B 1F18D 1F18C 1F179 26F7 26F8 26F9 26FA 1F17B 260E 26FB 26FC 26FD 26FE5x 1F17C 26FF6x7x Not in ARIB STD B62Character set 0x7C row number 92 units enclosed forms list markers arrows edit Characters from ARIB STD B24 which were not retained in ARIB STD B62 are shown shaded ARIB STD B24 Kanji double byte set prefixed with 0x7C 5 11 12 0 1 2 3 4 5 6 7 8 9 A B C D E F2x 27A1 2B05 2B06 2B07 2B2F 2B2E 年5E74 月6708 日65E5 円5186 33A1 33A5 339D 33A0 33A43x 1F100 2488 2489 248A 248B 248C 248D 248E 248F 2490 氏 b 副 b 元 b 故 b 前 b 新 b 4x 1F101 1F102 1F103 1F104 1F105 1F106 1F107 1F108 1F109 1F10A 3233 3236 3232 3231 3239 32445x 25B6 25C0 3016 3017 27D0 00B2 00B3 1F12D vn c ob c cb c ce c mb c hp c br c p c 6x s c ms c t c bs c b c tb c tp c ds c ag c eg c vo c fl c ke c y c sa c x c 7x sy c n c or c g c pe c r c 1F12C 1F12B 3247 1F190 1F226 213B Not in ARIB STD B62Character set 0x7D row number 93 game and weather symbols fractions units enclosed forms edit Characters from ARIB STD B24 which were not retained in ARIB STD B62 are shown shaded ARIB STD B24 Kanji double byte set prefixed with 0x7D 5 11 12 0 1 2 3 4 5 6 7 8 9 A B C D E F2x 322A 322B 322C 322D 322E 322F 3230 3237 337E 337D 337C 337B 2116 2121 30363x 26BE 1F240 1F241 1F242 1F243 1F244 1F245 1F246 1F247 1F248 1F12A 1F227 1F228 1F229 1F214 1F22A4x 1F22B 1F22C 1F22D 1F22E 1F22F 1F230 1F231 ℓ2113 338F 3390 33CA 339E 33A2 33715x 00BD 2189 2153 2154 00BC 00BE 2155 2156 2157 2158 2159 215A 2150 215B 2151 21526x 2600 2601 2602 26C4 2616 2617 26C9 26CA 2666 2665 2663 2660 26CB 2A00 203C 20497x 26C5 2614 26C6 2603 26C7 26A1 26C8 269E 269F 266C 260E Not in ARIB STD B62Character set 0x7E row number 94 list markers edit Characters from ARIB STD B24 which were not retained in ARIB STD B62 are shown shaded ARIB STD B24 Kanji double byte set prefixed with 0x7E 5 11 12 0 1 2 3 4 5 6 7 8 9 A B C D E F2x 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 216A 216B 2470 2471 24723x 2473 2474 2475 2476 2477 2478 2479 247A 247B 247C 247D 247E 247F 3251 3252 32534x 3254 1F110 1F111 1F112 1F113 1F114 1F115 1F116 1F117 1F118 1F119 1F11A 1F11B 1F11C 1F11D 1F11E5x 1F11F 1F120 1F121 1F122 1F123 1F124 1F125 1F126 1F127 1F128 1F129 3255 3256 3257 3258 32596x 325A 2460 2461 2462 2463 2464 2465 2466 2467 2468 2469 246A 246B 246C 246D 246E7x 246F 2776 2777 2778 2779 277A 277B 277C 277D 277E 277F 24EB 24EC 325B Not in ARIB STD B62Single byte sets edit Alphanumeric set edit Main articles Code page 895 and JIS X 0201 ARIB STD B24 Alphanumeric set 14 0 1 2 3 4 5 6 7 8 9 A B C D E F2x 0021 0022 0023 0024 0025 amp 0026 0027 0028 0029 002A 002B 002C 002D 002E 002F3x 00030 10031 20032 30033 40034 50035 60036 70037 80038 90039 003A 003B lt 003C 003D gt 003E 003F4x 0040 A0041 B0042 C0043 D0044 E0045 F0046 G0047 H0048 I0049 J004A K004B L004C M004D N004E O004F5x P0050 Q0051 R0052 S0053 T0054 U0055 V0056 W0057 X0058 Y0059 Z005A 005B 00A5 005D 005E 005F6x 0060 a0061 b0062 c0063 d0064 e0065 f0066 g0067 h0068 i0069 j006A k006B l006C m006D n006E o006F7x p0070 q0071 r0072 s0073 t0074 u0075 v0076 w0077 x0078 y0079 z007A 007B 007C 007D 203E Differences from US ASCIIHiragana set edit ARIB STD B24 Hiragana set 15 0 1 2 3 4 5 6 7 8 9 A B C D E F2x ぁ3041 あ 3042 ぃ3043 い3044 ぅ3045 う3046 ぇ3047 え3048 ぉ3049 お304A か304B が 304C き304D ぎ304E く 304F3x ぐ3050 け3051 げ3052 こ3053 ご3054 さ3055 ざ3056 し3057 じ3058 す3059 ず305A せ305B ぜ305C そ305D ぞ305E た305F4x だ3060 ち3061 ぢ3062 っ3063 つ3064 づ3065 て3066 で3067 と3068 ど3069 な306A に306B ぬ306C ね306D の306E は306F5x ば3070 ぱ3071 ひ3072 び3073 ぴ3074 ふ3075 ぶ3076 ぷ3077 へ3078 べ3079 ぺ307A ほ307B ぼ307C ぽ307D ま307E み307F6x む3080 め3081 も3082 ゃ3083 や3084 ゅ3085 ゆ3086 ょ3087 よ3088 ら3089 り308A る308B れ308C ろ308D ゎ308E わ308F7x ゐ3090 ゑ3091 を3092 ん3093 ゝ309D ゞ309E ー30FC 3002 300C 300D 3001 30FB Character allocations not following row 4 of JIS X 0208Katakana set edit ARIB STD B24 Katakana set 16 0 1 2 3 4 5 6 7 8 9 A B C D E F2x ァ30A1 ア 30A2 ィ30A3 イ30A4 ゥ30A5 ウ30A6 ェ30A7 エ30A8 ォ30A9 オ30AA カ30AB ガ 30AC キ30AD ギ30AE ク 30AF3x グ30B0 ケ30B1 ゲ30B2 コ30B3 ゴ30B4 サ30B5 ザ30B6 シ30B7 ジ30B8 ス30B9 ズ30BA セ30BB ゼ30BC ソ30BD ゾ30BE タ30BF4x ダ30C0 チ30C1 ヂ30C2 ッ30C3 ツ30C4 ヅ30C5 テ30C6 デ30C7 ト30C8 ド30C9 ナ30CA ニ30CB ヌ30CC ネ30CD ノ30CE ハ30CF5x バ30D0 パ30D1 ヒ30D2 ビ30D3 ピ30D4 フ30D5 ブ30D6 プ30D7 ヘ30D8 ベ30D9 ペ30DA ホ30DB ボ30DC ポ30DD マ30DE ミ30DF6x ム30E0 メ30E1 モ30E2 ャ30E3 ヤ30E4 ュ30E5 ユ30E6 ョ30E7 ヨ30E8 ラ30E9 リ30EA ル30EB レ30EC ロ30ED ヮ30EE ワ30EF7x ヰ30F0 ヱ30F1 ヲ30F2 ン30F3 ヴ30F4 ヵ30F5 ヶ30F6 ヽ30FD ヾ30FE ー30FC 3002 300C 300D 3001 30FB Character allocations not following row 5 of JIS X 0208JIS X 0201 Katakana set edit Further information JIS X 0201 ARIB STD B24 JIS X 0201 Katakana set 17 0 1 2 3 4 5 6 7 8 9 A B C D E F2x FF61 FF62 FF63 FF64 FF65 ヲFF66 ァFF67 ィFF68 ゥFF69 ェFF6A ォFF6B ャFF6C ュFF6D ョFF6E ッFF6F3x ーFF70 アFF71 イFF72 ウFF73 エFF74 オFF75 カFF76 キFF77 クFF78 ケFF79 コFF7A サFF7B シFF7C スFF7D セFF7E ソFF7F4x タFF80 チFF81 ツFF82 テFF83 トFF84 ナFF85 ニFF86 ヌFF87 ネFF88 ノFF89 ハFF8A ヒFF8B フFF8C ヘFF8D ホFF8E マFF8F5x ミFF90 ムFF91 メFF92 モFF93 ヤFF94 ユFF95 ヨFF96 ラFF97 リFF98 ルFF99 レFF9A ロFF9B ワFF9C ンFF9D ゙FF9E ゚FF9F6x7xMosaic sets edit This section needs expansion You can help by adding to it January 2019 Shift JIS variant editIn addition to the modified ISO 2022 encoding the B24 standard also specifies a Shift JIS encoding following JIS X 0208 1997 but with the addition of the extended characters in the kanji set 1 First byte 0 1 2 3 4 5 6 7 8 9 A B C D E F0 1 2 amp 3 0 1 2 3 4 5 6 7 8 9 lt gt 4 A B C D E F G H I J K L M N O5 P Q R S T U V W X Y Z 6 a b c d e f g h i j k l m n o7 p q r s t u v w x y z 89A ヲ ァ ィ ゥ ェ ォ ャ ュ ョ ッB ー ア イ ウ エ オ カ キ ク ケ コ サ シ ス セ ソC タ チ ツ テ ト ナ ニ ヌ ネ ノ ハ ヒ フ ヘ ホ マD ミ ム メ モ ヤ ユ ヨ ラ リ ル レ ロ ワ ン ゙ ゚EF Second byte 0 1 2 3 4 5 6 7 8 9 A B C D E F0123456789ABCDEF Non printable ASCII characterUnaltered ASCII characterModified ASCII characterSingle byte half width katakanaFirst byte of a double byte character used by JIS X 0208First byte of an ARIB extended characterNot used as first byte unallocated space in JIS X 0208Not used as first byteSecond byte of a double byte character whose first half of the JIS sequence was oddSecond byte of a double byte character whose first half of the JIS sequence was evenUnused as second byte of a double byte characterSee also edit nbsp Wikimedia Commons has media related to ARIB Extended Font Footnotes edit Glossed as temple i e Buddhist temple in B24 table 7 10 the list of extension characters a b c d e f Small form 70 size per code chart table 7 10 of a kanji character Shown here simulated Private Use Area code points shown are those used by the Nishiki teki font 13 a b c d e f g h i j k l m n o p q r s t u v w x y z aa ab ac ad Musical abbreviation or half thereof not present in Unicode simulated here with multiple characters Private Use Area code points shown are those used by the Nishiki teki font References edit a b ARIB 2008 p 105 part 2 section 7 3 a b ARIB 2008 Suignard Michel 2008 03 11 ISO IEC JTC1 SC2 WG2 N 3397 Japanese TV Symbols PDF Unicode 5 2 Emoji List Emojipedia a b c d e f ARIB 2014 pp 33 50 part 2 Table 5 2 ARIB 2008 pp 48 52 a b ARIB 2008 p 39 part 2 Table 7 3 Japanese National Committee on ISO TC97 SC2 1984 07 01 Japanese Graphic Character Set for Information Interchange PDF ITSCJ IPSJ ISO IR 87 a href Template Citation html title Template Citation citation a CS1 maint numeric names authors list link a b RFC 1468 IETF a b ARIB 2008 p 72 a b c d e ARIB 2008 pp 54 72 part 2 Table 7 10 a b c d ARIB 2008 pp 46 47 part 2 Table 7 4 Nishiki teki Version 3 82b 2021 07 23 6 416 characters in the Private Use Areas PDF ARIB 2008 p 48 part 2 Table 7 5 ARIB 2008 p 50 part 2 Table 7 7 ARIB 2008 p 49 part 2 Table 7 6 ARIB 2008 p 52 part 2 Table 7 9 Data Coding and Transmission Specification for Digital Broadcasting PDF ARIB Standard 5 2 E1 Vol 1 Association of Radio Industries and Businesses ARIB 2008 06 06 1999 10 26 ARIB STD B24 Archived PDF from the original on 2017 07 10 Retrieved 2017 07 10 Multimedia Coding Specification for Digital Broadcasting Second Generation PDF ARIB Standard 1 0 E1 Vol 1 Association of Radio Industries and Businesses ARIB 2014 07 31 ARIB STD B62 Retrieved 2019 02 11 Further reading editLunde Ken Roger December 2008 CJKV Information Processing 2 ed O Reilly ISBN 978 0 596 51447 1 Lunde Ken Roger December 1998 CJKV Information Processing 1 ed O Reilly ISBN 1 56592 224 7 NB Translated into Japanese and Chinese in 2002 External links editOfficial changelog for ARIB STD B24 in Japanese STD B24 and others List of ARIB Standards in the Field of Broadcasting ARIB Retrieved from https en wikipedia org w index php title ARIB STD B24 character set amp oldid 1129385891, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.