fbpx
Wikipedia

ZX81 character set

The ZX81 character set is the character encoding used by the Sinclair Research ZX81 family of microcomputers including the Timex Sinclair 1000 and Timex Sinclair 1500. The encoding uses one byte per character for 256 code points. It has no relationship with previously established ones like ASCII or EBCDIC, but it is related though not identical to the character set of the predecessor ZX80.

The Sinclair ZX81 character set rendered in the system font.

Printable characters

 
Screenshot of a ZX81 8K BASIC program that demonstrates all code points including BASIC keywords and nonprintable characters, rendered as question marks.

The character set has 64 unique glyphs present at code points 0–63. With the most significant bit set the character is generated in inverse video; corresponding to code points 128–191. These 128 values are the only displayable ones allowed in the video memory (known as the display file). The remaining code points (64–127 and 192–255) are used as control characters such as 118 for newline or, uniquely to Sinclair BASIC, for keywords, while some are unused.

The small effective range of only 64 unique glyphs precludes support for Latin lower case letters, and many symbols used widely in computing such as the exclamation point and the at sign. The lack of an apostrophe led some software authors to use a comma instead.

There are 11 block graphics characters, counting code point 0 which also doubles as space. The first 8 of these together with their 8 inverse video versions (16 code points) provide every combination of the character cell divided into 2×2 black-and-white block pixels for low-resolution 64×48 pixel graphics. These 2×2 blocks are present in the Block Elements Unicode block. An additional 3 characters provide a cell divided into 1×2 black, white or dithered gray wide block pixels. These, in combination with their inverse video versions and some of the previous 2×2 blocks provides for a 32×48 resolution with 3 levels (white, dithered gray, black). The basic 11 characters plus their inverse video versions, makes for 22 block graphics characters in total. The dithered characters (of which there are 6) are also available in Unicode (mostly in the Symbols for Legacy Computing block), but only in Unicode versions 13.0 and newer, available from 2020 onwards.

Code point 11 is the double-quote (") symbol when used in the display file. The BASIC function CHR$ 192 prints as the same character but is shown as "" in BASIC source listings; it is used for including the literal " character in a string without conflict with the " string delimiter.[1]

Changes from the ZX80

The character set in the ZX81 was derived from the ZX80 character set. They have mostly the same code points, e.g. for A-Z and 0-9, but the code points are different for the block graphics characters, the symbols ", -, +, *, /, =, >, <, and the BASIC keyword tokens (with many new added). There are also changes to the control characters. Code point 1 is no longer an unprintable string terminator. The ZX81 8K BASIC ROM was also available as an upgrade for the ZX80, replacing its integer-only 4K BASIC ROM.[2]

The ZX81 system font uses an 8×8 pixel-per-character grid where most glyphs fit in 6×6 pixels leaving two pixels horizontal and vertical space between rows and columns. The ZX80's ROM which had slightly wider 7×6 pixel glyphs with only one pixel horizontal space between them. Some glyphs also received a different design in the ZX81 system font, noticeable on the *, the slashed and less rounded 0, and the less rounded $, C, G and J.

In the later ZX Spectrum the entire character encoding was replaced with the ZX Spectrum character set, which is a derivative of ASCII and includes lower case letters and more.

Character set

ZX81 character set[1][3]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x  SP                      "[a] £ $ : ?
1x ( ) > < = + - * / ; , . 0 1 2 3
2x 4 5 6 7 8 9 A B C D E F G H I J
3x K L M N O P Q R S T U V W X Y Z
4x RND INKEY$ PI
5x
6x
7x UP DOWN LEFT RIGHT graphics EDIT newline rubout K/L
MODE
function number cursor
8x                       " £ $ : ?
9x ( ) > < = + - * / ; , . 0 1 2 3
Ax 4 5 6 7 8 9 A B C D E F G H I J
Bx K L M N O P Q R S T U V W X Y Z
Cx ""[a] AT TAB CODE VAL LEN SIN COS TAN ASN ACS ATN LN EXP INT
Dx SQR SGN ABS PEEK USR STR$ CHR$ NOT **[b] OR AND <=[c] >=[d] <>[e] THEN TO
Ex STEP LPRINT LLIST STOP SLOW FAST NEW SCROLL CONT DIM REM FOR GOTO GOSUB INPUT LOAD
Fx LIST LET PAUSE NEXT POKE PRINT PLOT RUN SAVE RAND IF CLS UNPLOT CLEAR RETURN COPY
  Sinclair BASIC tokenizes keywords into single-byte code points.

See also

Notes

  1. ^ a b Code point 11 is the double-quote (") symbol when used in the video memory (called the display file). Code point 192, CHR$ 192, prints as the same character but shows as "" in BASIC listings and is used for including the literal " character in a string without conflict with the " string delimiter.[1]
  2. ^ The raise to a power multi-character operator tokenized into a single-byte code point.
  3. ^ The greater than or equal to multi-character operator tokenized into a single-byte code point.
  4. ^ The less than or equal to multi-character operator tokenized into a single-byte code point.
  5. ^ The not equal sign multi-character operator tokenized into a single-byte code point.

References

  1. ^ a b c Vickers, Steven (1981). Sinclair ZX81 BASIC Programming. Sinclair Research Ltd.
  2. ^ "8K BASIC ROM UPGRADE".
  3. ^ Wearmouth, Geoff. . Archived from the original on August 15, 2015.

zx81, character, character, encoding, used, sinclair, research, zx81, family, microcomputers, including, timex, sinclair, 1000, timex, sinclair, 1500, encoding, uses, byte, character, code, points, relationship, with, previously, established, ones, like, ascii. The ZX81 character set is the character encoding used by the Sinclair Research ZX81 family of microcomputers including the Timex Sinclair 1000 and Timex Sinclair 1500 The encoding uses one byte per character for 256 code points It has no relationship with previously established ones like ASCII or EBCDIC but it is related though not identical to the character set of the predecessor ZX80 The Sinclair ZX81 character set rendered in the system font Contents 1 Printable characters 2 Changes from the ZX80 3 Character set 4 See also 5 Notes 6 ReferencesPrintable characters Edit Screenshot of a ZX81 8K BASIC program that demonstrates all code points including BASIC keywords and nonprintable characters rendered as question marks The character set has 64 unique glyphs present at code points 0 63 With the most significant bit set the character is generated in inverse video corresponding to code points 128 191 These 128 values are the only displayable ones allowed in the video memory known as the display file The remaining code points 64 127 and 192 255 are used as control characters such as 118 for newline or uniquely to Sinclair BASIC for keywords while some are unused The small effective range of only 64 unique glyphs precludes support for Latin lower case letters and many symbols used widely in computing such as the exclamation point and the at sign The lack of an apostrophe led some software authors to use a comma instead There are 11 block graphics characters counting code point 0 which also doubles as space The first 8 of these together with their 8 inverse video versions 16 code points provide every combination of the character cell divided into 2 2 black and white block pixels for low resolution 64 48 pixel graphics These 2 2 blocks are present in the Block Elements Unicode block An additional 3 characters provide a cell divided into 1 2 black white or dithered gray wide block pixels These in combination with their inverse video versions and some of the previous 2 2 blocks provides for a 32 48 resolution with 3 levels white dithered gray black The basic 11 characters plus their inverse video versions makes for 22 block graphics characters in total The dithered characters of which there are 6 are also available in Unicode mostly in the Symbols for Legacy Computing block but only in Unicode versions 13 0 and newer available from 2020 onwards Code point 11 is the double quote symbol when used in the display file The BASIC function CHR 192 prints as the same character but is shown as in BASIC source listings it is used for including the literal character in a string without conflict with the string delimiter 1 Changes from the ZX80 EditThe character set in the ZX81 was derived from the ZX80 character set They have mostly the same code points e g for A Z and 0 9 but the code points are different for the block graphics characters the symbols gt lt and the BASIC keyword tokens with many new added There are also changes to the control characters Code point 1 is no longer an unprintable string terminator The ZX81 8K BASIC ROM was also available as an upgrade for the ZX80 replacing its integer only 4K BASIC ROM 2 The ZX81 system font uses an 8 8 pixel per character grid where most glyphs fit in 6 6 pixels leaving two pixels horizontal and vertical space between rows and columns The ZX80 s ROM which had slightly wider 7 6 pixel glyphs with only one pixel horizontal space between them Some glyphs also received a different design in the ZX81 system font noticeable on the the slashed and less rounded 0 and the less rounded C G and J In the later ZX Spectrum the entire character encoding was replaced with the ZX Spectrum character set which is a derivative of ASCII and includes lower case letters and more Character set EditZX81 character set 1 3 0 1 2 3 4 5 6 7 8 9 A B C D E F0x SP a 1x gt lt 0 1 2 32x 4 5 6 7 8 9 A B C D E F G H I J3x K L M N O P Q R S T U V W X Y Z4x RND INKEY PI5x6x7x UP DOWN LEFT RIGHT graphics EDIT newline rubout K L MODE function number cursor8x 9x gt lt 0 1 2 3Ax 4 5 6 7 8 9 A B C D E F G H I JBx K L M N O P Q R S T U V W X Y ZCx a AT TAB CODE VAL LEN SIN COS TAN ASN ACS ATN LN EXP INTDx SQR SGN ABS PEEK USR STR CHR NOT b OR AND lt c gt d lt gt e THEN TOEx STEP LPRINT LLIST STOP SLOW FAST NEW SCROLL CONT DIM REM FOR GOTO GOSUB INPUT LOADFx LIST LET PAUSE NEXT POKE PRINT PLOT RUN SAVE RAND IF CLS UNPLOT CLEAR RETURN COPY Sinclair BASIC tokenizes keywords into single byte code points See also EditZX80 character set ZX Spectrum character set ATASCII Atari ST character set PETSCII Extended ASCIINotes Edit a b Code point 11 is the double quote symbol when used in the video memory called the display file Code point 192 CHR 192 prints as the same character but shows as in BASIC listings and is used for including the literal character in a string without conflict with the string delimiter 1 The raise to a power multi character operator tokenized into a single byte code point The greater than or equal to multi character operator tokenized into a single byte code point The less than or equal to multi character operator tokenized into a single byte code point The not equal sign multi character operator tokenized into a single byte code point References Edit a b c Vickers Steven 1981 Sinclair ZX81 BASIC Programming Sinclair Research Ltd 8K BASIC ROM UPGRADE Wearmouth Geoff An Assembly Listing of the Operating System of the ZX81 ROM Archived from the original on August 15 2015 Retrieved from https en wikipedia org w index php title ZX81 character set amp oldid 1135234708, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.