Each language has at least one character set, the characters that are legally allowed in programs.
A source character set is the character set used for writing source code.
An execution character set is a character set used when a program is running and usually includes any characters that may appear in either input or putput.
table of contents
If you like the idea of this project,
then please donate some money.
more information on donating
Each language has at least one character set, the characters that are legally allowed in programs.
A source character set is the character set used for writing source code.
An execution character set is a character set used when a program is running and usually includes any characters that may appear in either input or output.
The first computer keyboards were usually teletypes or teletype-compaible. These early keyboards traced back to closely related keyboards used for telegraph networks.
The code used for these teletypes was based on a code invented by Frenchman Emile Baudot in 1870. Only the original code was Baudot Code, but in slang all subsequent codes were also called Baudot Code.
Baudot Code uses five bits, which allows 32 different character codes. Using two characters to indicate a shift between interpretations (letter shift and figures shift), it was possible to encode all of the capital letters, the decimal digits, a few punctuation marks, and special control codes.
The resulting character set explains the small character sets of several early programming languages, such as BASIC, COBOL, FORTRAN, LISP, and PL/I.
Baudot includes ABCDEFGHIJKLMNOPQRSTUVWXYZ and 012345678, as well as punctuation -$',!:(")#&./;+= and control codes: Space, Bell, Carriage Return (CR), and Line Feed (LF). Not all teletypes had += or ;"
| Binary | Decimal | Hex | Octal | Letter | U.S. Figures |
CCITT No. 2 Figures |
|---|---|---|---|---|---|---|
| 00000 | 0 | 0 | 0 | N/A | N/A | N/A |
| 00001 | 1 | 1 | 1 | E | 3 | 3 |
| 00010 | 2 | 2 | 2 | LF | LF | LF |
| 00011 | 3 | 3 | 3 | A | - | - |
| 00100 | 4 | 4 | 4 | Space | Space | Space |
| 00101 | 5 | 5 | 5 | S | BELL | ' |
| 00110 | 6 | 6 | 6 | I | 8 | 8 |
| 00111 | 7 | 7 | 7 | U | 7 | 7 |
| 01000 | 8 | 8 | 10 | CR | CR | CR |
| 01001 | 9 | 9 | 11 | D | $ | WRU |
| 01010 | 10 | A | 12 | R | 4 | 4 |
| 01011 | 11 | B | 13 | J | ' | Bell |
| 01100 | 12 | C | 14 | N | , | , |
| 01101 | 13 | D | 15 | F | ! | ! |
| 01110 | 14 | E | 16 | C | : | : |
| 01111 | 15 | F | 17 | K | ( | ( |
| 10000 | 16 | 10 | 20 | T | 5 | 5 |
| 10001 | 17 | 11 | 21 | Z | " | + |
| 10010 | 18 | 12 | 22 | L | ) | ) |
| 10011 | 19 | 13 | 23 | W | 2 | 2 |
| 10100 | 20 | 14 | 24 | H | # | £ |
| 10101 | 21 | 15 | 25 | Y | 6 | 6 |
| 10110 | 22 | 16 | 26 | P | 0 | 0 |
| 10111 | 23 | 17 | 27 | Q | 1 | 1 |
| 11000 | 24 | 18 | 30 | O | 9 | 9 |
| 11001 | 25 | 19 | 31 | B | ? | ? |
| 11010 | 26 | 1A | 32 | G | & | & |
| 11011 | 27 | 1B | 33 | Figures Shift | Figures Shift | Figures Shift |
| 11100 | 28 | 1C | 34 | M | . | . |
| 11101 | 29 | 1D | 35 | X | / | / |
| 11110 | 30 | 1E | 36 | V | ; | = |
| 11111 | 31 | 1F | 37 | Letters Shift | Letters Shift | Letters Shift |
| name | C | PL/I |
|---|---|---|
| A | A | |
| B | B | |
| C | C | |
| D | D | |
| E | E | |
| F | F | |
| G | G | |
| H | H | |
| I | I | |
| J | J | |
| K | K | |
| L | L | |
| M | M | |
| N | N | |
| O | O | |
| P | P | |
| Q | Q | |
| R | R | |
| S | S | |
| T | T | |
| U | U | |
| V | V | |
| W | W | |
| X | X | |
| Y | Y | |
| Z | Z | |
| name | C | PL/I |
| a | ||
| b | ||
| c | ||
| d | ||
| e | ||
| f | ||
| g | ||
| h | ||
| i | ||
| j | ||
| k | ||
| l | ||
| m | ||
| n | ||
| o | ||
| p | ||
| q | ||
| r | ||
| s | ||
| t | ||
| u | ||
| v | ||
| w | ||
| x | ||
| y | ||
| z | ||
| name | C | PL/I |
| 0 | 0 | |
| 1 | 1 | |
| 2 | 2 | |
| 3 | 3 | |
| 4 | 4 | |
| 5 | 5 | |
| 6 | 6 | |
| 7 | 7 | |
| 8 | 8 | |
| 9 | 9 | |
| name | C | PL/I |
| alert bell |
\a ANSI |
|
| backspace BS |
\b formatting character |
|
| horizontal tab HT |
\t formatting character |
|
| vertical tab VT |
\v formatting character |
|
| form feed FF |
\f< formatting character/td> | |
| carriage return CR |
\r formatting character |
|
| space blank |
||
| exclamation point | ! | |
| double quote | " \" |
|
| number sign | # | |
| dollar sign | $ extra graphic |
|
| percent sign | % | % |
| ampersand | & | & AND symbol |
| apostrophe single quotation mark |
' \' |
' |
| left parenthesis | ( | ( |
| right parenthesis | ) | ) |
| asterisk | * | * multiplication symbol |
| plus sign | + | + |
| comma | , | , |
| hyphen or minus sign |
- | - |
| period point |
. | . |
| slash forward slash |
/ | / divide symbol |
| colon | : | : |
| semicolon | : | ; |
| less than | < | < |
| equal | = | = also assignment symbol |
| greater than | > | > |
| question mark | ? \? ANSI |
? |
| at sign | @ extra graphic |
|
| left bracket | [ | |
| backslash | \ \\ |
|
| right bracket | ] | |
| circumflex | ^ | |
| underscore | _ | _ break character |
| accent grave | ` extra graphic |
|
| left brace | [ | |
| vertical bar stroke |
| | | OR symbol |
| right brace | ] | |
| tilde | ~ | |
| NOT symbol | ¬ | |
| null | must equal zero | |
| newline end-of-line marker |
\n |
The official character set for Python is the orginal ASCII character codes (0 to 127 inclusive).
You can use additional characters in both comments and string literals if you declare a character coding.
# _*_ coding: utf-8 _*_
# _*_ coding: iso-8859-1 _*_
The character set coding line must go at the beginning of the source file (or immediately after the she-bang or hash-bang line).
Coding example: I am making heavily documented and explained open source code for a method to play music for free almost any song, no subscription fees, no download costs, no advertisements, all completely legal. This is done by building a front-end to YouTube (which checks the copyright permissions for you).
View music player in action: www.musicinpublic.com/.
Create your own copy from the original source code/ (presented for learning programming).
return to table of contents
free downloadable college text book
Because I no longer have the computer and software to make PDFs, the book is available as an HTML file, which you can convert into a PDF.
Building a free downloadable text book on computer programming for university, college, community college, and high school classes in computer programming.
If you like the idea of this project,
then please donate some money.
send donations to:
Milo
PO Box 1361
Tustin, California 92781
Supporting the entire project:
If you have a business or organization that can support the entire cost of this project, please contact Pr Ntr Kmt (my church)
This web site handcrafted on Macintosh computers using Tom Benders Tex-Edit Plus and served using FreeBSD .
UNIX used as a generic term unless specifically used as a trademark (such as in the phrase UNIX certified). UNIX is a registered trademark in the United States and other countries, licensed exclusively through X/Open Company Ltd.
Names and logos of various OSs are trademarks of their respective owners.
Copyright © 2010, 2011 Milo
Created: November 10, 2010
Last Updated: March 21, 2011
return to table of contents
free downloadable college text book