|
Notes Spring 2009 |
The ASCII Character SetA text document is broken into individual characters. There are a finite number of characters to represent. So the approach is to list them all and assign each a binary string. To store a particular letter, we store the appropriate bit string. What characters do we worry about? Upper and lower case letters, punctuation, numeric digits, white space, special symbols. What symbols would we need if we were not using English? A character set is a list of characters and the codes used to represent them. Several codes have been used over the years. IBM created a code for its mainframe computers. It is the Extended Binary Coded Decimal Information Code (EBCDIC). It is an 8-bit code. Then we got American Standard Code for Information Interchange (ASCII), used by PC's. It began as an 7-bit code with the extra bit used for a variety of purposes including checking that the code was transmitted correctly. But 7 bits allow only 128 characters. This is fine if everyone communicates in English, but a few other languages exist. The 7-bit code was extended to be an 8-bit code which allows 256 different characters. Now other languages can be included: Spanish, German, French and most other European languages. The third system in the Unicode Character Set which uses 16 bits per character. Now we can express over 65.000 characters. More languages can be represented, including Russian, Thai, Greek, Cherokee, Braille, Chinese/Japanese/Korean, Mathematical symbols, and special symbols such as ₤ and ™. In future versions even Klingon will be supported. Below is the table of printable and nonprintable ASCII characters.
Computers can layer encodings to virtually any level of complexity. Numbers can be interpreted as characters, which can be interpreted in sets as Web pages, which can be interpreted to appear as multiple fonts and styles. But at the bottommost level, the computer only "knows" voltages, which we interpret as numbers. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Syllabus | Notes |