ISO-8859-1 (ISO Latin 1) Character Encoding
Contents
- The characters at a glance
- Character codes and names
- Notes for html documents
- Other notes
- Additional references
The characters at a glance
Here are all the printable characters, in collating order:
! " # $ % & ' ( ) * + , - . /
0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
[ ] ^ _ `
a b c d e f g h i j k l m n o p q r s t u v w x y z
{ | } ~
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö
×
Ø Ù Ú Û Ü Ý Þ
ß
à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö
÷
ø ù ú û ü ý þ
ÿ
The first six rows are the ASCII character set.
Note the ordinary ASCII space (before `!') and the ISO Latin-1 non-breaking space (before `¡')
Character codes and names
The columns show, in order:
- HTML: the HTML notation (decimal);
- OCTL: the C/Modula-3 octal notation;
- HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
- CMP: the Sun/X11 "Compose" key combinations;
- CHR: the charater itself, when HTML-printable;
- MEANING: the ISO-8859-1 (and ASCII) definition.
HTML OCTL HEX CMP CHR MEANING
------ + ---- + --- + --- + --- + ------------------------------
� | 00 | =00 | | | Invalid (ASCII NUL)
 | 01 | =01 | | | Unused (ASCII SOH)
 | 02 | =02 | | | Unused (ASCII STX)
 | 03 | =03 | | | Unused (ASCII ETX)
 | 04 | =04 | | | Unused (ASCII EOT)
 | 05 | =05 | | | Unused (ASCII ENQ)
 | 06 | =06 | | | Unused (ASCII ACK)
 | 07 | =07 | | | Unused (ASCII BEL, audible bell)
 | 10 | =08 | | | Unused (ASCII BS, backspace)
	 | 11 | =09 | | | Horizontal tab (ASCII HT)

 | 12 | =0A | | | Line feed (ASCII NL, newline)
 | 13 | =0B | | | Unused (ASCII VT, vertical tab)
 | 14 | =0C | | | Unused (ASCII NP, new page)

 | 15 | =0D | | | Carriage Return (ASCII CR)
 | 16 | =0E | | | Unused (ASCII SO)
 | 17 | =0F | | | Unused (ASCII SI)
 | 20 | =10 | | | Unused (ASCII DLE)
 | 21 | =11 | | | Unused (ASCII DC1)
 | 22 | =12 | | | Unused (ASCII DC2)
 | 23 | =13 | | | Unused (ASCII DC3)
 | 24 | =14 | | | Unused (ASCII DC4)
 | 25 | =15 | | | Unused (ASCII NAK)
 | 26 | =16 | | | Unused (ASCII SYN)
 | 27 | =17 | | | Unused (ASCII ETB)
 | 30 | =18 | | | Unused (ASCII CAN)
 | 31 | =19 | | | Unused (ASCII EM)
 | 32 | =1A | | | Unused (ASCII SUB)
 | 33 | =1B | | | Unused (ASCII ESC, escape)
 | 34 | =1C | | | Unused (ASCII FS)
 | 35 | =1D | | | Unused (ASCII GS)
 | 36 | =1E | | | Unused (ASCII RS)
 | 37 | =1F | | | Unused (ASCII US)
  | 40 | =20 | | ( ) | Space (ASCII SP)
! | 41 | =21 | | (!) | Exclamation mark
" | 42 | =22 | | (") | Quotation mark (")
# | 43 | =23 | | (#) | Number sign
$ | 44 | =24 | | ($) | Dollar sign
% | 45 | =25 | | (%) | Percent sign
& | 46 | =26 | | (&) | Ampersand (&)
' | 47 | =27 | | (') | Apostrophe (right single quote)
( | 50 | =28 | | (() | Left parenthesis
) | 51 | =29 | | ()) | Right parenthesis
* | 52 | =2A | | (*) | Asterisk
+ | 53 | =2B | | (+) | Plus sign
, | 54 | =2C | | (,) | Comma
- | 55 | =2D | | (-) | Hyphen
. | 56 | =2E | | (.) | Period (fullstop)
/ | 57 | =2F | | (/) | Solidus (slash)
0 | 60 | =30 | | (0) | Digit 0
. . .
9 | 71 | =39 | | (9) | Digit 9
: | 72 | =3A | | (:) | Colon
; | 73 | =3B | | (;) | Semi-colon
< | 74 | =3C | | (<) | Less than (<)
= | 75 | =3D | | (=) | Equals sign
> | 76 | =3E | | (>) | Greater than (>)
? | 77 | =3F | | (?) | Question mark
@ | 100 | =40 | | (@) | Commercial at-sign
A | 101 | =41 | | (A) | Uppercase letter A
. . .
Z | 132 | =5A | | (Z) | Uppercase letter Z
[ | 133 | =5B | | ([) | Left square bracket
\ | 134 | =5C | | () | Reverse solidus (backslash)
] | 135 | =5D | | (]) | Right square bracket
^ | 136 | =5E | | (^) | Caret
_ | 137 | =5F | | (_) | Horizontal bar (underscore)
` | 140 | =60 | | (`) | Reverse apostrophe (left single quote)
a | 141 | =61 | | (a) | Lowercase letter a
. . .
z | 172 | =7A | | (z) | Lowercase letter z
{ | 173 | =7B | | ({) | Left curly brace
| | 174 | =7C | | (|) | Vertical bar
} | 175 | =7D | | (}) | Right curly brace
~ | 176 | =7E | | (~) | Tilde
 | 177 | =7F | | | Unused (ASCII DEL)
€ | 200 | =80 | | | Unused
 | 201 | =81 | | | Unused
‚ | 202 | =82 | | | Unused
ƒ | 203 | =83 | | | Unused
„ | 204 | =84 | | | Unused
… | 205 | =85 | | | Unused
† | 206 | =86 | | | Unused
‡ | 207 | =87 | | | Unused
ˆ | 210 | =88 | | | Unused
‰ | 211 | =89 | | | Unused
Š | 212 | =8A | | | Unused
‹ | 213 | =8B | | | Unused
Œ | 214 | =8C | | | Unused
 | 215 | =8D | | | Unused
Ž | 216 | =8E | | | Unused
 | 217 | =8F | | | Unused
 | 220 | =90 | | | Unused
‘ | 221 | =91 | | | Unused
’ | 222 | =92 | | | Unused
“ | 223 | =93 | | | Unused
” | 224 | =94 | | | Unused
• | 225 | =95 | | | Unused
– | 226 | =96 | | | Unused
— | 227 | =97 | | | Unused
˜ | 230 | =98 | | | Unused
™ | 231 | =99 | | | Unused
š | 232 | =9A | | | Unused
› | 233 | =9B | | | Unused
œ | 234 | =9C | | | Unused
 | 235 | =9D | | | Unused
ž | 236 | =9E | | | Unused
Ÿ | 237 | =9F | | | Unused
  | 240 | =A0 | | ( ) | Non-breaking space ( )
¡ | 241 | =A1 | ! ! | (¡) | Inverted exclamation
¢ | 242 | =A2 | c / | (¢) | Cent sign
£ | 243 | =A3 | l - | (£) | Pound sterling
¤ | 244 | =A4 | o x | (¤) | General currency sign
¥ | 245 | =A5 | y - | (¥) | Yen sign
¦ | 246 | =A6 | | | | (¦) | Broken vertical bar
§ | 247 | =A7 | s o | (§) | Section sign
¨ | 250 | =A8 | " " | (¨) | Umlaut (dieresis)
© | 251 | =A9 | c o | (©) | Copyright
ª | 252 | =AA | - a | (ª) | Feminine ordinal
« | 253 | =AB | < < | («) | Left angle quote, guillemotleft
¬ | 254 | =AC | - , | (¬) | Not sign
­ | 255 | =AD | - - | () | Soft hyphen
® | 256 | =AE | r o | (®) | Registered trademark
¯ | 257 | =AF | ^ - | (¯) | Macron accent
° | 260 | =B0 | ^ * | (°) | Degree sign
± | 261 | =B1 | + - | (±) | Plus or minus
² | 262 | =B2 | ^ 2 | (²) | Superscript two
³ | 263 | =B3 | ^ 3 | (³) | Superscript three
´ | 264 | =B4 | | (´) | Acute accent
µ | 265 | =B5 | / u | (µ) | Micro sign
¶ | 266 | =B6 | P ! | (¶) | Paragraph sign
· | 267 | =B7 | ^ . | (·) | Middle dot
¸ | 270 | =B8 | , , | (¸) | Cedilla
¹ | 271 | =B9 | ^ 1 | (¹) | Superscript one
º | 272 | =BA | _ o | (º) | Masculine ordinal
» | 273 | =BB | > > | (») | Right angle quote, guillemotright
¼ | 274 | =BC | 1 4 | (¼) | Fraction one-fourth
½ | 275 | =BD | 1 2 | (½) | Fraction one-half
¾ | 276 | =BE | 3 4 | (¾) | Fraction three-fourths
¿ | 277 | =BF | ? ? | (¿) | Inverted question mark
À | 300 | =C0 | A ` | (À) | Capital A, grave accent
Á | 301 | =C1 | A ' | (Á) | Capital A, acute accent
 | 302 | =C2 | A ^ | (Â) | Capital A, circumflex accent
à | 303 | =C3 | A ~ | (Ã) | Capital A, tilde
Ä | 304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark
Å | 305 | =C5 | A * | (Å) | Capital A, ring
Æ | 306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature)
Ç | 307 | =C7 | C , | (Ç) | Capital C, cedilla
È | 310 | =C8 | E ` | (È) | Capital E, grave accent
É | 311 | =C9 | E ' | (É) | Capital E, acute accent
Ê | 312 | =CA | E ^ | (Ê) | Capital E, circumflex accent
Ë | 313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark
Ì | 314 | =CC | I ` | (Ì) | Capital I, grave accent
Í | 315 | =CD | I ' | (Í) | Capital I, acute accent
Î | 316 | =CE | I ^ | (Î) | Capital I, circumflex accent
Ï | 317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark
Ð | 320 | =D0 | D - | (Ð) | Capital Eth, Icelandic
Ñ | 321 | =D1 | N ~ | (Ñ) | Capital N, tilde
Ò | 322 | =D2 | O ` | (Ò) | Capital O, grave accent
Ó | 323 | =D3 | O ' | (Ó) | Capital O, acute accent
Ô | 324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent
Õ | 325 | =D5 | O ~ | (Õ) | Capital O, tilde
Ö | 326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark
× | 327 | =D7 | x x | (×) | Multiply sign
Ø | 330 | =D8 | O / | (Ø) | Capital O, slash
Ù | 331 | =D9 | U ` | (Ù) | Capital U, grave accent
Ú | 332 | =DA | U ' | (Ú) | Capital U, acute accent
Û | 333 | =DB | U ^ | (Û) | Capital U, circumflex accent
Ü | 334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark
Ý | 335 | =DD | Y ' | (Ý) | Capital Y, acute accent
Þ | 336 | =DE | P | | (Þ) | Capital THORN, Icelandic
ß | 337 | =DF | s s | (ß) | Small sharp s, German (sz ligature)
à | 340 | =E0 | a ` | (à) | Small a, grave accent
á | 341 | =E1 | a ' | (á) | Small a, acute accent
â | 342 | =E2 | a ^ | (â) | Small a, circumflex accent
ã | 343 | =E3 | a ~ | (ã) | Small a, tilde
ä | 344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark
å | 345 | =E5 | a * | (å) | Small a, ring
æ | 346 | =E6 | a e | (æ) | Small ae dipthong (ligature)
ç | 347 | =E7 | c , | (ç) | Small c, cedilla
è | 350 | =E8 | e ` | (è) | Small e, grave accent
é | 351 | =E9 | e ' | (é) | Small e, acute accent
ê | 352 | =EA | e ^ | (ê) | Small e, circumflex accent
ë | 353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark
ì | 354 | =EC | i ` | (ì) | Small i, grave accent
í | 355 | =ED | i ' | (í) | Small i, acute accent
î | 356 | =EE | i ^ | (î) | Small i, circumflex accent
ï | 357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark
ð | 360 | =F0 | d - | (ð) | Small eth, Icelandic
ñ | 361 | =F1 | n ~ | (ñ) | Small n, tilde
ò | 362 | =F2 | o ` | (ò) | Small o, grave accent
ó | 363 | =F3 | o ' | (ó) | Small o, acute accent
ô | 364 | =F4 | o ^ | (ô) | Small o, circumflex accent
õ | 365 | =F5 | o ~ | (õ) | Small o, tilde
ö | 366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark
÷ | 367 | =F7 | - : | (÷) | Division sign
ø | 370 | =F8 | o / | (ø) | Small o, slash
ù | 371 | =F9 | u ` | (ù) | Small u, grave accent
ú | 372 | =FA | u ' | (ú) | Small u, acute accent
û | 373 | =FB | u ^ | (û) | Small u, circumflex accent
ü | 374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark
ý | 375 | =FD | y ' | (ý) | Small y, acute accent
þ | 376 | =FE | p | | (þ) | Small thorn, Icelandic
ÿ | 377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark
Notes for HTML documents
-
HTML entity names are given in the "MEANING" column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.
-
The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)
-
The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.
-
The non-breaking space ( ) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: | |.
Other notes
-
Alternative Sun/X11 "Compose" sequences for the Icelandic "thorn" are "t h" (þ, lowercase) and "T H" (Þ, uppercase).
-
Note that the Sun/X11 "Compose" sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas...
Additional references
- Martin Ramsch's iso8859-1 table.
- The HTML 2.0 Standard [Character Entity Sets] [HTML Coded Character Set]
- The HTML 3.0 specification [Latin-1 Character Entities]
- The HTML+ Discussion Document [Appendix II]
- An exhaustive entity table including HTML 2.0, HTML3.0, HTML+, with Postscript equivalents.
Composed by J. Stolfi from several sources found throughout the net.
http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html