zoukankan      html  css  js  c++  java
  • HTML Character Sets

    HTML Character Sets

    Character sets determine how the bytes that represent the text of your HTML document are translated to readable characters. A Web browser interprets the bytes in your document according to the applied character set translations. It interprets numeric or hex character references ("〹" or "ሴ") as ISO10646 code points, consistent with the Unicode Standard, version 2.0, and independent of the chosen character set. Named entities ("&") are displayed independently of the chosen character set as well. The display of an arbitrary numeric character reference requires the existence of a font that is able to display that particular character on the user's system. Accordingly, the content in the first column of the following tables may not render as expected on all systems.

    ISO Latin-1 Character Set

    The following table contains the complete ISO Latin-1 character set, corresponding to the first 256 entries of the Unicode character repertoire. The table provides each character, its decimal code, its named entity reference for HTML, and also a brief description.

    Character

    Decimal code

    Named entity

    Description

    ---

    �

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    	

    ---

    Horizontal tab

    ---

    


    ---

    Line feed

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Carriage Return

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

    ---

    

    ---

    Unused

     

     

    ---

    Space

    !

    !

    ---

    Exclamation mark

    "

    "

    "

    Quotation mark

    #

    #

    ---

    Number sign

    $

    $

    ---

    Dollar sign

    %

    %

    ---

    Percent sign

    &

    &

    &

    Ampersand

    '

    '

    ---

    Apostrophe

    (

    (

    ---

    Left parenthesis

    )

    )

    ---

    Right parenthesis

    *

    *

    ---

    Asterisk

    +

    +

    ---

    Plus sign

    ,

    ,

    ---

    Comma

    -

    -

    ---

    Hyphen

    .

    .

    ---

    Period (fullstop)

    /

    /

    ---

    Solidus (slash)

    0

    0

    ---

    Digit 0

    1

    1

    ---

    Digit 1

    2

    2

    ---

    Digit 2

    3

    3

    ---

    Digit 3

    4

    4

    ---

    Digit 4

    5

    5

    ---

    Digit 5

    6

    6

    ---

    Digit 6

    7

    7

    ---

    Digit 7

    8

    8

    ---

    Digit 8

    9

    9

    ---

    Digit 9

    :

    :

    ---

    Colon

    ;

    &#59;

    ---

    Semicolon

    <

    &#60;

    &lt;

    Less than

    =

    &#61;

    ---

    Equals sign

    >

    &#62;

    &gt;

    Greater than

    ?

    &#63;

    ---

    Question mark

    @

    &#64;

    ---

    Commercial at

    A

    &#65;

    ---

    Capital A

    B

    &#66;

    ---

    Capital B

    C

    &#67;

    ---

    Capital C

    D

    &#68;

    ---

    Capital D

    E

    &#69;

    ---

    Capital E

    F

    &#70;

    ---

    Capital F

    G

    &#71;

    ---

    Capital G

    H

    &#72;

    ---

    Capital H

    I

    &#73;

    ---

    Capital I

    J

    &#74;

    ---

    Capital J

    K

    &#75;

    ---

    Capital K

    L

    &#76;

    ---

    Capital L

    M

    &#77;

    ---

    Capital M

    N

    &#78;

    ---

    Capital N

    O

    &#79;

    ---

    Capital O

    P

    &#80;

    ---

    Capital P

    Q

    &#81;

    ---

    Capital Q

    R

    &#82;

    ---

    Capital R

    S

    &#83;

    ---

    Capital S

    T

    &#84;

    ---

    Capital T

    U

    &#85;

    ---

    Capital U

    V

    &#86;

    ---

    Capital V

    W

    &#87;

    ---

    Capital W

    X

    &#88;

    ---

    Capital X

    Y

    &#89;

    ---

    Capital Y

    Z

    &#90;

    ---

    Capital Z

    [

    &#91;

    ---

    Left square bracket

    \

    &#92;

    ---

    Reverse solidus (backslash)

    ]

    &#93;

    ---

    Right square bracket

    ^

    &#94;

    ---

    Caret

    _

    &#95;

    ---

    Horizontal bar (underscore)

    `

    &#96;

    ---

    Grave accent

    a

    &#97;

    ---

    Small a

    b

    &#98;

    ---

    Small b

    c

    &#99;

    ---

    Small c

    d

    &#100;

    ---

    Small d

    e

    &#101;

    ---

    Small e

    f

    &#102;

    ---

    Small f

    g

    &#103;

    ---

    Small g

    h

    &#104;

    ---

    Small h

    i

    &#105;

    ---

    Small i

    j

    &#106;

    ---

    Small j

    k

    &#107;

    ---

    Small k

    l

    &#108;

    ---

    Small l

    m

    &#109;

    ---

    Small m

    n

    &#110;

    ---

    Small n

    o

    &#111;

    ---

    Small o

    p

    &#112;

    ---

    Small p

    q

    &#113;

    ---

    Small q

    r

    &#114;

    ---

    Small r

    s

    &#115;

    ---

    Small s

    t

    &#116;

    ---

    Small t

    u

    &#117;

    ---

    Small u

    v

    &#118;

    ---

    Small v

    w

    &#119;

    ---

    Small w

    x

    &#120;

    ---

    Small x

    y

    &#121;

    ---

    Small y

    z

    &#122;

    ---

    Small z

    {

    &#123;

    ---

    Left curly brace

    |

    &#124;

    ---

    Vertical bar

    }

    &#125;

    ---

    Right curly brace

    ~

    &#126;

    ---

    Tilde

    ---

    &#127;

    ---

    Unused

     

    &#160;

    &nbsp;

    Nonbreaking space

    ¡

    &#161;

    &iexcl;

    Inverted exclamation

    ¢

    &#162;

    &cent;

    Cent sign

    £

    &#163;

    &pound;

    Pound sterling

    ¤

    &#164;

    &curren;

    General currency sign

    ¥

    &#165;

    &yen;

    Yen sign

    ¦

    &#166;

    &brvbar; or &brkbar;

    Broken vertical bar

    §

    &#167;

    &sect;

    Section sign

    ¨

    &#168;

    &uml; or &die;

    Diæresis / Umlaut

    ©

    &#169;

    &copy;

    Copyright

    ª

    &#170;

    &ordf;

    Feminine ordinal

    «

    &#171;

    &laquo;

    Left angle quote, guillemot left

     

    &#172;

    &not;

    Not sign

    -

    &#173;

    &shy;

    Soft hyphen

    ®

    &#174;

    &reg;

    Registered trademark

    ¯

    &#175;

    &macr; or &hibar;

    Macron accent

    °

    &#176;

    &deg;

    Degree sign

    ±

    &#177;

    &plusmn;

    Plus or minus

    ²

    &#178;

    &sup2;

    Superscript two

    ³

    &#179;

    &sup3;

    Superscript three

    ´

    &#180;

    &acute;

    Acute accent

    µ

    &#181;

    &micro;

    Micro sign

     

    &#182;

    &para;

    Paragraph sign

    &#183;

    &middot;

    Middle dot

    ¸

    &#184;

    &cedil;

    Cedilla

    ¹

    &#185;

    &sup1;

    Superscript one

    º

    &#186;

    &ordm;

    Masculine ordinal

    »

    &#187;

    &raquo;

    Right angle quote, guillemot right

    ¼

    &#188;

    &frac14;

    Fraction one-fourth

    ½

    &#189;

    &frac12;

    Fraction one-half

    ¾

    &#190;

    &frac34;

    Fraction three-fourths

    ¿

    &#191;

    &iquest;

    Inverted question mark

    À

    &#192;

    &Agrave;

    Capital A, grave accent

    Á

    &#193;

    &Aacute;

    Capital A, acute accent

    Â

    &#194;

    &Acirc;

    Capital A, circumflex

    Ã

    &#195;

    &Atilde;

    Capital A, tilde

    Ä

    &#196;

    &Auml;

    Capital A, diæresis / umlaut

    Å

    &#197;

    &Aring;

    Capital A, ring

    Æ

    &#198;

    &AElig;

    Capital AE ligature

    Ç

    &#199;

    &Ccedil;

    Capital C, cedilla

    È

    &#200;

    &Egrave;

    Capital E, grave accent

    É

    &#201;

    &Eacute;

    Capital E, acute accent

    Ê

    &#202;

    &Ecirc;

    Capital E, circumflex

    Ë

    &#203;

    &Euml;

    Capital E, diæresis / umlaut

    Ì

    &#204;

    &Igrave;

    Capital I, grave accent

    Í

    &#205;

    &Iacute;

    Capital I, acute accent

    Î

    &#206;

    &Icirc;

    Capital I, circumflex

    Ï

    &#207;

    &Iuml;

    Capital I, diæresis / umlaut

    Ð

    &#208;

    &ETH;

    Capital Eth, Icelandic

    Ñ

    &#209;

    &Ntilde;

    Capital N, tilde

    Ò

    &#210;

    &Ograve;

    Capital O, grave accent

    Ó

    &#211;

    &Oacute;

    Capital O, acute accent

    Ô

    &#212;

    &Ocirc;

    Capital O, circumflex

    Õ

    &#213;

    &Otilde;

    Capital O, tilde

    Ö

    &#214;

    &Ouml;

    Capital O, diæresis / umlaut

    ×

    &#215;

    &times;

    Multiply sign

    Ø

    &#216;

    &Oslash;

    Capital O, slash

    Ù

    &#217;

    &Ugrave;

    Capital U, grave accent

    Ú

    &#218;

    &Uacute;

    Capital U, acute accent

    Û

    &#219;

    &Ucirc;

    Capital U, circumflex

    Ü

    &#220;

    &Uuml;

    Capital U, diæresis / umlaut

    Ý

    &#221;

    &Yacute;

    Capital Y, acute accent

    Þ

    &#222;

    &THORN;

    Capital Thorn, Icelandic

    ß

    &#223;

    &szlig;

    Small sharp s, German sz

    à

    &#224;

    &agrave;

    Small a, grave accent

    á

    &#225;

    &aacute;

    Small a, acute accent

    â

    &#226;

    &acirc;

    Small a, circumflex

    ã

    &#227;

    &atilde;

    Small a, tilde

    ä

    &#228;

    &auml;

    Small a, diæresis / umlaut

    å

    &#229;

    &aring;

    Small a, ring

    æ

    &#230;

    &aelig;

    Small ae ligature

    ç

    &#231;

    &ccedil;

    Small c, cedilla

    è

    &#232;

    &egrave;

    Small e, grave accent

    é

    &#233;

    &eacute;

    Small e, acute accent

    ê

    &#234;

    &ecirc;

    Small e, circumflex

    ë

    &#235;

    &euml;

    Small e, diæresis / umlaut

    ì

    &#236;

    &igrave;

    Small i, grave accent

    í

    &#237;

    &iacute;

    Small i, acute accent

    î

    &#238;

    &icirc;

    Small i, circumflex

    ï

    &#239;

    &iuml;

    Small i, diæresis / umlaut

    ð

    &#240;

    &eth;

    Small eth, Icelandic

    ñ

    &#241;

    &ntilde;

    Small n, tilde

    ò

    &#242;

    &ograve;

    Small o, grave accent

    ó

    &#243;

    &oacute;

    Small o, acute accent

    ô

    &#244;

    &ocirc;

    Small o, circumflex

    õ

    &#245;

    &otilde;

    Small o, tilde

    ö

    &#246;

    &ouml;

    Small o, diæresis / umlaut

    ÷

    &#247;

    &divide;

    Division sign

    ø

    &#248;

    &oslash;

    Small o, slash

    ù

    &#249;

    &ugrave;

    Small u, grave accent

    ú

    &#250;

    &uacute;

    Small u, acute accent

    û

    &#251;

    &ucirc;

    Small u, circumflex

    ü

    &#252;

    &uuml;

    Small u, diæresis / umlaut

    ý

    &#253;

    &yacute;

    Small y, acute accent

    þ

    &#254;

    &thorn;

    Small thorn, Icelandic

    ÿ

    &#255;

    &yuml;

    Small y, diæresis / umlaut

    Additional Named Entities for HTML

    The following table contains additional named entities, their numeric character references, and a description of each. With the exception of the left and right-pointing brackets (&#9001; and &#9002;), these entities are rendered using Lucida sans Unicode.

    Character

    Named entity

    Numeric character reference

    Description

    Latin Extended-B

         

    ƒ

    &fnof;

    &#402;

    Latin small f with hook, =function, =florin, U0192 ISOtech

    Greek

         

    Α

    &Alpha;

    &#913;

    Greek capital letter alpha, U0391

    Β

    &Beta;

    &#914;

    Greek capital letter beta, U0392

    Γ

    &Gamma;

    &#915;

    Greek capital letter gamma, U0393 ISOgrk3

    Δ

    &Delta;

    &#916;

    Greek capital letter delta, U0394 ISOgrk3

    Ε

    &Epsilon;

    &#917;

    Greek capital letter epsilon, U0395

    Ζ

    &Zeta;

    &#918;

    Greek capital letter zeta, U0396

    Η

    &Eta;

    &#919;

    Greek capital letter eta, U0397

    Θ

    &Theta;

    &#920;

    Greek capital letter theta, U0398 ISOgrk3

    Ι

    &Iota;

    &#921;

    Greek capital letter iota, U0399

    Κ

    &Kappa;

    &#922;

    Greek capital letter kappa, U039A

    Λ

    &Lambda;

    &#923;

    Greek capital letter lambda, U039B ISOgrk3

    Μ

    &Mu;

    &#924;

    Greek capital letter mu, U039C

    Ν

    &Nu;

    &#925;

    Greek capital letter nu, U039D

    Ξ

    &Xi;

    &#926;

    Greek capital letter xi, U039E ISOgrk3

    Ο

    &Omicron;

    &#927;

    Greek capital letter omicron, U039F

    Π

    &Pi;

    &#928;

    Greek capital letter pi, U03A0 ISOgrk3

    Ρ

    &Rho;

    &#929;

    Greek capital letter rho, U03A1

    Σ

    &Sigma;

    &#931;

    Greek capital letter sigma, U03A3 ISOgrk3

    Τ

    &Tau;

    &#932;

    Greek capital letter tau, U03A4

    Υ

    &Upsilon;

    &#933;

    Greek capital letter upsilon, U03A5 ISOgrk3

    Φ

    &Phi;

    &#934;

    Greek capital letter phi, U03A6 ISOgrk3

    Χ

    &Chi;

    &#935;

    Greek capital letter chi, U03A7

    Ψ

    &Psi;

    &#936;

    Greek capital letter psi, U03A8 ISOgrk3

    Ω

    &Omega;

    &#937;

    Greek capital letter omega, U03A9 ISOgrk3

    α

    &alpha;

    &#945;

    Greek small letter alpha, U03B1 ISOgrk3

    β

    &beta;

    &#946;

    Greek small letter beta, U03B2 ISOgrk3

    γ

    &gamma;

    &#947;

    Greek small letter gamma, U03B3 ISOgrk3

    δ

    &delta;

    &#948;

    Greek small letter delta, U03B4 ISOgrk3

    ε

    &epsilon;

    &#949;

    Greek small letter epsilon, U03B5 ISOgrk3

    ζ

    &zeta;

    &#950;

    Greek small letter zeta, U03B6 ISOgrk3

    η

    &eta;

    &#951;

    Greek small letter eta, U03B7 ISOgrk3

    θ

    &theta;

    &#952;

    Greek small letter theta, U03B8 ISOgrk3

    ι

    &iota;

    &#953;

    Greek small letter iota, U03B9 ISOgrk3

    κ

    &kappa;

    &#954;

    Greek small letter kappa, U03BA ISOgrk3

    λ

    &lambda;

    &#955;

    Greek small letter lambda, U03BB ISOgrk3

    μ

    &mu;

    &#956;

    Greek small letter mu, U03BC ISOgrk3

    ν

    &nu;

    &#957;

    Greek small letter nu, U03BD ISOgrk3

    ξ

    &xi;

    &#958;

    Greek small letter xi, U03BE ISOgrk3

    ο

    &omicron;

    &#959;

    Greek small letter omicron, U03BF NEW

    π

    &pi;

    &#960;

    Greek small letter pi, U03C0 ISOgrk3

    ρ

    &rho;

    &#961;

    Greek small letter rho, U03C1 ISOgrk3

    ς

    &sigmaf;

    &#962;

    Greek small letter final sigma, U03C2 ISOgrk3

    σ

    &sigma;

    &#963;

    Greek small letter sigma, U03C3 ISOgrk3

    τ

    &tau;

    &#964;

    Greek small letter tau, U03C4 ISOgrk3

    υ

    &upsilon;

    &#965;

    Greek small letter upsilon, U03C5 ISOgrk3

    φ

    &phi;

    &#966;

    Greek small letter phi, U03C6 ISOgrk3

    χ

    &chi;

    &#967;

    Greek small letter chi, U03C7 ISOgrk3

    ψ

    &psi;

    &#968;

    Greek small letter psi, U03C8 ISOgrk3

    ω

    &omega;

    &#969;

    Greek small letter omega, U03C9 ISOgrk3

    θ

    &thetasym;

    &#977;

    Greek small letter theta symbol, U03D1 NEW

    Υ

    &upsih;

    &#978;

    Greek upsilon with hook symbol, U03D2 NEW

    π

    &piv;

    &#982;

    Greek pi symbol, U03D6 ISOgrk3

    General Punctuation

         

    &bull;

    &#8226;

    bullet, =black small circle, U2022 ISOpub

    &hellip;

    &#8230;

    horizontal ellipsis, =three dot leader, U2026 ISOpub

    &prime;

    &#8242;

    prime, =minutes, =feet, U2032 ISOtech

    &Prime;

    &#8243;

    double prime, =seconds, =inches, U2033 ISOtech

    &oline;

    &#8254;

    overline, =spacing overscore, U203E NEW

    &frasl;

    &#8260;

    fraction slash, U2044 NEW

    Letterlike Symbols

         

    clip_image001[6]

    &weierp;

    &#8472;

    script capital P, =power set, =Weierstrass p, U2118 ISOamso

    clip_image002[6]

    &image;

    &#8465;

    blackletter capital I, =imaginary part, U2111 ISOamso

    clip_image003[6]

    &real;

    &#8476;

    blackletter capital R, =real part symbol, U211C ISOamso

    &trade;

    &#8482;

    trade mark sign, U2122 ISOnum

    clip_image004[6]

    &alefsym;

    &#8501;

    alef symbol, =first transfinite cardinal, U2135 NEW

    Arrows

         
     

    &larr;

    &#8592;

    leftward arrow, U2190 ISOnum

    &uarr;

    &#8593;

    upward arrow, U2191 ISOnum

    &rarr;

    &#8594;

    rightward arrow, U2192 ISOnum

    &darr;

    &#8595;

    downward arrow, U2193 ISOnum

    &harr;

    &#8596;

    left right arrow, U2194 ISOamsa

    clip_image005[6]

    &crarr;

    &#8629;

    downward arrow with corner leftward, =carriage return, U21B5 NEW

    clip_image006[6]

    &lArr;

    &#8656;

    leftward double arrow, U21D0 ISOtech

    clip_image007[6]

    &uArr;

    &#8657;

    upward double arrow, U21D1 ISOamsa

    clip_image008[6]

    &rArr;

    &#8658;

    rightward double arrow, U21D2 ISOtech

    clip_image009[6]

    &dArr;

    &#8659;

    downward double arrow, U21D3 ISOamsa

    clip_image010[6]

    &hArr;

    &#8660;

    left right double arrow, U21D4 ISOamsa

    Mathematical Operators

         

    &forall;

    &#8704;

    for all, U2200 ISOtech

    &part;

    &#8706;

    partial differential, U2202 ISOtech

    &exist;

    &#8707;

    there exists, U2203 ISOtech

    clip_image011[6]

    &empty;

    &#8709;

    empty set, =null set, =diameter, U2205 ISOamso

    &nabla;

    &#8711;

    nabla, =backward difference, U2207 ISOtech

    &isin;

    &#8712;

    element of, U2208 ISOtech

    clip_image012[6]

    &notin;

    &#8713;

    not an element of, U2209 ISOtech

    &ni;

    &#8715;

    contains as member, U220B ISOtech

    &prod;

    &#8719;

    n-ary product, =product sign, U220F ISOamsb

    &sum;

    &#8722;

    n-ary sumation, U2211 ISOamsb

    &minus;

    &#8722;

    minus sign, U2212 ISOtech

    clip_image013[6]

    &lowast;

    &#8727;

    asterisk operator, U2217 ISOtech

    &radic;

    &#8730;

    square root, =radical sign, U221A ISOtech

    &prop;

    &#8733;

    proportional to, U221D ISOtech

    &infin;

    &#8734;

    infinity, U221E ISOtech

    &ang;

    &#8736;

    angle, U2220 ISOamso

    &and;

    &#8869;

    logical and, =wedge, U2227 ISOtech

    clip_image014[6]

    &or;

    &#8870;

    logical or, =vee, U2228 ISOtech

    &cap;

    &#8745;

    intersection, =cap, U2229 ISOtech

    &cup;

    &#8746;

    union, =cup, U222A ISOtech

    &int;

    &#8747;

    integral, U222B ISOtech

    &there4;

    &#8756;

    therefore, U2234 ISOtech

    &sim;

    &#8764;

    tilde operator, =varies with, =similar to, U223C ISOtech

    clip_image015[10]

    &cong;

    &#8773;

    approximately equal to, U2245 ISOtech

    clip_image015[11]

    &asymp;

    &#8773;

    almost equal to, =asymptotic to, U2248 ISOamsr

    &ne;

    &#8800;

    not equal to, U2260 ISOtech

    &equiv;

    &#8801;

    identical to, U2261 ISOtech

    &le;

    &#8804;

    less-than or equal to, U2264 ISOtech

    &ge;

    &#8805;

    greater-than or equal to, U2265 ISOtech

    &sub;

    &#8834;

    subset of, U2282 ISOtech

    &sup;

    &#8835;

    superset of, U2283 ISOtech

    clip_image016[6]

    &nsub;

    &#8836;

    not a subset of, U2284 ISOamsn

    &sube;

    &#8838;

    subset of or equal to, U2286 ISOtech

    &supe;

    &#8839;

    superset of or equal to, U2287 ISOtech

    &oplus;

    &#8853;

    circled plus, =direct sum, U2295 ISOamsb

    clip_image017[6]

    &otimes;

    &#8855;

    circled times, =vector product, U2297 ISOamsb

    &perp;

    &#8869;

    up tack, =orthogonal to, =perpendicular, U22A5 ISOtech

    clip_image018[6]

    &sdot;

    &#8901;

    dot operator, U22C5 ISOamsb

    Miscellaneous Technical

         

    clip_image019[6]

    &lceil;

    &#8968;

    left ceiling, =apl upstile, U2308, ISOamsc

    clip_image020[6]

    &rceil;

    &#8969;

    right ceiling, U2309, ISOamsc

    clip_image021[6]

    &lfloor;

    &#8970;

    left floor, =apl downstile, U230A, ISOamsc

    clip_image022[6]

    &rfloor;

    &#8971;

    right floor, U230B, ISOamsc

    <

    &lang;

    &#9001;

    left-pointing angle bracket, =bra, U2329 ISOtech

    >

    &rang;

    &#9002;

    right-pointing angle bracket, =ket, U232A ISOtech

    Geometric Shapes

         

    &loz;

    &#9674;

    lozenge, U25CA ISOpub

    Miscellaneous Symbols

         

    &spades;

    &#9824;

    black spade suit, U2660 ISOpub

    &clubs;

    &#9827;

    black club suit, =shamrock, U2663 ISOpub

    &hearts;

    &#9829;

    black heart suit, =valentine, U2665 ISOpub

    clip_image023[6]

    &diams;

    &#9830;

    black diamond suit, U2666 ISOpub

    Character Entities for Special Symbols and BIDI Text

    Using NE

    NE

    NCR

    Using NCR

    C0 Controls and Basic Latin

         

    "

    &quot

    &#34;

    quotation mark, =apl quote, U0022 ISOnum

    &

    &amp

    &#38;

    ampersand, U0026 ISOnum

    <

    &lt

    &#60;

    less-than sign, U003C ISOnum

    >

    &gt

    &#62;

    greater-than sign, U003E ISOnum

    Latin Extended-A

         

    Œ

    &OElig

    &#338;

    Latin capital ligature oe, U0152 ISOlat2

    œ

    &oelig

    &#339;

    Latin small ligature oe, U0153 ISOlat2

    Š

    &Scaron

    &#352;

    Latin capital letter s with caron, U0160 ISOlat2

    š

    &scaron

    &#353;

    Latin small letter s with caron, U0161 ISOlat2

    Ÿ

    &Yuml

    &#376;

    Latin capital letter y with diaeresis, U0178 ISOlat2

    Spacing Modifier Letters

         

    ˆ

    &circ

    &#710;

    modifier letter circumflex accent, U02C6 ISOpub

    ˜

    &tilde

    &#732;

    small tilde, U02DC ISOdia

    General Punctuation

         
     

    &ensp

    &#8194;

    en space, U2002 ISOpub

     

    &emsp

    &#8195;

    em space, U2003 ISOpub

     

    &thinsp

    &#8201;

    thin space, U2009 ISOpub

    &zwnj

    &#8204;

    zero width non-joiner, U200C NEW RFC 2070

    &zwj

    &#8205;

    zero width joiner, U200D NEW RFC 2070

     

    &lrm

    &#8206;

    left-to-right mark, U200E NEW RFC 2070

     

    &rlm

    &#8207;

    right-to-left mark, U200F NEW RFC 2070

    &ndash

    &#8211;

    en dash, U2013 ISOpub

    &mdash

    &#151;

    em dash, U2014 ISOpub

    &lsquo

    &#8216;

    left single quotation mark, U2018 ISOnum

    '

    &rsquo

    &#8217;

    right single quotation mark, U2019 ISOnum

    &sbquo

    &#8218;

    single low-9 quotation mark, U201A NEW

    "

    &ldquo

    &#8220;

    left double quotation mark, U201C ISOnum

    "

    &rdquo

    &#8221;

    right double quotation mark, U201D ISOnum

    &bdquo

    &#8222;

    double low-9 quotation mark, U201E NEW

    &dagger

    &#8224;

    dagger, U2020 ISOpub

    &Dagger

    &#8225;

    double dagger, U2021 ISOpub

    &permil

    &#8240;

    per mille sign, U2030 ISOtech

    <

    &lsaquo

    &#8249;

    single left-pointing angle quotation mark, U2039 ISO proposed

    >

    &rsaquo

    &#8250;

    single right-pointing angle quotation mark, U203A ISO proposed

    Character Set Recognition

    Microsoft Internet Explorer uses the character set specified for a document to determine how to translate the bytes in the document into characters on the screen or on paper. By default, Internet Explorer uses the character set specified in the HTTP content type returned by the server to determine this translation. If this parameter is not given, Internet Explorer uses the character set specified by the META element in the document. It uses the user's preferences if no META element is given.

    You can use the META element to explicitly set the character set for a document. In this case, you set the HTTP-EQUIV= attribute to "Content-Type" and specify a character set identifier in the CONTENT= attribute. For example, the following META element identifies windows-1251 as the character set for the document.

    <META HTTP-EQUIV="Content-Type"

    CONTENT="text/html; CHARSET=windows-1251">

    As long as you place the META element before the BODY element, it affects the whole document, including the TITLE element. For clarity, it should appear as the first element after HEAD so that all readers know the encoding before the first element that can be displayed is parsed. Note that the META element applies to the document containing it. This means, for example, that a compound document (a document consisting of two or more documents in a set of frames) can use different character sets in different frames.

    The following table contains information concerning the character sets supported by Internet Explorer 5. The information provided is:

    1. Display Name — the name used to refer to the character set.

    2. Preferred Charset ID — the most common identifier used to set character sets in Internet Explorer. For example, in the previous code sample windows-1251 is the Charset ID.

    3. Additional Aliases — other identifiers that may be used to set character sets.

    4. MLang Code Pages — numeric value of the code pages used by the Internet Explorer MLang API.

    5. Supported by Version — the versions of Internet Explorer that support the listed character sets.

    6. Note CS indicates that the version of Internet Explorer must support complex scripts such as Arabic, Hebrew, or Thai.

    Charsets in Microsoft Internet Explorer 5

    Display Name

    Preferred Charset ID

    Additional Aliases

    MLang Code Page

    Supported by Versions

    Arabic ASMO-708

    ASMO-708

     

    708

    4CS, 5

    Arabic (DOS)

    DOS-720

     

    720

    4CS, 5

    Arabic (ISO)

    iso-8859-6

    ISO_8859-6:1987, iso-ir-127, ISO_8859-6, ECMA-114, arabic, csISOLatinArabic

    28596

    4CS, 5

    Arabic (Windows)

    windows-1256

     

    1256

    4CS, 5

    Baltic (ISO)

    iso-8859-4

    csISOLatin4, iso-ir-110, ISO_8859-4, ISO_8859-4:1988, l4, latin4

    28594

    4, 5

    Baltic (Windows)

    Windows-1257

     

    1257

    4, 5

    Central European (DOS)

    ibm852

    cp852

    852

    4, 5

    Central European (ISO)

    iso-8859-2

    csISOLatin2, iso-ir-101, iso8859-2, iso_8859-2, iso_8859-2:1987, l2, latin2

    28592

    3, 4, 5

    Central European (Windows)

    windows-1250

    x-cp1250

    1250

    3, 4, 5

    Chinese Simplified (GB2312)

    gb2312

    chinese, csGB2312, csISO58GB23128, GB2312, GBK, GB_2312-80, iso-ir-58

    936

    3, 4, 5

    Chinese Simplified (HZ)

    hz-gb-2312

     

    52936

    4, 5

    Chinese Traditional

    big5

    csbig5, x-x-big5

    950

    3, 4, 5

    Cyrillic (DOS)

    cp866

    ibm866

    866

    4, 5

    Cyrillic (ISO)

    iso-8859-5

    csISOLatinCyrillic, cyrillic, iso-ir-144, ISO_8859-5, ISO_8859-5:1988

    28595

    4, 5

    Cyrillic (KOI8-R)

    koi8-r

    csKOI8R, koi

    20866

    3, 4, 5

    Cyrillic (Windows)

    windows-1251

    x-cp1251

    1251

    3, 4, 5

    Greek (ISO)

    iso-8859-7

    csISOLatinGreek, ECMA-118, ELOT_928, greek, greek8, iso-ir-126, ISO_8859-7, ISO_8859-7:1987

    28597

    3, 4, 5

    Greek (Windows)

    Windows-1253

    windows-1253

    1253

    5

    Hebrew (DOS)

    DOS-862

     

    862

    4CS, 5

    Hebrew (ISO)

    iso-8859-8

    csISOLatinHebrew, hebrew, iso-ir-138, ISO_8859-8, visual, ISO-8859-8 Visual

    28598

    4CS, 5

    Hebrew (Windows)

    windows-1255

    logical, ISO_8859-8:1988, iso-ir-138

    1255

    3CS, 4CS, 5

    Japanese (JIS)

    iso-2022-jp

    csISO2022JP

    50220

    4, 5

    Japanese (JIS-Allow 1-byte Kana)

    csISO2022JP

    iso-2022-jp

    50221

    4, 5

    Japanese (JIS-Allow 1-byte Kana - SO/SI)

    iso-2022-jp

    csISO2022JP

    50222

    3, 4, 5

    Japanese (EUC)

    euc-jp

    csEUCPkdFmtJapanese, Extended_UNIX_Code_Packed_
    Format_for_Japanese, x-euc, x-euc-jp

    51932

    3, 4, 5

    Japanese (Shift-JIS)

    shift_jis

    csShiftJIS, csWindows31J, ms_Kanji, shift-jis, x-ms-cp932, x-sjis

    932

    3, 4, 5

    Korean

    ks_c_5601-1987

    csKSC56011987, euc-kr, korean, ks_c_5601

    949

    3, 4, 5

    Korean (ISO)

    iso-2022-kr

    csISO2022KR

    50225

    3, 4, 5

    Latin 3 (ISO)

    iso-8859-3

     

    28593

    4, 5

    Thai (Windows)

    iso-8859-11

    windows-874

    874

    3, 4, 5

    Turkish (Windows)

    Windows-1254

    windows-1254

    1254

    3, 4, 5

    Turkish (ISO)

    iso-8859-9

    csISOLatin5, ISO_8859-9, ISO_8859-9:1989, iso-ir-148, l5, latin5

    28599

    3, 4, 5

    Ukrainian (KOI8-U)

    koi8-u

     

    21866

    4, 5

    Unicode (UTF-7)

    utf-7

    csUnicode11UTF7, unicode-1-1-utf-7, x-unicode-2-0-utf-7

    65000

    4, 5

    Unicode (UFT-8)

    utf-8

    unicode-1-1-utf-8, unicode-2-0-utf-8, x-unicode-2-0-utf-8

    65001

    4, 5

    Vietnamese (Windows)

    windows-1258

     

    1258

    3, 4, 5

    Western European (Windows)

    Windows-1252

     

    1252

    5

    Western European (ISO)

    iso-8859-1

    ANSI_X3.4-1968, ANSI_X3.4-1986, ascii, cp367, cp819, csASCII, IBM367, ibm819, iso-ir-100, iso-ir-6, ISO646-US, iso8859-1, ISO_646.irv:1991, iso_8859-1, iso_8859-1:1987, latin1, us, us-ascii, x-ansi

    1252

    3, 4, 5

    Nonstandard Charsets with Special Meaning Inside Internet Explorer and MLang

    These character sets are not to be used for labeling documents.

    Display Name

    Preferred Charset ID

    Additional Aliases

    MLang Code Page

    Supported by Versions

    Japanese (Auto Select)

    _autodetect

     

    50932

    3, 4, 5

    Korean (Auto Select)

    _autodetect_kr

     

    50949

    4, 5

    Unicode

    unicode

     

    1200

    4, 5

    Unicode (BigEndian)

    unicodeFEFF

     

    1201

    4, 5

    User Defined

    x-user-defined

     

    50000

    4, 5

    ©2003 Microsoft Corporation. All rights reserved.

  • 相关阅读:
    控制element表格禁用选择
    深度拷贝
    VScode修复eslint报错,保存的时候自动格式修正
    关于route监听
    PAT 1030 完美数列
    PAT1029 旧键盘(C完全正确)
    PAT 1028 人口普查
    PAT 1016
    PAT:1013
    PAT :1012 数字分类
  • 原文地址:https://www.cnblogs.com/rock_chen/p/2014427.html
Copyright © 2011-2022 走看看