zoukankan      html  css  js  c++  java
  • python,shell,locale,charset

    windows下,python控制台,编码 为gbk

    用户输入为gbk,python内部用unicode

    zi = u'字'

    gbk输入 -> unicode内部表示

    >>> zi
    u'u5b57'  内部表示

    >>> print zi

    字     内部表示->gbk输出

    把内部表示转换为另一种编码存储

    输出时,系统认为仍是unicode表示,因此输出乱码

    查看windows控制台编码:

    右键属性

    python控制台编码:

    http://docs.python.org/2/library/locale.html

    locale.getdefaultlocale([envvars])

    Tries to determine the default locale settings and returns them as a tuple of the form (language code, encoding).

    According to POSIX, a program which has not called setlocale(LC_ALL, '') runs using the portable 'C' locale. Callingsetlocale(LC_ALL, '') lets it use the default locale as defined by the LANG variable. Since we do not want to interfere with the current locale setting we thus emulate the behavior in the way described above.

    To maintain compatibility with other platforms, not only the LANG variable is tested, but a list of variables given as envvars parameter. The first found to be defined will be used. envvars defaults to the search path used in GNU gettext; it must always contain the variable name LANG. The GNU gettext search path contains 'LANGUAGE''LC_ALL''LC_CTYPE', and 'LANG', in that order.

    Except for the code 'C', the language code corresponds to RFC 1766language code and encoding may be None if their values cannot be determined.

    >>> locale.getdefaultlocale()
    ('zh_CN', 'cp936')
    >>>

     wiki cp936约等于gbk

  • 相关阅读:
    JAVA 8学习笔记-第五章
    JAVA 8学习笔记-第一章
    JAVA 8学习笔记-第二章
    MySQL应用
    Mac给iTerm2终端配色
    masOS支持NTFS读写,无需第三方软件
    macOS Apache配置用于支持Python CGI编程
    Vim
    Thrift
    Netflix Hystrix
  • 原文地址:https://www.cnblogs.com/threef/p/3316333.html
Copyright © 2011-2022 走看看