zoukankan      html  css  js  c++  java
  • Python3、Unicode、UTF-8、编码

    text = u'你好,今天天气不错'
    text
    print(text)
    
    text = 'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    text
    print(text)
    
    text = u'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    text
    print(text)
    
    text = '\u4f60\u597d\uff0c\u4eca\u5929\u5929\u6c14\u4e0d\u9519'
    text
    print(text)
    text = text.encode('utf-8').decode('unicode_escape')
    text
    print(text)
    
    text = '\u4f60\u597d\uff0c今天天气不错'
    text
    print(text)
    import re
    text = re.sub(r'(\u[0-9a-fA-F]{4})', lambda matched: matched.group(1).encode('utf-8').decode('unicode_escape'), text)
    text
    print(text)

    以上为运行的代码,运行的结果如下:

    >>> text = u'你好,今天天气不错'
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = 'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = u'u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519'
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = '\u4f60\u597d\uff0c\u4eca\u5929\u5929\u6c14\u4e0d\u9519'
    >>> text
    '\u4f60\u597d\uff0c\u4eca\u5929\u5929\u6c14\u4e0d\u9519'
    >>> print(text)
    u4f60u597duff0cu4ecau5929u5929u6c14u4e0du9519
    >>> text = text.encode('utf-8').decode('unicode_escape')
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错
    
    >>> text = '\u4f60\u597d\uff0c今天天气不错'
    >>> text
    '\u4f60\u597d\uff0c今天天气不错'
    >>> print(text)
    u4f60u597duff0c今天天气不错
    >>> import re
    >>> text = re.sub(r'(\u[0-9a-fA-F]{4})', lambda matched: matched.group(1).encode('utf-8').decode('unicode_escape'), text)
    >>> text
    '你好,今天天气不错'
    >>> print(text)
    你好,今天天气不错

  • 相关阅读:
    SpringCloud 学习之概述
    定位慢查询
    中止线程
    笨办法41学会说面向对象【pyinstaller安装使用
    pip安装
    笨办法40模块, 类和对象class
    笨办法39字典dict
    笨办法38列表操作
    笨办法35分支和函数
    笨办法34访问列表元素(列表方法)
  • 原文地址:https://www.cnblogs.com/jacen789/p/9401877.html
Copyright © 2011-2022 走看看