python中的字符串编码问题——3.各操作系统下的不同编码方式

2023-05-12 11:25:46

各操作系统下的不同编码方式

先看一下

linux,python2.7

>>> B = b'\xc3\x84\xc3\xa8'

>>> B.decode('utf-8')

u'\xc4\xe8'

>>> type(B)

>>>

windows,python2.7,python shell

>>> B = b'\xc3\x84\xc3\xa8'

>>> B.decode('utf-8')

u'\xc4\xe8'

>>> print B.decode('utf-8')

Äè

>>>

windows,python2.7,python cmd控制台

>>> B = b'\xc3\x84\xc3\xa8'

>>> B.decode('utf-8')

u'\xc4\xe8'

>>> print B.decode('utf-8')

Traceback (most recent call last):

File "<stdin>", line 1, in <module>

UnicodeEncodeError: 'gbk' codec can't encode character u'\xc4' in position 0: il

legal multibyte sequence

>>>

三种环境下不同输出的原因：

windows控制台默认采用GBK编码，liunx默认采用UTF-8编码

------------------------------------------------------

查看linux默认编码：

[[email protected] ~]# env |grep LANG

LANG=zh_CN.UTF-8

------------------------------------------------------

查看windows控制台默认编码：

cmd打开控制台---->属性---->查看编码为936（简体中文GBK）

（进一步在linux和windows下新建文本文件查看编码方式果然没错，证实。）

转载于:https://www.cnblogs.com/Micang/p/9733028.html

python中的字符串编码问题——3.各操作系统下的不同编码方式

继续阅读

Shell编程——sort排序、uniq忽略重复、tr替换压缩删除、cut指定删除字段、正则表达式元字符sort 命令uniq 命令tr 命令cut 命令正则表达式

Zeppelin 配置访问 REST APIApache Zeppelin Configuration REST API

【Torch】最简洁logging使用指南

Linxu常用命令技巧汇总

27. Remove Element(列表)题目代码

《Linux命令行与Shell脚本编程大全第2版.布卢姆》pdf

ACS基本配置-权限等级管理

传说FreeBSD等比Linux更稳定，更“健壮”

无人机--飞控科普

27 Best Free Eclipse Plug-ins for Java Developer to be ProductiveCode Quality PluginsText Editor PluginsDependency ManagementVersion Control Integration PluginsFramework Development Continuous Integration Related PluginsOther Utility Plugins

Cloud Studio初体验

使用 ctypes 进行 Python 和 C 的混合编程

【python】【数据处理】画多维数据分布图

【python】netconf协议对接管理设备

「Python 网络自动化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 网络设备

在python中创建excel并写入