Pandas详解十二之排序和排名排序和排名

2023-04-12 09:55:07

约定：

import pandas as pd
import numpy as np

排序和排名

根据条件对Series对象或DataFrame对象的值排序（sorting）和排名(ranking)是一种重要的内置运算。

接下来为大家介绍如何使用pandas对象的：sort_index() / sort_values() / rank() 方法。

一、对Series排序

对Series对象排序是最常用的，可以根据Series对象的索引、值排序。

根据索引排序

se1=pd.Series(np.arange(,),index=[,,])
se1.sort_index()

代码结果：

1    10
2    12
3    11
dtype: int32

还能对字符索引排序

se2=pd.Series(np.arange(,),index=['c','d','a'])
se2.sort_index()

代码结果：

a    2
c    0
d    1
dtype: int32

按降序排序

se2.sort_index(ascending=False)

代码结果：

d    1
c    0
a    2
dtype: int32

按值排序

se3=pd.Series([,-,])
se3.sort_values()

代码结果：

1   -5
0    3
2    7
dtype: int64

NaN值会放在Series末尾

se4=pd.Series([,np.nan,-,np.nan,])
se4.sort_values()

代码结果：

2   -7.0
0    3.0
4    5.0
1    NaN
3    NaN
dtype: float64

二、对DataFrame排序

通过axis参数可以对任意轴排序

df1=pd.DataFrame(np.arange().reshape(,),index=list("bac"),columns=list("yzx"))
df1

代码结果：

y	z	x
b	1	2
a	3	4	5
c	6	7	8

df1.sort_index()

代码结果：

y	z	x
a	3	4	5
b	1	2
c	6	7	8

df1.sort_index(axis=)

代码结果：

x	y	z
b	2	1
a	5	3	4
c	8	6	7

根据一个列的值来排序

df2=pd.DataFrame({'a':[,,],'b':[,-,]})
df2.sort_values(by='b')

代码结果：

a	b
1	3	-6
20	1
2	3	18

对多个列来排序

df2.sort_values(by=['a','b'])

代码结果：

a	b
1	3	-6
2	3	18
20	1

三、排名

排名是根据Series对象或DataFrame的某几列的值进行排名，.rank(method=，ascending=,…)返回对值的排名。但需要十分注意如何处理出现相同的值。

平均排名

为相同的值分配一个平均排名

se5=pd.Series([,,,,,])
se5.rank()

代码结果：

0    1.0
1    2.5
2    5.5
3    4.0
4    2.5
5    5.5
dtype: float64

顺序排名

对于相同的值按照出现的顺序排名

se5.rank(method="first")

代码结果：

0    1.0
1    2.0
2    5.0
3    4.0
4    3.0
5    6.0
dtype: float64

最小值排名

对于相同的值都取小的排名

se5.rank(method="min",ascending=False)

代码结果：

0    6.0
1    4.0
2    1.0
3    3.0
4    4.0
5    1.0
dtype: float64

最大值排名

对于相同的值都取大的排名

se5.rank(method="max",ascending=False)

代码结果：

0    6.0
1    5.0
2    2.0
3    3.0
4    5.0
5    2.0
dtype: float64

降序排名

se5.rank(method="first",ascending=False)

代码结果：

0    6.0
1    4.0
2    1.0
3    3.0
4    5.0
5    2.0
dtype: float64

谢谢大家的浏览，

希望我的努力能帮助到您，

共勉！

Pandas详解十二之排序和排名排序和排名

排序和排名

一、对Series排序

二、对DataFrame排序

三、排名

继续阅读

来自python的【条件控制/语句循环/break/continue/else/pass】一、条件控制二、语句循环

无法解析的外部符号 wmain，该符号在函数 "void cdecl mainCRTStartupHelper(struct HINSTANCE *,unsigned short con......

TestLink导出用例转换工具(XML2Excel)

YAML简介和PyYAML安全操作YAML支持的类型YAML的优点：yaml的基本语法python操作

Small tricks

libsvm for python 安装

学习软件测试基础测试第七天

Zeppelin 配置访问 REST APIApache Zeppelin Configuration REST API

【Torch】最简洁logging使用指南

27. Remove Element(列表)题目代码

Cloud Studio初体验

使用 ctypes 进行 Python 和 C 的混合编程

【python】【数据处理】画多维数据分布图

【python】netconf协议对接管理设备

「Python 网络自动化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 网络设备

在python中创建excel并写入