pandas選取元素loc,iloc

2023-04-12 09:51:24

pandas可以通過loc和iloc來篩選元素，ix不推薦使用

data = {'AAA':[4,5,6,7], 'BBB':[10,20,30,40], 'CCC':[100,50,-30, -50]}
df = pd.DataFrame(data=data, index=['foo','bar','boo','kar']);df

pandas.iloc

Purely integer-location based indexing for selection by position.

.iloc[]

is primarily integer position based (from

length-1

of the axis), but may also be used with a boolean array.

#篩選第一行到第二行的資料，注意區間左閉右開
df.iloc[1:3]

pandas.loc

Access a group of rows and columns by label(s) or a boolean array.

#篩選索引從'bar'到'kar'的資料
df.loc['bar':'kar']

更多用法

使用callable indexing

data = {'AAA':[4,5,6,7], 'BBB':[10,20,30,40], 'CCC':[100,50,-30, -50]}
df2 = pd.DataFrame(data=data);df2
df2[~((df2.AAA <=6)&(df2.index.isin([0,2,4])))]

df1 = pd.DataFrame(np.random.randn(6,4),
                  index=list('abcdef'),
                  columns=list('ABCD'));df1

#篩選A列大于0的資料
df1.loc[lambda df:df.A >0:]

#篩選A，B兩列資料
df1.loc[:, lambda df:['A','B']]

#篩選第一列和第二列資料
df1.iloc[:, lambda df:[0,1]]

#篩選第1列資料
df1[lambda df:df.columns[0]]

在Series上使用callable indexing

#篩選A列大于0的資料
df1.A.loc[lambda s:s >0]

使用這些方法，可以鍊式篩選而不使用臨時變量

bb = pd.read_csv('data/baseball.csv', index_col='id')
(bb.groupby(['year','team']).sum().loc[lambda df: df.r > 100])

pandas選取元素loc,iloc

繼續閱讀

GPS資料類型格式 NMEA協定

推薦系統-資源整理一、綜合性文章四、算法詳解：

别輕易轉資料分析了！太卷了

python中哪些函數可以進行清單排序？

This application failed to start because it could not find or load the Qt platform plugin "

R語言| 中介效應分析，Mediation包和BruceR包，循環Process函數

一套完整實用的IT規劃方法論

miRNA與轉錄組聯合分析

進階資料分析師憑什麼月薪三萬？一文解答你所有困惑

SparkSQL項目練習1 準備資料2 需求：各區域熱門商品Top3

SQL常見計算方法總結

一篇文章帶你使用模組化的思路解決泰迪杯-智慧政務問題（答複意見評價含代碼）

pandas模仿excel對資料處理并可視化

資料分析實戰20絕技

從大資料看技術，為什麼天貓雙11是史上最大數字經濟節日

線上教育巨頭多鄰國Duolingo入華一周年，中國市場馬力全開