Python程式設計：pandas入門

2021-11-21 23:50:00

總算找到時間學習了下pandas，先學習一部分，後面繼續補充

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

# 建立序列, 讓pandas建立預設整數索引
s = pd.Series([1, 3, 5, np.nan, 6, 8])
print(s)
"""
0    1.0
1    3.0
2    5.0
3    NaN
4    6.0
5    8.0
dtype: float64
"""
# 資料讀取

# csv檔案讀取
# 預設第一行列名，資料從第二行起
# (資料從第一行起header=None)
data = pd.read_csv("data.csv")
print(data)
"""
   id name  age  score
0   1  tom   12     98
1   2  tom   12     98
2   3  tom   12     98
3   4  tom   12     98
4   5  tom   12     98
5   6  tom   12     98
6   7  tom   12     98
7   8  tom   12     98
8   9  tom   12     98
9  10  tom   12     98
"""

# 顯示前面幾行資料（預設前5行）
print(data.head())
"""
   id name  age  score
0   1  tom   12     98
1   2  tom   12     98
2   3  tom   12     98
3   4  tom   12     98
4   5  tom   12     98
"""
# 讀取尾部資料
print(data.tail())
"""
 id name  age  score
5   6  tom   12     98
6   7  tom   12     98
7   8  tom   12     98
8   9  tom   12     98
9  10  tom   12     98
"""

# 顯示列名
print(data.columns)
# Index(['id', 'name', 'age', 'score'], dtype='object')

# 顯示行号
print(data.index)
# RangeIndex(start=0, stop=10, step=1)

# 顯示大小
print(data.shape)
# (10, 4)

# 索引和計算

# 列名預設為字元串(Object)。行的index預設為整數
# 提取行
print(data.loc[3:6])
"""
   id name  age  score
3   4  tom   12     98
4   5  tom   12     98
5   6  tom   12     98
6   7  tom   12     98
"""

# 使用list提取行
print(data[3:6])
"""
   id name  age  score
3   4  tom   12     98
4   5  tom   12     98
5   6  tom   12     98
"""

# 提取列
columns = ["name", "age"]
print(data[columns])
"""
   name  age
0  tom   12
1  tom   12
2  tom   12
3  tom   12
4  tom   12
5  tom   12
6  tom   12
7  tom   12
8  tom   12
9  tom   12
"""

# 列名轉為list
print(data.columns.tolist())
# ['id', 'name', 'age', 'score']

print(data.columns.values)
# ['id' 'name' 'age' 'score']

print(data.columns.values.dtype)
# object

# 切片
print(data.loc[3:6][["name", "age"]])
"""
  name  age
3  tom   12
4  tom   12
5  tom   12
6  tom   12
"""

print(data["name"])
"""
0    tom
1    tom
2    tom
3    tom
4    tom
5    tom
6    tom
7    tom
8    tom
9    tom
Name: name, dtype: object
"""

# 取極值
print(data["id"].max())
# 10

# 運算
num = data.shape[0]
age_average = data["id"]/num
print(age_average.head())
"""
0    0.1
1    0.2
2    0.3
3    0.4
4    0.5
Name: id, dtype: float64
"""

# 排序
data.sort_values("id", inplace=False, ascending=False)
print(data.head())

Python程式設計：pandas入門

繼續閱讀

來自python的【條件控制/語句循環/break/continue/else/pass】一、條件控制二、語句循環

無法解析的外部符号 wmain，該符号在函數 "void cdecl mainCRTStartupHelper(struct HINSTANCE *,unsigned short con......

TestLink導出用例轉換工具(XML2Excel)

YAML簡介和PyYAML安全操作YAML支援的類型YAML的優點：yaml的基本文法python操作

Small tricks

libsvm for python 安裝

學習軟體測試基礎測試第七天

Zeppelin 配置通路 REST APIApache Zeppelin Configuration REST API

【Torch】最簡潔logging使用指南

27. Remove Element(清單)題目代碼

Cloud Studio初體驗

使用 ctypes 進行 Python 和 C 的混合程式設計

【python】【資料處理】畫多元資料分布圖

【python】netconf協定對接管理裝置

「Python 網絡自動化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 網絡裝置

在python中建立excel并寫入