天天看點

Python資料分析之讀取檔案

Python的資料分析,大部分的教程都是想講numpy,再講Dataframe,再講讀取檔案。但我看書的時候,前面二章看的實在頭暈,是以,我們還是通過讀取檔案來開始我們的Python資料分析吧。

讀取CSV

  • 讀取csv通過read_csv讀取
import pandas as pd
zhuanti = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'))
zhuanti      
Python資料分析之讀取檔案
  • 設定第一列為索引
import pandas as pd
zhuanti1 = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'),index_col=0)
zhuanti1      
Python資料分析之讀取檔案
  • 設定header,這裡把header去掉
import pandas as pd
zhuanti2 = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'),index_col=0,header=None)
zhuanti2      
Python資料分析之讀取檔案
  • 跳過前2行
import pandas as pd
zhuanti3 = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'),skiprows=[1,2],index_col=0)
zhuanti3      
Python資料分析之讀取檔案

讀取Excel

  • 利用read_excel讀取excel檔案
import pandas as pd
test = pd.read_excel('C:/Users/luopan/Desktop/test.xlsx',sheetname='Sheet2',header=None)
test      
Python資料分析之讀取檔案

讀取MySQL

import pandas as pd
import pymysql
conn = pymysql.connect(host='localhost', user='root', passwd='123456', db='test', port=3306, charset='utf8')
jianshu = pd.read_sql('select * from jianshu1',conn)
jianshu      
Python資料分析之讀取檔案

讀取MongoDB

import pandas as pd
import pymongo
client = pymongo.MongoClient('localhost',port = 27017)
test = client['test']
tieba = test['tieba']
data = pd.DataFrame(list(tieba.find()))
data      

繼續閱讀