Dataframe获取指定列并将其单独保存
- 问题描述
- 解决办法
问题描述
根据项目需要,要把表格拆分成两部分,一部分做训练一部分做测试。
解决办法
在jupyter notebook实现。
import pandas as pd
import os
from sklearn import datasets
from pandas import DataFrame
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
data = pd.read_csv('./6.csv', sep=';',index_col='datetime', parse_dates=True).drop(['changepoint'], axis=1)
data
df_train = data.iloc[:550].drop(['anomaly'], axis=1)
df_test = data.iloc[550:]
df_train.to_csv('./generated_' + '/train.csv')
df_train
df_test.to_csv('./generated_' + '/test.csv')
df_test
拆分保存的表格如下。
完整代码:
import pandas as pd
import os
from sklearn import datasets
from pandas import DataFrame
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
data = pd.read_csv('./6.csv', sep=';',index_col='datetime', parse_dates=True).drop(['changepoint'], axis=1)
df_train = data.iloc[:550].drop(['anomaly'], axis=1)
df_test = data.iloc[550:]
df_train.to_csv('./generated_' + '/train.csv')
df_test.to_csv('./generated_' + '/test.csv')