pandas處理資料

pd.read_csv('path/to/file.txt',header=0,names='ab',index=0)

names columns這個可以不寫，制定索引列是第一列，這樣就沒有序號

np.tofile("d:/python/np.txt")

類似於二維列表

充分利用map函式

df.a=df.a.map(function)

修改一列的值

new_dataframe = df[df.a!='']

這種辦法可以篩選得到新的df

自動迭代

def getphonenumfromfile(fobj):

regex = re.compile(r'1\d', re.ignorecase)

phonenums = re.findall(regex, fobj)

#print phonenums

if len(phonenums)==1:

return phonenums[0]

else:

return ''

pandas 資料處理

pandas中資料可以分為series，dataframe，panel分別表示一維至三維資料。其中在構造時，index表示行名，columns表示列名構造方式 s pd.series data index index s pd series np random randn 5 index a b ...

Pandas 處理缺失資料

import numpy as np import pandas as pd from pandas import series,dataframes series a b np.nan,c d pd.isnull s 0 false 1 false 2 true 3 false 4 false d...

pandas處理丟失資料

有兩種丟失資料的方式 none np.nan nan 1,none none是python當中自帶的，型別為python object，所以，none是不能參與到任何的計算當中的 2,np.nan np.nan是浮點型別，能參與到計算當中，但是計算的結果為nan pandas中none與np.nan...

pandas處理資料

pandas 資料處理

Pandas 處理缺失資料

pandas處理丟失資料

相關推薦