pandas知識點總結

pandas學習筆記

1.一維資料結構 series 物件

b=pd.series(data=[1,2,3])  #利用陣列建立series物件
b>>>
0    1
1    2
2    3
dtype: int64
type(b)
>>>pandas.core.series.series
a =   #利用字典dict建立series物件
c = pd.series(data=a)
c>>>
1    whj
2    xhl
3     xj
dtype: object
type(c)
>>> pandas.core.series.series
c.index			#獲取下標
>>> index(['1', '2', '3'], dtype='object')
c.values			#獲取值
>>> array(['whj', 'xhl', 'xj'], dtype=object)
c[0]			#利用下標進行索引
>>>'whj'
c[1]
>>>'xhl'
c + '   nihao!'			#進行拼接
>>> 
1    whj   nihao!
2    xhl   nihao!
3     xj   nihao!
dtype: object
c>>>
1    whj
2    xhl
3     xj
dtype: object

2.二維資料結構 dataframe 物件

columns=['語文','數學','英語'],

index=['熊寒露','王韓健','熊健'])

a>>>

語文數學英語

熊寒露 85 90 95

王韓健 82 86 90

熊健 90 60 75

a['語文']

>>>

熊寒露 85

王韓健 82

熊健 90

name: 語文, dtype: int64

del a['英語'] #刪除列

a>>>

語文數學

熊寒露 85 90

王韓健 82 86

熊健 90 60

#層次化索引，復合索引

data = pd.dataframe(data=,)

data

>>>

month year salary

0 12 2013 55

1 3 2014 46

2 6 2014 73

3 9 2014 89

data.set_index(['year','month'])

>>>

salary

year month

2013 12 55

2014 3 46

6 73

9 89

data = pd.dataframe(data=,)

data

>>>

year month salary

0 2020 1 4k

1 2020 5 5k

2 2019 6 4k

3 2020 9 5k

data.set_index(['year','month'])

>>>

salary

year month

2020 1 4k

5 5k

2019 6 4k

2020 9 5k

3.重置索引 reindex()

#dataframe直接索引只支援先列後行，不支援先行後列的索引方式

#但是支援對行的切片

data

>>>

year month salary

0 2020 1 4k

1 2020 5 5k

2 2019 6 4k

3 2020 9 5k

data[:3]

>>>

year month salary

0 2020 1 4k

1 2020 5 5k

2 2019 6 4k

data[1] #直接對行進行索引會報錯

traceback (most recent call last):

file "pandas\_libs\hashtable_class_helper.pxi", line 1619, in pandas._libs.hashtable.pyobjecthashtable.get_item

file "pandas\_libs\hashtable_class_helper.pxi", line 1627, in pandas._libs.hashtable.pyobjecthashtable.get_item

keyerror: 1

4.算術運算與資料對齊

a = pd.series(data=range(3))     #series一維陣列
b = pd.series(data=range(6))
a>>>
0    0
1    1
2    2
dtype: int64
b>>>
0    0
1    1
2    2
3    3
4    4
5    5
dtype: int64
a+b>>>
0    0.0
1    2.0
2    4.0
3    nan
4    nan
5    nan
dtype: float64
a.add(b,fill_value=0)     #對a裡面沒有的陣列進行填充fill_value=0
>>>
0    0.0
1    2.0
2    4.0
3    3.0
4    4.0
5    5.0
dtype: float64

Pandas知識點總結

import pandas as pd s pd.series 1 2,3 4,5 6,np.nan np.nan相當於什麼都沒有 dates pd.date range 20200101 periods 6 初始化六個日期 df pd.dataframe np.random.randn 6,4 i...

Pandas常用知識點總結

pandas是python中高效能的資料分析庫。為資料的統計分析帶來了極大的便利。本文以pandas中最常用的資料結構dataframe為主，總結常用知識點，如增刪改查，分組統計等等。df pd.dataframe df.emptydf.drop duplicates df.replace rege...

Pandas知識點梳理（一）

一 pandas簡介 pandas 是基於numpy 的乙個資料分析包，該工具是為了解決資料分析任務而建立的。pandas 納入了大量庫和一些標準的資料模型，提供了高效地操作大型資料集所需的工具。pandas提供了大量能使我們快速便捷地處理資料的函式和方法。pandas的資料結構 series及da...

pandas知識點總結

Pandas知識點總結

Pandas常用知識點總結

Pandas知識點梳理（一）

相關推薦