Data Processing
Introduction
> pip install pandas
Creating
一維陣列
import pandas
my_data1 = pandas.Series([56, 32, 548], index=['2021', '2022', '2023'], name='count')
二維數據 可以想像成 excel
import pandas
my_data2 = pandas.DataFrame({'number': [5, 7, 13], 'score': [131, 2, 8]})
import pandas
print(my_data1)
print(my_data2)
Writing
import pandas
my_data1.to_csv("data.csv")
Reading
import pandas
input_data = pandas.read_csv("flie.csv")
Index
import pandas
my_data3 = pandas.DataFrame({"score1": [78, 94, 68, 97, 68], "score2": [3, 4, 7, 6, 4])
print(my_data3.iloc[4])
print(my_data3.iloc[1:4])
print(my_data3.loc[1:4])
print(my_data3.iloc[1:4, 0])
Insightes
import pandas
melbourne_data = pandas.read_csv("melb_data.csv")
print(melbourne_data.describe())
count | 有幾行值 |
mean | 平均 |
std | 標準差 |
min | 最小值 |
25% | PR25 |
50% | PR50 |
75% | PR75 |
max | 最大值 |
import pandas
reviews.rooms.map(lambda p: p - 1)