Howie Chen
建中資訊社 副社長兼學術 陳泓宇
Data Processing
Introduction
> pip install pandasCreating
一維陣列
import pandas
my_data1 = pandas.Series([56, 32, 548], index=['2021', '2022', '2023'], name='count')二維數據 可以想像成 excel
import pandas
my_data2 = pandas.DataFrame({'number': [5, 7, 13], 'score': [131, 2, 8]})import pandas
print(my_data1)
print(my_data2)Writing
import pandas
my_data1.to_csv("data.csv")Reading
import pandas
input_data = pandas.read_csv("flie.csv")Index
import pandas
my_data3 = pandas.DataFrame({"score1": [78, 94, 68, 97, 68], "score2": [3, 4, 7, 6, 4])
print(my_data3.iloc[4])
print(my_data3.iloc[1:4])
print(my_data3.loc[1:4])
print(my_data3.iloc[1:4, 0])Insightes
import pandas
melbourne_data = pandas.read_csv("melb_data.csv")
print(melbourne_data.describe())| count | 有幾行值 |
| mean | 平均 |
| std | 標準差 |
| min | 最小值 |
| 25% | PR25 |
| 50% | PR50 |
| 75% | PR75 |
| max | 最大值 |
import pandas
reviews.rooms.map(lambda p: p - 1)By Howie Chen