2017-03-06 29 views
1
import pandas as pd 
output=pd.read_csv('output.csv',encoding='big5') 

output['airplane'].sum() 

我试图总结数据框中的数字,但是当调用sum()时,它只是打印出我希望总结的数字。为什么不把所有的数字相加?为什么我的数据帧没有总和?

'5,0514,7834,3734,7925,4624,9404,8344,9045,1964,0213,9405,0515,1894,9805,0556,0595,7194,6255,4985,3305,3175,3785,6494,5514,3335,1605,3175,3175,3435,8994,6164,3145,2915,2445,2905,5055,7344,5074,1965,4815,6215,6135,6756,0035,1004,0065,5815,2963,1683,5623,8734,1104,2144,6275,3745,2025,8545,3684,5614,4245,3405,0525,0985,0945,6404,6874,3575,0355,0665,2985,1125,5954,9374,4015,2595,1505,2215,1755,9214,6694,3595,2935,2185,2695,3356,1814,9284,5755,1725,3925,8546,4215,5494,5423,7324,63 

回答

1

通过str.replacereplace更换,空字符串,然后通过astype转换为int,因为,是千位分隔符:

output = pd.DataFrame({'airplane':['5,051','4,783','4,373']}) 
print (output) 
    airplane 
0 5,051 
1 4,783 
2 4,373 

print (output['airplane'].sum()) 
5,0514,7834,373 

print (output['airplane'].str.replace(',','').astype(int).sum()) 
14207 

print (output['airplane'].replace(',','', regex=True).astype(int).sum()) 
14207 

但你可以尝试也添加参数thousandsread_csv

output=pd.read_csv('output.csv',encoding='big5', thousands=',') 

print (output['airplane'].sum()) 
+0

Bur我的号码包含',',我无法将其转换。 – ben

+0

ok,什么是'print(output ['airplane']。head())' – jezrael

+0

0 5,051 1 4,783 2 4,373 3 4,792 4 5,462 名称:airplane,dtype:object – ben