pandas - How to calculate group by cumulative sum for multiple columns in python -
i have data set like,
data=pd.dataframe({'id':pd.series([1,1,1,2,2,3,3,3]),'var1':pd.series([1,2,3,4,5,6,7,8]),'var2':pd.series([11,12,13,14,15,16,17,18]), 'var3':pd.series([21,22,23,24,25,26,27,28])})
here need calculate groupwise cumulative sum columns(var1,var2,var3) based on id. how can write python code crate output per requirement?
thanks in advance.
if have understood right, can use dataframe.groupby
calculate cumulative sum across columns grouped 'id'
-column. like:
import pandas pd data=pd.dataframe({'id':[1,1,1,2,2,3,3,3],'var1':[1,2,3,4,5,6,7,8],'var2':[11,12,13,14,15,16,17,18], 'var3':[21,22,23,24,25,26,27,28]}) data.groupby('id').apply(lambda x: x.drop('id', axis=1).cumsum(axis=1).sum())
Comments
Post a Comment