pandas - How to calculate group by cumulative sum for multiple columns in python -


i have data set like,

data=pd.dataframe({'id':pd.series([1,1,1,2,2,3,3,3]),'var1':pd.series([1,2,3,4,5,6,7,8]),'var2':pd.series([11,12,13,14,15,16,17,18]), 'var3':pd.series([21,22,23,24,25,26,27,28])}) 

here need calculate groupwise cumulative sum columns(var1,var2,var3) based on id. how can write python code crate output per requirement?

thanks in advance.

if have understood right, can use dataframe.groupby calculate cumulative sum across columns grouped 'id'-column. like:

import pandas pd data=pd.dataframe({'id':[1,1,1,2,2,3,3,3],'var1':[1,2,3,4,5,6,7,8],'var2':[11,12,13,14,15,16,17,18], 'var3':[21,22,23,24,25,26,27,28]}) data.groupby('id').apply(lambda x: x.drop('id', axis=1).cumsum(axis=1).sum()) 

Comments

Popular posts from this blog

c# - ItextSharp font color issue in ver 5.5.4+ -

jquery - Multiple issues with pushstate: history, loading, calling functions -

opengl es - Android GLES 2 draw line flicker and strange effects -