1
这里充满价值的NAS原题: Group by min and fill NAs with value from another column集团通过分和另一列第2部分
我有这样的数据帧:
mydf = pd.DataFrame (data = {'uid': [1,1,1,2,2,3,4,4,4,4], 'pagename':
['home', 'blah',
'blah', 'home', 'blah', 'blah','blah','home','blah','blah'], 'startpage':
[np.nan, np.nan, np.nan, 'home',
'home', 'blah',np.nan,np.nan,np.nan,np.nan], 'date_time':
[0,1,2,5,9,1,1,2,3,4], 'page_event': [0,0,0,0,0,0,10,0,0,10]})
我想这个数据帧:
endingdf = pd.DataFrame (data = {'uid': [1,1,1,2,2,3,4,4,4,4], 'pagename':
['home', 'blah', 'blah', 'home', 'blah','blah','blah','home','blah','blah'],
'startpage': [np.nan, np.nan, np.nan, 'home',
'home','blah',np.nan,np.nan,np.nan,np.nan],
'date_time': [0,1,2,5,9,1,1,2,3,4], 'page_event': [0,0,0,0,0,0,10,0,0,10],
'new_start_page':['home', 'home', 'home', 'home', 'home', 'blah', 'home',
'home', 'home', 'home']})
我想要做的是按UID
分组,如果startpage
为NULL
,则使用fir st pagename
的访问(min_ date_time)但只有当page_event = 0
。所以如果第一个pagename
有page_event = 10
那就跳过那个,直到page_event = 0
。