使用下面的示例,我想按CaseWorker分组数据帧,然后客户端,然后为每个客户端组确定是否在“任务”与“任务2”中的任务列表相同。使用Dplyr的“group_by”创建组,然后使用Stringr查找组之间的差异
如果每个处于“任务2”但不是“任务”的任务都可以提取并显示在新的列或数据框中,我会很高兴有一个简单的真或假,或更好。
所以基本上我需要确保“任务”和“任务2”为每个客户端包含相同的条目。
如果可能的话,我想坚持使用Dplyr和Stringr,或者至少留在Tidyverse中。我认为有一种使用“group_by”和“str_detect”或其他一些Stringr功能以优雅的方式实现这一点的方法。
CaseWorker<-c("John","John","John","John","John","John","Melanie","Melanie","Melanie","Melanie","Melanie","Melanie")
Client<-c("Chris","Chris","Chris","Tom","Tom","Tom","Valerie","Valerie","Valerie","Tim","Tim","Tim")
Task<-c("Feed cat","Make dinner","Iron shirt","Make dinner","Do homework","Make lunch","Make dinner","Feed cat","Buy groceries","Do homework","Iron shirt","Make lunch")
Task2<-c("Feed cat","Make dinner","Iron shirt","Make dinner","Do homework","Feed cat","Make dinner","Feed cat","Iron shirt","Do homework","Iron shirt","Make lunch")
Df<-data.frame(CaseWorker,Client,Task,Task2)
感谢您的回答!我发布了这个问题的“第二部分”,如果您有兴趣也可以发布一个更复杂但相似的问题。它以相同的问题名称发布,但在开始时使用“第2部分”。 – Mike