我有一个数据帧如下:在履行一个数据帧保持行多约束
> sampledput
V1 V2 V3
1 GSM1010983 adipose Bisulfite-Seq
2 GSM1120330 adipose Bisulfite-Seq
3 GSM1120331 adipose Bisulfite-Seq
4 GSM1282348 adipose Bisulfite-Seq
5 GSM1282357 adipose Bisulfite-Seq
6 GSM906416 adipose ChIP-Seq input
7 GSM906394 adipose H3K27ac
8 GSM1010958 adipose mRNA-Seq
9 GSM1120304 adipose mRNA-Seq
10 GSM1120305 adipose mRNA-Seq
11 GSM621443 adipose derived mesenchymal stem cells ChIP-Seq input
12 GSM621420 adipose derived mesenchymal stem cells H3K27me3
13 GSM621446 adipose derived mesenchymal stem cells H3K36me3
14 GSM621418 adipose derived mesenchymal stem cells H3K4me1
15 GSM621458 adipose derived mesenchymal stem cells H3K4me3
16 GSM670020 adipose derived mesenchymal stem cells H3K9ac
17 GSM621398 adipose derived mesenchymal stem cells H3K9me3
我想保留这些行,其中在列的值V2
停留相同(例如,adipose
),而列值V3
应包含Bisulfite-Seq
H3K27ac
,ChIP-Seq input
和mRNA-Seq
。如果有在V3
重复的值,然后只取其中的1,你可以看到,我只选择一个,在这种情况下,具有价值mRNA-Seq
和Bisulfite-Seq
所以排,我会得到的输出为:
5 GSM1282357 adipose Bisulfite-Seq
6 GSM906416 adipose ChIP-Seq input
7 GSM906394 adipose H3K27ac
8 GSM1010958 adipose mRNA-Seq
这里是dput:
structure(list(V1 = structure(c(2L, 5L, 6L, 7L, 8L, 17L, 16L,
1L, 3L, 4L, 12L, 11L, 13L, 10L, 14L, 15L, 9L), .Label = c("GSM1010958",
"GSM1010983", "GSM1120304", "GSM1120305", "GSM1120330", "GSM1120331",
"GSM1282348", "GSM1282357", "GSM621398", "GSM621418", "GSM621420",
"GSM621443", "GSM621446", "GSM621458", "GSM670020", "GSM906394",
"GSM906416"), class = "factor"), V2 = structure(c(1L, 1L, 1L,
1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 2L), .Label = c("adipose",
"adipose derived mesenchymal stem cells"), class = "factor"),
V3 = structure(c(1L, 1L, 1L, 1L, 1L, 2L, 3L, 10L, 10L, 10L,
2L, 4L, 5L, 6L, 7L, 8L, 9L), .Label = c("Bisulfite-Seq",
"ChIP-Seq input", "H3K27ac", "H3K27me3", "H3K36me3", "H3K4me1",
"H3K4me3", "H3K9ac", "H3K9me3", "mRNA-Seq"), class = "factor")), .Names = c("V1",
"V2", "V3"), class = "data.frame", row.names = c(NA, -17L))
为什么不是第一个四行满足您的约束? V2中的值是'adipose','V3'中的值包含'Bisulfite-Seq' – ZachTurn
@ZTtTurn是的,你是对的,它们也会在输出中。 – Newbie
@ZachTurn其实我想在这里是删除重复,只考虑1类。 – Newbie