2016-11-16 204 views
-1

我有一个列表。它是随机的。我想要重复删除列表中的项目。从列表中删除略有重复的项目Python

我只想要这些网址(项目)在我的列表中。

/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k 

我只会从我的列表中删除这些URL(项目)。

/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k.1479303517063659 

urlList是保存它的变量。

/watch?v=iOpJywrdCuQ&lc=z12ntbeg5nvptj2pv04chz5roqfpyxpp1to0k 
/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k 
/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k.1479303517063659 
/watch?v=iOpJywrdCuQ&lc=z12zt3fagtymh3nta23dhpsohmjocfwqd04 
/watch?v=iOpJywrdCuQ&lc=z12mv3rgatfcf3jvq22tuzgzcmfadbtvi 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k.1475086029972480 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k.1475086070389366 
/watch?v=iOpJywrdCuQ&lc=z13qulw4jvafgxo1322hdfnitwv5u35d0 
/watch?v=iOpJywrdCuQ&lc=z13qulw4jvafgxo1322hdfnitwv5u35d0.1478197028336117 
/watch?v=iOpJywrdCuQ&lc=z12xxv3r3py3jv5en23vjft4gxb3jhnkx 
/watch?v=iOpJywrdCuQ&lc=z12xxv3r3py3jv5en23vjft4gxb3jhnkx.1478703511076217 
/watch?v=iOpJywrdCuQ&lc=z12fzxwqwlr4xdkao04cflhbawivd1gq2jc 
/watch?v=iOpJywrdCuQ&lc=z12oczcbksudef3cq23psfd4cqjycro5e04 
/watch?v=iOpJywrdCuQ&lc=z12oczcbksudef3cq23psfd4cqjycro5e04.1478790261949273 
/watch?v=iOpJywrdCuQ&lc=z135h1sxqprthvfdl23hsh2qdrmggpzsc 

我想要什么:

/watch?v=iOpJywrdCuQ&lc=z12ntbeg5nvptj2pv04chz5roqfpyxpp1to0k 
/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k 
/watch?v=iOpJywrdCuQ&lc=z12zt3fagtymh3nta23dhpsohmjocfwqd04 
/watch?v=iOpJywrdCuQ&lc=z12mv3rgatfcf3jvq22tuzgzcmfadbtvi 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k 
/watch?v=iOpJywrdCuQ&lc=z13qulw4jvafgxo1322hdfnitwv5u35d0 
/watch?v=iOpJywrdCuQ&lc=z12xxv3r3py3jv5en23vjft4gxb3jhnkx 
/watch?v=iOpJywrdCuQ&lc=z12fzxwqwlr4xdkao04cflhbawivd1gq2jc 
/watch?v=iOpJywrdCuQ&lc=z12oczcbksudef3cq23psfd4cqjycro5e04 
/watch?v=iOpJywrdCuQ&lc=z135h1sxqprthvfdl23hsh2qdrmggpzsc 

我会如何呢?

+4

舍弃'.'的字符串,然后把它放在一个集。 –

回答

0

试验用于字符.匹配正则表达式(因为.\.表示被保留用于any char):

import re 
without_dot = [] 
for x in open('file.txt').read().split('\n'): 
    if not re.match('\.', x): 
     without_dot.append(x)