0
我想只提取在的Python IP地址和URL
153.12.123.123 - - [13/Nov/2014:15:06:43 -0700] "GET /icons/AHPS/0.06.png HTTP/1.1" 123 1234 "http://198.123.123.123/index.html" "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:33.0) Gecko/1234567 Firefox/33.0"
153.12.123.123 - - [13/Nov/2014:15:06:43 -0700] "GET /icons/AHPS/0.06.png HTTP/1.1" 123 1234 "http://abc.weatherabc.org/?Center=38.123456789" "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:33.0) Gecko/1234556 Firefox/33.0"
我目前使用命令行上此表达式的格式包含数据的记录的IP地址和URL部分正则表达式:
[^\"]*\"[^\"]*\"[^\"]*\"([^\"]*)\"
和它产生的这些作为结果:
http://198.123.123.123/index.html
http://abc.weatherabc.org/?Center=38.123456789
但是我想REGUL AR表达产生仅这些部分:
http://198.123.123.123/
http://abc.weatherabc.org/
或
http://198.123.123.123
http://abc.weatherabc.org
请帮助。提前致谢!
所以,你的问题其实是关于从URL中取出部件? –
你不能用这个吗? 'http:\/\/[0-9a-zA-z \。] +' – David
'(?<= \“)http [^ \”] +(?= /)'? [链接](https://regex101.com/r/YsqivJ/1) – Gurman