摆脱使用正则表达式的HTML标记的

我有一个正则表达式，成功地搜索给出的关键词，但是我不能摆脱符号的HTML代码中的一个文本<或>摆脱使用正则表达式的HTML标记的

这是我的正则表达式：

[><][^><=]*climate[> - <][^<|//]*

而这个结果：

>Formation of coastal sea ice in North Pacific drives ocean circulation and climate<

任何建议吗？

2016-11-24 jim

您正在使用什么程序来解析这个？什么风格的正则表达式？也看到这个问题：http://stackoverflow.com/questions/181095/regular-expression-to-extract-text-from-html – sakurashinken

尝试'<.*>（。*）' –

http://stackoverflow.com/a/1732454/2864740 – user2864740

添加向前看，而不是反向预搜索匹配他们：

(?<=[><])[^><=]*climate(?=[> - <])[^<|/]* 
^^^^^^^^^    ^^^^^^^^^^^

2016-11-24 06:23:58

作品完美谢谢你soo！并感谢解释！ – jim

@jim高兴地帮助:) –

只需使用[^<>]*climate作为正则表达式，应该可以做到。还是我误解了你的问题？

用于测试和建筑的正则表达式尼斯帮手：http://regexr.com/

2016-11-24 06:23:05

回答