正则表达式：如何找到那些在不同地方链接不相同的 html 文件？

2024-9-16 • tag-icon

我在起始 html 页面上有这个链接：

<link rel="canonical" href="https://neculaifantanaru.com/en/experiencing-the-faith-shed-in-an-astral-moment.html" />

此外，文件中间还有另一个链接：

<a href="https://neculaifantanaru.com/en/experiencing-the-faith-shed-in-an-astral-moment.html"><img src="index_files/flag_lang_en.jpg" title="en" alt="en" width="28" height="19" /></a>

您会看到有相同的链接，但上下文和位置不同。比较一下。

但是我如何才能找到那些在不同地方链接不相同的 html 文件呢？

假设第一个链接是：<link rel="canonical" href="https://neculaifantanaru.com/en/eyes-accustomed-to-seeing-the-unseen.html" />在这种情况下也不相同，因此正则表达式应该找到包含不同链接的文件。

我如何使用 Regex 来实现这一点？

答案1

@Toto：

Ctrl+F
找什么：<link .*?href=("[^"]+").*?(?:href=(?!\1).)+
查看环绕
查看 正则表达式
查看 [.] 匹配新闻报道
Find All in Current Document

答案1

相关内容