replace all links in file

2018-06-04 01:04:33

This question already has an answer here:

Regular expression to match a line that doesn't contain a word? 25 answers

Try ~<a href="(.(?!remedy))*?".*?>(.*?)</a>~i

To the question, what you are doing wrong: Regexes match ever if anyhow possible and for each url (even that containing remedy ) it is possible to match '~<a href=".*?(?!remedy).*?".*?>(.*?)</a>~i' because you did not specify remedy may not be contained anywhere in the attribute but you specified there must be anything/nothing ( .*? ) that is not followed by remedy and that is the case for any url except those that begin with exactly <a href="remedy" . Hope one can understand that...

I would probably use this:

<a href="(?:(?!remedy)[^"])*"[^>]*>([^<]*)</a>

The most interesting part is this:

"(?:(?!remedy)[^"])*"

Each time the [^"] is about to consume another character, it yields to the lookahead so it confirm that it's not the first character of the word remedy . Using [^"] instead of . prevents it from looking at anything beyond the closing quote. I also took the liberty of replacing your .*? s with negated character classes. This serves the same purpose, keeping the match "corralled" in the area where you want it to match. It's also more efficient and more robust.

Of course, I'm assuming the <a> element's content is plain text, with no more elements nested inside it. In fact, that's just one of many simplifying assumptions I've made. You can't match HTML with regexes without them.

链接地址: http://www.djcxy.com/p/13386.html

上一篇: 如何替换vim中包含模式的所有单词？

下一篇: 替换文件中的所有链接