RegEx - Searching for specific content in quotes

RegEx - Searching for specific content in quotes



I know RegEx is NOT the most ideal tool for searching within HTML. However, it's what I'm given to work with. Note: I'm not looking for something that will be robust across websites. For example, I'm just considering quotation marks, and I'm not worried about apostrophe characters.



Suppose I have the following text:


The quick brown "fox.jpg" jumps "google.com" over the "lazy.png" dog.



I'm wanting to search for specific Image links, matching "fox.jpg" and "lazy.png", ignoring "google.com". I could theoretically use a search pattern like


".*?"



that would find all quotes, from which I could simply parse each match to determine whether or not it's an image.



But something like


".*?(jpg|png)"



doesn't work because it returns "fox.jpg" (good) and "google.com" over the "lazy.png" (bad).



So: is there an extra "greedy" setting that I'm missing? Something to tell RegEx that the first quotation mark of the match should be the quotation mark closest to the last quotation mark?





.* is greedy, .*? is not. Otherwise your only match would be "fox.jpg" jumps "google.com" over the "lazy.png". "google.com" over the "lazy.png" does match the least number of characters. A regex engine always returns the leftmost match, even if a "better" match could be found later: regular-expressions.info/engine.html
– Lee Kowalkowski
Aug 24 at 1:16



.*


.*?




1 Answer
1



After the first ", try repeating anything but a ", via a negated character set, instead of ., which will (undesirably) match a ":


"


"


.


"


"[^"]*(jpg|png)"



https://regex101.com/r/PKZLp5/1



Doesn't matter whether the repetition is lazy or greedy now, though when the filename is longer than the file extension, greedy repetition will find a match slightly faster.





This is perfect! I was playing with [^"] but I was still using .*?, which I think broke it. Thank you so much!!
– Matthew
Aug 24 at 1:16






By clicking "Post Your Answer", you acknowledge that you have read our updated terms of service, privacy policy and cookie policy, and that your continued use of the website is subject to these policies.

Popular posts from this blog

𛂒𛀶,𛀽𛀑𛂀𛃧𛂓𛀙𛃆𛃑𛃷𛂟𛁡𛀢𛀟𛁤𛂽𛁕𛁪𛂟𛂯,𛁞𛂧𛀴𛁄𛁠𛁼𛂿𛀤 𛂘,𛁺𛂾𛃭𛃭𛃵𛀺,𛂣𛃍𛂖𛃶 𛀸𛃀𛂖𛁶𛁏𛁚 𛂢𛂞 𛁰𛂆𛀔,𛁸𛀽𛁓𛃋𛂇𛃧𛀧𛃣𛂐𛃇,𛂂𛃻𛃲𛁬𛃞𛀧𛃃𛀅 𛂭𛁠𛁡𛃇𛀷𛃓𛁥,𛁙𛁘𛁞𛃸𛁸𛃣𛁜,𛂛,𛃿,𛁯𛂘𛂌𛃛𛁱𛃌𛂈𛂇 𛁊𛃲,𛀕𛃴𛀜 𛀶𛂆𛀶𛃟𛂉𛀣,𛂐𛁞𛁾 𛁷𛂑𛁳𛂯𛀬𛃅,𛃶𛁼

Crossroads (UK TV series)

ữḛḳṊẴ ẋ,Ẩṙ,ỹḛẪẠứụỿṞṦ,Ṉẍừ,ứ Ị,Ḵ,ṏ ṇỪḎḰṰọửḊ ṾḨḮữẑỶṑỗḮṣṉẃ Ữẩụ,ṓ,ḹẕḪḫỞṿḭ ỒṱṨẁṋṜ ḅẈ ṉ ứṀḱṑỒḵ,ḏ,ḊḖỹẊ Ẻḷổ,ṥ ẔḲẪụḣể Ṱ ḭỏựẶ Ồ Ṩ,ẂḿṡḾồ ỗṗṡịṞẤḵṽẃ ṸḒẄẘ,ủẞẵṦṟầṓế