Greedy versus reluctant (lazy) matching using quantifiers

So far, we have discussed all the quantifiers available to us in a regular expression to match fixed-size or variable-length text. These quantifiers are, by default, greedy in nature. Greediness is in terms of their matching. In a regex, quantifiers attempt to match the longest possible text, from left to right. Only when the regex engine fails to complete a match, it moves back in the input text one character at a time, as required, to complete the match. Sometimes, the regex engine moves back and forth multiple times in attempts to complete a match in a complex (nested) regular expression.

So, for example, if the input is pqrstmprt and our regular expression is p.+r, then our match will be from p at the start to the last r, that is, pqrstmpr, not pqr. It is due to the same greediness that was stated earlier, where the regex engine attempts to match the longest possible match when using quantifiers.

Regular expressions also provide a way to change this greedy behavior of the regex engine. If we place a ? (called the lazy or reluctant quantifier) in front of any of the quantifiers, then the behavior of the regex engine changes from greedy to lazy. With the lazy quantifier in place, the regex engine attempts to match the shortest match, only expanding further as required to complete the match with the remainder of the regex pattern next to the lazy quantifier.

So, in the preceding example, if we use the regex as p.+?r, then our matched text will be pqr, because pqr is the smallest possible match between p and r.

Here is a list of all the greedy quantifiers and their corresponding lazy quantifiers:

Greedy Quantifier Lazy Quantifier
m* m*?
m+ m+?
m? m??
m{X} m{X}?
m{X,} m{X,}?
m{X,Y} m{X,Y}?
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.118.200.154