天天看點

python正規表達式隻比對第一個,Python正規表達式:僅擷取一個比對的表達式

python正規表達式隻比對第一個,Python正規表達式:僅擷取一個比對的表達式

So I'm wrestling with a program that matches multiple regular expressions against one statement:

import re

line = "Remind me to pick coffee up at Autostrada at 4:00 PM"

matchObj = re.match( r'Remind me to (.*) at (.*?) at (.*?) .*', line, re.M|re.I|re.M)

matchObj2 = re.match( r'Remind me to (.*) at (.*?) .*', line, re.M|re.I)

if matchObj:

print("matchObj.group() : ", matchObj.group())

print("matchObj.group(1) : ", matchObj.group(1))

print("matchObj.group(2) : ", matchObj.group(2))

print("matchObj.group(3) :", matchObj.group(3))

else:

print("No match!!")

if matchObj2:

print("matchObj2.group() : ", matchObj2.group())

print("matchObj2.group(1) : ", matchObj2.group(1))

print("matchObj2.group(2) : ", matchObj2.group(2))

else:

print("No match!!")

Now, I want only one regex to match at a time, like this:

matchObj.group() : Remind me to pick coffee up at Autostrada at 4:00 PM

matchObj.group(1) : pick coffee up

matchObj.group(2) : Autostrada

matchObj.group(3) : 4:00

Instead, both the regexes match up to the statement, like this:

matchObj.group() : Remind me to pick coffee up at Autostrada at 4:00 PM

matchObj.group(1) : pick coffee up

matchObj.group(2) : Autostrada

matchObj.group(3) : 4:00

matchObj2.group() : Remind me to pick coffee up at Autostrada at 4:00 PM

matchObj2.group(1) : pick coffee up at Autostrada

matchObj2.group(2) : 4:00

Only matchObj should be a proper match here, so how do I stop the other regexes from reporting a match?

解決方案

The problem is that every string matching the first regex also matches the second one (anything that matches at (.*?) .* also matches .*. So matchObj2 is in fact a proper match.

If you want to distinguish these two situations, you need to apply the second regex if and only if the first one produces no match.

import re

line = "Remind me to pick coffee up at Autostrada at 4:00 PM"

matchObj = re.match( r'Remind me to (.*) at (.*?) at (.*?) .*', line, re.M|re.I|re.M)

matchObj2 = re.match( r'Remind me to (.*) at (.*?) .*', line, re.M|re.I)

if matchObj:

print("matchObj.group() : ", matchObj.group())

print("matchObj.group(1) : ", matchObj.group(1))

print("matchObj.group(2) : ", matchObj.group(2))

print("matchObj.group(3) :", matchObj.group(3))

elif matchObj2:

print("matchObj2.group() : ", matchObj2.group())

print("matchObj2.group(1) : ", matchObj2.group(1))

print("matchObj2.group(2) : ", matchObj2.group(2))

else:

print("No match!!")