This is an archived version of the course. Please find the latest version of the course on the main webpage.

Chapter 2: Regular expression basics

Match any character

face Josiah Wang

Time to spice things up!

Let’s say instead of just "bop", you want to match any three letter words that start with a b and ends with a p.

So you want to be able to match the following strings: bop, bap, bmp, bbp, b2p, b#p, b&p, b!p, and b@p.

How would you write a regular expression for this?

You will use a special . character. The . represents any character except a new line.

>>> re.match("b.p", "bop")
<re.Match object; span=(0, 3), match='bop'>
>>> re.match("b.p", "b2p")
<re.Match object; span=(0, 3), match='b2p'>
>>> re.match("b.p", "b!p")
<re.Match object; span=(0, 3), match='b!p'>

If you need to match a full stop (.), you should escape it with a backslash (\).

>>> re.match("b.p\.", "bop.")
<re.Match object; span=(0, 4), match='bop.'>

Quick task

Write a regular expression to match all strings that have an e as its second and fourth letters.

Example valid strings: hehe, were, beke, tepe, eeee aeae.

Example invalid strings: eeaa, abcd, hello, care.

Try this - this should be a quick one!

>>> pattern = ".e.e"
>>> re.match(pattern, "were")
<re.Match object; span=(0, 4), match='were'>
>>> re.match(pattern, "care") # None
>>>