天天看點

python 中正規表達式用法 re.findall()print (help(re.findall))print (dir(re.findall))

參考部落格原址:https://blog.csdn.net/YZXnuaa/article/details/79346963

<link rel="stylesheet" href="https://csdnimg.cn/release/phoenix/template/css/ck_htmledit_views-5edb848729.css" target="_blank" rel="external nofollow" >

					<div class="htmledit_views" id="content_views">
            <pre>正則 re.findall  的簡單用法(傳回string中所有與pattern相比對的全部字串,傳回形式為數組)
           

文法:

1

findall(pattern, string, flags

=

)

import re      

Python 正規表達式 re findall 方法能夠以清單的形式傳回能比對的子串

print (help(re.findall))

print (dir(re.findall))

findall查找全部r辨別代表後面是正則的語句

1 2 3

regular_v1

=

re.findall(r

"docs"

,

“https://docs.python.org/3/whatsnew/3.6.html”

)

print

(regular_v1)

# [‘docs’]

符号^表示比對以https開頭的的字元串傳回,      
1 2 3

regular_v2

=

re.findall(r

"^https"

,

“https://docs.python.org/3/whatsnew/3.6.html”

)

print

(regular_v2)

# [‘https’]

  

用
    
     
      
       
        符
       
       
        号
       
       
        表
       
       
        示
       
       
        以
       
       
        h
       
       
        t
       
       
        m
       
       
        l
       
       
        結
       
       
        尾
       
       
        的
       
       
        字
       
       
        符
       
       
        串
       
       
        返
       
       
        回
       
       
        ,
       
       
        判
       
       
        斷
       
       
        是
       
       
        否
       
       
        字
       
       
        符
       
       
        串
       
       
        結
       
       
        束
       
       
        的
       
       
        字
       
       
        符
       
       
        串
       
       
        &lt;
       
       
        /
       
       
        p
       
       
        r
       
       
        e
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        c
       
       
        n
       
       
        b
       
       
        l
       
       
        o
       
       
        g
       
       
        
         s
        
        
         H
        
       
       
        i
       
       
        g
       
       
        h
       
       
        l
       
       
        i
       
       
        g
       
       
        h
       
       
        t
       
       
        e
       
       
        r
       
       
        s
       
       
        h
       
       
        −
       
       
        g
       
       
        u
       
       
        t
       
       
        t
       
       
        e
       
       
        r
       
       
        &quot;
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        s
       
       
        y
       
       
        n
       
       
        t
       
       
        a
       
       
        x
       
       
        h
       
       
        i
       
       
        g
       
       
        h
       
       
        l
       
       
        i
       
       
        g
       
       
        h
       
       
        t
       
       
        e
       
       
        r
       
       
        p
       
       
        y
       
       
        t
       
       
        h
       
       
        o
       
       
        n
       
       
        &quot;
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        t
       
       
        a
       
       
        b
       
       
        l
       
       
        e
       
       
        −
       
       
        b
       
       
        o
       
       
        x
       
       
        &quot;
       
       
        &gt;
       
       
        &lt;
       
       
        t
       
       
        a
       
       
        b
       
       
        l
       
       
        e
       
       
        c
       
       
        e
       
       
        l
       
       
        l
       
       
        s
       
       
        p
       
       
        a
       
       
        c
       
       
        i
       
       
        n
       
       
        g
       
       
        =
       
       
        &quot;
       
       
        0
       
       
        &quot;
       
       
        c
       
       
        e
       
       
        l
       
       
        l
       
       
        p
       
       
        a
       
       
        d
       
       
        d
       
       
        i
       
       
        n
       
       
        g
       
       
        =
       
       
        &quot;
       
       
        0
       
       
        &quot;
       
       
        b
       
       
        o
       
       
        r
       
       
        d
       
       
        e
       
       
        r
       
       
        =
       
       
        &quot;
       
       
        0
       
       
        &quot;
       
       
        &gt;
       
       
        &lt;
       
       
        t
       
       
        b
       
       
        o
       
       
        d
       
       
        y
       
       
        &gt;
       
       
        &lt;
       
       
        t
       
       
        r
       
       
        &gt;
       
       
        &lt;
       
       
        t
       
       
        d
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        g
       
       
        u
       
       
        t
       
       
        t
       
       
        e
       
       
        r
       
       
        &quot;
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        l
       
       
        i
       
       
        n
       
       
        e
       
       
        n
       
       
        u
       
       
        m
       
       
        b
       
       
        e
       
       
        r
       
       
        1
       
       
        i
       
       
        n
       
       
        d
       
       
        e
       
       
        x
       
       
        0
       
       
        a
       
       
        l
       
       
        t
       
       
        2
       
       
        &quot;
       
       
        &gt;
       
       
        1
       
       
        &lt;
       
       
        /
       
       
        d
       
       
        i
       
       
        v
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        l
       
       
        i
       
       
        n
       
       
        e
       
       
        n
       
       
        u
       
       
        m
       
       
        b
       
       
        e
       
       
        r
       
       
        2
       
       
        i
       
       
        n
       
       
        d
       
       
        e
       
       
        x
       
       
        1
       
       
        a
       
       
        l
       
       
        t
       
       
        1
       
       
        &quot;
       
       
        &gt;
       
       
        2
       
       
        &lt;
       
       
        /
       
       
        d
       
       
        i
       
       
        v
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        l
       
       
        i
       
       
        n
       
       
        e
       
       
        n
       
       
        u
       
       
        m
       
       
        b
       
       
        e
       
       
        r
       
       
        3
       
       
        i
       
       
        n
       
       
        d
       
       
        e
       
       
        x
       
       
        2
       
       
        a
       
       
        l
       
       
        t
       
       
        2
       
       
        &quot;
       
       
        &gt;
       
       
        3
       
       
        &lt;
       
       
        /
       
       
        d
       
       
        i
       
       
        v
       
       
        &gt;
       
       
        &lt;
       
       
        /
       
       
        t
       
       
        d
       
       
        &gt;
       
       
        &lt;
       
       
        t
       
       
        d
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        &quot;
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        &gt;
       
       
        &lt;
       
       
        d
       
       
        i
       
       
        v
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        l
       
       
        i
       
       
        n
       
       
        e
       
       
        n
       
       
        u
       
       
        m
       
       
        b
       
       
        e
       
       
        r
       
       
        1
       
       
        i
       
       
        n
       
       
        d
       
       
        e
       
       
        x
       
       
        0
       
       
        a
       
       
        l
       
       
        t
       
       
        2
       
       
        &quot;
       
       
        &gt;
       
       
        &lt;
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        p
       
       
        y
       
       
        t
       
       
        h
       
       
        o
       
       
        n
       
       
        p
       
       
        l
       
       
        a
       
       
        i
       
       
        n
       
       
        &quot;
       
       
        &gt;
       
       
        r
       
       
        e
       
       
        g
       
       
        u
       
       
        l
       
       
        a
       
       
        
         r
        
        
         v
        
       
       
        3
       
       
        &lt;
       
       
        /
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        &gt;
       
       
        &lt;
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        p
       
       
        y
       
       
        t
       
       
        h
       
       
        o
       
       
        n
       
       
        k
       
       
        e
       
       
        y
       
       
        w
       
       
        o
       
       
        r
       
       
        d
       
       
        &quot;
       
       
        &gt;
       
       
        =
       
       
        &lt;
       
       
        /
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        &gt;
       
       
        &lt;
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        p
       
       
        y
       
       
        t
       
       
        h
       
       
        o
       
       
        n
       
       
        p
       
       
        l
       
       
        a
       
       
        i
       
       
        n
       
       
        &quot;
       
       
        &gt;
       
       
        r
       
       
        e
       
       
        .
       
       
        f
       
       
        i
       
       
        n
       
       
        d
       
       
        a
       
       
        l
       
       
        l
       
       
        (
       
       
        r
       
       
        &lt;
       
       
        /
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        &gt;
       
       
        &lt;
       
       
        c
       
       
        o
       
       
        d
       
       
        e
       
       
        c
       
       
        l
       
       
        a
       
       
        s
       
       
        s
       
       
        =
       
       
        &quot;
       
       
        p
       
       
        y
       
       
        t
       
       
        h
       
       
        o
       
       
        n
       
       
        s
       
       
        t
       
       
        r
       
       
        i
       
       
        n
       
       
        g
       
       
        &quot;
       
       
        &gt;
       
       
        &quot;
       
       
        h
       
       
        t
       
       
        m
       
       
        l
       
      
      
       符号表示以html結尾的字元串傳回,判斷是否字元串結束的字元串&lt;/pre&gt;&lt;div class=&quot;cnblogs_Highlighter sh-gutter&quot;&gt;&lt;div&gt;&lt;div class=&quot;syntaxhighlighter python&quot;&gt;&lt;div class=&quot;table-box&quot;&gt;&lt;table cellspacing=&quot;0&quot; cellpadding=&quot;0&quot; border=&quot;0&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class=&quot;gutter&quot;&gt;&lt;div class=&quot;line number1 index0 alt2&quot;&gt;1&lt;/div&gt;&lt;div class=&quot;line number2 index1 alt1&quot;&gt;2&lt;/div&gt;&lt;div class=&quot;line number3 index2 alt2&quot;&gt;3&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;code&quot;&gt;&lt;div&gt;&lt;div class=&quot;line number1 index0 alt2&quot;&gt;&lt;code class=&quot;python plain&quot;&gt;regular_v3 &lt;/code&gt;&lt;code class=&quot;python keyword&quot;&gt;=&lt;/code&gt; &lt;code class=&quot;python plain&quot;&gt;re.findall(r&lt;/code&gt;&lt;code class=&quot;python string&quot;&gt;&quot;html
      
     
    符号表示以html結尾的字元串傳回,判斷是否字元串結束的字元串</pre><divclass="cnblogsH​ighlightersh−gutter"><div><divclass="syntaxhighlighterpython"><divclass="table−box"><tablecellspacing="0"cellpadding="0"border="0"><tbody><tr><tdclass="gutter"><divclass="linenumber1index0alt2">1</div><divclass="linenumber2index1alt1">2</div><divclass="linenumber3index2alt2">3</div></td><tdclass="code"><div><divclass="linenumber1index0alt2"><codeclass="pythonplain">regularv​3</code><codeclass="pythonkeyword">=</code><codeclass="pythonplain">re.findall(r</code><codeclass="pythonstring">"html"         ,              “https://docs.python.org/3/whatsnew/3.6.html”              )              print              (regular_v3)              # [‘html’]            
# […]比對括号中的其中一個字元      
1 2 3

regular_v4

=

re.findall(r

"[t,w]h"

,

“https://docs.python.org/3/whatsnew/3.6.html”

)

print

(regular_v4)

# [‘th’, ‘wh’]

“d”是正則文法規則用來比對0到9之間的數傳回清單      
1 2 3 4 5 6

regular_v5

=

re.findall(r

"\d"

,

“https://docs.python.org/3/whatsnew/3.6.html”

)

regular_v6

=

re.findall(r

"\d\d\d"

,

“https://docs.python.org/3/whatsnew/3.6.html/1234”

)

print

(regular_v5)

# [‘3’, ‘3’, ‘6’]

print

(regular_v6)

# [‘123’]

小d表示取數字0-9,大D表示不要數字,也就是出了數字以外的内容傳回      
1 2 3

regular_v7

=

re.findall(r

"\D"

,

“https://docs.python.org/3/whatsnew/3.6.html”

)

print

(regular_v7)

# [‘h’, ‘t’, ‘t’, ‘p’, ‘s’, ‘:’, ‘/’, ‘/’, ‘d’, ‘o’, ‘c’, ‘s’, ‘.’, ‘p’, ‘y’, ‘t’, ‘h’, ‘o’, ‘n’, ‘.’, ‘o’, ‘r’, ‘g’, ‘/’, ‘/’, ‘w’, ‘h’, ‘a’, ‘t’, ‘s’, ‘n’, ‘e’, ‘w’, ‘/’, ‘.’, ‘.’, ‘h’, ‘t’, ‘m’, ‘l’]

“w”在正則裡面代表比對從小寫a到z,大寫A到Z,數字0到9      
1 2 3

regular_v8

=

re.findall(r

"\w"

,

“https://docs.python.org/3/whatsnew/3.6.html”

)

print

(regular_v8)

#[‘h’, ‘t’, ‘t’, ‘p’, ‘s’, ‘d’, ‘o’, ‘c’, ‘s’, ‘p’, ‘y’, ‘t’, ‘h’, ‘o’, ‘n’, ‘o’, ‘r’, ‘g’, ‘3’, ‘w’, ‘h’, ‘a’, ‘t’, ‘s’, ‘n’, ‘e’, ‘w’, ‘3’, ‘6’, ‘h’, ‘t’, ‘m’, ‘l’]

“W”在正則裡面代表比對除了字母與數字以外的特殊符号      
1 2 3

regular_v9

=

re.findall(r

"\W"

,

“https://docs.python.org/3/whatsnew/3.6.html”

)

print

(regular_v9)

# [’:’, ‘/’, ‘/’, ‘.’, ‘.’, ‘/’, ‘/’, ‘/’, ‘.’, ‘.’]