Python模組學習： re 正規表示式

發表於2015-05-30

今天學習了Python中有關正規表示式的知識。關於正規表示式的語法，不作過多解釋，網上有許多學習的資料。這裡主要介紹Python中常用的正規表示式處理函式。

re.match

re.match 嘗試從字串的開始匹配一個模式，如：下面的例子匹配第一個單詞。

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."
m = re.match(r"(/w+)/s", text)
if m:
    print m.group(0), '/n', m.group(1)
else:
    print 'not match'

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."

m = re.match(r"(/w+)/s", text)

if m:

print m.group(0), '/n', m.group(1)

else:

print 'not match'

re.match的函式原型為：re.match(pattern, string, flags)

第一個引數是正規表示式，這裡為”(/w+)/s”，如果匹配成功，則返回一個Match，否則返回一個None；

第二個參數列示要匹配的字串；

第三個引數是標緻位，用於控制正規表示式的匹配方式，如：是否區分大小寫，多行匹配等等。

re.search

re.search函式會在字串內查詢模式匹配,只到找到第一個匹配然後返回，如果字串沒有匹配，則返回None。

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."
m = re.search(r'/shan(ds)ome/s', text)
if m:
    print m.group(0), m.group(1)
else:
    print 'not search'

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."

m = re.search(r'/shan(ds)ome/s', text)

if m:

print m.group(0), m.group(1)

else:

print 'not search'

re.search的函式原型為： re.search(pattern, string, flags)

每個引數的含意與re.match一樣。

re.match與re.search的區別：re.match只匹配字串的開始，如果字串開始不符合正規表示式，則匹配失敗，函式返回None；而re.search匹配整個字串，直到找到一個匹配。

re.sub

re.sub用於替換字串中的匹配項。下面一個例子將字串中的空格 ‘ ‘ 替換成 ‘-‘ :

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."
print re.sub(r'/s+', '-', text)

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."

print re.sub(r'/s+', '-', text)

re.sub的函式原型為：re.sub(pattern, repl, string, count)

其中第二個函式是替換後的字串；本例中為’-‘

第四個引數指替換個數。預設為0，表示每個匹配項都替換。

re.sub還允許使用函式對匹配項的替換進行復雜的處理。如：re.sub(r’/s’, lambda m: ‘[‘ + m.group(0) + ‘]’, text, 0)；將字串中的空格’ ‘替換為'[ ]’。

re.split

可以使用re.split來分割字串，如：re.split(r’/s+’, text)；將字串按空格分割成一個單詞列表。

re.findall

re.findall可以獲取字串中所有匹配的字串。如：re.findall(r’/w*oo/w*’, text)；獲取字串中，包含’oo’的所有單詞。

re.compile

可以把正規表示式編譯成一個正規表示式物件。可以把那些經常使用的正規表示式編譯成正規表示式物件，這樣可以提高一定的效率。下面是一個正規表示式物件的一個例子：

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."
regex = re.compile(r'/w*oo/w*')
print regex.findall(text)   #查詢所有包含'oo'的單詞
print regex.sub(lambda m: '[' + m.group(0) + ']', text) #將字串中含有'oo'的單詞用[]括起來。

import re

text = "JGood is a handsome boy, he is cool, clever, and so on..."

regex = re.compile(r'/w*oo/w*')

print regex.findall(text) #查詢所有包含'oo'的單詞

print regex.sub(lambda m: '[' + m.group(0) + ']', text) #將字串中含有'oo'的單詞用[]括起來。

更詳細的內容，可以參考Python手冊。

Python 正規表示式 re 模組
2018-10-12
Python
python re模組正規表示式
2018-09-12
Python
python正規表示式(re模組)
2020-08-08
Python
python中re模組的使用（正規表示式）
2021-01-17
Python
python基礎之正規表示式和re模組
2020-03-12
Python
正規表示式re.compile的學習
2018-08-01
Compile
Python 之 RE（正規表示式）常用
2020-03-16
Python
python 關於正規表示式re
2020-04-21
Python
Python 正規表示式模組詳解
2018-11-02
Python
Python正規表示式簡記和re庫
2019-02-16
Python
Python爬蟲— 1.4 正規表示式：re庫
2019-02-28
Python爬蟲
Python學習筆記 - 正規表示式
2019-01-16
Python筆記
正規表示式（三）：pythonre模組
2018-07-10
Python
Python學習筆記|Python之正規表示式
2018-12-18
Python筆記
LeetCode-10. 正規表示式匹配（Python-re包）
2018-05-03
LeetCodePython
python就業班----正規表示式及re應用
2020-10-05
Python就業
python 正規表示式re常用操作符使用方法怎麼用re正規表示式表示一個IP地址：0-255
2018-11-22
Python
Go 正規表示式學習
2024-04-06
Go
python 中的正規表示式學習筆記
2021-05-05
Python筆記
超詳細Python正規表示式操作指南(re使用)，一
2018-05-26
Python
正規表示式學習和練習
2019-04-09
Python——正規表示式
2019-08-05
Python
python正規表示式
2024-06-15
Python
Python 正規表示式
2021-09-09
Python
Python：正規表示式
2021-04-22
Python
正規表示式學習筆記
2019-03-02
筆記
如何快速學習正規表示式
2022-03-21
正規表示式入門學習
2020-12-14
Python爬蟲教程-19-資料提取-正規表示式(re)
2018-09-06
Python爬蟲
正規表示式例項蒐集，通過例項來學習正規表示式。
2021-11-19
通過js正規表示式例項學習正規表示式基本語法
2021-02-10
JS
java 正規表示式語法學習
2018-06-21
Java
正規表示式學習（2）---字元特性
2024-06-05
字元
學習正規表示式（js、C#）
2022-03-22
JSC#
小豬的Python學習之旅 —— 3.正規表示式
2019-03-02
Python
python爬蟲學習筆記4-正規表示式
2020-12-12
Python爬蟲筆記
python之正規表示式
2018-08-11
Python
python 正規表示式匹配
2024-04-19
Python
Python正規表示式手稿
2020-04-04
Python

Python模組學習： re 正規表示式

re.match

re.search

re.sub

re.split

re.findall

re.compile

相關文章