正則匹配方法及示例

南橋經不起秋發表於2020-09-25

原文網址 : https://blog.csdn.net/dianxinlaozong/article/details/108803281

正則匹配

match進行匹配

import re

# match方法進行匹配，從頭開始匹配。match這個函式如果成功匹配，返回的就是一個物件，如果匹配不到資料，返回的就是None
result = re.match('python','python is good')
print(result)

# group這個方法用來提取匹配到的資料
print(result.group())

注意點：
1、match從頭匹配

字元

.單個任意字元

result = re.match('.','bcd')
print(result.group())

[]匹配列舉字元的任何一位 [ ]

result = re.match('[pP]','Python')
print(result.group())

\d 匹配數字

# re1 = re.match('[0123456789]','123python')
# re1 = re.match('[0-9]','923python')
re1 = re.match('\d','923python')

在這裡插入圖片描述

\D 匹配的是非數字

re1 = re.match('\D','@923python')

\w 匹配的是a-z A-Z 0-9_

re1 = re.match('[a-zA-Z_0-9]','123_aASD923python')
re1 = re.match('\w','asd_123_aASD923python')

\W 匹配的非單詞字元

匹配指定的字元個數

* 代表的是可有可無，代表的是0次或者無數次（沒有不報錯）

str1 = 'Qwe'
re1 = re.match('[A-Z][a-z]*',str1)

+ 代表的是至少有一位 (必須有，沒有就報錯)

# 匹配變數名是否生效
str1 = '_name123_asd'
re1 =re.match('[a-zA-Z_]+[\w]*',str1)

？（寫在[]後面）後面匹配的字元出現一次或者0次

#匹配0-99之間的數字
re1 = re.match('[1-9]?[0-9]','988')

{}的匹配具體的位數

型別	說明
{m}	匹配一個字元出現m次
{m,}	匹配的是一個字元至少出現m次
{m,n}	匹配字元出現的次數從m次到n次

# 匹配的8-20位密碼，可以是數字，字母下劃線
re1 = re.match('[\w]{8,20}','asd123asd123')

print(re1.group())

表示字串的邊界

$表示以。。。。結尾

# 匹配的是郵箱
re1 = re.match(r'[\w]{4,20}@163\.com$','asd123@163.comasd123')

^ 表示以。。。開頭

re1 = re.match(r'^[\w]{4,20}@163\.com$','@asd123@163.com')

分組

|表示的是任何一個表示式都可以

re1 = re.match(r'[1-9]?\d$|100','100')

（）分組

#匹配電話號碼，要求取出要求取出區號或者電話號碼。
re1=re.match(r'(\d{2,4})-(\d{7})$','0571-8922222')
print(re1.group())
print(re1.group(1))
print(re1.group(2))

\number關於引用分組匹配到的字串

# 匹配的是<div>this is div</div>
# 匹配DIV。匹配尖括號DIV。尖括號。這是DIV。尖括號。
str1 = '<a>this is div</a>'
re1 = re.match(r'<[a-zA-Z]+>.*</[a-zA-Z]+>',str1)
re1 = re.match(r'<([a-zA-Z]+)>.*</\1>',str1)
print(re1.group())

注意點：
1、第一個分組的結果，作為第二個人分組額依據
2、適用於成對出現的資料

(?P) 和(?P=name)

str2 = '<div><a>這是包著的超連結</a></div>'
re1 = re.match('<(?P<name1>\w+)><(?P<name2>\w+)>.*</(?P=name2)></(?P=name1)>',str2)
print(re1.group())

re的高階用法

match（匹配）開頭

從開頭進行匹配
def match(pattern, string, flags=0):
    """Try to apply the pattern at the start of the string, returning
    a match object, or None if no match was found."""
    return _compile(pattern, flags).match(string)

search（搜尋）全部，在一個字串中查詢

def search(pattern, string, flags=0):
    """Scan through string looking for a match to the pattern, returning
    a match object, or None if no match was found."""
    return _compile(pattern, flags).search(string)

注意點：
1、查詢整個字串

findall （查詢所有符合條件的，返回一個列表）

def findall(pattern, string, flags=0):
    """Return a list of all non-overlapping matches in the string.

    If one or more capturing groups are present in the pattern, return
    a list of groups; this will be a list of tuples if the pattern
    has more than one group.

    Empty matches are included in the result."""
    
 注意點：
 1、將字串中所有符合要求的字元返回
 2、返回的結果是一個列表型別
 3、如果沒有符合條件的，返回的是一個空列表

sub（將字串中匹配正規表示式的部分替換為其他值）

def sub(pattern, repl, string, count=0, flags=0):
    """Return the string obtained by replacing the leftmost
    non-overlapping occurrences of the pattern in string by the
    replacement repl.  repl can be either a string or a callable;
    if a string, backslash escapes in it are processed.  If it is
    a callable, it's passed the match object and must return
    a replacement string to be used."""
    return _compile(pattern, flags).sub(repl, string, count)

ret = re.sub('\d{2}','19','age is 18,phone is 123',1)
print(ret)

split （根據匹配分割字串，返回分割字串組成的列表）

ret = re.split(' |,','age is 18,phone is 123')
print(ret)

貪婪匹配與非貪婪匹配

print('######################################')
re1 = re.match('\d*?','123456789')
re1 = re.match('\d+?','123456789')
print(re1.group())

注意點：
1、預設的狀態是貪婪匹配，儘可能多的去匹配字元個數
2、在多個匹配符號後面，比如：*  +  {m,n}


import re
a='python'
b=re.match('p(.*?)n',a).group()
#只匹配開頭
print(b)

a='asdasd656165dsadas56166516'
#只匹配一次
b=re.search('\d+',a).group()  #匹配第一次數字串
print(b)

a='sdad316564dadas56665465asdas5661616'
##匹配所有。並返回一個列表
res =re.compile("\d+")#編譯正規表示式，適用於正規表示式較長以及多次使用
b=re.findall('\d+',a)#查詢全部數字串\D+是所有非數字
print(b)


a='sdad316564dadas56665465asdas5661616'
##匹配所有。並返回一個列表
res =re.compile("\d+")#編譯正規表示式，適用於正規表示式較長以及多次使用
b=re.sub(res,'+',a)#把匹配到的所有數字串都變成＋號
print(b)


a='sdad316564dadas56665465asdas5661616'
##匹配所有。並返回一個列表
res =re.compile("\d+")#編譯正規表示式，適用於正規表示式較長以及多次使用
b=re.split(res,a)#以數字進行分割，返回一個列表
print(b)

正則匹配規則2
2024-04-19
正則匹配規則記錄
2018-07-08
學習筆記——正則匹配方法整理
2019-03-04
筆記
探究js正則匹配方法：match和exec
2019-04-22
JS
正則匹配數字
2018-10-30
Python正則匹配中文
2018-07-30
Python
grep 多行正則匹配
2018-06-13
PHP 正則匹配中文
2020-09-24
PHP
Logstash之Grok正則匹配，讓正則進階！
2022-11-17
python的re正則匹配
2024-03-25
Python
Laravel redis 正則匹配keys
2021-03-09
LaravelRedis
Python-網頁轉義字元及正則全文匹配
2018-07-13
Python網頁字元
Java處理正則匹配卡死（正則回溯問題）
2023-03-01
Java
VS Code 正則匹配（冗餘程式碼批量清理方法）
2018-04-18
python爬蟲中使用正則match( )方法匹配目標
2021-09-11
Python爬蟲
java中url正則regex匹配
2020-04-06
Java
正則匹配的捕獲組
2020-02-28
apisix~路由字首的正則匹配
2024-12-03
API路由
shell正則匹配捕獲引用進行IP匹配
2023-05-02
正則匹配之零寬斷言
2018-12-01
正則匹配指定字元之前的字串
2018-05-07
字元字串
正則匹配開頭和結尾
2020-04-05
php正則匹配所有違規字元
2021-03-27
PHP字元
VIM-灰常有用的正則匹配
2024-03-12
python正則一些簡單匹配
2021-09-09
Python
小技巧系列：正則匹配img標籤
2021-02-01
nginx location匹配及rewrite規則
2019-12-27
Nginx
正則匹配身份證有bug你知道麼？
2019-01-15
關於正則位置匹配（斷言）的技巧
2018-07-25
正則表示匹配手機IMEI機身碼
2020-01-30
js正則全域性匹配引發的血案
2020-09-27
JS
MySQL全面瓦解8：查詢的正則匹配
2020-11-10
MySql
js中split之正則運用(模式匹配)
2019-04-29
JS模式
隨手查閱的正則匹配筆記
2019-01-28
筆記
js Abba逆向前瞻正則匹配例項
2022-03-18
JS
php 正則如何匹配手機號碼呢？
2021-04-06
PHP
在一串字串中Java使用正則匹配電話號碼的方法
2024-08-08
字串Java
正規表示式實現字元的模糊匹配功能示例
2022-03-14
字元

正則匹配方法及示例

正則匹配

match進行匹配

字元

.單個任意字元

[]匹配列舉字元的任何一位 [ ]

\d 匹配數字

\D 匹配的是非數字

\w 匹配的是a-z A-Z 0-9_

\W 匹配的非單詞字元

匹配指定的字元個數

* 代表的是可有可無，代表的是0次或者無數次（沒有不報錯）

+ 代表的是至少有一位 (必須有，沒有就報錯)

？（寫在[]後面）後面匹配的字元出現一次或者0次

{}的匹配具體的位數

表示字串的邊界

$表示以。。。。結尾

^ 表示以。。。開頭

分組

|表示的是任何一個表示式都可以

（）分組

\number關於引用分組匹配到的字串

(?P) 和(?P=name)

re的高階用法

match（匹配）開頭

search（搜尋）全部，在一個字串中查詢

findall （查詢所有符合條件的，返回一個列表）

sub（將字串中匹配正規表示式的部分替換為其他值）

split （根據匹配分割字串，返回分割字串組成的列表）

貪婪匹配與非貪婪匹配

相關文章