溫馨提示×

溫馨提示×

您好，登錄后才能下訂單哦！

密碼登錄×

忘記密碼？

登錄注冊(cè)×

獲取短信驗(yàn)證碼

其他方式登錄

點(diǎn)擊登錄注冊(cè) 即表示同意《億速云用戶服務(wù)條款》

用戶登錄×

賬戶密碼登錄

請(qǐng)使用微信掃描上方二維碼

使用幫助

請(qǐng)求超時(shí)！

請(qǐng)點(diǎn)擊重新獲取二維碼

re.split，re.finditer怎么在Python3中使用

發(fā)布時(shí)間：2021-05-11 15:39:07 來(lái)源：億速云閱讀：131 作者：Leah 欄目：開(kāi)發(fā)技術(shù)

本篇文章給大家分享的是有關(guān)re.split，re.finditer怎么在Python3中使用，小編覺(jué)得挺實(shí)用的，因此分享給大家學(xué)習(xí)，希望大家閱讀完這篇文章后可以有所收獲，話不多說(shuō)，跟著小編一起來(lái)看看吧。

re.split re.finditer re.findall

@(python3)

官方 re 模塊說(shuō)明文檔

re.compile() 函數(shù)

編譯正則表達(dá)式模式，返回一個(gè)對(duì)象?？梢园殉Ｓ玫恼齽t表達(dá)式編譯成正則表達(dá)式對(duì)象，方便后續(xù)調(diào)用及提高效率。

re 模塊最離不開(kāi)的就是 re.compile 函數(shù)。其他函數(shù)都依賴于 compile 創(chuàng)建的正則表達(dá)式對(duì)象

re.compile(pattern, flags=0)

pattern 指定編譯時(shí)的表達(dá)式字符串
flags 編譯標(biāo)志位，用來(lái)修改正則表達(dá)式的匹配方式。支持 re.L|re.M 同時(shí)匹配

flags 標(biāo)志位參數(shù)

re.I(re.IGNORECASE)
使匹配對(duì)大小寫(xiě)不敏感

re.L(re.LOCAL)
做本地化識(shí)別（locale-aware）匹配

re.M(re.MULTILINE)
多行匹配，影響 ^ 和 $

re.S(re.DOTALL)
使 . 匹配包括換行在內(nèi)的所有字符

re.U(re.UNICODE)
根據(jù)Unicode字符集解析字符。這個(gè)標(biāo)志影響 \w, \W, \b, \B.

re.X(re.VERBOSE)
該標(biāo)志通過(guò)給予你更靈活的格式以便你將正則表達(dá)式寫(xiě)得更易于理解。

示例：

import re
content = 'Citizen wang , always fall in love with neighbour，WANG'
rr = re.compile(r'wan\w', re.I) # 不區(qū)分大小寫(xiě)
print(type(rr))
a = rr.findall(content)
print(type(a))
print(a)

findall 返回的是一個(gè) list 對(duì)象

<class '_sre.SRE_Pattern'>
<class 'list'>
['wang', 'WANG']

re.split 函數(shù)

按照指定的 pattern 格式，分割 string 字符串，返回一個(gè)分割后的列表。

re.split(pattern, string, maxsplit=0, flags=0)

pattern compile 生成的正則表達(dá)式對(duì)象，或者自定義也可
string 要匹配的字符串
maxsplit 指定最大分割次數(shù)，不指定將全部分割

import re
str = 'say hello world! hello python'
str_nm = 'one1two2three3four4'
pattern = re.compile(r'(?P<space>\s)') # 創(chuàng)建一個(gè)匹配空格的正則表達(dá)式對(duì)象
pattern_nm = re.compile(r'(?P<space>\d+)') # 創(chuàng)建一個(gè)匹配空格的正則表達(dá)式對(duì)象
match = re.split(pattern, str)
match_nm = re.split(pattern_nm, str_nm, maxsplit=1)
print(match)
print(match_nm)

結(jié)果：

['say', ' ', 'hello', ' ', 'world!', ' ', 'hello', ' ', 'python']
['one', '1', 'two2three3four4']

re.findall() 方法

返回一個(gè)包含所有匹配到的字符串的列表。

pattern 匹配模式，由 re.compile 獲得
string 需要匹配的字符串

import re
str = 'say hello world! hello python'
pattern = re.compile(r'(?P<first>h\w)(?P<symbol>l+)(?P<last>o\s)') # 分組，0 組是整個(gè) world!, 1組 or，2組 ld!
match = re.findall(pattern, str)
print(match)

結(jié)果

[('he', 'll', 'o '), ('he', 'll', 'o ')]

re.finditer 、re.findall

re.finditer(pattern, string[, flags=0]) re.findall(pattern, string[, flags=0])

pattern compile 生成的正則表達(dá)式對(duì)象，或者自定義也可
string 要匹配的字符串

findall 返回一個(gè)包含所有匹配到的字符的列表，列表類以元組的形式存在。

finditer 返回一個(gè)可迭代對(duì)象。

示例一：

pattern = re.compile(r'\d+@\w+.com') #通過(guò) re.compile 獲得一個(gè)正則表達(dá)式對(duì)象
result_finditer = re.finditer(pattern, content)
print(type(result_finditer))
print(result_finditer) # finditer 得到的結(jié)果是個(gè)可迭代對(duì)象
for i in result_finditer: # i 本身也是可迭代對(duì)象，所以下面要使用 i.group()
 print(i.group())
result_findall = re.findall(pattern, content)
print(type(result_findall)) # findall 得到的是一個(gè)列表
print(result_findall)
for p in result_finditer:
 print(p)

輸出結(jié)果：

<class 'callable_iterator'>
<callable_iterator object at 0x10545ec88>
123456@163.com
234567@163.com
345678@163.com
<class 'list'>
['123456@163.com', '234567@163.com', '345678@163.com']

由結(jié)果可知：finditer 得到的是可迭代對(duì)象，finfdall 得到的是一個(gè)列表。

示例二：

import re
content = '''email:123456@163.com
email:234567@163.com
email:345678@163.com
'''
pattern = re.compile(r'(?P<number>\d+)@(?P<mail_type>\w+).com')
result_finditer = re.finditer(pattern, content)
print(type(result_finditer))
print(result_finditer)
iter_dict = {} # 把最后得到的結(jié)果
for i in result_finditer:
 print('郵箱號(hào)碼是：', i.group(1),'郵箱類型是：',i.group(2))
 number = i.group(1)
 mail_type = i.group(2)
 iter_dict.setdefault(number, mail_type) # 使用 dict.setdefault 創(chuàng)建了一個(gè)字典
print(iter_dict)
print('+++++++++++++++++++++++++++++++')
result_findall = re.findall(pattern, content)
print(result_findall)
print(type(result_findall))

輸出結(jié)果：

<class 'callable_iterator'>
<callable_iterator object at 0x104c5cbe0>
郵箱號(hào)碼是： 123456 郵箱類型是： 163
郵箱號(hào)碼是： 234567 郵箱類型是： 163
郵箱號(hào)碼是： 345678 郵箱類型是： 163
{'123456': '163', '234567': '163', '345678': '163'}
+++++++++++++++++++++++++++++++
[('123456', '163'), ('234567', '163'), ('345678', '163')]
<class 'list'>

finditer 得到的可迭代對(duì)象 i，也可以使用 lastindex，lastgroup 方法。

print('lastgroup 最后一個(gè)被捕獲的分組的名字',i.lastgroup)

findall 當(dāng)正則沒(méi)有分組，返回就是正則匹配。

re.findall(r"\d+@\w+.com", content)
['2345678@163.com', '2345678@163.com', '345678@163.com']

有一個(gè)分組返回的是分組的匹配

re.findall(r"(\d+)@\w+.com", content)
['2345678', '2345678', '345678']

多個(gè)分組時(shí)，將結(jié)果作為元組，一并存入到列表中。

re.findall(r"(\d+)@(\w+).com", content)
[('2345678', '163'), ('2345678', '163'), ('345678', '163')]

以上就是re.split，re.finditer怎么在Python3中使用，小編相信有部分知識(shí)點(diǎn)可能是我們?nèi)粘９ぷ鲿?huì)見(jiàn)到或用到的。希望你能通過(guò)這篇文章學(xué)到更多知識(shí)。更多詳情敬請(qǐng)關(guān)注億速云行業(yè)資訊頻道。

向AI問(wèn)一下細(xì)節(jié)

推薦閱讀：

免責(zé)聲明：本站發(fā)布的內(nèi)容（圖片、視頻和文字）以原創(chuàng)、轉(zhuǎn)載和分享為主，文章觀點(diǎn)不代表本網(wǎng)站立場(chǎng)，如果涉及侵權(quán)請(qǐng)聯(lián)系站長(zhǎng)郵箱：is@yisu.com進(jìn)行舉報(bào)，并提供相關(guān)證據(jù)，一經(jīng)查實(shí)，將立刻刪除涉嫌侵權(quán)內(nèi)容。

上一篇新聞：
re.compile、re.match怎么在Python3中使用
下一篇新聞：
如何在Python中使用協(xié)程

猜你喜歡

AI
助
手

產(chǎn)品服務(wù)

地區(qū)劃分

專題活動(dòng)

幫助支持

關(guān)于我們

售后咨詢

7*24小時(shí)在線電話：400-100-2938

7*24小時(shí)在線 QQ：800811969

關(guān)注億速云

億速云公眾號(hào)

手機(jī)網(wǎng)站二維碼