正規表達式實作跨行比對1. sed 指令删除多行2. Python正規表達式比對多行

2023-05-19 01:21:39

正規表達式（Regular expression）可用來檢查文本中是否包含指定模式的字元串，通常是按行來處理（POSIX标準），因為

操作符通常不比對換行符，如果要比對多行怎麼處理呢？本文介紹正規表達式跨行比對實作方法。

1. sed 指令删除多行

測試文檔test.txt内容如下：

start
test1
test2
end

删除

start

和

end

之間的内容

# 包括`start` 和 `end`
sed -i '/start/,/end/d' test.txt  

# 不包括`start` 和 `end`
sed -i '/start/,/end/{{//!d;};}' test.txt

2. Python正規表達式比對多行

Python中比對多行方法如下：

① `re.DOTALL` 或者 `re.S` 參數

import re

data = "1\nstart\ntest1\ntest2\nend\n2"

reg1 = r"start.*end"
reg2 = r"start(.*)end"
res1 = re.findall(reg1, data, flags=re.S)
print(res1)
res2 = re.findall(reg2, data, flags=re.DOTALL)
print(res2)

執行結果：

['start\ntest1\ntest2\nend']
['\ntest1\ntest2\n']

② 表達式 `(.|\n|\r)*`

import re
data = "1\nstart\ntest1\ntest2\nend\n2"

reg3 = r"start((.|\n|\r)*)end"
res = re.findall(reg3, data)
print(res)

執行結果：

③ 表達式 `[\s\S]*`

import re
data = "1\nstart\ntest1\ntest2\nend\n2"

reg4 = r"start([\s\S]*)end"
res = re.findall(reg4, data)
print(res)

執行結果：

④ 表達式 `(?s)`

import re
data = "1\nstart\ntest1\ntest2\nend\n2"

reg5 = r"(?s)start(.*)end"
res = re.findall(reg5, data)
print(res)
reg5 = r"(?s)start.*end"
res = re.findall(reg5, data)
print(res)

執行結果：

['\ntest1\ntest2\n']
['start\ntest1\ntest2\nend']

參考：

https://stackoverflow.com/questions/159118/how-do-i-match-any-character-across-multiple-lines-in-a-regular-expression

--THE END--

歡迎關注公衆号:「測試開發小記」及時接收最新技術文章！

正規表達式實作跨行比對1. sed 指令删除多行2. Python正規表達式比對多行

目錄

1. sed 指令删除多行

2. Python正規表達式比對多行

① `re.DOTALL` 或者 `re.S` 參數

② 表達式 `(.|\n|\r)*`

③ 表達式 `[\s\S]*`

④ 表達式 `(?s)`

繼續閱讀

無法解析的外部符号 wmain，該符号在函數 "void cdecl mainCRTStartupHelper(struct HINSTANCE *,unsigned short con......

TestLink導出用例轉換工具(XML2Excel)

YAML簡介和PyYAML安全操作YAML支援的類型YAML的優點：yaml的基本文法python操作

Small tricks

libsvm for python 安裝

學習軟體測試基礎測試第七天

Zeppelin 配置通路 REST APIApache Zeppelin Configuration REST API

【Torch】最簡潔logging使用指南

27. Remove Element(清單)題目代碼

neo4j之cypher使用文檔

Cloud Studio初體驗

使用 ctypes 進行 Python 和 C 的混合程式設計

【python】【資料處理】畫多元資料分布圖

【python】netconf協定對接管理裝置

「Python 網絡自動化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 網絡裝置

在python中建立excel并寫入

正規表達式實作跨行比對1. sed 指令删除多行2. Python正規表達式比對多行

目錄

1. sed 指令删除多行

2. Python正規表達式比對多行

① re.DOTALL 或者 re.S 參數

② 表達式 (.|\n|\r)*

③ 表達式 [\s\S]*

④ 表達式 (?s)

繼續閱讀

① `re.DOTALL` 或者 `re.S` 參數

② 表達式 `(.|\n|\r)*`

③ 表達式 `[\s\S]*`

④ 表達式 `(?s)`