Linux常用指令---grep(搜尋過濾)

Linux系統中grep命令是一種強大的文字搜尋工具，它能使用正規表示式搜尋文字，並把匹配的行列印出來。grep全稱是Global Regular Expression Print，表示全域性正規表示式版本，它的使用許可權是所有使用者。

grep的工作方式是這樣的，它在一個或多個檔案中搜尋字串模板。如果模板包括空格，則必須被引用，模板後的所有字串被看作檔名。搜尋的結果被送到標準輸出，不影響原檔案內容。

grep可用於shell指令碼，因為grep透過返回一個狀態值來說明搜尋的狀態，如果模板搜尋成功，則返回0，如果搜尋不成功，則返回1，如果搜尋的檔案不存在，則返回2。我們利用這些返回值就可進行一些自動化的文字處理工作。

1．命令格式：

grep [option] pattern file

2．命令功能：

用於過濾/搜尋的特定字元。可使用正規表示式能多種命令配合使用，使用上十分靈活。

3．命令引數：

-a --text #不要忽略二進位制的資料。

-A<顯示行數> --after-context=<顯示行數> #除了顯示符合範本樣式的那一列之外，並顯示該行之後的內容。

-b --byte-offset #在顯示符合樣式的那一行之前，標示出該行第一個字元的編號。

-B<顯示行數> --before-context=<顯示行數> #除了顯示符合樣式的那一行之外，並顯示該行之前的內容。

-c --count #計算符合樣式的列數。

-C<顯示行數> --context=<顯示行數>或-<顯示行數> #除了顯示符合樣式的那一行之外，並顯示該行之前後的內容。

-d <動作> --directories=<動作> #當指定要查詢的是目錄而非檔案時，必須使用這項引數，否則grep指令將回報資訊並停止動作。

-e<範本樣式> --regexp=<範本樣式> #指定字串做為查詢檔案內容的樣式。

-E --extended-regexp #將樣式為延伸的普通表示法來使用。

-f<規則檔案> --file=<規則檔案> #指定規則檔案，其內容含有一個或多個規則樣式，讓grep查詢符合規則條件的檔案內容，格式為每行一個規則樣式。

-F --fixed-regexp #將樣式視為固定字串的列表。

-G --basic-regexp #將樣式視為普通的表示法來使用。

-h --no-filename #在顯示符合樣式的那一行之前，不標示該行所屬的檔名稱。

-H --with-filename #在顯示符合樣式的那一行之前，表示該行所屬的檔名稱。

-i --ignore-case #忽略字元大小寫的差別。

-l --file-with-matches #列出檔案內容符合指定的樣式的檔名稱。

-L --files-without-match #列出檔案內容不符合指定的樣式的檔名稱。

-n --line-number #在顯示符合樣式的那一行之前，標示出該行的列數編號。

-q --quiet或--silent #不顯示任何資訊。

-r --recursive #此引數的效果和指定“-d recurse”引數相同。

-s --no-messages #不顯示錯誤資訊。

-v --revert-match #顯示不包含匹配文字的所有行。

-V --version #顯示版本資訊。

-w --word-regexp #只顯示全字元合的列。

-x --line-regexp #只顯示全列符合的列。

-y #此引數的效果和指定“-i”引數相同。

4．規則表示式：

grep的規則表示式:

^ #錨定行的開始如：'^grep'匹配所有以grep開頭的行。

$ #錨定行的結束如：'grep$'匹配所有以grep結尾的行。

. #匹配一個非換行符的字元如：'gr.p'匹配gr後接一個任意字元，然後是p。

* #匹配零個或多個先前字元如：'*grep'匹配所有一個或多個空格後緊跟grep的行。

.* #一起用代表任意字元。

[] #匹配一個指定範圍內的字元，如'[Gg]rep'匹配Grep和grep。

[^] #匹配一個不在指定範圍內的字元，如：'[^A-FH-Z]rep'匹配不包含A-R和T-Z的一個字母開頭，緊跟rep的行。

$..$ #標記匹配字元，如'$love$'，love被標記為1。

\< #錨定單詞的開始，如:'\<grep'匹配包含以grep開頭的單詞的行。 </grep'匹配包含以grep開頭的單詞的行。 <>

\> #錨定單詞的結束，如'grep\>'匹配包含以grep結尾的單詞的行。

x\{m\} #重複字元x，m次，如：'0\{5\}'匹配包含5個o的行。

x\{m,\} #重複字元x,至少m次，如：'o\{5,\}'匹配至少有5個o的行。

x\{m,n\} #重複字元x，至少m次，不多於n次，如：'o\{5,10\}'匹配5--10個o的行。

\w #匹配文字和數字字元，也就是[A-Za-z0-9]，如：'G\w*p'匹配以G後跟零個或多個文字或數字字元，然後是p。

\W #\w的反置形式，匹配一個或多個非單詞字元，如點號句號等。

\b #單詞鎖定符，如: '\bgrep\b'只匹配grep。

POSIX字元:

為了在不同國家的字元編碼中保持一至，POSIX(The Portable Operating System Interface)增加了特殊的字元類，如[:alnum:]是[A-Za-z0-9]的另一個寫法。要把它們放到[]號內才能成為正規表示式，如[A- Za-z0-9]或[[:alnum:]]。在linux下的grep除fgrep外，都支援POSIX的字元類。

[:alnum:] #文字數字字元

[:alpha:] #文字字元

[:digit:] #數字字元

[:graph:] #非空字元（非空格、控制字元）

[:lower:] #小寫字元

[:cntrl:] #控制字元

[:print:] #非空字元（包括空格）

[:punct:] #標點符號

[:space:] #所有空白字元（新行，空格，製表符）

[:upper:] #大寫字元

[:xdigit:] #十六進位制數字（0-9，a-f，A-F）

5．使用例項：

例項1：查詢指定程式

命令：

ps -ef|grep svn

輸出：

[root@localhost ~]# ps -ef|grep svn

root 4943 1 0 Dec05 ? 00:00:00 svnserve -d -r /opt/svndata/grape/

root 16867 16838 0 19:53 pts/0 00:00:00 grep svn

[root@localhost ~]#

說明：

第一條記錄是查詢出的程式；第二條結果是grep程式本身，並非真正要找的程式。

例項2：查詢指定程式個數

命令：

ps -ef|grep svn -c

ps -ef|grep -c svn

輸出：

[root@localhost ~]# ps -ef|grep svn -c

2

[root@localhost ~]# ps -ef|grep -c svn

2

[root@localhost ~]#

說明：

例項3：從檔案中讀取關鍵詞進行搜尋

命令：

cat test.txt | grep -f test2.txt

輸出：

[root@localhost test]# cat test.txt

hnlinux

peida.cnblogs.com

ubuntu

ubuntu linux

redhat

Redhat

linuxmint

[root@localhost test]# cat test2.txt

linux

Redhat

[root@localhost test]# cat test.txt | grep -f test2.txt

hnlinux

ubuntu linux

Redhat

linuxmint

[root@localhost test]#

說明：

輸出test.txt檔案中含有從test2.txt檔案中讀取出的關鍵詞的內容行

例項4：從檔案中讀取關鍵詞進行搜尋且顯示行號

命令：

cat test.txt | grep -nf test2.txt

輸出：

[root@localhost test]# cat test.txt

hnlinux

peida.cnblogs.com

ubuntu

ubuntu linux

redhat

Redhat

linuxmint

[root@localhost test]# cat test2.txt

linux

Redhat

[root@localhost test]# cat test.txt | grep -nf test2.txt

1:hnlinux

4:ubuntu linux

6:Redhat

7:linuxmint

[root@localhost test]#

說明：

輸出test.txt檔案中含有從test2.txt檔案中讀取出的關鍵詞的內容行，並顯示每一行的行號

例項5：從檔案中查詢關鍵詞

命令：

grep 'linux' test.txt

輸出：

[root@localhost test]# grep 'linux' test.txt

hnlinux

ubuntu linux

linuxmint

[root@localhost test]# grep -n 'linux' test.txt

1:hnlinux

4:ubuntu linux

7:linuxmint

[root@localhost test]#

說明：

例項6：從多個檔案中查詢關鍵詞

命令：

grep 'linux' test.txt test2.txt

輸出：

[root@localhost test]# grep -n 'linux' test.txt test2.txt

test.txt:1:hnlinux

test.txt:4:ubuntu linux

test.txt:7:linuxmint

test2.txt:1:linux

[root@localhost test]# grep 'linux' test.txt test2.txt

test.txt:hnlinux

test.txt:ubuntu linux

test.txt:linuxmint

test2.txt:linux

[root@localhost test]#

說明：

多檔案時，輸出查詢到的資訊內容行時，會把檔案的命名在行最前面輸出並且加上":"作為標示符

例項7：grep不顯示本身程式

命令：

ps aux|grep \[s]sh

ps aux | grep ssh | grep -v "grep"

輸出：

[root@localhost test]# ps aux|grep ssh

root 2720 0.0 0.0 62656 1212 ? Ss Nov02 0:00 /usr/sbin/sshd

root 16834 0.0 0.0 88088 3288 ? Ss 19:53 0:00 sshd: root@pts/0

root 16901 0.0 0.0 61180 764 pts/0 S+ 20:31 0:00 grep ssh

[root@localhost test]# ps aux|grep \[s]sh

root 2720 0.0 0.0 62656 1212 ? Ss Nov02 0:00 /usr/sbin/sshd

root 16834 0.0 0.0 88088 3288 ? Ss 19:53 0:00 sshd: root@pts/0

[root@localhost test]# ps aux | grep ssh | grep -v "grep"

root 2720 0.0 0.0 62656 1212 ? Ss Nov02 0:00 /usr/sbin/sshd

root 16834 0.0 0.0 88088 3288 ? Ss 19:53 0:00 sshd: root@pts/0

說明：

例項8：找出以u開頭的行內容

命令：

cat test.txt |grep ^u

輸出：

[root@localhost test]# cat test.txt |grep ^u

ubuntu

ubuntu linux

[root@localhost test]#

說明：

例項9：輸出非u開頭的行內容

命令：

cat test.txt |grep ^[^u]

輸出：

[root@localhost test]# cat test.txt |grep ^[^u]

hnlinux

peida.cnblogs.com

redhat

Redhat

linuxmint

[root@localhost test]#

說明：

例項10：輸出以hat結尾的行內容

命令：

cat test.txt |grep hat$

輸出：

[root@localhost test]# cat test.txt |grep hat$

redhat

Redhat

[root@localhost test]#

說明：

例項11：輸出ip地址

命令：

ifconfig eth0|grep -E "([0-9]{1,3}\.){3}[0-9]"

輸出：

[root@localhost test]# ifconfig eth0|grep "[0-9]\{1,3\}\.[0-9]\{1,3\}\.[0-9]\{1,3\}\.[0-9]\{1,3\}"

inet addr:192.168.120.204 Bcast:192.168.120.255 Mask:255.255.255.0

[root@localhost test]# ifconfig eth0|grep -E "([0-9]{1,3}\.){3}[0-9]"

inet addr:192.168.120.204 Bcast:192.168.120.255 Mask:255.255.255.0

[root@localhost test]#

說明：

例項12：顯示包含ed或者at字元的內容行

命令：

cat test.txt |grep -E "ed|at"

輸出：

[root@localhost test]# cat test.txt |grep -E "peida|com"

peida.cnblogs.com

[root@localhost test]# cat test.txt |grep -E "ed|at"

redhat

Redhat

[root@localhost test]#

說明：

例項13：顯示當前目錄下面以.txt 結尾的檔案中的所有包含每個字串至少有7個連續小寫字元的字串的行

命令：

grep '[a-z]\{7\}' *.txt

輸出：

[root@localhost test]# grep '[a-z]\{7\}' *.txt

test.txt:hnlinux

test.txt:peida.cnblogs.com

test.txt:linuxmint

[root@localhost test]#

例項14:日誌檔案過大，不好檢視，我們要從中檢視自己想要的內容，或者得到同一類資料，比如說沒有404日誌資訊的

命令：

grep '.' access1.log|grep -Ev '404' > access2.log

grep '.' access1.log|grep -Ev '(404|/photo/|/css/)' > access2.log

grep '.' access1.log|grep -E '404' > access2.log

輸出：

[root@localhost test]# grep “.”access1.log|grep -Ev “404” > access2.log

說明：上面3句命令前面兩句是在當前目錄下對access1.log檔案進行查詢，找到那些不包含404的行，把它們放到access2.log中,後面去掉’v’,即是把有404的行放入access2.log

Linux常用指令---grep(搜尋過濾)

Linux常用指令---grep(搜尋過濾)

相關文章