scrapy處理post請求的傳參和日誌等級

Bound_w發表於2019-03-04

原文網址 : https://www.cnblogs.com/wqzn/p/10471321.html

一.Scrapy的日誌等級

　　- 在使用scrapy crawl spiderFileName執行程式時，在終端裡列印輸出的就是scrapy的日誌資訊。

　　- 日誌資訊的種類：

　　　　　　　　ERROR ： 一般錯誤

　　　　　　　　WARNING : 警告

　　　　　　　　INFO : 一般的資訊

　　　　　　　　DEBUG ： 除錯資訊

　　- 設定日誌資訊指定輸出：

　　　　在settings.py配置檔案中，加入

LOG_LEVEL = ‘指定日誌資訊種類’即可。

LOG_FILE = 'log.txt'則表示將日誌資訊寫入到指定檔案中進行儲存。

二.請求傳參

　　- 在某些情況下，我們爬取的資料不在同一個頁面中，例如，我們爬取一個電影網站，電影的名稱，評分在一級頁面，而要爬取的其他電影詳情在其二級子頁面中。這時我們就需要用到請求傳參。

處理post請求的引數：

建立專案：

程式碼:

import scrapy


class PostSpider(scrapy.Spider):
    name = 'post'
    # allowed_domains = ['www.xxx.com']
    start_urls = ['https://fanyi.baidu.com/sug']

    def start_requests(self):
        data = {
            'kw':'dog'
        }
        for url in self.start_urls:
            yield scrapy.FormRequest(url=url,formdata=data,callback=self.parse)

    def parse(self, response):
        print(response.text)

settings.py

USER_AGENT = 'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/72.0.3626.119 Safari/537.36'
# Obey robots.txt rules
ROBOTSTXT_OBEY = False

檢視請求的資料：　

案例二：

# -*- coding: utf-8 -*-
import scrapy
from moviePro.items import MovieproItem

class MovieSpider(scrapy.Spider):
    name = 'movie'
    # allowed_domains = ['www.xxx.com']
    start_urls = ['https://www.4567tv.tv/frim/index1.html']
    #解析詳情頁中的資料
    def parse_detail(self,response):
        #response.meta返回接收到的meta字典
        item = response.meta['item']
        actor = response.xpath('/html/body/div[1]/div/div/div/div[2]/p[3]/a/text()').extract_first()
        item['actor'] = actor

        yield item

    def parse(self, response):
        li_list = response.xpath('//li[@class="col-md-6 col-sm-4 col-xs-3"]')
        for li in li_list:
            item = MovieproItem()
            name = li.xpath('./div/a/@title').extract_first()
            detail_url = 'https://www.4567tv.tv'+li.xpath('./div/a/@href').extract_first()
            item['name'] = name
            #meta引數:請求傳參.meta字典就會傳遞給回撥函式的response引數
            yield scrapy.Request(url=detail_url,callback=self.parse_detail,meta={'item':item})

settings.py
LOG_LEVEL = "ERROE"
LOG_FILE = './log.txt'    #輸出日誌

items.py

# -*- coding: utf-8 -*-

# Define here the models for your scraped items
#
# See documentation in:
# https://doc.scrapy.org/en/latest/topics/items.html

import scrapy


class MoveproItem(scrapy.Item):
    # define the fields for your item here like:
    # name = scrapy.Field()
    name = scrapy.Field()
    actor = scrapy.Field()

Scrapy的日誌等級和請求傳參
2019-01-15
vue2.0 axios post請求傳參問題（ajax請求）
2018-07-24
VueiOS
scrapy-redis原始碼解讀之傳送POST請求
2019-05-15
Redis原始碼
java傳送GET和post請求
2020-04-05
Java
Node中POST請求的正確處理方式
2019-06-19
axios中POST請求變成OPTIONS處理
2018-07-19
iOS
Python開發技巧：scrapy-redis爬蟲如何傳送POST請求
2021-03-24
PythonRedis爬蟲
Postman傳送Post請求
2019-04-24
Postman
Java傳送Post請求
2020-11-26
Java
SpringMVC原始碼分析：POST請求中的檔案處理
2022-05-22
SpringMVC原始碼
python傳送HTTP POST請求
2018-06-03
PythonHTTP
Angular Universal Application 應該處理 HTTP POST 請求嗎？
2023-02-18
AngularAPPHTTP
SpringMVC中如何傳送GET請求、POST請求、PUT請求、DELETE請求。
2020-05-13
SpringMVCdelete
使用Postman傳送POST請求的指南
2024-06-12
Postman
以Raw的方式傳送POST請求
2021-08-07
cURL實現傳送Get和Post請求(PHP)
2018-08-01
PHP
Scrapy原始碼閱讀分析_4_請求處理流程
2019-02-19
原始碼
自定義Egg.js的請求級別日誌
2018-11-03
JS
介面請求 (get、post、head 等) 詳解
2020-11-25
介面請求（get、post、head等）詳解
2020-11-25
linux用curl傳送post請求
2019-02-21
Linux
處理nginx訪問日誌，篩選時間大於1秒的請求
2018-11-15
Nginx
curl 傳送 POST 請求的四種方式
2020-11-18
httprequest- post- get -傳送請求
2018-12-30
HTTP
file_get_contents傳送post請求
2022-01-28
FastAPI中請求URL傳參
2024-06-26
ASTAPI
jmeter之傳送json資料的post請求
2018-05-31
JMeterJSON
logstash 收集 http POST請求中的json日誌時，欄位衝突問題
2024-07-02
HTTPJSON
java apache commons HttpClient傳送get和post請求的學習整理
2018-03-02
JavaApacheHTTPclient
SpringBoot使用Axios傳送請求，引數處理
2019-03-22
Spring BootiOS
Kettle通過Http post請求webservice介面以及結果解析處理
2021-06-11
HTTPWeb
處理請求(AFURLRequestSerialization)和響應(AFURLResponseSerialization)
2018-05-21
java post 請求
2019-03-25
Java
postman(二)：使用postman傳送get or post請求
2018-12-20
Postman
RestTemplate exchange GET POST請求傳引數DEMO
2024-11-27
REST
【Postman】6 Postman 傳送post請求-Json格式
2021-05-24
PostmanJSON
Vue中通過Axios向SpringBoot傳送get和post請求
2020-07-30
VueiOSSpring Boot
Golang：使用go-resty/resty傳送http請求get和post
2024-05-25
GolangRESTHTTP

scrapy處理post請求的傳參和日誌等級

相關文章