01、部落格爬蟲

三角形發表於2019-04-11

原文網址 : https://www.cnblogs.com/www1707/p/10692298.html

你需要爬取的是部落格【人人都是蜘蛛俠】中，《未來已來（四）——Python學習進階圖譜》的所有文章評論，並且列印。

文章URL:https://wordpress-edu-3autumn.localprod.forc.work/all-about-the-future_04/

 1 #1、部落格爬蟲
 2 #    你需要爬取的是部落格【人人都是蜘蛛俠】中，《未來已來（四）——Python學習進階圖譜》的所有文章評論，並且列印。
 3 #    文章URL:https://wordpress-edu-3autumn.localprod.forc.work/all-about-the-future_04/
 4 import requests
 5 from bs4 import BeautifulSoup
 6 res = requests.get('https://wordpress-edu-3autumn.localprod.forc.work/all-about-the-future_04/')
 7 html = res.text
 8 soup = BeautifulSoup(html,'html.parser')
 9 items = soup.find_all('div',class_='comment-content')
10 for item in items:
11     print(item.find('p').text)
12 
13 '''
14 執行結果如下：
15     測試評論
16     我們就是
17     minu
18     kpi
19 '''
20 
21 '''
22 #   下面是老師的程式碼
23 
24 #   呼叫requests庫
25 import requests
26 #   呼叫BeautifulSoup庫
27 from bs4 import BeautifulSoup
28 #   把網址複製給變數destnation_url
29 url_destnation = 'https://wordpress-edu-3autumn.localprod.forc.work/all-about-the-future_04/'
30 #   返回一個response物件，賦值給destnation
31 res_comment = requests.get (url_destnation)
32 #   把網頁解析為BeautifulSoup物件
33 bs_comment = BeautifulSoup(res_comment.text,'html.parser')
34 #   通過匹配屬性提取出我們想要的元素
35 list_comments = bs_comment.find_all('div',class_= 'comment-content')
36 #   遍歷列表，取出列表中的每一個值
37 for tag_comment in list_comments:
38 #   列印評論的文字
39     print(tag_comment.text)
40 '''

items中每個Tag的內容如下

1 <div class="comment-content">
2 <p>第1個蜘蛛俠</p>
3 </div>

[雪峰磁針石部落格]python爬蟲cookbook1爬蟲入門
2018-09-10
Python爬蟲
部落格園記錄：汽車引數爬蟲
2024-11-06
爬蟲
分享5個爬蟲專業部落格網站
2021-10-12
爬蟲網站
Python爬蟲教程-01-爬蟲介紹
2018-09-06
Python爬蟲
【爬蟲工具】下載部落格轉成Markdown的形式
2019-02-16
爬蟲
每天一個爬蟲-learnku我的部落格列表
2021-06-17
爬蟲
【爬蟲】利用Python爬蟲爬取小麥苗itpub部落格的所有文章的連線地址（1）
2018-12-26
爬蟲Python
python爬蟲日記01
2021-05-11
Python爬蟲
Python爬蟲-部落格園首頁推薦部落格排行(整合詞雲+郵件傳送)
2019-05-14
Python爬蟲
實用爬蟲-01-檢測爬蟲的 IP
2018-09-08
爬蟲
我的第一篇部落格（從爬蟲開始）
2020-09-29
爬蟲
Python爬蟲入門教程 40-100 部落格園Python相關40W部落格抓取 scrapy
2019-02-25
Python爬蟲
爬取部落格園文章
2020-07-31
【爬蟲】利用Python爬蟲爬取小麥苗itpub部落格的所有文章的連線地址並寫入Excel中（2）
2018-12-27
爬蟲PythonExcel
Python爬蟲基礎-01-帶有請求引數的爬蟲
2018-06-06
Python爬蟲
golang-spider-從單任務版爬蟲到併發爬蟲01
2018-04-05
GolangIDE爬蟲
python爬蟲學習01--電子書爬取
2020-07-13
Python爬蟲
Python爬蟲實戰系列1：部落格園cnblogs熱門新聞採集
2024-03-13
Python爬蟲
爬蟲01:爬取豆瓣電影TOP 250基本資訊
2020-12-29
爬蟲
【Python學習】爬蟲爬蟲爬蟲爬蟲~
2018-05-03
Python爬蟲
個人部落格專案筆記_01
2024-04-08
筆記
你的部落格可能被爬了
2019-07-25
Python爬取CSDN部落格資料
2019-01-03
Python
Python 實用爬蟲-04-使用 BeautifulSoup 去水印下載 CSDN 部落格圖片
2019-06-16
Python爬蟲
Flutter 即學即用系列部落格——01 環境搭建
2019-02-12
Flutter
爬蟲：多程式爬蟲
2021-05-19
爬蟲
Go秒爬部落格園100頁新聞
2018-08-01
Go
python爬蟲---網頁爬蟲，圖片爬蟲，文章爬蟲，Python爬蟲爬取新聞網站新聞
2019-01-04
Python爬蟲網頁網站
頁面資料採集——網路爬蟲實戰（ASP.NET Web 部落格園為例）
2020-12-25
爬蟲ASP.NETWeb
通用爬蟲與聚焦爬蟲
2023-04-18
爬蟲
爬蟲--Scrapy簡易爬蟲
2020-10-07
爬蟲
爬蟲進階：反反爬蟲技巧
2018-06-28
爬蟲
反爬蟲之字型反爬蟲
2019-06-27
爬蟲
爬蟲
2024-11-16
爬蟲
01-個人部落格筆記-專案初始化
2018-07-09
筆記
2020-10-01 第一次發部落格helloword
2020-10-01
小豬的Python學習之旅 —— 8.爬蟲實戰：刷某部落格站點的訪問量
2021-09-09
Python爬蟲
增補部落格第十九篇 python 爬樓梯
2024-06-14
Python

01、部落格爬蟲

相關文章