scrapy中使用正確的xpath出現空列表問題

今天在使用scrapy框架爬取網頁時，使用正確的xpath來爬取時，爬取的缺失空列表，**如下：

# -
*- coding: utf-8-
*-import scrapy
class
haodfspider
(scrapy.spider)
:    name =
'haodf'
start_urls =
['']    def parse
(self, response)
:for item in response.
xpath
(r'//table[@summary="forum_6"]/tbody[not(contains(@id,"separatorline"))]'):
url_s = item.
xpath
('./tr/th/a[3]/@href').
get(
)            yield scrapy.
request
(url_s, callback=self.parse_s)
def parse_s
(self, response)
:print
(response.
xpath
('//div/table[@class="plhin"]/tbody/tr/td//div[@class="t_fsz"]//td').
extract()
)

後來發現刪除tbody就行了，原因是瀏覽器會規範這個html文件

記憶碎片之scrapy中使用logging模快

settings.py log level warning 設定日誌顯示的等級 log file a.log 設定日誌儲存的位置，設定後介面不會顯示日誌內容 spider.py import logging import logging logging.basicconfig函式各引數 filena...

詳解Python安裝scrapy的正確姿勢

執行平台 windows python版本 python3.x ide sublime text3 一 scrapy簡介 scrapy是乙個為了爬取資料提取結構性資料而編寫的應用框架，可以應用於資料探勘，資訊處理或儲存歷史資料等一些列的程式中。scrapy最初就是為了網路爬取而設計的。現在，scr...

vue專案中使用less正確姿勢

本人在vue專案中引入less外掛程式，但是在使用webpack打包專案的時候發生了報錯 typeerror this.getoptions is not a function，接下來給大家提供乙個解決辦法因為我們安裝的less 版本過高導致的這個問題，具體原因是 less 8.x版本更新了配置...

scrapy中使用正確的xpath出現空列表問題

記憶碎片之scrapy中使用logging模快

詳解Python安裝scrapy的正確姿勢

vue專案中使用less正確姿勢

相關推薦