乙個簡單的Python爬蟲

乙個簡單測試爬蟲的網頁：

1.獲取**的html**

import urllib.request
request=urllib.request.urlopen("")
str=request.read()
print(str)

2.返回網頁標題

輸出目標網頁內title標籤內的文字：，gettitle()函式返回網頁的標題，如果獲取網頁遇到問題就返回乙個 none 物件。如果伺服器不存在， html 就是乙個 none 物件，html.read() 就會丟擲 attributeerror 。

則執行輸出結果為：

d:\test\python>python reptile.py

3.查詢html

#!/usr/bin/env python #**來自:
#coding=utf-8
import requests
from bs4 import beautifulsoup
url = ''
r = requests.get(url)
soup = beautifulsoup(r.content, 'lxml')
bbs_nes = soup.find_all(name='span',attrs=)
for news in bbs_nes:
print (news.string)

規律是查詢：

在這個html源**中一共有：

now with super-colorful bell peppers! 8 entire dolls per set! octuple the presents! also hand-painted by trained monkeys! or maybe he's only resting?

keep your friends guessing!

所以最後的輸出結果為：

d:\test\python>python reptile.py now with super-colorful bell peppers! 8 entire dolls per set! octuple the presents! also hand-painted by trained monkeys! or maybe he's only resting?

keep your friends guessing!

乙個簡單的Python爬蟲

python 乙個簡單的貪婪爬蟲

python 乙個簡單的貪婪爬蟲

Python爬蟲入門筆記乙個簡單的爬蟲架構

乙個簡單的Python爬蟲

python 乙個簡單的貪婪爬蟲

python 乙個簡單的貪婪爬蟲

Python爬蟲入門筆記 乙個簡單的爬蟲架構

相關推薦

Python爬蟲入門筆記乙個簡單的爬蟲架構