python爬蟲（5）例項（1）

# -*- coding: utf-8 -*-
# 匯入第三方包和模組
import requests
from bs4 import beautifulsoup
import os   
#os是軟體自帶的，此處用來儲存資料夾的功能
folder = 'test_img'
if not os.path.exists(folder):
os.makedirs(folder)
def download(url):
response = requests.get(url)
name = url.split('/')[-1]    #以/為分割符保留最後一段,這個是儲存的名字，在下一條中使用，此條只是為了名字，無其他用
print('ok')

本例項的技術難點在於：

1、用for迴圈表示這個可翻頁帖子的19個url；

2、range()表示有序陣列；

3、str()強制將整形轉化為字串；

4、溫習beautifulsoup模組中find_all(『img』, )與soup.get(『src』)，來獲取所有的src的方法。

Python爬蟲例項

中國大學排名專案功能描述輸出大學排名資訊的螢幕輸出排名，大學名稱，總分技術路線 requests bs4 定向爬蟲僅對輸入url進行爬取，不擴充套件爬取程式的結構設計步驟1 從網路上獲取大學排名網頁內容步驟2 提取網頁內容中資訊到合適的資料結構二維列表步驟3 利用資料結構展示並...

python 爬蟲例項

coding utf 8 import re import sys import os from time import sleep from bs4 import beautifulsoup import requests reload sys sys.setdefaultencoding utf...

Python 爬蟲例項

下面是我寫的乙個簡單爬蟲例項 1.定義函式讀取html網頁的源 2.從源通過正規表示式挑選出自己需要獲取的內容 3.序列中的htm依次寫到d盤 usr bin python import re import urllib.request 定義函式讀取html網頁的源 def gethtml url...

python爬蟲（5）例項（1）

Python爬蟲例項

python 爬蟲例項

Python 爬蟲例項

相關推薦