（一）BeautifulSoup的安裝

確保必要的工作(已經安裝好python和pip)

執行cmd，在命令列中輸入一下命令即可安裝成功。

pip install beautifulsoup4

由於本人使用的是ubuntu，即主要講解的是ubuntu下的安裝，其實其他發行版本的安裝都是差不多的。

安裝python

由於在ubuntu的發行版本中已經自帶有python，即可以略過這一步

安裝pip

預設是沒有安裝pip的，開啟shell，輸入一下命令即可安裝pip，此方法比較方便快捷。

sudo apt-get install python-pip

beautifulsoup安裝

sudo apt-get install python-bs4

pip install beautifulsoup4

由於beautifulsoup4支援多種解析器，但其中，lxml的解析速度比較快的，所以，必須安裝lxml解析器。當然，如果覺得不需要lxml解析器的話，也可以不安裝，而是使用內建的html.parser解析器。

pip install lxml

beautifulsoup確實是乙個不錯的文件樹分析工具，比較有利於對網頁資訊的提取。當然，也可以使用正則對網頁資訊進行提取，不過得水平比較好才行。

BeautifulSoup選擇屬性（一）

soup.select class class 名可以提取出class類同名的 soup.select div div名可以提取出div類同名的如下 import requests 帶入requests模組 from bs4 import beautifulsoup 從bs4中帶入reques...

Python的BeautifulSoup庫的使用

python爬蟲的beautifulsoup庫的使用以下為爬去985高校的例子 import requests from bs4 import beautifulsoup 從bs4中引入beautifulsoup庫 import re url r requests.get url soup bea...

beautifulsoup爬蟲快速入門一基礎知識

主要涉及到的知識點這裡針對的是資料以html返回的形式 beautifulsoup lxml的使用首先這裡需要請求到乙個網頁位址，之後用beautifulsoup解析網頁 requestsapi request.get url bs beautifulsoup requestsapi.conte...

（一）BeautifulSoup的安裝

BeautifulSoup選擇屬性（一）

Python的BeautifulSoup庫的使用

beautifulsoup爬蟲快速入門一基礎知識

相關推薦