獲取標籤及內容詳解

from bs4 import beautifulsoup

html ='''

the dormouse's story

aaaaa

the dormouse's storya

lacie

tillie

good

'''soup = beautifulsoup(html,'lxml')

print(soup.find_all('p',attrs=))

(1)獲取標籤物件

print(soup.h1)

（2）獲取標籤內的文字字串:

print(soup.h1.text)

print(soup.h1.get_text())

tit = soup.find('h1').get_text()

print(tit)

（3）獲取soup內的所有p標籤，返回乙個列表

print(soup.find_all('p'))

(4)多層查詢

find_all查詢返回的是列表，使用下標尋找想要的內容

print(soup.find_all('ul')[0].find_all('li'))

（5）獲取標籤的屬性

print(soup.a.attrs['href'])

tag.get('attr')

可以得到tag標籤中attr屬性的value,

for link in soup.find_all('a'):

print(link.get('href'))

(6)通過指定的屬性，獲取物件

print(soup.find('ul',id='ulone'))

print(soup.find_all('ul',id='ulone'))結果是列表

print(soup.find_all('p',attrs=))

python 根據標籤名獲取標籤內容

import re import json import requests from bs4 import beautifulsoup import lxml.html from lxml import etree result requests.get with open 123.html wb ...

js獲取外層標籤（排除標籤裡面的內容）

前些天做乙個htmldom樹，樹的每個節點對應乙個html標籤。在獲取頁面標籤的時候，dom.innerhtml獲取的是物件的內容，又嘗試了dom.outerhtml得到的是外層標籤和裡面的內容。找了好久都沒有找到只獲取外層標籤本身的方法。後來突然發現clonenode include all 方法...

JQ和Js獲取span標籤的內容

源 jq和js獲取span標籤的內容 1 我是span標籤的內容 1 var cont document.getelementbyid content 2 console.log innertext cont cont.innertext 3 console.log innerhtml cont c...

獲取標籤及內容詳解

python 根據標籤名獲取標籤內容

js獲取外層標籤（排除標籤裡面的內容）

JQ和Js獲取span標籤的內容

相關推薦