HTML標籤的get方法

1. 獲取標籤屬性的時候，可以先獲取整個標籤，再利用get()方法獲取標籤的屬性

import requests
from lxml import etree
html=requests.get('').text
html=etree.html(html)
res=html.xpath('//li//a')    #獲得li標籤
for re in res:
print(re.get('href'))  #get()方法獲得各li標籤的href屬性

這樣寫的好處是:如果一次需要提取多個屬性的時候，多次提取的時候比較方便。

當get()獲取的標籤屬性在標籤中並不存的時候，返回none,並不丟擲異常。這還挺好的。。

etree.tostring(標籤)函式:列印每個標籤的具體內容

html = etree.html(res)
result = etree.tostring(html)

2.在selenium webdriver 中，

from selenium import webdriver
driver=webdriver.chrome()
driver.get('')
driver.switch_to_frame('contentframe')
htmllist=driver.find_elements_by_xpath('//p[@class="dec"]//a')
songlist_list=
songlist_list_name=
for item in htmllist:
songlist_list.insert(0,item.get_attribute('href'))  #用get_attribute獲取標籤屬性
songlist_list_name.insert(0,item.get_attribute('textcontent').strip())  #獲取標籤的文字
內容

在selenium webdriver 中獲取標籤的屬性需要get_attribute()函式來獲取，不可再用get()方法了。

php 處理html標籤的方法

清除空格和換行 str trim str 清除字串兩邊的空格 str strip tags str,利用php自帶的函式清除html格式 str preg replace t str 使用正規表示式替換內容，如空格，換行，並將替換為空。str preg replace r n str str pr...

HTML中使用struts標籤的方法

在jsp中使用struts標籤要先對標籤檔案進行引用,這裡用struts2作為例子,如下 taglib prefix s uri struts tags 而在html中根本無法識別上面所以在html無法使用struts標籤.解決方法在tomcat目錄 conf web.xml 檔案中,找到 se...

python去除所有html標籤的方法

這段可以用於去除文字裡的字串標籤，不包括標籤裡面的內容 import re 程式設計客棧html 程式設計客棧a 程式設計客棧href 我們,python學習！dr re.fezczocmclcompile r re.s dd dr.sub html print dd 執行結果如下我們,pyth...

HTML標籤的get方法

php 處理html標籤的方法

HTML中使用struts標籤的方法

python去除所有html標籤的方法

相關推薦