Python lxml库使用etree.HTML解析网页报错:'function' object has no attribute 'HTML' 的解决方法
应该将 etree.HTML 修改为 etree.HTMLParser,即将 tree = etree.HTML(html) 修改为 tree = etree.HTMLParser().parse(html),并且将 tree.xpath 修改为 tree.xpath()。修改后的代码如下:
import requests
from lxml import etree
import pandas as pd
url = "http://search.dangdang.com/?key=%C9%F1%BE%AD%CD%F8%C2%E7&act=input"
headers = {
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/114.0.0.0 Safari/537.36 Edg/114.0.1823.67"
}
response = requests.get(url, headers=headers)
html = response.text
tree = etree.HTMLParser().parse(html)
原文地址: https://www.cveoy.top/t/topic/pqEN 著作权归作者所有。请勿转载和采集!