Python lxml库使用etree.HTML解析网页报错：'function' object has no attribute 'HTML' 的解决方法

日期: 2028-08-13

标签: 常规

应该将 etree.HTML 修改为 etree.HTMLParser，即将 tree = etree.HTML(html) 修改为 tree = etree.HTMLParser().parse(html)，并且将 tree.xpath 修改为 tree.xpath()。修改后的代码如下：

import requests
from lxml import etree
import pandas as pd

url = "http://search.dangdang.com/?key=%C9%F1%BE%AD%CD%F8%C2%E7&act=input"
headers = {
        "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/114.0.0.0 Safari/537.36 Edg/114.0.1823.67"
    }
response = requests.get(url, headers=headers)
html = response.text
tree = etree.HTMLParser().parse(html)

Python lxml库使用etree.HTML解析网页报错：'function' object has no attribute 'HTML' 的解决方法

原文地址: https://www.cveoy.top/t/topic/pqEN 著作权归作者所有。请勿转载和采集!