Bilibili 子域名访问量爬虫 Python 代码
import requests from bs4 import BeautifulSoup
定义一个空的字典用来存储子域名和访问数量
subdomains = {}
定义一个函数用来获取每个子域名的访问数量
def get_subdomain_views(subdomain): url = 'https://{}.bilibili.com/'.format(subdomain) response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') view_count = soup.select_one('.n-data .n-data-v').text return int(view_count)
定义一个列表存储需要爬取的子域名
subdomain_list = ['www', 'space', 'bangumi', 'live', 'vc', 'account', 'message', 'passport', 'pay', 'app', 'search', 's']
循环遍历每个子域名,调用函数获取访问数量并存储到字典中
for subdomain in subdomain_list: view_count = get_subdomain_views(subdomain) subdomains[subdomain] = view_count
输出结果
print(subdomains)
原文地址: https://www.cveoy.top/t/topic/g55D 著作权归作者所有。请勿转载和采集!