DMM作品信息查询工具:识别高清/标清、月额/非月作品
DMM作品信息查询工具:识别高清/标清、月额/非月作品/n/n该脚本可以帮助用户查询DMM作品信息,自动识别高清/标清、月额/非月作品,并展示作品标题、分辨率、码率、大小等信息。/n/n使用方法:/n1. 运行脚本;/n2. 请输入要查询的CID,多个CID用空格或逗号分隔;/n3. 脚本将自动查询并输出对应信息。/n/n代码示例:/n/npython/nimport re/nimport requests/nfrom bs4 import BeautifulSoup/n/nheaders = {/n 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36',/n}/n/ndef get_monthly_product_info(product_id):/n url = 'https://www.dmm.co.jp/monthly/premium/-/detail/=/cid={}/'.format(product_id)/n response = requests.get(url, headers=headers)/n soup = BeautifulSoup(response.text, 'html.parser')/n/n title_tag = soup.find('h1', {'class': 'ttl-sectDetail ttl-product'})/n title = title_tag.text.strip() if title_tag else None/n/n quality_select = soup.find('select', {'id': 'download_bitrate'})/n selected_option = quality_select.find('option', {'selected': True})/n quality = selected_option.text.split('(')[-1].replace(')', '').strip() if selected_option else None/n quality_value = selected_option['value'] if selected_option else None/n/n size_spans = soup.select('#download_filesize > span')/n sizes = {}/n for span in size_spans:/n size_text = span.text.strip()/n if '容量(目安)' in size_text:/n size_value = int(size_text.split(':')[-1].replace('MB', ''))/n sizes[size_value] = span/n max_size = max(sizes.keys())/n max_size_tag = sizes[max_size]/n/n return {/n 'title': title,/n 'quality': quality,/n 'quality_value': quality_value,/n 'size': max_size_tag.text.strip()/n }/n/ndef format_product_id(product_id):/n if '-' in product_id:/n product_id = product_id.replace('-', '00')/n return product_id/n/ncid_str = input('请输入要查询的cid, 多个CID用空格或逗号分隔:')/ncids = [format_product_id(cid.strip()) for cid in cid_str.replace(',', ' ').split()]/n/nhigh_definition_cids = []/nstandard_definition_cids = []/ncid_list = []/ndown_cid_list = []/n/nfor cid in cids:/n # 月额作品/n search_url = 'https://www.dmm.co.jp/monthly/-/list/search/=/?searchstr={}'.format(cid)/n response = requests.get(search_url, headers=headers)/n soup = BeautifulSoup(response.text, 'html.parser')/n/n link_tags = soup.find_all('a', href=re.compile(r'https://www/.dmm/.co/.jp/monthly/premium/-/detail/=/cid=/w+'))/n/n if not link_tags:/n # 非月额作品/n search_url = 'https://www.dmm.co.jp/search/=/searchstr={}'.format(cid)/n response = requests.get(search_url, headers=headers)/n soup = BeautifulSoup(response.text, 'html.parser')/n/n link_tags = soup.find_all('a', href=re.compile(r'https://www/.dmm/.co/.jp/digital/videoa/-/detail/=/cid=/w+'))/n/n if not link_tags:/n # 没有找到作品, 则跳过该cid的查询/n down_cid_list.append(cid)/n continue/n/n link = link_tags[0]['href'].split('?')[0]/n product_id_match = re.search(r'cid=([/w/d]+)', link)/n product_id = product_id_match.group(1)/n /n # 如果cid存在于非月作品中,不要对cid进行任何操作/n if link.startswith('https://www.dmm.co.jp/digital/videoa/'):/n cid_list.append(cid)/n else:/n product_info = get_monthly_product_info(product_id)/n print(f'{product_id} {product_info['title']}')/n print(f'分辨率:{product_info['quality']}, 码率:{product_info['quality_value']}, {product_info['size']}/n') /n if product_info['quality_value'] and int(product_info['quality_value']) > 3000:/n high_definition_cids.append(product_id)/n else:/n standard_definition_cids.append(product_id)/n/nhigh_definition_count = len(high_definition_cids)/nstandard_definition_count = len(standard_definition_cids)/n/nif high_definition_count > 0:/n print(f'高清月额({high_definition_count}):{'|'.join(high_definition_cids)}/n')/nif standard_definition_count > 0:/n print(f'标清月额({standard_definition_count}):{'|'.join(standard_definition_cids)}/n')/n/ncid_str = '|'.join(cid_list)/nif cid_str:/n print(f'非月作品({len(cid_list)}):{cid_str}/n')/n/ndown_cid_str = '|'.join(down_cid_list)/nif down_cid_str:/n print(f'已下架或错误cid({len(down_cid_list)}):{down_cid_str}')/n/n/n代码说明:/n/n* 该脚本首先使用requests库获取DMM网页内容,并使用BeautifulSoup库解析网页结构。/n* get_monthly_product_info函数用于获取月额作品的信息,包括标题、分辨率、码率、大小等。/n* 脚本会根据CID分别查询月额作品和非月作品,并自动识别高清/标清。/n* 最后,脚本会输出查询结果,包括高清月额、标清月额、非月作品、已下架或错误CID等。/n/n注意:/n* 该脚本仅供参考,可能需要根据实际情况进行调整。/n* 使用该脚本进行查询时,请遵守DMM网站的使用规则。/n* 由于DMM网站可能会进行改版,该脚本可能需要进行更新。/n
原文地址: https://www.cveoy.top/t/topic/nH6h 著作权归作者所有。请勿转载和采集!