Python 统计文本文件单词频率并输出前十名

以下是Python的示例代码：

import collections

# 打开文件并读取内容
with open('input.txt', 'r') as file:
    content = file.read()

# 将内容转换为小写，去除标点符号并分割成单词列表
words = content.lower().replace(',', '').replace('.', '').split()

# 使用collections库的Counter函数统计每个单词出现的频率
word_counts = collections.Counter(words)

# 获取频率最高的前十个单词
top_words = word_counts.most_common(10)

# 输出结果
for word, count in top_words:
    print(f'{word}: {count}')

假设输入文件名为input.txt，代码会打开该文件并读取其中的内容。然后将内容转换为小写，去除标点符号并分割成单词列表。接着使用collections库的Counter函数统计每个单词出现的频率。最后获取频率最高的前十个单词并输出结果。