我们将设计另一个名为word_stats()的函数,该函数将单词频率字典(count_words_fast()/ count_words()的输出)作为参数。函数将返回唯一单词的总数(总和) /频率字典中的总密钥)和dict_values将它们的总数统计在一起,作为元组。
def word_stats(word_counts): # word_counts = count_words_fast(text)
num_unique = len(word_counts)
counts = word_counts.values()
return (num_unique, counts)
调用函数:所以,最后我们将阅读一本书,例如 - 罗密欧与朱丽叶的英文版,并从函数中收集关于词频,独特词,独特词总数等的信息。
text = read_book("./Books / English / shakespeare / Romeo and Juliet.txt")
word_counts = count_words_fast(text)
(num_unique, counts) = word_stats(word_counts)
print(num_unique, sum(counts))
brightness_4text =
read_book("./Books / English / shakespeare / Romeo and Juliet.txt")
word_counts =
count_words_fast(text)
(num_unique, counts) =
word_stats(word_counts)
print(num_unique, sum(counts))
Output: 5118 40776








暂无数据