詞語詞頻統計

不是南小北發表於2020-11-19

import string
path = ‘C://Users/Yang/Desktop/Walden.txt’
with open(path,‘r’) as text:
words = [raw_word.strip(string.punctuation).lower() for raw_word in text.read().split()]
words_index = set(words)
counts_dict = {index:words.count(index) for index in words_index}
for word in sorted(counts_dict,key=lambda x: counts_dict[x],reverse=True):
print(’{} – {} times’.format(word,counts_dict[word]))


相關文章