IDF stands for inverse document frequency
TF:it’ll convert the raw count of a word in the document into some weight
that reflects our belief about how important this word in the document.
|d1|: the document length of the total counts of words
b: this is a parameter to control length normalization