Term frequency-inverse document frequency (TF-IDF) vectorization is a mouthful
to say, but it's also a simple and convenient way to characterize bodies of
text. Due to its simplicity, this method scales better than some other topic
modeling techniques (latent dirichlet allocation, probabilistic latent semantic
indexing) when dealing with