Executing TF-IDF in Python

The following are the steps for executing TF-IDF in Python:

from sklearn.feature_extraction.text import TfidfVectorizer

corpus = ['First document', 'Second document','Third document','First and second document' ]

vectorizer = TfidfVectorizer()

X = vectorizer.fit_transform(corpus)
print(vectorizer.get_feature_names())
print(X.shape)

The output is as follows:

X.toarray()

We get the following output:

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

3.145.17.140

Table of Contents for Executing TF-IDF in Python