文本特征过程:
- 特征抽取对文本等数据进行特征值化
- 是为了让计算机更好的理解数据
from sklearn.feature_extraction.text import CountVectorizer
# 实例化CountVectorizer
vector = CountVectorizer()
# 调用fit_transform输入并转换数据
res = vector.fit_transform(["Life is short, i like python","Life is too long, i dislike python"])
# 打印结果
print(vector.get_feature_names())
print(res.toarray())