在python3中,filter处理之后,变成了一个可迭代对象,解决办法有2中:
① 切成python2
② 在filter外面套一层list
df = df.dropna() lines=df.content.values.tolist() sentences=[] for line in lines: try: segs=jieba.lcut(line) segs = filter(lambda x:len(x)>1, segs) segs = filter(lambda x:x not in stopwords, segs) sentences.append(segs) except Exception,e: print line continue
改成
df = df.dropna() lines=df.content.values.tolist() sentences=[] for line in lines: try: segs=jieba.lcut(line) segs = list(filter(lambda x:len(x)>1, segs)) segs = list(filter(lambda x:x not in stopwords, segs)) sentences.append(segs) except Exception: print(line) continue