- (1)pandas包含了分类数据在DataFrame里
df = pd.DataFrame({"id":[1,2,3,4],"name":['张珊','李时','网二','马志']})
df
- (2)转换原生数据到分类数据类型
df["grade"]=df["name"].astype("category")
df["grade"]
df["grade"].cat.categories = ["kok", "galarxy", "L","poa"]
df["grade"] = df["grade"].cat.set_categories(["very bad", "bad", "medium","good"])
df["grade"]
(3)每个规则的排列是分类,不是根据词法规则
df.sort_values(by="grade")
(4)基于一个分类的列进行分组也会显示空的分类