2021-05-20
阅读量:
519
提问:sklearn怎么填充缺失的数据
# 使用SimpleImputer
from sklearn.impute import SimpleImputer
# 使用中位数填补Age
num = ['Age']
data_num = data.loc[:, num]
SI_num = SimpleImputer(strategy = 'median')
SI_num = SI_num.fit(data_num)
data.loc[:, num] = SI_num.transform(data_num)
# 使用众数填补Embarked
cat = ['Sex', 'Embarked']
data_cat = data.loc[:, cat]
SI_cat = SimpleImputer(strategy = 'most_frequent')
SI_cat = SI_cat.fit(data_cat)
data.loc[:, cat] = SI_cat.transform(data_cat)
# 对Y值进行空缺值填补
lab = ['Survived']
data_lab = data.loc[:, lab]
SI_lab = SimpleImputer(strategy = 'most_frequent',
missing_values = 'Unknown')
SI_lab = SI_lab.fit(data_lab)
data.loc[:, lab] = SI_lab.transform(data_lab)






评论(0)


暂无数据
推荐帖子
0条评论
0条评论
0条评论