简明理解AdaBoost算法

简明理解AdaBoost算法

从案例理解AdaBoost

(转载自李航《统计学习方法》,有删减)，使用AdaBoost算法学习如下训练数据，弱分类器采用sign()函数。

x	1	2	3	4	5	6	7	8	9
y	1	1	1	-1	-1	-1	1	1	1

解：
初始化数据权值分布，即 $m=0$
$D_1=(w_{11},w_{12},...,w_{110})$ $w_{1i}=0.1, \quad i=1,2,...,10$
对 $m=1$ :
(1) 基本分类器： $G_1(x)=-sign(x-2.5)$
(2) 误差率 $e_1=P(G_1(x_i) \neq y_i)=0.3$
(3) 计算 $G_1(x)$ 的系数： $\alpha_1 = \frac {1}{2}log{\frac{1-e_1}{e_1}}=0.4236$
(4) 更新权值分布：
$D_2(w_{21},w_{22},...,w_{210})$ $w_{2i}=\frac{w_{1i}}{Z_1}exp(-\alpha_1y_iG_1(x_i)), \quad i=1,2,...,10$ $f_1(x)=0.4236G_1(x)$ 分类器 $sign[f_1(x)]$ 在训练数据集上有3个误分类点。
对 $m=2,m=3$ 分别执行以上步骤： $G(x)=sign[0.4236G_1(x)+0.6496G_2(x)+0.7514G_3(x)]$

import numpy as np 
import matplotlib.pyplot as plt

x0 = [0,1,2,3,4,5,6,7,8,9]
y0 = [1,1,1,-1,-1,-1,1,1,1,-1]

x1 = np.linspace(-1,10,100)
G1 = -np.sign(x1-2.5) 
G2 = -np.sign(x1-8.5) 
G3 = np.sign(x1-5.5)

fig, axs = plt.subplots(2,2)

axs[0,0].scatter(x0,y0)
axs[0,0].plot(x1,G1)
axs[0, 0].set_title('f1(x)', fontsize=10)

axs[0,1].scatter(x0,y0)
axs[0,1].plot(x1,np.sign(0.4236*G1+G2))
axs[0,1].set_title('f2(x)', fontsize=10)

axs[1,0].scatter(x0,y0)
axs[1,0].plot(x1,np.sign(0.4236*G1+0.6496*G2+G3))
axs[1,0].set_title('f3(x)', fontsize=10)

axs[1,1].scatter(x0,y0)
axs[1,1].plot(x1,np.sign(0.4236*G1+0.6496*G2+0.7514*G3))
axs[1,1].set_title('f3(x)', fontsize=10)
fig.tight_layout()

plt.show()

在这里插入图片描述
最终通过sign分类的线性组合，逐步将训练误差减少到0

案例分析

针对二分类的提升（boosting）方法有两个步骤：
1、改变训练数据的权值或概率分布
2、将弱分类器组合为强分类器

涉及计算内容为：

计算$G_m(x)的系数：
$\alpha_m=\frac{1}{2}log{\frac{1-e_m}{e_m}}$
更新权重：
$D_{m+1}(w_{m+1,1},w_{m+1,2},...,w_{m+1,N})$ $w_{m+1,i}=\frac{w_{m,i}}{Z_m}exp(-\alpha_my_mG_m(x_i)), \quad i=1,2,...,N$ 其中， $Z_m$ 是规范化因子，有 $Z_m=\sum_{i=1}^{N}exp(-\alpha_my_mG_m(x_i))$
当正确分类时： $w_{m+1}=\frac{w_{mi}}{Z_m}e^{-\alpha_m}$ ，权值缩小
当错误分类时： $w_{m+1}=\frac{w_{mi}}{Z_m}e^{\alpha_m}$ ，权值放大