机器学习技法习题3答案 - 代码天地

机器学习技法习题3答案

编程语言 2018-06-06 20:00:30 阅读次数: 1

Decision Tree

Impurity functions play an important role in decision tree branching. For binary classification problems, let $\mu_{+}$ be the fraction of positive examples in a data subset, and \(\mu_{-}=1-\mu_+\) be the fraction of negative examples in the data subset.

1. The Gini index is \(1-\mu_+^2-\mu_-^2\). What is the maximum value of the Gini index among all \(\mu_+\in[0,1]\)?Prove your answer.

解：

当\(\mu_+=\mu_-=\frac{1}{2}\)时，Gini系数有最大值\(1-\mu_+^2-\mu_-^2=\frac{1}{2}\)

2. Following Question 1, we can normalize each impurity function by dividing it with its maximum value among all\(\mu_+\in[0,1]\). For instance, the classification error is 2min\((\mu_+,\mu_-)\). After normalization, prove or disprove that the normalized Gini index is equivalent to the normalized squared regression error (used for branching in classification data sets), where the squared error is by definition \(\mu_+(1-(\mu_+-\mu_-))^2+\mu_-(-1-(\mu_+-\mu_-))^2\).

解:

归一化后的Gini系数=\(2*(1-\mu_+^2-\mu_-^2)=2*((\mu_++\mu_-)^2-\mu_+^2-\mu_-^2))\)

$$=4\mu_+\mu_-$$
归一化后的回归平方误差=\(\mu_+(1-(\mu_+-\mu_-))^2+\mu_-(-1-(\mu_+-\mu_-))^2\)
$$=\mu_+((\mu_+-\mu_-)-1)^2+\mu_-((\mu_+-\mu_-)+1)^2$$
$$=(\mu_++\mu_-)(\mu_+-\mu_-)^2-2(\mu_+-\mu_-)^2+(\mu_++\mu_-)$$
又因为\(\mu_++\mu_-=1\)，因此我们有：
$$=(\mu_+-\mu_-)^2-2(\mu_+-\mu_-)^2+1$$
$$=1-(\mu_+-\mu_-)^2$$
$$=(\mu_++\mu_-)^2-\mu_+-\mu_-)^2$$
$$=4\mu_+\mu_-$$

猜你喜欢

转载自blog.csdn.net/ma412410029/article/details/80571072

机器学习技法习题3答案

机器学习技法笔记3：Kernel SVM

机器学习技法------SVM

《机器学习技法》第3课笔记 Kernel核函数

机器学习技法 Lecture3: Kernel Support Vector Machine

机器学习期末复习题及答案

机器学习技法-SVM原理

机器学习技法-------作业一

机器学习技法------对偶SVM

机器学习技法------Blending and Bagging

机器学习技法笔记：Homework #5 特征变换&Soft-Margin SVM相关习题

机器学习技法笔记：Homework #6 AdaBoost&Kernel Ridge Regression相关习题

机器学习技法笔记：Homework #8 kNN&RBF&k-Means相关习题

机器学习 | 台大林轩田机器学习技法课程笔记3 --- Kernel Support Vector Machine

机器学习技法学习笔记

周志华《机器学习》PDF课件习题答案学习笔记

《机器学习》赵卫东学习笔记第3章决策树与分类算法（课后习题及答案）

【学习笔记、面试准备】机器学习西瓜书要点归纳和课后习题参考答案——第3章

svm-基于机器学习技法

机器学习技法笔记1：线性SVM

机器学习技法笔记7：blending and bagging

机器学习技法笔记：08 Adaptive Boosting

机器学习技法------Kernel Logistics Regression

机器学习技法实现------决策树

机器学习技法笔记：13 Deep Learning

机器学习技法笔记：07 Blending and Bagging

机器学习技法笔记：15 Matrix Factorization

机器学习技法笔记：09 Decision Tree

机器学习技法笔记：16 Finale

机器学习技法笔记：10 Random Forest

今日推荐

周排行

LRU cache算法

windows10, 自带的OpenSSH, key权限问题, 文件权限问题

测试用例书写方法

HIVE-默认分隔符的（linux系统的特殊字符）查看，输入和修改

最贵的AMD 7nm显卡来了！这设计够狂野

java多线程简单demo

[ 转载 ]在Android系统上使用busybox——最简单的方法

QT connect学习

BFSIFT算法分析

Xcode10：library not found for -lstdc++.6.0.9 临时解决

每日归档

更多

2024-08-06(0)

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)

2024-07-29(0)

2024-07-28(0)