机器学习（七） — 决策树

1天前 • 算法结构

model 4 — decision tree

1 decision tree

1. component

usage: classification

root node

decision node

2. choose feature on each node

maximize purity (minimize inpurity)

3. stop splitting

a node is 100% on class

splitting a node will result in the tree exceeding a maximum depth

improvement in purity score are below a threshold

number of examples in a node is below a threshold

2 meature of impurity

use entropy(

H

H

H) as a meature of impurity

H

(

p

)

=

−

p

l

o

g

2

(

p

)

−

(

1

−

p

)

l

o

g

2

(

1

−

p

)

n

o

t

e

:

0

l

o

g

0

=

0

H(p) = -plog_2(p) – (1-p)log_2(1-p)\\ note: 0log0 = 0

H(p)=−plog2(p)−(1−p)log2(1−p)note:0log0=0

在这里插入图片描述

3 information gain

1. definition

i

n

f

o

m

a

t

i

o

n

_

g

a

i

n

=

H

(

p

r

o

o

t

)

−

(

w

l

e

f

t

H

(

p

l

e

f

t

)

+

w

r

i

g

h

t

H

(

p

r

i

g

h

t

)

)

infomation\_gain = H(p^{root}) – (w^{left}H(p^{left}) + w^{right}H(p^{right}))

infomation_gain=H(proot)−(wleftH(pleft)+wrightH(pright))

2. usage

meature the reduction in entropy

a signal of stopping splitting

3. continuous

find the threshold that has the most infomation gain

在这里插入图片描述

4 random forest

generating a tree sample

given training set of size m
for b = 1 to B:
	use sampling with replacement to create a new training set of size m
	train a decision tree on the training set

randomizing the feature choice: at each node, when choosing a feature to use to split, if n features is available, pick a random subset of k < n(usually
k

=

n

k = \sqrt{n}

k=n
) features and alow the algorithm to only choose from that subset of features

本文来自网络，不代表协通编程立场，如若转载，请注明出处：https://net2asp.com/db453b599e.html

人工智能决策树机器学习

赞 (0)

【数据结构】二叉树的介绍和二叉树堆

« 上一篇 1天前

7-1 根据后序和中序遍历输出先序遍历 (PTA-数据结构)

下一篇 » 1天前

前端（node.js）调用dll动态链接库

Ⅰ- 壹 – 需求使用 js node 调用dll 动态链接库. github地址如下，包含dll，里面就一个Add方法暴露出来 github Ⅱ – 贰…

前端 1天前
前端

HBuilderX uni-app简单实现静态登录页面（实例）

本章用到……uni-app页面跳转uni.navigateTo方法、uni.navigateBack方法。uni-app简单实现邮箱验证码发送点击后读秒样式…

1天前
前端

【cnpm】cnpm的安装方法（附详细步骤）

1- 前言 cnpm的官方介绍是：cnpm是一个完整npmjs.org镜像，你可以用此代替官方版本(只读)，同步频率目前为 10分钟一次以保证尽量与官方服务同步。之前的一篇博客…

1天前
前端

vue2中watch的使用

一，监听基本普通属性：字符串，布尔值，number （1）把要监听的msg值看作方法名，来进行监听。触发 {{ msg }} export default { data() {…

1天前
Jave

基于Java+SpringBoot+Vue网络云端日记本系统设计和实现

博主介绍：✌全网粉丝30W+,csdn特邀作者、博客专家、CSDN新星计划导师、Java领域优质创作者,博客之星、掘金/华为云/阿里云/InfoQ等平台优质作者、专注于Java技术…

1天前
Jave

【基础】OPC 通讯协议

OPC 通讯协议 OPC 通讯协议基础 OPC 简介 OPC 与 OPC UA OPC 逻辑对象模型 OPC 通信方式 Java 实现 OPC 的方式 Java 实现 OPC-cl…

1天前
Java stream求和以及mapToDouble sum精度丢失解决办法

在 Java 8 中，Stream.reduce()合并流的元素并产生单个值。基本数据类型和包装类型的一位数组求和 package test day1; import ja…

Jave 1天前
Jave

Linux系统卸载重装JDK

CentOS 系统是开发者常用的 Linux 操作系统，安装它时会默认安装自带的旧版本的 OpenJDK，但在开发者平时开发 Java 项目时还是需要完整的 JDK，所以我们部署 …

1天前
Unity 平台编译预定义

提示：文章写完后，目录可以自动生成，如何生成可参考右边的帮助文档文章目录前言一、Platform scripting symbols 二、测试总结前言提示：这里可以添加…

Jave 1天前
Jave

CommunityToolkit.Mvvm 加速 MVVM 开发

CommunityToolkit.Mvvm 加速 MVVM 开发 CommunityToolkit.Mvvm 简介 CommunityToolkit.Mvvm 包含的实现源生成器…

1天前