weka up-sampling & down-sampling - 代码天地

weka up-sampling & down-sampling

其他 2018-05-10 08:06:47 阅读次数: 1

up-sampling:

SMOTE algorithm，over-sampled by creating ``synthetic'' examples rather than by over-sampling with replacement.

Weka supervised SMOTE filter
两个参数：

nearestNeighbors:how many nearest neighbor instances (surrounding the currently considered instance) are used to build an inbetween synthetic instance. 默认取值5.
percentage.how many synthetic instances are created based on the number of the class with less instances. 默认值100，假设minority class有25个样本，则25个新样本将会根据nearest Neighbors来合成，此时minority class的样本数变成了50.

down-sampling
The majority class is under-sampled by randomly removing samples from the majority class population until the minority class becomes some specified percentage of the majority class.

Weka supervised SpreadSubsample filter
maxCount:可以取minority class的样本数量 n。
如果 maxCount < n: 则正负例的样本数量都减少到maxCount
如果 maxCount > n: 则minority class的样本数量 n不变，majority class的样本数量减少到maxCount

		Instances train = DataSource
				.read(path);
		train.setClassIndex(rawins.numAttributes() - 1);
		weka.filters.supervised.instance.SpreadSubsample sps = new SpreadSubsample();
		sps.setMaxCount(n); //minority class的样本数量 n
		sps.setInputFormat(train);
		Instances ins = sps.useFilter(train, sps);

猜你喜欢

转载自fenglei.iteye.com/blog/2221758

weka up-sampling & down-sampling

图像的上采样（up-sampling）和下采样(down-sampling)

Up-sampling with Transposed Convolution

Adaptively Up-Sampling Point-Sampled Models

上采样和反卷积 Up-sampling and Transposed Convolution (Deconvolution)

GPU Down Sampling For Point Based Rendering

理解deconvolution（反卷积、转置卷积）概念原理和计算公式、up-sampling（上采样）的几种方式、dilated convolution（空洞卷积）的原理理解和公式计算

Weka

Gibbs Sampling

Sampling Theorem

sampling method

scheduled sampling

Sampling Matrix

importance sampling

Reservoir Sampling

Survey sampling

SRS|Stratified sampling|系统抽样|Cluster sampling|multistage sampling|

水塘抽样（Reservoir sampling）

Sampling a Signal in Matlab

MCMC sampling for dummies

(Excerpt) Reservoir Sampling

Gibbs Sampling for Ising model

水塘抽样 Reservoir sampling

Topic model and Gibbs Sampling

漫谈“采样”（sampling）

IELTS SPEAKING(SAMPLING)

Superpixel Sampling Networks

Sampling and quantization 翻译

抽样方法（Sampling Method）

[学习笔记] Gibbs Sampling

今日推荐

周排行

四大线程池详解

如何高效使用Vim

Mogodb的常用操作总结

Spyder默认页面布局调整

SAR日志分析

OAuth是一个关于授权（authorization）的开放网络标准，在全世界得到广泛应用，目前的版本是2.0版。本文对OAuth 2.0的设计思路和运行流程，做一个简明通俗的解释，主要参考材料为R

WebService中注解开发，CXF，Spring整合，Rest风格

2019考研英语一 Text1分析

windows下安装docker详细步骤

CentOS 7/6系统升级内核版本到5.2.2

每日归档

更多

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)

2024-07-29(0)

2024-07-28(0)

2024-07-27(0)