1123 - 代码天地

1123

其他 2018-11-23 20:34:35 阅读次数: 0

VGGish

　　通过阅读帮助文档，知道可以VGGish是产生128维音频数据集的工具，原文的描述是这样的： VGGish， as well as supporting code to extract input features for the model from audio wavaforms and post-process the model enmbedding output int the same fomat.\

　输入：音频特征

　1.所有的音频都被重采样为16KHz的单声道形式。

　 2.使用 25ms 的帧长、10ms 的帧移，以及周期性的 Hann 窗口对语音进行分帧，对每一帧做短时傅里叶变换，然后利用信号幅值计算声谱图。

　 3.通过将声谱映射到 64 阶 mel 滤波器组(covering the range 125-7500 Hz.)中计算 mel 声谱.

　 4.计算 log(mel-spectrum + 0.01)，得到稳定的 mel 声谱，所加的 0.01 的偏置是为了避免对 0 取对数。

　 5.然后这些特征被以 0.96s 的时长被组帧，并且没有帧的重叠，每一帧都包含 64 个 mel 频带，时长 10ms（即总共 96 帧）。

　　

　　

猜你喜欢

转载自www.cnblogs.com/ChenKe-cheng/p/10009526.html

1123

1123. Salary

1123C练习

bzoj 1123

【ACWing】1123. 铲雪车

PAT(A) 1123. Is It a Complete AVL Tree (30)

1123. Lowest Common Ancestor of Deepest Leaves

1123.火力网(normal)

1123：图像相似度

Trail Maintenance LightOJ - 1123

Blockade(Bzoj1123)

bzoj1123 Blockade

[bzoj1123]BLO

【bzoj1123】BLO

test2_1123

1123: 最佳校友

error LINK1123

BLO（bzoj1123）

SDNU 1123.Encoding

1123 Is It a Complete AVL Tree

PAT甲级1123

luogu1123

ZUULIOJ 1123: 最佳校友

1123 atonement math

PAT1123

AcWing 1123 铲雪车

bzoj 1123 BLO

1123: [POI2008]BLO

PAT 1123 Is It a Complete AVL Tree

fatal error LNK1123

今日推荐

周排行

LRU cache算法

windows10, 自带的OpenSSH, key权限问题, 文件权限问题

测试用例书写方法

HIVE-默认分隔符的（linux系统的特殊字符）查看，输入和修改

最贵的AMD 7nm显卡来了！这设计够狂野

java多线程简单demo

[ 转载 ]在Android系统上使用busybox——最简单的方法

QT connect学习

BFSIFT算法分析

Xcode10：library not found for -lstdc++.6.0.9 临时解决

每日归档

更多

2024-08-06(0)

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)

2024-07-29(0)

2024-07-28(0)