RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one. - 代码天地

RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one.

企业开发 2023-10-01 00:29:31 阅读次数: 0

问题：非分布式训练时没有问题，使用多卡分布式训练时报错

Error message
RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one. This error indicates that your module has parameters that were not used in producing loss. You can enable unused parameter detection by (1) passing the keyword argument find_unused_parameters=True to torch.nn.parallel.DistributedDataParallel; (2) making su re all forward function outputs participate in calculating loss. If you already have done the above two steps, then th e distributed data parallel module wasn’t able to locate the output tensors in the return value of your module’s forwar d function. Please include the loss function and the structure of the return value of forward of your module when rep orting this issue (e.g. list, dict, iterable).

原因：

模块的__init__方法中存在一些带参数的模块，但是在forward函数中没有使用。

解决

对于forward函数不使用的模块，在__init__方法中注释掉即可。也就是说用什么模块就定义哪个模块，不要定义一些模块但不去使用

猜你喜欢

转载自blog.csdn.net/u014295602/article/details/130781392

RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one.

【Debug记录】RuntimeError: Expected to have finished reduction in the prior iteration before starting

RuntimeError: An attempt has been made to start a new process before the current process has finished its bootstrapping phase.

RuntimeError: An attempt has been made to start a new process before the current process has finished its bootstrapping phase. This probably means that you are not using fork to start your c

ERROR: There are no scenarios； must have at least one.

Before starting

RuntimeError: An attempt has been made to start a new process before the current

编译器错误error: expected unqualified-id before 'using'| ||=== Build finished: 1

python的An attempt has been made to start a new process before the current process has finished的解决

RuntimeError: Given groups=1, weight of size [64, 3, , ], expected input[1, 4, , ] to have 3

transformer 4 RuntimeError: Expected tensor for argument #1 ‘indices‘ to have scalar type Long

3.7 Constraint Propagation as Iteration of Reduction Rules

Before starting a DB project

RuntimeError: An attempt has been made to start a new process before the current pr

RuntimeError: An attempt has been made to start a new process before the current process has

pytorch使用出现"RuntimeError: An attempt has been made to start a new process before the..." 解决方法

windows中使用multiprocessing报错 RuntimeError:An attempt has been made to start a new process before ...

RuntimeError: An attempt has been made to start a new process before the current process（解决方案））

RuntimeError: An attempt has been made to start a new process before the current process...

解决 RuntimeError: An attempt has been made to start a new process before the current process......

错误 expected '}' before ' ' token

error :expected initializer before

expected expression before ‘=’ token

error: expected '{' before ';' token

error: expected '{' before ';' token

RuntimeError: Given groups=1, weight of size 16 1 3 3, expected input[2, 3, 512, 512] to have 1 chan

已解决ValueError: Length mismatch: Expected axis has 5 elements, new values have 4 elements

已解决ValueError: Length mismatch: Expected axis has 2 elements, new values have 3 elements

python：RuntimeError: dictionary changed size during iteration

RuntimeError: dictionary changed size during iteration

今日推荐

周排行

Access的四舍五入取整

8.23 前端学习过程

入门学习过程方向与漏洞复现总结：

操作分布式文件之八：如何批量并行读写远程文件和事务补偿处理

应邀出个教程（搭建tensorflow跑网络环境）

Kubernetes之Pod控制器应用进阶

14-[mysql内置功能]--

HDU6212 区间dp 好题

VS2015生成代码图

验证手机号的工具类

每日归档

更多

2024-10-21(0)

2024-10-20(0)

2024-10-19(0)

2024-10-18(0)

2024-10-17(0)

2024-10-16(0)

2024-10-15(0)

2024-10-14(0)

2024-10-13(0)

2024-10-12(0)