Machine learning Caffe停留在迭代0

Machine learning Caffe停留在迭代0,machine-learning,neural-network,caffe,Machine Learning,Neural Network,Caffe,我正在用仅CPU的Caffe实现一个域自适应项目。在训练过程中,它停留在迭代0处。这就是我得到的: I0326 18:14:51.217656 9257 net.cpp:693] Ignoring source layer concat_data I0326 18:14:51.218354 9257 net.cpp:693] Ignoring source layer slice_features_fc7 I0326 18:14:51.218359 9257 net.cpp:693] Ig

我正在用仅CPU的Caffe实现一个域自适应项目。在训练过程中,它停留在迭代0处。这就是我得到的:

I0326 18:14:51.217656  9257 net.cpp:693] Ignoring source layer concat_data
I0326 18:14:51.218354  9257 net.cpp:693] Ignoring source layer slice_features_fc7
I0326 18:14:51.218359  9257 net.cpp:693] Ignoring source layer source_features_fc7_slice_features_fc7_0_split
I0326 18:14:51.218361  9257 net.cpp:693] Ignoring source layer target_features_fc7_slice_features_fc7_1_split
I0326 18:14:51.218364  9257 net.cpp:693] Ignoring source layer source_features_fc8_fc8_source_0_split
I0326 18:14:51.218365  9257 net.cpp:693] Ignoring source layer softmax_loss
I0326 18:14:51.218366  9257 net.cpp:693] Ignoring source layer fc8_target
I0326 18:14:51.218369  9257 net.cpp:693] Ignoring source layer mmd_loss_fc7
I0326 18:14:51.218369  9257 net.cpp:693] Ignoring source layer mmd_loss_fc8
I0326 18:17:06.733678  9257 solver.cpp:407]     Test net output #0: lp_accuracy = 0.0301887
I0326 18:17:34.953090  9257 solver.cpp:231] Iteration 0, loss = 4.42734
I0326 18:17:34.953160  9257 solver.cpp:247]     Train net output #0: fc7_mmd_loss = 0 (* 1 = 0 loss)
I0326 18:17:34.953181  9257 solver.cpp:247]     Train net output #1: fc8_mmd_loss = 0 (* 1 = 0 loss)
I0326 18:17:34.953202  9257 solver.cpp:247]     Train net output #2: softmax_loss = 4.42734 (* 1 = 4.42734 loss)
I0326 18:17:34.953223  9257 sgd_solver.cpp:106] Iteration 0, lr = 0.0003
系统:Ubuntu 16.04
命令行:

./build/tools/caffe train -solver models/DAN/amazon_to_webcam/solver.prototxt -weights models/bvlc_reference_caffenet/bvlc_reference_caffenet.caffemodel
Solver.prototxt:

net: "./models/DAN/amazon_to_webcam/train_val.prototxt"
test_iter: 795
test_interval: 300
base_lr: 0.0003
momentum: 0.9
lr_policy: "inv"
gamma: 0.002
power: 0.75
display: 100
max_iter: 50000
snapshot: 60000
snapshot_prefix: "./models/RTN/amazon_to_webcam/trained_model"
solver_mode: CPU
snapshot_after_train: false

停留在迭代0表示培训正在等待成功打开的通道上的输入。(未能打开通道将产生错误消息,至少是超时问题。)


您需要调试输入流。如果没有其他问题,请设置一些调试器断点(甚至打印语句)以检查是否到达流的关键部分。

最终,我的问题与输入流无关。只是CPU模式太慢,无法训练网络。所以,如果你有同样的问题,那就试试GPU版本的Caffe吧。问题解决了

检查输入层。它是否按预期生产批次?您使用的是什么输入层?我是Caffe的新手,所以您能更具体地说明如何调试我的输入流吗?谢谢不,我不能,因为:(1)这个答案对于堆栈溢出格式来说太长了;(2) 答案进入辅导级讨论,这超出了SO指南的范围;(3) 我从来不用自己动手:我有一个同事是发现这些问题的向导。