Keras中CNN回归任务中奇怪的均方根误差行为

Keras中CNN回归任务中奇怪的均方根误差行为,keras,regression,Keras,Regression,我正在使用类似于alexnet的CNN进行图像相关回归任务。我为损失函数定义了rmse。然而,在第一个时代的训练中,损失带来了巨大的价值。但在第二个时代之后,它下降到了一个有意义的值。这是: 1/51[……]-预计到达时间:847s-损失:104.1821- acc:0.2500-均方根误差:104.1821 2/51 预计到达时间:470s损失:5277326.0910- acc:0.5938-均方根误差:5277326.0910 3/51 预计到达时间:345s损失:3518246.7337

我正在使用类似于alexnet的CNN进行图像相关回归任务。我为损失函数定义了rmse。然而,在第一个时代的训练中,损失带来了巨大的价值。但在第二个时代之后,它下降到了一个有意义的值。这是:

1/51[……]-预计到达时间:847s-损失:104.1821- acc:0.2500-均方根误差:104.1821 2/51 预计到达时间:470s损失:5277326.0910- acc:0.5938-均方根误差:5277326.0910 3/51 预计到达时间:345s损失:3518246.7337- acc:0.5000-均方根误差:3518246.7337 4/51 预计到达时间:281s-损失:2640801.3379- acc:0.6094-均方根误差:2640801.3379 5/51 预计到达时间:241秒-损失:2112661.3062- acc:0.5000-均方根误差:2112661.3062 6/51 预计到达时间:214s损失:1760566.4758- acc:0.4375-均方根误差:1760566.4758 7/51 预计到达时间:194s-损失:1509067.6495- acc:0.4464-均方根误差:1509067.6495 8/51 预计到达时间:178秒-损失:1320442.6319- acc:0.4570-均方根误差:1320442.6319 9/51 预计到达时间:165s-损失:1173734.9212- acc:0.4792-均方根误差:1173734.9212 10/51 预计到达时间:155s损失:1056369.3193- acc:0.4875-均方根误差:1056369.3193 11/51 预计到达时间:146s-损失:960343.5998- acc:0.4943-均方根误差:960343.5998 12/51 [==========>……]-预计到达时间:139s-损失:880320.3762- acc:0.5052-均方根误差:880320.3762 13/51 预计到达时间:131s-损失时间:812608.7112- acc:0.5216-均方根误差:812608.7112 14/51 预计到达时间:125s损失:754570.1939- acc:0.5402-均方根误差:754570.1939 15/51 [==========>预计到达时间:120秒-损失:704269.2443- acc:0.5479-均方根误差:704269.2443 16/51 [==========>…]预计到达时间:114秒-损失:660256.3035- acc:0.5508-均方根误差:660256.3035 17/51 预计到达时间:109s-损失时间:621420.7248- acc:0.5607-均方根误差:621420.7248 18/51 [=============>…]预计到达时间:104秒-损失:586900.8398- acc:0.5712-均方根误差:586900.8398 19/51 预计到达时间:100秒-损失时间:556014.6719- acc:0.5806-均方根误差:556014.6719 20/51 [=============>…]预计到达时间:95s-损失:528216.9077-附件: 0.5875-均方根误差:528216.9077 21/51[================>…]-预计到达时间:91s-损失:503065.7743-附件: 0.5967-均方根误差:503065.7743 22/51[================>…]-预计到达时间:87s-损失:480206.3521-附件: 0.6094-均方根误差:480206.3521 23/51[===================>…]-预计到达时间:83s-损失:459331.8636-附件: 0.6114-均方根误差:459331.8636 24/51[===================>…]预计到达时间:80-损失:440196.2991-附件: 0.6159-均方根误差:440196.2991 25/51[===================>…]预计到达时间:76s-损失:422590.8381-附件: 0.6162-均方根误差:422590.8381 26/51[=========================>]ETA:73s-损耗:406339.5179-acc: 0.6178-均方根误差:406339.5179 27/51[=========================>…]-预计到达时间:69s-损失:391292.6992-附件: 0.6238-均方根误差:391292.6992 28/51[================>…]预计到达时间:66s-损失:377319.9851-附件: 0.6306-均方根误差:377319.9851 29/51[===================>…]预计到达时间:63s-损失:364310.7557-附件: 0.6336-均方根误差:364310.7557 30/51[===================>…..]-预计到达时间:60s-损失:352169.1059-附件: 0.6385-均方根误差:352169.1059 31/51[================>…]-预计到达时间:57s-损失:340810.8854-附件: 0.6401-均方根误差:340810.8854 32/51[===================>…]-预计到达时间:53s-损失:330162.1334-附件: 0.6455-均方根误差:330162.1334 33/51[======================>……]-预计到达时间:50s-损耗:320158.7622-acc: 0.6553-均方根误差:320158.7622 34/51[===================>……]-预计到达时间:47s-损失:310744.0080-acc: 0.6645-均方根误差:310744.0080 35/51[===================>…]预计到达时间:44s-损失:301866.8259-附件: 0.6714-均方根误差:301866.8259 36/51[======================>…]-预计到达时间:41s-损失:293483.0129-附件: 0.6762-均方根误差:293483.0129 37/51[======================>…]-预计到达时间:39s-损失:285552.8197-附件: 0.6757-均方根误差:285552.8197 38/51[===================>…]-预计到达时间:36s-损失:278039.4488-acc: 0.6752-均方根误差:278039.4488 39/51[======================>…]-预计到达时间:33s-损失:270911.4670-附件: 0.6795-均方根误差:270911.4670 40/51[======================>…]-预计到达时间:30s-损失:264140.2391-附件: 0.6820-均方根误差:264140.239141/51[=========================>…]-预计到达时间:27s-损失:257699.1895-acc: 0.6852-均方根误差:257699.1895 42/51[=========================>…]-预计到达时间:25s-损失:251564.6846-acc: 0.6890-均方根误差:251564.6846 43/51[=========================>…]-预计到达时间:22s-损失:245715.4124-附件: 0.6933-均方根误差:245715.4124 44/51[======================>…]-预计到达时间:19s-损失:240131.9916-acc: 0.6960-均方根误差:240131.9916 45/51[=========================>…]-预计到达时间:16s-损失:234796.694
from keras import backend as K
def root_mean_squared_error(y_true, y_pred):
   return K.sqrt(K.mean(K.square(y_pred - y_true), axis=-1))
model.compile(optimizer="rmsprop", loss=root_mean_squared_error, metrics=['accuracy', root_mean_squared_error])
estimator = alexmodel()
datagen = ImageDataGenerator()
datagen.fit(x_train)
start = time.time()
history = estimator.fit_generator(datagen.flow(x_train, x_train,batch_size=batch_size, shuffle=True),
           epochs=epochs,
           steps_per_epoch=x_train.shape[0]/batch_size,
           validation_data=(x_test, y_test))
end = time.time()