Python 按顺序获取keras中间层输出_Python_Tensorflow_Keras

Python 按顺序获取keras中间层输出

python tensorflow keras

Python 按顺序获取keras中间层输出,python,tensorflow,keras,Python,Tensorflow,Keras,代码是这样的。变量“bneck”是按顺序的。我想得到中间层的输出 ... x = bneck(x) x = CBNModule(960, 1, 1, activation=HSwish())(x) # 7 * 960 s32 = CBNModule(320, 1, 1, activation=HSwish())(x) # 7 * 960 -> 7 * 320 s32 = CBNModule(24, 1, 1, activation=HSwish())(s32) # 7 * 320 -

代码是这样的。变量“bneck”是按顺序的。我想得到中间层的输出

...
x = bneck(x)
x = CBNModule(960, 1, 1, activation=HSwish())(x)  # 7 * 960
s32 = CBNModule(320, 1, 1, activation=HSwish())(x)  # 7 * 960 -> 7 * 320
s32 = CBNModule(24, 1, 1, activation=HSwish())(s32)  # 7 * 320 -> 7 * 24
s16 = k.layers.Add()([
    CBNModule(24, 1, 1, activation=HSwish())(bneck.layers[12].output),
    UpModule(24, 2)(s32)
])  # (14 * 160 -> 14 * 24) + (7 * 24 -> 14 * 24)
...
return keras.Model(inputs=[...], outputs=[...])

当我运行

model.summary（）

时，我得到如下错误

错误发生在第6行[code>bCheck.layers[12]。输出。但是当我用代码替换第1行时

for layer in bneck.layers:
    x = layers(x)

没有错误。为什么呢？与它们有什么区别。

首先，您必须根据您的期望输出层创建一个功能提取器。您的图形在此处断开连接
b选中.layers[12]。输出。假设您有型号A 和型号B 。您需要从模型A 中获得某个层的输出（比如2层），并在模型B 中使用它们来完成其架构。为此，首先从模型A 创建2特征提取器，如下所示 extractor_one = Model(modelA.input, expected_layer_1.output) extractor_two = Model(modelA.input, expected_layer_2.output) model = tf.keras.models.Model( inputs=[seq_model.input], outputs=[seq_model.get_layer('conv3').output, seq_model.get_layer('conv5').output] ) # check for i in check_model(tf.keras.Input((32, 32, 3))): print(i.name, i.shape) model_13/conv3/Relu:0 (None, 26, 26, 64) model_13/conv5/Relu:0 (None, 22, 22, 256) 在这里，我将引导您完成一个简单的代码示例。有一种更灵活、更聪明的方法可以做到这一点，但这里有一种。我将构建一个顺序模型，并在CIFAR10 上对其进行训练，接下来，我将尝试构建一个功能模型，其中我将利用一些顺序模型层（仅2个），并在CIFAR100 上训练整个模型 import tensorflow as tf seq_model = tf.keras.Sequential( [ tf.keras.Input(shape=(32, 32, 3)), tf.keras.layers.Conv2D(16, 3, activation="relu"), tf.keras.layers.Conv2D(32, 3, activation="relu"), tf.keras.layers.Conv2D(64, 3, activation="relu"), tf.keras.layers.Conv2D(128, 3, activation="relu"), tf.keras.layers.Conv2D(256, 3, activation="relu"), tf.keras.layers.GlobalAveragePooling2D(), tf.keras.layers.Dense(10, activation='softmax') ] ) seq_model.summary() print('last layer output') # just freezing first 2 layer for layer in last_layer_outputs.layers[:2]: layer.trainable = False # checking for l in last_layer_outputs.layers: print(l.name, l.trainable) print('\nmid layer output') # freeze all layers mid_layer_outputs.trainable = False # checking for l in mid_layer_outputs.layers: print(l.name, l.trainable) last layer output input_11 False conv2d_81 False conv2d_82 False conv2d_83 False conv2d_84 True conv2d_85 True mid layer output input_11 False conv2d_81 False conv2d_82 False conv2d_83 False 在CIFAR10 数据集上测试 (x_train, y_train), (x_test, y_test) = tf.keras.datasets.cifar10.load_data() # train set / data x_train = x_train.astype('float32') / 255 y_train = tf.keras.utils.to_categorical(y_train , num_classes=10) print(x_train.shape, y_train.shape) seq_model.compile( loss = tf.keras.losses.CategoricalCrossentropy(), metrics = tf.keras.metrics.CategoricalAccuracy(), optimizer = tf.keras.optimizers.Adam()) # fit seq_model.fit(x_train, y_train, batch_size=128, epochs=5, verbose = 2) # ------------------------------------------------------------------- (50000, 32, 32, 3) (50000, 10) Epoch 1/5 27s 66ms/step - loss: 1.2229 - categorical_accuracy: 0.5647 Epoch 2/5 26s 67ms/step - loss: 1.1389 - categorical_accuracy: 0.5950 Epoch 3/5 26s 67ms/step - loss: 1.0890 - categorical_accuracy: 0.6127 Epoch 4/5 26s 67ms/step - loss: 1.0475 - categorical_accuracy: 0.6272 Epoch 5/5 26s 67ms/step - loss: 1.0176 - categorical_accuracy: 0.6409 现在，假设我们希望从这个顺序模型中得到一些输出，比如说下面两层 tf.keras.layers.Conv2D(64, 3, activation="relu") # (None, 26, 26, 64) tf.keras.layers.Conv2D(256, 3, activation="relu") # (None, 22, 22, 256) 为了得到它们，我们首先从序列模型中创建两个特征提取器 last_layer_outputs = tf.keras.Model(seq_model.input, seq_model.layers[-3].output) last_layer_outputs.summary() # (None, 22, 22, 256) mid_layer_outputs = tf.keras.Model(seq_model.input, seq_model.layers[2].output) mid_layer_outputs.summary() # (None, 26, 26, 64) import tensorflow as tf seq_model = tf.keras.Sequential( [ tf.keras.Input(shape=(32, 32, 3)), tf.keras.layers.Conv2D(16, 3, activation="relu", name='conv1'), tf.keras.layers.Conv2D(32, 3, activation="relu", name='conv2'), tf.keras.layers.Conv2D(64, 3, activation="relu", name='conv3'), tf.keras.layers.Conv2D(128, 3, activation="relu", name='conv4'), tf.keras.layers.Conv2D(256, 3, activation="relu", name='conv5'), tf.keras.layers.GlobalAveragePooling2D(), tf.keras.layers.Dense(10, activation='softmax') ] ) for l in seq_model.layers: print(l.name, l.output_shape) conv1 (None, 30, 30, 16) conv2 (None, 28, 28, 32) conv3 (None, 26, 26, 64) conv4 (None, 24, 24, 128) conv5 (None, 22, 22, 256) global_average_pooling2d_3 (None, 256) dense_3 (None, 10) 或者，如果我们想冻结它们，我们现在也可以这样做。冻结，因为我们在这里选择了相同类型的数据集。（CIFAR 10-100 ）现在，让我们使用函数式API创建一个新模型，并使用上述两个功能提取器 encoder_input = tf.keras.Input(shape=(32, 32, 3), name="img") x = tf.keras.layers.Conv2D(16, 3, activation="relu")(encoder_input) last_x = last_layer_outputs(encoder_input) print(last_x.shape) # (None, 22, 22, 256) mid_x = mid_layer_outputs(encoder_input) mid_x = tf.keras.layers.Conv2D(32, kernel_size=3, strides=1)(mid_x) print(mid_x.shape) # (None, 24, 24, 32) last_x = tf.keras.layers.GlobalMaxPooling2D()(last_x) mid_x = tf.keras.layers.GlobalMaxPooling2D()(mid_x) print(last_x.shape, mid_x.shape) # (None, 256) (None, 32) encoder_output = tf.keras.layers.Concatenate()([last_x, mid_x]) print(encoder_output.shape) # (None, 288) encoder_output = tf.keras.layers.Dense(100, activation='softmax')(encoder_output) print(encoder_output.shape) # (None, 100) encoder = tf.keras.Model(encoder_input, encoder_output, name="encoder") encoder.summary() 在CIFAR100上运行 (x_train, y_train), (x_test, y_test) = tf.keras.datasets.cifar100.load_data() # train set / data x_train = x_train.astype('float32') / 255 y_train = tf.keras.utils.to_categorical(y_train , num_classes=100) print(x_train.shape, y_train.shape) encoder.compile( loss = tf.keras.losses.CategoricalCrossentropy(), metrics = tf.keras.metrics.CategoricalAccuracy(), optimizer = tf.keras.optimizers.Adam()) # fit encoder.fit(x_train, y_train, batch_size=128, epochs=5, verbose = 1) 参考资料：根据您对我的第一篇文章的第一次评论，我正在添加一篇新文章，而不是编辑我现有的答案，因为这篇文章已经太长了。无论如何，你的担心是合理的。就连我也在努力解决子类API的一些问题。但我似乎在我的问题中写得不太好，因为人们并不觉得这是一个值得关注的问题无论如何，这里有另一个更简洁和精确的答案，因为我们构建了一个带有期望输出的单一模型。一个单独的提取器，而不是以前的两个单独的提取器，这会带来额外的计算开销。比如说，我们的顺序模型 last_layer_outputs = tf.keras.Model(seq_model.input, seq_model.layers[-3].output) last_layer_outputs.summary() # (None, 22, 22, 256) mid_layer_outputs = tf.keras.Model(seq_model.input, seq_model.layers[2].output) mid_layer_outputs.summary() # (None, 26, 26, 64) import tensorflow as tf seq_model = tf.keras.Sequential( [ tf.keras.Input(shape=(32, 32, 3)), tf.keras.layers.Conv2D(16, 3, activation="relu", name='conv1'), tf.keras.layers.Conv2D(32, 3, activation="relu", name='conv2'), tf.keras.layers.Conv2D(64, 3, activation="relu", name='conv3'), tf.keras.layers.Conv2D(128, 3, activation="relu", name='conv4'), tf.keras.layers.Conv2D(256, 3, activation="relu", name='conv5'), tf.keras.layers.GlobalAveragePooling2D(), tf.keras.layers.Dense(10, activation='softmax') ] ) for l in seq_model.layers: print(l.name, l.output_shape) conv1 (None, 30, 30, 16) conv2 (None, 28, 28, 32) conv3 (None, 26, 26, 64) conv4 (None, 24, 24, 128) conv5 (None, 22, 22, 256) global_average_pooling2d_3 (None, 256) dense_3 (None, 10) 我们希望从单个模型中获得conv3 和conv5 。我们可以很容易地做到这一点，如下所示 extractor_one = Model(modelA.input, expected_layer_1.output) extractor_two = Model(modelA.input, expected_layer_2.output) model = tf.keras.models.Model( inputs=[seq_model.input], outputs=[seq_model.get_layer('conv3').output, seq_model.get_layer('conv5').output] ) # check for i in check_model(tf.keras.Input((32, 32, 3))): print(i.name, i.shape) model_13/conv3/Relu:0 (None, 26, 26, 64) model_13/conv5/Relu:0 (None, 22, 22, 256) 很好，预期层的两个特征输出。现在，让我们使用这两个层（就像我的第一篇文章）来构建一个功能API模型 encoder_input = tf.keras.Input(shape=(32, 32, 3), name="img") x = tf.keras.layers.Conv2D(16, 3, activation="relu")(encoder_input) last_x = check_model(encoder_input)[0] print(last_x.shape) # (None, 26, 26, 64) - model_13/conv3/Relu:0 (None, 26, 26, 64) mid_x = check_model(encoder_input)[1] # model_13/conv5/Relu:0 (None, 22, 22, 256) mid_x = tf.keras.layers.Conv2D(32, kernel_size=3, strides=1)(mid_x) print(mid_x.shape) # (None, 20, 20, 32) last_x = tf.keras.layers.GlobalMaxPooling2D()(last_x) mid_x = tf.keras.layers.GlobalMaxPooling2D()(mid_x) print(last_x.shape, mid_x.shape) # (None, 64) (None, 32) encoder_output = tf.keras.layers.Concatenate()([last_x, mid_x]) print(encoder_output.shape) # (None, 96) encoder_output = tf.keras.layers.Dense(100, activation='softmax')(encoder_output) print(encoder_output.shape) # (None, 100) encoder = tf.keras.Model(encoder_input, encoder_output, name="encoder") tf.keras.utils.plot_model( encoder, show_shapes=True, show_layer_names=True ) 您想将顺序模型输出连接到函数式API模型，对吗？是的。除了最终的输出，我还想按顺序得到中间层的输出。我有一些新问题。在这种情况下，这是否意味着两个提取器分别运行？事实上，对于同一个输入图像，我希望运行一次模型以提取不同的特征。我的其他答案是否解决了这个问题？如果您在任何部分不理解，请随时提问。是的，您的答案解决了我的问题。我根据您的评论给出了另一个答案，如果它是您想要的，请将此标记为正确答案，以便将来的读者能够理解。