Python Tensorflow数字网络不工作_Python_Machine Learning_Tensorflow

Python Tensorflow数字网络不工作

python machine-learning tensorflow

Python Tensorflow数字网络不工作,python,machine-learning,tensorflow,Python,Machine Learning,Tensorflow,谢谢你考虑回答我的问题。我在使用TensorFlow时遇到问题，我在其中输入数据并不断获得输出： ('Epoch ', 0, ' completed out of ', 10, 'loss:', nan) ('Epoch ', 1, ' completed out of ', 10, 'loss:', nan) ('Epoch ', 2, ' completed out of ', 10, 'loss:', nan) ('Epoch ', 3, ' completed out of ', 10,

谢谢你考虑回答我的问题。我在使用TensorFlow时遇到问题，我在其中输入数据并不断获得输出：

('Epoch ', 0, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 1, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 2, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 3, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 4, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 5, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 6, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 7, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 8, ' completed out of ', 10, 'loss:', nan)
('Epoch ', 9, ' completed out of ', 10, 'loss:', nan)
('Accuracy:', 1.0)

我的X_列车数据是一个500 X 1000的矩阵，其中每行由数字组成，例如：

-0.38484444, 1.4542222222 ...

我希望你能明白。。。我的Y_序列数据由二进制分类（0，1）组成。 len（X_列[0]）返回1000，即样本量（列）

我不太确定我还需要澄清什么问题；我将包括我的简单TensorFlow代码，如果您需要更多关于我的代码或问题的说明，请告诉我

谢谢你抽出时间

import tensorflow as tf
import pandas as pd
import numpy as np

da = pd.read_csv("data.csv", header=None)
ta = pd.read_csv("BMI.csv")

X_data = da.iloc[:, :1000]
Y_data = np.expand_dims(ta.iloc[:, -1], axis = 1)

X_train = X_data.iloc[:500 :,]
X_test = X_data.iloc[500:,:]

Y_train = Y_data[:500 :,]
Y_test = Y_data[735:,:]


X_train = np.array(X_train)
X_test = np.array(X_test)

n_nodes_hl1 = 500
n_nodes_hl2 = 500
n_nodes_hl3 = 500

n_classes = 1
batch_size = 10

x = tf.placeholder('float', [None, len(X_train[0])])
y = tf.placeholder('float')

def neural_network_model(data):
    hidden_1_layer = {'weights': tf.Variable(tf.random_normal([len(X_train[0]), n_nodes_hl1])),
                      'biases': tf.Variable(tf.random_normal([n_nodes_hl1]))}
    hidden_2_layer = {'weights': tf.Variable(tf.random_normal([n_nodes_hl1, n_nodes_hl2])),
                      'biases': tf.Variable(tf.random_normal([n_nodes_hl2]))}
    hidden_3_layer = {'weights': tf.Variable(tf.random_normal([n_nodes_hl2, n_nodes_hl3])),
                      'biases': tf.Variable(tf.random_normal([n_nodes_hl3]))}
    output_layer = {'weights': tf.Variable(tf.random_normal([n_nodes_hl3, n_classes])),
                      'biases': tf.Variable(tf.random_normal([n_classes]))}

    l1 = tf.add(tf.matmul(data, hidden_1_layer['weights']), hidden_1_layer['biases'])
    l1 = tf.nn.relu(l1)

    l2 = tf.add(tf.matmul(l1, hidden_2_layer['weights']), hidden_2_layer['biases'])
    l2 = tf.nn.relu(l2)

    l3 = tf.add(tf.matmul(l2, hidden_3_layer['weights']), hidden_3_layer['biases'])
    l3 = tf.nn.relu(l3)

    output = tf.matmul(l3, output_layer['weights']) + output_layer['biases']

    return output


def train_nueral_network(x):
    prediction = neural_network_model(x)
    cost = tf.reduce_mean( tf.nn.softmax_cross_entropy_with_logits(logits=prediction, labels=y) )
    optimizer = tf.train.AdamOptimizer().minimize(cost)

    hm_epochs = 10

    with tf.Session() as sess:
        sess.run(tf.global_variables_initializer())

        for epoch in range(hm_epochs):
            epoch_loss = 0

            i = 0
            while i < len(X_train[0]):
                start = i
                end = i + batch_size

                batch_x = np.array(X_train[start:end])
                batch_y = np.array(Y_train[start:end])

                _, c = sess.run([optimizer, cost], feed_dict= {x: batch_x, y: batch_y})
                epoch_loss += c
                i += batch_size


            print('Epoch ', epoch, ' completed out of ', hm_epochs, 'loss:', epoch_loss)

        correct = tf.equal(tf.argmax(prediction, 1), tf.argmax(y, 1))
        accuracy = tf.reduce_mean(tf.cast(correct, 'float'))
        print('Accuracy:', accuracy.eval({x:X_test, y:Y_test}))


train_nueral_network(x)

将tensorflow导入为tf
作为pd进口熊猫
将numpy作为np导入
da=pd.read\u csv（“data.csv”，header=None）
ta=pd.read\U csv（“BMI.csv”）
X_data=da.iloc[：，：1000]
Y_data=np.expand_dims（ta.iloc[：，-1]，轴=1）
X_列=X_数据.iloc[：500:，]
X_test=X_data.iloc[500:，：]
Y_列=Y_数据[：500:，]
Y_测试=Y_数据[735:，：]
X_列=np.数组（X_列）
X_测试=np.数组（X_测试）
n_节点_hl1=500
n_节点_hl2=500
n_节点\u hl3=500
n_类=1
批量大小=10
x=tf.占位符（'float'，[None，len（x_列[0]））
y=tf.占位符（'float'）
def神经网络模型（数据）：
隐藏的_1_层={'weights'：tf.变量（tf.random_normal（[len（X_train[0]），n_节点_hl1]），
“偏差”：tf.Variable（tf.random_normal（[n_nodes_hl1]））
隐藏的_2_层={'weights'：tf.Variable（tf.random_normal（[n_nodes_hl1，n_nodes_hl2]），
“偏差”：tf.Variable（tf.random_normal（[n_nodes_hl2]））
隐藏的_3_层={'weights'：tf.Variable（tf.random_normal（[n_nodes_hl2，n_nodes_hl3]），
“偏差”：tf.Variable（tf.random_normal（[n_nodes_hl3]））
输出_层={'weights'：tf.Variable（tf.random_normal（[n_节点_hl3，n_类]），
“偏差”：tf.Variable（tf.random_normal（[n_类]））
l1=tf.add（tf.matmul（数据，隐藏层['weights'）），隐藏层['biases'））
l1=tf.nn.relu（l1）
l2=tf.add（tf.matmul（l1，隐藏的\u 2\u层['权重]），隐藏的\u 2\u层['偏差]）
l2=tf.nn.relu（l2）
l3=tf.add（tf.matmul（l2，隐藏层['weights'）），隐藏层['biases'））
l3=tf.nn.relu（l3）
输出=tf.matmul（l3，输出层['weights']）+输出层['biases']
返回输出
def系列通用网络（x）：
预测=神经网络模型（x）
成本=tf.reduce\u平均值（tf.nn.softmax\u交叉\u熵\u与logits（logits=prediction，labels=y））
优化器=tf.train.AdamOptimizer（）。最小化（成本）
hm_时代=10
使用tf.Session（）作为sess：
sess.run（tf.global\u variables\u initializer（））
对于范围内的历元（hm_历元）：
历元损失=0
i=0
当我

您可以用tf.nn.tanh替换tf.nn.relu并再次训练它，然后检查是否会得到相同的结果。有时，ReLU会导致消失梯度问题

您的

n_classes=1

。因此，在单个神经元上应用softmax，该神经元的计算值应始终为1。您应该设置

n\u classes=2

此外，在当前设置中，您使用的精度评估将始终100%正确：

correct = tf.equal(tf.argmax(prediction, 1), tf.argmax(y, 1))

这是因为

预测

和

都是形状

（批次大小，1）

，因此所有样本的argmax始终为0

我建议用一个热的表示法来表示

。一旦你这样做了，剩下的代码应该可以工作。

你的

n_class=1