TensorFlow模型受损失0

import tensorflow as tf 
import numpy as np 
def weight(shape): 
return tf.Variable(tf.truncated_normal(shape, stddev=0.1)) 
def bias(shape): 
return tf.Variable(tf.constant(0.1, shape=shape)) 
def output(input,w,b): 
return tf.matmul(input,w)+b 
x_columns = 33 
y_columns = 1 
layer1_num = 7 
layer2_num = 7 
epoch_num = 10 
train_num = 1000 
batch_size = 100 
display_size = 1 
x = tf.placeholder(tf.float32,[None,x_columns]) 
y = tf.placeholder(tf.float32,[None,y_columns]) 

layer1 = 
tf.nn.relu(output(x,weight([x_columns,layer1_num]),bias([layer1_num]))) 
layer2=tf.nn.relu 
(output(layer1,weight([layer1_num,layer2_num]),bias([layer2_num]))) 
prediction = output(layer2,weight([layer2_num,y_columns]),bias([y_columns])) 

loss=tf.reduce_mean 
(tf.nn.softmax_cross_entropy_with_logits(labels=y,logits=prediction)) 
train_step = tf.train.AdamOptimizer().minimize(loss) 

sess = tf.InteractiveSession() 
sess.run(tf.global_variables_initializer()) 
for epoch in range(epoch_num): 
    avg_loss = 0. 
    for i in range(train_num): 
     index = np.random.choice(len(x_train),batch_size) 
     x_train_batch = x_train[index] 
     y_train_batch = y_train[index] 
     _,c = sess.run([train_step,loss],feed_dict= 
{x:x_train_batch,y:y_train_batch}) 
     avg_loss += c/train_num 
    if epoch % display_size == 0: 
     print("Epoch:{0},Loss:{1}".format(epoch+1,avg_loss)) 
print("Training Finished")

我的模型得到时期：2，损耗：0.0 时期：3，损耗：0.0 时期：4，损耗：0.0 历元：5，损耗：0.0 时期：6 ，损失：0.0 时期：7，损失：0.0 时代：8，损失：0.0 时期：9，损失：0.0 时期：10，损失：0.0 培训之后TensorFlow模型受损失0

我该如何处理这个问题？

来源

2017-05-04 yoshi

softmax_cross_entropy_with_logits需要单热形式的标签，即形状为[batch_size, num_classes]。在这里，你有y_columns = 1，这意味着只有1个类，它必然总是预测的和“基础事实”（从你的网络的角度来看），所以无论权重是多少，你的输出总是正确的。因此，loss=0。

我猜你确实有不同的类别，并且y_train包含标签的ID。然后predictions应该是形状[batch_size, num_classes]，而不是softmax_cross_entropy_with_logits您应该使用tf.nn.sparse_softmax_cross_entropy_with_logits

来源

2017-05-04 12:19:45 gdelab

非常感谢！你的回答告诉我，我对输入类错误了。我已经能够预测！ – yoshi

TensorFlow模型受损失0

回答

相关问题