我正在使用tensorflow来微调模型,它使用线程排队输入队列。该代码运行良好,进行了几次迭代,但在几次迭代后仍然会出现以下错误。但是,代码通常会在几次迭代后退出,并且必须手动重新启动它。你能帮我理解错误吗?获取数据时入流和线程错误与tensorflow
Exception in thread Thread-29:
Traceback (most recent call last):
File "/usr/lib/python2.7/threading.py", line 810, in __bootstrap_inner
self.run()
File "/usr/lib/python2.7/threading.py", line 763, in run
self.__target(*self.__args, **self.__kwargs)
File "data_feeder.py", line 212, in enqueue_op_online_val
model_vars['labels_val']: labels_minibatch})
File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 717, in run
run_metadata_ptr)
File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 915, in _run
feed_dict_string, options, run_metadata)
File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 965, in _do_run
target_list, options, run_metadata)
File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 985, in _do_call
raise type(e)(node_def, op, message)
NotFoundError: PruneForTargets: Some target nodes not found: fifo_queue_enqueue/fifo_queue_enqueue_4/fifo_queue_enqueue_10/fifo_queue_enqueue_10/fifo_queue_enqueue_8/fifo_queue_enqueue_2/fifo_queue_enqueue_4/fifo_queue_enqueue/fifo_queue_enqueue_3/fifo_queue_enqueue_3/fifo_queue_enqueue/fifo_queue_enqueue_1/fifo_queue_enqueue_5/fifo_queue_enqueue/fifo_queue_enqueue_2/fifo_queue_enqueue/fifo_queue_enqueue_4/fifo_queue_enqueue_6/fifo_queue_enqueue_20/fifo_queue_enqueue_29/fifo_queue_enqueue/fifo_queue_enqueue_7/fifo_queue_enqueue_8/fifo_queue_enqueue_9/fifo_queue_enqueue_7/fifo_queue_enqueue/fifo_queue_enqueue_9/fifo_queue_enqueue_8/fifo_queue_enqueue_11/fifo_queue_enqueue_5/fifo_queue_enqueue_7/fifo_queue_enqueue_8/fifo_queue_enqueue_12/fifo_queue_enqueue_3/fifo_queue_enqueue_2/fifo_queue_enqueue_5/fifo_queue_enqueue_4/fifo_queue_enqueue_7/fifo_queue_enqueue_3/fifo_queue_enqueue/fifo_queue_enqueue_9/fifo_queue_enqueue_1/fifo_queue_enqueue_25/fifo_queue_enqueue_23/fifo_queue_enqueue_30/fifo_queue_enqueue_5/fifo_queue_enqueue_3/fifo_queue_enqueue_7/fifo_queue_enqueue_3/fifo_queue_enqueue/fifo_queue_enqueue_2/fifo_queue_enqueue_8/fifo_queue_enqueue_11/fifo_queue_enqueue_11/fifo_queue_enqueue/fifo_queue_enqueue_4
这里的排队运算
def enqueue_op(self, sess, model_vars, coord):
queue = model_vars['queue_train']
random.shuffle(self.keys)
img_minibatch = np.zeros((self.batch_size, 224, 224, 3))
#t = time.time()
for indx in xrange(self.num_batches):
labels_minibatch = []
# Load all files in this batch
for i,k in\
enumerate(self.keys[indx*self.batch_size:(indx+1)*self.batch_size]):
img = misc.imread(self.img_path + self.data[k]['filename'])
img_minibatch[i, :, :, :] = img_proc(img)
labels_minibatch.append(self.data[k]['labels'])
labels_minibatch =\
self.mlb.fit_transform(labels_minibatch).astype(float)
sess.run([queue.enqueue([model_vars['input'],
model_vars['labels']])],\
feed_dict={model_vars['input']: img_minibatch,
model_vars['labels']: labels_minibatch})
if coord.should_stop():
break
下面是创建线程片断
thr_train = []
for i in xrange(12):
thr_train.append(threading.Thread(target=primary_mb_feeder.enqueue_op,
args=(sess, model_vars, coord)))
thr_train[-1].setDaemon(True)
thr_train[-1].start()
你可以分享你用来创建队列操作的代码吗?缺少的节点名称非常奇怪,我不知道这个问题是否由多个线程同时创建enqueue操作引起的,因为创建一个可以在线程中重用的单个操作会更好。 (但是它可能仍然是一个bug) – mrry
我已经粘贴了我正在使用的enqueue操作的代码和创建线程来运行此操作的代码片段。 – ksikka
'primary_mb_feeder'是什么类型? enqueue_op是它的一个方法,它是如何实现的? – mrry