2016-12-27 78 views
0

关卡片断产生:Keras:加载检查点的权重由HDF5在多重GPU

checkpointer = ModelCheckpoint(filepath=os.path.join(savedir, "mid/weights.{epoch:02d}.hd5"), monitor='val_loss', verbose=1, save_best_only=False, save_weights_only=False) 
hist = model.fit_generator(
    gen.generate(batch_size = batch_size, nb_classes=nb_classes), samples_per_epoch=593920, nb_epoch=nb_epoch, verbose=1, callbacks=[checkpointer], validation_data = gen.vld_generate(VLD_PATH, batch_size = 64, nb_classes=nb_classes), nb_val_samples=10000 
) 

我训练的转储在HDF5格式mid文件多GPU主机上我的模型。当我与keras.load_weights('mid')他们装单GPU机器上,有人提出了一个错误:

Using TensorFlow backend. 
Traceback (most recent call last): 
    File "server.py", line 171, in <module> 
    model = load_model_and_weights('zhch.yml', '7_weights.52.hd5') 
    File "server.py", line 16, in load_model_and_weights 
    model.load_weights(os.path.join('model', weights_name)) 
    File "/home/lz/code/ProjectGo/meta/project/libpolicy-server/.virtualenv/lib/python3.5/site-packages/keras/engine/topology.py", line 2701, in load_weights 
    self.load_weights_from_hdf5_group(f) 
    File "/home/lz/code/ProjectGo/meta/project/libpolicy-server/.virtualenv/lib/python3.5/site-packages/keras/engine/topology.py", line 2753, in load_weights_from_hdf5_group 
    str(len(flattened_layers)) + ' layers.') 
ValueError: You are trying to load a weight file containing 1 layers into a model with 21 layers. 

有什么办法可以载入多GPU的单GPU计算机上生成检查点的权重?似乎没有问题的凯拉斯讨论这个问题,因此任何帮助将不胜感激。

+0

你能否在同一个多GPU机器中加载?错误消息说有一些层不匹配。模型有多少层? –

+0

@YaoZhang Weights可以在源机器上加载'model.load_weights()'。 multi-GPU机器上的'model.yml'和model都有21层。 – lz96

回答

2

您可以在单个GPU这样加载模型:

from keras.models import load_model 

multi_gpus_model = load_model('mid') 
origin_model = multi_gpus_model.layers[-2] # you can use multi_gpus_model.summary() to see the layer of the original model 
origin_model.save_weights('single_gpu_model.hdf5') 

“single_gpu_model.hdf5”是,你可以加载到单GPU机型的文件。