I would like to make a deep copy of a keras model (called model1
) of mine in order to be able to use it in a for a loop and then re-initialize for each for-loop iteration and perform fit
with one additional sample to the model. I would like to be able to initialize the model after each iteration since after performing the fit
(my model is modified however, I want it keep it as it is when i am loading from the path using the load_weights).
My code looks like:
model1= create_Model()
model1.compile(optimizer='rmsprop', loss='categorical_crossentropy')
model1.load_weights('my_weights')
model_copy= create_Model()
model_copy.compile(optimizer='rmsprop', loss='categorical_crossentropy')
model_copy= keras.models.clone_model(model1)
for j in range(0, image_size):
model_copy.fit(sample[j], sample_lbl[j])
prediction= model_copy.predict(sample[j])
Also, it is not really efficient for me to load the model each time in the for-loop since that is time-consuming. How can I do properly the deep copy in my case? The code I posted give the following error that concerns the function .fit and my reference model model_copy:
RuntimeError: You must compile a model before training/testing. Use
model.compile(optimizer, loss)
.
The issue is that model_copy is probably not compiled after cloning. There are in fact a few issues:
Apparently cloning doesn't copy over the loss function, optimizer info, etc.
Before compiling you need to also build the model.
Moreover, cloning doesn't copy weight over
So you need a couple extra lines after cloning. For example, for 10 input variables:
model_copy= keras.models.clone_model(model1)
model_copy.build((None, 10)) # replace 10 with number of variables in input layer
model_copy.compile(optimizer='rmsprop', loss='categorical_crossentropy')
model_copy.set_weights(model.get_weights())
If I understand your question correctly, there is an easier way to do this. You don't need to clone the model, just need to save the old_weights and set the weights at beginning of the loop. You can simply load weights from file as you are doing.
for _ in range(10):
model1= create_Model()
model1.compile(optimizer='rmsprop', loss='categorical_crossentropy')
model1.load_weights('my_weights')
for j in range(0, image_size):
model1.fit(sample[j], sample_lbl[j])
prediction= model1.predict(sample[j])
Or if you prefer not to load from file:
model1= create_Model()
model1.compile(optimizer='rmsprop', loss='categorical_crossentropy')
model1.load_weights('my_weights')
old_weights = model1.get_weights()
for _ in range(10):
model1.set_weights(old_weights)
for j in range(0, image_size):
model1.fit(sample[j], sample_lbl[j])
prediction= model1.predict(sample[j])