ModelCheckpoint
- Original Link : https://keras.io/api/callbacks/model_checkpoint/
- Last Checked at : 2024-11-25
ModelCheckpoint
class
keras.callbacks.ModelCheckpoint(
filepath,
monitor="val_loss",
verbose=0,
save_best_only=False,
save_weights_only=False,
mode="auto",
save_freq="epoch",
initial_value_threshold=None,
)
Callback to save the Keras model or model weights at some frequency.
ModelCheckpoint
callback is used in conjunction with training using
model.fit()
to save a model or weights (in a checkpoint file) at some
interval, so the model or weights can be loaded later to continue the
training from the state saved.
A few options this callback provides include:
- Whether to only keep the model that has achieved the “best performance” so far, or whether to save the model at the end of every epoch regardless of performance.
- Definition of “best”; which quantity to monitor and whether it should be maximized or minimized.
- The frequency it should save at. Currently, the callback supports saving at the end of every epoch, or after a fixed number of training batches.
- Whether only weights are saved, or the whole model is saved.
Example
model.compile(loss=..., optimizer=...,
metrics=['accuracy'])
EPOCHS = 10
checkpoint_filepath = '/tmp/ckpt/checkpoint.model.keras'
model_checkpoint_callback = keras.callbacks.ModelCheckpoint(
filepath=checkpoint_filepath,
monitor='val_accuracy',
mode='max',
save_best_only=True)
model.fit(epochs=EPOCHS, callbacks=[model_checkpoint_callback])
keras.models.load_model(checkpoint_filepath)
checkpoint_filepath = '/tmp/ckpt/checkpoint.weights.h5'
model_checkpoint_callback = keras.callbacks.ModelCheckpoint(
filepath=checkpoint_filepath,
save_weights_only=True,
monitor='val_accuracy',
mode='max',
save_best_only=True)
model.fit(epochs=EPOCHS, callbacks=[model_checkpoint_callback])
model.load_weights(checkpoint_filepath)
Arguments
- filepath: string or
PathLike
, path to save the model file.filepath
can contain named formatting options, which will be filled the value ofepoch
and keys inlogs
(passed inon_epoch_end
). Thefilepath
name needs to end with".weights.h5"
whensave_weights_only=True
or should end with".keras"
when checkpoint saving the whole model (default). For example: iffilepath
is"{epoch:02d}-{val_loss:.2f}.keras"
, then the model checkpoints will be saved with the epoch number and the validation loss in the filename. The directory of the filepath should not be reused by any other callbacks to avoid conflicts. - monitor: The metric name to monitor. Typically the metrics are set by
the
Model.compile
method. Note:- Prefix the name with
"val_"
to monitor validation metrics. - Use
"loss"
or"val_loss"
to monitor the model’s total loss. - If you specify metrics as strings, like
"accuracy"
, pass the same string (with or without the"val_"
prefix). - If you pass
metrics.Metric
objects,monitor
should be set tometric.name
- If you’re not sure about the metric names you can check the
contents of the
history.history
dictionary returned byhistory = model.fit()
- Multi-output models set additional prefixes on the metric names.
- Prefix the name with
- verbose: Verbosity mode, 0 or 1. Mode 0 is silent, and mode 1 displays messages when the callback takes an action.
- save_best_only: if
save_best_only=True
, it only saves when the model is considered the “best” and the latest best model according to the quantity monitored will not be overwritten. Iffilepath
doesn’t contain formatting options like{epoch}
thenfilepath
will be overwritten by each new better model. - mode: one of {
"auto"
,"min"
,"max"
}. Ifsave_best_only=True
, the decision to overwrite the current save file is made based on either the maximization or the minimization of the monitored quantity. Forval_acc
, this should be"max"
, forval_loss
this should be"min"
, etc. In"auto"
mode, the mode is set to"max"
if the quantities monitored are"acc"
or start with"fmeasure"
and are set to"min"
for the rest of the quantities. - save_weights_only: if
True
, then only the model’s weights will be saved (model.save_weights(filepath)
), else the full model is saved (model.save(filepath)
). - save_freq:
"epoch"
or integer. When using"epoch"
, the callback saves the model after each epoch. When using integer, the callback saves the model at end of this many batches. If theModel
is compiled withsteps_per_execution=N
, then the saving criteria will be checked every Nth batch. Note that if the saving isn’t aligned to epochs, the monitored metric may potentially be less reliable (it could reflect as little as 1 batch, since the metrics get reset every epoch). Defaults to"epoch"
. - initial_value_threshold: Floating point initial “best” value of the
metric to be monitored. Only applies if
save_best_value=True
. Only overwrites the model weights already saved if the performance of current model is better than this value.