CausalLM
- 원본 링크 : https://keras.io/api/keras_hub/base_classes/causal_lm/
- 최종 확인 : 2024-11-26
CausalLM
class
keras_hub.models.CausalLM()
Base class for generative language modeling tasks.
CausalLM
tasks wrap a keras_hub.models.Backbone
and
a keras_hub.models.Preprocessor
to create a model that can be used for
generation and generative fine-tuning.
CausalLM
tasks provide an additional, high-level generate()
function
which can be used to auto-regressively sample a model token by token with a
string in, string out signature. The compile()
method of all CausalLM
classes contains an additional sampler
argument, which can be used to pass
a keras_hub.samplers.Sampler
to control how the predicted distribution
will be sampled.
When calling fit()
, the tokenized input will be predicted token-by-token
with a causal mask applied, which gives both a pre-training and supervised
fine-tuning setup for controlling inference-time generation.
All CausalLM
tasks include a from_preset()
constructor which can be used
to load a pre-trained config and weights.
Example
# Load a GPT2 backbone with pre-trained weights.
causal_lm = keras_hub.models.CausalLM.from_preset(
"gpt2_base_en",
)
causal_lm.compile(sampler="top_k")
causal_lm.generate("Keras is a", max_length=64)
# Load a Mistral instruction tuned checkpoint at bfloat16 precision.
causal_lm = keras_hub.models.CausalLM.from_preset(
"mistral_instruct_7b_en",
dtype="bfloat16",
)
causal_lm.compile(sampler="greedy")
causal_lm.generate("Keras is a", max_length=64)
from_preset
method
CausalLM.from_preset(preset, load_weights=True, **kwargs)
Instantiate a keras_hub.models.Task
from a model preset.
A preset is a directory of configs, weights and other file assets used
to save and load a pre-trained model. The preset
can be passed as
one of:
- a built-in preset identifier like
'bert_base_en'
- a Kaggle Models handle like
'kaggle://user/bert/keras/bert_base_en'
- a Hugging Face handle like
'hf://user/bert_base_en'
- a path to a local preset directory like
'./bert_base_en'
For any Task
subclass, you can run cls.presets.keys()
to list all
built-in presets available on the class.
This constructor can be called in one of two ways. Either from a task
specific base class like keras_hub.models.CausalLM.from_preset()
, or
from a model class like keras_hub.models.BertTextClassifier.from_preset()
.
If calling from the a base class, the subclass of the returning object
will be inferred from the config in the preset directory.
Arguments
- preset: string. A built-in preset identifier, a Kaggle Models handle, a Hugging Face handle, or a path to a local directory.
- load_weights: bool. If
True
, saved weights will be loaded into the model architecture. IfFalse
, all weights will be randomly initialized.
Examples
# Load a Gemma generative task.
causal_lm = keras_hub.models.CausalLM.from_preset(
"gemma_2b_en",
)
# Load a Bert classification task.
model = keras_hub.models.TextClassifier.from_preset(
"bert_base_en",
num_classes=2,
)
compile
method
CausalLM.compile(
optimizer="auto", loss="auto", weighted_metrics="auto", sampler="top_k", **kwargs
)
Configures the CausalLM
task for training and generation.
The CausalLM
task extends the default compilation signature of
keras.Model.compile
with defaults for optimizer
, loss
, and
weighted_metrics
. To override these defaults, pass any value
to these arguments during compilation.
The CausalLM
task adds a new sampler
to compile
, which can be used
to control the sampling strategy used with the generate
function.
Note that because training inputs include padded tokens which are
excluded from the loss, it is almost always a good idea to compile with
weighted_metrics
and not metrics
.
Arguments
- optimizer:
"auto"
, an optimizer name, or akeras.Optimizer
instance. Defaults to"auto"
, which uses the default optimizer for the given model and task. Seekeras.Model.compile
andkeras.optimizers
for more info on possibleoptimizer
values. - loss:
"auto"
, a loss name, or akeras.losses.Loss
instance. Defaults to"auto"
, where akeras.losses.SparseCategoricalCrossentropy
loss will be applied for the token classificationCausalLM
task. Seekeras.Model.compile
andkeras.losses
for more info on possibleloss
values. - weighted_metrics:
"auto"
, or a list of metrics to be evaluated by the model during training and testing. Defaults to"auto"
, where akeras.metrics.SparseCategoricalAccuracy
will be applied to track the accuracy of the model at guessing masked token values. Seekeras.Model.compile
andkeras.metrics
for more info on possibleweighted_metrics
values. - sampler: A sampler name, or a
keras_hub.samplers.Sampler
instance. Configures the sampling method used duringgenerate()
calls. Seekeras_hub.samplers
for a full list of built-in sampling strategies. - **kwargs: See
keras.Model.compile
for a full list of arguments supported by the compile method.
generate
method
CausalLM.generate(inputs, max_length=None, stop_token_ids="auto", strip_prompt=False)
Generate text given prompt inputs
.
This method generates text based on given inputs
. The sampling method
used for generation can be set via the compile()
method.
If inputs
are a tf.data.Dataset
, outputs will be generated
“batch-by-batch” and concatenated. Otherwise, all inputs will be handled
as a single batch.
If a preprocessor
is attached to the model, inputs
will be
preprocessed inside the generate()
function and should match the
structure expected by the preprocessor
layer (usually raw strings).
If a preprocessor
is not attached, inputs should match the structure
expected by the backbone
. See the example usage above for a
demonstration of each.
Arguments
- inputs: python data, tensor data, or a
tf.data.Dataset
. If apreprocessor
is attached to the model,inputs
should match the structure expected by thepreprocessor
layer. If apreprocessor
is not attached,inputs
should match the structure expected thebackbone
model. - max_length: Optional. int. The max length of the generated sequence.
Will default to the max configured
sequence_length
of thepreprocessor
. Ifpreprocessor
isNone
,inputs
should be should be padded to the desired maximum length and this argument will be ignored. - stop_token_ids: Optional.
None
, “auto”, or tuple of token ids. Defaults to “auto” which uses thepreprocessor.tokenizer.end_token_id
. Not specifying a processor will produce an error. None stops generation after generatingmax_length
tokens. You may also specify a list of token id’s the model should stop on. Note that sequences of tokens will each be interpreted as a stop token, multi-token stop sequences are not supported. - strip_prompt: Optional. By default, generate() returns the full prompt followed by its completion generated by the model. If this option is set to True, only the newly generated text is returned.
save_to_preset
method
CausalLM.save_to_preset(preset_dir)
Save task to a preset directory.
Arguments
- preset_dir: The path to the local model preset directory.
preprocessor
property
keras_hub.models.CausalLM.preprocessor
A keras_hub.models.Preprocessor
layer used to preprocess input.
backbone
property
keras_hub.models.CausalLM.backbone
A keras_hub.models.Backbone
model with the core architecture.