Default service settings for InstructLab
Review the default settings for InstructLab. These settings cannot be modified.
Training settings
- Training strategy
- Specifies the training strategy that is used.
-
--strategy=lab-multiphase
- Base Model to be trained
- Specifies the base model to be trained
-
- Version 1.4:
--model-path=granite-3.1-8b-starter-v1
- Version 1.4:
-
- Version 1.3:
--model-path=granite-8b-starter
- Version 1.3:
-
- Version 1.2:
--model-path=granite-7b-starter
- Version 1.2:
IBM Granite is provided under the Apache License 2.0. For more information, see the Apache License documentation
- MT-Bench judge
- Specifies the MT-Bench judge model. This parameter is the absolute path to the local judge model directory. If necessary, you can download the model by running
ilab model download
. -
- Default judge model:
prometheus-8x7b-v2-0
- Default judge model:
-
- Parameter (includes directory path):
--phased-mt-bench-judge=/instructlab/models/prometheus-8x7b-v2-0
; config:train.phased_mt_bench_judge
- Parameter (includes directory path):
- Training batch size per phase
- Specifies the total size of a training batch over all GPUs, per phase.
-
- Phase 1:
--phased-phase1-effective-batch-size=128
; config:train.phased_phase1_effective_batch_size
,
- Phase 1:
-
- Phase 2:
--phased-phase2-effective-batch-size=3840
; config:train.phased_phase2_effective_batch_size
- Phase 2:
- Epochs per phase
- Specifies the number of epochs to run for each phase of end-to-end training.
-
- Knowledge: Phase 1:
--phased-phase1-num-epochs=6
; config:train.phased_phase1_num_epochs
- Knowledge: Phase 1:
-
- Skills: Phase 2:
--phased-phase2-num-epochs=10
; config:train.phased_phase2_num_epochs
- Skills: Phase 2:
- Padding-free transformer
- Specifies whether training is performed on a padding-free transformer.
-
--is-padding-free=true
; config:train.is_padding_free
Synthetic data generation (SDG) settings
- Teacher Model
- Specifies the model used during synthetic data generation.
-
- Default teacher model:
mixtral-8x7b-instruct-v0-1
- Default teacher model:
-
- Parameter (includes directory path):
--model=/instructlab/models/mixtral-8x7b-instruct-v0-1
; config:generate.pipeline
- Parameter (includes directory path):
- Instructions generated per seed example
- Specifies the number of instructions to generate for each seed example. Each example maps to sample Q&A pairs for new skills. Examples are generated with both the same Q&A pairs and chunks of the knowledge document, so that the resulting data set is typically larger for a knowledge addition for the same value.
--sdg-scale-factor=30
- Data generation pipeline
- Specifies the data generation pipeline to use.
- Pipeline:
agentic
- Parameter (includes directory path):
--pipeline=/instructlab/sdg/pipelines/agentic
; config:generate.teacher.model_path
Model settings
- Context window
- Specifies the maximum amount of bytes that be can sent in a prompt. The content window size supported is 4096 bytes.
- To find this setting, open
config.json
undertrained_models/$TRAINING_JOB_ID/model/
and locate themax_position_embeddings
field, for example,"max_position_embeddings": 4096
.