IBM Cloud Docs
Default service settings for InstructLab

Default service settings for InstructLab

Review the default settings for InstructLab. These settings cannot be modified.

Training settings

Training strategy
Specifies the training strategy that is used.
  • --strategy=lab-multiphase
Base Model to be trained
Specifies the base model to be trained
  • Version 1.4: --model-path=granite-3.1-8b-starter-v1
  • Version 1.3: --model-path=granite-8b-starter
  • Version 1.2: --model-path=granite-7b-starter

IBM Granite is provided under the Apache License 2.0. For more information, see the Apache License documentation

MT-Bench judge
Specifies the MT-Bench judge model. This parameter is the absolute path to the local judge model directory. If necessary, you can download the model by running ilab model download.
  • Default judge model: prometheus-8x7b-v2-0
  • Parameter (includes directory path): --phased-mt-bench-judge=/instructlab/models/prometheus-8x7b-v2-0; config: train.phased_mt_bench_judge
Training batch size per phase
Specifies the total size of a training batch over all GPUs, per phase.
  • Phase 1: --phased-phase1-effective-batch-size=128; config:train.phased_phase1_effective_batch_size,
  • Phase 2: --phased-phase2-effective-batch-size=3840; config:train.phased_phase2_effective_batch_size
Epochs per phase
Specifies the number of epochs to run for each phase of end-to-end training.
  • Knowledge: Phase 1: --phased-phase1-num-epochs=6; config: train.phased_phase1_num_epochs
  • Skills: Phase 2: --phased-phase2-num-epochs=10; config: train.phased_phase2_num_epochs
Padding-free transformer
Specifies whether training is performed on a padding-free transformer.
  • --is-padding-free=true; config: train.is_padding_free

Synthetic data generation (SDG) settings

Teacher Model
Specifies the model used during synthetic data generation.
  • Default teacher model: mixtral-8x7b-instruct-v0-1
  • Parameter (includes directory path): --model=/instructlab/models/mixtral-8x7b-instruct-v0-1; config: generate.pipeline
Instructions generated per seed example
Specifies the number of instructions to generate for each seed example. Each example maps to sample Q&A pairs for new skills. Examples are generated with both the same Q&A pairs and chunks of the knowledge document, so that the resulting data set is typically larger for a knowledge addition for the same value.
--sdg-scale-factor=30
Data generation pipeline
Specifies the data generation pipeline to use.
Pipeline: agentic
Parameter (includes directory path): --pipeline=/instructlab/sdg/pipelines/agentic; config: generate.teacher.model_path

Model settings

Context window
Specifies the maximum amount of bytes that be can sent in a prompt. The content window size supported is 4096 bytes.
To find this setting, open config.json under trained_models/$TRAINING_JOB_ID/model/ and locate the max_position_embeddings field, for example, "max_position_embeddings": 4096.