merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Passthrough merge method using PleIAs/Baguettotron as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: passthrough
dtype: bfloat16
out_dtype: float32
base_model: PleIAs/Baguettotron
slices:
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [0,30]
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [20,40]
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [30,66]
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [40,76]
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [50,80]
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [10,38]
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [0,22]
  - sources:
    - model: PleIAs/Baguettotron
      layer_range: [14,70]
tokenizer:
  source: base
parameters:
  normalize: true
Downloads last month
10
Safetensors
Model size
1.0B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mstyslavity/boulango_random

Finetuned
(4)
this model
Finetunes
1 model