π§ Model
Yi-34B-Chat
by 01-ai
Yi-34B-Chat is an open-source AI model by 01-ai
π Updated 12/31/2025
Technical Specifications
Parameters34.39
ArchitectureLlamaForCausalLM
View Config (3 entries)
{
"architectures": [
"LlamaForCausalLM"
],
"model_type": "llama",
"tokenizer_config": {
"bos_token": "<|startoftext|>",
"chat_template": "{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{% for message in messages %}{{'<|im_start|>' + message['role'] + '\n' + message['content'] + '<|im_end|>' + '\n'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant\n' }}{% endif %}",
"eos_token": "<|endoftext|>",
"pad_token": "<unk>",
"unk_token": "<unk>",
"use_default_system_prompt": true
}
}
πΎ
Est. VRAM Required
~23 GB
Estimation Formula
VRAM = params Γ 0.6 + 2 GB
Based on FP16 precision.
β οΈ Does not account for KV cache or parallel overhead.
π Estimate only. Actual requirements may vary.
π Daily sync (11:00 Beijing)
Based on open-source metadata snapshot. Last synced: Dec 31, 2025
π§ Architecture Explorer
Neural network architecture
1 Input Layer
2 Hidden Layers
3 Attention
4 Output Layer
Parameters 34.39B
Technical Specifications
Parameters34.39
ArchitectureLlamaForCausalLM
0View Config (3 entries)
{
"architectures": [
"LlamaForCausalLM"
],
"model_type": "llama",
"tokenizer_config": {
"bos_token": "<|startoftext|>",
"chat_template": "{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{% for message in messages %}{{'<|im_start|>' + message['role'] + '\n' + message['content'] + '<|im_end|>' + '\n'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant\n' }}{% endif %}",
"eos_token": "<|endoftext|>",
"pad_token": "<unk>",
"unk_token": "<unk>",
"use_default_system_prompt": true
}
}
π Limitations & Considerations
- β’ Benchmark scores may vary based on evaluation methodology and hardware configuration.
- β’ VRAM requirements are estimates; actual usage depends on quantization and batch size.
- β’ FNI scores are relative rankings and may change as new models are added.
- β License Unknown: Verify licensing terms before commercial use.
- β’ Source: Huggingface
π Related Resources
π Related Papers
No related papers linked yet. Check the model's official documentation for research papers.
π Training Datasets
Training data information not available. Refer to the original model card for details.
π Related Models
Data unavailable