π§
Deepseek V4 Flash W4a16 Fp8 model by Canada Quant
β 57.1
π¬Technical Deep Dive
Full Specifications [+]
π Updated daily
Source summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Model Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
π Identity & Source
- id
- hf-model--canada-quant--deepseek-v4-flash-w4a16-fp8
- slug
- canada-quant--deepseek-v4-flash-w4a16-fp8
- source
- huggingface
- author
- Canada Quant
- license
- MIT
- tags
- vllm, safetensors, deepseek_v4, deepseek, compressed-tensors, w4a16, gptq, fp8, mixture-of-experts, moe, text-generation, en, zh, base_model:deepseek-ai/deepseek-v4-flash, license:mit, region:us
βοΈ Technical Specs
- architecture
- DeepseekV4ForCausalLM
- params billions
- 44.1
- context length
- 4,096
- pipeline tag
- text-generation
- vram gb
- 35.6
- vram is estimated
- true
- vram formula
- VRAM β (params * 0.75) + 2GB (KV) + 0.5GB (OS)
π Engagement & Metrics
- downloads
- 7,380
- stars
- null
- forks
- null
Data indexed from public sources. Updated daily.