π§
Llama 3.3 70b Instruct Quantized.w4a16 model by RedHatAI
β 54.6
π¬Technical Deep Dive
Full Specifications [+]
π Updated daily
Source summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Model Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
π Identity & Source
- id
- hf-model--redhatai--llama-3.3-70b-instruct-quantized.w4a16
- slug
- redhatai--llama-3.3-70b-instruct-quantized.w4a16
- source
- huggingface
- author
- RedHatAI
- license
- llama3.3
- tags
- safetensors, llama, facebook, meta, llama-3, int4, vllm, chat, neuralmagic, llmcompressor, conversational, 4-bit precision, compressed-tensors, text-generation, en, de, fr, it, pt, hi, es, th, arxiv:2210.17323, base_model:meta-llama/llama-3.3-70b-instruct, license:llama3.3, region:us
βοΈ Technical Specs
- architecture
- LlamaForCausalLM
- params billions
- 71.14
- context length
- 8,192
- pipeline tag
- text-generation
- vram gb
- 55.9
- vram is estimated
- true
- vram formula
- VRAM β (params * 0.75) + 2GB (KV) + 0.5GB (OS)
π Engagement & Metrics
- downloads
- 5,994
- stars
- null
- forks
- null
Data indexed from public sources. Updated daily.