π§
Llama 3.3 70b Instruct Quantized.w8a8 model by RedHatAI
β 57.6
π¬Technical Deep Dive
Full Specifications [+]
π Updated daily
Source summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Model Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
π Identity & Source
- id
- hf-model--redhatai--llama-3.3-70b-instruct-quantized.w8a8
- slug
- redhatai--llama-3.3-70b-instruct-quantized.w8a8
- source
- huggingface
- author
- RedHatAI
- license
- llama3.3
- tags
- safetensors, llama, facebook, meta, llama-3, int8, vllm, chat, neuralmagic, llmcompressor, conversational, 8-bit precision, compressed-tensors, text-generation, en, de, fr, it, pt, hi, es, th, base_model:meta-llama/llama-3.3-70b-instruct, license:llama3.3, 8-bit, region:us
βοΈ Technical Specs
- architecture
- LlamaForCausalLM
- params billions
- 70.56
- context length
- 8,192
- pipeline tag
- text-generation
- vram gb
- 55.4
- vram is estimated
- true
- vram formula
- VRAM β (params * 0.75) + 2GB (KV) + 0.5GB (OS)
π Engagement & Metrics
- downloads
- 17,022
- stars
- null
- forks
- null
Data indexed from public sources. Updated daily.