🧠 Model

deepseek-coder-33b-instruct

Name: deepseek-coder-33b-instruct
Author: deepseek-ai

by deepseek-ai

deepseek-coder-33b-instruct is an open-source AI model by deepseek-ai

🕐 Updated 12/31/2025

Compare This Model

Technical Specifications

Parameters33.34

ArchitectureLlamaForCausalLM

View Config (3 entries)


{
  "architectures": [
    "LlamaForCausalLM"
  ],
  "model_type": "llama",
  "tokenizer_config": {
    "bos_token": {
      "__type": "AddedToken",
      "content": "<｜begin▁of▁sentence｜>",
      "lstrip": false,
      "normalized": true,
      "rstrip": false,
      "single_word": false
    },
    "eos_token": {
      "__type": "AddedToken",
      "content": "<|EOT|>",
      "lstrip": false,
      "normalized": true,
      "rstrip": false,
      "single_word": false
    },
    "pad_token": {
      "__type": "AddedToken",
      "content": "<｜end▁of▁sentence｜>",
      "lstrip": false,
      "normalized": true,
      "rstrip": false,
      "single_word": false
    },
    "unk_token": null,
    "chat_template": "{% if not add_generation_prompt is defined %}\n{% set add_generation_prompt = false %}\n{% endif %}\n{%- set ns = namespace(found=false) -%}\n{%- for message in messages -%}\n    {%- if message['role'] == 'system' -%}\n        {%- set ns.found = true -%}\n    {%- endif -%}\n{%- endfor -%}\n{{bos_token}}{%- if not ns.found -%}\n{{'You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer\\n'}}\n{%- endif %}\n{%- for message in messages %}\n    {%- if message['role'] == 'system' %}\n{{ message['content'] }}\n    {%- else %}\n        {%- if message['role'] == 'user' %}\n{{'### Instruction:\\n' + message['content'] + '\\n'}}\n        {%- else %}\n{{'### Response:\\n' + message['content'] + '\\n<|EOT|>\\n'}}\n        {%- endif %}\n    {%- endif %}\n{%- endfor %}\n{% if add_generation_prompt %}\n{{'### Response:'}}\n{% endif %}"
  }
}

💾

Est. VRAM Required

~23 GB

Estimation Formula


VRAM = params × 0.6 + 2 GB

Based on FP16 precision.

⚠️ Does not account for KV cache or parallel overhead.

📋 Estimate only. Actual requirements may vary.

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (11:00 Beijing)

Based on open-source metadata snapshot. Last synced: Dec 31, 2025

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🧠 Architecture Explorer

Neural network architecture

1 Input Layer

2 Hidden Layers

3 Attention

4 Output Layer

Parameters 33.34B

Learn about Transformers →

Technical Specifications

Parameters33.34

ArchitectureLlamaForCausalLM

View Config (3 entries)


{
  "architectures": [
    "LlamaForCausalLM"
  ],
  "model_type": "llama",
  "tokenizer_config": {
    "bos_token": {
      "__type": "AddedToken",
      "content": "<｜begin▁of▁sentence｜>",
      "lstrip": false,
      "normalized": true,
      "rstrip": false,
      "single_word": false
    },
    "eos_token": {
      "__type": "AddedToken",
      "content": "<|EOT|>",
      "lstrip": false,
      "normalized": true,
      "rstrip": false,
      "single_word": false
    },
    "pad_token": {
      "__type": "AddedToken",
      "content": "<｜end▁of▁sentence｜>",
      "lstrip": false,
      "normalized": true,
      "rstrip": false,
      "single_word": false
    },
    "unk_token": null,
    "chat_template": "{% if not add_generation_prompt is defined %}\n{% set add_generation_prompt = false %}\n{% endif %}\n{%- set ns = namespace(found=false) -%}\n{%- for message in messages -%}\n    {%- if message['role'] == 'system' -%}\n        {%- set ns.found = true -%}\n    {%- endif -%}\n{%- endfor -%}\n{{bos_token}}{%- if not ns.found -%}\n{{'You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer\\n'}}\n{%- endif %}\n{%- for message in messages %}\n    {%- if message['role'] == 'system' %}\n{{ message['content'] }}\n    {%- else %}\n        {%- if message['role'] == 'user' %}\n{{'### Instruction:\\n' + message['content'] + '\\n'}}\n        {%- else %}\n{{'### Response:\\n' + message['content'] + '\\n<|EOT|>\\n'}}\n        {%- endif %}\n    {%- endif %}\n{%- endfor %}\n{% if add_generation_prompt %}\n{{'### Response:'}}\n{% endif %}"
  }
}

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
⚠ License Unknown: Verify licensing terms before commercial use.
• Source: Huggingface

📚 Related Resources

📄 Related Papers

No related papers linked yet. Check the model's official documentation for research papers.

📊 Training Datasets

Training data information not available. Refer to the original model card for details.

🔗 Related Models

Data unavailable

Model Specifications

Parameters 33.34B

Architecture LlamaForCausalLM

Deploy Score 0%

🚀 Deployment Info

Difficulty

💎Expert

VRAM Required

~80 GB

Recommended Hardware

☁️ Multi-GPU or cloud A100/H100

Model Information Summary
Model Name	deepseek-coder-33b-instruct
Author	deepseek-ai
Type	Not specified
Downloads	0
Likes	556
Source	Hugging Face
Last Updated	December 31, 2025

Graph Overview

200 Models

460 Connections

Explore Full Graph →

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Learn About Deployment

Understand deployment options

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!

Technical Specifications

🧠 Architecture Explorer

Technical Specifications

📝 Limitations & Considerations

📚 Related Resources

📄 Related Papers

📊 Training Datasets

🔗 Related Models

🚀 What's Next?

Find Training Datasets

Compare Benchmarks

Learn About Deployment