Qwen2
Qwen2 is a series of large language models from Alibaba Cloud. They are Transformer-based models with SwiGLU activation, attention QKV bias, and group query attention. They have strong performance in language understanding, generation, multilingual capabilities, coding, mathematics, and reasoning.
qwen2-1.5b
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:1.5b
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 3 GB | 3 GB |
qwen2-1.5b-instruct
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:1.5b-instruct
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 3 GB | 3 GB |
qwen2-1.5b-instruct-8bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:1.5b-instruct-8bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
8-bit | 1.5 GB | 1.5 GB |
qwen2-1.5b-instruct-4bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:1.5b-instruct-4bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
4-bit | 0.75 GB | 0.75 GB |
qwen2-7b
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:7b
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 14 GB | 14 GB |
qwen2-7b-instruct
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:7b-instruct
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 14 GB | 14 GB |
qwen2-7b-instruct-8bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:7b-instruct-8bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
8-bit | 7 GB | 7 GB |
qwen2-7b-instruct-4bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:7b-instruct-4bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
4-bit | 3.5 GB | 3.5 GB |
qwen2-72b
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:72b
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 144 GB | 144 GB |
qwen2-72b-instruct
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:72b-instruct
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 144 GB | 144 GB |
qwen2-72b-instruct-8bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:72b-instruct-8bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
8-bit | 72 GB | 72 GB |
qwen2-72b-instruct-4bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:72b-instruct-4bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
4-bit | 36 GB | 36 GB |
qwen2-57b-a14b
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:57b-a14b
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 114 GB | 114 GB |
qwen2-57b-a14b-instruct
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:57b-a14b-instruct
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
BF16 | 114 GB | 114 GB |
qwen2-57b-a14b-instruct-8bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:57b-a14b-instruct-8bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
8-bit | 57 GB | 57 GB |
qwen2-57b-a14b-instruct-4bit
Pull this model
Use the following command with the HoML CLI:
homl pull qwen2:57b-a14b-instruct-4bit
Resource Requirements
Quantization | Disk Space | GPU Memory |
---|---|---|
4-bit | 28.5 GB | 28.5 GB |