← Back to Models
Hunyuan-OCR
HunyuanOCR is an end-to-end OCR expert VLM developed by Tencent, featuring a lightweight 1B parameter design. It achieves state-of-the-art benchmarks in multilingual document parsing and excels in practical applications like text spotting, information extraction, video subtitle extraction, and photo translation.
Hunyuan-OCR
1B parameters
Base
Pull this model
Use the following command with the HoML CLI:
homl pull hunyuanocr:base
Resource Requirements
| Quantization | Disk Space | GPU Memory |
|---|---|---|
| BF16 | 2 GB | 2 GB |