HoML Logo

HoML

← Back to Models

Hunyuan-OCR

HunyuanOCR is an end-to-end OCR expert VLM developed by Tencent, featuring a lightweight 1B parameter design. It achieves state-of-the-art benchmarks in multilingual document parsing and excels in practical applications like text spotting, information extraction, video subtitle extraction, and photo translation.

Hunyuan-OCR

1B parameters Base

Pull this model

Use the following command with the HoML CLI:

homl pull hunyuanocr:base

Resource Requirements

Quantization Disk Space GPU Memory
BF16 2 GB 2 GB