Home - HoML

Welcome to HoML

The easiest & fastest way to run LLMs in your home lab.

Download Explore Models

A simple, intuitive CLI that just works, inspired by Ollama.

Models load on demand and unload automatically after a configurable idle time (default 10 mins), freeing up your GPU for other tasks.

Integrate seamlessly with your existing tools and workflows.

Download a model to your local machine. You can use a shorthand alias for curated models.

homl pull qwen3:0.6b

Run a downloaded model. This will start the model and make it available for chat and API access.

homl run qwen3:0.6b

Start a conversation with a model.

homl chat qwen3:0.6b