Skip to content

Installation Notes

A quick run down on the options available for whisper-asr-webservice.

The main service runs a Interactive Swagger API documentation is available at <http://truenas.local:19900/docs>


The ASR Model has the following values:

ModelRequired VRAMRelative speed
tiny~1 GB~32x
base~1 GB~16x
small~2 GB~6x
medium~5 GB~2x
large~10 GB~1x

Default is Base.


The ASR Engine is default to Faster Whisper, explained here.

A list of Engines available.

Faster Whisper
OpenAI Whisper


The ASR model is downloaded each time you start the container, using the large model this can take some time. If you want to decrease the time it takes to start your container by skipping the download, you can store the cache directory (/root/.cache/whisper) to an persistent storage. Next time you start your container the ASR Model will be taken from the cache instead of being downloaded again. Important this will prevent you from receiving any updates to the models.

You can set the pre-persisted mount whisper as emptyDir and set it as default or memory if you have the ram for it.