Lisper ZeroGPU

Server-side Gemma 4 audio analysis for users whose browser cannot comfortably run the WebGPU model.

The currently validated fine-tuned Lisper model is Gemma 4 E2B. E4B and 31B are future model targets and should be deployed as separate revisions after training/eval.

Browser recorder

No browser recording ready. This path bypasses Gradio's microphone recorder.

Upload fallback

No uploaded clip ready. Use the browser recorder above, or upload an audio file here.

Configured model: thomasjvu/lisper-gemma4-e2b-audio-full

Configured adapter: none

Adapter 4-bit load: False

Acoustic hint: True

Audio token alignment: True

ZeroGPU size: large

If this Space errors on private or gated models, add HF_TOKEN as a Space secret. For local development without downloading the model, set LISPER_ZERO_GPU_EAGER_LOAD=0.