Runtime Speech Recognizer
Documentation for the Runtime Speech Recognizer plugin.
- Get it on Fab
- Product website
- Download Demo (Windows)
- Video tutorial
- Plugin Support & Custom Development: solutions@georgy.dev (tailored solutions for teams & organizations)
Overview
Runtime Speech Recognizer Documentation
How to use the plugin
The Runtime Speech Recognizer plugin is designed to recognize words from incoming audio data. It uses a slightly modified version of whisper.cpp to work with the engine. To use the plugin, follow these steps:
How to use language models
Selecting, downloading and packaging models
Recognition parameter list
These parameters can only be set while the recognizer is not running.
Supported languages
This is the complete list of languages supported by the available language models.
Commands recognition
Compute Levenshtein Similarity
Minimizing freezes
This guide addresses two common sources of freezing in the RuntimeSpeechRecognizer plugin and provides practical solutions to mitigate the performance impact.
Troubleshooting
Language Model Staging Issues
Demo project
Packaged Demo Project for Windows.
How to improve performance
The plugin uses different GPU acceleration methods depending on the platform