Manual Installation
The following instructions assume an Ubuntu installation. The specific commands might change in other distributions.
Requirements
- Java 25: required to run the application.
- GStreamer: required for audio recording (usually preinstalled, installation docs here)
- gdbus: required to trigger the keyboard shortcut (usually preinstalled, install
libglib2.0-binotherwise).
On Ubuntu LTS, you can install Java 25 (current LTS) and make it the default with:
sudo apt install openjdk-25-jdk
sudo update-java-alternatives -s java-1.25.0-openjdk-amd64
Download the app
Go to the GitHub releases page and download two files from the latest release:
speedofsound.jar: the applicationtrigger.sh: the script that triggers a recording from a keyboard shortcut (usesgdbusunder the hood)
For example, download both files into $HOME/speedofsound/. Then make the trigger script executable:
chmod +x $HOME/speedofsound/trigger.sh
Run the app
Launch the application like any other Java application:
java --enable-native-access=ALL-UNNAMED -jar $HOME/speedofsound/speedofsound.jar
(--enable-native-access=ALL-UNNAMED is required to enable warning-free access to GTK/GNOME libraries
by Java GI.)
On the first launch, two things will happen:
- Model setup: The built-in Whisper Tiny model is unpacked into your user data folder. This is automatic and the app will start faster in the future.
- Permissions prompt: The app will ask you to grant permission to type on your behalf. To support both X11 and Wayland desktops without requiring root access, Speed of Sound uses XDG Desktop Portals for keyboard input. You must approve this prompt for dictation to work.
Once permissions are granted, press your shortcut to start dictating and press it again to stop. The transcribed text will be typed into whatever application is currently focused.