Tutorial - Whisper
Let's run OpenAI's Whisper, pre-trained model for automatic speech recognition on Jetson!
What you need
One of the following Jetson:
Jetson AGX Orin 64GB Jetson AGX Orin (32GB) Jetson Orin Nano Orin (8GB)
Running one of the following JetPack.5x
JetPack 5.1.2 (L4T r35.4.1) JetPack 5.1.1 (L4T r35.3.1) JetPack 5.1 (L4T r35.2.1)
Sufficient storage space (preferably with NVMe SSD).
- Space for checkpoints
Clone and set up
git clone https://github.com/dusty-nv/jetson-containers cd jetson-containers sudo apt update; sudo apt install -y python3-pip pip3 install -r requirements.txt
How to start
autotag script to automatically pull or build a compatible container image.
cd jetson-containers ./run.sh $(./autotag whisper)
The container has a default run command (
CMD) that will automatically start the Jupyter Lab server, with SSL enabled.
Open your browser and access
Note it is
HTTPS (SSL) connection is needed to allow
ipywebrtc widget to have access to your microphone (for
You will see a warning message like this.
Press "Advanced" button and then click on "Proceed to
The default password for Jupyter Lab is
Run Jupyter notebooks
Whisper repo comes with demo Jupyter notebooks, which you can find under
jetson-containers also adds one convenient notebook (
record-and-transcribe.ipynb) to record your audio sample on Jupyter notebook in order to run transcribe on your recorded audio.
This notebook is to let you record your own audio sample using your PC's microphone and apply Whisper's
medium model to transcribe the audio sample.
It uses Jupyter notebook/lab's
ipywebrtc extension to record an audio sample on your web browser.
When you click the ⏺ botton, your web browser may show a pop-up to ask you to allow it to use your microphone. Be sure to allow the access.
Once done, if you click on the "⚠ Not secure" part in the URL bar, you should see something like this.
Once you go through all the steps, you should see the transcribe result in text like this.