This app demonstrates running speech recognition models on Android using ExecuTorch. It supports both Whisper and Parakeet model families.
| Model | Type | Details |
|---|---|---|
| Whisper Tiny/Small/Medium (INT8/INT4) | Streaming | Requires model, tokenizer, and preprocessor |
| Whisper Tiny/Small/Medium (FP32) | Streaming | Requires model, tokenizer, and preprocessor |
| Parakeet TDT 0.6B (INT4) | Synchronous | Requires model and tokenizer |
- Whisper: Follow the instructions at https://github.com/pytorch/executorch/tree/main/examples/models/whisper
- Parakeet: Follow the instructions at https://github.com/pytorch/executorch/tree/main/examples/models/parakeet
- Open AsrApp in Android Studio
- Copy the
executorch.aarlibrary (with ASR and Parakeet JNI bindings) intoapp/libs/ - Build and run on device
The app includes a built-in download screen to fetch models from HuggingFace. Alternatively, push files manually:
adb push model.pte /data/local/tmp/asr/
adb push tokenizer.json /data/local/tmp/asr/
adb push whisper_preprocessor.pte /data/local/tmp/asr/ # Whisper only- Whisper: Click to start recording; automatically stops after 30 seconds
- Parakeet: Click to start recording; click again to stop (no time limit)