Voice KKuTu - Team Project for Embedded Software Lecture
rye is required for run this project easily.
Required:
# 1. Fetch KKuTu Repository
git module init
git module update
# 2. Install dependencies
rye sync
# 3. Convert word database from KKuTu DB*
rye run convert
# 4. Download and convert whisper model**
rye run make-model* <Convert word data from KKuTu DB> Section.
** <Download and convert Whisper model or just download from Hugging Face> Section.
(Optional) When you clone for developing:
# Install pre commit script for linting
rye run pre-commitIf below commands are not executed sucessfully, Try rye sync first.
rye run app
rye run convert
rye run clean
rye run black
rye run pre-commit
rye run make-model
rye run sttThis repository contains KKuTu repository as submodule.
Word data for KKuTu server can be migrated for this project with converting script.
git submodule init
git submodule update
rye run convertrye run make-model
rye run make-model --model-size=modelThere are converted model at Hugging Face. You can run application just downloading models from there.
git lfs install
git clone https://huggingface.co/ShapeLayer/embedded-voice-kkutu-models modelsNote
If you use a model other than the default model, you must pass parameters while running the app.
rye run app --model=model| Model | model value |
|---|---|
| Tiny | tiny |
| (en) Tiny | tiny.en |
| Base | base |
| (en) Base | base.en |
| Small | small |
| (en) Small | small.en |
| Medium | medium |
| (en) Medium | medium.en |
| Large-v1 | large-v1 |
| Large-v2 | large-v2 |
| Large-v3 | large-v3 |
| Large-v3-turbo | large-v3-turbo |
| Turbo | turbo |
| Distil-large-v2 | distil-large-v2 |
| Distil-large-v3 | distil-large-v3 |
| (en) Distil-medium | distil-medium.en |
| (en) Distil-small | distil-small.en |
[stderr]
src/pyaudio/device_api.c:9:10: fatal error: 'portaudio.h' file not found
9 | #include "portaudio.h"
| ^~~~~~~~~~~~~
1 error generated.
error: command '/usr/bin/clang' failed with exit code 1
hint: This error likely indicates that you need to install a library that provides "portaudio.h" for `pyaudio@0.2.14`
Install portaudio
sudo apt-get install portaudio19-dev To install PyTorch on a Raspberry Pi, you need to install the CPU-only version. You can do so using the following command:
pip install torch torchvision torchaudio --index-url <https://download.pytorch.org/whl/cpu>
# Enable auido (loads snd_bcm2835)
dtparam=audio=onRemove comments from boot/config.txt
sudo /etc/init.d/alsa-utils reset
sudo rebootsudo raspi-configSystem Options > Audio > [choose the audio output]
Run TTS
sudo apt-get install espeak
espeak "hello"