You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* WIP response format implementation for audio transcriptions
(cherry picked from commit e271dd7)
Signed-off-by: Andres Smith <andressmithdev@pm.me>
* Rework transcript response_format and add more formats
(cherry picked from commit 6a93a8f)
Signed-off-by: Andres Smith <andressmithdev@pm.me>
* Add test and replace go-openai package with official openai go client
(cherry picked from commit f25d1a0)
Signed-off-by: Andres Smith <andressmithdev@pm.me>
* Fix faster-whisper backend and refactor transcription formatting to also work on CLI
Signed-off-by: Andres Smith <andressmithdev@pm.me>
(cherry picked from commit 69a9397)
Signed-off-by: Andres Smith <andressmithdev@pm.me>
---------
Signed-off-by: Andres Smith <andressmithdev@pm.me>
Co-authored-by: nanoandrew4 <nanoandrew4@gmail.com>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
Filenamestring`arg:"" name:"file" help:"Audio file to transcribe" type:"path"`
18
23
19
-
Backendstring`short:"b" default:"whisper" help:"Backend to run the transcription model"`
20
-
Modelstring`short:"m" required:"" help:"Model name to run the TTS"`
21
-
Languagestring`short:"l" help:"Language of the audio file"`
22
-
Translatebool`short:"c" help:"Translate the transcription to english"`
23
-
Diarizebool`short:"d" help:"Mark speaker turns"`
24
-
Threadsint`short:"t" default:"1" help:"Number of threads used for parallel computation"`
25
-
ModelsPathstring`env:"LOCALAI_MODELS_PATH,MODELS_PATH" type:"path" default:"${basepath}/models" help:"Path containing models used for inferencing" group:"storage"`
26
-
Promptstring`short:"p" help:"Previous transcribed text or words that hint at what the model should expect"`
24
+
Backendstring`short:"b" default:"whisper" help:"Backend to run the transcription model"`
25
+
Modelstring`short:"m" required:"" help:"Model name to run the TTS"`
26
+
Languagestring`short:"l" help:"Language of the audio file"`
27
+
Translatebool`short:"c" help:"Translate the transcription to English"`
28
+
Diarizebool`short:"d" help:"Mark speaker turns"`
29
+
Threadsint`short:"t" default:"1" help:"Number of threads used for parallel computation"`
30
+
BackendsPathstring`env:"LOCALAI_BACKENDS_PATH,BACKENDS_PATH" type:"path" default:"${basepath}/backends" help:"Path containing backends used for inferencing" group:"storage"`
31
+
ModelsPathstring`env:"LOCALAI_MODELS_PATH,MODELS_PATH" type:"path" default:"${basepath}/models" help:"Path containing models used for inferencing" group:"storage"`
32
+
BackendGalleriesstring`env:"LOCALAI_BACKEND_GALLERIES,BACKEND_GALLERIES" help:"JSON list of backend galleries" group:"backends" default:"${backends}"`
33
+
Promptstring`short:"p" help:"Previous transcribed text or words that hint at what the model should expect"`
34
+
ResponseFormat schema.TranscriptionResponseFormatType`short:"f" default:"" help:"Response format for Whisper models, can be one of (txt, lrc, srt, vtt, json, json_verbose)"`
35
+
PrettyPrintbool`help:"Used with response_format json or json_verbose for pretty printing"`
0 commit comments