Skip to content

What is the proper way to download the Whisper CoreML models and the tokenizer model? #339

@Kavi-Gupta

Description

@Kavi-Gupta

Hi I am trying to use WhisperKit completely offline, I would like to bundle models with the app, instead of relying on a network connection. I have tried downloading the model files from the HuggingFace repository. But, regardless of which model I download, I always get the error:

MelSpectrogram.mlmodelc/coremldata.bin is not a valid .mlmodelc file because the first word (1936876918) is not recognizable.
Unable to load model: ... Compile the model with Xcode or `MLModel.compileModel(at:)`.

How do I go about fixing this? Also, does each model have a separate tokenizer? And, where do I get the tokenizer model(s) from?

Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions