MusicGPT

🎵 Advanced AI Music Generation with Melody Conditioning, Infinite Streams, and Premium Features

Generate music based on natural language prompts using LLMs running locally with GPU acceleration.

MusicGPT.demo.mp4

☝️ Turn up the volume!

Overview

MusicGPT is a premium music generation application that runs the latest AI music models locally with exceptional performance. No Python, no heavy ML frameworks—just pure, optimized music generation on any platform.

Powered by MusicGen by Meta, MusicGPT brings professional-grade music generation to your fingertips with groundbreaking features like melody conditioning, infinite audio streams, and advanced tuning controls.

✨ Key Features

✅ Text-conditioned music generation - Natural language prompts
✅ Melody-conditioned generation - Upload audio files (WAV/MP3) to influence the output
✅ Infinite music streams - Continuous, never-ending audio generation
✅ Microphone recording - Record melodies directly in the browser
✅ Advanced tuning controls - Temperature, Top-K, and Guidance Scale parameters
✅ Multiple output formats - Export as WAV or MP3
✅ GPU acceleration - CUDA support for blazing-fast inference
✅ Chat-based UI - Premium web interface with history and playback
✅ Multiple models - Small, Medium, Large, and Melody variants

Install

Mac and Linux

MusicGPT can be installed on Mac and Linux using brew:

brew install gabotechs/taps/musicgpt

Windows

Download and install MusicGPT's executable file following this link.

All platforms

Precompiled binaries are available for the following platforms:

Just downloading them and executing them should be enough.

Docker (Recommended for CUDA)

If you want to run MusicGPT with a CUDA-enabled GPU, Docker is the best option. You only need basic NVIDIA drivers installed.

docker pull gabotechs/musicgpt

Once the image is downloaded, run it with:

docker run -it --gpus all -p 8642:8642 -v ~/.musicgpt:/root/.local/share/musicgpt gabotechs/musicgpt --gpu --ui-expose

With cargo

If you have the Rust toolchain installed:

cargo install musicgpt --features cuda

Usage

MusicGPT offers two interaction modes: UI mode and CLI mode.

UI Mode (Recommended)

Launch the premium web interface for the best experience:

musicgpt

The UI provides:

🎨 Premium chat interface with modern design
📝 Persistent chat history across sessions
🎵 In-browser audio playback with controls
🎤 Microphone recording for melody conditioning
🎛️ Advanced tuning controls (Temperature, Top-K, Guidance Scale)
📦 Multiple export formats (WAV, MP3)
♾️ Infinite stream mode for continuous generation
🚀 Background processing for smooth UX

Advanced Options

Choose models and enable GPU:

musicgpt --gpu --model melody

For infinite streaming with melody conditioning:

musicgpt --gpu --model melody --ui-expose

Tip

Use --ui-expose to access the UI from other devices on your network

With Docker (CUDA)

docker run -it --gpus all -p 8642:8642 -v ~/.musicgpt:/root/.local/share/musicgpt gabotechs/musicgpt --ui-expose --gpu --model melody

CLI Mode

Generate music directly in the terminal:

Basic Generation

musicgpt "Create a relaxing LoFi song"

With Melody Conditioning

musicgpt "Epic orchestral music" --melody my_melody.mp3

Infinite Streaming

musicgpt "Ambient soundscape" --secs 0

Advanced Tuning

musicgpt "Jazz fusion" --secs 30 --model medium --temperature 1.5 --top-k 300

All CLI Options

musicgpt --help

Available options:

--model - Choose model size (small, medium, large, melody)
--melody - Path to melody audio file (WAV/MP3)
--secs - Duration in seconds (0 for infinite)
--temperature - Sampling temperature (0.1-2.0)
--top-k - Top-K sampling (1-500)
--guidance-scale - Classifier-free guidance (1.0-10.0)
--output-format - Export format (wav, mp3)
--gpu - Enable CUDA acceleration

Warning

Larger models require significant RAM and GPU memory

New Features

🎼 Melody Conditioning

Upload any audio file (WAV or MP3) to condition the generation:

In UI:

Click the upload icon
Select your melody file
Generate music that follows your melody's structure

In CLI:

musicgpt "Energetic rock song" --melody guitar_riff.mp3

♾️ Infinite Streams

Generate continuous, never-ending music:

In UI:

Toggle the "Inf" checkbox
Start generation for unlimited duration

In CLI:

musicgpt "Continuous ambient music" --secs 0

🎤 Microphone Recording

Record melodies directly in your browser:

Click the microphone icon
Record your melody
Stop recording (it auto-attaches)
Generate music based on your recording

🎛️ Advanced Tuning

Fine-tune generation with professional controls:

Temperature (0.1-2.0): Controls randomness
- Lower = More consistent
- Higher = More creative
Top-K (1-500): Sampling diversity
- Lower = More focused
- Higher = More varied
Guidance Scale (1.0-10.0): Prompt adherence
- Lower = More freedom
- Higher = Stricter prompt following

📦 Multiple Formats

Export your compositions as:

WAV - Lossless quality
MP3 - Compressed for sharing

Benchmarks

The following graph shows inference time for generating 10 seconds of audio using different models on a Mac M1 Pro, compared to Python/transformers:

Command used:

musicgpt '80s pop track with bassy drums and synth'

Storage

MusicGPT stores models, generated audio, and metadata locally:

Windows: C:\Users\foo\AppData\Roaming\gabotechs\musicgpt
MacOS: /Users/foo/Library/Application Support/com.gabotechs.musicgpt
Linux: /home/foo/.config/musicgpt

Technical Details

Architecture

Backend: Rust with ONNX Runtime for optimal performance
Frontend: React + TypeScript with premium UI components
Audio Processing: Symphonia (decoding) + Lame (MP3 encoding)
GPU: CUDA support via ONNX Runtime

Model Support

Model	Size	Quality	Speed	Melody
Small	~1.5GB	Good	Fast	❌
Medium	~3GB	Better	Moderate	❌
Large	~6GB	Best	Slow	❌
Melody	~1.5GB	Good	Fast	✅

Contributing

Contributions are welcome! This fork includes:

Melody conditioning with MP3 support
Infinite streaming with sliding window
Microphone input
Advanced tuning parameters
MP3 export
Modern UI with premium features

License

The code is licensed under a MIT License, but the AI model weights are licensed under CC-BY-NC-4.0 License.

Model sources:

Made with ❤️ by Emmanuel Djagbley

Name		Name	Last commit message	Last commit date
Latest commit History 317 Commits
.cargo		.cargo
.github		.github
assets		assets
src		src
web		web
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
WEIGHT_STORAGE_GUIDE.md		WEIGHT_STORAGE_GUIDE.md
build.rs		build.rs
setup_cuda_compat.sh		setup_cuda_compat.sh

Folders and files

Latest commit

History

Repository files navigation

MusicGPT

Overview

✨ Key Features

Install

Mac and Linux

Windows

All platforms

Docker (Recommended for CUDA)

With cargo

Usage

UI Mode (Recommended)

Advanced Options

With Docker (CUDA)

CLI Mode

Basic Generation

With Melody Conditioning

Infinite Streaming

Advanced Tuning

All CLI Options

New Features

🎼 Melody Conditioning

♾️ Infinite Streams

🎤 Microphone Recording

🎛️ Advanced Tuning

📦 Multiple Formats

Benchmarks

Storage

Technical Details

Architecture

Model Support

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages