[audio.cpp] The Sound of GGML — C++/GGML native ACE-Step, Stable Audio, HeartMuLa, RoFormer, HTDemucs released. 10-Minute Music in 60 Seconds!
A major update to audio.cpp adds native support for ACE-Step, Stable Audio, HeartMuLa, RoFormer, and HTDemucs models for music generation, SFX creation, and audio source separation.
![[audio.cpp] The Sound of GGML — C++/GGML native ACE-Step, Stable Audio, HeartMuLa, RoFormer, HTDemucs released. 10-Minute Music in 60 Seconds!](https://images.weserv.nl/?url=preview.redd.it%2Fyxa9dlzquxah1.png%3Fwidth%3D140%26height%3D64%26auto%3Dwebp%26s%3Ddc8fd781446c0ff28129cb015349bd508fc464fe&w=1200&fit=inside&q=72&output=webp&dpr=2&we=1&il=1)
- ›ACE-Step 1.5 Turbo/Base, HeartMuLa, Stable Audio 3 Small/Medium, RoFormer, and HTDemucs are now natively supported in audio.cpp.
- ›The update enables local music generation, SFX creation, and audio source separation using GGML for efficient inference.
- ›A 10-minute music generation task can be completed in 60 seconds on compatible hardware.
- ›Developers can now run audio AI models entirely offline in C++ without external API dependencies.
The audio.cpp framework has received a significant expansion with native support for multiple audio and music generation models under the GGML ecosystem. This update introduces ACE-Step 1.5 Turbo and Base versions, HeartMuLa, Stable Audio 3 Small and Medium variants, RoFormer, and HTDemucs, enabling users to generate music, sound effects, and perform audio source separation directly within the framework.
The integration leverages GGML's efficient inference capabilities, allowing for faster processing and reduced computational overhead. Users can now run these models locally in C++ without relying on external APIs, making it ideal for developers focused on audio AI applications. The release also includes optimizations for real-time audio tasks, including a 10-minute music generation demo that completes in just 60 seconds on supported hardware.
This expansion broadens audio.cpp's utility beyond traditional LLM workloads, positioning it as a versatile tool for audio AI research and production.
Source: [audio.cpp] The Sound of GGML — C++/GGML native ACE-Step, Stable Audio, HeartMuLa, RoFormer, HTDemucs released. 10-Minute Music in 60 Seconds!. Read the full piece at the source.
Enables local, efficient audio AI model inference in C++ with GGML-native support.
Reduces dependency on cloud APIs for audio generation and processing tasks.
Provides an accessible framework for experimenting with audio AI models locally.
Expands the capabilities of GGML-based tools into the audio domain.
- GGML
- An open-source framework for efficient inference of large language and audio models.
- SFX
- Sound effects used in media production.
- Source separation
- The process of isolating individual audio sources from a mixed audio signal.
Summary and analysis generated by AI (mistral). Always verify against the original sources.

Meet WebBrain: An Open-Source, Local-First AI Browser Agent That Reads Pages and Automates Tasks in Chrome and Firefox

Meet Alibaba’s Page Agent: A JavaScript In-Page GUI Agent That Controls Web Interfaces With Natural Language Through the DOM
