โ All stories
AI Audio Scene Generation Advances
Researchers propose ScenA, a method to generate multi-speaker audio scenes from reference voices and natural language prompts, leveraging a text-to-audio flow-matching model trained on in-the-wild data.
One continuously updated timeline instead of dozens of separate articles. New developments are appended as the story evolves.
- BenchmarkJun 17, 2026, 05:51 PM 95%
Researchers propose ScenA, a flow-matching model for generating realistic multi-speaker audio scenes from reference voices and natural language prompts.
Researchers propose ScenA, a method to generate multi-speaker audio scenes from reference voices and natural language prompts, leveraging a text-to-audio flow-matching model trained on in-the-wild data.
Read the full story โ