Qwen2-Audio: Chat with Your Voice!

qwenlm.github.io ReleasesMultimodalResearch Qwen 1 min read

DEMO PAPER GITHUB HUGGING FACE MODELSCOPE DISCORD To achieve the objective of building an AGI system, the model should be capable of understanding information from different modalities. Thanks to the rapid development of large language models, LLMs are now capable of understanding language and reasoning. Previously we have taken a step forward to extend our LLM, i.e., Qwen, to more modalities, including vision and audio, and built Qwen-VL and Qwen-Audio. Today, we release Qwen2-Audio, the next v

Read the original on qwenlm.github.io

AI News Hub links to primary sources. This page shows the publisher's own title and excerpt with a link to the full article. We point you at the news; we don't rewrite it.