Stack Explorer

Qwen Audio

multimodal llm

Alibaba's multimodal model for audio processing

Official site

Pros and Cons

Ventajas

  • + Advanced audio capabilities
  • + Multilingual
  • + Open source

Desventajas

  • - Documentation mainly in Chinese
  • - Smaller community outside Asia

Casos de Uso

  • Audio transcription
  • Audio analysis
  • Voice assistants