return to table of content

Moshi: A speech-text foundation model for real time dialogue