AudioCraft favicon

AudioCraft
A Single-Stop Code Base for Generative Audio Needs

What is AudioCraft?

Developed by Meta AI, AudioCraft simplifies the overall design of generative models for audio. It provides a comprehensive solution for music, sound effects, and compression after training on raw audio signals. The framework includes MusicGen and AudioGen, which consist of a single autoregressive Language Model (LM) operating over streams of compressed discrete music representation (tokens).

AudioCraft leverages the EnCodec neural audio codec. This codec learns discrete audio tokens from the raw waveform, mapping the audio signal to parallel streams of discrete tokens. A single autoregressive language model then recursively models these audio tokens. Finally, generated tokens are fed back to the EnCodec decoder to reconstruct the output waveform. Different conditioning models, like pretrained text encoders, are used for text-to-audio control.

Features

  • MusicGen: Produces diverse and long music samples from user-provided text inputs.
  • AudioGen: Generates audio from environmental sounds based on text inputs.
  • EnCodec: Neural audio codec that learns discrete audio tokens from raw waveforms.
  • Autoregressive Language Model (LM): Recursively models audio tokens from EnCodec for efficient audio sequence modeling.
  • Token Interleaving Pattern: Models audio sequences while capturing long-term dependencies to generate high-quality audio.

Use Cases

  • Text-to-music generation
  • Text-to-sound generation
  • Audio compression
  • Audio research

Related Tools:

Blogs:

  • Top AI tools for Teachers

    Top AI tools for Teachers

    Explore the top AI tools designed for teachers, revolutionizing the education landscape. These innovative tools leverage artificial intelligence to enhance teaching efficiency, personalize learning experiences, automate administrative tasks, and provide valuable insights, empowering educators to create engaging and effective educational environments.

  • Long Videos into Viral Shorts

    Long Videos into Viral Shorts

    Klap.app is an AI-powered video editing tool that transforms long-form videos into engaging short clips optimized for platforms like TikTok, Instagram Reels, and YouTube Shorts

  • Boost Engagement in Ads with AI

    Boost Engagement in Ads with AI

    Discover how AI music and AI SDR agents are reshaping modern advertising. Learn how emotional resonance through AI-generated soundtracks combined with smart, automated sales outreach can turn viewers into loyal customers faster, cheaper, and more personally than ever before.

  • Top 6 AI note-taking tools for 2026: in-person, online, and hybrid use cases

    Top 6 AI note-taking tools for 2026: in-person, online, and hybrid use cases

    Most AI note-taking lists are really lists of meeting bots, which join your video call and transcribe it. That's useful, but it's half the picture. Decisions happen in hallway conversations, client dinners, on-site visits, and hybrid rooms where nobody is on a video link. This guide covers different parts of the note-taking workflow: hardware capture for in-person settings, platform-native tools for online calls, and AI layers for organizing and synthesizing what you've captured. It compares six tools by capture context, workflow fit, pricing, and limitations.

Didn't find tool you were looking for?

Be as detailed as possible for better results