Close Menu
  • Home
  • AI News
  • AI Startups
  • Deep Learning
  • Interviews
  • Machine-Learning
  • Robotics

Subscribe to Updates

Get the latest creative news from FooBar about art, design and business.

What's Hot

Chaos Audio Launches Nimbus, an AI-Powered Open-Platform Amp for Whole Artistic Freedom

October 17, 2025

AGII Provides Actual-Time Studying Methods to Enhance Blockchain Intelligence and Reliability

October 17, 2025

Colle AI Integrates Clever Automation Engines to Enhance NFT Manufacturing Effectivity

October 17, 2025
Facebook X (Twitter) Instagram
Smart Homez™
Facebook X (Twitter) Instagram Pinterest YouTube LinkedIn TikTok
SUBSCRIBE
  • Home
  • AI News
  • AI Startups
  • Deep Learning
  • Interviews
  • Machine-Learning
  • Robotics
Smart Homez™
Home»Deep Learning»MAGNeT: A Masked Generative Sequence AI Modeling Technique that Operates Instantly Over A number of Streams of Audio Tokens and 7x Quicker than the Autoregressive Baseline
Deep Learning

MAGNeT: A Masked Generative Sequence AI Modeling Technique that Operates Instantly Over A number of Streams of Audio Tokens and 7x Quicker than the Autoregressive Baseline

By January 13, 2024Updated:January 13, 2024No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Reddit WhatsApp Email
MAGNeT: A Masked Generative Sequence AI Modeling Technique that Operates Instantly Over A number of Streams of Audio Tokens and 7x Quicker than the Autoregressive Baseline
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email


In audio expertise, researchers have made vital strides in creating fashions for audio technology. Nonetheless, the problem lies in creating fashions that may effectively and precisely generate audio from numerous inputs, together with textual descriptions. Earlier approaches have centered on autoregressive and diffusion-based fashions. Whereas these approaches yield spectacular outcomes, they’ve drawbacks, resembling excessive inference instances and struggles with producing long-form sequences.

Researchers from FAIR Group Meta, Kyutai, and The Hebrew College of Jerusalem have developed MAGNET (Masked Audio Technology utilizing Non-autoregressive Transformers) in response to those challenges. This novel method operates on a number of streams of audio tokens utilizing a single transformer mannequin. In contrast to earlier strategies, MAGNET is non-autoregressive, predicting spans of masked tokens obtained from a masking scheduler throughout coaching. It steadily constructs the output audio sequence throughout inference by a number of decoding steps. This method considerably hastens the technology course of, making it extra appropriate for interactive functions resembling music technology and modifying.

https://arxiv.org/abs/2401.04577

MAGNET additionally introduces a singular rescoring technique to boost audio high quality. This technique leverages an exterior pre-trained mannequin to rescore and rank predictions from MAGNET, that are then utilized in later decoding steps. A hybrid model of MAGNET, which mixes autoregressive and non-autoregressive fashions to generate the primary few seconds of audio in an autoregressive method, has been explored. On the identical time, the remainder of the sequence is decoded in parallel.

The effectivity of MAGNET has been demonstrated within the context of text-to-music and text-to-audio technology. Via in depth empirical analysis, together with each goal metrics and human research, MAGNET has proven comparable efficiency to present baselines whereas being considerably quicker. This velocity is especially notable in comparison with autoregressive fashions, with MAGNET being seven instances quicker.

The analysis delves into the significance of every element of MAGNET, highlighting the trade-offs between autoregressive and non-autoregressive modeling when it comes to latency, throughput, and technology high quality. By conducting ablation research and evaluation, the analysis workforce has illuminated the importance of varied facets of MAGNET, contributing to a extra profound understanding of audio technology applied sciences.

https://arxiv.org/abs/2401.04577

In conclusion, the event of MAGNET marks a considerable development within the realm of audio expertise:

  • Introduces a novel, environment friendly method for audio technology, considerably decreasing latency in comparison with conventional strategies.
  • Combines autoregressive and non-autoregressive components to optimize technology high quality and velocity.
  • Demonstrates the potential for real-time, high-quality audio technology from textual explanations, opening up new potentialities in interactive audio functions.

Take a look at the Paper and Venture Web page. All credit score for this analysis goes to the researchers of this undertaking. Additionally, don’t neglect to comply with us on Twitter. Be part of our 36k+ ML SubReddit, 41k+ Fb Group, Discord Channel, and LinkedIn Group.

If you happen to like our work, you’ll love our publication..



Hey, My title is Adnan Hassan. I’m a consulting intern at Marktechpost and shortly to be a administration trainee at American Specific. I’m at the moment pursuing a twin diploma on the Indian Institute of Know-how, Kharagpur. I’m obsessed with expertise and wish to create new merchandise that make a distinction.


[Free AI Event] 🐝 ‘Meet SingleStore Professional Max, the Powerhouse Version’ (Jan 24 2024, 10 am PST)



Related Posts

Microsoft Analysis Releases Skala: a Deep-Studying Alternate–Correlation Practical Focusing on Hybrid-Stage Accuracy at Semi-Native Value

October 10, 2025

Deep Studying Framework Showdown: PyTorch vs TensorFlow in 2025

August 20, 2025

Google AI Releases DeepPolisher: A New Deep Studying Software that Improves the Accuracy of Genome Assemblies by Exactly Correcting Base-Degree Errors

August 7, 2025
Misa
Trending
Machine-Learning

Chaos Audio Launches Nimbus, an AI-Powered Open-Platform Amp for Whole Artistic Freedom

By Editorial TeamOctober 17, 20250

Dwell on Kickstarter, Nimbus is the Smartest Amp Ever Made. Nimbus, the world’s smartest open-platform…

AGII Provides Actual-Time Studying Methods to Enhance Blockchain Intelligence and Reliability

October 17, 2025

Colle AI Integrates Clever Automation Engines to Enhance NFT Manufacturing Effectivity

October 17, 2025

Wrap Launches Subsequent-Technology Drone First Responder Interdiction Answer with a Concentrate on Non-Deadly Response

October 17, 2025
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Our Picks

Chaos Audio Launches Nimbus, an AI-Powered Open-Platform Amp for Whole Artistic Freedom

October 17, 2025

AGII Provides Actual-Time Studying Methods to Enhance Blockchain Intelligence and Reliability

October 17, 2025

Colle AI Integrates Clever Automation Engines to Enhance NFT Manufacturing Effectivity

October 17, 2025

Wrap Launches Subsequent-Technology Drone First Responder Interdiction Answer with a Concentrate on Non-Deadly Response

October 17, 2025

Subscribe to Updates

Get the latest creative news from SmartMag about art & design.

The Ai Today™ Magazine is the first in the middle east that gives the latest developments and innovations in the field of AI. We provide in-depth articles and analysis on the latest research and technologies in AI, as well as interviews with experts and thought leaders in the field. In addition, The Ai Today™ Magazine provides a platform for researchers and practitioners to share their work and ideas with a wider audience, help readers stay informed and engaged with the latest developments in the field, and provide valuable insights and perspectives on the future of AI.

Our Picks

Chaos Audio Launches Nimbus, an AI-Powered Open-Platform Amp for Whole Artistic Freedom

October 17, 2025

AGII Provides Actual-Time Studying Methods to Enhance Blockchain Intelligence and Reliability

October 17, 2025

Colle AI Integrates Clever Automation Engines to Enhance NFT Manufacturing Effectivity

October 17, 2025
Trending

Wrap Launches Subsequent-Technology Drone First Responder Interdiction Answer with a Concentrate on Non-Deadly Response

October 17, 2025

Artemis, the Solely AI-Powered Photo voltaic Design Instrument, Authorized by Power Belief of Oregon for Incentive Qualification

October 17, 2025

Martensen IP Affords Essential Steerage on AI Mental Property Dangers, Examples of Copyright Points, and FAQs

October 17, 2025
Facebook X (Twitter) Instagram YouTube LinkedIn TikTok
  • About Us
  • Advertising Solutions
  • Privacy Policy
  • Terms
  • Podcast
Copyright © The Ai Today™ , All right reserved.

Type above and press Enter to search. Press Esc to cancel.