Sign Up to like & get
recommendations!
0
Published in 2025 at "IEEE Access"
DOI: 10.1109/access.2025.3617184
Abstract: Modeling musical audio requires capturing hierarchical relationships between harmonic textures, rhythmic motifs, and long-range structural repetitions. Convolutional networks extract local features efficiently, while transformers provide global modeling, yet both face mismatches with musical structure. In…
read more here.
Keywords:
dual axis;
attention;
mamba;
structmamba ... See more keywords