Adding Audio-MAE to Transformers

Alanturner2 · January 21, 2025, 1:20pm

Thank you for sharing your incredible work and for bringing this to the community’s attention! Training an Audio-MAE model on 30 million underwater sound samples is an impressive achievement, and your suggestion to add the model to the Hugging Face Transformers library is both valuable and exciting.

Here’s how you could move forward:

Open a Feature Request:
- You can create a feature request on the Hugging Face GitHub repository (Transformers Issues).
- In the request, include details about Audio-MAE, your training setup, dataset (Orcasound), and potential applications, emphasizing its value for underwater acoustics and other domains.
Prepare a Pretrained Model for Sharing:
- If you’re comfortable, consider uploading your pretrained model to the Hugging Face Model Hub. This would make it accessible to the community and encourage adoption.
- Use a descriptive README for your model card, including details like:
  - Training dataset and methodology.
  - Potential use cases (e.g., marine research, underwater sound monitoring).
  - Limitations or biases in the dataset.
Contribute an Implementation:
- If you’re open to contributing code, you can fork the Transformers repository and create an implementation for Audio-MAE, taking inspiration from the existing Vision Transformer MAE implementation.
- Add relevant documentation and tests to make the integration seamless.

This initiative could significantly benefit the audio research community, especially in niche domains like underwater acoustics. Kudos to you for leading the way, and I’m excited to see where this goes!

Best regards,
Alan.

Topic		Replies	Views
Audio-to-audio is not a valid pipeline Beginners	2	364	January 14, 2024
Audio event embeddings from existing pretrained transformer models 🤗Transformers	0	445	March 18, 2023
Audio Spectrogram Transformer in tensorflow 🤗Transformers	0	121	August 2, 2023
Model for Audio classification 🤗Transformers	2	1227	January 23, 2023
VideoMAE localy without pipeline Beginners	0	91	December 21, 2023

Adding Audio-MAE to Transformers

Related topics