minguinho26/Prefix_AAC_ICASSP2023

Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"

21
/ 100
Experimental

This tool helps researchers and audio content creators automatically generate descriptive text captions for audio recordings. You input an audio file, and it outputs a human-readable sentence or phrase describing the sounds within. It's designed for anyone working with large collections of audio who needs to quickly understand or catalog their content without manually listening to every file.

No commits in the last 6 months.

Use this if you need to automatically create textual descriptions for sound events or environmental audio recordings.

Not ideal if you need to caption spoken dialogue or music compositions, as this focuses on general sound events.

audio-analysis sound-recognition content-tagging audio-metadata media-asset-management
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 6 / 25

How are scores calculated?

Stars

31

Forks

2

Language

Jupyter Notebook

License

Last pushed

Dec 06, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/minguinho26/Prefix_AAC_ICASSP2023"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.