lars76/swift-f0
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
This tool helps musicians, musicologists, and speech scientists automatically extract the fundamental pitch from audio recordings. You feed it a WAV, MP3, or other audio file, and it quickly outputs a detailed pitch contour, confidence scores, and even segmented musical notes with MIDI information. It's designed for anyone needing fast and precise pitch analysis for music or speech research.
141 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly and accurately determine the exact pitch (F0) of a single voice or instrument in an audio recording, or automatically convert that pitch into musical notes and MIDI.
Not ideal if you need to analyze multiple instruments or voices simultaneously (polyphonic audio), as it's designed for monophonic sources.
Stars
141
Forks
18
Language
Python
License
MIT
Category
Last pushed
Sep 02, 2025
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/lars76/swift-f0"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
deezer/spleeter
Deezer source separation library including pretrained models.
audeering/opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
audeering/opensmile-python
Python package for openSMILE