lars76/swift-f0

Fast and accurate fundamental frequency (F0) detector using convolutional neural networks

/ 100

Established

This tool helps musicians, musicologists, and speech scientists automatically extract the fundamental pitch from audio recordings. You feed it a WAV, MP3, or other audio file, and it quickly outputs a detailed pitch contour, confidence scores, and even segmented musical notes with MIDI information. It's designed for anyone needing fast and precise pitch analysis for music or speech research.

141 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly and accurately determine the exact pitch (F0) of a single voice or instrument in an audio recording, or automatically convert that pitch into musical notes and MIDI.

Not ideal if you need to analyze multiple instruments or voices simultaneously (polyphonic audio), as it's designed for monophonic sources.

music-analysis speech-science audio-transcription musical-notation sound-engineering

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 24 / 25

Community 15 / 25

How are scores calculated?

Stars

141

Forks

Language

Python

License

MIT

Related frameworks

pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

deezer/spleeter

Deezer source separation library including pretrained models.

audeering/opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

audeering/opensmile-python

Python package for openSMILE

Explore ML Frameworks

All categories Trending ML Framework directory Insights