carat.features.accentuation_feature¶
-
carat.features.
accentuation_feature
(signal, fs, sum_flag=True, log_flag=False, mel_flag=True, alpha=1000, maxfilt_flag=False, maxbins=3, **kwargs)[source]¶ Compute accentuation feature from audio signal.
Based on the log-power Mel spectrogram [1].
- [1] Böck, Sebastian, and Gerhard Widmer.
- “Maximum filter vibrato suppression for onset detection.” 16th International Conference on Digital Audio Effects, Maynooth, Ireland. 2013.
This performs the following calculations to the input signal:
input->STFT->(Mel scale)->(Log)->(Max filtering)->Diff->HWR->(Sum)
Parenthesis denote optional steps.
- Args:
- input: signal
- fs: sampling rate
- sum_flag (bool): true if the features are to be summed for each frame.
- log_flag (bool): true if the features energy are to be converted to dB.
- mel_flag (bool): true if the features are to be mapped in the Mel scale.
- alpha (int): compression parameter for dB conversion - log10(alpha*abs(S)+1).
- maxfilt_flag (bool): true if a maximum filtering is applied to the feature.
- maxbins (int): number of frequency bins for maximum filter size
**kw
: these keyword arguments are passed down to each of the functions used
- Returns:
- feature (numpy array): feature values
- time (numpy array): time values