Time-Frequency Audio Features for Speech-Music Classification

11/03/2018
by   Mrinmoy Bhattacharjee, et al.
0

Distinct striation patterns are observed in the spectrograms of speech and music. This motivated us to propose three novel time-frequency features for speech-music classification. These features are extracted in two stages. First, a preset number of prominent spectral peak locations are identified from the spectra of each frame. These important peak locations obtained from each frame are used to form Spectral peak sequences (SPS) for an audio interval. In second stage, these SPS are treated as time series data of frequency locations. The proposed features are extracted as periodicity, average frequency and statistical attributes of these spectral peak sequences. Speech-music categorization is performed by learning binary classifiers on these features. We have experimented with Gaussian mixture models, support vector machine and random forest classifiers. Our proposal is validated on four datasets and benchmarked against three baseline approaches. Experimental results establish the validity of our proposal.

READ FULL TEXT

page 1

page 4

research
03/13/2018

Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals

In this paper, we proposed a robust music genre classification method ba...
research
10/18/2021

SpecTNT: a Time-Frequency Transformer for Music Audio

Transformers have drawn attention in the MIR field for their remarkable ...
research
01/15/2019

Classical Music Generation in Distinct Dastgahs with AlimNet ACGAN

In this paper AlimNet (With respect to great musician, Alim Qasimov) an ...
research
12/17/2018

Instrument-Independent Dastgah Recognition of Iranian Classical Music Using AzarNet

In this paper, AzarNet, a deep neural network (DNN), is proposed to reco...
research
11/12/2018

Identification of Internal Faults in Indirect Symmetrical Phase Shift Transformers Using Ensemble Learning

This paper proposes methods to identify 40 different types of internal f...
research
11/02/2022

SpectroMap: Peak detection algorithm for audio fingerprinting

We present SpectroMap, an open source GitHub repository for audio finger...

Please sign up or login with your details

Forgot password? Click here to reset