APSIPA Transactions on Signal and Information Processing > Vol 3 > Issue 1

Environmental sound recognition: a survey

Sachin Chachada, University of Southern California, USA, chachada@usc.edu , C.-C. Jay Kuo, University of Southern California, USA
 
Suggested Citation
Sachin Chachada and C.-C. Jay Kuo (2014), "Environmental sound recognition: a survey", APSIPA Transactions on Signal and Information Processing: Vol. 3: No. 1, e14. http://dx.doi.org/10.1017/ATSIP.2014.12

Publication Date: 15 Dec 2014
© 2014 Sachin Chachada and C.-C. Jay Kuo
 
Subjects
 
Keywords
environmental sound recognitionaudio signal processingfeature extractionnonstationary ESR techniquesenvironmental sound processing schemessignal spectral characteristicssignal temporal characteristics
 

Share

Open Access

This is published under the terms of the Creative Commons Attribution licence.

Downloaded: 7672 times

In this article:
I. INTRODUCTION 
II. ENVIRONMENTAL SOUND-PROCESSING SCHEMES 
III. STATIONARY ESR TECHNIQUES 
IV. NON-STATIONARY ESR TECHNIQUES 
V. Database and Performance Evaluation 
VI. CONCLUSIONS AND FUTURE WORK 

Abstract

Although research in audio recognition has traditionally focused on speech and music signals, the problem of environmental sound recognition (ESR) has received more attention in recent years. Research on ESR has significantly increased in the past decade. Recent work has focused on the appraisal of non-stationary aspects of environmental sounds, and several new features predicated on non-stationary characteristics have been proposed. These features strive to maximize their information content pertaining to signal's temporal and spectral characteristics. Furthermore, sequential learning methods have been used to capture the long-term variation of environmental sounds. In this survey, we will offer a qualitative and elucidatory survey on recent developments. It includes four parts: (i) basic environmental sound-processing schemes, (ii) stationary ESR techniques, (iii) non-stationary ESR techniques, and (iv) performance comparison of selected methods. Finally, concluding remarks and future research and development trends in the ESR field will be given.

DOI:10.1017/ATSIP.2014.12