Bimonthly    Since 1986
ISSN 1004-9037
Publication Details
Edited by: Editorial Board of Journal of Data Acquisition and Processing
P.O. Box 2704, Beijing 100190, P.R. China
Sponsored by: Institute of Computing Technology, CAS & China Computer Federation
Undertaken by: Institute of Computing Technology, CAS
Published by: SCIENCE PRESS, BEIJING, CHINA
Distributed by:
China: All Local Post Offices
 
   
      07 April 2023, Volume 38 Issue 2   
    Article

    AUTOMATIC SPEECH RECOGNITION USING MODIFIED PRINCIPAL COMPONENT ANALYSIS AND ENHANCED CONVOLUTION NEURAL NETWORK
    Dr.M. Kathiresh1, Dr. R. SankaraSubramanian2
    Journal of Data Acquisition and Processing, 2023, 38 (2): 4347-4363 . 

    Abstract

    Research on speech recognitions were initiated by notions of HMIs (human machine interactions). ASR (Automatic voice Recognition) is method that employs implementable algorithms on computers to translate voice signals as strings of words. Systems can understand human speech inputs. Speech signals transmit two crucial forms of information, including speech contents and identities of speakers. Existing system have issues with speech recognition accuracies and feature extractions. To overcome these problems, in this work, ECNN (Enhanced Convolution Neural Networks) is proposed. The main modules are pre-processing, feature extraction and speech recognition. In pre-processing, noises are removed by the application of Wiener filters for obtaining cleaner speeches. Subsequently, MPCA (Modified Principal Component Analysis) is used for feature extractions where most informative features are extracted. Noise corrupt speech feature matrices are the focus of MPCA and it is demonstrated that the generated sparse partitions reveal speech dominant properties. The ECNN algorithm is subsequently used for speech recognitions and thus enhancing speech recognitions with reduced error rates. The experimental results demonstrate in the conclusion that the proposed MPCA+ECNN algorithm provides better values in comparison with other methods in terms of MSE (Mean Square Error) rates, accuracy, specificity and execution times.

    Keyword

    Speech recognition, Enhanced Convolution Neural Network (ECNN), Modified Principal Component Analysis (MPCA)


    PDF Download (click here)

SCImago Journal & Country Rank

ISSN 1004-9037

         

Home
Editorial Board
Author Guidelines
Subscription
Journal of Data Acquisition and Processing
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
E-mail: info@sjcjycl.cn
 
  Copyright ©2015 JCST, All Rights Reserved