Using Mel-Mapped Best Tree Encoding for Baseline-Context-Independent-Mono-Phone Automatic Speech Recognition | ||||
The Egyptian Journal of Language Engineering | ||||
Article 2, Volume 2, Issue 1, April 2015, Page 10-24 PDF (1.09 MB) | ||||
Document Type: Original Article | ||||
DOI: 10.21608/ejle.2015.60254 | ||||
View on SCiNiTO | ||||
Authors | ||||
Amr Gody ; Rania Abul Seoud; Mai Ezz El-Din | ||||
Electronics and Communications Engineering Department, Faculty of Engineering, Fayoum University, Egypt | ||||
Abstract | ||||
Best-Tree Encoding (BTE) is first introduced by Amr M. Gody [1] as new features for Automatic Speech Recognition (ASR) problem. BTE is basically acting as spectrum analyzer. It relies on Wavelet packets to get projection of signal power into predefined filter banks. The feature components are encoded into digital form using certain entropy method and certain digital encoding procedure. In this research BTE is further developed by including two more key factors into the BTE process. The key factors are Mel-scale (MS) and baseband Bandwidth mapping (BM).This Research provides a baseline performance evaluation for Context-independent mono-phone recognition (Without Grammar) of English by using Vid-TIMIT database. Vid-TIMIT consists of 43 speakers (19 female and 24 male), reciting short sentences. The recording of this database was done in a noisy environment (mostly computer fan noise) and also it is not hand verified. Total of 15643 phone segments are used for testing and evaluating the newly proposed features. HMM is used as recognition engine via HTK toolkit for its popularity in ASR. Comparison to MFCC on the same database is considered to evaluate the system results. Although it gives the same recognition efficiency as MFCC on the same testing database, the proposed model saves almost 66% of the required storage than the feature vector of MFCC. | ||||
Keywords | ||||
Automatic Speech recognition (ASR); Arabic Phone Recognition; Wavelet packets; Mel-Scale; WPBTE; MFCC; HTK and BTE | ||||
Statistics Article View: 128 PDF Download: 324 |
||||