Date of Award

12-2014

Access Type

Thesis - Open Access

Degree Name

Master of Science in Mechanical Engineering

Department

Mechanical Engineering

Committee Chair

Eric J. Coyle

First Committee Member

Patrick Currier

Second Committee Member

Yan Tang

Abstract

Extensive Research has been conducted on speech recognition and Speaker Recognition over the past few decades. Speaker recognition deals with identifying the speaker from multiple speakers and the ability to filter out the voice of an individual from the background for computational understanding. The more commonly researched method, speech recognition, deals only with computational linguistics. This thesis deals with speaker recognition and natural language processing. The most common speaker recognition systems are Text-Dependent and identify the speaker after a key word/phrase is uttered. This thesis presents Text-Independent Speaker recognition systems that incorporate the collaborative effort and research of noise-filtering, Speech Segmentation, Feature extraction, speaker verification and finally, Partial Language Modelling. The filtering process was accomplished using 4th order Butterworth Band-pass filters to dampen ambient noise outside normal speech frequencies of 300Hzto3000Hz. Speech segmentation utilizes Hamming windows to segment the speech, after which speech detection occurs by calculating the Short time Energy and Zero-crossing rates over a particular time period and identifying voiced from unvoiced using a threshold. Audio data collected from different people is run consecutively through a Speaker Training and Recognition Algorithm which uses neural networks to create a training group and target group for the recognition process. The output of the segmentation module is then processed by the neural network to recognize the speaker. Though not implemented here due to database and computational requirements, the last module suggests a new model for the Part of Speech tagging process that involves a combination of Artificial Neural Networks (ANN) and Hidden Markov Models (HMM) in a series configuration to achieve higher accuracy. This differs from existing research by diverging from the usual single model approach or the creation of hybrid ANN and HMM models.

Scholarly Commons Citation

Ernst, Theodore Philip King, "Speaker Dependent Voice Recognition with Word-Tense Association and Part-of-Speech Tagging" (2014). Doctoral Dissertations and Master's Theses. 272.
https://commons.erau.edu/edt/272

Download

Included in

Linguistics Commons, Mechanical Engineering Commons

COinS

Doctoral Dissertations and Master's Theses

Speaker Dependent Voice Recognition with Word-Tense Association and Part-of-Speech Tagging

Date of Award

Access Type

Degree Name

Department

Committee Chair

First Committee Member

Second Committee Member

Abstract

Scholarly Commons Citation

Included in

Search

Browse

Author Corner

Links

Doctoral Dissertations and Master's Theses

Speaker Dependent Voice Recognition with Word-Tense Association and Part-of-Speech Tagging

Author

Date of Award

Access Type

Degree Name

Department

Committee Chair

First Committee Member

Second Committee Member

Abstract

Scholarly Commons Citation

Included in

Share

Search

Browse

Author Corner

Links