You are on page 1of 2

SUBJECT NAME: Transform Techniques for Signals SUBJECT CODE: EEE205 PROJECT TITLE: DESIGN OF MATLAB-BASED SPEAKER RECOGNITION

SYSTEM GROUP MEMEBERS: (2nd year EEE)


Avishek Banerjee 11BEE1010 Bikash Chakraborty 11BEE1016 Rahul Choubisa 11BEE1035 Shashwat Shukla 11BEE1053 Vikas Gupta 11BEE1078

ABSTRACT:
This project entails the design of a speaker recognition code using MATLAB. Signal processing in the time and frequency domain yields a powerful method for analysis.

In the current design project a basic speaker identification algorithm has been written to sort through a list of files and choose the 12 most likely matches based on the average pitch of the speech utterance as well as the location of the formants in the frequency domain representation.

This multi-faceted design project can be categorized into different sections: speech editing, speech degradation, speech enhancement, pitch analysis, formant analysis and waveform comparison.

Speech editing will be performed as well as degradation of signals by the application of Gaussian noise. Background noise will also be successfully removed from a signal by the application of a 3rd order Butterworth filter.

ROAD MAP:
Before CAT 1 what is your expectation on intermediate result?
Before CAT1 we will analyse the different aspects of the problem to be solved and try to understand all the details about it. We will chalk out the plan about how to go ahead with the project, and go through the theoretical concepts related to the topic. We intend to develop the algorithms for the code then.

What result are you expecting before CAT 2?


After CAT1, we plan to start the coding in MATLAB. Before CAT2, we will try to finish most of the coding needed for our project. We will get the solutions for speech editing, speech degradation and speech enhancement initially. Then we will move onto the other parts of our project like pitch analysis, formant analysis and waveform comparison. In order to create a speech recognition algorithm, criteria to compare speech files must be established. This section of the project compares four different methods of comparing the data. The wav files are compared to a reference file and sorted based on the average pitch of the file in one method. The files are also compared and sorted based on the location of the formants present in the Power Spectrum Density (PSD) of the signal. Finally, mixed comparisons are done based on both of these criteria.

What will be the final results before last review?


After CAT2, we will finish any coding work that might be remaining and test them for various inputs. Debugging may also be required. Then we will plot the results of the programs and analyse them in order to understand the behaviour. Finally, proper conclusions will be reached based on all the different results.