next up previous
Next: EEL6586: List of Student Up: EEL6586: Projects Previous: EEL6586: Projects

EEL6586: Project Handout




Final report due Wednesday, April 21 at 3pm. Late reports will suffer the usual late fees and penalties.

Your final project consists of a significant portion of the grade in this class.

Everyone must have a project idea by Wednesday, March 17. Important dates are as follows:

Your final grade for the project will be based on the on-time completion and quality of each of the above items.

Project Presentation

You will not be graded on how good a speaker you are, but on the work you have done and how well you prepared for the talk. Powerpoint presentations with audio demonstrations are strongly encouraged. (Be prepared to run your code on a surprise voice file that you have not seen before). Presentations should get better with each day since later students have had more time to prepare. Everyone will attend class each of the five days of student presentations. Please let the instructor know in advance if you cannot attend.

Project Report

Your final project report will be a web page-you do not need to print it out. Just email the address to the instructor. Most word processors are capable of outputing html code so this should not be a big hassle. A big advantage of using a webpage for your report is that you can include color figures and examples of sounds that you use or produce. If you have never designed a webpage before, this is your opportunity to learn. The report should be written as if it were to be submitted to a conference and therefore should contain the following components:

1.
A concise description of the problem.
2.
A summary of previous solutions to the problem. You should include at least one reference to a paper you have read (not a textbook).
3.
A detailed description of your solution to the problem.
4.
Matlab simulation results.
5.
A discussion of the significance of these results and how your solution differs from previous attempts.
6.
The appendix should contain complete MATLAB codes, messy derivations and any other information too detailed to keep in the main body.

Project Topics

To help you think about project topics, you should realize that you will have two more homework assignments in this course. HW3 will be on speech coding; for the matlab portion you will implement a LPC10E-like coder. For HW4, you will implement a Dynamic Time Warping algorithm to recognize words from a very small database.

You are strongly encouraged to come up with your own idea for a project based on your own experience. Extra points for novelty and creativeness. Projects should roughly fall under one of the four major topics in this course: 1) Speech synthesis, 2) short-term speech processing, 3) speech coding, and 4) speech recognition but your instructor is willing to consider all proposals.

You are welcome to work on two-person projects. Two-person teams need only turn in one project report and send one email per week, but remember that a two-person project is expected to be twice as much work as a one-person project.

Just a few of the possible topics include:

1.
Implement a speech synthesis algorithm more sophisticated than we implemented in HW#1.
2.
Accurate pitch detection or formant estimation procedures.
3.
In matlab, Implement a speech coder more sophisticated than LPC-10E.
4.
Implement a simpler coder to work in real-time between two PCs.
5.
Voice warping: to change the pitch or other quality of a speech signal. For example, change a voice signal to sound like Mickey Mouse.
6.
A DSP implementation of any of the algorithms related to this class.
7.
Noise filtering: clean the noise from a speech signal assuming only a single speaker.
8.
Audio-based video playback for internet teaching: produce a video of a speaker's face given only audio. This required a training set video.
9.
HMM based speech recognition.
10.
Investigate various feature detection techniques for phoneme recognition (cepstrum, LPC, mel-scale, reflection coefficients etc.)
11.
Speech recognition applied to any particular application (controlling a robot, telephone number lookup, stock report, etc)
12.
Combine adaptive filters or neural networks with any of the themes in the course.
13.
An unacceptable project would be to demo a commercial speech recognition program. A better project would be to preprocess the speech to improve the recognition performance of this program.
14.
Speaker, gender, or language detection from speech.
15.
Produce speech based on the magnitude spectrum only (Talk to Agustin Roca goose@cnel.ufl.edu about this).
You are welcome (as always) to talk to the instuctor about your project ideas.


next up previous
Next: EEL6586: List of Student Up: EEL6586: Projects Previous: EEL6586: Projects
Dr John Harris
1999-04-29