next up previous
Next: EEL6825: HW#5 Up: EEL6825: Homework Assignments Previous: EEL6825: HW#3

EEL6825: HW#4

tex2html_wrap253 EEL 6825  -  Fall 1997

Due Friday, October 31, 1997 at 3pm. Do not be late to class. Those of you taking our friendly MS/PHD exam, contact me for an alternate due date.

PART A: Non-computer questions

A1
Problem 4-4 in D&H
A2
Problem 4-5 in D&H
A3
A one-dimensional distribution is parameterized by tex2html_wrap_inline229 and is given by:

displaymath233

(Note: the probability distribution is only valid for tex2html_wrap_inline235 ). The probability distribution for tex2html_wrap_inline149 ( tex2html_wrap_inline233 ) is uniform in [0,1]. Answer the following questions assuming that tex2html_wrap_inline143 and that an infinite number of samples is available:

a)
Compute the Bayes error as a function of tex2html_wrap_inline229 .
b)
Compute the expected probability of error as a function of tex2html_wrap_inline229 for the 1-NN leave-one-out procedure.
c)
Compute the expected probability of error for the 2-NN leave-one-out procedure as a function of tex2html_wrap_inline229 . Do not include the sample being classified and assume that ties are rejected.
d)
Is the the 2-NN error greater or smaller than the 1-NN and the Bayes error? Explain why.

A4
Extra Credit: Derive the maximum liklihood estimate for tex2html_wrap_inline229 given N data points.

PART B: Computer questions

The programming part of this assignment uses the data set developed by Gorman and Sejnowski in their study of the classification of sonar signals using a neural network. The task is to train a network to discriminate between sonar signals bounced off a metal cylinder and those bounced off a roughly cylindrical rock.

The file ``mines.asc'' (http://www.cnel.ufl.edu/analog/courses/EEL6825/mines.asc) contains 111 patterns obtained by bouncing sonar signals off a metal cylinder at various angles and under various conditions. The file ``rocks.asc''
(http://www.cnel.ufl.edu/analog/courses/EEL6825/rocks.asc) contains 97 patterns obtained from rocks under similar conditions. The transmitted sonar signal is a frequency-modulated chirp, rising in amplitude. The data set contains signals obtained from a variety of different aspect angles, spanning 90 degrees for the cylinder and 180 degrees for the rock. Each pattern is a set of 60 numbers in the range 0.0 to 1.0. Each number represents the energy within a particular frequency band, integrated over a certain period of time. The integration aperture for higher frequencies occur later in time, since these frequencies are transmitted later during the chirp. A README file in the directory contains a longer description of the data and past experiments.

B1
Compute the sample mean for each class. Design a linear classifier that chooses the class with the nearest sample mean. Use the Euclidian distance, don't worry about covariance matrices. Compute the resubstitution and the leave-one-out errors. Clearly indicate these results in your answers. As usual turn in all code that you write.

B2
Design a nearest-neighbor classifier that chooses the class of the nearest-neighbor for each X. Compute the resubstitution and the leave-one-out errors. Clearly indicate these results in your answers. Programming hint: Do not use all of the data points when you are developing your code. When you are confident that your program is correct, run with the full number of points. Also, write your code with efficiency in mind. If the full number of points still takes too long to run, use as many points as you think reasonable but explain what you have done.

B3
Plot a graph that shows the leave-one-out performance of your classifier that is similar to the tex2html_wrap_inline245 display we discussed in class. The Y-axis represents the distance between each point in the data set and its nearest neighbor in the mines class. If the data point happens to come from the mines class, leave it out of the minimum distance computation. Similarly, the X-axis is the distance between each data point and its nearest neighbor in the rocks class. (None of the distances should be exactly zero since you are using the leave-one-out method). Plot a line on the plot that shows your solution to problem B2. Change the offset and slope of this line the best you can to minimize the error. What is the best error that you can get?

B4
Extra credit (optional). Compute the same errors as problem B2 for 3-NN. Does 3-NN perform better or worse than 1-NN?


next up previous
Next: EEL6825: HW#5 Up: EEL6825: Homework Assignments Previous: EEL6825: HW#3

Dr John Harris
Mon Nov 10 01:03:10 EST 1997