$30
Homework 9
In this homework, you are required to include in your report the code that you
implemented. If you use some code from the web or package, also mention in your
report the origin of the code.
1. Implement the EM algorithm for clustering with multivariate Gaussian models discussed in the LearningGM section. Initialize the algorithm with the K-means result
(for example Matlab provides a kmeans function). Assume there are only two clusters. Report the parameters πk, µk, Σk, k = 1, 2 for the following datasets containing
500-600 observations from R
2
:
a) The dataset xeasy. (1 point)
b) The dataset x1. (1 point)
c) The dataset x2. (2 points)
Pay attention to the dimensions of the mean centers µk, k = 1, 2 and the covariance
matrices Σk, k = 1, 2.
2. Now implement the two-step EM algorithm (also known as provable EM) and repeat
the questions from problem 1. (4 points)
1