Homework 6 PCA via Successive Deflation

Your shopping cart is empty.

Machine Learning
Homework 6
This homework contains 3 questions. The last two questions require programming. The maximum
number of points is 100.
1 PCA via Successive Deflation [30 points]
Suppose we have a set of n data points x1, . . . , xn, where each xi
is represented as a d-dimensional column
vector. Assume that the data has been centerlized, i.e., having zero mean: 1
n
Pn
i=1 xi = 0. Let X =
[x1; . . . ; xn] be the (d × n) matrix where column i is equal to xi
. Define C =
1
nXXT
to be the covariance
matrix of X, where cij =
1
n
Pn
l=1 xilxjl = covar(i, j).
Next, order the eigenvectors of C by their eigenvalues (largest first), and let v1, v2, . . . , vk be the first k
eigenvectors. These satisfy
v
T
i vj =
(
0 if i 6= j
1 if i = j
v1 is the first principal eigenvector of C (the eigenvector with the largest eigenvalue), and as such satisfies
Cv1 = λ1v1. Now define x˜i as the orthogonal projection of xi onto the space orthogonal to v1:
x˜i = (I − v1v
T
1
)xi
Finally, define X˜ = [x˜1; . . . ; x˜n] as the deflated matrix of rank d − 1, which is obtained by removing
from the d-dimensional data the component that lies in the direction of the first principal eigenvector:
X˜ = (I − v1v
T
1
)X
1. [7 points] Show that the covariance of the deflated matrix,
C˜ =
1
n
X˜ X˜ T
is given by
C˜ =
1
n
XXT − λ1v1v
T
1
(Hint: Some useful facts: (I − v1v
T
1
) is symmetric, XXT v1 = nλ1v1, and v
T
1 v1 = 1. Also, for any
matrices A and B, (AB)
T = BT AT
.)
2. [7 points] Show that for j 6= 1, if vj is a principal eigenvector of C with corresponding eigenvalue λj
(that is, Cvj = λjvj ), then vj is also a principal eigenvector of C˜ with the same eigenvalue λj .
3. [8 points] Let u be the first principal eigenvector of C˜ . Explain why u = v2. (You may assume u is
unit norm.)
4. [8 points] Suppose we have a simple method f for finding the leading eigenvector and eigenvalue of
a positive-definite matrix, denoted by [λ, u] = f(C). Write some pseudocode for finding the first k
principal basis vectors of X that only uses the special f function and simple vector arithmetic.
(Hint: This should be a simple iterative routine that takes only a few lines to write. The input is C, k,
and the function f, the output should be vj and λj for j ∈ 1, · · · , k)
1
2 Action recognition with CNN (35 points + 10 bonus)
In this question, you will train a convolutional neural network (CNN) to classify images and videos using
Pytorch. We use the UCF101 data (see http://crcv.ucf.edu/data/UCF101.php). There are also 10 classes of
data in this homework but the data and the number of classes are different from those of Homework 4. Each
clip has 3 frames and each frame is 64 ∗ 64 pixels. The labels of train and validation clips are provided in
hw6 data.mat inside the directory Question2.
You will first train a CNN for action classification for each image. You will then improve the network
architecture and submit the classification results on the test data to Kaggle. Then, you will train a CNN
using 3D convolution for a set of video frames (rather than for individual frames), and submit your results to
Kaggle.
The detail instructions and questions are in the jupyter notebook Action CNN.ipynb. In this file, there
are 8 ‘ToDos’ spots for you to fill. The score of each ToDo is specified at the spot. For the 5
th and 8
th
ToDos, you need to submit CSV result files to Kaggle. The results would be evaluated by Categorization
Accuracy.For the 5
th ToDo, submit to https://www.kaggle.com/c/cse512hw6image. For the
8
th ToDo, submit to https://www.kaggle.com/c/cse512hw6video.
We will maintain a leader board for each Kaggle competition, and the top three entries at the end of
the competition (official assignment due date) will receive 10 bonus points. Any submission that rises to
top three after the assignment deadline is not eligible for bonus points. The ranking will be based on the
Categorization Accuracy. Marks for these questions will be scaled according to the ranking on the Private
Leaderboard. To prevent exploiting test data, you are allowed to make a maximum of 2 submissions per 24
hours. Your submission will be evaluated immediately and the leader board will be updated.
Environment setting
Please make a ./data folder under the same directory with the Action CNN.ipynb file. Put data trainClips,
valClips, testClips and hw6 data.mat from the Question2 directory under ./data.
We recommend using virtual environment for the project. If you choose not to use a virtual environment,
it is up to you to make sure that all dependencies for the code are installed globally on your machine. To set
up a virtual environment, run the following in the command-line interface:
cd your_hw6_folder
sudo pip install virtualenv # This may already be installed
virtualenv .env # Create a virtual environment
source .env/bin/activate # Activate the virtual environment
pip install -r requirements.txt # Install dependencies
# Note that this does NOT install TensorFlow or PyTorch,
# which you need to do yourself.
# Work (hard) on the assignment
# ... and when you’re done:
deactivate # Exit the virtual environment
Note that every time you want to work on the assignment, you should run ‘source .env/bin/activate’ (from
within your hw6 folder) to re-activate the virtual environment, and deactivate again whenever you are done.
3 Action Classification Using RNN (35 points)
In this section, you will train recurrent neural networks (RNNs) to classify human actions. RNNs are designed handle sequential data.
For human action recognition, you will be using skeleton data that encodes the 3D locations of 25 body
joints. The data is collected by Kinect v2. There are 10 different action classes. There are 4000 training
2
sequences, 800 validation sequences, and 1000 test sequences. Each sequence has 15 frames, each frame is
a 75-dimension vector (the xyz positions of 25 joints). Data and the Jupyter notebook for this question are
provided inside the directory Question3.
You will first train an LSTM for action classification. Then try to improve the network architecture and
attach your results with the jupyter notebook. Also add the hyper-parameters explored.
The detailed instructions and questions are in the jupyter notebook RNN ActionClassify.ipynb. In this
file, there are 4 ToDo locations for you to fill. The score of each ToDo is specified at the spot.
You will need to install the following extra packages:
pip install h5py
pip install git+https://github.com/pytorch/tnt.git@master
4 What to submit?
4.1 Blackboard submission
You will need to submit both your code and your answers to questions on Blackboard. Put the answer file and
your code in a folder named: SBUID FirstName LastName (e.g., 10947XXXX lionel messi). Zip this folder
and submit the zip file on Blackboard. Your submission must be a zip file, i.e, SBUID FirstName LastName.zip.
The answer file should be named: answers.pdf. The first page of the answers.pdf should be the filled cover
page at the end of this homework. The remaining of the answer file should contain:
1. Answers to Question 1.
2. Completed Ipython notebooks ActionCNN.ipynb and RNN ActionClassify.ipynb
3. For Question 2, also submit Prediction csv files which you submitted to Kaggle.
4. For Question 3, mention the best validation performance in the report, and submit the prediction files
on the test and validation set. For the validation set, submit a single prediction file which results in the
best performance.
4.2 Report
You can use Latex or MS Word for preparing the report; submit report in pdf format. Report should be typed.
Don’t submit scans/pics of handwritten answers.
4.3 Kaggle submission
No Kaggle submission required for this homework.

Shopping cart

US$0

Homework 6 PCA via Successive Deflation

More products