We will do a project to evaluate the accuracy of nearest means and naive bayes classifier for protein sequence classification. 1. We will first write a Python program to convert protein sequences into feature vectors with the spectrum kernel. 2. We will use training and test data from the SCOP database posted on the course website to evaluate the mean error of both classifiers.