Evaluation plan

We will use Accuracy and F1 for ranking of the results. We also calculated and present other measures.