I guess the genomes are in the NCBI server, I just read about CGR and it looks very interesting, I guess it will take into account patterns that edit distance (which ClusteringTree uses) doesn't see. I will play round with that idea and post the results.