Thanks a lot for the feedback! I like your blog post about metaprogramming and will look more into it.
I guess the genomes are in the NCBI server, I just read about CGR and it looks very interesting, I guess it will take into account patterns that edit distance (which ClusteringTree uses) doesn't see. I will play round with that idea and post the...
Last update: I just realized I was using the wrong set of data, haha. If it helps to explain, in case anyone has a similar problem, I was using sequences of different lengths when my custom function only worked with sequences of equal lengths. ...