As part of the Studia Stemmatalogica project (led by Tuomas Heikkilä, Teemu Roos and Petri Myllymäk of the University of Helsinki), I have prepared a page giving access to five full sets of data prepared for phylogenetic analysis: four for sections of the Canterbury Tales, one for the Old Norse Solarljod. These datasets have been produced with exceptional care, to give the most accurate and complete portrayal of the variation in each tradition. For each dataset, we also present an expert scholarly analysis.
Our hope, in releasing this data, is to encourage researchers interested in the possibilities and challenges of the application of phylogenetic methods to stemmatics to experiment with different methods of analysis on 'real' datasets. We would be glad to hear of any and all uses made of this data.
Best wishes