![]() Then we did dendro phyloxml and got an OOPS message then we viewed the 3 rd was download XML, however this never successfully downloaded. We tried it three times into treeview and clicked get dendro and used first a PDF clustering, which never uploaded. This successfully chunked everything, merged chunksets and called it WholeCorpus, which was useless because it was too difficult to read. Then we rechunked and relabeled (please see the provided table) and used advanced by chunk size, rather than # of chunks. We first chunked by # of chunks into 1000 chunks for each and tried to merge chunkset 4 times and it crashed, which was our first failure. Step 2: Overcoming Failures with Dendrograms ![]() After tokenizing the text we moved onto using chunking, merging chunksets and treeview. The only difference is that we combined everything after the 6 th narrative to the epilogue in one grouping. ![]() The organization schema we based on the Gutenberg project for ease. ![]() As a result we broke The Moonstone up into seven different pieces. We were unable to chunk the whole Moonstone because it continued to crash the tokenizer. ![]() Please refer to the below table for all of the texts we started with. ![]()
0 Comments
Leave a Reply. |