}
We hereby give links to the data presented in our publications. We also recall that the source code can be freely downloaded, distributed and modified.
The Snakefile, Python scripts and README to reproduce the benchmark are available on this git repository.
PeerJ Computer Science, 2018, 10.7717/peerj-cs.148
Genomic dataset of labeled DNA sequences with V(D)J recombinations
The dataset is a FASTA file. Labels are encoded in the FASTA header under the format label:start_position-end_position
.
PLOS ONE, 2016, doi:10.1371/journal.pone.0166126
Leukemia Research, 2017, doi: 10.1016/j.leukres.2016.11.009
British Journal of Haematology, 2016, doi:10.1111/bjh.13981