Abstract. The rapid development of high-throughput sequencing technologies means that hundreds of gigabytes of sequencing data can be produced in a single