Please use this identifier to cite or link to this item: http://hdl.handle.net/10773/37053
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMatos, Luís M. O.pt_PT
dc.contributor.authorNeves, António J. R.pt_PT
dc.contributor.authorPratas, Diogopt_PT
dc.contributor.authorPinho, Armando J.pt_PT
dc.date.accessioned2023-04-14T14:06:18Z-
dc.date.available2023-04-14T14:06:18Z-
dc.date.issued2015-
dc.identifier.urihttp://hdl.handle.net/10773/37053-
dc.description.abstractIn the last decade, the cost of genomic sequencing has been decreasing so much that researchers all over the world accumulate huge amounts of data for present and future use. These genomic data need to be efficiently stored, because storage cost is not decreasing as fast as the cost of sequencing. In order to overcome this problem, the most popular general-purpose compression tool, gzip, is usually used. However, these tools were not specifically designed to compress this kind of data, and often fall short when the intention is to reduce the data size as much as possible. There are several compression algorithms available, even for genomic data, but very few have been designed to deal with Whole Genome Alignments, containing alignments between entire genomes of several species. In this paper, we present a lossless compression tool, MAFCO, specifically designed to compress MAF (Multiple Alignment Format) files. Compared to gzip, the proposed tool attains a compression gain from 34% to 57%, depending on the data set. When compared to a recent dedicated method, which is not compatible with some data sets, the compression gain of MAFCO is about 9%. Both source-code and binaries for several operating systems are freely available for non-commercial use at: http://bioinformatics.ua.pt/software/mafco.pt_PT
dc.language.isoengpt_PT
dc.publisherPLoSpt_PT
dc.relationinfo:eu-repo/grantAgreement/FCT/FARH/SFRH%2FBD%2F86531%2F2012/PTpt_PT
dc.relationinfo:eu-repo/grantAgreement/FCT/6820 - DCRRNI ID/PEst-C%2FEEI%2FUI0127%2F2011/PTpt_PT
dc.rightsopenAccesspt_PT
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/pt_PT
dc.subjectData compressionpt_PT
dc.subjectGenomicspt_PT
dc.subjectTime factorspt_PT
dc.subjectSequence alignmentpt_PT
dc.titleMAFCO: a compression tool for MAF filespt_PT
dc.typearticlept_PT
dc.description.versionpublishedpt_PT
dc.peerreviewedyespt_PT
degois.publication.issue3pt_PT
degois.publication.titlePLoS ONEpt_PT
degois.publication.volume10pt_PT
dc.identifier.doi10.1371/journal.pone.0116082pt_PT
dc.identifier.essn1932-6203pt_PT
dc.identifier.articlenumbere0116082pt_PT
Appears in Collections:IEETA - Artigos

Files in This Item:
File Description SizeFormat 
file.pdf637.62 kBAdobe PDFView/Open


FacebookTwitterLinkedIn
Formato BibTex MendeleyEndnote Degois 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.