Please use this identifier to cite or link to this item: http://hdl.handle.net/10773/6913
Full metadata record
DC FieldValueLanguage
dc.contributor.authorFreitas, A.pt
dc.contributor.authorAfreixo, V.pt
dc.contributor.authorPinheiro, M.pt
dc.contributor.authorOliveira, J.L.pt
dc.contributor.authorMoura, G.pt
dc.contributor.authorSantos, M.pt
dc.date.accessioned2012-02-27T11:52:44Z-
dc.date.issued2011-
dc.identifier.issn1932-1864pt
dc.identifier.urihttp://hdl.handle.net/10773/6913-
dc.description.abstractThe iterative signature algorithm (ISA) has become very attractive to detect co-regulated genes from microarray data matrices and can be a useful tool for the identification of similar patterns in many other kinds of numerical data matrices. Nevertheless, its algorithmic strategy exhibits some limitations since it is based on statistical behavior of the average and considers averages weighted by scores not necessarily positive. Hence, we propose to take the median instead of the average and to use absolutes scores in ISA's structure. Furthermore, a generalized function is also introduced in the algorithm in order to improve its algorithmic strategy for detecting high value or low value biclusters. The effects of these simple modifications on the performance of the biclustering algorithm are evaluated through an experimental comparative study involving synthetic data sets and real data from the organism Saccharomyces cerevisiae. The experimental results show that the proposed variations of ISA outperform the original version in many situations. Absolute scores in ISA are shown to be essential for the correct interpretation of the biclusters found by the algorithm. The median instead of the average turns the biclustering algorithm more resilient to outliers in the data sets. Copyright © 2011 Wiley Periodicals, Inc.pt
dc.language.isoengpt
dc.publisherWileypt
dc.relationPTDC/MAT/72974/2006pt
dc.relationCenter of Research and Development in Mathematics and Applicationspt
dc.relation.urihttp://www.scopus.com/inward/record.url?eid=2-s2.0-79551712212&partnerID=40&md5=00237dd2f9c03663e39acac7b0f85dbd-
dc.rightsopenAccesspor
dc.subjectBiclusteringpt
dc.subjectCodonpt
dc.subjectIterative signature algorithmpt
dc.subjectMedianpt
dc.subjectMicroarraypt
dc.titleImproving the performance of the iterative signature algorithm for the identification of relevant patternspt
dc.typearticlept
dc.peerreviewedyespt
ua.distributioninternationalpt
degois.publication.firstPage71pt
degois.publication.issue1-
degois.publication.lastPage83pt
degois.publication.titleStatistical Analysis and Data Miningpt
degois.publication.volume4pt
dc.date.embargo10000-01-01-
dc.identifier.doi10.1002/sam.10104*
Appears in Collections:DMat - Artigos

Files in This Item:
File Description SizeFormat 
FreitasEtAl2011.pdf888.92 kBAdobe PDFView/Open


FacebookTwitterLinkedIn
Formato BibTex MendeleyEndnote Degois 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.