Sequence length variation, indel costs, and congruence in sensitivity analysis

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Sequence length variation, indel costs, and congruence in sensitivity analysis. / Aagesen, Lone; Petersen, Gitte; Seberg, Ole.

I: Cladistics, Bind 21, Nr. 1, 2005, s. 15-20.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Aagesen, L, Petersen, G & Seberg, O 2005, 'Sequence length variation, indel costs, and congruence in sensitivity analysis', Cladistics, bind 21, nr. 1, s. 15-20. <http://www3.interscience.wiley.com/cgi-bin/fulltext/118656656/HTMLSTART>

APA

Aagesen, L., Petersen, G., & Seberg, O. (2005). Sequence length variation, indel costs, and congruence in sensitivity analysis. Cladistics, 21(1), 15-20. http://www3.interscience.wiley.com/cgi-bin/fulltext/118656656/HTMLSTART

Vancouver

Aagesen L, Petersen G, Seberg O. Sequence length variation, indel costs, and congruence in sensitivity analysis. Cladistics. 2005;21(1):15-20.

Author

Aagesen, Lone ; Petersen, Gitte ; Seberg, Ole. / Sequence length variation, indel costs, and congruence in sensitivity analysis. I: Cladistics. 2005 ; Bind 21, Nr. 1. s. 15-20.

Bibtex

@article{5c2c8d7074c311dbbee902004c4f4f50,
title = "Sequence length variation, indel costs, and congruence in sensitivity analysis",
abstract = "The behavior of two topological and four character-based congruence measures was explored using different indel treatments in three empirical data sets, each with different alignment difficulties. The analyses were done using direct optimization within a sensitivity analysis framework in which the cost of indels was varied. Indels were treated either as a fifth character state, or strings of contiguous gaps were considered single events by using linear affine gap cost. Congruence consistently improved when indels were treated as single events, but no congruence measure appeared as the obviously preferable one. However, when combining enough data, all congruence measures clearly tended to select the same alignment cost set as the optimal one. Disagreement among congruence measures was mostly caused by a dominant fragment or a data partition that included all or most of the length variation in the data set. Dominance was easily detected, as the character-based congruence measures approached their optimal value when indel costs were incremented. Dominance of a fragment or data partition was overwhelmed when new sequence length-variable fragments or data partitions were added.",
author = "Lone Aagesen and Gitte Petersen and Ole Seberg",
year = "2005",
language = "English",
volume = "21",
pages = "15--20",
journal = "Cladistics",
issn = "0748-3007",
publisher = "Wiley-Blackwell",
number = "1",

}

RIS

TY - JOUR

T1 - Sequence length variation, indel costs, and congruence in sensitivity analysis

AU - Aagesen, Lone

AU - Petersen, Gitte

AU - Seberg, Ole

PY - 2005

Y1 - 2005

N2 - The behavior of two topological and four character-based congruence measures was explored using different indel treatments in three empirical data sets, each with different alignment difficulties. The analyses were done using direct optimization within a sensitivity analysis framework in which the cost of indels was varied. Indels were treated either as a fifth character state, or strings of contiguous gaps were considered single events by using linear affine gap cost. Congruence consistently improved when indels were treated as single events, but no congruence measure appeared as the obviously preferable one. However, when combining enough data, all congruence measures clearly tended to select the same alignment cost set as the optimal one. Disagreement among congruence measures was mostly caused by a dominant fragment or a data partition that included all or most of the length variation in the data set. Dominance was easily detected, as the character-based congruence measures approached their optimal value when indel costs were incremented. Dominance of a fragment or data partition was overwhelmed when new sequence length-variable fragments or data partitions were added.

AB - The behavior of two topological and four character-based congruence measures was explored using different indel treatments in three empirical data sets, each with different alignment difficulties. The analyses were done using direct optimization within a sensitivity analysis framework in which the cost of indels was varied. Indels were treated either as a fifth character state, or strings of contiguous gaps were considered single events by using linear affine gap cost. Congruence consistently improved when indels were treated as single events, but no congruence measure appeared as the obviously preferable one. However, when combining enough data, all congruence measures clearly tended to select the same alignment cost set as the optimal one. Disagreement among congruence measures was mostly caused by a dominant fragment or a data partition that included all or most of the length variation in the data set. Dominance was easily detected, as the character-based congruence measures approached their optimal value when indel costs were incremented. Dominance of a fragment or data partition was overwhelmed when new sequence length-variable fragments or data partitions were added.

M3 - Journal article

VL - 21

SP - 15

EP - 20

JO - Cladistics

JF - Cladistics

SN - 0748-3007

IS - 1

ER -

ID: 92029