- Title
- Experimental investigation of three machine learning algorithms for ITS dataset
- Creator
- Yearwood, John; Kang, Byeongho; Kelarev, Andrei
- Date
- 2009
- Type
- Text; Conference paper
- Identifier
- http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/39901
- Identifier
- vital:3581
- Identifier
-
https://doi.org/10.1007/978-3-642-10509-8_34
- Identifier
- ISBN:9783642105081
- Abstract
- The present article is devoted to experimental investigation of the performance of three machine learning algorithms for ITS dataset in their ability to achieve agreement with classes published in the biologi cal literature before. The ITS dataset consists of nuclear ribosomal DNA sequences, where rather sophisticated alignment scores have to be used as a measure of distance. These scores do not form a Minkowski metric and the sequences cannot be regarded as points in a finite dimensional space. This is why it is necessary to develop novel machine learning ap proaches to the analysis of datasets of this sort. This paper introduces a k-committees classifier and compares it with the discrete k-means and Nearest Neighbour classifiers. It turns out that all three machine learning algorithms are efficient and can be used to automate future biologically significant classifications for datasets of this kind. A simplified version of a synthetic dataset, where the k-committees classifier outperforms k-means and Nearest Neighbour classifiers, is also presented.
- Publisher
- Jeju Island, Korea : Springer
- Relation
- Paper presented at First International Conference, FGIT 2009, Future Generation Information Technology, Jeju Island, Korea : 10th-12th December 2009 Vol. 5899, p. 308-316
- Rights
- Copyright Springer
- Rights
- Open Access
- Rights
- This metadata is freely available under a CCO license
- Subject
- Classification; Learning algorithms; ITS dataset
- Full Text
- Hits: 1827
- Visitors: 2221
- Downloads: 422
Thumbnail | File | Description | Size | Format | |||
---|---|---|---|---|---|---|---|
View Details Download | SOURCE1 | Accepted version | 140 KB | Adobe Acrobat PDF | View Details Download |