- Title
- Experimental investigation of clasification algorithms for ITS dataset
- Creator
- Yearwood, John; Kang, Byeongho; Kelarev, Andrei
- Date
- 2008
- Type
- Text; Conference paper
- Identifier
- http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/31707
- Identifier
- vital:5184
- Abstract
- This article is devoted to experimental investigation of classification algorithms for analysis of ITS dataset. We introduce and consider a novel k-committees alogorithm for classification and compare it with the discrete k- means and nearest neighbour algorithms. The ITS dataset consists of nuclear ribosomal DNA sequences, where rather sophisticated alignment scores have to be used as a measure of distance. These scores do not form Minkowski metric and the sequences cannot be regarded as points in a finite dimensional space. This is why it is necessary to develop novel algorithms and adjust familiar ones. We present the results of experiments comparing the efficiency of three classification methods in their ability to achieve agreement with classes published in the biological literature before. It turns out that our algorithms are efficient and can be used to obtain biologically significant classifications. A simplified version of a synthetic dataset, where the k-committees classifier out performs k-means and Nearest Neighbour classifiers, is also presented.; E1
- Publisher
- Hanoi, Vietnam Pacific Rim International Conferences on Artificial Intelligence (PRICAI) & University of Tasmania
- Relation
- PKAW-08, Pacific Rim Knowledge Acquisition Workshop 2008, as part of PRICAI 2008, Tenth Pacific Rim p. 262-272
- Rights
- Copyright unknown
- Rights
- This metadata is freely available under a CCO license
- Subject
- Classification; Data mining
- Reviewed
- Hits: 1240
- Visitors: 1238
- Downloads: 3
Thumbnail | File | Description | Size | Format |
---|