Visual tools for analysing evolution, emergence, and error in data streams
- Hart, Sol, Yearwood, John, Bagirov, Adil
- Authors: Hart, Sol , Yearwood, John , Bagirov, Adil
- Date: 2007
- Type: Text , Conference paper
- Relation: Paper presented at 6th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2007, Melbourne, Victoria : 11th-13th July 2007 p. 987-992
- Full Text:
- Description: The relatively new field of stream mining has necessitated the development of robust drift-aware algorithms that provide accurate, real time, data handling capabilities. Tools are needed to assess and diagnose important trends and investigate drift evolution parameters. In this paper, we present two new and novel visualisation techniques, Pixie and Luna graphs, which incorporate salient group statistics coupled with intuitive visual representations of multidimensional groupings over time. Through the novel representations presented here, spatial interactions between temporal divisions can be diagnosed and overall distribution patterns identified. It provides a means of evaluating in non-constrained capacity, commonly constrained evolutionary problems.
- Description: 2003005432
- Authors: Hart, Sol , Yearwood, John , Bagirov, Adil
- Date: 2007
- Type: Text , Conference paper
- Relation: Paper presented at 6th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2007, Melbourne, Victoria : 11th-13th July 2007 p. 987-992
- Full Text:
- Description: The relatively new field of stream mining has necessitated the development of robust drift-aware algorithms that provide accurate, real time, data handling capabilities. Tools are needed to assess and diagnose important trends and investigate drift evolution parameters. In this paper, we present two new and novel visualisation techniques, Pixie and Luna graphs, which incorporate salient group statistics coupled with intuitive visual representations of multidimensional groupings over time. Through the novel representations presented here, spatial interactions between temporal divisions can be diagnosed and overall distribution patterns identified. It provides a means of evaluating in non-constrained capacity, commonly constrained evolutionary problems.
- Description: 2003005432
Predicting Australian stock market index using neural networks exploiting dynamical swings and intermarket influences
- Pan, Heping, Tilakaratne, Chandima, Yearwood, John
- Authors: Pan, Heping , Tilakaratne, Chandima , Yearwood, John
- Date: 2005
- Type: Text , Journal article
- Relation: Journal of Research and Practice in Information Technology Vol. 37, no. 1 (2005), p. 43-55
- Full Text:
- Reviewed:
- Description: This paper presents a computational approach for predicting the Australian stock market index AORD using multi-layer feed-forward neural networks front the time series data of AORD and various interrelated markets. This effort aims to discover an effective neural network, or a set of adaptive neural networks for this prediction purpose, which can exploit or model various dynamical swings and inter-market influences discovered from professional technical analysis and quantitative analysis. Within a limited range defined by our empirical knowledge, three aspects of effectiveness on data selection are considered: effective inputs from the target market (AORD) itself, a sufficient set of interrelated markets,. and effective inputs from the interrelated markets. Two traditional dimensions of the neural network architecture are also considered: the optimal number of hidden layers, and the optimal number of hidden neurons for each hidden layer. Three important results were obtained: A 6-day cycle was discovered in the Australian stock market during the studied period; the time signature used as additional inputs provides useful information; and a basic neural network using six daily returns of AORD and one daily, returns of SP500 plus the day of the week as inputs exhibits up to 80% directional prediction correctness.
- Description: C1
- Description: 2003001440
- Authors: Pan, Heping , Tilakaratne, Chandima , Yearwood, John
- Date: 2005
- Type: Text , Journal article
- Relation: Journal of Research and Practice in Information Technology Vol. 37, no. 1 (2005), p. 43-55
- Full Text:
- Reviewed:
- Description: This paper presents a computational approach for predicting the Australian stock market index AORD using multi-layer feed-forward neural networks front the time series data of AORD and various interrelated markets. This effort aims to discover an effective neural network, or a set of adaptive neural networks for this prediction purpose, which can exploit or model various dynamical swings and inter-market influences discovered from professional technical analysis and quantitative analysis. Within a limited range defined by our empirical knowledge, three aspects of effectiveness on data selection are considered: effective inputs from the target market (AORD) itself, a sufficient set of interrelated markets,. and effective inputs from the interrelated markets. Two traditional dimensions of the neural network architecture are also considered: the optimal number of hidden layers, and the optimal number of hidden neurons for each hidden layer. Three important results were obtained: A 6-day cycle was discovered in the Australian stock market during the studied period; the time signature used as additional inputs provides useful information; and a basic neural network using six daily returns of AORD and one daily, returns of SP500 plus the day of the week as inputs exhibits up to 80% directional prediction correctness.
- Description: C1
- Description: 2003001440
Using links to aid web classification
- Xie, Wei, Mammadov, Musa, Yearwood, John
- Authors: Xie, Wei , Mammadov, Musa , Yearwood, John
- Date: 2007
- Type: Text , Conference paper
- Relation: Paper presented at 6th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2007, Melbourne, Victoria : 11th-13th July 2007 p. 981-986
- Full Text:
- Description: In this paper, we will present a new approach of using link information to improve the accuracy and efficiency of web classification. However, different from others, we only use the mappings between linked documents and their own class or classes. In this case, we only need to add a few features called linked-class features into the datasets. We apply SVM and BoosTexter for classification. We show that the classification accuracy can be improved based on mixtures of ordinary word features and out-linked-class features. We analyze and discuss the reason of this improvement.
- Description: 2003005438
- Authors: Xie, Wei , Mammadov, Musa , Yearwood, John
- Date: 2007
- Type: Text , Conference paper
- Relation: Paper presented at 6th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2007, Melbourne, Victoria : 11th-13th July 2007 p. 981-986
- Full Text:
- Description: In this paper, we will present a new approach of using link information to improve the accuracy and efficiency of web classification. However, different from others, we only use the mappings between linked documents and their own class or classes. In this case, we only need to add a few features called linked-class features into the datasets. We apply SVM and BoosTexter for classification. We show that the classification accuracy can be improved based on mixtures of ordinary word features and out-linked-class features. We analyze and discuss the reason of this improvement.
- Description: 2003005438
A new scoring system in Cystic Fibrosis : Statistical tools for database analysis - A preliminary report
- Hafen, Gaudenz, Hurst, Cameron, Yearwood, John, Smith, Julie, Dzalilov, Zari, Robinson, P. J.
- Authors: Hafen, Gaudenz , Hurst, Cameron , Yearwood, John , Smith, Julie , Dzalilov, Zari , Robinson, P. J.
- Date: 2008
- Type: Text , Journal article
- Relation: BMC Medical Informatics and Decision Making Vol. 8, no. 44 (2008), p.1-11
- Full Text:
- Reviewed:
- Description: Background. Cystic fibrosis is the most common fatal genetic disorder in the Caucasian population. Scoring systems for assessment of Cystic fibrosis disease severity have been used for almost 50 years, without being adapted to the milder phenotype of the disease in the 21st century. The aim of this current project is to develop a new scoring system using a database and employing various statistical tools. This study protocol reports the development of the statistical tools in order to create such a scoring system. Methods. The evaluation is based on the Cystic Fibrosis database from the cohort at the Royal Children's Hospital in Melbourne. Initially, unsupervised clustering of the all data records was performed using a range of clustering algorithms. In particular incremental clustering algorithms were used. The clusters obtained were characterised using rules from decision trees and the results examined by clinicians. In order to obtain a clearer definition of classes expert opinion of each individual's clinical severity was sought. After data preparation including expert-opinion of an individual's clinical severity on a 3 point-scale (mild, moderate and severe disease), two multivariate techniques were used throughout the analysis to establish a method that would have a better success in feature selection and model derivation: 'Canonical Analysis of Principal Coordinates' and 'Linear Discriminant Analysis'. A 3-step procedure was performed with (1) selection of features, (2) extracting 5 severity classes out of a 3 severity class as defined per expert-opinion and (3) establishment of calibration datasets. Results. (1) Feature selection: CAP has a more effective "modelling" focus than DA. (2) Extraction of 5 severity classes: after variables were identified as important in discriminating contiguous CF severity groups on the 3-point scale as mild/moderate and moderate/severe, Discriminant Function (DF) was used to determine the new groups mild, intermediate moderate, moderate, intermediate severe and severe disease. (3) Generated confusion tables showed a misclassification rate of 19.1% for males and 16.5% for females, with a majority of misallocations into adjacent severity classes particularly for males. Conclusion. Our preliminary data show that using CAP for detection of selection features and Linear DA to derive the actual model in a CF database might be helpful in developing a scoring system. However, there are several limitations, particularly more data entry points are needed to finalize a score and the statistical tools have further to be refined and validated, with re-running the statistical methods in the larger dataset. © 2008 Hafen et al; licensee BioMed Central Ltd.
- Authors: Hafen, Gaudenz , Hurst, Cameron , Yearwood, John , Smith, Julie , Dzalilov, Zari , Robinson, P. J.
- Date: 2008
- Type: Text , Journal article
- Relation: BMC Medical Informatics and Decision Making Vol. 8, no. 44 (2008), p.1-11
- Full Text:
- Reviewed:
- Description: Background. Cystic fibrosis is the most common fatal genetic disorder in the Caucasian population. Scoring systems for assessment of Cystic fibrosis disease severity have been used for almost 50 years, without being adapted to the milder phenotype of the disease in the 21st century. The aim of this current project is to develop a new scoring system using a database and employing various statistical tools. This study protocol reports the development of the statistical tools in order to create such a scoring system. Methods. The evaluation is based on the Cystic Fibrosis database from the cohort at the Royal Children's Hospital in Melbourne. Initially, unsupervised clustering of the all data records was performed using a range of clustering algorithms. In particular incremental clustering algorithms were used. The clusters obtained were characterised using rules from decision trees and the results examined by clinicians. In order to obtain a clearer definition of classes expert opinion of each individual's clinical severity was sought. After data preparation including expert-opinion of an individual's clinical severity on a 3 point-scale (mild, moderate and severe disease), two multivariate techniques were used throughout the analysis to establish a method that would have a better success in feature selection and model derivation: 'Canonical Analysis of Principal Coordinates' and 'Linear Discriminant Analysis'. A 3-step procedure was performed with (1) selection of features, (2) extracting 5 severity classes out of a 3 severity class as defined per expert-opinion and (3) establishment of calibration datasets. Results. (1) Feature selection: CAP has a more effective "modelling" focus than DA. (2) Extraction of 5 severity classes: after variables were identified as important in discriminating contiguous CF severity groups on the 3-point scale as mild/moderate and moderate/severe, Discriminant Function (DF) was used to determine the new groups mild, intermediate moderate, moderate, intermediate severe and severe disease. (3) Generated confusion tables showed a misclassification rate of 19.1% for males and 16.5% for females, with a majority of misallocations into adjacent severity classes particularly for males. Conclusion. Our preliminary data show that using CAP for detection of selection features and Linear DA to derive the actual model in a CF database might be helpful in developing a scoring system. However, there are several limitations, particularly more data entry points are needed to finalize a score and the statistical tools have further to be refined and validated, with re-running the statistical methods in the larger dataset. © 2008 Hafen et al; licensee BioMed Central Ltd.
A fully automated CAD system using multi-category feature selection with restricted recombination
- Ghosh, Ranadhir, Ghosh, Moumita, Yearwood, John, Mukherjee, Subhasis
- Authors: Ghosh, Ranadhir , Ghosh, Moumita , Yearwood, John , Mukherjee, Subhasis
- Date: 2007
- Type: Text , Conference paper
- Relation: Paper presented at 6th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2007, Melbourne, Victoria : 11th-13th July 2007 p. 106-111
- Full Text:
- Description: In pattern recognition problems features plays an important role for classification results. It is very important which features are used and how many features are used for the classification process. Most of the real life classification problem uses different category of features. It is desirable to find the optimal combination of features that improves the performance of the classifier. There exists different selection framework that selects the features. Mostly do not incorporate the impact of one category of features on another. Even if they incorporate, they produce conflict between the categories. In this paper we proposed a restricted crossover selection framework which incorporate the impact of different categories on each other, as well as it restricts the search within the category which searching in the global region of the search space. The results obtained by the proposed framework are promising.
- Description: 2003005429
- Authors: Ghosh, Ranadhir , Ghosh, Moumita , Yearwood, John , Mukherjee, Subhasis
- Date: 2007
- Type: Text , Conference paper
- Relation: Paper presented at 6th IEEE/ACIS International Conference on Computer and Information Science, ICIS 2007, Melbourne, Victoria : 11th-13th July 2007 p. 106-111
- Full Text:
- Description: In pattern recognition problems features plays an important role for classification results. It is very important which features are used and how many features are used for the classification process. Most of the real life classification problem uses different category of features. It is desirable to find the optimal combination of features that improves the performance of the classifier. There exists different selection framework that selects the features. Mostly do not incorporate the impact of one category of features on another. Even if they incorporate, they produce conflict between the categories. In this paper we proposed a restricted crossover selection framework which incorporate the impact of different categories on each other, as well as it restricts the search within the category which searching in the global region of the search space. The results obtained by the proposed framework are promising.
- Description: 2003005429
Decisions surrounding adverse drug reaction prescribing : Insights from consumers and implications for decision support
- O'Brien, Michelle, Yearwood, John
- Authors: O'Brien, Michelle , Yearwood, John
- Date: 2005
- Type: Text , Journal article
- Relation: Journal of Research and Practice in Information Technology Vol. 37, no. 1 (2005), p. 57-71
- Full Text:
- Reviewed:
- Description: This paper presents findings from case studies of health consumers who each suspect they may have experienced an adverse drug reaction (ADR). These case studies are part of a larger study involving consumer/doctor decisions surrounding suspected adverse drug reactions and prescribing. Decision support to assist with the diagnosis and management of ADRs has, to date, primarily focused on providing in-time information to prescribers about factors that pertain to the consumer and the medications they are taking. Decision support that includes consumers usually targets treatment decisions. The results of this paper indicate the prescriber is only one decision contributor in a rich tapestry of decision contributors and decision types, and consumer decision types are significantly broader than treatment decisions. The results provide guidance for the development of decision support within this domain.
- Description: C1
- Description: 2003001435
- Authors: O'Brien, Michelle , Yearwood, John
- Date: 2005
- Type: Text , Journal article
- Relation: Journal of Research and Practice in Information Technology Vol. 37, no. 1 (2005), p. 57-71
- Full Text:
- Reviewed:
- Description: This paper presents findings from case studies of health consumers who each suspect they may have experienced an adverse drug reaction (ADR). These case studies are part of a larger study involving consumer/doctor decisions surrounding suspected adverse drug reactions and prescribing. Decision support to assist with the diagnosis and management of ADRs has, to date, primarily focused on providing in-time information to prescribers about factors that pertain to the consumer and the medications they are taking. Decision support that includes consumers usually targets treatment decisions. The results of this paper indicate the prescriber is only one decision contributor in a rich tapestry of decision contributors and decision types, and consumer decision types are significantly broader than treatment decisions. The results provide guidance for the development of decision support within this domain.
- Description: C1
- Description: 2003001435
Narrative-based interactive learning environments from modelling reasoning
- Yearwood, John, Stranieri, Andrew
- Authors: Yearwood, John , Stranieri, Andrew
- Date: 2007
- Type: Text , Journal article
- Relation: Educational Technology and Society Vol. 10, no. 3 (2007), p. 192-208
- Full Text:
- Reviewed:
- Description: Narrative and story telling has a long history of use in structuring, organising and communicating human experience. This paper describes a narrative based interactive intelligent learning environment which aims to elucidate practical reasoning using interactive emergent narratives that can be used in training novices in decision making. Its design is based on an approach to generating narrative from knowledge that has been modelled in specific decision/reasoning domains. The approach uses a narrative model that is guided partially by inference and contextual information contained in the particular knowledge representation used, the Generic/Actual argument model of structured reasoning. The approach is described with examples in the area of critical care nursing training and positive learning outcomes are reported. © International Forum of Educational Technology & Society (IFETS).
- Description: C1
- Description: 2003002522
- Authors: Yearwood, John , Stranieri, Andrew
- Date: 2007
- Type: Text , Journal article
- Relation: Educational Technology and Society Vol. 10, no. 3 (2007), p. 192-208
- Full Text:
- Reviewed:
- Description: Narrative and story telling has a long history of use in structuring, organising and communicating human experience. This paper describes a narrative based interactive intelligent learning environment which aims to elucidate practical reasoning using interactive emergent narratives that can be used in training novices in decision making. Its design is based on an approach to generating narrative from knowledge that has been modelled in specific decision/reasoning domains. The approach uses a narrative model that is guided partially by inference and contextual information contained in the particular knowledge representation used, the Generic/Actual argument model of structured reasoning. The approach is described with examples in the area of critical care nursing training and positive learning outcomes are reported. © International Forum of Educational Technology & Society (IFETS).
- Description: C1
- Description: 2003002522
- «
- ‹
- 1
- ›
- »