Evaluating authorship distance methods using the positive Silhouette coefficient
- Authors: Layton, Robert , Watters, Paul , Dazeley, Richard
- Date: 2013
- Type: Text , Journal article
- Relation: Natural Language Engineering Vol. 19, no. 4 (2013), p. 517-535
- Full Text:
- Reviewed:
- Description: Unsupervised Authorship Analysis (UAA) aims to cluster documents by authorship without knowing the authorship of any documents. An important factor in UAA is the method for calculating the distance between documents. This choice of the authorship distance method is considered more critical to the end result than the choice of cluster analysis algorithm. One method for measuring the correlation between a distance metric and a labelling (such as class values or clusters) is the Silhouette Coefficient (SC). The SC can be leveraged by measuring the correlation between the authorship distance method and the true authorship, evaluating the quality of the distance method. However, we show that the SC can be severely affected by outliers. To address this issue, we introduce the Positive Silhouette Coefficient, given as the proportion of instances with a positive SC value. This metric is not easily altered by outliers and produces a more robust metric. A large number of authorship distance methods are then compared using the PSC, and the findings are presented. This research provides an insight into the efficacy of methods for UAA and presents a framework for testing authorship distance methods.
- Description: C1
Recentred local profiles for authorship attribution
- Authors: Layton, Robert , Watters, Paul , Dazeley, Richard
- Date: 2012
- Type: Text , Journal article
- Relation: Natural Language Engineering Vol. 18, no. 3 (2012), p. 293-312
- Full Text:
- Reviewed:
- Description: Authorship attribution methods aim to determine the author of a document, by using information gathered from a set of documents with known authors. One method of performing this task is to create profiles containing distinctive features known to be used by each author. In this paper, a new method of creating an author or document profile is presented that detects features considered distinctive, compared to normal language usage. This recentreing approach creates more accurate profiles than previous methods, as demonstrated empirically using a known corpus of authorship problems. This method, named recentred local profiles, determines authorship accurately using a simple 'best matching author' approach to classification, compared to other methods in the literature. The proposed method is shown to be more stable than related methods as parameter values change. Using a weighted voting scheme, recentred local profiles is shown to outperform other methods in authorship attribution, with an overall accuracy of 69.9% on the ad-hoc authorship attribution competition corpus, representing a significant improvement over related methods. Copyright © Cambridge University Press 2011.
- Description: 2003010688
Automated unsupervised authorship analysis using evidence accumulation clustering
- Authors: Layton, Robert , Watters, Paul , Dazeley, Richard
- Date: 2013
- Type: Text , Journal article
- Relation: Natural Language Engineering Vol. 19, no. 1 (2013), p. 95-120
- Full Text:
- Reviewed:
- Description: Authorship Analysis aims to extract information about the authorship of documents from features within those documents. Typically, this is performed as a classification task with the aim of identifying the author of a document, given a set of documents of known authorship. Alternatively, unsupervised methods have been developed primarily as visualisation tools to assist the manual discovery of clusters of authorship within a corpus by analysts. However, there is a need in many fields for more sophisticated unsupervised methods to automate the discovery, profiling and organisation of related information through clustering of documents by authorship. An automated and unsupervised methodology for clustering documents by authorship is proposed in this paper. The methodology is named NUANCE, for n-gram Unsupervised Automated Natural Cluster Ensemble. Testing indicates that the derived clusters have a strong correlation to the true authorship of unseen documents. © 2011 Cambridge University Press.
- Description: 2003010584
Stuttering, disability and the higher education sector in Australia
- Authors: Meredith, Grant , Packman, Ann , Marks, Genee
- Date: 2012
- Type: Text , Journal article
- Relation: International Journal of Speech-Language Pathology Vol. 14, no. 4 (2012), p. 370-376
- Full Text:
- Reviewed:
- Description: The aim of this study was to ascertain the extent to which Australian public universities and their associated disability liaison services offer web-based information for current or prospective students who stutter. The disability pages of the websites of all 39 public universities in Australia were visited and the information about disability services assessed according to 12 criteria developed by the authors. Results indicate that there is a dearth of information on Australian university websites available for students or prospective students who stutter. Only 13% of the sites reported any form of alternative teaching and assessment procedures for speech-impaired students and only 51% of 39 disability liaison officers responded when contacted by email. Such a student could not make an informed choice to enrol in a university based upon the information on disability services available on public Australian university websites. © 2012 The Speech Pathology Association of Australia Limited.
A student-centred approach : the english language support service for international students
- Authors: Pantelich, Melania
- Date: 2021
- Type: Text , Journal article
- Relation: Journal of Academic Language and Learning Vol. 15, no. 1 (2021), p. 72-84
- Full Text:
- Reviewed:
- Description: This article outlines the purpose, development and delivery of the English Language Support Service (ELSS), which is offered to international students in their first year of study at a medium-sized university in regional Victoria, Australia. Additionally, this article explains how the support provided is contextualised, timely and appropriate to student needs, allowing students to take on new concepts with meaning and immediate application, in conjunction with their degree coursework. ELSS has been specifically designed to aid international students with their initial exposure and transition to studying in an Australian context. It aims to help international students become more assured in their place at university, and acclimatise to the Australian academic language, culture and landscape enough in order to subsequently engage confidently with their assignments and the remainder of their degree.
The critical discourse analysis paradox : a brief research reflection
- Authors: Terry, Daniel
- Date: 2013
- Type: Text , Journal article
- Relation: Internet Journal of Language, Culture and Society Vol. 38, no. (2013), p. 42-44
- Full Text:
- Reviewed:
- Description: Critical Discourse Analysis (CDA) is a means of criticising or critiquing the social order of power, inequality and hegemony in language. Within a doctoral study CDA was used to determine if social power, dominance, and inequality are enacted and reproduced through the text and talk of key participants. A reflection of the researcher experiences is provided as the results were analysed and prepared for publication. The discussion highlights there are other also discourses of power and hegemony which may impact researchers and authors themselves as they report and discuss discourse which marginalises those individuals and groups for whom the research is being conducted. As researchers and academics attempt to articulate and discuss discourse which marginalise and stigmatise, they need to acknowledge and recognise the discourse, which impacts their own ability to advocate for change, adjustment and empowerment.
Focus groups and ELICOS evaluation
- Authors: Zeegers, Margaret
- Date: 2002
- Type: Text , Journal article
- Relation: English Australia Journal Vol. 20, no. 1 (2002), p. 17-23
- Full Text:
- Reviewed:
- Description: C1
- Description: 2003000125