Evaluating authorship distance methods using the positive Silhouette coefficient
- Authors: Layton, Robert , Watters, Paul , Dazeley, Richard
- Date: 2013
- Type: Text , Journal article
- Relation: Natural Language Engineering Vol. 19, no. 4 (2013), p. 517-535
- Full Text:
- Reviewed:
- Description: Unsupervised Authorship Analysis (UAA) aims to cluster documents by authorship without knowing the authorship of any documents. An important factor in UAA is the method for calculating the distance between documents. This choice of the authorship distance method is considered more critical to the end result than the choice of cluster analysis algorithm. One method for measuring the correlation between a distance metric and a labelling (such as class values or clusters) is the Silhouette Coefficient (SC). The SC can be leveraged by measuring the correlation between the authorship distance method and the true authorship, evaluating the quality of the distance method. However, we show that the SC can be severely affected by outliers. To address this issue, we introduce the Positive Silhouette Coefficient, given as the proportion of instances with a positive SC value. This metric is not easily altered by outliers and produces a more robust metric. A large number of authorship distance methods are then compared using the PSC, and the findings are presented. This research provides an insight into the efficacy of methods for UAA and presents a framework for testing authorship distance methods.
- Description: C1
Recentred local profiles for authorship attribution
- Authors: Layton, Robert , Watters, Paul , Dazeley, Richard
- Date: 2012
- Type: Text , Journal article
- Relation: Natural Language Engineering Vol. 18, no. 3 (2012), p. 293-312
- Full Text:
- Reviewed:
- Description: Authorship attribution methods aim to determine the author of a document, by using information gathered from a set of documents with known authors. One method of performing this task is to create profiles containing distinctive features known to be used by each author. In this paper, a new method of creating an author or document profile is presented that detects features considered distinctive, compared to normal language usage. This recentreing approach creates more accurate profiles than previous methods, as demonstrated empirically using a known corpus of authorship problems. This method, named recentred local profiles, determines authorship accurately using a simple 'best matching author' approach to classification, compared to other methods in the literature. The proposed method is shown to be more stable than related methods as parameter values change. Using a weighted voting scheme, recentred local profiles is shown to outperform other methods in authorship attribution, with an overall accuracy of 69.9% on the ad-hoc authorship attribution competition corpus, representing a significant improvement over related methods. Copyright © Cambridge University Press 2011.
- Description: 2003010688
Automated unsupervised authorship analysis using evidence accumulation clustering
- Authors: Layton, Robert , Watters, Paul , Dazeley, Richard
- Date: 2013
- Type: Text , Journal article
- Relation: Natural Language Engineering Vol. 19, no. 1 (2013), p. 95-120
- Full Text:
- Reviewed:
- Description: Authorship Analysis aims to extract information about the authorship of documents from features within those documents. Typically, this is performed as a classification task with the aim of identifying the author of a document, given a set of documents of known authorship. Alternatively, unsupervised methods have been developed primarily as visualisation tools to assist the manual discovery of clusters of authorship within a corpus by analysts. However, there is a need in many fields for more sophisticated unsupervised methods to automate the discovery, profiling and organisation of related information through clustering of documents by authorship. An automated and unsupervised methodology for clustering documents by authorship is proposed in this paper. The methodology is named NUANCE, for n-gram Unsupervised Automated Natural Cluster Ensemble. Testing indicates that the derived clusters have a strong correlation to the true authorship of unseen documents. © 2011 Cambridge University Press.
- Description: 2003010584
Stuttering, disability and the higher education sector in Australia
- Authors: Meredith, Grant , Packman, Ann , Marks, Genee
- Date: 2012
- Type: Text , Journal article
- Relation: International Journal of Speech-Language Pathology Vol. 14, no. 4 (2012), p. 370-376
- Full Text:
- Reviewed:
- Description: The aim of this study was to ascertain the extent to which Australian public universities and their associated disability liaison services offer web-based information for current or prospective students who stutter. The disability pages of the websites of all 39 public universities in Australia were visited and the information about disability services assessed according to 12 criteria developed by the authors. Results indicate that there is a dearth of information on Australian university websites available for students or prospective students who stutter. Only 13% of the sites reported any form of alternative teaching and assessment procedures for speech-impaired students and only 51% of 39 disability liaison officers responded when contacted by email. Such a student could not make an informed choice to enrol in a university based upon the information on disability services available on public Australian university websites. © 2012 The Speech Pathology Association of Australia Limited.
Technology and the evolution of clinical methods for stuttering
- Authors: Packman, Ann , Meredith, Grant
- Date: 2011
- Type: Text , Journal article
- Relation: Journal of Fluency Disorders Vol. 36, no. 2 (2011), p. 75-85
- Full Text: false
- Reviewed:
- Description: The World Wide Web (WWW) was 20 years old last year. Enormous amounts of information about stuttering are now available to anyone who can access the Internet. Compared to 20 years ago, people who stutter and their families can now make more informed choices about speech-language interventions, from a distance. Blogs and chat rooms provide opportunities for people who stutter to share their experiences from a distance and to support one another. New technologies are also being adopted into speech-language pathology practice and service delivery. Telehealth is an exciting development as it means that treatment can now be made available to many rural and remotely located people who previously did not have access to it. Possible future technological developments for speech-language pathology practice include Internet based treatments and the use of Virtual Reality. Having speech and CBT treatments for stuttering available on the Internet would greatly increase their accessibility. Second Life also has exciting possibilities for people who stutter.Educational objectives: The reader will (1) explain how people who stutter and their families can get information about stuttering from the World Wide Web, (2) discuss how new technologies have been applied in speech-language pathology practice, and (3) summarize the principles and practice of telehealth delivery of services for people who stutter and their families. © 2011 Elsevier Inc.
Speech-language pathology intervention in a youth justice setting : Benefits perceived by staff extend beyond communication
- Authors: Snow, Pamela , Bagley, Kerryn , White, Donna
- Date: 2018
- Type: Text , Journal article
- Relation: International Journal of Speech-Language Pathology Vol. 20, no. 4 (2018), p. 458-467
- Full Text: false
- Reviewed:
- Description: Purpose: Young people in youth justice (YJ) settings face high-risk for unidentified language disorder, however, speech-language pathology (SLP) services are not routinely offered in such settings. The aim of this study was to explore and describe the perceptions and experiences of YJ staff in a custodial centre of the utility of having a speech-language pathologist working with young offenders. Method: Following a SLP intervention trial, two staff focus groups were conducted by an independent SLP. Interview probes were developed through review of the literature and consultation with the practitioner who conducted the clinical intervention. Focus groups were digitally recorded for thematic analysis, which was carried out by the three authors independently. Result: YJ staff expressed consistently positive views about the SLP intervention trial in their centre. Staff indicated that they learnt a great deal about the complexity of communication difficulties in this population, and that this information informed and guided their own practices. They expressed surprise at the engagement of young people in the SLP service, and supported its embedding in the YJ setting. Conclusion: YJ staff endorsed the value of a SLP service in a custodial setting. Further research should focus on refining measurement of this service and its impact. © 2017,