- Title
- Authorship attribution of IRC messages using inverse author frequency
- Creator
- Layton, Robert; McCombie, Stephen; Watters, Paul
- Date
- 2012
- Type
- Text; Conference proceedings
- Identifier
- http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/69038
- Identifier
- vital:4965
- Identifier
-
https://doi.org/10.1109/CTC.2012.11
- Abstract
- Internet Relay Chat (IRC) is a useful and relativelysimple protocol for text based chat online, used in a variety ofareas online such as for discussion and technical support. IRC isalso used for cybercrime, with online rooms selling stolen creditcard details, botnet access and malware. The reasons for theuse of IRC in cybercrime include the widespread adoption andease of use, but also focus around the anonymity granted bythe protocol, allowing users to hide behind aliases that can bechanged regularly. In this research, we apply authorship analysistechniques to be able to attribute chat messages to known aliases.A preliminary experiment shows that this application is verydifficult, due to the short messages and repeated information.To improve the accuracy, we apply inverse-author-frequency(iaf) weighting, which gives higher weights to features used byfewer authors. This research is the first time that iaf has beenapplied to character n-gram models, previously being applied toword based models of authorship. We find that this improvesthe accuracy significantly for the RLP method and provides aplatform for successful applications of authorship analysis in thefuture. Overall, the method achieves accuracies of over 55% ina very difficult application domain. © 2012 IEEE.
- Publisher
- Ballarat, VIC IEEE Computer Society Conference Publishing Services
- Rights
- Copyright 2012 IEEE
- Rights
- This metadata is freely available under a CCO license
- Subject
- Attribution; Authorship analysis; Cybercrime; IRC; OSINT
- Hits: 1405
- Visitors: 1408
- Downloads: 0
Thumbnail | File | Description | Size | Format |
---|