Jagadeesh Patchala


2018

pdf bib
Authorship Attribution By Consensus Among Multiple Features
Jagadeesh Patchala | Raj Bhatnagar
Proceedings of the 27th International Conference on Computational Linguistics

Most existing research on authorship attribution uses various lexical, syntactic and semantic features. In this paper we demonstrate an effective template-based approach for combining various syntactic features of a document for authorship analysis. The parse-tree based features that we propose are independent of the topic of a document and reflect the innate writing styles of authors. We show that the use of templates including sub-trees of parse trees in conjunction with other syntactic features result in improved author attribution rates. Another contribution is the demonstration that Dempster’s rule based combination of evidence from syntactic features performs better than other evidence-combination methods. We also demonstrate that our methodology works well for the case where actual author is not included in the candidate author set.
Search
Co-authors
Venues