| A Workshop on Machine Learning in Natural Language Processing |
Organizers: Shalom Lapin and Ido Dagan |
The Authorship Attribution Problem: Variations and Solutions
Moshe Koppel
Hebrew University
Abstract: In the standard authorship attribution problem, we are told that the author of an anonymous document is one of a given set of suspects and are asked to choose the likeliest candidate among them based on their respective known writings. Posed this way, the problem is a reasonably straightforward text categorization problem. In this talk, we will consider variations on the standard problem, focusing especially on the case in which there may be tens of thousands of candidate authors. |