Noisy Text Categorization

Vinciarelli, A. (2004) Noisy Text Categorization. In: 17th International Conference on Pattern Recognition (ICPR'04), Cambridge UK, August 23-26, 2004, pp. 554-557. (doi:10.1109/ICPR.2004.1334303)

Vinciarelli, A. (2004) Noisy Text Categorization. In: 17th International Conference on Pattern Recognition (ICPR'04), Cambridge UK, August 23-26, 2004, pp. 554-557. (doi:10.1109/ICPR.2004.1334303)

Full text not currently available from Enlighten.

Publisher's URL: http://doi.ieeecomputersociety.org/10.1109/ICPR.2004.1334303

Abstract

This work presents a system for the categorization of noisy texts. By noisy it is meant any text obtained through an extraction process (affected by errors) from media different than digital texts. We show that, even with an average Word Error Rate of around 50%, the categorization performance loss with respect to the clean version of the same documents is negligible.

Item Type:Conference Proceedings
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Vinciarelli, Professor Alessandro
Authors: Vinciarelli, A.
College/School:College of Science and Engineering > School of Computing Science

University Staff: Request a correction | Enlighten Editors: Update this record