Information extraction system for invoices and receipts

Tan, Q. (M.), Cao, Q. , Seow, C. K. and Yau, C. (P.) (2023) Information extraction system for invoices and receipts. In: Huang, D.-S., Premaratne, P., Jin, B., Qu, B., Jo, K.-H. and Hussain, A. (eds.) Advanced Intelligent Computing Technology and Applications: 19th International Conference, ICIC 2023, Zhengzhou, China, August 10–13, 2023, Proceedings, Part IV. Series: Lecture notes in computer science (14089). Springer: Singapore, pp. 77-89. ISBN 9789819947515

[img] Text
303465.pdf - Accepted Version
Restricted to Repository staff only until 31 July 2024.

381kB

Abstract

Rapid growth in the digitization of documents, such as paper-based invoices or receipts, has alleviated the demand for methods to process information accurately and efficiently. However, it has become impractical for humans to extract the data manually, as it is labor-intensive and time-consuming. Digital documents contain various components such as tables, key-value pairs and figures. Existing optical character recognition (OCR) methods can recognize texts, but it is challenging to extract the key-value pairs in unformatted digital invoices or receipts. Hence, developing an information extraction system with intelligent algorithms would be beneficial, as it can increase the workflow efficiency for knowledge discovery and data recognition. In this paper, a pipeline of the information extraction system is proposed with intelligent computing and deep learning approaches for classifying key-value pairs first, followed by linking the key-value pairs. Two key-value pairing rules are developed in the proposed pipeline. Various experiments with intelligent algorithms are conducted to evaluate the performance of the pipeline of information extraction system.

Item Type:Book Sections
Additional Information:eISBN 9789819947522.
Status:Published
Glasgow Author(s) Enlighten ID:Cao, Dr Qi and Yau, Dr Peter C Y and Seow, Dr Chee Kiat
Authors: Tan, Q. (M.), Cao, Q., Seow, C. K., and Yau, C. (P.)
College/School:College of Science and Engineering > School of Computing Science
Publisher:Springer
ISBN:9789819947515
Copyright Holders:Copyright: © The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023
First Published:First published in Advanced Intelligent Computing Technology and Applications: 19th International Conference, ICIC 2023, Zhengzhou, China, August 10–13, 2023, Proceedings, Part IV
Publisher Policy:Reproduced in accordance with the publisher copyright policy
Related URLs:

University Staff: Request a correction | Enlighten Editors: Update this record