Tan, Q. (M.), Cao, Q. , Seow, C. K. and Yau, C. (P.) (2023) Information extraction system for invoices and receipts. In: Huang, D.-S., Premaratne, P., Jin, B., Qu, B., Jo, K.-H. and Hussain, A. (eds.) Advanced Intelligent Computing Technology and Applications: 19th International Conference, ICIC 2023, Zhengzhou, China, August 10–13, 2023, Proceedings, Part IV. Series: Lecture notes in computer science (14089). Springer: Singapore, pp. 77-89. ISBN 9789819947515
![]() |
Text
303465.pdf - Accepted Version Restricted to Repository staff only until 31 July 2024. 381kB |
Abstract
Rapid growth in the digitization of documents, such as paper-based invoices or receipts, has alleviated the demand for methods to process information accurately and efficiently. However, it has become impractical for humans to extract the data manually, as it is labor-intensive and time-consuming. Digital documents contain various components such as tables, key-value pairs and figures. Existing optical character recognition (OCR) methods can recognize texts, but it is challenging to extract the key-value pairs in unformatted digital invoices or receipts. Hence, developing an information extraction system with intelligent algorithms would be beneficial, as it can increase the workflow efficiency for knowledge discovery and data recognition. In this paper, a pipeline of the information extraction system is proposed with intelligent computing and deep learning approaches for classifying key-value pairs first, followed by linking the key-value pairs. Two key-value pairing rules are developed in the proposed pipeline. Various experiments with intelligent algorithms are conducted to evaluate the performance of the pipeline of information extraction system.
Item Type: | Book Sections |
---|---|
Additional Information: | eISBN 9789819947522. |
Status: | Published |
Glasgow Author(s) Enlighten ID: | Cao, Dr Qi and Yau, Dr Peter C Y and Seow, Dr Chee Kiat |
Authors: | Tan, Q. (M.), Cao, Q., Seow, C. K., and Yau, C. (P.) |
College/School: | College of Science and Engineering > School of Computing Science |
Publisher: | Springer |
ISBN: | 9789819947515 |
Copyright Holders: | Copyright: © The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 |
First Published: | First published in Advanced Intelligent Computing Technology and Applications: 19th International Conference, ICIC 2023, Zhengzhou, China, August 10–13, 2023, Proceedings, Part IV |
Publisher Policy: | Reproduced in accordance with the publisher copyright policy |
Related URLs: |
University Staff: Request a correction | Enlighten Editors: Update this record