http://scholars.ntou.edu.tw/handle/123456789/26403| 標題: | Printed document layout analysis and optical character recognition system based on deep learning | 作者: | Li, Dong-Lin Lee, Shih-Kai Liu, Yin-Ting |
關鍵字: | OCR;Layout analysis;CNN;YOLO;Deep learning | 公開日期: | 2025 | 出版社: | NATURE PORTFOLIO | 卷: | 15 | 期: | 1 | 來源出版物: | SCIENTIFIC REPORTS | 摘要: | This paper proposes a layout analysis and text recognition system for printed documents based on deep learning. Initially, scanned documents or image files are processed using a layout analysis algorithm based on YOLOv4 and YOLOv8 deep learning to identify the positions of titles, text paragraphs, tables, and images within the document. Each of these categories undergoes specific character segmentation processing. Then, the content is recognized using a text recognition algorithm based on Convolutional Neural Networks (CNN). Finally, the recognized text is integrated and output in editable formats, such as JSON or Microsoft formats. Our proposed method enables convenient, fast, and highly accurate OCR processing on a local computer. |
URI: | http://scholars.ntou.edu.tw/handle/123456789/26403 | ISSN: | 2045-2322 | DOI: | 10.1038/s41598-025-07439-y |
| 顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。