- Tài khoản và mật khẩu chỉ cung cấp cho sinh viên, giảng viên, cán bộ của TRƯỜNG ĐẠI HỌC FPT
- Hướng dẫn sử dụng: Xem Video .
- Danh mục tài liệu mới: Tại đây .
- Đăng nhập : Tại đây .
SỐ LƯỢT TRUY CẬP


accurate visitors web counter
Visits Counter
FPT University|e-Resources > Đồ án tốt nghiệp (Dissertations) > Khoa học máy tính - Trí tuệ nhân tạo >
Please use this identifier to cite or link to this item: http://ds.libol.fpt.edu.vn/handle/123456789/3022

Title: Key Information Extraction from Vietnamese Invoices by Combining Layout and Context
Authors: Lê, Đình Huynh
Ngô, Tuấn Anh
Trần, Mạnh Cường
Keywords: Computer Science
Key information extraction
Natural Language Processing
Layout analysis
Issue Date: 2021
Publisher: FPTU Hà Nội
Abstract: This thesis introduces a deep approach, an effective and robust framework in handling complex document layout, visual features, and textual semantics for Key Information Extraction (KIE). The algorithm combines graph learning with graph convolution, resulting in a richer semantic representation that includes both textual and visual features and a clear global layout. The model's input only with the coordinates of token bounding boxes, avoiding the use of raw images. It leads to a layout-aware language model, which can fine-tune downstream tasks. The model is evaluated on a key information extraction task using publicly available datasets SROIE. We show that it achieves superior performance on datasets consisting of visually rich documents while outperforming the baseline RoBERTa on documents.
Description: Thesis: 54 pages
URI: /handle/123456789/3022
Appears in Collections:Khoa học máy tính - Trí tuệ nhân tạo

Files in This Item:

File Description SizeFormat
Thesis_Key Information Extraction from Vietnamese Invoices by Combining Layout and Context.pdfFree2.38 MBAdobe PDF book.png
View/Open
Slide - Key Information Extraction from Vietnamese Invoices by Combining Layout and Context.pdfFree2.13 MBAdobe PDF book.png
View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

  Collections Copyright © FPT University

FSE Hoa Lac Library

Add : Room 107, 1st floor, Hoa Lac campus, Km28 Thang Long Avenue, Hoa Lac Hi-Tech Park

Office tel: + 844.66805912  / Email :  thuvien_fu_hoalac@fpt.edu.vn

 - Feedback