Matches in SemOpenAlex for { <https://semopenalex.org/work/W4212821468> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W4212821468 abstract "Perhaps, PDF is the most popular format to share non-editable documents. PDF documents are often untagged. In particular, this means that positions and the cell structure of tables are not designated explicitly. PDF table detection predicts bounding boxes of tables on document pages. Some of the predictions inevitably happen to be false. This negatively affects the accuracy of table structure recognition. We argue that the page layout analysis in pre- and post-processing can refine the table detection. We suggest pre-processing algorithms for the recognition of headings, running titles, paragraphs, and images in PDF pages. This allows selecting areas of interest inside pages where real tables can be placed. Then we use deep neural networks to predict tables only in these areas. We also propose post-processing algorithms to verify predictions and filter out false table candidates after table detection. Our empirical study shows that the proposed approach reduces errors in the table detection and improve the PDF table extraction overall." @default.
- W4212821468 created "2022-02-24" @default.
- W4212821468 creator A5017095385 @default.
- W4212821468 creator A5041210817 @default.
- W4212821468 date "2021-12-01" @default.
- W4212821468 modified "2023-09-24" @default.
- W4212821468 title "Page Layout Analysis for Refining Table Extraction from PDF Documents" @default.
- W4212821468 cites W1573796710 @default.
- W4212821468 cites W184723052 @default.
- W4212821468 cites W2022351003 @default.
- W4212821468 cites W2032016603 @default.
- W4212821468 cites W2074966879 @default.
- W4212821468 cites W2107092590 @default.
- W4212821468 cites W2168459394 @default.
- W4212821468 cites W2321821989 @default.
- W4212821468 cites W2518276024 @default.
- W4212821468 cites W2618743573 @default.
- W4212821468 cites W2786480153 @default.
- W4212821468 cites W2786515133 @default.
- W4212821468 cites W2787523828 @default.
- W4212821468 cites W2804437076 @default.
- W4212821468 cites W2889497954 @default.
- W4212821468 cites W2901890385 @default.
- W4212821468 cites W2910897241 @default.
- W4212821468 cites W2914231536 @default.
- W4212821468 cites W2999605892 @default.
- W4212821468 cites W3003206728 @default.
- W4212821468 cites W3003482937 @default.
- W4212821468 cites W3003737912 @default.
- W4212821468 cites W3004186774 @default.
- W4212821468 cites W3021344331 @default.
- W4212821468 cites W3190766843 @default.
- W4212821468 cites W3205155483 @default.
- W4212821468 cites W633457721 @default.
- W4212821468 doi "https://doi.org/10.1109/ispras53967.2021.00021" @default.
- W4212821468 hasPublicationYear "2021" @default.
- W4212821468 type Work @default.
- W4212821468 citedByCount "1" @default.
- W4212821468 countsByYear W42128214682023 @default.
- W4212821468 crossrefType "proceedings-article" @default.
- W4212821468 hasAuthorship W4212821468A5017095385 @default.
- W4212821468 hasAuthorship W4212821468A5041210817 @default.
- W4212821468 hasConcept C106131492 @default.
- W4212821468 hasConcept C111012933 @default.
- W4212821468 hasConcept C124101348 @default.
- W4212821468 hasConcept C136764020 @default.
- W4212821468 hasConcept C153180895 @default.
- W4212821468 hasConcept C154945302 @default.
- W4212821468 hasConcept C172967692 @default.
- W4212821468 hasConcept C23123220 @default.
- W4212821468 hasConcept C31972630 @default.
- W4212821468 hasConcept C41008148 @default.
- W4212821468 hasConcept C45235069 @default.
- W4212821468 hasConcept C63584917 @default.
- W4212821468 hasConcept C68476402 @default.
- W4212821468 hasConceptScore W4212821468C106131492 @default.
- W4212821468 hasConceptScore W4212821468C111012933 @default.
- W4212821468 hasConceptScore W4212821468C124101348 @default.
- W4212821468 hasConceptScore W4212821468C136764020 @default.
- W4212821468 hasConceptScore W4212821468C153180895 @default.
- W4212821468 hasConceptScore W4212821468C154945302 @default.
- W4212821468 hasConceptScore W4212821468C172967692 @default.
- W4212821468 hasConceptScore W4212821468C23123220 @default.
- W4212821468 hasConceptScore W4212821468C31972630 @default.
- W4212821468 hasConceptScore W4212821468C41008148 @default.
- W4212821468 hasConceptScore W4212821468C45235069 @default.
- W4212821468 hasConceptScore W4212821468C63584917 @default.
- W4212821468 hasConceptScore W4212821468C68476402 @default.
- W4212821468 hasFunder F4320324099 @default.
- W4212821468 hasLocation W42128214681 @default.
- W4212821468 hasOpenAccess W4212821468 @default.
- W4212821468 hasPrimaryLocation W42128214681 @default.
- W4212821468 hasRelatedWork W1671988510 @default.
- W4212821468 hasRelatedWork W1863997242 @default.
- W4212821468 hasRelatedWork W2026588654 @default.
- W4212821468 hasRelatedWork W2076560287 @default.
- W4212821468 hasRelatedWork W2112343299 @default.
- W4212821468 hasRelatedWork W2116763882 @default.
- W4212821468 hasRelatedWork W2128762945 @default.
- W4212821468 hasRelatedWork W2327961366 @default.
- W4212821468 hasRelatedWork W4299352401 @default.
- W4212821468 hasRelatedWork W4361193679 @default.
- W4212821468 isParatext "false" @default.
- W4212821468 isRetracted "false" @default.
- W4212821468 workType "article" @default.