Matches in SemOpenAlex for { <https://semopenalex.org/work/W4220971275> ?p ?o ?g. }
- W4220971275 endingPage "21" @default.
- W4220971275 startingPage "1" @default.
- W4220971275 abstract "With the rapid development of mobile Internet technology and artificial intelligence technology, the digital publishing industry is in urgent need of using intelligent technology to change the current way of content production and service. Most of the e-book resources owned by publishing enterprises are in PDF format, which is not suitable for reading on mobile devices, and it is not convenient to directly extract key information and construct knowledge graph. With this in mind, this article designs a PDF automatic indexing scheme that can identify all the element information in PDF and output structured data automatically and then extract all the key information in it to generate a keyword library with tag weights. The scheme mainly involves two key technical points: parsing PDF based on text features and grammar rules and extracting keywords based on tag weights. The former visualizes the text block in PDF into a rectangular area, divides the elements by clustering algorithm, and, finally, outputs structured data containing all the information. The latter combines the tags and their weights in the structured data and extracts the keywords in it by the inter-word relation algorithm. The structured data and keywords database produced by this scheme can be used to produce intelligent e-book and build knowledge graph, thus helping publishing enterprises to transform from a content service provider to an intelligent knowledge service provider. This transformation can deeply excavate the core value of the content held by the publishing industry and promote the digitization and intelligentization process of the whole industry." @default.
- W4220971275 created "2022-04-03" @default.
- W4220971275 creator A5027668993 @default.
- W4220971275 creator A5043978699 @default.
- W4220971275 creator A5050179383 @default.
- W4220971275 creator A5079663867 @default.
- W4220971275 date "2023-03-31" @default.
- W4220971275 modified "2023-10-16" @default.
- W4220971275 title "Research and Implementation of Automatic Indexing Method of PDF for Digital Publishing" @default.
- W4220971275 cites W1861201910 @default.
- W4220971275 cites W1996852554 @default.
- W4220971275 cites W2099363719 @default.
- W4220971275 cites W2131712117 @default.
- W4220971275 cites W2142887881 @default.
- W4220971275 cites W2204508432 @default.
- W4220971275 cites W2346271454 @default.
- W4220971275 cites W2474062739 @default.
- W4220971275 cites W2604759005 @default.
- W4220971275 cites W2623794988 @default.
- W4220971275 cites W2626799581 @default.
- W4220971275 cites W2770848963 @default.
- W4220971275 cites W2774928600 @default.
- W4220971275 cites W2887681009 @default.
- W4220971275 cites W2911855264 @default.
- W4220971275 cites W2949160428 @default.
- W4220971275 cites W2969431175 @default.
- W4220971275 cites W2972142851 @default.
- W4220971275 cites W2987720654 @default.
- W4220971275 cites W2992548114 @default.
- W4220971275 cites W2995796179 @default.
- W4220971275 cites W2995848879 @default.
- W4220971275 cites W3007523155 @default.
- W4220971275 cites W3021482958 @default.
- W4220971275 cites W3025935606 @default.
- W4220971275 cites W3032036711 @default.
- W4220971275 cites W3042799967 @default.
- W4220971275 cites W3046966796 @default.
- W4220971275 cites W3121510293 @default.
- W4220971275 cites W3168067188 @default.
- W4220971275 cites W3172269557 @default.
- W4220971275 cites W4246033633 @default.
- W4220971275 doi "https://doi.org/10.1145/3501400" @default.
- W4220971275 hasPublicationYear "2023" @default.
- W4220971275 type Work @default.
- W4220971275 citedByCount "0" @default.
- W4220971275 crossrefType "journal-article" @default.
- W4220971275 hasAuthorship W4220971275A5027668993 @default.
- W4220971275 hasAuthorship W4220971275A5043978699 @default.
- W4220971275 hasAuthorship W4220971275A5050179383 @default.
- W4220971275 hasAuthorship W4220971275A5079663867 @default.
- W4220971275 hasConcept C110875604 @default.
- W4220971275 hasConcept C136264566 @default.
- W4220971275 hasConcept C136764020 @default.
- W4220971275 hasConcept C143275388 @default.
- W4220971275 hasConcept C151719136 @default.
- W4220971275 hasConcept C162324750 @default.
- W4220971275 hasConcept C17744445 @default.
- W4220971275 hasConcept C18599908 @default.
- W4220971275 hasConcept C199360897 @default.
- W4220971275 hasConcept C199539241 @default.
- W4220971275 hasConcept C23123220 @default.
- W4220971275 hasConcept C26517878 @default.
- W4220971275 hasConcept C2779308522 @default.
- W4220971275 hasConcept C2780378061 @default.
- W4220971275 hasConcept C2780801425 @default.
- W4220971275 hasConcept C31972630 @default.
- W4220971275 hasConcept C38652104 @default.
- W4220971275 hasConcept C41008148 @default.
- W4220971275 hasConcept C518677369 @default.
- W4220971275 hasConcept C75165309 @default.
- W4220971275 hasConceptScore W4220971275C110875604 @default.
- W4220971275 hasConceptScore W4220971275C136264566 @default.
- W4220971275 hasConceptScore W4220971275C136764020 @default.
- W4220971275 hasConceptScore W4220971275C143275388 @default.
- W4220971275 hasConceptScore W4220971275C151719136 @default.
- W4220971275 hasConceptScore W4220971275C162324750 @default.
- W4220971275 hasConceptScore W4220971275C17744445 @default.
- W4220971275 hasConceptScore W4220971275C18599908 @default.
- W4220971275 hasConceptScore W4220971275C199360897 @default.
- W4220971275 hasConceptScore W4220971275C199539241 @default.
- W4220971275 hasConceptScore W4220971275C23123220 @default.
- W4220971275 hasConceptScore W4220971275C26517878 @default.
- W4220971275 hasConceptScore W4220971275C2779308522 @default.
- W4220971275 hasConceptScore W4220971275C2780378061 @default.
- W4220971275 hasConceptScore W4220971275C2780801425 @default.
- W4220971275 hasConceptScore W4220971275C31972630 @default.
- W4220971275 hasConceptScore W4220971275C38652104 @default.
- W4220971275 hasConceptScore W4220971275C41008148 @default.
- W4220971275 hasConceptScore W4220971275C518677369 @default.
- W4220971275 hasConceptScore W4220971275C75165309 @default.
- W4220971275 hasFunder F4320336363 @default.
- W4220971275 hasIssue "3" @default.
- W4220971275 hasLocation W42209712751 @default.
- W4220971275 hasOpenAccess W4220971275 @default.
- W4220971275 hasPrimaryLocation W42209712751 @default.
- W4220971275 hasRelatedWork W1606406758 @default.
- W4220971275 hasRelatedWork W1863997242 @default.
- W4220971275 hasRelatedWork W1873761914 @default.