Matches in SemOpenAlex for { <https://semopenalex.org/work/W2911367192> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W2911367192 abstract "The availability of parallel corpora is limited, especially for under-resourced languages and narrow domains. On the other hand, the number of comparable documents in these areas that are freely available on the Web is continuously increasing. Algorithmic approaches to identify these documents from the Web are needed for the purpose of automatically building comparable corpora for these under-resourced languages and domains. How do we identify these comparable documents? What approaches should be used in collecting these comparable documents from different Web sources? In this chapter, we firstly present a review of previous techniques that have been developed for collecting comparable documents from the Web. Then we describe in detail three new techniques to gather comparable documents from three different types of Web sources: Wikipedia, news articles, and narrow domains." @default.
- W2911367192 created "2019-02-21" @default.
- W2911367192 creator A5003699544 @default.
- W2911367192 creator A5011889540 @default.
- W2911367192 creator A5036691231 @default.
- W2911367192 creator A5038724800 @default.
- W2911367192 creator A5040587162 @default.
- W2911367192 creator A5041232293 @default.
- W2911367192 creator A5045614251 @default.
- W2911367192 creator A5058386804 @default.
- W2911367192 creator A5062940611 @default.
- W2911367192 creator A5065296262 @default.
- W2911367192 creator A5084668427 @default.
- W2911367192 creator A5090205535 @default.
- W2911367192 date "2019-01-01" @default.
- W2911367192 modified "2023-10-17" @default.
- W2911367192 title "Collecting Comparable Corpora" @default.
- W2911367192 cites W1481125541 @default.
- W2911367192 cites W1636405317 @default.
- W2911367192 cites W1982474572 @default.
- W2911367192 cites W2001832505 @default.
- W2911367192 cites W2017726337 @default.
- W2911367192 cites W2029341294 @default.
- W2911367192 cites W2037796960 @default.
- W2911367192 cites W2044743392 @default.
- W2911367192 cites W2066636486 @default.
- W2911367192 cites W2086039194 @default.
- W2911367192 cites W2090146924 @default.
- W2911367192 cites W2102749417 @default.
- W2911367192 cites W2107695330 @default.
- W2911367192 cites W2109803107 @default.
- W2911367192 cites W2116713744 @default.
- W2911367192 cites W2120101509 @default.
- W2911367192 cites W2124718840 @default.
- W2911367192 cites W2132019450 @default.
- W2911367192 cites W2141825421 @default.
- W2911367192 cites W2145080939 @default.
- W2911367192 cites W2145685230 @default.
- W2911367192 cites W2153488166 @default.
- W2911367192 cites W2171836785 @default.
- W2911367192 cites W2405617275 @default.
- W2911367192 cites W3211848854 @default.
- W2911367192 cites W97009826 @default.
- W2911367192 doi "https://doi.org/10.1007/978-3-319-99004-0_3" @default.
- W2911367192 hasPublicationYear "2019" @default.
- W2911367192 type Work @default.
- W2911367192 sameAs 2911367192 @default.
- W2911367192 citedByCount "1" @default.
- W2911367192 countsByYear W29113671922020 @default.
- W2911367192 crossrefType "book-chapter" @default.
- W2911367192 hasAuthorship W2911367192A5003699544 @default.
- W2911367192 hasAuthorship W2911367192A5011889540 @default.
- W2911367192 hasAuthorship W2911367192A5036691231 @default.
- W2911367192 hasAuthorship W2911367192A5038724800 @default.
- W2911367192 hasAuthorship W2911367192A5040587162 @default.
- W2911367192 hasAuthorship W2911367192A5041232293 @default.
- W2911367192 hasAuthorship W2911367192A5045614251 @default.
- W2911367192 hasAuthorship W2911367192A5058386804 @default.
- W2911367192 hasAuthorship W2911367192A5062940611 @default.
- W2911367192 hasAuthorship W2911367192A5065296262 @default.
- W2911367192 hasAuthorship W2911367192A5084668427 @default.
- W2911367192 hasAuthorship W2911367192A5090205535 @default.
- W2911367192 hasConcept C136764020 @default.
- W2911367192 hasConcept C23123220 @default.
- W2911367192 hasConcept C41008148 @default.
- W2911367192 hasConceptScore W2911367192C136764020 @default.
- W2911367192 hasConceptScore W2911367192C23123220 @default.
- W2911367192 hasConceptScore W2911367192C41008148 @default.
- W2911367192 hasLocation W29113671921 @default.
- W2911367192 hasOpenAccess W2911367192 @default.
- W2911367192 hasPrimaryLocation W29113671921 @default.
- W2911367192 hasRelatedWork W1557645482 @default.
- W2911367192 hasRelatedWork W1994754711 @default.
- W2911367192 hasRelatedWork W2101955803 @default.
- W2911367192 hasRelatedWork W2144190808 @default.
- W2911367192 hasRelatedWork W2323214056 @default.
- W2911367192 hasRelatedWork W2370424357 @default.
- W2911367192 hasRelatedWork W2376314740 @default.
- W2911367192 hasRelatedWork W2384888906 @default.
- W2911367192 hasRelatedWork W2804576480 @default.
- W2911367192 hasRelatedWork W2796728524 @default.
- W2911367192 isParatext "false" @default.
- W2911367192 isRetracted "false" @default.
- W2911367192 magId "2911367192" @default.
- W2911367192 workType "book-chapter" @default.