Matches in SemOpenAlex for { <https://semopenalex.org/work/W29040286> ?p ?o ?g. }
Showing items 1 to 98 of
98
with 100 items per page.
- W29040286 endingPage "8" @default.
- W29040286 startingPage "1" @default.
- W29040286 abstract "The rapid development of language tools using machine learning techniques for less computerized languages requires appropriately tagged corpus. A Bengali news corpus has been developed from the web archive of a widely read Bengali newspaper. A web crawler retrieves the web pages in Hyper Text Markup Language (HTML) format from the news archive. At present, the corpus contains approximately 34 million wordforms. The date, location, reporter and agency tags present in the web pages have been automatically named entity (NE) tagged. A portion of this partially NE tagged corpus has been manually annotated with the sixteen NE tags with the help of Sanchay Editor 1 , a text editor for Indian languages. This NE tagged corpus contains 150K wordforms. Additionally, 30K wordforms have been manually annotated with the twelve NE tags as part of the IJCNLP08 NER Shared Task for South and South East Asian Languages 2 . A table driven semi-automatic NE tag conversion routine has been developed in order to convert the sixteen-NE tagged corpus to the twelve-NE tagged corpus. The 150K NE tagged corpus has been used to develop Named Entity Recognition (NER) system in Bengali using pattern directed shallow parsing approach, Hidden Markov Model (HMM), Maximum Entropy (ME) Model, Condi" @default.
- W29040286 created "2016-06-24" @default.
- W29040286 creator A5006218020 @default.
- W29040286 creator A5085370631 @default.
- W29040286 date "2008-01-01" @default.
- W29040286 modified "2023-09-26" @default.
- W29040286 title "Development of Bengali Named Entity Tagged Corpus and its Use in NER Systems" @default.
- W29040286 cites W1481786847 @default.
- W29040286 cites W1520377376 @default.
- W29040286 cites W1545821449 @default.
- W29040286 cites W1934019294 @default.
- W29040286 cites W2016335896 @default.
- W29040286 cites W2045993505 @default.
- W29040286 cites W2067326963 @default.
- W29040286 cites W2067830457 @default.
- W29040286 cites W2095452022 @default.
- W29040286 cites W2147880316 @default.
- W29040286 cites W2149430911 @default.
- W29040286 cites W2152941660 @default.
- W29040286 cites W2320120825 @default.
- W29040286 cites W2406626796 @default.
- W29040286 hasPublicationYear "2008" @default.
- W29040286 type Work @default.
- W29040286 sameAs 29040286 @default.
- W29040286 citedByCount "8" @default.
- W29040286 countsByYear W290402862012 @default.
- W29040286 countsByYear W290402862013 @default.
- W29040286 countsByYear W290402862018 @default.
- W29040286 countsByYear W290402862019 @default.
- W29040286 countsByYear W290402862021 @default.
- W29040286 crossrefType "proceedings-article" @default.
- W29040286 hasAuthorship W29040286A5006218020 @default.
- W29040286 hasAuthorship W29040286A5085370631 @default.
- W29040286 hasConcept C108757681 @default.
- W29040286 hasConcept C136764020 @default.
- W29040286 hasConcept C137546455 @default.
- W29040286 hasConcept C154945302 @default.
- W29040286 hasConcept C162324750 @default.
- W29040286 hasConcept C186644900 @default.
- W29040286 hasConcept C187736073 @default.
- W29040286 hasConcept C19235068 @default.
- W29040286 hasConcept C204321447 @default.
- W29040286 hasConcept C21959979 @default.
- W29040286 hasConcept C23123220 @default.
- W29040286 hasConcept C23224414 @default.
- W29040286 hasConcept C2777889803 @default.
- W29040286 hasConcept C2779135771 @default.
- W29040286 hasConcept C2780451532 @default.
- W29040286 hasConcept C41008148 @default.
- W29040286 hasConcept C45874996 @default.
- W29040286 hasConcept C8797682 @default.
- W29040286 hasConceptScore W29040286C108757681 @default.
- W29040286 hasConceptScore W29040286C136764020 @default.
- W29040286 hasConceptScore W29040286C137546455 @default.
- W29040286 hasConceptScore W29040286C154945302 @default.
- W29040286 hasConceptScore W29040286C162324750 @default.
- W29040286 hasConceptScore W29040286C186644900 @default.
- W29040286 hasConceptScore W29040286C187736073 @default.
- W29040286 hasConceptScore W29040286C19235068 @default.
- W29040286 hasConceptScore W29040286C204321447 @default.
- W29040286 hasConceptScore W29040286C21959979 @default.
- W29040286 hasConceptScore W29040286C23123220 @default.
- W29040286 hasConceptScore W29040286C23224414 @default.
- W29040286 hasConceptScore W29040286C2777889803 @default.
- W29040286 hasConceptScore W29040286C2779135771 @default.
- W29040286 hasConceptScore W29040286C2780451532 @default.
- W29040286 hasConceptScore W29040286C41008148 @default.
- W29040286 hasConceptScore W29040286C45874996 @default.
- W29040286 hasConceptScore W29040286C8797682 @default.
- W29040286 hasLocation W290402861 @default.
- W29040286 hasOpenAccess W29040286 @default.
- W29040286 hasPrimaryLocation W290402861 @default.
- W29040286 hasRelatedWork W147166030 @default.
- W29040286 hasRelatedWork W1587977534 @default.
- W29040286 hasRelatedWork W196159769 @default.
- W29040286 hasRelatedWork W1977671488 @default.
- W29040286 hasRelatedWork W2004890010 @default.
- W29040286 hasRelatedWork W2010949961 @default.
- W29040286 hasRelatedWork W2021572127 @default.
- W29040286 hasRelatedWork W2034964736 @default.
- W29040286 hasRelatedWork W2073420389 @default.
- W29040286 hasRelatedWork W2097978833 @default.
- W29040286 hasRelatedWork W2099253769 @default.
- W29040286 hasRelatedWork W2144934730 @default.
- W29040286 hasRelatedWork W2147880316 @default.
- W29040286 hasRelatedWork W2408477201 @default.
- W29040286 hasRelatedWork W2898976768 @default.
- W29040286 hasRelatedWork W3037833079 @default.
- W29040286 hasRelatedWork W3101260801 @default.
- W29040286 hasRelatedWork W3118948038 @default.
- W29040286 hasRelatedWork W94125874 @default.
- W29040286 hasRelatedWork W1758980427 @default.
- W29040286 isParatext "false" @default.
- W29040286 isRetracted "false" @default.
- W29040286 magId "29040286" @default.
- W29040286 workType "article" @default.