Matches in SemOpenAlex for { <https://semopenalex.org/work/W2186804596> ?p ?o ?g. }
- W2186804596 abstract "In this work we introduce a system based on natural language processing techniques which aim is to enhance social news media in Bulgarian. It solves the task of multi-class, multi-label classification of documents. We apply the algorithms to a collection of media articles from Svejo.net, a popular Bulgarian web resource comprising user-generated content. Our algorithms are one-versus-all classification methods widely used in the computational linguistics community. We describe the algorithms, the features employed and we evaluate the impact of the features on the performance of the models. Thereby, we show that knowledge about the user and user behavior can greatly improve performance. Also, despite the fact that our document collection is generated entirely by social media users, the quality of the classification results is comparable to that of previously reported studies. We address also the task of automatic keyword and keyphrase extraction from unstructured text, and suit it to the needs of Svejo.net for induction of’themes’. Themes are defined as text snippets that summarize the essence of an article. We evaluate the performance of several generic methods for keyword and keyphrase extraction on a corpus of articles in Bulgarian. The methods that we discuss rely on widely accepted information retrieval and machine learning techniques and are languageindependent. We also consider the effect of a stemmer component on the keyphrase extraction accuracy. The satisfactory performance of our models in spite of the limited linguistic knowledge incorporated in them recommends our models as a baseline for keyword and keyphrase extraction for Bulgarian language." @default.
- W2186804596 created "2016-06-24" @default.
- W2186804596 creator A5009647947 @default.
- W2186804596 creator A5091336182 @default.
- W2186804596 date "2012-01-01" @default.
- W2186804596 modified "2023-09-27" @default.
- W2186804596 title "ENHANCING SoCIAl NEWS MEdIA IN BulGArIAN WItH NA turAl lANGuAGE ProCESSING" @default.
- W2186804596 cites W1486865875 @default.
- W2186804596 cites W1507852567 @default.
- W2186804596 cites W1510117420 @default.
- W2186804596 cites W1510372738 @default.
- W2186804596 cites W1512847390 @default.
- W2186804596 cites W1550206324 @default.
- W2186804596 cites W1585743408 @default.
- W2186804596 cites W1594797367 @default.
- W2186804596 cites W168332103 @default.
- W2186804596 cites W1907578970 @default.
- W2186804596 cites W1924689489 @default.
- W2186804596 cites W1969572066 @default.
- W2186804596 cites W202303397 @default.
- W2186804596 cites W2038670074 @default.
- W2186804596 cites W2049119796 @default.
- W2186804596 cites W2053463056 @default.
- W2186804596 cites W2059586463 @default.
- W2186804596 cites W2114535528 @default.
- W2186804596 cites W2128751639 @default.
- W2186804596 cites W2139153206 @default.
- W2186804596 cites W2139193890 @default.
- W2186804596 cites W2142129461 @default.
- W2186804596 cites W2145766604 @default.
- W2186804596 cites W2149684865 @default.
- W2186804596 cites W2158018156 @default.
- W2186804596 cites W2158873310 @default.
- W2186804596 cites W2160218441 @default.
- W2186804596 cites W2170654002 @default.
- W2186804596 hasPublicationYear "2012" @default.
- W2186804596 type Work @default.
- W2186804596 sameAs 2186804596 @default.
- W2186804596 citedByCount "0" @default.
- W2186804596 crossrefType "journal-article" @default.
- W2186804596 hasAuthorship W2186804596A5009647947 @default.
- W2186804596 hasAuthorship W2186804596A5091336182 @default.
- W2186804596 hasConcept C111472728 @default.
- W2186804596 hasConcept C136764020 @default.
- W2186804596 hasConcept C138885662 @default.
- W2186804596 hasConcept C154945302 @default.
- W2186804596 hasConcept C162324750 @default.
- W2186804596 hasConcept C187736073 @default.
- W2186804596 hasConcept C195807954 @default.
- W2186804596 hasConcept C204321447 @default.
- W2186804596 hasConcept C23123220 @default.
- W2186804596 hasConcept C2777212361 @default.
- W2186804596 hasConcept C2779530757 @default.
- W2186804596 hasConcept C2780288562 @default.
- W2186804596 hasConcept C2780343019 @default.
- W2186804596 hasConcept C2780451532 @default.
- W2186804596 hasConcept C41008148 @default.
- W2186804596 hasConcept C41895202 @default.
- W2186804596 hasConcept C518677369 @default.
- W2186804596 hasConceptScore W2186804596C111472728 @default.
- W2186804596 hasConceptScore W2186804596C136764020 @default.
- W2186804596 hasConceptScore W2186804596C138885662 @default.
- W2186804596 hasConceptScore W2186804596C154945302 @default.
- W2186804596 hasConceptScore W2186804596C162324750 @default.
- W2186804596 hasConceptScore W2186804596C187736073 @default.
- W2186804596 hasConceptScore W2186804596C195807954 @default.
- W2186804596 hasConceptScore W2186804596C204321447 @default.
- W2186804596 hasConceptScore W2186804596C23123220 @default.
- W2186804596 hasConceptScore W2186804596C2777212361 @default.
- W2186804596 hasConceptScore W2186804596C2779530757 @default.
- W2186804596 hasConceptScore W2186804596C2780288562 @default.
- W2186804596 hasConceptScore W2186804596C2780343019 @default.
- W2186804596 hasConceptScore W2186804596C2780451532 @default.
- W2186804596 hasConceptScore W2186804596C41008148 @default.
- W2186804596 hasConceptScore W2186804596C41895202 @default.
- W2186804596 hasConceptScore W2186804596C518677369 @default.
- W2186804596 hasLocation W21868045961 @default.
- W2186804596 hasOpenAccess W2186804596 @default.
- W2186804596 hasPrimaryLocation W21868045961 @default.
- W2186804596 hasRelatedWork W1698046050 @default.
- W2186804596 hasRelatedWork W1711680036 @default.
- W2186804596 hasRelatedWork W2067344302 @default.
- W2186804596 hasRelatedWork W2071049913 @default.
- W2186804596 hasRelatedWork W2166369229 @default.
- W2186804596 hasRelatedWork W2188997631 @default.
- W2186804596 hasRelatedWork W2250589143 @default.
- W2186804596 hasRelatedWork W2251844123 @default.
- W2186804596 hasRelatedWork W2281393958 @default.
- W2186804596 hasRelatedWork W2475024326 @default.
- W2186804596 hasRelatedWork W2500427647 @default.
- W2186804596 hasRelatedWork W2613724768 @default.
- W2186804596 hasRelatedWork W2902916697 @default.
- W2186804596 hasRelatedWork W2947071392 @default.
- W2186804596 hasRelatedWork W2949041962 @default.
- W2186804596 hasRelatedWork W2951086924 @default.
- W2186804596 hasRelatedWork W2965052554 @default.
- W2186804596 hasRelatedWork W3020763856 @default.
- W2186804596 hasRelatedWork W3208619206 @default.
- W2186804596 hasRelatedWork W84086436 @default.
- W2186804596 isParatext "false" @default.