Matches in SemOpenAlex for { <https://semopenalex.org/work/W98102442> ?p ?o ?g. }
Showing items 1 to 50 of
50
with 100 items per page.
- W98102442 abstract "This paper describes an algorithm for document representation in a reduced vectorial space by a process of feature extraction. The algorithm is applied and evaluated in the context of the supervised classification of news articles from the collection of Le Monde newspaper issued in the years 2003 and 2004. We are generating a document representation (or profile), in a space of 800 dimensions, represented by semantic tags from a machine-readable dictionary. We are dealing with two issues : the synonymy handled by thematic conflation and polysemy for which we have developed a statistical method for word-sense disambiguation. We propose four variants for the profile generation (of a document) depending on whether a recursive system is used or not, and whether a corrective factor for polysemous words is taken into account or not. To determine the best classifier provided by our algorithm we have evaluated 32 variants, depending on the algorithm type (as previously) and on three other parameters that influence the document representation : grammatical category selection, 15% reduction of the profile, and a stop-list of semantic tags. The evaluation is done on a set of documents from six categories by calculating the precision, the recall and the F-measure to determine the best algorithm related to the threshold detection. Some parameters (like profile reduction) have low influence on the classifier performance and others (corrective factor for the ambiguous words, stop-list) improve it noticeably. Resume" @default.
- W98102442 created "2016-06-24" @default.
- W98102442 creator A5019351439 @default.
- W98102442 creator A5035388242 @default.
- W98102442 creator A5035569693 @default.
- W98102442 date "2006-01-01" @default.
- W98102442 modified "2023-09-24" @default.
- W98102442 title "Un algorithme de génération de profil de document et son évaluation dans le contexte de la classification thématique" @default.
- W98102442 cites W1493526108 @default.
- W98102442 cites W1500547895 @default.
- W98102442 cites W1570542661 @default.
- W98102442 cites W1572124604 @default.
- W98102442 cites W1731244441 @default.
- W98102442 cites W2043772506 @default.
- W98102442 cites W2058089741 @default.
- W98102442 cites W2103035252 @default.
- W98102442 cites W2114535528 @default.
- W98102442 cites W2118020653 @default.
- W98102442 cites W2159882563 @default.
- W98102442 cites W2161103800 @default.
- W98102442 cites W22702538 @default.
- W98102442 cites W2435251607 @default.
- W98102442 cites W3157411736 @default.
- W98102442 hasPublicationYear "2006" @default.
- W98102442 type Work @default.
- W98102442 sameAs 98102442 @default.
- W98102442 citedByCount "0" @default.
- W98102442 crossrefType "journal-article" @default.
- W98102442 hasAuthorship W98102442A5019351439 @default.
- W98102442 hasAuthorship W98102442A5035388242 @default.
- W98102442 hasAuthorship W98102442A5035569693 @default.
- W98102442 hasConcept C153180895 @default.
- W98102442 hasConcept C154945302 @default.
- W98102442 hasConcept C204321447 @default.
- W98102442 hasConcept C2780276568 @default.
- W98102442 hasConcept C41008148 @default.
- W98102442 hasConcept C95623464 @default.
- W98102442 hasConceptScore W98102442C153180895 @default.
- W98102442 hasConceptScore W98102442C154945302 @default.
- W98102442 hasConceptScore W98102442C204321447 @default.
- W98102442 hasConceptScore W98102442C2780276568 @default.
- W98102442 hasConceptScore W98102442C41008148 @default.
- W98102442 hasConceptScore W98102442C95623464 @default.
- W98102442 hasLocation W981024421 @default.
- W98102442 hasOpenAccess W98102442 @default.
- W98102442 hasPrimaryLocation W981024421 @default.
- W98102442 isParatext "false" @default.
- W98102442 isRetracted "false" @default.
- W98102442 magId "98102442" @default.
- W98102442 workType "article" @default.