Matches in SemOpenAlex for { <https://semopenalex.org/work/W2376948364> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W2376948364 abstract "Web text information mining is one of the important applications of applying data mining technologies into information analysis and processing,how to transform web documents into data mining to the required format,i.e.web document preprocessing becomes a significant research task.In this paper the method is : from Internet to download a large number of webpage files,webpage files are converted into a text files,and then through the algorithm to word frequency statistics the data of the text files,delete non-using words,remove high frequency words,process etyma of substantive words,extract stems,eliminate redundant words and establish word list,thus extraction word list,alphabetical index to generate word frequency index,and the dictionary file comparison,get the word ID,the last generation of Reuters-21578 Database data format.This web document data converted into standard data sets for classification and clustering to prepare in data mining." @default.
- W2376948364 created "2016-06-24" @default.
- W2376948364 creator A5091381667 @default.
- W2376948364 date "2011-01-01" @default.
- W2376948364 modified "2023-09-25" @default.
- W2376948364 title "DESIGN AND IMPLEMENTATION OF WEB DOCUMENTS CONVERSION ALGORITHM IN DATA MINING" @default.
- W2376948364 hasPublicationYear "2011" @default.
- W2376948364 type Work @default.
- W2376948364 sameAs 2376948364 @default.
- W2376948364 citedByCount "0" @default.
- W2376948364 crossrefType "journal-article" @default.
- W2376948364 hasAuthorship W2376948364A5091381667 @default.
- W2376948364 hasConcept C10551718 @default.
- W2376948364 hasConcept C110875604 @default.
- W2376948364 hasConcept C124101348 @default.
- W2376948364 hasConcept C136764020 @default.
- W2376948364 hasConcept C154945302 @default.
- W2376948364 hasConcept C162324750 @default.
- W2376948364 hasConcept C175293574 @default.
- W2376948364 hasConcept C17744445 @default.
- W2376948364 hasConcept C187736073 @default.
- W2376948364 hasConcept C197046077 @default.
- W2376948364 hasConcept C199539241 @default.
- W2376948364 hasConcept C204321447 @default.
- W2376948364 hasConcept C21959979 @default.
- W2376948364 hasConcept C23123220 @default.
- W2376948364 hasConcept C2524010 @default.
- W2376948364 hasConcept C2777382242 @default.
- W2376948364 hasConcept C2777466982 @default.
- W2376948364 hasConcept C2777530160 @default.
- W2376948364 hasConcept C2779473830 @default.
- W2376948364 hasConcept C2780451532 @default.
- W2376948364 hasConcept C2983335612 @default.
- W2376948364 hasConcept C33923547 @default.
- W2376948364 hasConcept C34736171 @default.
- W2376948364 hasConcept C41008148 @default.
- W2376948364 hasConcept C73555534 @default.
- W2376948364 hasConcept C90805587 @default.
- W2376948364 hasConceptScore W2376948364C10551718 @default.
- W2376948364 hasConceptScore W2376948364C110875604 @default.
- W2376948364 hasConceptScore W2376948364C124101348 @default.
- W2376948364 hasConceptScore W2376948364C136764020 @default.
- W2376948364 hasConceptScore W2376948364C154945302 @default.
- W2376948364 hasConceptScore W2376948364C162324750 @default.
- W2376948364 hasConceptScore W2376948364C175293574 @default.
- W2376948364 hasConceptScore W2376948364C17744445 @default.
- W2376948364 hasConceptScore W2376948364C187736073 @default.
- W2376948364 hasConceptScore W2376948364C197046077 @default.
- W2376948364 hasConceptScore W2376948364C199539241 @default.
- W2376948364 hasConceptScore W2376948364C204321447 @default.
- W2376948364 hasConceptScore W2376948364C21959979 @default.
- W2376948364 hasConceptScore W2376948364C23123220 @default.
- W2376948364 hasConceptScore W2376948364C2524010 @default.
- W2376948364 hasConceptScore W2376948364C2777382242 @default.
- W2376948364 hasConceptScore W2376948364C2777466982 @default.
- W2376948364 hasConceptScore W2376948364C2777530160 @default.
- W2376948364 hasConceptScore W2376948364C2779473830 @default.
- W2376948364 hasConceptScore W2376948364C2780451532 @default.
- W2376948364 hasConceptScore W2376948364C2983335612 @default.
- W2376948364 hasConceptScore W2376948364C33923547 @default.
- W2376948364 hasConceptScore W2376948364C34736171 @default.
- W2376948364 hasConceptScore W2376948364C41008148 @default.
- W2376948364 hasConceptScore W2376948364C73555534 @default.
- W2376948364 hasConceptScore W2376948364C90805587 @default.
- W2376948364 hasLocation W23769483641 @default.
- W2376948364 hasOpenAccess W2376948364 @default.
- W2376948364 hasPrimaryLocation W23769483641 @default.
- W2376948364 hasRelatedWork W1985414497 @default.
- W2376948364 hasRelatedWork W1990780094 @default.
- W2376948364 hasRelatedWork W2000275737 @default.
- W2376948364 hasRelatedWork W2006480487 @default.
- W2376948364 hasRelatedWork W2113433382 @default.
- W2376948364 hasRelatedWork W2117463183 @default.
- W2376948364 hasRelatedWork W2120655756 @default.
- W2376948364 hasRelatedWork W2187324811 @default.
- W2376948364 hasRelatedWork W2360747096 @default.
- W2376948364 hasRelatedWork W2382176690 @default.
- W2376948364 hasRelatedWork W2735827657 @default.
- W2376948364 hasRelatedWork W2978154720 @default.
- W2376948364 hasRelatedWork W3165695323 @default.
- W2376948364 hasRelatedWork W64370772 @default.
- W2376948364 hasRelatedWork W2475935882 @default.
- W2376948364 hasRelatedWork W2814315455 @default.
- W2376948364 hasRelatedWork W2853479280 @default.
- W2376948364 hasRelatedWork W2933204640 @default.
- W2376948364 hasRelatedWork W3140111399 @default.
- W2376948364 hasRelatedWork W943054961 @default.
- W2376948364 isParatext "false" @default.
- W2376948364 isRetracted "false" @default.
- W2376948364 magId "2376948364" @default.
- W2376948364 workType "article" @default.