Matches in SemOpenAlex for { <https://semopenalex.org/work/W2319314598> ?p ?o ?g. }
Showing items 1 to 52 of
52
with 100 items per page.
- W2319314598 endingPage "238" @default.
- W2319314598 startingPage "232" @default.
- W2319314598 abstract "在Web挖掘极度盛行的今天,收集大量网络数据已经不是问题,而如何在海量数据中抽取去噪后的有用数据成为要解决的关键问题。本文研究将网站用户的搜索关键词分析聚类,作为用户的兴趣、爱好标签,以供运营分析人员参考。文中根据世界知识或分类体系计算词语语义距离后转化为词语相似度的方法,将词语间距离依据词频、词权重等因子加工计算出关键词集合间相似度矩阵后,用欧式距离表示其关键字集的相似度;之后聚类算法利用现有R软件中开源算法包——基于隐马尔科夫模型的depmix算法包进行的用户聚类算法。最终用某搜索引擎用户的真实数据,经过数据去噪后所得实验数据进行聚类,并于前台展示聚类及用户周边相关结果。 Nowadays, as web mining is extremely prevalent, it is easy to collect huge amounts of data but to figure out which materials is useful to analyze after de-noising is more important. This article discusses how to use the result of user’s searching keywords clustering as the label of the client for operational analysts to refer to. The similarity between isolated words is calculated by turning the word semantic distance based on world knowledge or classification system. Then the similarity between clients (keyword sets) is defined as the Euclidean distance of a similarity matrix constituted by the similarities between keyword sets which determined by word frequency and word weight. The “depmix” package which based on the Hidden Markov Model in “R” software is used as the clustering algorithm and the user clustering result is displayed at last using the real data of the users of a search engine." @default.
- W2319314598 created "2016-06-24" @default.
- W2319314598 creator A5016875437 @default.
- W2319314598 date "2013-01-01" @default.
- W2319314598 modified "2023-09-26" @default.
- W2319314598 title "The Study and Implementation of Web User Mining System Based on the Similarity of Words" @default.
- W2319314598 cites W2435251607 @default.
- W2319314598 doi "https://doi.org/10.12677/csa.2013.34040" @default.
- W2319314598 hasPublicationYear "2013" @default.
- W2319314598 type Work @default.
- W2319314598 sameAs 2319314598 @default.
- W2319314598 citedByCount "0" @default.
- W2319314598 crossrefType "journal-article" @default.
- W2319314598 hasAuthorship W2319314598A5016875437 @default.
- W2319314598 hasBestOaLocation W23193145981 @default.
- W2319314598 hasConcept C103278499 @default.
- W2319314598 hasConcept C115961682 @default.
- W2319314598 hasConcept C124101348 @default.
- W2319314598 hasConcept C136764020 @default.
- W2319314598 hasConcept C154945302 @default.
- W2319314598 hasConcept C23123220 @default.
- W2319314598 hasConcept C41008148 @default.
- W2319314598 hasConcept C77088390 @default.
- W2319314598 hasConceptScore W2319314598C103278499 @default.
- W2319314598 hasConceptScore W2319314598C115961682 @default.
- W2319314598 hasConceptScore W2319314598C124101348 @default.
- W2319314598 hasConceptScore W2319314598C136764020 @default.
- W2319314598 hasConceptScore W2319314598C154945302 @default.
- W2319314598 hasConceptScore W2319314598C23123220 @default.
- W2319314598 hasConceptScore W2319314598C41008148 @default.
- W2319314598 hasConceptScore W2319314598C77088390 @default.
- W2319314598 hasIssue "04" @default.
- W2319314598 hasLocation W23193145981 @default.
- W2319314598 hasOpenAccess W2319314598 @default.
- W2319314598 hasPrimaryLocation W23193145981 @default.
- W2319314598 hasRelatedWork W2115485936 @default.
- W2319314598 hasRelatedWork W2119214692 @default.
- W2319314598 hasRelatedWork W2144190808 @default.
- W2319314598 hasRelatedWork W2153015554 @default.
- W2319314598 hasRelatedWork W2349125667 @default.
- W2319314598 hasRelatedWork W2357241418 @default.
- W2319314598 hasRelatedWork W2366644548 @default.
- W2319314598 hasRelatedWork W2376314740 @default.
- W2319314598 hasRelatedWork W2384888906 @default.
- W2319314598 hasRelatedWork W2748952813 @default.
- W2319314598 hasVolume "03" @default.
- W2319314598 isParatext "false" @default.
- W2319314598 isRetracted "false" @default.
- W2319314598 magId "2319314598" @default.
- W2319314598 workType "article" @default.