Matches in SemOpenAlex for { <https://semopenalex.org/work/W2100443987> ?p ?o ?g. }
Showing items 1 to 87 of
87
with 100 items per page.
- W2100443987 abstract "The fundamental problem of similarity studies, in the frame of data-mining, is to examine and detect similar items in articles, papers, books, with huge sizes. In this paper, we are interested in the probabilistic, and the statistical and the algorithmic aspects in studies of texts. We will be using the approach of $k$textit{-shinglings}, a $k$textit{-shingling} being defined as a sequence of $k$ consecutive characters that are extracted from a text ($kgeq 1$ ). The main stake in this field is to find accurate and quick algorithms to compute the similarity in short times. This will be achieved in using approximation methods. The first approximation method is statistical and, is based on the theorem of Glivenko-Cantelli. The second is the banding technique. And the third concerns a modification of the algorithm proposed by Rajaraman and al (% cite{AnandJeffrey}), denoted here as (RUM). The Jaccard index is the one used in this paper. We finally illustrate these results of the paper on the four Gospels. The results are very conclusive." @default.
- W2100443987 created "2016-06-24" @default.
- W2100443987 creator A5000193182 @default.
- W2100443987 creator A5030212843 @default.
- W2100443987 date "2015-08-15" @default.
- W2100443987 modified "2023-09-27" @default.
- W2100443987 title "Probabilistic, statistical and algorithmic aspects of the similarity of texts and application to Gospels comparison" @default.
- W2100443987 cites W1736726159 @default.
- W2100443987 cites W2106446203 @default.
- W2100443987 cites W2164456230 @default.
- W2100443987 cites W299839057 @default.
- W2100443987 hasPublicationYear "2015" @default.
- W2100443987 type Work @default.
- W2100443987 sameAs 2100443987 @default.
- W2100443987 citedByCount "0" @default.
- W2100443987 crossrefType "posted-content" @default.
- W2100443987 hasAuthorship W2100443987A5000193182 @default.
- W2100443987 hasAuthorship W2100443987A5030212843 @default.
- W2100443987 hasConcept C103278499 @default.
- W2100443987 hasConcept C11413529 @default.
- W2100443987 hasConcept C114289077 @default.
- W2100443987 hasConcept C115961682 @default.
- W2100443987 hasConcept C118615104 @default.
- W2100443987 hasConcept C126042441 @default.
- W2100443987 hasConcept C136764020 @default.
- W2100443987 hasConcept C153180895 @default.
- W2100443987 hasConcept C154945302 @default.
- W2100443987 hasConcept C202444582 @default.
- W2100443987 hasConcept C203519979 @default.
- W2100443987 hasConcept C2777382242 @default.
- W2100443987 hasConcept C2778112365 @default.
- W2100443987 hasConcept C33923547 @default.
- W2100443987 hasConcept C41008148 @default.
- W2100443987 hasConcept C49937458 @default.
- W2100443987 hasConcept C54355233 @default.
- W2100443987 hasConcept C76155785 @default.
- W2100443987 hasConcept C80444323 @default.
- W2100443987 hasConcept C86803240 @default.
- W2100443987 hasConcept C9652623 @default.
- W2100443987 hasConceptScore W2100443987C103278499 @default.
- W2100443987 hasConceptScore W2100443987C11413529 @default.
- W2100443987 hasConceptScore W2100443987C114289077 @default.
- W2100443987 hasConceptScore W2100443987C115961682 @default.
- W2100443987 hasConceptScore W2100443987C118615104 @default.
- W2100443987 hasConceptScore W2100443987C126042441 @default.
- W2100443987 hasConceptScore W2100443987C136764020 @default.
- W2100443987 hasConceptScore W2100443987C153180895 @default.
- W2100443987 hasConceptScore W2100443987C154945302 @default.
- W2100443987 hasConceptScore W2100443987C202444582 @default.
- W2100443987 hasConceptScore W2100443987C203519979 @default.
- W2100443987 hasConceptScore W2100443987C2777382242 @default.
- W2100443987 hasConceptScore W2100443987C2778112365 @default.
- W2100443987 hasConceptScore W2100443987C33923547 @default.
- W2100443987 hasConceptScore W2100443987C41008148 @default.
- W2100443987 hasConceptScore W2100443987C49937458 @default.
- W2100443987 hasConceptScore W2100443987C54355233 @default.
- W2100443987 hasConceptScore W2100443987C76155785 @default.
- W2100443987 hasConceptScore W2100443987C80444323 @default.
- W2100443987 hasConceptScore W2100443987C86803240 @default.
- W2100443987 hasConceptScore W2100443987C9652623 @default.
- W2100443987 hasLocation W21004439871 @default.
- W2100443987 hasOpenAccess W2100443987 @default.
- W2100443987 hasPrimaryLocation W21004439871 @default.
- W2100443987 hasRelatedWork W1487460889 @default.
- W2100443987 hasRelatedWork W1505771370 @default.
- W2100443987 hasRelatedWork W1539311325 @default.
- W2100443987 hasRelatedWork W1575372003 @default.
- W2100443987 hasRelatedWork W1584172119 @default.
- W2100443987 hasRelatedWork W1821814192 @default.
- W2100443987 hasRelatedWork W1979671533 @default.
- W2100443987 hasRelatedWork W2056138982 @default.
- W2100443987 hasRelatedWork W2062775555 @default.
- W2100443987 hasRelatedWork W2074902906 @default.
- W2100443987 hasRelatedWork W2154905744 @default.
- W2100443987 hasRelatedWork W2166778739 @default.
- W2100443987 hasRelatedWork W2241310126 @default.
- W2100443987 hasRelatedWork W2313401933 @default.
- W2100443987 hasRelatedWork W2506847904 @default.
- W2100443987 hasRelatedWork W2914485779 @default.
- W2100443987 hasRelatedWork W295695556 @default.
- W2100443987 hasRelatedWork W2976328945 @default.
- W2100443987 hasRelatedWork W3001100795 @default.
- W2100443987 hasRelatedWork W3151629053 @default.
- W2100443987 isParatext "false" @default.
- W2100443987 isRetracted "false" @default.
- W2100443987 magId "2100443987" @default.
- W2100443987 workType "article" @default.