Matches in SemOpenAlex for { <https://semopenalex.org/work/W3020882095> ?p ?o ?g. }
- W3020882095 abstract "Motivation: Alignment-free distance and similarity functions (AF functions, for short) are a computationally convenient alternative to two and multiple sequence alignments for many genomic, metagenomic and epigenomic tasks. Yet, their use is still to the proof of principle stage: only recently a benchmarking study has coherently evaluated a handful of the functions proposed over the years, identifying a pool of well performing ones. However, more is needed to make this pool usable on a day-to-day basis. In particular, a statistical significance quantification associated to the output of a given function would greatly help when no reference point is available. For most functions, such an analysis is bound to be based on Monte Carlo Hypothesis Test simulations, yielding a dramatic increase in computational time that transforms this into a Big Data problem. Surprisingly, it has been hardly considered, despite the increasing popularity of Big Data Technologies in Computational Biology. Results: We fill this important gap by providing the first user-friendly, extensible, efficient Spark platform for Alignment-free genomic analysis. Thanks to its scalability, Monte Carlo Hypothesis Test simulations on the output of AF functions can seamlessly be afforded for either small or huge collections of sequences. Thus, we are able to comparatively study for the first time AF functions in relation to the statistical significance of their output. Such novel analysis allows us to reduce the pool of well performing functions coming from the benchmarking study to a handful of them." @default.
- W3020882095 created "2020-05-13" @default.
- W3020882095 creator A5018051509 @default.
- W3020882095 creator A5066774697 @default.
- W3020882095 creator A5075642910 @default.
- W3020882095 creator A5078881318 @default.
- W3020882095 date "2020-05-02" @default.
- W3020882095 modified "2023-09-27" @default.
- W3020882095 title "An Extensible, Scalable Spark Platform for Alignment-free Genomic Analysis -- Version 2" @default.
- W3020882095 cites W1987591432 @default.
- W3020882095 cites W1996069010 @default.
- W3020882095 cites W2006075770 @default.
- W3020882095 cites W2053276922 @default.
- W3020882095 cites W2060425093 @default.
- W3020882095 cites W2094890728 @default.
- W3020882095 cites W2117897510 @default.
- W3020882095 cites W2120771433 @default.
- W3020882095 cites W2138486754 @default.
- W3020882095 cites W2141865968 @default.
- W3020882095 cites W2168619423 @default.
- W3020882095 cites W2173213060 @default.
- W3020882095 cites W2339602899 @default.
- W3020882095 cites W2468915207 @default.
- W3020882095 cites W2611515161 @default.
- W3020882095 cites W2774657098 @default.
- W3020882095 cites W2811072203 @default.
- W3020882095 cites W2950150251 @default.
- W3020882095 cites W2962807110 @default.
- W3020882095 cites W2892087261 @default.
- W3020882095 hasPublicationYear "2020" @default.
- W3020882095 type Work @default.
- W3020882095 sameAs 3020882095 @default.
- W3020882095 citedByCount "0" @default.
- W3020882095 crossrefType "posted-content" @default.
- W3020882095 hasAuthorship W3020882095A5018051509 @default.
- W3020882095 hasAuthorship W3020882095A5066774697 @default.
- W3020882095 hasAuthorship W3020882095A5075642910 @default.
- W3020882095 hasAuthorship W3020882095A5078881318 @default.
- W3020882095 hasConcept C105795698 @default.
- W3020882095 hasConcept C124101348 @default.
- W3020882095 hasConcept C136764020 @default.
- W3020882095 hasConcept C14036430 @default.
- W3020882095 hasConcept C144133560 @default.
- W3020882095 hasConcept C162853370 @default.
- W3020882095 hasConcept C19499675 @default.
- W3020882095 hasConcept C199360897 @default.
- W3020882095 hasConcept C2780615836 @default.
- W3020882095 hasConcept C2781215313 @default.
- W3020882095 hasConcept C33923547 @default.
- W3020882095 hasConcept C41008148 @default.
- W3020882095 hasConcept C48044578 @default.
- W3020882095 hasConcept C75684735 @default.
- W3020882095 hasConcept C77088390 @default.
- W3020882095 hasConcept C78458016 @default.
- W3020882095 hasConcept C80444323 @default.
- W3020882095 hasConcept C86251818 @default.
- W3020882095 hasConcept C86803240 @default.
- W3020882095 hasConcept C87007009 @default.
- W3020882095 hasConceptScore W3020882095C105795698 @default.
- W3020882095 hasConceptScore W3020882095C124101348 @default.
- W3020882095 hasConceptScore W3020882095C136764020 @default.
- W3020882095 hasConceptScore W3020882095C14036430 @default.
- W3020882095 hasConceptScore W3020882095C144133560 @default.
- W3020882095 hasConceptScore W3020882095C162853370 @default.
- W3020882095 hasConceptScore W3020882095C19499675 @default.
- W3020882095 hasConceptScore W3020882095C199360897 @default.
- W3020882095 hasConceptScore W3020882095C2780615836 @default.
- W3020882095 hasConceptScore W3020882095C2781215313 @default.
- W3020882095 hasConceptScore W3020882095C33923547 @default.
- W3020882095 hasConceptScore W3020882095C41008148 @default.
- W3020882095 hasConceptScore W3020882095C48044578 @default.
- W3020882095 hasConceptScore W3020882095C75684735 @default.
- W3020882095 hasConceptScore W3020882095C77088390 @default.
- W3020882095 hasConceptScore W3020882095C78458016 @default.
- W3020882095 hasConceptScore W3020882095C80444323 @default.
- W3020882095 hasConceptScore W3020882095C86251818 @default.
- W3020882095 hasConceptScore W3020882095C86803240 @default.
- W3020882095 hasConceptScore W3020882095C87007009 @default.
- W3020882095 hasLocation W30208820951 @default.
- W3020882095 hasOpenAccess W3020882095 @default.
- W3020882095 hasPrimaryLocation W30208820951 @default.
- W3020882095 hasRelatedWork W121578967 @default.
- W3020882095 hasRelatedWork W1567061291 @default.
- W3020882095 hasRelatedWork W2009571806 @default.
- W3020882095 hasRelatedWork W2017614184 @default.
- W3020882095 hasRelatedWork W2139361324 @default.
- W3020882095 hasRelatedWork W2188269859 @default.
- W3020882095 hasRelatedWork W2253028448 @default.
- W3020882095 hasRelatedWork W2398924165 @default.
- W3020882095 hasRelatedWork W2401927118 @default.
- W3020882095 hasRelatedWork W2440071369 @default.
- W3020882095 hasRelatedWork W2514134594 @default.
- W3020882095 hasRelatedWork W2623062799 @default.
- W3020882095 hasRelatedWork W2803639275 @default.
- W3020882095 hasRelatedWork W2811072203 @default.
- W3020882095 hasRelatedWork W2967298165 @default.
- W3020882095 hasRelatedWork W2983055179 @default.
- W3020882095 hasRelatedWork W2988523778 @default.
- W3020882095 hasRelatedWork W3173768211 @default.
- W3020882095 hasRelatedWork W2535897643 @default.