Matches in SemOpenAlex for { <https://semopenalex.org/work/W2059529172> ?p ?o ?g. }
- W2059529172 endingPage "1560" @default.
- W2059529172 startingPage "1549" @default.
- W2059529172 abstract "Genomic sequencing techniques introduce experimental errors into reads which can mislead sequence assembly efforts and complicate the diagnostic process. Here we present a method for detecting and removing sequencing errors from reads generated in genomic shotgun sequencing projects prior to sequence assembly. For each input read, the set of all length k substrings (k-mers) it contains are calculated. The read is evaluated based on the frequency with which each k-mer occurs in the complete data set (k-count). For each read, k-mers are clustered using the variable-bandwidth mean-shift algorithm. Based on the k-count of the cluster center, clusters are classified as error regions or non-error regions. For the 23 real and simulated data sets tested (454 and Solexa), our algorithm detected error regions that cover 99% of all errors. A heuristic algorithm is then applied to detect the location of errors in each putative error region. A read is corrected by removing the errors, thereby creating two or more smaller, error-free read fragments. After performing error removal, the error-rate for all data sets tested decreased (∼35-fold reduction, on average). EDAR has comparable accuracy to methods that correct rather than remove errors and when the error rate is greater than 3% for simulated data sets, it performs better. The performance of the Velvet assembler is generally better with error-removed data. However, for short reads, splitting at the location of errors can be problematic. Following error detection with error correction, rather than removal, may improve the assembly results." @default.
- W2059529172 created "2016-06-24" @default.
- W2059529172 creator A5008782297 @default.
- W2059529172 creator A5024682352 @default.
- W2059529172 creator A5039280850 @default.
- W2059529172 creator A5043475069 @default.
- W2059529172 creator A5044643570 @default.
- W2059529172 creator A5046111155 @default.
- W2059529172 date "2010-11-01" @default.
- W2059529172 modified "2023-09-23" @default.
- W2059529172 title "EDAR: An Efficient Error Detection and Removal Algorithm for Next Generation Sequencing Data" @default.
- W2059529172 cites W1635391495 @default.
- W2059529172 cites W1851713763 @default.
- W2059529172 cites W1966711026 @default.
- W2059529172 cites W1984021931 @default.
- W2059529172 cites W2002875415 @default.
- W2059529172 cites W2005129098 @default.
- W2059529172 cites W2023911006 @default.
- W2059529172 cites W2067191022 @default.
- W2059529172 cites W2096634299 @default.
- W2059529172 cites W2097529842 @default.
- W2059529172 cites W2100237209 @default.
- W2059529172 cites W2135196291 @default.
- W2059529172 cites W2136651963 @default.
- W2059529172 cites W2142749416 @default.
- W2059529172 cites W2144662851 @default.
- W2059529172 cites W2149107824 @default.
- W2059529172 cites W2151017710 @default.
- W2059529172 cites W2155845142 @default.
- W2059529172 cites W2158869408 @default.
- W2059529172 cites W2160969485 @default.
- W2059529172 cites W2169002484 @default.
- W2059529172 cites W4233014035 @default.
- W2059529172 doi "https://doi.org/10.1089/cmb.2010.0127" @default.
- W2059529172 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/20973743" @default.
- W2059529172 hasPublicationYear "2010" @default.
- W2059529172 type Work @default.
- W2059529172 sameAs 2059529172 @default.
- W2059529172 citedByCount "40" @default.
- W2059529172 countsByYear W20595291722012 @default.
- W2059529172 countsByYear W20595291722013 @default.
- W2059529172 countsByYear W20595291722014 @default.
- W2059529172 countsByYear W20595291722015 @default.
- W2059529172 countsByYear W20595291722016 @default.
- W2059529172 countsByYear W20595291722017 @default.
- W2059529172 countsByYear W20595291722018 @default.
- W2059529172 countsByYear W20595291722019 @default.
- W2059529172 countsByYear W20595291722023 @default.
- W2059529172 crossrefType "journal-article" @default.
- W2059529172 hasAuthorship W2059529172A5008782297 @default.
- W2059529172 hasAuthorship W2059529172A5024682352 @default.
- W2059529172 hasAuthorship W2059529172A5039280850 @default.
- W2059529172 hasAuthorship W2059529172A5043475069 @default.
- W2059529172 hasAuthorship W2059529172A5044643570 @default.
- W2059529172 hasAuthorship W2059529172A5046111155 @default.
- W2059529172 hasConcept C101985253 @default.
- W2059529172 hasConcept C103088060 @default.
- W2059529172 hasConcept C104317684 @default.
- W2059529172 hasConcept C113425843 @default.
- W2059529172 hasConcept C11413529 @default.
- W2059529172 hasConcept C150194340 @default.
- W2059529172 hasConcept C154945302 @default.
- W2059529172 hasConcept C162317418 @default.
- W2059529172 hasConcept C177264268 @default.
- W2059529172 hasConcept C182407805 @default.
- W2059529172 hasConcept C18949551 @default.
- W2059529172 hasConcept C199360897 @default.
- W2059529172 hasConcept C2279292 @default.
- W2059529172 hasConcept C2778112365 @default.
- W2059529172 hasConcept C40969351 @default.
- W2059529172 hasConcept C41008148 @default.
- W2059529172 hasConcept C51679486 @default.
- W2059529172 hasConcept C54355233 @default.
- W2059529172 hasConcept C552990157 @default.
- W2059529172 hasConcept C55493867 @default.
- W2059529172 hasConcept C86803240 @default.
- W2059529172 hasConceptScore W2059529172C101985253 @default.
- W2059529172 hasConceptScore W2059529172C103088060 @default.
- W2059529172 hasConceptScore W2059529172C104317684 @default.
- W2059529172 hasConceptScore W2059529172C113425843 @default.
- W2059529172 hasConceptScore W2059529172C11413529 @default.
- W2059529172 hasConceptScore W2059529172C150194340 @default.
- W2059529172 hasConceptScore W2059529172C154945302 @default.
- W2059529172 hasConceptScore W2059529172C162317418 @default.
- W2059529172 hasConceptScore W2059529172C177264268 @default.
- W2059529172 hasConceptScore W2059529172C182407805 @default.
- W2059529172 hasConceptScore W2059529172C18949551 @default.
- W2059529172 hasConceptScore W2059529172C199360897 @default.
- W2059529172 hasConceptScore W2059529172C2279292 @default.
- W2059529172 hasConceptScore W2059529172C2778112365 @default.
- W2059529172 hasConceptScore W2059529172C40969351 @default.
- W2059529172 hasConceptScore W2059529172C41008148 @default.
- W2059529172 hasConceptScore W2059529172C51679486 @default.
- W2059529172 hasConceptScore W2059529172C54355233 @default.
- W2059529172 hasConceptScore W2059529172C552990157 @default.
- W2059529172 hasConceptScore W2059529172C55493867 @default.
- W2059529172 hasConceptScore W2059529172C86803240 @default.
- W2059529172 hasIssue "11" @default.