Matches in SemOpenAlex for { <https://semopenalex.org/work/W367034151> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W367034151 abstract "During the last ten years, a genomics revolution has changed the ways biological research is carried out. The draft sequence of the human genome is available, as well as the sequence of 84 other completed genomes. High-throughput genomics technologies such as genome sequencing with associated bioinformatics tools have been instrumental in this process. The draft genome sequences were determined using the shotgun sequencing strategy, where long DNA molecules are randomly sheared into small pieces from which sequences are determined. These are assembled by computer programs in order to reconstruct the original genome sequence. Ubiquitous repeated sequences together with errors in the sequencing process complicate the assembly of shotgun fragments. In most genome projects gaps are caused by this complication. This thesis presents methods and algorithms to separate repeated sequences in shotgun projects. The Tandem Repeat Assembly Program (TRAP) builds multiple alignments of reads, which are then analyzed in order to discriminate sequencing errors from real differences between highly similar repeats. The method is based on the fact that sequencing errors are randomly distributed, as opposed to the systematic distribution of mutations in repeat copies. The TRAP assembler was shown to be able to correctly assemble 2000 bp repeat copies that are repeated in tandem eight times. The degree of difference between repeat copies was 1.0%, and the maximum sequencing error 11%.A refined method based on single base differences between repeat copies has been developed to further improve repeat separation. Results show that in the same sequence, 87% of all the single base differences present in the repeats can be detected, with an error of only 1.6%.In addition, a novel pattern-matching algorithm was developed. This algorithm takes advantage of the inherent symmetry between indices that can be computed for similar words of the same length and was implemented in the error correction software, MisEd. The results show that up to 99.3% of the sequencing errors can be corrected, while up to 87% of the single base differences remain.All methods described have thus been shown to be functional, and it is clear that these programs will facilitate genome sequencing and assembly." @default.
- W367034151 created "2016-06-24" @default.
- W367034151 creator A5006437281 @default.
- W367034151 creator A5014534528 @default.
- W367034151 creator A5046347871 @default.
- W367034151 creator A5064986157 @default.
- W367034151 date "2002-01-01" @default.
- W367034151 modified "2023-09-29" @default.
- W367034151 title "Correcting Errors in Shotgun Sequences Using DNPs and a Novel Pattern Matching Algorithm" @default.
- W367034151 hasPublicationYear "2002" @default.
- W367034151 type Work @default.
- W367034151 sameAs 367034151 @default.
- W367034151 citedByCount "0" @default.
- W367034151 crossrefType "journal-article" @default.
- W367034151 hasAuthorship W367034151A5006437281 @default.
- W367034151 hasAuthorship W367034151A5014534528 @default.
- W367034151 hasAuthorship W367034151A5046347871 @default.
- W367034151 hasAuthorship W367034151A5064986157 @default.
- W367034151 hasConcept C101985253 @default.
- W367034151 hasConcept C104317684 @default.
- W367034151 hasConcept C113425843 @default.
- W367034151 hasConcept C11413529 @default.
- W367034151 hasConcept C141231307 @default.
- W367034151 hasConcept C150194340 @default.
- W367034151 hasConcept C162317418 @default.
- W367034151 hasConcept C189206191 @default.
- W367034151 hasConcept C18949551 @default.
- W367034151 hasConcept C24432333 @default.
- W367034151 hasConcept C27149982 @default.
- W367034151 hasConcept C2778112365 @default.
- W367034151 hasConcept C2781434637 @default.
- W367034151 hasConcept C41008148 @default.
- W367034151 hasConcept C51679486 @default.
- W367034151 hasConcept C54355233 @default.
- W367034151 hasConcept C552990157 @default.
- W367034151 hasConcept C70721500 @default.
- W367034151 hasConcept C86803240 @default.
- W367034151 hasConceptScore W367034151C101985253 @default.
- W367034151 hasConceptScore W367034151C104317684 @default.
- W367034151 hasConceptScore W367034151C113425843 @default.
- W367034151 hasConceptScore W367034151C11413529 @default.
- W367034151 hasConceptScore W367034151C141231307 @default.
- W367034151 hasConceptScore W367034151C150194340 @default.
- W367034151 hasConceptScore W367034151C162317418 @default.
- W367034151 hasConceptScore W367034151C189206191 @default.
- W367034151 hasConceptScore W367034151C18949551 @default.
- W367034151 hasConceptScore W367034151C24432333 @default.
- W367034151 hasConceptScore W367034151C27149982 @default.
- W367034151 hasConceptScore W367034151C2778112365 @default.
- W367034151 hasConceptScore W367034151C2781434637 @default.
- W367034151 hasConceptScore W367034151C41008148 @default.
- W367034151 hasConceptScore W367034151C51679486 @default.
- W367034151 hasConceptScore W367034151C54355233 @default.
- W367034151 hasConceptScore W367034151C552990157 @default.
- W367034151 hasConceptScore W367034151C70721500 @default.
- W367034151 hasConceptScore W367034151C86803240 @default.
- W367034151 hasLocation W3670341511 @default.
- W367034151 hasOpenAccess W367034151 @default.
- W367034151 hasPrimaryLocation W3670341511 @default.
- W367034151 hasRelatedWork W1897166569 @default.
- W367034151 hasRelatedWork W1975736716 @default.
- W367034151 hasRelatedWork W2010454899 @default.
- W367034151 hasRelatedWork W2041238393 @default.
- W367034151 hasRelatedWork W2087008771 @default.
- W367034151 hasRelatedWork W2110129776 @default.
- W367034151 hasRelatedWork W2155890678 @default.
- W367034151 hasRelatedWork W2159497035 @default.
- W367034151 hasRelatedWork W2173935179 @default.
- W367034151 hasRelatedWork W2387513919 @default.
- W367034151 hasRelatedWork W2566436150 @default.
- W367034151 hasRelatedWork W2795056457 @default.
- W367034151 hasRelatedWork W2891300676 @default.
- W367034151 hasRelatedWork W2955242233 @default.
- W367034151 hasRelatedWork W2964547832 @default.
- W367034151 hasRelatedWork W3011612808 @default.
- W367034151 hasRelatedWork W3015563293 @default.
- W367034151 hasRelatedWork W3088695935 @default.
- W367034151 hasRelatedWork W3127433020 @default.
- W367034151 hasRelatedWork W3135487127 @default.
- W367034151 isParatext "false" @default.
- W367034151 isRetracted "false" @default.
- W367034151 magId "367034151" @default.
- W367034151 workType "article" @default.