Matches in SemOpenAlex for { <https://semopenalex.org/work/W4311684328> ?p ?o ?g. }
Showing items 1 to 100 of
100
with 100 items per page.
- W4311684328 endingPage "e0279280" @default.
- W4311684328 startingPage "e0279280" @default.
- W4311684328 abstract "Plasmids are important genetic elements that facilitate horizonal gene transfer between bacteria and contribute to the spread of virulence and antimicrobial resistance. Most bacterial genome sequences in the public archives exist in draft form with many contigs, making it difficult to determine if a contig is of chromosomal or plasmid origin. Using a training set of contigs comprising 10,584 chromosomes and 10,654 plasmids from the PATRIC database, we evaluated several machine learning models including random forest, logistic regression, XGBoost, and a neural network for their ability to classify chromosomal and plasmid sequences using nucleotide k-mers as features. Based on the methods tested, a neural network model that used nucleotide 6-mers as features that was trained on randomly selected chromosomal and plasmid subsequences 5kb in length achieved the best performance, outperforming existing out-of-the-box methods, with an average accuracy of 89.38% ± 2.16% over a 10-fold cross validation. The model accuracy can be improved to 92.08% by using a voting strategy when classifying holdout sequences. In both plasmids and chromosomes, subsequences encoding functions involved in horizontal gene transfer-including hypothetical proteins, transporters, phage, mobile elements, and CRISPR elements-were most likely to be misclassified by the model. This study provides a straightforward approach for identifying plasmid-encoding sequences in short read assemblies without the need for sequence alignment-based tools." @default.
- W4311684328 created "2022-12-28" @default.
- W4311684328 creator A5010147529 @default.
- W4311684328 creator A5022265757 @default.
- W4311684328 creator A5052160218 @default.
- W4311684328 creator A5065254196 @default.
- W4311684328 creator A5073549627 @default.
- W4311684328 date "2022-12-16" @default.
- W4311684328 modified "2023-09-26" @default.
- W4311684328 title "Classification of bacterial plasmid and chromosome derived sequences using machine learning" @default.
- W4311684328 cites W1822796043 @default.
- W4311684328 cites W1923559908 @default.
- W4311684328 cites W1998200698 @default.
- W4311684328 cites W2063756105 @default.
- W4311684328 cites W2085487818 @default.
- W4311684328 cites W2132801908 @default.
- W4311684328 cites W2143325926 @default.
- W4311684328 cites W2146341019 @default.
- W4311684328 cites W2149452073 @default.
- W4311684328 cites W2559429800 @default.
- W4311684328 cites W2583363792 @default.
- W4311684328 cites W2620979206 @default.
- W4311684328 cites W2737706773 @default.
- W4311684328 cites W2782623083 @default.
- W4311684328 cites W2803474517 @default.
- W4311684328 cites W2890718724 @default.
- W4311684328 cites W2909601828 @default.
- W4311684328 cites W2949831026 @default.
- W4311684328 cites W2950919540 @default.
- W4311684328 cites W2962645938 @default.
- W4311684328 cites W2971038220 @default.
- W4311684328 cites W2977580499 @default.
- W4311684328 cites W2986916574 @default.
- W4311684328 cites W3014345701 @default.
- W4311684328 cites W3102476541 @default.
- W4311684328 cites W3125845881 @default.
- W4311684328 cites W3214956918 @default.
- W4311684328 cites W4245572222 @default.
- W4311684328 cites W4296776269 @default.
- W4311684328 cites W4308616455 @default.
- W4311684328 doi "https://doi.org/10.1371/journal.pone.0279280" @default.
- W4311684328 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/36525447" @default.
- W4311684328 hasPublicationYear "2022" @default.
- W4311684328 type Work @default.
- W4311684328 citedByCount "0" @default.
- W4311684328 crossrefType "journal-article" @default.
- W4311684328 hasAuthorship W4311684328A5010147529 @default.
- W4311684328 hasAuthorship W4311684328A5022265757 @default.
- W4311684328 hasAuthorship W4311684328A5052160218 @default.
- W4311684328 hasAuthorship W4311684328A5065254196 @default.
- W4311684328 hasAuthorship W4311684328A5073549627 @default.
- W4311684328 hasBestOaLocation W43116843281 @default.
- W4311684328 hasConcept C104317684 @default.
- W4311684328 hasConcept C141231307 @default.
- W4311684328 hasConcept C154945302 @default.
- W4311684328 hasConcept C182901222 @default.
- W4311684328 hasConcept C22744801 @default.
- W4311684328 hasConcept C30481170 @default.
- W4311684328 hasConcept C41008148 @default.
- W4311684328 hasConcept C54355233 @default.
- W4311684328 hasConcept C59582021 @default.
- W4311684328 hasConcept C70721500 @default.
- W4311684328 hasConcept C86803240 @default.
- W4311684328 hasConcept C92938381 @default.
- W4311684328 hasConceptScore W4311684328C104317684 @default.
- W4311684328 hasConceptScore W4311684328C141231307 @default.
- W4311684328 hasConceptScore W4311684328C154945302 @default.
- W4311684328 hasConceptScore W4311684328C182901222 @default.
- W4311684328 hasConceptScore W4311684328C22744801 @default.
- W4311684328 hasConceptScore W4311684328C30481170 @default.
- W4311684328 hasConceptScore W4311684328C41008148 @default.
- W4311684328 hasConceptScore W4311684328C54355233 @default.
- W4311684328 hasConceptScore W4311684328C59582021 @default.
- W4311684328 hasConceptScore W4311684328C70721500 @default.
- W4311684328 hasConceptScore W4311684328C86803240 @default.
- W4311684328 hasConceptScore W4311684328C92938381 @default.
- W4311684328 hasFunder F4320321001 @default.
- W4311684328 hasFunder F4320338412 @default.
- W4311684328 hasIssue "12" @default.
- W4311684328 hasLocation W43116843281 @default.
- W4311684328 hasLocation W43116843282 @default.
- W4311684328 hasLocation W43116843283 @default.
- W4311684328 hasOpenAccess W4311684328 @default.
- W4311684328 hasPrimaryLocation W43116843281 @default.
- W4311684328 hasRelatedWork W1571771606 @default.
- W4311684328 hasRelatedWork W1976615336 @default.
- W4311684328 hasRelatedWork W2022680262 @default.
- W4311684328 hasRelatedWork W2030926190 @default.
- W4311684328 hasRelatedWork W2083308722 @default.
- W4311684328 hasRelatedWork W2108742089 @default.
- W4311684328 hasRelatedWork W2128903938 @default.
- W4311684328 hasRelatedWork W2524850911 @default.
- W4311684328 hasRelatedWork W2793602347 @default.
- W4311684328 hasRelatedWork W3153552122 @default.
- W4311684328 hasVolume "17" @default.
- W4311684328 isParatext "false" @default.
- W4311684328 isRetracted "false" @default.
- W4311684328 workType "article" @default.