Matches in SemOpenAlex for { <https://semopenalex.org/work/W4242522557> ?p ?o ?g. }
Showing items 1 to 55 of
55
with 100 items per page.
- W4242522557 abstract "Abstract truncated at 3,000 characters - the full version is available in the pdf file. Biological networks and, in particular, biological pathways are composed of thousands of nodes and edges, posing several challenge regarding analysis and storage. The primary format used to represent pathways data is BioPAX (http://biopax.org.) BioPAX is a standard language that aims to enable integration, exchange, visualization and analysis of biological pathway data. BioPAX is an open and collaborative effort made by the community of researchers, software developers, and institutions and it specifically supports data exchange between pathway data groups. BioPAX is defined in OWL and is represented in the RDF/XML format. OWL (Web Ontology Language) is a W3C standard and is designed for use by applications that need to process the content of information instead of just presenting information to humans. RDF is a standard model for data interchange on the Web. Although OWL allows a standard representation of pathways, since it is based on XML, it is a verbose and redundant language, so the storage of pathways may be very huge, preventing an efficient transmission and sharing of this data. The typical size of a pathway is related to the organism, for example, the size of Homo Sapiens pathways (from Reactome database) is near to 200 MB on disk. Moreover, integrating pathways data coming from different data sources may require GBytes of space. A second problem with pathways is related to the possibility to integrate information coming from different data sources to have updated information in a centralized way. There exist several different databases for pathways data that emphasizes different aspect of the same pathway, thus, it could be useful to integrate and annotate together pathways coming from different databases to obtain a centralized and more informative pathway data. The principal obstacle for integrating, storing and exchanging such data is the extreme size growth when several pathways data are merged together, posing several challenges from the computational and archiving point of view. Pathways data can be easily classified as big data, because they meet all the 5V (Volume, Velocity, Variety, Veracity, Value) characteristics typical of Big Data, thus, the necessity to efficiently integrate and compress pathways data arises. The methodology for pathways data integration is based on the following steps: i) aggregation and validation locally of data coming from several pathway databases, ii) identification and normalization of compounds and reactions identifier and iii) integration. Integration occurs at the level of physical entities, such as proteins and small molecules. This is accomplished by linking interaction and pathway records together if they use the same physical entities (such as from UniProt for proteins) and by adding annotation data from UniProt or GeneOntology." @default.
- W4242522557 created "2022-05-12" @default.
- W4242522557 creator A5004845138 @default.
- W4242522557 creator A5019458185 @default.
- W4242522557 creator A5070785913 @default.
- W4242522557 date "2016-07-02" @default.
- W4242522557 modified "2023-09-29" @default.
- W4242522557 title "BioPaxCOMP: an efficient system for integrating, compressing, and querying BioPAX" @default.
- W4242522557 doi "https://doi.org/10.7287/peerj.preprints.2210" @default.
- W4242522557 hasPublicationYear "2016" @default.
- W4242522557 type Work @default.
- W4242522557 citedByCount "0" @default.
- W4242522557 crossrefType "posted-content" @default.
- W4242522557 hasAuthorship W4242522557A5004845138 @default.
- W4242522557 hasAuthorship W4242522557A5019458185 @default.
- W4242522557 hasAuthorship W4242522557A5070785913 @default.
- W4242522557 hasBestOaLocation W42425225571 @default.
- W4242522557 hasConcept C101230327 @default.
- W4242522557 hasConcept C111472728 @default.
- W4242522557 hasConcept C136764020 @default.
- W4242522557 hasConcept C138885662 @default.
- W4242522557 hasConcept C147497476 @default.
- W4242522557 hasConcept C2129575 @default.
- W4242522557 hasConcept C23123220 @default.
- W4242522557 hasConcept C25810664 @default.
- W4242522557 hasConcept C41008148 @default.
- W4242522557 hasConcept C77088390 @default.
- W4242522557 hasConcept C8797682 @default.
- W4242522557 hasConceptScore W4242522557C101230327 @default.
- W4242522557 hasConceptScore W4242522557C111472728 @default.
- W4242522557 hasConceptScore W4242522557C136764020 @default.
- W4242522557 hasConceptScore W4242522557C138885662 @default.
- W4242522557 hasConceptScore W4242522557C147497476 @default.
- W4242522557 hasConceptScore W4242522557C2129575 @default.
- W4242522557 hasConceptScore W4242522557C23123220 @default.
- W4242522557 hasConceptScore W4242522557C25810664 @default.
- W4242522557 hasConceptScore W4242522557C41008148 @default.
- W4242522557 hasConceptScore W4242522557C77088390 @default.
- W4242522557 hasConceptScore W4242522557C8797682 @default.
- W4242522557 hasLocation W42425225571 @default.
- W4242522557 hasOpenAccess W4242522557 @default.
- W4242522557 hasPrimaryLocation W42425225571 @default.
- W4242522557 hasRelatedWork W10456893 @default.
- W4242522557 hasRelatedWork W10884974 @default.
- W4242522557 hasRelatedWork W11976966 @default.
- W4242522557 hasRelatedWork W13574947 @default.
- W4242522557 hasRelatedWork W15470705 @default.
- W4242522557 hasRelatedWork W196095 @default.
- W4242522557 hasRelatedWork W200971 @default.
- W4242522557 hasRelatedWork W3490918 @default.
- W4242522557 hasRelatedWork W3844123 @default.
- W4242522557 hasRelatedWork W598886 @default.
- W4242522557 isParatext "false" @default.
- W4242522557 isRetracted "false" @default.
- W4242522557 workType "article" @default.