Matches in SemOpenAlex for { <https://semopenalex.org/work/W2912077575> ?p ?o ?g. }
Showing items 1 to 67 of
67
with 100 items per page.
- W2912077575 abstract "The last decade has experienced a rapid growth in volume and diversity of biological data, thanks to the development of high-throughput technologies related to web services and embeded systems. It is common that information related to a given biological phenomenon is encoded in multiple data sources. On the one hand, this provides a great opportunity for biologists and data scientists to have more unified views about phenomenon of interest. On the other hand, this presents challenges for scientists to find optimal ways in order to wisely extract knowledge from such huge amount of data which normally cannot be done without the help of automated learning systems. Therefore, there is a high need of developing smart learning systems, whose input as set of multiple sources, to support experts to form and assess hypotheses in biology and medicine. In these systems, the problem of combining multiple data sources or data integration needs to be efficiently solved to achieve high performances. Biological data can naturally be represented as graphs. By taking graphs for data representation, we can take advantages from the access to a solid and principled mathematical framework for graphs, and the problem of data integration becomes graph-based integration. In recent years, the machine learning community has witnessed the tremendous growth in the development of kernel-based learning algorithms. Kernel methods whose kernel functions allow to separate between the representation of the data and the general learning algorithm. Interestingly, kernel representation can be applied to any type of data, including trees, graphs, vectors, etc.For this reason, kernel methods are a reasonable and logical choice for graph-based inference systems. However, there is a number of challenges for graph-based systems using kernel methods need to be effectively solved, including definition of node similarity measure, graph sparsity, scalability, efficiency, complementary property exploitation, integration methods.The contributions of the thesis aim at investigating to propose solutions that overcome the challenges faced when constructing graph-based data integration learning systems.The first contribution is the definition of a decompositional graph node kernel, named Conjunctive Disjunctive Node Kernel (CDNK), which intends to measure the similarities between nodes of graphs. Differently of existing graph node kernels that only exploit the topologies of graphs, the proposed kernel also utilizes the available information on the graph nodes. In CDNK, first, the graph is transformed into a set of linked connected components in which we distinguish between “conjunctive” links whose endpoints are in the same connected components and “disjunctive” links that connect nodes located in different connected components. Then the similarity between any couple of nodes is measured by employing a particular graph kernel on two neighborhood subgraphs rooted as each node. Next, it integrates the side information by applying convolution of the discrete information with the real valued vectors associated to graph nodes. Empirical evaluation shows that the kernel presents better performance compared to state-of-the-art graph node kernels.The second contribution aims at dealing with the graph sparsity problem. When working with sparse graphs, i.e graphs with a high number of missing links, the available information is not efficient to learn effectively. An idea to overcome this problem is to use link enrichment to enrich information for graphs. However, the performance of a link enrichment strongly depends on the adopted link prediction method. Therefore, we propose an effective link prediction method (JNSL). In this method, first, each link is represented as a joint neighborhood subgraphs. Then link prediction is considered as a binary classification. We empirically show that the proposed link prediction outperforms various other methods. Besides, we also present a method to boost the performance of diffusion-based kernels, which are most popularly used, by coupling kernel methods with link enrichment. Experimental results prove that the performances of diffusion-based graph node kernels are considerably improved by using link enrichment.The last contribution proposes a general kernel-based framework for graph integration that we name Graph-one. Graph-one is designed to overcome the challenges when handling with graph integration. In particular, it is a scalable and efficient framework. Besides, it is able to deal with unbanlanced settings where the number of positive and negative instances are much different. Numerous variations of Graph-one are evaluated in disease gene prioritization context. The results from experiments illustrate the power of the proposed framework. Precisely, Graph-one shows better performance than various methods. Moreover, Graph-one with data integration gets higher results than it with any single data source. It presents the effectiveness of Graph-one in exploiting the complementary property of graph integration." @default.
- W2912077575 created "2019-02-21" @default.
- W2912077575 creator A5088278039 @default.
- W2912077575 date "2018-04-03" @default.
- W2912077575 modified "2023-09-27" @default.
- W2912077575 title "Kernel methods for large-scale graph-based heterogeneous biological data integration" @default.
- W2912077575 hasPublicationYear "2018" @default.
- W2912077575 type Work @default.
- W2912077575 sameAs 2912077575 @default.
- W2912077575 citedByCount "0" @default.
- W2912077575 crossrefType "journal-article" @default.
- W2912077575 hasAuthorship W2912077575A5088278039 @default.
- W2912077575 hasConcept C100595998 @default.
- W2912077575 hasConcept C119857082 @default.
- W2912077575 hasConcept C122280245 @default.
- W2912077575 hasConcept C12267149 @default.
- W2912077575 hasConcept C124101348 @default.
- W2912077575 hasConcept C154945302 @default.
- W2912077575 hasConcept C160446489 @default.
- W2912077575 hasConcept C201797286 @default.
- W2912077575 hasConcept C2522767166 @default.
- W2912077575 hasConcept C41008148 @default.
- W2912077575 hasConcept C60644358 @default.
- W2912077575 hasConcept C72634772 @default.
- W2912077575 hasConcept C80444323 @default.
- W2912077575 hasConcept C86803240 @default.
- W2912077575 hasConceptScore W2912077575C100595998 @default.
- W2912077575 hasConceptScore W2912077575C119857082 @default.
- W2912077575 hasConceptScore W2912077575C122280245 @default.
- W2912077575 hasConceptScore W2912077575C12267149 @default.
- W2912077575 hasConceptScore W2912077575C124101348 @default.
- W2912077575 hasConceptScore W2912077575C154945302 @default.
- W2912077575 hasConceptScore W2912077575C160446489 @default.
- W2912077575 hasConceptScore W2912077575C201797286 @default.
- W2912077575 hasConceptScore W2912077575C2522767166 @default.
- W2912077575 hasConceptScore W2912077575C41008148 @default.
- W2912077575 hasConceptScore W2912077575C60644358 @default.
- W2912077575 hasConceptScore W2912077575C72634772 @default.
- W2912077575 hasConceptScore W2912077575C80444323 @default.
- W2912077575 hasConceptScore W2912077575C86803240 @default.
- W2912077575 hasLocation W29120775751 @default.
- W2912077575 hasOpenAccess W2912077575 @default.
- W2912077575 hasPrimaryLocation W29120775751 @default.
- W2912077575 hasRelatedWork W1647260487 @default.
- W2912077575 hasRelatedWork W2102966896 @default.
- W2912077575 hasRelatedWork W2344379281 @default.
- W2912077575 hasRelatedWork W2593157645 @default.
- W2912077575 hasRelatedWork W2725194812 @default.
- W2912077575 hasRelatedWork W2784304320 @default.
- W2912077575 hasRelatedWork W2790106378 @default.
- W2912077575 hasRelatedWork W2914086750 @default.
- W2912077575 hasRelatedWork W2935184916 @default.
- W2912077575 hasRelatedWork W2939638899 @default.
- W2912077575 hasRelatedWork W2941253203 @default.
- W2912077575 hasRelatedWork W2965608837 @default.
- W2912077575 hasRelatedWork W2981274387 @default.
- W2912077575 hasRelatedWork W3038461281 @default.
- W2912077575 hasRelatedWork W3104683355 @default.
- W2912077575 hasRelatedWork W3145608311 @default.
- W2912077575 hasRelatedWork W3167650166 @default.
- W2912077575 hasRelatedWork W3172158142 @default.
- W2912077575 hasRelatedWork W3174615437 @default.
- W2912077575 hasRelatedWork W3189403855 @default.
- W2912077575 isParatext "false" @default.
- W2912077575 isRetracted "false" @default.
- W2912077575 magId "2912077575" @default.
- W2912077575 workType "article" @default.