Matches in SemOpenAlex for { <https://semopenalex.org/work/W2964315653> ?p ?o ?g. }
- W2964315653 abstract "For tasks like code synthesis from natural language, code retrieval, and code summarization, data-driven models have shown great promise. However, creating these models require parallel data between natural language (NL) and code with fine-grained alignments. Stack Overflow (SO) is a promising source to create such a data set: the questions are diverse and most of them have corresponding answers with high quality code snippets. However, existing heuristic methods (e.g., pairing the title of a post with the code in the accepted answer) are limited both in their coverage and the correctness of the NL-code pairs obtained. In this paper, we propose a novel method to mine high-quality aligned data from SO using two sets of features: hand-crafted features considering the structure of the extracted snippets, and correspondence features obtained by training a probabilistic model to capture the correlation between NL and code using neural networks. These features are fed into a classifier that determines the quality of mined NL-code pairs. Experiments using Python and Java as test beds show that the proposed method greatly expands coverage and accuracy over existing mining methods, even when using only a small number of labeled examples. Further, we find that reasonable results are achieved even when training the classifier on one language and testing on another, showing promise for scaling NL-code mining to a wide variety of programming languages beyond those for which we are able to annotate data." @default.
- W2964315653 created "2019-07-30" @default.
- W2964315653 creator A5019923149 @default.
- W2964315653 creator A5050821883 @default.
- W2964315653 creator A5056966994 @default.
- W2964315653 creator A5068811427 @default.
- W2964315653 creator A5078519761 @default.
- W2964315653 date "2018-05-28" @default.
- W2964315653 modified "2023-09-29" @default.
- W2964315653 title "Learning to mine aligned code and natural language pairs from stack overflow" @default.
- W2964315653 cites W1588986231 @default.
- W2964315653 cites W1655078475 @default.
- W2964315653 cites W1972141422 @default.
- W2964315653 cites W1973719497 @default.
- W2964315653 cites W1974020522 @default.
- W2964315653 cites W1997358723 @default.
- W2964315653 cites W2010608861 @default.
- W2964315653 cites W2023925487 @default.
- W2964315653 cites W2033705196 @default.
- W2964315653 cites W2054855378 @default.
- W2964315653 cites W2117228548 @default.
- W2964315653 cites W2125943921 @default.
- W2964315653 cites W2136189984 @default.
- W2964315653 cites W2143960295 @default.
- W2964315653 cites W2156981320 @default.
- W2964315653 cites W2158396456 @default.
- W2964315653 cites W2242083635 @default.
- W2964315653 cites W2251957808 @default.
- W2964315653 cites W2298285108 @default.
- W2964315653 cites W2344444819 @default.
- W2964315653 cites W2516621648 @default.
- W2964315653 cites W2740220421 @default.
- W2964315653 cites W2788306232 @default.
- W2964315653 cites W2884681705 @default.
- W2964315653 cites W2962728167 @default.
- W2964315653 cites W2963617989 @default.
- W2964315653 cites W2964284687 @default.
- W2964315653 cites W3098403328 @default.
- W2964315653 cites W4289255588 @default.
- W2964315653 doi "https://doi.org/10.1145/3196398.3196408" @default.
- W2964315653 hasPublicationYear "2018" @default.
- W2964315653 type Work @default.
- W2964315653 sameAs 2964315653 @default.
- W2964315653 citedByCount "112" @default.
- W2964315653 countsByYear W29643156532018 @default.
- W2964315653 countsByYear W29643156532019 @default.
- W2964315653 countsByYear W29643156532020 @default.
- W2964315653 countsByYear W29643156532021 @default.
- W2964315653 countsByYear W29643156532022 @default.
- W2964315653 countsByYear W29643156532023 @default.
- W2964315653 crossrefType "proceedings-article" @default.
- W2964315653 hasAuthorship W2964315653A5019923149 @default.
- W2964315653 hasAuthorship W2964315653A5050821883 @default.
- W2964315653 hasAuthorship W2964315653A5056966994 @default.
- W2964315653 hasAuthorship W2964315653A5068811427 @default.
- W2964315653 hasAuthorship W2964315653A5078519761 @default.
- W2964315653 hasBestOaLocation W29643156532 @default.
- W2964315653 hasConcept C124101348 @default.
- W2964315653 hasConcept C154945302 @default.
- W2964315653 hasConcept C170858558 @default.
- W2964315653 hasConcept C195324797 @default.
- W2964315653 hasConcept C199360897 @default.
- W2964315653 hasConcept C204321447 @default.
- W2964315653 hasConcept C41008148 @default.
- W2964315653 hasConcept C43126263 @default.
- W2964315653 hasConcept C51929080 @default.
- W2964315653 hasConcept C519991488 @default.
- W2964315653 hasConcept C548217200 @default.
- W2964315653 hasConcept C55439883 @default.
- W2964315653 hasConcept C58646249 @default.
- W2964315653 hasConcept C60048249 @default.
- W2964315653 hasConcept C95623464 @default.
- W2964315653 hasConceptScore W2964315653C124101348 @default.
- W2964315653 hasConceptScore W2964315653C154945302 @default.
- W2964315653 hasConceptScore W2964315653C170858558 @default.
- W2964315653 hasConceptScore W2964315653C195324797 @default.
- W2964315653 hasConceptScore W2964315653C199360897 @default.
- W2964315653 hasConceptScore W2964315653C204321447 @default.
- W2964315653 hasConceptScore W2964315653C41008148 @default.
- W2964315653 hasConceptScore W2964315653C43126263 @default.
- W2964315653 hasConceptScore W2964315653C51929080 @default.
- W2964315653 hasConceptScore W2964315653C519991488 @default.
- W2964315653 hasConceptScore W2964315653C548217200 @default.
- W2964315653 hasConceptScore W2964315653C55439883 @default.
- W2964315653 hasConceptScore W2964315653C58646249 @default.
- W2964315653 hasConceptScore W2964315653C60048249 @default.
- W2964315653 hasConceptScore W2964315653C95623464 @default.
- W2964315653 hasLocation W29643156531 @default.
- W2964315653 hasLocation W29643156532 @default.
- W2964315653 hasOpenAccess W2964315653 @default.
- W2964315653 hasPrimaryLocation W29643156531 @default.
- W2964315653 hasRelatedWork W118725990 @default.
- W2964315653 hasRelatedWork W1575927706 @default.
- W2964315653 hasRelatedWork W270927586 @default.
- W2964315653 hasRelatedWork W3101523611 @default.
- W2964315653 hasRelatedWork W3199434107 @default.
- W2964315653 hasRelatedWork W4205531442 @default.
- W2964315653 hasRelatedWork W4288076218 @default.
- W2964315653 hasRelatedWork W4312970618 @default.
- W2964315653 hasRelatedWork W4380768849 @default.