Matches in SemOpenAlex for { <https://semopenalex.org/work/W4286911815> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W4286911815 abstract "Recent models in developing summarization systems consist of millions of parameters and the model performance is highly dependent on the abundance of training data. While most existing summarization corpora contain data in the order of thousands to one million, generation of large-scale summarization datasets in order of couple of millions is yet to be explored. Practically, more data is better at generalizing the training patterns to unseen data. In this paper, we introduce TLDR9+ -- a large-scale summarization dataset -- containing over 9 million training instances extracted from Reddit discussion forum (https://github.com/sajastu/reddit_collector). This dataset is specifically gathered to perform extreme summarization (i.e., generating one-sentence summary in high compression and abstraction) and is more than twice larger than the previously proposed dataset. We go one step further and with the help of human annotations, we distill a more fine-grained dataset by sampling High-Quality instances from TLDR9+ and call it TLDRHQ dataset. We further pinpoint different state-of-the-art summarization models on our proposed datasets." @default.
- W4286911815 created "2022-07-25" @default.
- W4286911815 creator A5028863551 @default.
- W4286911815 creator A5036610566 @default.
- W4286911815 creator A5037219190 @default.
- W4286911815 creator A5083681053 @default.
- W4286911815 date "2021-10-03" @default.
- W4286911815 modified "2023-09-27" @default.
- W4286911815 title "TLDR9+: A Large Scale Resource for Extreme Summarization of Social Media Posts" @default.
- W4286911815 doi "https://doi.org/10.48550/arxiv.2110.01159" @default.
- W4286911815 hasPublicationYear "2021" @default.
- W4286911815 type Work @default.
- W4286911815 citedByCount "0" @default.
- W4286911815 crossrefType "posted-content" @default.
- W4286911815 hasAuthorship W4286911815A5028863551 @default.
- W4286911815 hasAuthorship W4286911815A5036610566 @default.
- W4286911815 hasAuthorship W4286911815A5037219190 @default.
- W4286911815 hasAuthorship W4286911815A5083681053 @default.
- W4286911815 hasBestOaLocation W42869118151 @default.
- W4286911815 hasConcept C106131492 @default.
- W4286911815 hasConcept C111472728 @default.
- W4286911815 hasConcept C121332964 @default.
- W4286911815 hasConcept C124101348 @default.
- W4286911815 hasConcept C124304363 @default.
- W4286911815 hasConcept C134714966 @default.
- W4286911815 hasConcept C136764020 @default.
- W4286911815 hasConcept C138885662 @default.
- W4286911815 hasConcept C140779682 @default.
- W4286911815 hasConcept C154945302 @default.
- W4286911815 hasConcept C170858558 @default.
- W4286911815 hasConcept C204321447 @default.
- W4286911815 hasConcept C206345919 @default.
- W4286911815 hasConcept C23123220 @default.
- W4286911815 hasConcept C2522767166 @default.
- W4286911815 hasConcept C2777530160 @default.
- W4286911815 hasConcept C2778755073 @default.
- W4286911815 hasConcept C31258907 @default.
- W4286911815 hasConcept C31972630 @default.
- W4286911815 hasConcept C41008148 @default.
- W4286911815 hasConcept C518677369 @default.
- W4286911815 hasConcept C62520636 @default.
- W4286911815 hasConceptScore W4286911815C106131492 @default.
- W4286911815 hasConceptScore W4286911815C111472728 @default.
- W4286911815 hasConceptScore W4286911815C121332964 @default.
- W4286911815 hasConceptScore W4286911815C124101348 @default.
- W4286911815 hasConceptScore W4286911815C124304363 @default.
- W4286911815 hasConceptScore W4286911815C134714966 @default.
- W4286911815 hasConceptScore W4286911815C136764020 @default.
- W4286911815 hasConceptScore W4286911815C138885662 @default.
- W4286911815 hasConceptScore W4286911815C140779682 @default.
- W4286911815 hasConceptScore W4286911815C154945302 @default.
- W4286911815 hasConceptScore W4286911815C170858558 @default.
- W4286911815 hasConceptScore W4286911815C204321447 @default.
- W4286911815 hasConceptScore W4286911815C206345919 @default.
- W4286911815 hasConceptScore W4286911815C23123220 @default.
- W4286911815 hasConceptScore W4286911815C2522767166 @default.
- W4286911815 hasConceptScore W4286911815C2777530160 @default.
- W4286911815 hasConceptScore W4286911815C2778755073 @default.
- W4286911815 hasConceptScore W4286911815C31258907 @default.
- W4286911815 hasConceptScore W4286911815C31972630 @default.
- W4286911815 hasConceptScore W4286911815C41008148 @default.
- W4286911815 hasConceptScore W4286911815C518677369 @default.
- W4286911815 hasConceptScore W4286911815C62520636 @default.
- W4286911815 hasLocation W42869118151 @default.
- W4286911815 hasOpenAccess W4286911815 @default.
- W4286911815 hasPrimaryLocation W42869118151 @default.
- W4286911815 hasRelatedWork W132250100 @default.
- W4286911815 hasRelatedWork W1539478205 @default.
- W4286911815 hasRelatedWork W1943954554 @default.
- W4286911815 hasRelatedWork W2093597205 @default.
- W4286911815 hasRelatedWork W2099984331 @default.
- W4286911815 hasRelatedWork W2106813246 @default.
- W4286911815 hasRelatedWork W2347941600 @default.
- W4286911815 hasRelatedWork W2389846579 @default.
- W4286911815 hasRelatedWork W2392495745 @default.
- W4286911815 hasRelatedWork W2793376154 @default.
- W4286911815 isParatext "false" @default.
- W4286911815 isRetracted "false" @default.
- W4286911815 workType "article" @default.