Matches in SemOpenAlex for { <https://semopenalex.org/work/W2336737996> ?p ?o ?g. }
- W2336737996 endingPage "10" @default.
- W2336737996 startingPage "5" @default.
- W2336737996 abstract "This article reports on the on-going CoRoLa project, aiming at creating a reference corpus of contemporary Romanian (from 1945 onwards), opened for online free exploitation by researchers in linguistics and language processing, teachers of Romanian, students. We invest serious efforts in persuading large publishing houses and other owners of IPR on relevant language data to join us and contribute the project with selections of their text and speech repositories. The CoRoLa project is coordinated by two Computer Science institutes of the Romanian Academy, but enjoys cooperation of and consulting from professional linguists from other institutes of the Romanian Academy. We foresee a written component of the corpus of more than 500 million word forms, and a speech component of about 300 hours of recordings. The entire collection of texts (covering all functional styles of the language) will be pre-processed and annotated at several levels, and also documented with standardized metadata. The pre-processing includes cleaning the data and harmonising the diacritics, sentence splitting and tokenization. Annotation will include morpho-lexical tagging and lemmatization in the first stage, followed by syntactic, semantic and discourse annotation in a later stage." @default.
- W2336737996 created "2016-06-24" @default.
- W2336737996 creator A5006567976 @default.
- W2336737996 creator A5009030184 @default.
- W2336737996 creator A5012218423 @default.
- W2336737996 creator A5012740432 @default.
- W2336737996 creator A5023193320 @default.
- W2336737996 creator A5047101281 @default.
- W2336737996 creator A5049881911 @default.
- W2336737996 creator A5054400839 @default.
- W2336737996 creator A5058597112 @default.
- W2336737996 creator A5062940611 @default.
- W2336737996 creator A5068438303 @default.
- W2336737996 date "2015-07-02" @default.
- W2336737996 modified "2023-10-18" @default.
- W2336737996 title "CoRoLa Starts Blooming - An update on the Reference Corpus of Contemporary Romanian Language" @default.
- W2336737996 cites W10032143 @default.
- W2336737996 cites W1527672225 @default.
- W2336737996 cites W1625582487 @default.
- W2336737996 cites W2095503871 @default.
- W2336737996 cites W2163925746 @default.
- W2336737996 cites W2250638379 @default.
- W2336737996 cites W2250745152 @default.
- W2336737996 cites W2251805931 @default.
- W2336737996 cites W2288416156 @default.
- W2336737996 cites W2612123278 @default.
- W2336737996 cites W2757729335 @default.
- W2336737996 cites W356062828 @default.
- W2336737996 hasPublicationYear "2015" @default.
- W2336737996 type Work @default.
- W2336737996 sameAs 2336737996 @default.
- W2336737996 citedByCount "4" @default.
- W2336737996 countsByYear W23367379962017 @default.
- W2336737996 countsByYear W23367379962018 @default.
- W2336737996 countsByYear W23367379962019 @default.
- W2336737996 crossrefType "journal-article" @default.
- W2336737996 hasAuthorship W2336737996A5006567976 @default.
- W2336737996 hasAuthorship W2336737996A5009030184 @default.
- W2336737996 hasAuthorship W2336737996A5012218423 @default.
- W2336737996 hasAuthorship W2336737996A5012740432 @default.
- W2336737996 hasAuthorship W2336737996A5023193320 @default.
- W2336737996 hasAuthorship W2336737996A5047101281 @default.
- W2336737996 hasAuthorship W2336737996A5049881911 @default.
- W2336737996 hasAuthorship W2336737996A5054400839 @default.
- W2336737996 hasAuthorship W2336737996A5058597112 @default.
- W2336737996 hasAuthorship W2336737996A5062940611 @default.
- W2336737996 hasAuthorship W2336737996A5068438303 @default.
- W2336737996 hasConcept C121332964 @default.
- W2336737996 hasConcept C129400051 @default.
- W2336737996 hasConcept C136764020 @default.
- W2336737996 hasConcept C138885662 @default.
- W2336737996 hasConcept C154945302 @default.
- W2336737996 hasConcept C157659113 @default.
- W2336737996 hasConcept C161831844 @default.
- W2336737996 hasConcept C168167062 @default.
- W2336737996 hasConcept C176982825 @default.
- W2336737996 hasConcept C204321447 @default.
- W2336737996 hasConcept C2776321320 @default.
- W2336737996 hasConcept C2777068528 @default.
- W2336737996 hasConcept C2777530160 @default.
- W2336737996 hasConcept C2780403423 @default.
- W2336737996 hasConcept C41008148 @default.
- W2336737996 hasConcept C41895202 @default.
- W2336737996 hasConcept C93518851 @default.
- W2336737996 hasConcept C97355855 @default.
- W2336737996 hasConceptScore W2336737996C121332964 @default.
- W2336737996 hasConceptScore W2336737996C129400051 @default.
- W2336737996 hasConceptScore W2336737996C136764020 @default.
- W2336737996 hasConceptScore W2336737996C138885662 @default.
- W2336737996 hasConceptScore W2336737996C154945302 @default.
- W2336737996 hasConceptScore W2336737996C157659113 @default.
- W2336737996 hasConceptScore W2336737996C161831844 @default.
- W2336737996 hasConceptScore W2336737996C168167062 @default.
- W2336737996 hasConceptScore W2336737996C176982825 @default.
- W2336737996 hasConceptScore W2336737996C204321447 @default.
- W2336737996 hasConceptScore W2336737996C2776321320 @default.
- W2336737996 hasConceptScore W2336737996C2777068528 @default.
- W2336737996 hasConceptScore W2336737996C2777530160 @default.
- W2336737996 hasConceptScore W2336737996C2780403423 @default.
- W2336737996 hasConceptScore W2336737996C41008148 @default.
- W2336737996 hasConceptScore W2336737996C41895202 @default.
- W2336737996 hasConceptScore W2336737996C93518851 @default.
- W2336737996 hasConceptScore W2336737996C97355855 @default.
- W2336737996 hasLocation W23367379961 @default.
- W2336737996 hasOpenAccess W2336737996 @default.
- W2336737996 hasPrimaryLocation W23367379961 @default.
- W2336737996 hasRelatedWork W1489378058 @default.
- W2336737996 hasRelatedWork W1495030797 @default.
- W2336737996 hasRelatedWork W167014491 @default.
- W2336737996 hasRelatedWork W2123165521 @default.
- W2336737996 hasRelatedWork W2250784218 @default.
- W2336737996 hasRelatedWork W2309058871 @default.
- W2336737996 hasRelatedWork W2340377436 @default.
- W2336737996 hasRelatedWork W2348452562 @default.
- W2336737996 hasRelatedWork W2577770804 @default.
- W2336737996 hasRelatedWork W2614307741 @default.
- W2336737996 hasRelatedWork W2781409318 @default.
- W2336737996 hasRelatedWork W2803321004 @default.