Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313409818> ?p ?o ?g. }
Showing items 1 to 59 of
59
with 100 items per page.
- W4313409818 endingPage "152" @default.
- W4313409818 startingPage "129" @default.
- W4313409818 abstract "The digital era has unlocked unprecedented possibilities of compiling corpora of social discourse, which has brought corpus linguistic methods into closer interaction with other methods of discourse analysis and the humanities. Even when not using any specific techniques of corpus linguistics, drawing on some sort of corpus is increasingly resorted to for empirically–grounded social–scientific analysis (sometimes dubbed ‘corpus–assisted discourse analysis’ or ‘corpus–based critical discourse analysis’, cf. Hardt–Mautner 1995; Baker 2016). In the post–Yugoslav space, recent corpus developments have brought table–turning advantages in many areas of discourse research, along with an ongoing proliferation of corpora and tools. Still, for linguists and discourse analysts who embark on collecting specialized corpora for their own research purposes, many questions persist – partly due to the fast–changing background of these issues, but also due to the fact that there is still a gap in the corpus method, and in guidelines for corpus compilation, when applied beyond the anglophone contexts. In this paper we aim to discuss some possible solutions to these difficulties, by presenting one step–by–step account of a corpus building procedure specifically for Croatian, Serbian and Slovenian, through an example of compiling a thematic corpus from digital media sources (news articles and reader comments). Following an overview of corpus types, uses and advantages in social sciences and digital humanities, we present the corpus compilation possibilities in the South Slavic language contexts, including data scraping options, permissions and ethical issues, the factors that facilitate or complicate automated collection, and corpus annotation and processing possibilities. The study shows expanding possibilities for work with the given languages, but also some persistently grey areas where researchers need to make decisions based on research expectations. Overall, the paper aims to recapitulate our own corpus compilation experience in the wider context of South–Slavic corpus linguistics and corpus linguistic approaches in the humanities more generally" @default.
- W4313409818 created "2023-01-06" @default.
- W4313409818 creator A5003830491 @default.
- W4313409818 creator A5005200028 @default.
- W4313409818 creator A5073283958 @default.
- W4313409818 date "2022-12-30" @default.
- W4313409818 modified "2023-09-26" @default.
- W4313409818 title "Corpus compilation for digital humanities in lower– resourced languages: A practical look at compiling thematic digital media corpora in Serbian, Croatian and Slovenian" @default.
- W4313409818 doi "https://doi.org/10.22210/suvlin.2022.094.01" @default.
- W4313409818 hasPublicationYear "2022" @default.
- W4313409818 type Work @default.
- W4313409818 citedByCount "0" @default.
- W4313409818 crossrefType "journal-article" @default.
- W4313409818 hasAuthorship W4313409818A5003830491 @default.
- W4313409818 hasAuthorship W4313409818A5005200028 @default.
- W4313409818 hasAuthorship W4313409818A5073283958 @default.
- W4313409818 hasBestOaLocation W43134098181 @default.
- W4313409818 hasConcept C136764020 @default.
- W4313409818 hasConcept C138885662 @default.
- W4313409818 hasConcept C154945302 @default.
- W4313409818 hasConcept C204321447 @default.
- W4313409818 hasConcept C2474386 @default.
- W4313409818 hasConcept C2776321320 @default.
- W4313409818 hasConcept C2778408831 @default.
- W4313409818 hasConcept C41008148 @default.
- W4313409818 hasConcept C41895202 @default.
- W4313409818 hasConcept C518677369 @default.
- W4313409818 hasConcept C532629269 @default.
- W4313409818 hasConceptScore W4313409818C136764020 @default.
- W4313409818 hasConceptScore W4313409818C138885662 @default.
- W4313409818 hasConceptScore W4313409818C154945302 @default.
- W4313409818 hasConceptScore W4313409818C204321447 @default.
- W4313409818 hasConceptScore W4313409818C2474386 @default.
- W4313409818 hasConceptScore W4313409818C2776321320 @default.
- W4313409818 hasConceptScore W4313409818C2778408831 @default.
- W4313409818 hasConceptScore W4313409818C41008148 @default.
- W4313409818 hasConceptScore W4313409818C41895202 @default.
- W4313409818 hasConceptScore W4313409818C518677369 @default.
- W4313409818 hasConceptScore W4313409818C532629269 @default.
- W4313409818 hasIssue "94" @default.
- W4313409818 hasLocation W43134098181 @default.
- W4313409818 hasOpenAccess W4313409818 @default.
- W4313409818 hasPrimaryLocation W43134098181 @default.
- W4313409818 hasRelatedWork W1524429405 @default.
- W4313409818 hasRelatedWork W2020644894 @default.
- W4313409818 hasRelatedWork W2113322342 @default.
- W4313409818 hasRelatedWork W2251244823 @default.
- W4313409818 hasRelatedWork W2553519899 @default.
- W4313409818 hasRelatedWork W28955488 @default.
- W4313409818 hasRelatedWork W2921793654 @default.
- W4313409818 hasRelatedWork W2954189872 @default.
- W4313409818 hasRelatedWork W4293698322 @default.
- W4313409818 hasRelatedWork W4385785379 @default.
- W4313409818 hasVolume "48" @default.
- W4313409818 isParatext "false" @default.
- W4313409818 isRetracted "false" @default.
- W4313409818 workType "article" @default.