Matches in SemOpenAlex for { <https://semopenalex.org/work/W3201979047> ?p ?o ?g. }
Showing items 1 to 60 of
60
with 100 items per page.
- W3201979047 endingPage "1333" @default.
- W3201979047 startingPage "1323" @default.
- W3201979047 abstract "The anonymization of unstructured texts has become a very popular and widely researched topic. This is due not only to the latest GDPR regulation, but also due to the development of state-of-the-art models in the field of natural language processing. The texts required for building such models have to be anonymized before and very often have to be anonymized on the premises of data providers, not the machine learning teams. In this work, we present the use of machine learning models such as part-of-speech tagger or named entity recognizer and their integration with regular expressions for anonymization of unstructured texts in Polish. The goal is to create a system that recognizes many types of sensitive data and can remove, tag, and pseudo-anonymize (replace with words from the same category in an appropriate form) the detected tokens. To test the performance of this system, we prepared a manually annotated dataset containing different categories of sensitive data. The paper presents a detailed analysis of the proposed method’s performance. Moreover, a deployment architecture is discussed in the paper, that results in the creation of a scalable tool capable of processing a large amount of data that can be easily used." @default.
- W3201979047 created "2021-10-11" @default.
- W3201979047 creator A5010635837 @default.
- W3201979047 creator A5038963939 @default.
- W3201979047 creator A5084544919 @default.
- W3201979047 date "2021-01-01" @default.
- W3201979047 modified "2023-10-14" @default.
- W3201979047 title "Automated anonymization of text documents in Polish" @default.
- W3201979047 cites W1558796178 @default.
- W3201979047 cites W1863940422 @default.
- W3201979047 cites W1974710463 @default.
- W3201979047 cites W1995228216 @default.
- W3201979047 cites W2104218280 @default.
- W3201979047 cites W2123512824 @default.
- W3201979047 cites W2156061936 @default.
- W3201979047 cites W2740181738 @default.
- W3201979047 cites W2785144599 @default.
- W3201979047 cites W2891583441 @default.
- W3201979047 cites W2990955708 @default.
- W3201979047 cites W3123367219 @default.
- W3201979047 cites W4230722969 @default.
- W3201979047 cites W50158085 @default.
- W3201979047 cites W79139011 @default.
- W3201979047 doi "https://doi.org/10.1016/j.procs.2021.08.136" @default.
- W3201979047 hasPublicationYear "2021" @default.
- W3201979047 type Work @default.
- W3201979047 sameAs 3201979047 @default.
- W3201979047 citedByCount "1" @default.
- W3201979047 crossrefType "journal-article" @default.
- W3201979047 hasAuthorship W3201979047A5010635837 @default.
- W3201979047 hasAuthorship W3201979047A5038963939 @default.
- W3201979047 hasAuthorship W3201979047A5084544919 @default.
- W3201979047 hasBestOaLocation W32019790471 @default.
- W3201979047 hasConcept C136764020 @default.
- W3201979047 hasConcept C23123220 @default.
- W3201979047 hasConcept C41008148 @default.
- W3201979047 hasConceptScore W3201979047C136764020 @default.
- W3201979047 hasConceptScore W3201979047C23123220 @default.
- W3201979047 hasConceptScore W3201979047C41008148 @default.
- W3201979047 hasFunder F4320322733 @default.
- W3201979047 hasLocation W32019790471 @default.
- W3201979047 hasOpenAccess W3201979047 @default.
- W3201979047 hasPrimaryLocation W32019790471 @default.
- W3201979047 hasRelatedWork W2101955803 @default.
- W3201979047 hasRelatedWork W2115485936 @default.
- W3201979047 hasRelatedWork W2119214692 @default.
- W3201979047 hasRelatedWork W2144190808 @default.
- W3201979047 hasRelatedWork W2357241418 @default.
- W3201979047 hasRelatedWork W2366644548 @default.
- W3201979047 hasRelatedWork W2376314740 @default.
- W3201979047 hasRelatedWork W2384888906 @default.
- W3201979047 hasRelatedWork W2469626427 @default.
- W3201979047 hasRelatedWork W2748952813 @default.
- W3201979047 hasVolume "192" @default.
- W3201979047 isParatext "false" @default.
- W3201979047 isRetracted "false" @default.
- W3201979047 magId "3201979047" @default.
- W3201979047 workType "article" @default.