Rechtliche Bedingungen für die Bereitstellung eines Chat-Korpus in CLARIN-D: Ergebnisse eines Rechtsgutachtens
暂无分享,去创建一个
Angelika Storrer | Michael Beißwenger | Julia Wildgans | J. H. Weitzmann | Angelika Storrer | Michael Beißwenger | A. Herold | H. Lüngen | Harald Lüngen | Jan Schallaböck | John H. Weitzmann | Axel Herold | Pawel Kamocki | Jan Schallaböck | Pawel Kamocki | Julia Wildgans
[1] Swantje Westpfahl,et al. STTS 2.0? Improving the Tagset for the Part-of-Speech-Tagging of German Spoken Data , 2014, LAW@COLING.
[2] Swantje Westpfahl,et al. FOLK-Gold ― A Gold Standard for Part-of-Speech-Tagging of Spoken German , 2016, LREC.
[3] Jörg Höhne. Verfahren zur Anonymisierung von Einzeldaten , 2010 .
[4] Harald Lüngen,et al. A TEI P5 Document Grammar for the IDS Text Model , 2012 .
[5] Michael Beißwenger. Das Dortmunder Chat-Korpus , 2013 .
[6] Ben Medlock. An Introduction to NLP-based Textual Anonymisation , 2006, LREC.
[7] Thomas Bartz,et al. Optimierung des Stuttgart-Tübingen-Tagset für die linguistische Annotation von Korpora zur internetbasierten Kommunikation: Phänomene, Herausforderungen, Erweiterungsvorschläge , 2013, J. Lang. Technol. Comput. Linguistics.
[8] Harald Lüngen,et al. *Integrating corpora of computer-mediated communication in CLARIN-D: Results from the curation project ChatCorpus2CLARIN , 2016, KONVENS.
[9] Eliza Margaretha,et al. Building Linguistic Corpora from Wikipedia Articles and Discussions , 2014, J. Lang. Technol. Comput. Linguistics.
[10] Benoît Sagot,et al. The CoMeRe corpus for French: structuring and annotating heterogeneous CMC genres , 2014, J. Lang. Technol. Comput. Linguistics.
[11] Erhard W. Hinrichs,et al. The Tüba-D/Z Treebank: Annotating German with a Context-Free Backbone , 2004, LREC.
[12] Angelika Storrer,et al. A TEI Schema for the Representation of Computer-mediated Communication , 2012 .
[13] Rachel Panckhurst. A Large SMS Corpus in French: From Design and Collation to Anonymisation, Transcoding and Analysis , 2013 .
[14] Joachim Bingel,et al. Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers , 2014, LREC.
[15] Stefan Thater,et al. Improving the Performance of Standard Part-of-Speech Taggers for Computer-Mediated Communication , 2014, KONVENS.
[16] Stefan Evert,et al. EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora , 2016, WAC@ACL.