EmpiriST: AIPHES - Robust Tokenization and POS-Tagging for Different Genres
暂无分享,去创建一个
Christian Biemann | Judith Eckle-Kohler | Thomas Arnold | Margot Mieskes | Christian M. Meyer | Darina Benikova | Steffen Remus | Gerold Hintz | Chris Biemann | Judith Eckle-Kohler | Margot Mieskes | Thomas Arnold | Darina Benikova | Steffen Remus | Gerold Hintz
[1] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[2] Wolfgang Lezius,et al. TIGER: Linguistic Interpretation of a German Corpus , 2004 .
[3] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..
[4] Chunyu Kit,et al. Tokenization as the Initial Phase in NLP , 1992, COLING.
[5] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.
[6] David A. Ferrucci,et al. UIMA: an architectural approach to unstructured information processing in the corporate research environment , 2004, Natural Language Engineering.
[7] Thorsten Brants,et al. TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.
[8] Christian Biemann,et al. GermaNER: Free Open German Named Entity Recognition Tool , 2015, GSCL.
[9] Bryan Jurish,et al. Word and Sentence Tokenization with Hidden Markov Models , 2013, J. Lang. Technol. Comput. Linguistics.
[10] Christian Biemann,et al. Text: now in 2D! A framework for lexical expansion with contextual similarity , 2013, J. Lang. Model..
[11] Tibor Kiss,et al. Unsupervised Multilingual Sentence Boundary Detection , 2006, CL.
[12] Praveen Paritosh,et al. Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.