Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries

The Complete Tang Poems (CTP) is the most important source to study Tang poems. We look into CTP with computational tools from specific linguistic perspectives, including distributional semantics and collocational analysis. From such quantitative viewpoints, we compare the usage of "wind" and "moon" in the poems of Li Bai and Du Fu. Colors in poems function like sounds in movies, and play a crucial role in the imageries of poems. Thus, words for colors are studied, and "white" is the main focus because it is the most frequent color in CTP. We also explore some cases of using colored words in antithesis pairs that were central for fostering the imageries of the poems. CTP also contains useful historical information, and we extract person names in CTP to study the social networks of the Tang poets. Such information can then be integrated with the China Biographical Database of Harvard University.

[1]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[2]  Hu Junfeng The Computer Aided Research Work of Chinese Ancient Poems , 2001 .

[3]  Chao-Lin Liu,et al.  《全唐詩》的分析、探勘與應用-風格、對仗、社會網路與對聯(Textual Analysis of Complete Tang Poems for Discoveries and Applications - Style, Antitheses, Social Networks, and Couplets)[In Chinese] , 2015, ROCLING.

[4]  Lee-Feng Chien,et al.  PAT-tree-based keyword extraction for Chinese information retrieval , 1997, SIGIR '97.

[5]  J. R. Firth,et al.  A Synopsis of Linguistic Theory, 1930-1955 , 1957 .

[6]  Jack W. Chen The Poetics of Sovereignty: On Emperor Taizong of the Tang Dynasty , 2011 .

[7]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[8]  Dekang Lin Automatic Retrieval and Clustering of Similar Words , 2022, COLING.

[9]  Chu-Ren Huang,et al.  From General Ontology to Specialized Ontology: A study based on a single author historical corpus , 2005, OntoLex@IJCNLP.

[10]  John Lee,et al.  A Dependency Treebank of Classical Chinese Poems , 2012, NAACL.

[11]  Long Jiang,et al.  Generating Chinese Couplets and Quatrain Using a Statistical Approach , 2009, PACLIC.

[12]  Long Jiang,et al.  Generating Chinese Couplets using a Statistical MT Approach , 2008, COLING.

[13]  John Lee,et al.  Treebanking for Data-driven Research in the Classroom , 2013 .

[14]  Peggy Cellier,et al.  What about Sequential Data Mining Techniques to Identify Linguistic Patterns for Stylistics? , 2012, CICLing.

[15]  John Lee,et al.  Glimpses of Ancient China from Classical Chinese Poems , 2012, COLING.

[16]  Chu-Ren Huang Text-based Construction and Comparison of Domain Ontology : A Study Based on Classical Poetry , 2004, PACLIC.

[17]  Daniel Jurafsky,et al.  Tradition and Modernity in 20th Century Chinese Poetry , 2013, CLfL@NAACL-HLT.

[18]  Geoffrey Williams Collocational networks: Interlocking patterns of lexis in a Corpusof plant biology research articles , 1998 .

[19]  John Lee A Classical Chinese Corpus with Nested Part-of-Speech Tags , 2012, LaTeCH@EACL.

[20]  Alex Chengyu Fang,et al.  Adapting NLP and Corpus Analysis Techniques to Structured Imagery Analysis in Classical Chinese Poetry , 2009 .