An Chinese Couplet Generation Model Based on Statistics and Rules

This paper presents an approach to computer generation of Chinese couplets.After dividing the composition of Chinese couplets into hard rules and soft rules,this paper further points out the soft rules consists of character correspondence and context correspondence.A probabilistic graphical model is proposed for couplet generation based on the soft rules,with parameters estimated by EM(Expectation-Maximization) algorithm.The decoding of the model integrates hard rules as heuristics.The experiment result demonstrates that the candidate characters produced by this model are better than those produced simply by frequency.The model can even learn parameters from the data set containing some couplets with poor quality.The couplet generation program implemented by this approach bears an acceptable performance.