Study on the Text Mining and Chinese Text Mining Framework
Text mining,also known as text data mining or knowledge discovery in texts,focuses on computerized exploration of large amounts of text and on discovery of implicit,previously unknown,and potentially useful patterns within them.Firstly,the text mining are introduced including its definition,its characteristics and its progress.Then,The problems and research direction of Chinese text mining are pointed out based on analysis for state-of-the-art of research on Chinese text mining.Finally,Unified Chinese Text Mining Framework(UCTMF) is presented.The framework are hierarchical,open,and scalable.It provide a unified and public frame for Chinese Text Mining System.