A study and application of textual information extraction and discriminative learning for text analysis