A connectionist-based model for predicting the linguistic origin of surnames

This paper describes an application to the prediction of the linguistic origin of surnames. The problem can be stated as follows: "Given an input string representing a surname, decide which language the surname belongs to". We present an approach that integrates methodologies from rule-based systems, evidential reasoning, and neural networks. Our hybrid solution maximizes the used information and allows one to deal with aspects of the problem that could have not been solved otherwise. In fact, our predictor exploits both knowledge from experts on languages (by means of a rule-based system) and knowledge automatically acquired by examples (through statistical analysis and a neural network).