Lamb - A Lexical Analyzer with Ambiguity Support
暂无分享,去创建一个
Lexical ambiguities may naturally arise in language specifications. We present Lamb, a lexical analyzer that captures overlapping tokens caused by lexical ambiguities. This novel technique scans through the input string and produces a lexical analysis graph that describes all the possible sequences of tokens that can be found within the string. The lexical graph can then be fed as input to a parser, which will discard any sequence of tokens that does not produce a valid syntactic sentence. In summary, our approach allows a context-sensitive lexical analysis that supports lexically-ambiguous language specifications.
[1] Neri Merhav,et al. Hidden Markov processes , 2002, IEEE Trans. Inf. Theory.
[2] Andrew McCallum,et al. Maximum Entropy Markov Models for Information Extraction and Segmentation , 2000, ICML.
[3] Y H Shyu. From semi-syntactic lexical analyzer to a new compiler model , 1986, SIGP.