A Note on Entropy of Telugu Prose

An optimum code is constructed for the Telugu alphabet using the proportions of letters estimated from a large sample of Telugu prose. The unbiased estimates of one-gram entropies of the different forms of prose writings are obtained. It is shown that this entropy can neither be treated as a language characteristic nor as a style characteristic. Further the digram entropy and an approximation of the entropy of Telugu prose are obtained.

[1]  G. Basharin On a Statistical Estimate for the Entropy of a Sequence of Independent Random Variables , 1959 .

[2]  G. Siromoney,et al.  Style as Information in Karnatic Music , 1964 .

[3]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[4]  Gustav Herdan,et al.  Language as choice and chance , 1957 .

[5]  L. Brillouin,et al.  Science and information theory , 1956 .

[6]  J. Licklider,et al.  Long-range constraints in the statistical structure of printed English. , 1955, The American journal of psychology.

[7]  Gift Siromoney,et al.  Entropy of Tamil Prose , 1963, Inf. Control..

[8]  Fazlollah M. Reza,et al.  Introduction to Information Theory , 2004, Lecture Notes in Electrical Engineering.

[9]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[10]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.