Can Basic ML Techniques Illuminate Rateless Erasure Codes

The recently developed rateless erasure codes are a near-optimal channel coding technique that guarantees low overhead and fast decoding. The underlying theory, and current implementations, of these codes assume that a network transmitter encodes according to a pre-specified probability distribution. In this report, we use basic Machine Learning techniques to try to understand what happens when this assumption is false. We train several classes of models using certain features that describe the empirical distribution realized at a network receiver, and we investigate whether these models can “learn” to predict whether a given encoding will require extra overhead. Our results are mixed.

[1]  William H. Press,et al.  Numerical Recipes in Fortran 77: The Art of Scientific Computing 2nd Editionn - Volume 1 of Fortran Numerical Recipes , 1992 .

[2]  Michael Luby,et al.  LT codes , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[3]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[4]  William H. Press,et al.  Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[5]  Timothy R. C. Read,et al.  Goodness-Of-Fit Statistics for Discrete Multivariate Data , 1988 .

[6]  David Mazières,et al.  Rateless Codes and Big Downloads , 2003, IPTPS.

[7]  Gersende Fort,et al.  Classification using partial least squares with penalized logistic regression , 2005, Bioinform..

[8]  Amin Vahdat,et al.  Bullet: high bandwidth data dissemination using an overlay mesh , 2003, SOSP '03.

[9]  Daniel A. Spielman,et al.  Practical loss-resilient codes , 1997, STOC '97.

[10]  P. Maymounkov Online codes , 2002 .

[11]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[12]  Ralph B. D'Agostino,et al.  Goodness-of-Fit-Techniques , 2020 .

[13]  Michael J. Freedman,et al.  On-the-Fly Verification of Erasure-Encoded File Transfers (Extended Abstract) , 2003 .

[14]  William H. Press,et al.  Numerical Recipes: FORTRAN , 1988 .

[15]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[16]  Miguel Castro,et al.  SplitStream: high-bandwidth multicast in cooperative environments , 2003, SOSP '03.