Capturing ghosts: predicting the used IPv4 space by inferring unobserved addresses

The pool of unused routable IPv4 prefixes is dwindling, with less than 4% remaining for allocation at the end of June 2014. Yet the adoption of IPv6 remains slow. We demonstrate a new capture-recapture technique for improved estimation of the size of "IPv4 reserves" (allocated yet unused IPv4 addresses or routable prefixes) from multiple incomplete data sources. A key contribution of our approach is the plausible estimation of both observed and unobserved-yet-active (ghost) IPv4 address space. This significantly improves our community's understanding of IPv4 address space exhaustion and likely pressure for IPv6 adoption. Using "ping scans", network traces and server logs we estimate that 6.3 million /24 subnets and 1.2 billion IPv4 addresses are currently in use (roughly 60% and 45% of the publicly routed space respectively). We also show how utilisation has changed over the last 2--3 years and provide an up-to-date estimate of potentially-usable remaining IPv4 space.

[1]  Enrico Gregori,et al.  On the incompleteness of the AS-level graph: a novel methodology for BGP route collector placement , 2012, Internet Measurement Conference.

[2]  A Chao,et al.  The applications of capture‐recapture models to epidemiological data , 2001, Statistics in medicine.

[3]  R R Regal,et al.  Capture-recapture methods in epidemiology: methods and limitations. , 1995, Epidemiologic reviews.

[4]  L. Rivest,et al.  Rcapture: Loglinear Models for Capture-Recapture in R , 2007 .

[5]  David R. Anderson,et al.  Multimodel Inference , 2004 .

[6]  Lachlan L. H. Andrew,et al.  Mitigating sampling error when measuring internet client IPv6 capabilities , 2012, IMC '12.

[7]  John Heidemann,et al.  Understanding block-level address usage in the visible internet , 2010, SIGCOMM 2010.

[8]  Songwu Lu,et al.  IPv4 address allocation and the BGP routing table evolution , 2005, CCRV.

[9]  Ramesh Govindan,et al.  Census and survey of the visible internet , 2008, IMC '08.

[10]  Benjamin Edelman,et al.  Pricing and Efficiency in the Market for IP Addresses , 2011, WINE.

[11]  S. Fienberg The multiple recapture census for closed populations and incomplete 2k contingency tables , 1972 .

[12]  Alberto Dainotti,et al.  Errata for: Estimating internet address space usage through passive measurements (SIGCOMM CCR (Vol. 44, Issue 1, January, 2014) , 2014, CCRV.

[13]  Lachlan L. H. Andrew,et al.  Estimating the used IPv4 address space with secure multi-party capture-recapture , 2013, 2013 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[14]  F. C. Lincoln Calculating waterfowl abundance on the basis of banding returns , 1930 .

[15]  John S. Heidemann,et al.  Understanding block-level address usage in the visible internet , 2010, SIGCOMM '10.

[16]  Anne Chao,et al.  An overview of closed capture-recapture models , 2001 .

[17]  Patrick D. McDaniel,et al.  Analysis of the IPv4 Address Space Delegation Structure , 2007, 2007 12th IEEE Symposium on Computers and Communications.

[18]  Lachlan L. H. Andrew,et al.  Estimating IPv4 address space usage with capture-recapture , 2013, 38th Annual IEEE Conference on Local Computer Networks - Workshops.

[19]  Stefan Savage,et al.  Inferring Internet denial-of-service activity , 2001, TOCS.

[20]  Olaf Maennel,et al.  Bigfoot, sasquatch, the yeti and other missing links: what we don't know about the as graph , 2008, IMC '08.