Determining Process Death Based on Censored Activity Data

This article addresses the problem of estimating the time of apparent death in a binary stochastic process. We show that, when only censored data are available, a fitted logistic regression model may estimate the time of death incorrectly. We improve this estimation by utilizing discrete-event simulation to produce simulated complete time series data. The proposed methodology may be applied to situations where time of death cannot be formally determined and has to be estimated based on prolonged inactivity. As an illustration, we use observed monthly activity patterns from 300 real Open Source Software development projects sampled from Sourceforge.net.

[1]  Anna Sidorova,et al.  SURVIVAL OF OPEN-SOURCE PROJECTS: A POPULATION ECOLOGY PERSPECTIVE , 2003 .

[2]  J P Klein,et al.  Estimating leukemia-free survival after allografting for chronic myeloid leukemia: a new method that takes into account patients who relapse and are restored to complete remission. , 2000, Blood.

[3]  Magnus Bergquist,et al.  The power of gifts: organizing social relationships in open source communities , 2001, Inf. Syst. J..

[4]  Kevin Crowston,et al.  Towards a Portfolio of FLOSS Project Success Measures , 2004, ICSE 2004.

[5]  Sree Nilakanta,et al.  Organizational Memory Management: Technological and Research Issues , 2006, J. Database Manag..

[6]  Barbara A Salazar The Holocaust--Recovery of Assets from World War II: A Chronology (May 1995 to Present) , 2000 .

[7]  B. Efron Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve , 1988 .

[8]  Jane Greenberg,et al.  Who is an open source software developer? , 2002, CACM.

[9]  Robert L. Glass A sociopolitical look at open source , 2003, CACM.

[10]  Jeffrey S. Norris,et al.  Mission-critical development with open source software: lessons learned , 2004, IEEE Software.

[11]  Nicholas J. Fiore Supporting a Parent , 1999 .

[12]  Giancarlo Succi,et al.  An empirical study of open-source and closed-source software products , 2004, IEEE Transactions on Software Engineering.

[13]  Christopher L. Huntley,et al.  Organizational learning in open-source software projects: an analysis of debugging data , 2003, IEEE Trans. Engineering Management.

[14]  José Raimundo de Souza Passos,et al.  Modeling Grouped Survival Data with Time-Dependent Covariates , 2006 .

[15]  Paul W. Howerton Computer crime (a tutorial) , 1985, ACM '85.