Estimating LOC for information systems from their conceptual data models

Effort and cost estimation is crucial in software management. Estimation of software size plays a key role in the estimation. Line of Code (LOC) is still a commonly used software size measure. Despite the fact that software sizing is well recognized as an important problem for more than two decades, there is still much problem in existing methods. Conceptual data model is widely used in the requirements analysis for information systems. It is also not difficult to construct conceptual data models in the early stage of developing information systems. Much characteristic of an information system is actually reflected from its conceptual data model. We explore into the use of conceptual data model for estimating LOC. This paper proposes a novel method for estimating LOC for an information system from its conceptual data model through the use of multiple linear regression model. We have validated the method through collecting samples from both the industry and open-source systems.

[1]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[2]  Peter P. Chen The Entity-Relationship Model: Towards a unified view of Data , 1976 .

[3]  Peter E. Kennedy A Guide to Econometrics , 1979 .

[4]  W. W. Muir,et al.  Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[5]  John E. Gaffney,et al.  Software Function, Source Lines of Code, and Development Effort Prediction: A Software Science Validation , 1983, IEEE Transactions on Software Engineering.

[6]  T. J. Teorey,et al.  A logical design methodology for relational databases using the extended entity-relationship model , 1986, CSUR.

[7]  Software Size Estimation of Object-Oriented Systems , 1990, IEEE Trans. Software Eng..

[8]  L. Leemis Applied Linear Regression Models , 1991 .

[9]  D. Ross Jeffery,et al.  A Comparison of Function Point Counting Techniques , 1993, IEEE Trans. Software Eng..

[10]  Michael R. Blaha,et al.  Object-Oriented Modeling and Design for Database Applications , 1997 .

[11]  John Smith,et al.  The Estimation of Effort Based on Use Cases , 1999 .

[12]  Eduardo Miranda An evaluation of the paired comparisons method for software sizing , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[13]  Ellis Horowitz,et al.  Software Cost Estimation with COCOMO II , 2000 .

[14]  Kenneth W. Boyer Function point analysis: measurement practices for successful software projects , 2001, SOEN.

[15]  Barbara A. Kitchenham,et al.  An empirical validation of the relationship between the magnitude of relative error and project size , 2002, Proceedings Eighth IEEE Symposium on Software Metrics.

[16]  Phillip G. Armour,et al.  Ten unmyths of project estimation , 2002, CACM.

[17]  Richard Lai,et al.  A Model for Estimating the Size of a Formal Communication Protocol Specification and Its Implementation , 2003, IEEE Trans. Software Eng..

[18]  D. Ross Jeffery,et al.  Cost estimation for web applications , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[19]  D. Ross Jeffery,et al.  An Empirical Study of Analogy-based Software Effort Estimation , 1999, Empirical Software Engineering.

[20]  Luigi Troiano,et al.  An Experience of Fuzzy Linear Regression applied to Effort Estimation , 2004, SEKE.

[21]  Yuan Zhao,et al.  ER-Based Software Sizing for Data-Intensive Systems , 2004, ER.

[22]  Genny Tortora,et al.  Class point: an approach for the size estimation of object-oriented systems , 2005, IEEE Transactions on Software Engineering.