论文信息 - Verification of Pointer Programs

Verification of Pointer Programs

With this dissertation we present an abstraction and verification framework for pointer programs operating on unbounded heaps. To this end, we introduce two different abstraction methods for pointer-manipulating programs: an abstraction technique for singlylinked structures that guarantees a finite abstract semantics for any given program and a more general approach, employing context-free hyperedge replacement graph grammars to model the data structures and compute the abstraction mappings. The graph grammars are user defined and therefore this approach can handle a variety of different data structures. By means of partial concretization steps we avoid the necessity for explicitly defining the effect of pointer-manipulating operations on abstracted parts of the heap: it is obtained “for free” by combining partial concretization, the concrete pointer operation, and re-abstraction of the transformed state. Besides the possibility to check for pointer safety, assuring the absence of null dereferences, and shape safety, the preservation of the data structure, we establish an expressive pointer logic that is based on LTL. It allows to specify safety as well as liveness properties for the executions of the system. We show that the corresponding model checking problem can be reduced to an LTL model checking problem enabling the application of existing, highly optimized model checkers. We show the practical feasibility of our approach by applying it to the well-known Deutsch-Schorr-Waite traversal algorithm for binary trees – a stackless traversal algorithm that uses destructive updates. Finally, we introduce an extension of our framework to concurrent pointer programs with unbounded thread creation. For that purpose we model the control-flow and heap semantics separately as Petri nets. Abstracting the heap only, we obtain a data-abstract semantics for which we can show that the model checking problem is decidable. To obtain practically feasible results, however, we are forced to apply in a second step abstraction to the control-flow semantics as well. It turns out that the resulting Petri net can be represented as a finite transition system, to that our model checking method can be applied. Zusammenfassung In dieser Dissertation stellen wir ein Konzept zur Abstraktion und Verifikation zeigermanipulierender Programme, welche über unbeschränkten Speicher verfügen, vor. Dafür führen wir zwei unterschiedliche Abstraktionstechniken ein: Die erste dient der Abstraktion von einfach verketteten Datenstrukturen und garantiert die Endlichkeit der abstrakten Semantik für alle Eingaben, während ein erweiterter Ansatz Hyperkantenersetzungsgrammatiken zur Modellierung komplexerer Datenstrukturen und zur Berechnung der zugehörigen Abstraktionsabbildungen einsetzt. Die verwendeten Graphgrammatiken werden vom Benutzer vorgegeben und sind daher nicht auf bestimmte Datenstrukturen beschränkt. Durch die Verwendung partieller Konkretisierungsschritte können wir es vermeiden, für jede Programmoperation eine abstrakte Version bereitstellen müssen. Dabei ergibt eine Kombination aus partieller Konkretisierung, Ausführung der konkreten Programmoperation und anschließender Reabstraktion den entsprechenden abstrakten Transformationsschritt. Die vorgestellten Verifikationsmethoden ermöglichen es uns nicht nur, mögliche Laufzeitfehler, wie sie etwa durch Dereferenzierung von Null-Zeigern entstehen, festzustellen, und die Invarianz von Datenstrukturen im Hinblick auf einen gegebenen Algorithmus zu testen. Die Einführung einer auf LTL basierenden, ausdrucksstarkenHeap-Logik erlaubt es uns, auch Sicherheitsund Lebendigkeitseigenschaften aller Läufe des Systems zu überprüfen. Wir zeigen, dass das zugehörige Model Checking-Problem auf LTL Model Checking zurückgeführt werden kann, und somit die Anwendung vorhandener, bewährter Model Checking-Verfahren möglich ist. Anhand des Deutsch-Schorr-Waite-Traversierungsalgorithmus, der ohne zusätzlichen Kellerspeicher oder andere Hilfsstrukturen auskommt, zeigen wir die praktische Anwendbarkeit unseres Konzepts, indem wir verschiedene Korrektheitseigenschaften nachweisen. Abschließend erweitern wir unseren Ansatz um dynamische und unbeschränkte Threaderzeugung zur Laufzeit. Dazu modellieren wir Kontrollflussund Heapsemantik unabhängig voneinander als Petri-Netze. Durch Abstraktion des Heaps erhalten wir eine datenabstrakte Semantik, für die die Entscheidbarkeit des Model Checking-Problems nachgewiesen werden kann. Für praktisch nutzbare Ergebnisse sind wir jedoch gezwungen, in einem zweiten Schritt auch die Kontrollflusssemantik zu abstrahieren. Das daraus resultierende Petri-Netz kann als endliches Transitionssystem dargestellt werden, auf welches wiederum unsere Verifikationsverfahren anwendbar sind.

Stefan Rieger | Stefan Rieger

[1] Neil Immerman,et al. Simulating Reachability Using First-Order Logic with Applications to Verification of Linked Data Structures , 2005, CADE.

[2] John C. Reynolds,et al. Separation logic: a logic for shared mutable data structures , 2002, Proceedings 17th Annual IEEE Symposium on Logic in Computer Science.

[3] Annegret Habel,et al. Hyperedge Replacement, Graph Grammars , 1997, Handbook of Graph Grammars.

[4] Reinhard Wilhelm,et al. Solving shape-analysis problems in languages with destructive updating , 1998, TOPL.

[5] Hsu-Chun Yen,et al. A Unified Approach for Deciding the Existence of Certain Petri Net Paths , 1992, Inf. Comput..

[6] Thomas Noll,et al. Juggrnaut: Graph Grammar Abstraction for Unbounded Heap Structures , 2010, TTSS.

[7] J. A. Robinson,et al. Handbook of Automated Reasoning (in 2 volumes) , 2001 .

[8] Andreas Podelski,et al. Boolean Heaps , 2005, SAS.

[9] Martin C. Rinard,et al. Pointer and escape analysis for multithreaded programs , 2001, PPoPP '01.

[10] Shin Nakajima,et al. The SPIN Model Checker : Primer and Reference Manual , 2004 .

[11] Jan Friso Groote,et al. An Efficient Algorithm for Branching Bisimulation and Stuttering Equivalence , 1990, ICALP.