Memory-Efficient and Parallel Simulation of Super Carbon Nanotubes

Carbon nanotubes (CNTs) received much attention since their description in Nature in 1991. In principle, a carbon nanotube is a rolled up sheet of graphene, which can be imagined as a honeycomb grid of carbon atoms. This allotrope of carbon has many interesting properties like high tensile strength at very low weight or its high temperature resistance. This motivates the application of CNTs in material science to create new carbon nanotube enforced materials. They also possess interesting electronic properties since CNTs show either metallic or semiconducting behavior, depending on their configuration. The synthesis of branched carbon nanotubes allows the connection of straight CNTs to carbon nanotubes networks with branched tubes employed as junction elements. One of these networks are the so-called super carbon nanotubes (SCNTs) that were proposed in 2006. In that case, each carbon-carbon bond within the honeycomb grid is replaced by a CNT of equal size and each carbon atom by a Y-branched tube with three arms of equal length and a regular angle of 120° between the arms. This results in a structure that originates from tubes and regains the outer shape of a tube. It is also possible to repeat this process, replacing carbon-carbon bonds not with CNTs but with SCNTs, leading to very regular and self-similar structures of increasingly higher orders. Simulations demonstrate that the SCNTs also exhibit very interesting mechanical properties. They are even more flexible than CNTs and thus are good candidates for high strength com- posites or actuators with very low weight. Other applications arise again in microelectronics because of their configurable electronic behavior and in biology due to the biocompatibility of SCNTs. Despite progress in synthesizing processes for straight and branched CNTs, the production of SCNTs is still beyond current technological capabilities. In addition, real experiments at nanoscale are expensive and complex and hence, simulations are important to predict properties of SCNTs and to guide the experimental research. The atomic-scale finite element method (AFEM) already provides a well-established approach for simulations of CNTs at the atomic level. However, the model size of SCNTs grows very fast for larger tubes and the arising n-body and linear equation systems quickly exceed the memory capacity of available computer systems. This renders infeasible the simulation of large SCNTs on an atomic level, unless the regular structure of SCNTs can be taken into account to reduce the memory footprint. This thesis presents ways to exploit the symmetry and hierarchy within SCNTs enabling the simulation of higher order SCNTs. We develop structure-tailored and memory-saving data struc- tures which allow the storage of very large SCNTs models up to several billions of atoms while providing fast data access. We realize this with a novel graph data structure called Compressed Symmetric Graphs which is able to dynamically recompute large parts of structural information for tubes instead of storing them. We also present a new structure-aware and SMP-parallelized matrix-free solver for the linear equation systems involving the stiffness matrix, which employs an efficient caching mechanism for the data during the sparse matrix-vector multiplication. The matrix-free solver is twice as fast as a compressed row storage format-based reference solver, requiring only half the memory while caching all contributions of the matrix employed. We demonstrate that this solver, in combination with the Compressed Symmetric Graphs, is able to instantiate equation systems with matrices of an order higher than 5∗10^7 on a single compute node, while still fully caching all matrix data.

[1]  Mat Buckland,et al.  Programming Game AI by Example , 2004 .

[2]  Chinya V. Ravishankar,et al.  Block-Oriented Compression Techniques for Large Statistical Databases , 1997, IEEE Trans. Knowl. Data Eng..

[3]  J. A. Bondy,et al.  Graph Theory with Applications , 1978 .

[4]  J. Meijerink,et al.  An iterative solution method for linear systems of which the coefficient matrix is a symmetric -matrix , 1977 .

[5]  S. Seraphin,et al.  Complex branching phenomena in the growth of carbon nanotubes , 1995 .

[6]  Riichiro Saito,et al.  Electronic structure of chiral graphene tubules , 1992 .

[7]  Ado Jorio,et al.  Atomistic simulations of the mechanical properties of ‘super’ carbon nanotubes , 2007, 0707.3961.

[8]  Chih-Jen Lin,et al.  Incomplete Cholesky Factorizations with Limited Memory , 1999, SIAM J. Sci. Comput..

[9]  Raymond H. Chan,et al.  Conjugate Gradient Methods for Toeplitz Systems , 1996, SIAM Rev..

[10]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[11]  Michele Benzi,et al.  A Sparse Approximate Inverse Preconditioner for the Conjugate Gradient Method , 1996, SIAM J. Sci. Comput..

[12]  X. Qiu,et al.  A comprehensive study on the mechanical properties of super carbon nanotubes , 2008 .

[13]  Joseph L. Greathouse,et al.  Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[14]  T. Mehlhorn,et al.  PHYSICS OF CARBON NANOTUBES , 1995 .

[15]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[16]  Christian H. Bischof,et al.  An Improved Algorithm for Simulating the Mechanical Behavior of Super Carbon Nanotubes , 2015, 2015 IEEE 18th International Conference on Computational Science and Engineering.

[17]  R. Ehlich,et al.  Y-branching of single walled carbon nanotubes , 2000 .

[18]  X. Qiu,et al.  Chirality independence in critical buckling forces of super carbon nanotubes , 2008 .

[19]  Bui Tuong Phong Illumination for computer generated pictures , 1975, Commun. ACM.

[20]  Huang Yong-gang,et al.  Nonlinear Deformation Processes and Damage Modes of Super Carbon Nanotubes with Armchair-Armchair Topology , 2008 .

[21]  G. M. Ostrovsky,et al.  Optimization of large-scale complex systems , 1986 .

[22]  Ying Li,et al.  Super carbon nanotubes, fractal super tubes and fractal super fibres , 2010 .

[23]  Enrico Clementi,et al.  Atomic Screening Constants from SCF Functions , 1963 .

[24]  Christian H. Bischof,et al.  Compressed symmetric graphs for the simulation of super carbon nanotubes , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[25]  Wonbong Choi,et al.  Synthesis of Y-junction single-wall carbon nanotubes , 2005 .

[26]  Yonggang Huang,et al.  Atomic-scale finite element method in multiscale computation with applications to carbon nanotubes , 2005 .

[27]  Xi-Qiao Feng,et al.  Superior flexibility of super carbon nanotubes: Molecular dynamics simulations , 2007 .

[28]  Chunyu Li,et al.  A STRUCTURAL MECHANICS APPROACH FOR THE ANALYSIS OF CARBON NANOTUBES , 2003 .

[29]  Olaf Schenk,et al.  Fast Methods for Computing Selected Elements of the Green's Function in Massively Parallel Nanoelectronic Device Simulations , 2013, Euro-Par.

[30]  J. Shewchuk An Introduction to the Conjugate Gradient Method Without the Agonizing Pain , 1994 .

[31]  Sergiy Fialko Application of AVX (Advanced Vector Extensions) for improved performance of the PARFES - finite element Parallel Direct Solver , 2013, 2013 Federated Conference on Computer Science and Information Systems.

[32]  S. L. Mayo,et al.  DREIDING: A generic force field for molecular simulations , 1990 .

[33]  J. Nagy,et al.  From straight carbon nanotubes to Y-branched and coiled carbon nanotubes , 2002 .

[34]  Christian H. Bischof,et al.  Exploiting Structural Properties During Carbon Nanotube Simulation , 2015, ICCSA.

[35]  Anna C. Gilbert,et al.  Compressing Network Graphs , 2004 .

[36]  Eduardo F. D'Azevedo,et al.  Vectorized Sparse Matrix Multiply for Compressed Row Storage Format , 2005, International Conference on Computational Science.

[37]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[38]  Jose-Ignacio Agulleiro,et al.  Tomo3D 2.0--exploitation of advanced vector extensions (AVX) for 3D reconstruction. , 2015, Journal of structural biology.

[39]  Sebastiano Vigna,et al.  The webgraph framework I: compression techniques , 2004, WWW '04.

[40]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[41]  Xiong Zhang,et al.  Mechanical properties of super honeycomb structures based on carbon nanotubes , 2007, Nanotechnology.

[42]  S. Pellegrino,et al.  AN INTRODUCTION TO THE ANALYSIS OF SYMMETRIC STRUCTURES , 1999 .

[43]  James Demmel,et al.  SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems , 2003, TOMS.

[44]  Boris Schling The Boost C++ Libraries , 2011 .

[45]  M. Benzi Preconditioning techniques for large linear systems: a survey , 2002 .

[46]  Yajun Yin,et al.  Geometric conditions for fractal super carbon nanotubes with strict self-similarities , 2008 .

[47]  S. Ashby,et al.  A parallel multigrid preconditioned conjugate gradient algorithm for groundwater flow simulations , 1996 .

[48]  Sylvain Lefebvre,et al.  Perfect spatial hashing , 2006, SIGGRAPH 2006.

[49]  J. M. Bull,et al.  Measuring Synchronisation and Scheduling Overheads in OpenMP , 2007 .

[50]  A. Oberlin,et al.  Filamentous growth of carbon through benzene decomposition , 1976 .

[51]  D. Brenner,et al.  Empirical potential for hydrocarbons for use in simulating the chemical vapor deposition of diamond films. , 1990, Physical review. B, Condensed matter.

[52]  Ataru Tanikawa,et al.  N-body simulation for self-gravitating collisional systems with a new SIMD instruction set extension to the x86 architecture, Advanced Vector eXtensions , 2012 .

[53]  B. Alder,et al.  Studies in Molecular Dynamics. I. General Method , 1959 .

[54]  Nisheeth Shrivastava,et al.  Graph summarization with bounded error , 2008, SIGMOD Conference.

[55]  Jörg Schröder,et al.  Technische Mechanik : Band 1 : Statik , 2004 .

[56]  Christian Schröppel,et al.  Algebraic graph theory and its applications for mesh generation , 2012 .

[57]  Christian Bischof,et al.  Using Instancing to Efficiently Render Carbon Nanotubes , 2014 .

[58]  Silvio Lattanzi,et al.  On compressing social networks , 2009, KDD.

[59]  John H. Reif,et al.  Efficient lossless compression of trees and graphs , 1996, Proceedings of Data Compression Conference - DCC '96.

[60]  M. Terrones,et al.  Covalent 2D and 3D networks from 1D nanostructures: designing new materials. , 2007, Nano letters.

[61]  M. Hestenes,et al.  Methods of conjugate gradients for solving linear systems , 1952 .

[62]  Peter Weiner,et al.  Linear Pattern Matching Algorithms , 1973, SWAT.

[63]  Benedict,et al.  Pure carbon nanoscale devices: Nanotube heterojunctions. , 1996, Physical review letters.

[64]  J. Mark Bull,et al.  A microbenchmark suite for OpenMP 2.0 , 2001, CARN.

[65]  K. Koohestani,et al.  Exploitation of symmetry in graphs with applications to finite and boundary elements analysis , 2012 .

[66]  Edward M. McCreight,et al.  A Space-Economical Suffix Tree Construction Algorithm , 1976, JACM.

[67]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[68]  R. Ruoff,et al.  Mechanical and thermal properties of carbon nanotubes , 1995 .

[69]  Rudolf A. Römer,et al.  On large‐scale diagonalization techniques for the Anderson model of localization , 2005, SIAM J. Sci. Comput..

[70]  Christian H. Bischof,et al.  Methods to model and simulate super carbon nanotubes of higher order , 2017, Concurr. Comput. Pract. Exp..

[71]  R. Baughman,et al.  Carbon nanotubes: An explosive thrust for nanotubes. , 2010, Nature materials.

[72]  Yajun Yin,et al.  Geometric conservation laws for perfect Y-branched carbon nanotubes , 2006 .

[73]  Jim X. Chen,et al.  OpenGL Shading Language , 2009 .

[74]  Christian H. Bischof,et al.  A Unified and Memory Efficient Framework for Simulating Mechanical Behavior of Carbon Nanotubes , 2015, ICCS.

[75]  T. Ichihashi,et al.  Single-shell carbon nanotubes of 1-nm diameter , 1993, Nature.

[76]  S. Iijima Helical microtubules of graphitic carbon , 1991, Nature.

[77]  Christian H. Bischof,et al.  Extending Perfect Spatial Hashing to Index Tuple-based Graphs Representing Super Carbon Nanotubes , 2017, ICCS.

[78]  M. Malakouti,et al.  Nanomechanics analysis of perfect and defected graphene sheets via a novel atomic-scale finite element method , 2016 .

[79]  G. Meurant The block preconditioned conjugate gradient method on vector computers , 1984 .

[80]  Inderjit S. Dhillon,et al.  A fast kernel-based multilevel algorithm for graph clustering , 2005, KDD '05.

[81]  Mark T. Jones,et al.  An improved incomplete Cholesky factorization , 1995, TOMS.

[82]  Jeffrey A. Fessler,et al.  Conjugate-gradient preconditioning methods for shift-variant PET image reconstruction , 1999, IEEE Trans. Image Process..

[83]  Gregory D. Peterson,et al.  Sparse Matrix-Vector Multiplication Design on FPGAs , 2007 .

[84]  Nicola Pugno,et al.  Mimicking nacre with super-nanotubes for producing optimized super-composites , 2006 .

[85]  W. Haensch,et al.  High-density integration of carbon nanotubes via chemical self-assembly. , 2012, Nature nanotechnology.

[86]  Olaf Schenk,et al.  Solving unsymmetric sparse systems of linear equations with PARDISO , 2004, Future Gener. Comput. Syst..

[87]  Jens Wackerfuß,et al.  Molecular mechanics in the context of the finite element method , 2009 .

[88]  Gustavo E. Scuseria,et al.  Negative curvature and hyperfullerenes , 1992 .

[89]  O. Axelsson,et al.  On the rate of convergence of the preconditioned conjugate gradient method , 1986 .

[90]  S. Stuart,et al.  A reactive potential for hydrocarbons with intermolecular interactions , 2000 .

[91]  Gerhard Wellein,et al.  A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units , 2013, SIAM J. Sci. Comput..

[92]  Y. Mai,et al.  Molecular mechanics modeling of deformation and failure of super carbon nanotube networks , 2011, Nanotechnology.

[93]  Ado Jorio,et al.  Geometric and electronic structure of carbon nanotube networks: ‘super’-carbon nanotubes , 2006 .

[94]  HornusSamuel,et al.  Coherent parallel hashing , 2011 .

[95]  Zhifeng Ren,et al.  Straight carbon nanotube Y junctions , 2001 .

[96]  Wolfgang Fichtner,et al.  Efficient Sparse LU Factorization with Left-Right Looking Strategy on Shared Memory Multiprocessors , 2000 .

[97]  Seifert,et al.  Construction of tight-binding-like potentials on the basis of density-functional theory: Application to carbon. , 1995, Physical review. B, Condensed matter.

[98]  Linus Pauling,et al.  The Nature of the Chemical Bond and the Structure of Molecules and Crystals , 1941, Nature.

[99]  Bora Uçar,et al.  On Shared-Memory Parallelization of a Sparse Matrix Scaling Algorithm , 2012, 2012 41st International Conference on Parallel Processing.

[100]  John D. Owens,et al.  Real-time parallel hashing on the GPU , 2009, SIGGRAPH 2009.

[101]  John C. Slater,et al.  Atomic Radii in Crystals , 1964 .

[102]  D. Kershaw The incomplete Cholesky—conjugate gradient method for the iterative solution of systems of linear equations , 1978 .

[103]  Edward D. Lazowska,et al.  Speedup Versus Efficiency in Parallel Systems , 1989, IEEE Trans. Computers.

[104]  Maurice V. Wilkes,et al.  The memory gap and the future of high performance memories , 2001, CARN.

[105]  Christian Schröppel,et al.  Meshing highly regular structures: the case of super carbon nanotubes of arbitrary order , 2015 .

[106]  Yajun Yin,et al.  The effective modulus of super carbon nanotubes predicted by molecular structure mechanics. , 2008, Nanotechnology.

[107]  Samuel Williams,et al.  Roofline: an insightful visual performance model for multicore architectures , 2009, CACM.

[108]  M. Meo,et al.  Prediction of Young's modulus of single wall carbon nanotubes by molecular-mechanics based finite element modelling , 2006 .