Harnessing the Computing Continuum for Programming Our World

The number of network-connected devices (sensors, actuators, instruments, computers, and data stores) now substantially exceeds the number of humans on this planet. This is a tipping point, and the societal and intellectual effects of this are not yet fully understood. Billions of things that sense, think, and act are now connected to a planet-spanning network of cloud and high-performance computing (HPC) centers that contain more computers than the entire Internet did just a few years ago. We are now critically dependent on this expanding network for our communications and social discourse; our food, health, and safety; our manufacturing, transportation, and logistics; and our creative and intellectual endeavors, including research and technical innovation. Despite our increasing dependence on this massive, interconnected system of systems in nearly every aspect of our social, political, economic, and cultural lives, we lack ways to analyze its emergent properties, specify its operating constraints, or coordinate its behavior. Simply put, today we have the tools to instrument and embed intelligence in everything, and we are doing so at a prodigious pace. Although we are the globally distributed designers, builders, and users of this immense, multi-layered environment, we are not truly its masters. Each of us manages only some of the networks components, and we can neither predict its aggregate behavior nor easily specify our intensional goals in intuitive language. For all of us collectively, and each of us individually, this must change. Today, we program in the relatively small confines of a single node, defining individual device, instrument, and computing element behaviors, and we are regularly confounded by unanticipated outcomes and unexpected behavior that results once this individual node/device is exposed to the network collective. As consumers, we want our Internet-capable environmental devices (e.g., thermostats, lighting, and entertainment preferences) to adapt seamlessly to our changing roles and expectations, regardless of location. And yet, rather than specifying the ends we seek, we must specify detailed behaviors for home, office, car, and transient locale. In environmental health, we build and deploy arrays of wireless environmental sensors and edge devices when our goal may really be to “reprioritize edge resources to search for mosquitoes, given a statistically significant change in seasonal temperature and humidity across the nearby river basin”. In disaster planning, when satellites show hurricane formation, we manually redirect data streams and simulation software stacks, when our goal is really to “retarget advanced computing resources to predict storm surge levels along the eastern seaboard”. In science, when the Laser Interferometer Gravitational-Wave Observatory (LIGO) detects

[1]  Lotfi A. Zadeh,et al.  Commonsense reasoning based on fuzzy logic , 1986, WSC '86.

[2]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[3]  Murray Cole,et al.  MaSiF: Machine learning guided auto-tuning of parallel skeletons , 2013, HiPC.

[4]  Lavanya Ramakrishnan,et al.  Performability modeling for scheduling and fault tolerance strategies for scientific workflows , 2008, HPDC '08.

[5]  Rahul Ramachandran,et al.  Service-oriented environments for dynamically interacting with mesoscale weather , 2005, Computing in Science & Engineering.

[6]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[7]  Judy Qiu,et al.  Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[8]  Jack J. Dongarra,et al.  Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programs , 1990, TOMS.

[9]  Xavier Messeguer,et al.  DnaSP, DNA polymorphism analyses by the coalescent and other methods , 2003, Bioinform..

[10]  Geoffrey C. Fox,et al.  Twister:Net - Communication Library for Big Data Processing in HPC and Cloud Environments , 2018, 2018 IEEE 11th International Conference on Cloud Computing (CLOUD).

[11]  Geoffrey C. Fox,et al.  Twister: a runtime for iterative MapReduce , 2010, HPDC '10.

[12]  Timothy W. Finin,et al.  A policy language for a pervasive computing environment , 2003, Proceedings POLICY 2003. IEEE 4th International Workshop on Policies for Distributed Systems and Networks.

[13]  Geoffrey Fox,et al.  Twister2: Design of a big data toolkit , 2020, Concurr. Comput. Pract. Exp..

[14]  Juan-Carlos Cano,et al.  A Proposal for a Publish/Subscribe, Disruption Tolerant Content Island for Fog Computing , 2017, SmartObjects@MobiCom.

[15]  Martin Schulz,et al.  Clustering performance data efficiently at massive scales , 2010, ICS '10.

[16]  Jeffrey M. Bradshaw,et al.  KAoS policy management for semantic Web services , 2004, IEEE Intelligent Systems.

[17]  Liliana Pasquale,et al.  Fuzzy Time in Linear Temporal Logic , 2014, TOCL.

[18]  Pete Beckman,et al.  Waggle: An open sensor platform for edge computing , 2016, 2016 IEEE SENSORS.

[19]  Ewa Deelman,et al.  Pegasus in the Cloud: Science Automation through Workflow Technologies , 2016, IEEE Internet Computing.

[20]  Daniel A. Reed,et al.  A Cloud-Based Scientific Gateway for Internet of Things Data Analytics , 2018, PEARC.