EasiCrawl: A Sleep-Aware Schedule Method for Crawling IoT Sensors

With rapid development of smart hardwares and networking protocols, more and more IoT sensors are becoming publicly accessible through the Internet. Many semantic enhanced IoT sensors store the captured events in their description files, making the build of a generic IoT search engine possible. Crawling the events captured by these sensors is a fundamental step towards building this IoT search engine. However, this step faces a challenge due to sensors' sleep behavior and limited energy supply. Using traditional web access strategy for IoT application may cause unpredictable latency in receiving events with low power efficiency. In this paper, firstly the issue how to crawl newly captured events from periodically sleeping sensors is formulated as a schedule problem, which can be solved by constrained optimization. We take expected latency as the optimization object, as this indicates whether the wanted events can be gathered by crawlers in time. Then a sleep-aware schedule method, named EasiCrawl, is proposed for achieving near-optimal expected latency in receiving events. Finally, EasiCrawl is evaluated by simulations and a case study with real-world data from Xively. The simulation results show that EasiCrawl has lower latency than the periodic and greedy crawl strategy.