Abstract
World Wide Web contains a huge amount of periodically-updated values originally sensed from the physical world; they include, for example, density of air pollutant, road traffic condition, and car park occupancy. In many cases, however, those data are not easily accessible from a computer program due to the lack of APIs to fetch them. In this paper, to cope with this problem, we propose an architecture for discovering, excavating, and streaming the entombed web contents (EWC). This architecture, called Sensorizer, leverages crowd sourcing for accurate EWC discovery, periodic web scraping with a headless browser for excavation from dynamic web pages, and a standardized communication protocol (XMPP) for data streaming to wide variety of applications.
Original language | English |
---|---|
Title of host publication | UbiComp and ISWC 2015 - Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the Proceedings of the 2015 ACM International Symposium on Wearable Computers |
Publisher | Association for Computing Machinery, Inc |
Pages | 1599-1606 |
Number of pages | 8 |
ISBN (Print) | 9781450335751 |
DOIs | |
Publication status | Published - 2015 Sept 7 |
Event | ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2015 ACM International Symposium on Wearable Computers, UbiComp and ISWC 2015 - Osaka, Japan Duration: 2015 Sept 7 → 2015 Sept 11 |
Other
Other | ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2015 ACM International Symposium on Wearable Computers, UbiComp and ISWC 2015 |
---|---|
Country/Territory | Japan |
City | Osaka |
Period | 15/9/7 → 15/9/11 |
ASJC Scopus subject areas
- Computer Networks and Communications
- Hardware and Architecture
- Software