RTA: A framework for the integration of local and relational open data

Yusuke Kosaka, Shu Murakami, Thomas Laurent, Kento Goto, Motomichi Toyama

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

There are currently massive amounts of public data, also refereed to as open data, for example stock price data or weather data. However, such data is distributed in a variety of ways, such as downloadable files like CSV or XML files, or through API calls to web services. Each data source thus requires a specific workflow, making it a burden for the users to process and use this data. This barrier to use diminishes the openness of this data We thus propose the Remote Table Access (RTA) system, a simple and safe architecture for publishing, i.e. giving open read only access to relational data, and easily integrating it with the user's local data. RTA enables the user to query relational open data and their own local data seamlessly through a single SQL query. To allow this, we designed a three parties architecture featuring a client-side application, an optional server-side module and a "Public Table Library" (PTL). The client side application processes the RTA query and fetches the necessary data, the server side system acts as an agent between the remote database and the client, offering added security as well as scalability in terms of connections, and the PTL list all the published data and stores its access information. We implemented an early prototype of this architecture as a proof of concept. We validated it against two datasets, including data from the TPC-C benchmark and make it available1. Our results show the feasability of RTA and possible significant reduction of query processing time mainly because of the reduction on transmission volume by condition pushing and semijoin.

Original languageEnglish
Title of host publicationProceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017
PublisherAssociation for Computing Machinery
Pages135-142
Number of pages8
VolumePart F129476
ISBN (Electronic)9781450352208
DOIs
Publication statusPublished - 2017 Jul 12
Event21st International Database Engineering and Applications Symposium, IDEAS 2017 - Bristol, United Kingdom
Duration: 2017 Jul 122017 Jul 14

Other

Other21st International Database Engineering and Applications Symposium, IDEAS 2017
CountryUnited Kingdom
CityBristol
Period17/7/1217/7/14

Fingerprint

Servers
Query processing
Application programming interfaces (API)
XML
Web services
Scalability

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Software

Cite this

Kosaka, Y., Murakami, S., Laurent, T., Goto, K., & Toyama, M. (2017). RTA: A framework for the integration of local and relational open data. In Proceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017 (Vol. Part F129476, pp. 135-142). Association for Computing Machinery. https://doi.org/10.1145/3105831.3105852

RTA : A framework for the integration of local and relational open data. / Kosaka, Yusuke; Murakami, Shu; Laurent, Thomas; Goto, Kento; Toyama, Motomichi.

Proceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017. Vol. Part F129476 Association for Computing Machinery, 2017. p. 135-142.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kosaka, Y, Murakami, S, Laurent, T, Goto, K & Toyama, M 2017, RTA: A framework for the integration of local and relational open data. in Proceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017. vol. Part F129476, Association for Computing Machinery, pp. 135-142, 21st International Database Engineering and Applications Symposium, IDEAS 2017, Bristol, United Kingdom, 17/7/12. https://doi.org/10.1145/3105831.3105852
Kosaka Y, Murakami S, Laurent T, Goto K, Toyama M. RTA: A framework for the integration of local and relational open data. In Proceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017. Vol. Part F129476. Association for Computing Machinery. 2017. p. 135-142 https://doi.org/10.1145/3105831.3105852
Kosaka, Yusuke ; Murakami, Shu ; Laurent, Thomas ; Goto, Kento ; Toyama, Motomichi. / RTA : A framework for the integration of local and relational open data. Proceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017. Vol. Part F129476 Association for Computing Machinery, 2017. pp. 135-142
@inproceedings{b8b472d3507c4f978f99b5ff31e15c47,
title = "RTA: A framework for the integration of local and relational open data",
abstract = "There are currently massive amounts of public data, also refereed to as open data, for example stock price data or weather data. However, such data is distributed in a variety of ways, such as downloadable files like CSV or XML files, or through API calls to web services. Each data source thus requires a specific workflow, making it a burden for the users to process and use this data. This barrier to use diminishes the openness of this data We thus propose the Remote Table Access (RTA) system, a simple and safe architecture for publishing, i.e. giving open read only access to relational data, and easily integrating it with the user's local data. RTA enables the user to query relational open data and their own local data seamlessly through a single SQL query. To allow this, we designed a three parties architecture featuring a client-side application, an optional server-side module and a {"}Public Table Library{"} (PTL). The client side application processes the RTA query and fetches the necessary data, the server side system acts as an agent between the remote database and the client, offering added security as well as scalability in terms of connections, and the PTL list all the published data and stores its access information. We implemented an early prototype of this architecture as a proof of concept. We validated it against two datasets, including data from the TPC-C benchmark and make it available1. Our results show the feasability of RTA and possible significant reduction of query processing time mainly because of the reduction on transmission volume by condition pushing and semijoin.",
author = "Yusuke Kosaka and Shu Murakami and Thomas Laurent and Kento Goto and Motomichi Toyama",
year = "2017",
month = "7",
day = "12",
doi = "10.1145/3105831.3105852",
language = "English",
volume = "Part F129476",
pages = "135--142",
booktitle = "Proceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - RTA

T2 - A framework for the integration of local and relational open data

AU - Kosaka, Yusuke

AU - Murakami, Shu

AU - Laurent, Thomas

AU - Goto, Kento

AU - Toyama, Motomichi

PY - 2017/7/12

Y1 - 2017/7/12

N2 - There are currently massive amounts of public data, also refereed to as open data, for example stock price data or weather data. However, such data is distributed in a variety of ways, such as downloadable files like CSV or XML files, or through API calls to web services. Each data source thus requires a specific workflow, making it a burden for the users to process and use this data. This barrier to use diminishes the openness of this data We thus propose the Remote Table Access (RTA) system, a simple and safe architecture for publishing, i.e. giving open read only access to relational data, and easily integrating it with the user's local data. RTA enables the user to query relational open data and their own local data seamlessly through a single SQL query. To allow this, we designed a three parties architecture featuring a client-side application, an optional server-side module and a "Public Table Library" (PTL). The client side application processes the RTA query and fetches the necessary data, the server side system acts as an agent between the remote database and the client, offering added security as well as scalability in terms of connections, and the PTL list all the published data and stores its access information. We implemented an early prototype of this architecture as a proof of concept. We validated it against two datasets, including data from the TPC-C benchmark and make it available1. Our results show the feasability of RTA and possible significant reduction of query processing time mainly because of the reduction on transmission volume by condition pushing and semijoin.

AB - There are currently massive amounts of public data, also refereed to as open data, for example stock price data or weather data. However, such data is distributed in a variety of ways, such as downloadable files like CSV or XML files, or through API calls to web services. Each data source thus requires a specific workflow, making it a burden for the users to process and use this data. This barrier to use diminishes the openness of this data We thus propose the Remote Table Access (RTA) system, a simple and safe architecture for publishing, i.e. giving open read only access to relational data, and easily integrating it with the user's local data. RTA enables the user to query relational open data and their own local data seamlessly through a single SQL query. To allow this, we designed a three parties architecture featuring a client-side application, an optional server-side module and a "Public Table Library" (PTL). The client side application processes the RTA query and fetches the necessary data, the server side system acts as an agent between the remote database and the client, offering added security as well as scalability in terms of connections, and the PTL list all the published data and stores its access information. We implemented an early prototype of this architecture as a proof of concept. We validated it against two datasets, including data from the TPC-C benchmark and make it available1. Our results show the feasability of RTA and possible significant reduction of query processing time mainly because of the reduction on transmission volume by condition pushing and semijoin.

UR - http://www.scopus.com/inward/record.url?scp=85028087353&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85028087353&partnerID=8YFLogxK

U2 - 10.1145/3105831.3105852

DO - 10.1145/3105831.3105852

M3 - Conference contribution

AN - SCOPUS:85028087353

VL - Part F129476

SP - 135

EP - 142

BT - Proceedings of the 21st International Database Engineering and Applications Symposium, IDEAS 2017

PB - Association for Computing Machinery

ER -