Project Title: III: Small: Indexing, Querying, and Visualizing Big Spatial and Spatio-temporal Data Project Award Number: IIS-1525953 PI Name: Mohamed Mokbel Department: Computer Science and Engineering Institution: University of Minnesota Address: 200 Union ST SE, Minneapolis, MN, 55455, USA Email: mokbel@cs.umn.edu URL: www.cs.umn.edu/~mokbel |
|
||
SpatialHadoop is an open source MapReduce framework with built-in support for spatial data. It employs the MapReduce programming paradigm for distributed processing to build a general purpose tool for large scale analysis of spatial data on large clusters. Users can interact easily with SpatialHadoop through a high level language with built-in support for spatial data types and spatial operations. Existing spatial data sets can be loaded in SpatialHadoop with the built in spatial data types point, polygon and rectangle. SpatialHadoop is also extensible and more data types can be added by users. In addition, the data sets are stored efficiently using built-in indexes (Grid file or R-tree) which speed up the retrieval and processing of these data sets. Users can build an index of their choice with a single command that runs in parallel on the machines in the cluster. Once the index is built, users can start analyzing their data sets using the built in spatial operations (range query, k nearest neighbor and spatial join). The extensibility of SpatialHadoop allows users to implement more spatial operations as MapReduce programs. For more information, please visit: "http://spatialhadoop.cs.umn.edu/" | ||
|
||
ST-Hadoop is a MapReduce framework that acknowledges the fact that space and time play a crucial role in query processing. ST-Hadoop is an open-source extension of a Hadoop framework that injects the spatiotemporal awareness in the code base of four layers inside SpatialHadoop, namely, language, indexing, MapReduce, and operations layers. The spatio-temporal indexing techniques inside ST-Hadoop primarily tuned to provide the accommodation of new updated dataset efficiently without the need to rebuild its index. The key point behind the performance gain of ST-Hadoop is the idea of indexing, where data are temporary loaded and divided across computation nodes. For more information, please visit: "http://st-hadoop.cs.umn.edu/" | ||