commit | 6d61c9ee3e15e4e8fa00209c15f294730ab136bf | [log] [tgz] |
---|---|---|
author | Jia Yu <jiayu2@asu.edu> | Mon Feb 17 14:29:00 2020 -0700 |
committer | Jia Yu <jiayu2@asu.edu> | Mon Feb 17 14:36:16 2020 -0700 |
tree | 7f0d783ecf8c97a80504496abc54f9b319cda62b | |
parent | 8b6538614681ea323ede06c5d8ff9a78cb921f37 [diff] |
[New version release] Set GeoSpark version to 1.3.1
Stable | Latest | Source code |
---|---|---|
GeoSpark@Twitter || GeoSpark Discussion Board || || (since Jan. 2018)
GeoSpark is a cluster computing system for processing large-scale spatial data. GeoSpark extends Apache Spark / SparkSQL with a set of out-of-the-box Spatial Resilient Distributed Datasets (SRDDs)/ SpatialSQL that efficiently load, process, and analyze large-scale spatial data across machines.
GeoSpark contains several modules:
Name | API | Spark compatibility | Dependency |
---|---|---|---|
GeoSpark-core | RDD | Spark 2.X/1.X | Spark-core |
GeoSpark-SQL | SQL/DataFrame | SparkSQL 2.1 and later | Spark-core, Spark-SQL, GeoSpark-core |
GeoSpark-Viz | RDD, SQL/DataFrame | RDD - Spark 2.X/1.X, SQL - Spark 2.1 and later | Spark-core, Spark-SQL, GeoSpark-core, GeoSpark-SQL |
GeoSpark-Zeppelin | Apache Zeppelin | Spark 2.1+, Zeppelin 0.8.1+ | Spark-core, Spark-SQL, GeoSpark-core, GeoSpark-SQL, GeoSpark-Viz |
Please visit GeoSpark website for details and documentations.
GeoSpark development team has published four papers about GeoSpark. Please read Publications.
GeoSpark received an evaluation from PVLDB 2018 paper “How Good Are Modern Spatial Analytics Systems?” Varun Pandey, Andreas Kipf, Thomas Neumann, Alfons Kemper (Technical University of Munich), quoted as follows:
GeoSpark comes close to a complete spatial analytics system. It also exhibits the best performance in most cases.