[New version release] Set GeoSpark version to 1.3.1
5 files changed
tree: 7f0d783ecf8c97a80504496abc54f9b319cda62b
  1. .github/
  2. contrib/
  3. core/
  4. docs/
  5. python/
  6. sql/
  7. viz/
  8. zeppelin/
  9. .gitignore
  10. .gitmodules
  11. .travis.yml
  12. _config.yml
  13. GeoSpark_logo.png
  14. LICENSE
  15. LICENSE-before-1.2.0
  16. mkdocs.yml
  17. Pipfile
  18. pom.xml
  19. README.md
README.md

GeoSpark Logo

StableLatestSource code
Maven Central with version prefix filterSonatype Nexus (Snapshots)Build Status

GeoSpark@Twitter || GeoSpark Discussion Board || Join the chat at https://gitter.im/geospark-datasys/Lobby || HitCount(since Jan. 2018)

GeoSpark is a cluster computing system for processing large-scale spatial data. GeoSpark extends Apache Spark / SparkSQL with a set of out-of-the-box Spatial Resilient Distributed Datasets (SRDDs)/ SpatialSQL that efficiently load, process, and analyze large-scale spatial data across machines.

GeoSpark contains several modules:

NameAPISpark compatibilityDependency
GeoSpark-coreRDDSpark 2.X/1.XSpark-core
GeoSpark-SQLSQL/DataFrameSparkSQL 2.1 and laterSpark-core, Spark-SQL, GeoSpark-core
GeoSpark-VizRDD, SQL/DataFrameRDD - Spark 2.X/1.X, SQL - Spark 2.1 and laterSpark-core, Spark-SQL, GeoSpark-core, GeoSpark-SQL
GeoSpark-ZeppelinApache ZeppelinSpark 2.1+, Zeppelin 0.8.1+Spark-core, Spark-SQL, GeoSpark-core, GeoSpark-SQL, GeoSpark-Viz
  • Core: GeoSpark SpatialRDDs and Query Operators.
  • SQL: SQL interfaces for GeoSpark core.
  • Viz: Visualization extension of GeoSpark Spatial RDD and DataFrame.
  • GeoSpark-Zeppelin: GeoSpark visualization plugin for Apache Zeppelin

Please visit GeoSpark website for details and documentations.

News!

Social impact

GeoSpark development team has published four papers about GeoSpark. Please read Publications.

GeoSpark received an evaluation from PVLDB 2018 paper “How Good Are Modern Spatial Analytics Systems?” Varun Pandey, Andreas Kipf, Thomas Neumann, Alfons Kemper (Technical University of Munich), quoted as follows:

GeoSpark comes close to a complete spatial analytics system. It also exhibits the best performance in most cases.