A cluster computing framework for processing large-scale geospatial data

Clone this repo:
  1. e6513be [SEDONA-53]fix SpatialKnnQuery NullPointerException (#532) by chenpengchuan · 3 weeks ago master
  2. 81e92f1 Fix the failed CI in Example CI (#533) by Jia Yu · 3 weeks ago
  3. c363cd8 [SEDONA-30] Changes to Geotiff Loader (#531) by shantanuaggarwal2695 · 5 weeks ago
  4. 8fd688f [SEDONA-30] Add raster data support in Sedona SQL (#523) by shantanuaggarwal2695 · 7 weeks ago
  5. d8c2aae [SEDONA-50] Removing logging configuration as it causes errors on databricks. (#530) by Dan Corbiani · 8 weeks ago

Scala and Java build Python build Example project build

Click Binder and play the interactive Sedona Python Jupyter Notebook immediately!

Apache Sedona™(incubating) is a cluster computing system for processing large-scale spatial data. Sedona extends Apache Spark / SparkSQL with a set of out-of-the-box Spatial Resilient Distributed Datasets (SRDDs)/ SpatialSQL that efficiently load, process, and analyze large-scale spatial data across machines.

System architecture

Modules in the source code

CoreRDDSpatialRDDs and Query Operators.
SQLSQL/DataFrameSQL interfaces for Sedona core.
VizRDD, SQL/DataFrameVisualization for Spatial RDD and DataFrame
ZeppelinApache ZeppelinPlugin for Apache Zeppelin 0.8.1+

Sedona supports several programming languages: Scala, Java, SQL, Python and R.

Compile the source code

Please refer to Sedona website


Feedback to improve Apache Sedona: Google Form

Twitter: Sedona@Twitter

Gitter chat: Gitter

Sedona JIRA: Bugs, Pull Requests, and other similar issues

Sedona Mailing Lists:

Please visit Apache Sedona website for detailed information

Powered by