BABYLON: Large-Scale GeoSpatial Visual Analytics in Apache Spark

StatusStableLatestSource code
GeoSpark0.8.1Maven CentralBuild Statuscodecov.io
Babylon Viz System0.2.2Maven CentralBuild Statuscodecov.io

Join the chat at https://gitter.im/geospark-datasys/Lobby

Supported Apache Spark version: 2.0+(Master branch) 1.0+(1.X branch)

Babylon in this repository (v0.1.X-0.2.X) provides native support for general cartographic design.

Babylon artifacts are hosted in Maven Central: Maven Central Coordinates

Babylon GeoSpark compatibility: Reference Sheet

Version release notes: click here

News!

We have changed the goal of Babylon project and re-desgined the entire system.

Code in this repository will only receive maintainance updates and its version will stay in 0.2.X.

For people who are interested in the new Babylon system, please refer to Babylon Project.

Babylon Gallery

Scatter Plot: USA mainland rail network

Heat Map: New York City Taxi Trips (with a given map background)

Choropleth Map + Overlay Operator: USA mainland tweets per USA county (Spatial Join Query)

Main Features

Extensible Visualization operator (just like playing LEGO bricks)!

  • Support super high resolution image generation: parallel map image rendering
  • Visualize Spatial RDD and Spatial Queries (Spatial Range, Spatial K Nearest Neighbors, Spatial Join)
  • Customizable: Can be customized to any user-supplied colors or coloring rule
  • Extensible: Can be extended to any visualization effect

Overlay Operator

Overlay one map layer with many other map layers!

Various Image Filter

  • Gaussian Blur
  • Box Blur
  • Embose
  • Outline
  • Sharpen
  • More!

You also can buld your new image filter by easily extending the photo filter!

Various Image Type

  • Raster image: PNG
  • Vector image: SVG (Only support Scatter plot and Choropleth Map)
  • More!

You also can support your desired image type by easily extending image generator! (JPG format is temporarily unavailable due to the lack of OpenJDK support)

Current Visualization effect

  • Scatter Plot
  • Heat Map
  • Choropleth Map
  • More!

Current Output Storage

  • Local disk
  • Hadoop Distributed File System (HDFS)
  • Amazon Simple Storage Service (Amazon S3)

You also can build your new self-designed effects by easily extending the visualization operator!

Babylon Tutorial (more)

Babylon full tutorial is available at GeoSpark GitHub Wiki: https://github.com/DataSystemsLab/GeoSpark/wiki

Supported Spatial Objects and Input format

All spatial obects and input formats supported by GeoSpark

Acknowledgement

Babylon makes use of JFreeSVG plus (An extension of JFreeSVG 3.2) for some SVG image opertaions.

Please refer to JFreeSVG website and JFreeSVG plus for more details.

Contact

Questions

  • Please join Join the chat at https://gitter.im/geospark-datasys/Lobby

  • Email us!

Contact

Project website

Please visit GeoSpark project wesbite for latest news and releases.

Data Systems Lab

Babylon is one of the projects under Data Systems Lab at Arizona State University. The mission of Data Systems Lab is designing and developing experimental data management systems (e.g., database systems).