Simply download PredictionIO's binary distribution and extract it.
$ wget http://download.prediction.io/PredictionIO-{{ site.pio_version }}.tar.gz $ tar zxvf PredictionIO-{{ site.pio_version }}.tar.gz $ cd PredictionIO-{{ site.pio_version }}
Apache Spark is the default processing engine for PredictionIO. Download Apache Spark release 1.1.0 package hadoop2.4. Extract the file, and set the SPARK_HOME
configuration in conf/pio-env.sh
to the Spark directory.
$ wget http://d3kbcqa49mib13.cloudfront.net/{{ site.spark_download_filename }}.tgz $ tar zxvf {{ site.spark_download_filename }}.tgz
After that, edit conf/pio-env.sh
in your PredictionIO installation directory. For example,
SPARK_HOME=/home/abc/Downloads/{{ site.spark_download_filename }}
By default, PredictionIO uses Elasticsearch at localhost as the data store to store its metadata. Simply install and run Elasticsearch, which looks like this:
$ wget https://download.elasticsearch.org/elasticsearch/elasticsearch/{{ site.elasticsearch_download_filename }}.tar.gz $ tar zxvf {{ site.elasticsearch_download_filename }}.tar.gz $ cd {{ site.elasticsearch_download_filename }} $ bin/elasticsearch
You may change the settings or even use another data store such as MongoDB. For details, please read Changing the Data Store.
By default, PredictionIO's Data API uses HBase at localhost as the data store for event data.
$ wget http://archive.apache.org/dist/hbase/{{ site.hbase_basename }}/{{ site.hbase_basename }}-{{ site.hbase_variant }}.tar.gz $ tar zxvf {{ site.hbase_basename }}-{{ site.hbase_variant }}.tar.gz $ cd {{ site.hbase_basename }}-{{ site.hbase_variant }}
You will need to at least add a minimal configuration to HBase to start it in standalone mode. Details can be found here. Here, we are showing a sample minimal configuration.
For production deployment, run a fully distributed HBase configuration.
Edit conf/hbase-site.xml
and put the following in. You may replace /home/abc
with your own home directory.
<configuration> <property> <name>hbase.rootdir</name> <value>file:///home/abc/hbase</value> </property> <property> <name>hbase.zookeeper.property.dataDir</name> <value>/home/abc/zookeeper</value> </property> </configuration>
Now you may start HBase.
$ bin/start-hbase.sh
Now you have installed everything you need to run PredictionIO!
Next: Reading Quick Start