blob: d1f799ae9c160c6c74d9b25e3711655563d2ec94 [file] [log] [blame]
These notes are for Pig 0.13.0 release.
Highlights
==========
This release includes several new features such as pluggable execution engines (to allow pig run on non-mapreduce engines in future), auto-local mode (to jobs with small input data size to run in-process), fetch optimization (to improve interactiveness of grunt), fixed counters for local-mode, support for user level jar cache, support for blacklisting and whitelisting pig commands. This also includes several performance fixes and debuggability features. A few non-backwards compatible interface modifications have been introduced in this release to make pig work with non-mapreduce engines (eg- PigProgressNotificationListener). You can find complete list of changes in CHANGES.txt.
System Requirements
===================
1. Java 1.6.x or newer, preferably from Sun. Set JAVA_HOME to the root of your
Java installation
2. Ant build tool: http://ant.apache.org - to build source only
3. Run under Unix and Windows
4. This release is compatible with all Hadoop 0.20.X, 1.X, 0.23.X and 2.X releases
Trying the Release
==================
1. Download pig-0.13.0.tar.gz
2. Unpack the file: tar -xzvf pig-0.13.0.tar.gz
3. Move into the installation directory: cd pig-0.13.0
4. To run pig without Hadoop cluster, execute the command below. This will
take you into an interactive shell called grunt that allows you to navigate
the local file system and execute Pig commands against the local files
bin/pig -x local
5. To run on your Hadoop cluster, you need to set PIG_CLASSPATH environment
variable to point to the directory with your hadoop-site.xml file and then run
pig. The commands below will take you into an interactive shell called grunt
that allows you to navigate Hadoop DFS and execute Pig commands against it
export PIG_CLASSPATH=/hadoop/conf
bin/pig
6. To build your own version of pig.jar run
ant
7. To run unit tests run
ant test
8. To build jar file with available user defined functions run commands below.
cd contrib/piggybank/java
ant
9. To build the tutorial:
cd tutorial
ant
10. To run tutorial follow instructions in http://wiki.apache.org/pig/PigTutorial
Relevant Documentation
======================
Pig Language Manual(including Grunt commands):
http://wiki.apache.org/pig-data/attachments/FrontPage/attachments/plrm.htm
UDF Manual: http://wiki.apache.org/pig/UDFManual
Piggy Bank: http://wiki.apache.org/pig/PiggyBank
Pig Tutorial: http://wiki.apache.org/pig/PigTutorial
Pig Eclipse Plugin (PigPen): http://wiki.apache.org/pig/PigPen