CHANGES.txt - spark - Git at Google

 Spark Change Log
 ----------------

 Release 1.4.1

   [SPARK-8548] [SPARKR] Remove the trailing whitespaces from the SparkR files
   Yu ISHIKAWA <yuu.ishikawa@gmail.com>
   2015-06-22 20:55:38 -0700
   Commit: 2501794, github.com/apache/spark/pull/6945

   [SPARK-7859] [SQL] Collect_set() behavior differences which fails the unit test under jdk8
   Cheng Hao <hao.cheng@intel.com>
   2015-06-22 20:04:49 -0700
   Commit: d73900a, github.com/apache/spark/pull/6402

   [SPARK-8532] [SQL] In Python's DataFrameWriter, save/saveAsTable/json/parquet/jdbc always override mode
   Yin Huai <yhuai@databricks.com>
   2015-06-22 13:51:23 -0700
   Commit: 994abba, github.com/apache/spark/pull/6937

   [SPARK-8511] [PYSPARK] Modify a test to remove a saved model in `regression.py`
   Yu ISHIKAWA <yuu.ishikawa@gmail.com>
   2015-06-22 11:53:11 -0700
   Commit: 507381d, github.com/apache/spark/pull/6926

   [SPARK-8420] [SQL] Fix comparision of timestamps/dates with strings (branch-1.4)
   Michael Armbrust <michaeldatabricks.com>, Michael Armbrust <michael@databricks.com>
   2015-06-22 10:45:33 -0700
   Commit: 6598161, github.com/apache/spark/pull/6888

   [SPARK-8406] [SQL] Backports SPARK-8406 and PR #6864 to branch-1.4
   Cheng Lian <lian@databricks.com>
   2015-06-22 10:04:29 -0700
   Commit: 451c872, github.com/apache/spark/pull/6932

   [HOTFIX] Hotfix branch-1.4 building by removing avgMetrics in CrossValidatorSuite
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-06-21 22:25:08 -0700
   Commit: b836bac, github.com/apache/spark/pull/6929

   [SPARK-7715] [MLLIB] [ML] [DOC] Updated MLlib programming guide for release 1.4
   Joseph K. Bradley <joseph@databricks.com>
   2015-06-21 16:25:25 -0700
   Commit: 2a7ea31, github.com/apache/spark/pull/6897

   [SPARK-8379] [SQL] avoid speculative tasks write to the same file
   jeanlyn <jeanlyn92@gmail.com>
   2015-06-21 00:13:40 -0700
   Commit: f0e4040, github.com/apache/spark/pull/6833

   [SPARK-8468] [ML] Take the negative of some metrics in RegressionEvaluator to get correct cross validation
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-06-20 13:01:59 -0700
   Commit: fe59a4a, github.com/apache/spark/pull/6905

   [HOTFIX] [SPARK-8489] Correct JIRA number in previous commit
   Andrew Or <andrew@databricks.com>
   2015-06-19 17:39:26 -0700
   Commit: 9b16508

   [SPARK-8390] [STREAMING] [KAFKA] fix docs related to HasOffsetRanges
   cody koeninger <cody@koeninger.org>
   2015-06-19 17:16:56 -0700
   Commit: a7b773a, github.com/apache/spark/pull/6863

   [SPARK-8389] [STREAMING] [KAFKA] Example of getting offset ranges out o…
   cody koeninger <cody@koeninger.org>
   2015-06-19 14:51:19 +0200
   Commit: 78d0cee, github.com/apache/spark/pull/6846

   [SPARK-8498] [SQL] Add regression test for SPARK-8470
   Andrew Or <andrew@databricks.com>
   2015-06-19 17:34:09 -0700
   Commit: 2248ad8, github.com/apache/spark/pull/6909

   [HOT-FIX] Fix compilation (caused by 0131142d98b191f6cc112d383aa10582a3ac35bf)
   Yin Huai <yhuai@databricks.com>
   2015-06-19 17:29:51 -0700
   Commit: 2510365, github.com/apache/spark/pull/6913

   [SPARK-8093] [SQL] Remove empty structs inferred from JSON documents
   Nathan Howell <nhowell@godaddy.com>
   2015-06-19 16:19:28 -0700
   Commit: 0131142, github.com/apache/spark/pull/6799

   [SPARK-8452] [SPARKR] expose jobGroup API in SparkR
   Hossein <hossein@databricks.com>
   2015-06-19 15:47:22 -0700
   Commit: 1a6b510, github.com/apache/spark/pull/6889

   [SPARK-8368] [SPARK-8058] [SQL] HiveContext may override the context class loader of the current thread (branch 1.4)
   Yin Huai <yhuai@databricks.com>
   2015-06-19 11:15:28 -0700
   Commit: 9ac8393, github.com/apache/spark/pull/6895

   [SPARK-7180] [SPARK-8090] [SPARK-8091] Fix a number of SerializationDebugger bugs and limitations
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-06-19 10:52:30 -0700
   Commit: 4b2c793, github.com/apache/spark/pull/6625

   [SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files
   Sean Owen <sowen@cloudera.com>
   2015-06-19 11:03:04 -0700
   Commit: 3415fb9, github.com/apache/spark/pull/6901

   [SPARK-8451] [SPARK-7287] SparkSubmitSuite should check exit code
   Andrew Or <andrew@databricks.com>
   2015-06-19 10:56:19 -0700
   Commit: aedd893, github.com/apache/spark/pull/6886

   [SPARK-8430] ExternalShuffleBlockResolver of shuffle service should support UnsafeShuffleManager
   Lianhui Wang <lianhuiwang09@gmail.com>
   2015-06-19 10:47:07 -0700
   Commit: 6f2e411, github.com/apache/spark/pull/6873

   [SPARK-8151] [MLLIB] pipeline components should correctly implement copy
   Xiangrui Meng <meng@databricks.com>
   2015-06-19 09:46:51 -0700
   Commit: 1f2dafb, github.com/apache/spark/pull/6622

   [SPARK-8339] [PYSPARK] integer division for python 3
   Kevin Conor <kevin@discoverybayconsulting.com>
   2015-06-19 00:12:20 -0700
   Commit: 164b9d3, github.com/apache/spark/pull/6794

   [SPARK-8458] [SQL] Don't strip scheme part of output path when writing ORC files
   Cheng Lian <lian@databricks.com>
   2015-06-18 22:01:52 -0700
   Commit: f48f3a2, github.com/apache/spark/pull/6892

   [SPARK-8080] [STREAMING] Receiver.store with Iterator does not give correct count at Spark UI
   Dibyendu Bhattacharya <dibyendu.bhattacharya1@pearson.com>, U-PEROOT\UBHATD1 <UBHATD1@PIN-L-PI046.PEROOT.com>
   2015-06-18 19:58:47 -0700
   Commit: b55e4b9, github.com/apache/spark/pull/6707

   [SPARK-8462] [DOCS] Documentation fixes for Spark SQL
   Lars Francke <lars.francke@gmail.com>
   2015-06-18 19:40:32 -0700
   Commit: bd9bbd6, github.com/apache/spark/pull/6890

   [SPARK-8446] [SQL] Add helper functions for testing SparkPlan physical operators
   Josh Rosen <joshrosen@databricks.com>, Josh Rosen <rosenville@gmail.com>, Michael Armbrust <michael@databricks.com>
   2015-06-18 16:45:14 -0700
   Commit: 152f446, github.com/apache/spark/pull/6885

   [SPARK-8376] [DOCS] Add common lang3 to the Spark Flume Sink doc
   zsxwing <zsxwing@gmail.com>
   2015-06-18 16:00:27 -0700
   Commit: 9f293a9, github.com/apache/spark/pull/6829

   [SPARK-8353] [DOCS] Show anchor links when hovering over documentation headers
   Josh Rosen <joshrosen@databricks.com>
   2015-06-18 15:10:09 -0700
   Commit: c1da5cf, github.com/apache/spark/pull/6808

   [SPARK-8202] [PYSPARK] fix infinite loop during external sort in PySpark
   Davies Liu <davies@databricks.com>
   2015-06-18 13:45:58 -0700
   Commit: ca23c3b, github.com/apache/spark/pull/6714

   [SPARK-8095] Resolve dependencies of --packages in local ivy cache
   Burak Yavuz <brkyvz@gmail.com>
   2015-06-17 22:33:37 -0700
   Commit: 9dabc12, github.com/apache/spark/pull/6788

   [SPARK-8392] RDDOperationGraph: getting cached nodes is slow
   xutingjun <xutingjun@huawei.com>
   2015-06-17 22:31:01 -0700
   Commit: 67ad12d, github.com/apache/spark/pull/6839

   [SPARK-8306] [SQL] AddJar command needs to set the new class loader to the HiveConf inside executionHive.state.
   Yin Huai <yhuai@databricks.com>
   2015-06-17 14:52:43 -0700
   Commit: 73cf5de, github.com/apache/spark/pull/6758

   [SPARK-8404] [STREAMING] [TESTS] Use thread-safe collections to make the tests more reliable
   zsxwing <zsxwing@gmail.com>
   2015-06-17 15:00:03 -0700
   Commit: 5aedfa2, github.com/apache/spark/pull/6852

   [SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling sum on an empty RDD
   zsxwing <zsxwing@gmail.com>
   2015-06-17 13:59:39 -0700
   Commit: 5e7973d, github.com/apache/spark/pull/6826

   [SPARK-8372] History server shows incorrect information for application not started
   Carson Wang <carson.wang@intel.com>
   2015-06-17 13:41:36 -0700
   Commit: f051373, github.com/apache/spark/pull/6827

   [SPARK-8161] Set externalBlockStoreInitialized to be true, after ExternalBlockStore is initialized
   Mingfei <mingfei.shi@intel.com>
   2015-06-17 13:40:07 -0700
   Commit: d75c53d, github.com/apache/spark/pull/6702

   [SPARK-7515] [DOC] Update documentation for PySpark on YARN with cluster mode
   Kousuke Saruta <sarutakoss.nttdata.co.jp>, Punya Biswal <pbiswal@palantir.com>, Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-06-17 13:37:20 -0700
   Commit: a7f6979, github.com/apache/spark/pull/6040

   [SPARK-8395] [DOCS] start-slave.sh docs incorrect
   Sean Owen <sowen@cloudera.com>
   2015-06-17 13:31:10 -0700
   Commit: 320c442, github.com/apache/spark/pull/6855

   [SPARK-8309] [CORE] Support for more than 12M items in OpenHashMap
   Vyacheslav Baranov <slavik.baranov@gmail.com>
   2015-06-17 09:42:29 +0100
   Commit: a5f602e, github.com/apache/spark/pull/6763

   Fix break introduced by backport
   Punya Biswal <pbiswal@palantir.com>
   2015-06-16 22:31:49 -0700
   Commit: 877deb0, github.com/apache/spark/pull/6850

   [SPARK-7916] [MLLIB] MLlib Python doc parity check for classification and regression
   Yanbo Liang <ybliang8@gmail.com>
   2015-06-16 14:30:30 -0700
   Commit: 15d973f, github.com/apache/spark/pull/6460

   [SPARK-8126] [BUILD] Make sure temp dir exists when running tests.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-06-16 21:10:18 +0100
   Commit: b9e5d3c, github.com/apache/spark/pull/6805

   [SQL] [DOC] improved a comment
   Radek Ostrowski <dest.hawaii@gmail.com>, radek <radek@radeks-MacBook-Pro-2.local>
   2015-06-16 21:04:26 +0100
   Commit: 4da0686, github.com/apache/spark/pull/6332

   [SPARK-DOCS] [SPARK-SQL] Update sql-programming-guide.md
   Moussa Taifi <moutai10@gmail.com>
   2015-06-16 20:59:22 +0100
   Commit: 1378bdc, github.com/apache/spark/pull/6847

   [SPARK-8367] [STREAMING] Add a limit for 'spark.streaming.blockInterval` since a data loss bug.
   huangzhaowei <carlmartinmax@gmail.com>, huangzhaowei <SaintBacchus@users.noreply.github.com>
   2015-06-16 08:16:09 +0200
   Commit: f287f7e, github.com/apache/spark/pull/6818

   SPARK-8336 Fix NullPointerException with functions.rand()
   tedyu <yuzhihong@gmail.com>
   2015-06-15 17:00:38 -0700
   Commit: fff8d7e, github.com/apache/spark/pull/6793

   fix read/write mixup
   Peter Hoffmann <ph@peter-hoffmann.com>
   2015-06-14 11:41:16 -0700
   Commit: 0ffbf08, github.com/apache/spark/pull/6815

   [SPARK-8358] [SQL] Wait for child resolution when resolving generators
   Michael Armbrust <michael@databricks.com>
   2015-06-14 11:21:42 -0700
   Commit: 2805d14, github.com/apache/spark/pull/6811

   [SPARK-8354] [SQL] Fix off-by-factor-of-8 error when allocating scratch space in UnsafeFixedWidthAggregationMap
   Josh Rosen <joshrosen@databricks.com>
   2015-06-14 09:34:35 -0700
   Commit: 4634be5, github.com/apache/spark/pull/6809

   [Spark-8343] [Streaming] [Docs] Improve Spark Streaming Guides.
   Mike Dusenberry <dusenberrymw@gmail.com>
   2015-06-13 21:22:46 -0700
   Commit: 187a3d5, github.com/apache/spark/pull/6801

   [SPARK-8329][SQL] Allow _ in DataSource options
   Michael Armbrust <michael@databricks.com>
   2015-06-12 23:11:16 -0700
   Commit: 1ca431e, github.com/apache/spark/pull/6786

   [SPARK-7284] [STREAMING] Updated streaming documentation
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-06-12 15:22:59 -0700
   Commit: 7c11ccf, github.com/apache/spark/pull/6781

   [SPARK-8330] DAG visualization: trim whitespace from input
   Andrew Or <andrew@databricks.com>
   2015-06-12 11:14:55 -0700
   Commit: 7608373, github.com/apache/spark/pull/6787

   [SPARK-8322] [EC2] Added spark 1.4.0 into the VALID_SPARK_VERSIONS and…
   Mark Smith <mark.smith@bronto.com>
   2015-06-12 10:28:30 -0700
   Commit: 141eab7, github.com/apache/spark/pull/6777

   [SPARK-6511] [docs] Fix example command in hadoop-provided docs.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-06-11 15:29:03 -0700
   Commit: 8b25f62, github.com/apache/spark/pull/6766

   [SPARK-8310] [EC2] Update spark-ec2 branch to 1.4
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-06-11 13:22:08 -0700
   Commit: 3a62569, github.com/apache/spark/pull/6765

   [SPARK-8289] Specify stack size for consistency with Java tests - resolves test failures
   Adam Roberts <aroberts@uk.ibm.com>, a-roberts <aroberts@uk.ibm.com>
   2015-06-11 08:40:46 +0100
   Commit: b313920, github.com/apache/spark/pull/6727

   [SPARK-8285] [SQL] CombineSum should be calculated as unlimited decimal first
   navis.ryu <navis@apache.org>
   2015-06-10 18:19:12 -0700
   Commit: 5c05b5c, github.com/apache/spark/pull/6736

   [SPARK-8200] [MLLIB] Check for empty RDDs in StreamingLinearAlgorithm
   Paavo <pparkkin@gmail.com>
   2015-06-10 23:17:42 +0100
   Commit: 59fc3f1, github.com/apache/spark/pull/6713

   [SPARK-8273] Driver hangs up when yarn shutdown in client mode
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-06-10 13:34:19 -0700
   Commit: 2846a35, github.com/apache/spark/pull/6717

   [SPARK-7756] CORE RDDOperationScope fix for IBM Java
   Adam Roberts <aroberts@uk.ibm.com>, a-roberts <aroberts@uk.ibm.com>
   2015-06-10 13:21:01 -0700
   Commit: 568d1d5, github.com/apache/spark/pull/6740

   [SPARK-8282] [SPARKR] Make number of threads used in RBackend configurable
   Hossein <hossein@databricks.com>
   2015-06-10 13:18:48 -0700
   Commit: 28e8a6e, github.com/apache/spark/pull/6730

   [SQL] [MINOR] Fixes a minor Java example error in SQL programming guide
   Cheng Lian <lian@databricks.com>
   2015-06-10 11:48:14 -0700
   Commit: 7b88e6a, github.com/apache/spark/pull/6749

   [SPARK-6511] [DOCUMENTATION] Explain how to use Hadoop provided builds
   Patrick Wendell <patrick@databricks.com>
   2015-06-09 16:14:21 -0700
   Commit: a0a7f2f, github.com/apache/spark/pull/6729

   [MINOR] [UI] DAG visualization: trim whitespace from input
   Andrew Or <andrew@databricks.com>
   2015-06-09 15:44:02 -0700
   Commit: 1175cfe, github.com/apache/spark/pull/6732

   [SPARK-8274] [DOCUMENTATION-MLLIB] Fix wrong URLs in MLlib Frequent Pattern Mining Documentation
   FavioVazquez <favio.vazquezp@gmail.com>
   2015-06-09 15:02:18 +0100
   Commit: a7b7a19, github.com/apache/spark/pull/6722

   [SPARK-6820] [SPARKR] Convert NAs to null type in SparkR DataFrames
   hqzizania <qian.huang@intel.com>
   2015-06-08 21:40:12 -0700
   Commit: 0a9383d, github.com/apache/spark/pull/6190

   [SPARK-8162] [HOTFIX] Fix NPE in spark-shell
   Andrew Or <andrew@databricks.com>
   2015-06-08 18:09:21 -0700
   Commit: e9a8372, github.com/apache/spark/pull/6711

   [SPARK-8126] [BUILD] Use custom temp directory during build.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-06-08 15:37:28 +0100
   Commit: 99c2a57, github.com/apache/spark/pull/6674

   [SPARK-8121] [SQL] Fixes InsertIntoHadoopFsRelation job initialization for Hadoop 1.x (branch 1.4 backport based on https://github.com/apache/spark/pull/6669)
   Yin Huai <yhuai@databricks.com>
   2015-06-08 11:35:30 -0700
   Commit: 69197c3

   [SPARK-7705] [YARN] Cleanup of .sparkStaging directory fails if application is killed
   linweizhong <linweizhong@huawei.com>
   2015-06-08 09:34:16 +0100
   Commit: a3afc2c, github.com/apache/spark/pull/6409

   [SPARK-4761] [DOC] [SQL] kryo default setting in SQL Thrift server
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-06-08 01:07:50 -0700
   Commit: 58bfdd6, github.com/apache/spark/pull/6639

   [SPARK-8004][SQL] Quote identifier in JDBC data source.
   Reynold Xin <rxin@databricks.com>
   2015-06-07 10:52:02 -0700
   Commit: b9c046f, github.com/apache/spark/pull/6689

   [SPARK-8146] DataFrame Python API: Alias replace in df.na
   Reynold Xin <rxin@databricks.com>
   2015-06-07 01:21:02 -0700
   Commit: ff26767, github.com/apache/spark/pull/6688

   [SPARK-8141] [SQL] Precompute datatypes for partition columns and reuse it
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-06-07 15:33:48 +0800
   Commit: b4d5441, github.com/apache/spark/pull/6687

   [SPARK-8145] [WEBUI] Trigger a double click on the span to show full job description.
   979969786 <q79969786@gmail.com>
   2015-06-06 23:15:27 -0700
   Commit: 9d1f4d6, github.com/apache/spark/pull/6646

   [SPARK-8004][SQL] Enclose column names by JDBC Dialect
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-06-06 22:59:31 -0700
   Commit: b6fdc6c, github.com/apache/spark/pull/6577

   [SPARK-7955] [CORE] Ensure executors with cached RDD blocks are not re…
   Hari Shreedharan <hshreedharan@apache.org>
   2015-06-06 21:13:26 -0700
   Commit: 6faaf15, github.com/apache/spark/pull/6508

   [SPARK-8079] [SQL] Makes InsertIntoHadoopFsRelation job/task abortion more robust
   Cheng Lian <lian@databricks.com>
   2015-06-06 17:23:12 +0800
   Commit: d8a53fb, github.com/apache/spark/pull/6612

   [SPARK-7991] [PySpark] Adding support for passing lists to describe.
   amey <amey@skytree.net>
   2015-06-05 13:49:33 -0700
   Commit: 84523fc, github.com/apache/spark/pull/6655

   [SPARK-7747] [SQL] [DOCS]  spark.sql.planner.externalSort
   Luca Martinetti <luca@luca.io>
   2015-06-05 13:40:11 -0700
   Commit: 94f65bc, github.com/apache/spark/pull/6272

   [SPARK-8112] [STREAMING] Fix the negative event count issue
   zsxwing <zsxwing@gmail.com>
   2015-06-05 12:46:02 -0700
   Commit: 200c980, github.com/apache/spark/pull/6659

   Revert "[MINOR] [BUILD] Use custom temp directory during build."
   Andrew Or <andrew@databricks.com>
   2015-06-05 10:54:06 -0700
   Commit: 429c658

   [SPARK-8085] [SPARKR] Support user-specified schema in read.df
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-06-05 10:19:03 -0700
   Commit: 3e3151e, github.com/apache/spark/pull/6620

   [STREAMING] Update streaming-kafka-integration.md
   Akhil Das <akhld@darktech.ca>
   2015-06-05 14:23:23 +0200
   Commit: 0ef2e9d, github.com/apache/spark/pull/6666

   [MINOR] [BUILD] Use custom temp directory during build.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-06-05 14:11:38 +0200
   Commit: 9b3e4c1, github.com/apache/spark/pull/6653

   [MINOR] remove unused interpolation var in log message
   Sean Owen <sowen@cloudera.com>
   2015-06-05 00:32:46 -0700
   Commit: 90cf686, github.com/apache/spark/pull/6650

   [SPARK-8116][PYSPARK] Allow sc.range() to take a single argument.
   Ted Blackman <ted.blackman@gmail.com>
   2015-06-04 22:21:11 -0700
   Commit: f02af7c, github.com/apache/spark/pull/6656

   [SPARK-8098] [WEBUI] Show correct length of bytes on log page
   Carson Wang <carson.wang@intel.com>
   2015-06-04 16:24:50 -0700
   Commit: 3ba6fc5, github.com/apache/spark/pull/6640

   [SPARK-8027] [SPARKR] Move man pages creation to install-dev.sh
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-06-04 12:52:16 -0700
   Commit: 0b71b85, github.com/apache/spark/pull/6593

   [SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column reference.
   Mike Dusenberry <dusenberrymw@gmail.com>
   2015-06-04 11:30:07 -0700
   Commit: 81ff7a9, github.com/apache/spark/pull/6585

   Fix maxTaskFailures comment
   Daniel Darabos <darabos.daniel@gmail.com>
   2015-06-04 13:46:49 +0200
   Commit: daf9451, github.com/apache/spark/pull/6621

   [BUILD] Fix Maven build for Kinesis
   Andrew Or <andrew@databricks.com>
   2015-06-03 20:45:31 -0700
   Commit: 84da653

   [SPARK-7558] Demarcate tests in unit-tests.log (1.4)
   Andrew Or <andrew@databricks.com>
   2015-06-03 20:46:44 -0700
   Commit: bfe74b3, github.com/apache/spark/pull/6598

   [BUILD] Use right branch when checking against Hive (1.4)
   Andrew Or <andrew@databricks.com>
   2015-06-03 18:09:14 -0700
   Commit: 584a2ba, github.com/apache/spark/pull/6630

   [BUILD] Increase Jenkins test timeout
   Andrew Or <andrew@databricks.com>
   2015-06-03 17:40:14 -0700
   Commit: 96f71b1

   [SPARK-8084] [SPARKR] Make SparkR scripts fail on error
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-06-03 17:02:16 -0700
   Commit: c2c1290, github.com/apache/spark/pull/6623

   [SPARK-8088] don't attempt to lower number of executors by 0
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-06-03 16:54:46 -0700
   Commit: 1674869, github.com/apache/spark/pull/6624

   [HOTFIX] [TYPO] Fix typo in #6546
   Andrew Or <andrew@databricks.com>
   2015-06-03 16:04:02 -0700
   Commit: 0bc9a3e

   [HOTFIX] Unbreak build from backporting #6546
   Andrew Or <andrew@databricks.com>
   2015-06-03 15:25:35 -0700
   Commit: d0be950

   [SPARK-8051] [MLLIB] make StringIndexerModel silent if input column does not exist
   Xiangrui Meng <meng@databricks.com>
   2015-06-03 15:16:24 -0700
   Commit: b2a22a6, github.com/apache/spark/pull/6595

   [SPARK-3674] [EC2] Clear SPARK_WORKER_INSTANCES when using YARN
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-06-03 15:14:38 -0700
   Commit: ca21fff, github.com/apache/spark/pull/6424

   [SPARK-7989] [CORE] [TESTS] Fix flaky tests in ExternalShuffleServiceSuite and SparkListenerWithClusterSuite
   zsxwing <zsxwing@gmail.com>
   2015-06-03 15:04:20 -0700
   Commit: 7e46ea0, github.com/apache/spark/pull/6546

   [SPARK-8001] [CORE] Make AsynchronousListenerBus.waitUntilEmpty throw TimeoutException if timeout
   zsxwing <zsxwing@gmail.com>
   2015-06-03 15:03:07 -0700
   Commit: 306837e, github.com/apache/spark/pull/6550

   [SPARK-8083] [MESOS] Use the correct base path in mesos driver page.
   Timothy Chen <tnachen@gmail.com>
   2015-06-03 14:57:23 -0700
   Commit: 59399a8, github.com/apache/spark/pull/6615

   [MINOR] [UI] Improve confusing message on log page
   Andrew Or <andrew@databricks.com>
   2015-06-03 12:10:12 -0700
   Commit: 31e0ae9

   [SPARK-8054] [MLLIB] Added several Java-friendly APIs + unit tests
   Joseph K. Bradley <joseph@databricks.com>
   2015-06-03 14:34:20 -0700
   Commit: bfab61f, github.com/apache/spark/pull/6562

   [SPARK-8074] Parquet should throw AnalysisException during setup for data type/name related failures.
   Reynold Xin <rxin@databricks.com>
   2015-06-03 13:57:57 -0700
   Commit: 1f90a06, github.com/apache/spark/pull/6608

   [SPARK-8063] [SPARKR] Spark master URL conflict between MASTER env variable and --master command line option.
   Sun Rui <rui.sun@intel.com>
   2015-06-03 11:56:35 -0700
   Commit: f67a27d, github.com/apache/spark/pull/6605

   [SPARK-7980] [SQL] Support SQLContext.range(end)
   animesh <animesh@apache.spark>
   2015-06-03 11:28:18 -0700
   Commit: 0a1dad6, github.com/apache/spark/pull/6609

   [SPARK-7973] [SQL] Increase the timeout of two CliSuite tests.
   Yin Huai <yhuai@databricks.com>
   2015-06-03 09:26:21 -0700
   Commit: 54a4ea4, github.com/apache/spark/pull/6525

   [SPARK-8060] Improve DataFrame Python test coverage and documentation.
   Reynold Xin <rxin@databricks.com>
   2015-06-03 00:23:34 -0700
   Commit: ee7f365, github.com/apache/spark/pull/6601

   [SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-06-02 23:24:47 -0700
   Commit: bd57af3, github.com/apache/spark/pull/6579

   [SPARK-8043] [MLLIB] [DOC] update NaiveBayes and SVM examples in doc
   Yuhao Yang <hhbyyh@gmail.com>
   2015-06-02 23:15:38 -0700
   Commit: 33edb2b, github.com/apache/spark/pull/6584

   [SPARK-8053] [MLLIB] renamed scalingVector to scalingVec
   Joseph K. Bradley <joseph@databricks.com>
   2015-06-02 22:56:56 -0700
   Commit: 88399c3, github.com/apache/spark/pull/6596

   [SPARK-7547] [ML] Scala Example code for ElasticNet
   DB Tsai <dbt@netflix.com>
   2015-06-02 19:12:08 -0700
   Commit: 6391be8, github.com/apache/spark/pull/6576

   [SPARK-7387] [ML] [DOC] CrossValidator example code in Python
   Ram Sriharsha <rsriharsha@hw11853.local>
   2015-06-02 18:53:04 -0700
   Commit: 6a3e32a, github.com/apache/spark/pull/6358

   Preparing development version 1.4.0-SNAPSHOT
   Patrick Wendell <pwendell@gmail.com>
   2015-06-02 18:06:41 -0700
   Commit: ab713af


 Release 1.4.0

   [HOTFIX] Revert "[SPARK-7092] Update spark scala version to 2.11.6"
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 02:28:41 -0700
   Commit: 31f5d53

   Revert "Preparing Spark release v1.4.0-rc1"
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 02:27:14 -0700
   Commit: 586ede6

   Revert "Preparing development version 1.4.1-SNAPSHOT"
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 02:27:07 -0700
   Commit: e7309ec

   Fixing a few basic typos in the Programming Guide.
   Mike Dusenberry <dusenberrymw@gmail.com>
   2015-05-19 08:59:45 +0100
   Commit: 0748263, github.com/apache/spark/pull/6240

   Preparing development version 1.4.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 07:13:24 +0000
   Commit: a1d896b

   Preparing Spark release v1.4.0-rc1
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 07:13:24 +0000
   Commit: 79fb01a

   Updating CHANGES.txt for Spark 1.4
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 00:12:20 -0700
   Commit: 30bf333

   Revert "Preparing Spark release v1.4.0-rc1"
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 00:10:39 -0700
   Commit: b0c63d2

   Revert "Preparing development version 1.4.1-SNAPSHOT"
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 00:10:37 -0700
   Commit: 198a186

   [SPARK-7581] [ML] [DOC] User guide for spark.ml PolynomialExpansion
   Xusen Yin <yinxusen@gmail.com>
   2015-05-19 00:06:33 -0700
   Commit: 38a3fc8, github.com/apache/spark/pull/6113

   [HOTFIX] Fixing style failures in Kinesis source
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 00:02:06 -0700
   Commit: de60c2e

   Preparing development version 1.4.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 06:06:41 +0000
   Commit: 40190ce

   Preparing Spark release v1.4.0-rc1
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 06:06:40 +0000
   Commit: 38ccef3

   Revert "Preparing Spark release v1.4.0-rc1"
   Patrick Wendell <patrick@databricks.com>
   2015-05-18 23:06:15 -0700
   Commit: 152b029

   Revert "Preparing development version 1.4.1-SNAPSHOT"
   Patrick Wendell <patrick@databricks.com>
   2015-05-18 23:06:13 -0700
   Commit: 4d098bc

   [HOTFIX]: Java 6 Build Breaks
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 06:00:13 +0000
   Commit: be1fc93

   Preparing development version 1.4.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 05:01:11 +0000
   Commit: 758ca74

   Preparing Spark release v1.4.0-rc1
   Patrick Wendell <patrick@databricks.com>
   2015-05-19 05:01:11 +0000
   Commit: e8e97e3

   [SPARK-7687] [SQL] DataFrame.describe() should cast all aggregates to String
   Josh Rosen <joshrosen@databricks.com>
   2015-05-18 21:53:44 -0700
   Commit: 99436bd, github.com/apache/spark/pull/6218

   CHANGES.txt and changelist updaets for Spark 1.4.
   Patrick Wendell <patrick@databricks.com>
   2015-05-18 21:44:13 -0700
   Commit: 914ecd0

   [SPARK-7150] SparkContext.range() and SQLContext.range()
   Daoyuan Wang <daoyuan.wang@intel.com>, Davies Liu <davies@databricks.com>
   2015-05-18 21:43:12 -0700
   Commit: 7fcbb2c, github.com/apache/spark/pull/6081

   Version updates for Spark 1.4.0
   Patrick Wendell <patrick@databricks.com>
   2015-05-18 21:38:37 -0700
   Commit: 9d0b7fb

   [SPARK-7681] [MLLIB] Add SparseVector support for gemv
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-18 21:32:36 -0700
   Commit: dd9f873, github.com/apache/spark/pull/6209

   [SPARK-7692] Updated Kinesis examples
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-18 18:24:15 -0700
   Commit: 9c48548, github.com/apache/spark/pull/6249

   [SPARK-7621] [STREAMING] Report Kafka errors to StreamingListeners
   jerluc <jeremyalucas@gmail.com>
   2015-05-18 18:13:29 -0700
   Commit: 9188ad8, github.com/apache/spark/pull/6204

   [SPARK-7624] Revert #4147
   Davies Liu <davies@databricks.com>
   2015-05-18 16:55:45 -0700
   Commit: 60cb33d, github.com/apache/spark/pull/6172

   [SQL] Fix serializability of ORC table scan
   Michael Armbrust <michael@databricks.com>
   2015-05-18 15:24:31 -0700
   Commit: f8f23c4, github.com/apache/spark/pull/6247

   [SPARK-7501] [STREAMING] DAG visualization: show DStream operations
   Andrew Or <andrew@databricks.com>
   2015-05-18 14:33:33 -0700
   Commit: a475cbc, github.com/apache/spark/pull/6034

   [HOTFIX] Fix ORC build break
   Michael Armbrust <michael@databricks.com>
   2015-05-18 14:04:04 -0700
   Commit: ba502ab, github.com/apache/spark/pull/6244

   [SPARK-7658] [STREAMING] [WEBUI] Update the mouse behaviors for the timeline graphs
   zsxwing <zsxwing@gmail.com>
   2015-05-18 13:34:43 -0700
   Commit: 39add3d, github.com/apache/spark/pull/6168

   [SPARK-6216] [PYSPARK] check python version of worker with driver
   Davies Liu <davies@databricks.com>
   2015-05-18 12:55:13 -0700
   Commit: a833209, github.com/apache/spark/pull/6203

   [SPARK-7673] [SQL] WIP: HadoopFsRelation and ParquetRelation2 performance optimizations
   Cheng Lian <lian@databricks.com>
   2015-05-18 12:45:37 -0700
   Commit: 3962348, github.com/apache/spark/pull/6225

   [SPARK-7567] [SQL] [follow-up] Use a new flag to set output committer based on mapreduce apis
   Yin Huai <yhuai@databricks.com>
   2015-05-18 12:17:10 -0700
   Commit: a385f4b, github.com/apache/spark/pull/6130

   [SPARK-7269] [SQL] Incorrect analysis for aggregation(use semanticEquals)
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-18 12:08:28 -0700
   Commit: d6f5f37, github.com/apache/spark/pull/6173

   [SPARK-7631] [SQL] treenode argString should not print children
   scwf <wangfei1@huawei.com>
   2015-05-18 12:05:14 -0700
   Commit: dbd4ec8, github.com/apache/spark/pull/6144

   [SPARK-2883] [SQL] ORC data source for Spark SQL
   Zhan Zhang <zhazhan@gmail.com>, Cheng Lian <lian@databricks.com>
   2015-05-18 12:03:27 -0700
   Commit: 65d71bd, github.com/apache/spark/pull/6194

   [SPARK-7380] [MLLIB] pipeline stages should be copyable in Python
   Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
   2015-05-18 12:02:18 -0700
   Commit: cf4e04a, github.com/apache/spark/pull/6088

   [SQL] [MINOR] [THIS] use private for internal field in ScalaUdf
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-18 12:01:30 -0700
   Commit: 7d44c01, github.com/apache/spark/pull/6235

   [SPARK-7570] [SQL] Ignores _temporary during partition discovery
   Cheng Lian <lian@databricks.com>
   2015-05-18 11:59:44 -0700
   Commit: c7623a2, github.com/apache/spark/pull/6091

   [SPARK-6888] [SQL] Make the jdbc driver handling user-definable
   Rene Treffer <treffer@measite.de>
   2015-05-18 11:55:36 -0700
   Commit: b41301a, github.com/apache/spark/pull/5555

   [SPARK-7627] [SPARK-7472] DAG visualization: style skipped stages
   Andrew Or <andrew@databricks.com>
   2015-05-18 10:59:35 -0700
   Commit: a0ae8ce, github.com/apache/spark/pull/6171

   [SPARK-7272] [MLLIB] User guide for PMML model export
   Vincenzo Selvaggio <vselvaggio@hotmail.it>
   2015-05-18 08:46:33 -0700
   Commit: a95d4e1, github.com/apache/spark/pull/6219

   [SPARK-6657] [PYSPARK] Fix doc warnings
   Xiangrui Meng <meng@databricks.com>
   2015-05-18 08:35:14 -0700
   Commit: 2c94ffe, github.com/apache/spark/pull/6221

   [SPARK-7299][SQL] Set precision and scale for Decimal according to JDBC metadata instead of returned BigDecimal
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-18 01:10:55 -0700
   Commit: 0e7cd8f, github.com/apache/spark/pull/5833

   [SPARK-7694] [MLLIB] Use getOrElse for getting the threshold of LR model
   Shuo Xiang <shuoxiangpub@gmail.com>
   2015-05-17 21:16:52 -0700
   Commit: 0b6bc8a, github.com/apache/spark/pull/6224

   [SPARK-7693][Core] Remove "import scala.concurrent.ExecutionContext.Implicits.global"
   zsxwing <zsxwing@gmail.com>
   2015-05-17 20:37:19 -0700
   Commit: 2a42d2d, github.com/apache/spark/pull/6223

   [SQL] [MINOR] use catalyst type converter in ScalaUdf
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-17 16:51:57 -0700
   Commit: be66d19, github.com/apache/spark/pull/6182

   [SPARK-6514] [SPARK-5960] [SPARK-6656] [SPARK-7679] [STREAMING] [KINESIS] Updates to the Kinesis API
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-17 16:49:07 -0700
   Commit: e0632ff, github.com/apache/spark/pull/6147

   [SPARK-7491] [SQL] Allow configuration of classloader isolation for hive
   Michael Armbrust <michael@databricks.com>
   2015-05-17 12:43:15 -0700
   Commit: a855608, github.com/apache/spark/pull/6167

   [SPARK-7686] [SQL] DescribeCommand is assigned wrong output attributes in SparkStrategies
   Josh Rosen <joshrosen@databricks.com>
   2015-05-17 11:59:28 -0700
   Commit: 53d6ab5, github.com/apache/spark/pull/6217

   [SPARK-7660] Wrap SnappyOutputStream to work around snappy-java bug
   Josh Rosen <joshrosen@databricks.com>
   2015-05-17 09:30:49 -0700
   Commit: 6df71eb, github.com/apache/spark/pull/6176

   [SPARK-7669] Builds against Hadoop 2.6+ get inconsistent curator depend…
   Steve Loughran <stevel@hortonworks.com>
   2015-05-17 17:03:11 +0100
   Commit: 0feb3de, github.com/apache/spark/pull/6191

   [SPARK-7447] [SQL] Don't re-merge Parquet schema when the relation is deserialized
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-17 15:42:21 +0800
   Commit: 898be62, github.com/apache/spark/pull/6012

   [MINOR] Add 1.3, 1.3.1 to master branch EC2 scripts
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-05-17 00:12:20 -0700
   Commit: 0ed376a, github.com/apache/spark/pull/6215

   [MINOR] [SQL] Removes an unreachable case clause
   Cheng Lian <lian@databricks.com>
   2015-05-16 23:20:09 -0700
   Commit: 671a6bc, github.com/apache/spark/pull/6214

   [SPARK-7654][SQL] Move JDBC into DataFrame's reader/writer interface.
   Reynold Xin <rxin@databricks.com>
   2015-05-16 22:01:53 -0700
   Commit: 17e0786, github.com/apache/spark/pull/6210

   [SPARK-7655][Core] Deserializing value should not hold the TaskSchedulerImpl lock
   zsxwing <zsxwing@gmail.com>
   2015-05-16 21:03:22 -0700
   Commit: 8494910, github.com/apache/spark/pull/6195

   [SPARK-7654][MLlib] Migrate MLlib to the DataFrame reader/writer API.
   Reynold Xin <rxin@databricks.com>
   2015-05-16 15:03:57 -0700
   Commit: bd057f8, github.com/apache/spark/pull/6211

   [BUILD] update jblas dependency version to 1.2.4
   Matthew Brandyberry <mbrandy@us.ibm.com>
   2015-05-16 18:17:48 +0100
   Commit: 8bde352, github.com/apache/spark/pull/6199

   [HOTFIX] [SQL] Fixes DataFrameWriter.mode(String)
   Cheng Lian <lian@databricks.com>
   2015-05-16 20:55:10 +0800
   Commit: 856619d, github.com/apache/spark/pull/6212

   [SPARK-7655][Core][SQL] Remove 'scala.concurrent.ExecutionContext.Implicits.global' in 'ask' and 'BroadcastHashJoin'
   zsxwing <zsxwing@gmail.com>
   2015-05-16 00:44:29 -0700
   Commit: ad5b0b1, github.com/apache/spark/pull/6200

   [SPARK-7672] [CORE] Use int conversion in translating kryoserializer.buffer.mb to kryoserializer.buffer
   Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
   2015-05-16 08:24:21 +0100
   Commit: e7607e5, github.com/apache/spark/pull/6198

   [SPARK-4556] [BUILD] binary distribution assembly can't run in local mode
   Sean Owen <sowen@cloudera.com>
   2015-05-16 08:18:41 +0100
   Commit: 1fc3560, github.com/apache/spark/pull/6186

   [SPARK-7671] Fix wrong URLs in MLlib Data Types Documentation
   FavioVazquez <favio.vazquezp@gmail.com>
   2015-05-16 08:07:03 +0100
   Commit: 7e3f9fe, github.com/apache/spark/pull/6196

   [SPARK-7654][SQL] DataFrameReader and DataFrameWriter for input/output API
   Reynold Xin <rxin@databricks.com>
   2015-05-15 22:00:31 -0700
   Commit: 9da55b5, github.com/apache/spark/pull/6175

   [SPARK-7473] [MLLIB] Add reservoir sample in RandomForest
   AiHe <ai.he@ussuning.com>
   2015-05-15 20:42:35 -0700
   Commit: f41be8f, github.com/apache/spark/pull/5988

   [SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files
   Davies Liu <davies@databricks.com>
   2015-05-15 20:09:15 -0700
   Commit: 8164fbc, github.com/apache/spark/pull/6201

   [SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in Python
   Davies Liu <davies@databricks.com>
   2015-05-15 20:05:26 -0700
   Commit: 61806f6, github.com/apache/spark/pull/6206

   [SPARK-7575] [ML] [DOC] Example code for OneVsRest
   Ram Sriharsha <rsriharsha@hw11853.local>
   2015-05-15 19:33:20 -0700
   Commit: 04323ba, github.com/apache/spark/pull/6115

   [SPARK-7563] OutputCommitCoordinator.stop() should only run on the driver
   Josh Rosen <joshrosen@databricks.com>
   2015-05-15 18:06:01 -0700
   Commit: ed75cc0, github.com/apache/spark/pull/6197

   [SPARK-7676] Bug fix and cleanup of stage timeline view
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-05-15 17:45:14 -0700
   Commit: 6f78d03, github.com/apache/spark/pull/6202

   [SPARK-7556] [ML] [DOC] Add user guide for spark.ml Binarizer, including Scala, Java and Python examples
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-15 15:05:04 -0700
   Commit: e847d86, github.com/apache/spark/pull/6116

   [SPARK-7677] [STREAMING] Add Kafka modules to the 2.11 build.
   Iulian Dragos <jaguarul@gmail.com>
   2015-05-15 14:57:29 -0700
   Commit: 31e6404, github.com/apache/spark/pull/6149

   [SPARK-7226] [SPARKR] Support math functions in R DataFrame
   qhuang <qian.huang@intel.com>
   2015-05-15 14:06:16 -0700
   Commit: 9ef6d74, github.com/apache/spark/pull/6170

   [SPARK-7296] Add timeline visualization for stages in the UI.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-05-15 13:54:09 -0700
   Commit: a5f7b3b, github.com/apache/spark/pull/5843

   [SPARK-7504] [YARN] NullPointerException when initializing SparkContext in YARN-cluster mode
   ehnalis <zoltan.zvara@gmail.com>
   2015-05-15 12:14:02 -0700
   Commit: 7dc0ff3, github.com/apache/spark/pull/6083

   [SPARK-7664] [WEBUI] DAG visualization: Fix incorrect link paths of DAG.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-05-15 11:54:13 -0700
   Commit: e319719, github.com/apache/spark/pull/6184

   [SPARK-5412] [DEPLOY] Cannot bind Master to a specific hostname as per the documentation
   Sean Owen <sowen@cloudera.com>
   2015-05-15 11:30:19 -0700
   Commit: fe3c734, github.com/apache/spark/pull/6185

   [CORE] Protect additional test vars from early GC
   Tim Ellison <t.p.ellison@gmail.com>
   2015-05-15 11:27:24 -0700
   Commit: 866e4b5, github.com/apache/spark/pull/6187

   [SPARK-7233] [CORE] Detect REPL mode once
   Oleksii Kostyliev <etander@gmail.com>, Oleksii Kostyliev <okostyliev@thunderhead.com>
   2015-05-15 11:19:56 -0700
   Commit: c58b9c6, github.com/apache/spark/pull/5835

   [SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on bad input
   FlytxtRnD <meethu.mathew@flytxt.com>
   2015-05-15 10:43:18 -0700
   Commit: dfdae58, github.com/apache/spark/pull/6180

   [SPARK-7668] [MLLIB] Preserve isTransposed property for Matrix after calling map function
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-15 10:03:29 -0700
   Commit: d1f5651, github.com/apache/spark/pull/6188

   [SPARK-7503] [YARN] Resources in .sparkStaging directory can't be cleaned up on error
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-05-15 11:37:34 +0100
   Commit: a17a0ee, github.com/apache/spark/pull/6026

   [SPARK-7591] [SQL] Partitioning support API tweaks
   Cheng Lian <lian@databricks.com>
   2015-05-15 16:20:49 +0800
   Commit: bcb2c5d, github.com/apache/spark/pull/6150

   [SPARK-6258] [MLLIB] GaussianMixture Python API parity check
   Yanbo Liang <ybliang8@gmail.com>
   2015-05-15 00:18:39 -0700
   Commit: c0bb974, github.com/apache/spark/pull/6087

   [SPARK-7650] [STREAMING] [WEBUI] Move streaming css and js files to the streaming project
   zsxwing <zsxwing@gmail.com>
   2015-05-14 23:51:41 -0700
   Commit: 0ba99f0, github.com/apache/spark/pull/6160

   [CORE] Remove unreachable Heartbeat message from Worker
   Kan Zhang <kzhang@apache.org>
   2015-05-14 23:50:50 -0700
   Commit: 6742b4e, github.com/apache/spark/pull/6163

   [HOTFIX] Add workaround for SPARK-7660 to fix JavaAPISuite failures.
   Josh Rosen <joshrosen@databricks.com>
   2015-05-14 23:17:41 -0700
   Commit: 1206a55

   [SQL] When creating partitioned table scan, explicitly create UnionRDD.
   Yin Huai <yhuai@databricks.com>
   2015-05-15 12:04:26 +0800
   Commit: 7aa269f, github.com/apache/spark/pull/6162

   [SPARK-7098][SQL] Make the WHERE clause with timestamp show consistent result
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-14 20:49:21 -0700
   Commit: bac4522, github.com/apache/spark/pull/5682

   [SPARK-7548] [SQL] Add explode function for DataFrames
   Michael Armbrust <michael@databricks.com>
   2015-05-14 19:49:44 -0700
   Commit: 778a054, github.com/apache/spark/pull/6107

   [SPARK-7619] [PYTHON] fix docstring signature
   Xiangrui Meng <meng@databricks.com>
   2015-05-14 18:16:22 -0700
   Commit: a238c23, github.com/apache/spark/pull/6161

   [SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.ml
   Xiangrui Meng <meng@databricks.com>
   2015-05-14 18:13:58 -0700
   Commit: f91bb57, github.com/apache/spark/pull/6156

   [SPARK-7645] [STREAMING] [WEBUI] Show milliseconds in the UI if the batch interval < 1 second
   zsxwing <zsxwing@gmail.com>
   2015-05-14 16:58:36 -0700
   Commit: 79983f1, github.com/apache/spark/pull/6154

   [SPARK-7649] [STREAMING] [WEBUI] Use window.localStorage to store the status rather than the url
   zsxwing <zsxwing@gmail.com>
   2015-05-14 16:57:33 -0700
   Commit: 3358485, github.com/apache/spark/pull/6158

   [SPARK-7643] [UI] use the correct size in RDDPage for storage info and partitions
   Xiangrui Meng <meng@databricks.com>
   2015-05-14 16:56:32 -0700
   Commit: 8d8876d, github.com/apache/spark/pull/6157

   [SPARK-7598] [DEPLOY] Add aliveWorkers metrics in Master
   Rex Xiong <pengx@microsoft.com>
   2015-05-14 16:55:31 -0700
   Commit: 894214f, github.com/apache/spark/pull/6117

   Make SPARK prefix a variable
   tedyu <yuzhihong@gmail.com>
   2015-05-14 15:26:35 -0700
   Commit: fceaffc, github.com/apache/spark/pull/6153

   [SPARK-7278] [PySpark] DateType should find datetime.datetime acceptable
   ksonj <kson@siberie.de>
   2015-05-14 15:10:58 -0700
   Commit: a49a145, github.com/apache/spark/pull/6057

   [SQL][minor] rename apply for QueryPlanner
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-14 10:25:18 -0700
   Commit: aa8a0f9, github.com/apache/spark/pull/6142

   [SPARK-7249] Updated Hadoop dependencies due to inconsistency in the versions
   FavioVazquez <favio.vazquezp@gmail.com>
   2015-05-14 15:22:58 +0100
   Commit: 67ed0aa, github.com/apache/spark/pull/5786

   [SPARK-7568] [ML] ml.LogisticRegression doesn't output the right prediction
   DB Tsai <dbt@netflix.com>
   2015-05-14 01:26:08 -0700
   Commit: 58534b0, github.com/apache/spark/pull/6109

   [SPARK-7407] [MLLIB] use uid + name to identify parameters
   Xiangrui Meng <meng@databricks.com>
   2015-05-14 01:22:15 -0700
   Commit: e45cd9f, github.com/apache/spark/pull/6019

   [SPARK-7595] [SQL] Window will cause resolve failed with self join
   linweizhong <linweizhong@huawei.com>
   2015-05-14 00:23:27 -0700
   Commit: c80e0cf, github.com/apache/spark/pull/6114

   [SPARK-7620] [ML] [MLLIB] Removed calling size, length in while condition to avoid extra JVM call
   DB Tsai <dbt@netflix.com>
   2015-05-13 22:23:21 -0700
   Commit: 9ab4db2, github.com/apache/spark/pull/6137

   [SPARK-7612] [MLLIB] update NB training to use mllib's BLAS
   Xiangrui Meng <meng@databricks.com>
   2015-05-13 21:27:17 -0700
   Commit: 82f387f, github.com/apache/spark/pull/6128

   [HOT FIX #6125] Do not wait for all stages to start rendering
   Andrew Or <andrew@databricks.com>
   2015-05-13 21:04:13 -0700
   Commit: 2d4a961, github.com/apache/spark/pull/6138

   [HOTFIX] Use 'new Job' in fsBasedParquet.scala
   zsxwing <zsxwing@gmail.com>
   2015-05-13 17:58:29 -0700
   Commit: d518c03, github.com/apache/spark/pull/6136

   [SPARK-6752] [STREAMING] [REVISED] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-13 17:33:15 -0700
   Commit: aec8394, github.com/apache/spark/pull/6096

   [SPARK-7601] [SQL] Support Insert into JDBC Datasource
   Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-05-13 17:24:04 -0700
   Commit: 820aaa6, github.com/apache/spark/pull/6121

   [SPARK-7081] Faster sort-based shuffle path using binary processing cache-aware sort
   Josh Rosen <joshrosen@databricks.com>
   2015-05-13 17:07:31 -0700
   Commit: c53ebea, github.com/apache/spark/pull/5868

   [SPARK-7356] [STREAMING] Fix flakey tests in FlumePollingStreamSuite using SparkSink's batch CountDownLatch.
   Hari Shreedharan <hshreedharan@apache.org>
   2015-05-13 16:43:30 -0700
   Commit: 6c0644a, github.com/apache/spark/pull/5918

   [STREAMING] [MINOR] Keep streaming.UIUtils private
   Andrew Or <andrew@databricks.com>
   2015-05-13 16:31:24 -0700
   Commit: e499a1e, github.com/apache/spark/pull/6134

   [SPARK-7502] DAG visualization: gracefully handle removed stages
   Andrew Or <andrew@databricks.com>
   2015-05-13 16:29:52 -0700
   Commit: 895d46a, github.com/apache/spark/pull/6132

   [SPARK-7464] DAG visualization: highlight the same RDDs on hover
   Andrew Or <andrew@databricks.com>
   2015-05-13 16:29:10 -0700
   Commit: 4b4f10b, github.com/apache/spark/pull/6100

   [SPARK-7399] Spark compilation error for scala 2.11
   Andrew Or <andrew@databricks.com>
   2015-05-13 16:28:37 -0700
   Commit: e6b8cef, github.com/apache/spark/pull/6129

   [SPARK-7608] Clean up old state in RDDOperationGraphListener
   Andrew Or <andrew@databricks.com>
   2015-05-13 16:27:48 -0700
   Commit: ec34230, github.com/apache/spark/pull/6125

   [SQL] Move some classes into packages that are more appropriate.
   Reynold Xin <rxin@databricks.com>
   2015-05-13 16:15:31 -0700
   Commit: acd872b, github.com/apache/spark/pull/6108

   [SPARK-7303] [SQL] push down project if possible when the child is sort
   scwf <wangfei1@huawei.com>
   2015-05-13 16:13:48 -0700
   Commit: d5c52d9, github.com/apache/spark/pull/5838

   [SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-13 15:13:09 -0700
   Commit: 51230f2, github.com/apache/spark/pull/6106

   [SPARK-7545] [MLLIB] Added check in Bernoulli Naive Bayes to make sure that both training and predict features have values of 0 or 1
   leahmcguire <lmcguire@salesforce.com>
   2015-05-13 14:13:19 -0700
   Commit: d9fb905, github.com/apache/spark/pull/6073

   [SPARK-7593] [ML] Python Api for ml.feature.Bucketizer
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-13 13:21:36 -0700
   Commit: 11911b0, github.com/apache/spark/pull/6124

   [SPARK-7551][DataFrame] support backticks for DataFrame attribute resolution
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-13 12:47:48 -0700
   Commit: 3a60bcb, github.com/apache/spark/pull/6074

   [SPARK-7567] [SQL] Migrating Parquet data source to FSBasedRelation
   Cheng Lian <lian@databricks.com>
   2015-05-13 11:04:10 -0700
   Commit: 90f304b, github.com/apache/spark/pull/6090

   [SPARK-7589] [STREAMING] [WEBUI] Make "Input Rate" in the Streaming page consistent with other pages
   zsxwing <zsxwing@gmail.com>
   2015-05-13 10:01:26 -0700
   Commit: 10007fb, github.com/apache/spark/pull/6102

   [SPARK-6734] [SQL] Add UDTF.close support in Generate
   Cheng Hao <hao.cheng@intel.com>
   2015-05-14 00:14:59 +0800
   Commit: 42cf4a2, github.com/apache/spark/pull/5383

   [MINOR] [SQL] Removes debugging println
   Cheng Lian <lian@databricks.com>
   2015-05-13 23:40:13 +0800
   Commit: d78f0e1, github.com/apache/spark/pull/6123

   [SQL] In InsertIntoFSBasedRelation.insert, log cause before abort job/task.
   Yin Huai <yhuai@databricks.com>
   2015-05-13 23:36:19 +0800
   Commit: 9ca28d9, github.com/apache/spark/pull/6105

   [SPARK-7599] [SQL] Don't restrict customized output committers to be subclasses of FileOutputCommitter
   Cheng Lian <lian@databricks.com>
   2015-05-13 07:35:55 -0700
   Commit: cb1fe81, github.com/apache/spark/pull/6118

   [SPARK-6568] spark-shell.cmd --jars option does not accept the jar that has space in its path
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>, Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-05-13 09:43:40 +0100
   Commit: bfdecac, github.com/apache/spark/pull/5447

   [SPARK-7526] [SPARKR] Specify ip of RBackend, MonitorServer and RRDD Socket server
   linweizhong <linweizhong@huawei.com>
   2015-05-12 23:55:44 -0700
   Commit: 7bd5274, github.com/apache/spark/pull/6053

   [SPARK-7482] [SPARKR] Rename some DataFrame API methods in SparkR to match their counterparts in Scala.
   Sun Rui <rui.sun@intel.com>
   2015-05-12 23:52:30 -0700
   Commit: b18f1c6, github.com/apache/spark/pull/6007

   [SPARK-7566][SQL] Add type to HiveContext.analyzer
   Santiago M. Mola <santi@mola.io>
   2015-05-12 23:44:21 -0700
   Commit: 6ff3379, github.com/apache/spark/pull/6086

   [SPARK-7321][SQL] Add Column expression for conditional statements (when/otherwise)
   Reynold Xin <rxin@databricks.com>, kaka1992 <kaka_1992@163.com>
   2015-05-12 21:43:34 -0700
   Commit: 219a904, github.com/apache/spark/pull/6072

   [SPARK-7588] Document all SQL/DataFrame public methods with @since tag
   Reynold Xin <rxin@databricks.com>
   2015-05-12 18:37:02 -0700
   Commit: bdd5db9, github.com/apache/spark/pull/6101

   [HOTFIX] Use the old Job API to support old Hadoop versions
   zsxwing <zsxwing@gmail.com>
   2015-05-13 08:33:24 +0800
   Commit: 2cc3301, github.com/apache/spark/pull/6095

   [SPARK-7572] [MLLIB] do not import Param/Params under pyspark.ml
   Xiangrui Meng <meng@databricks.com>
   2015-05-12 17:15:39 -0700
   Commit: 08ec1af, github.com/apache/spark/pull/6094

   [SPARK-7554] [STREAMING] Throw exception when an active/stopped StreamingContext is used to create DStreams and output operations
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-12 17:07:21 -0700
   Commit: bb81b15, github.com/apache/spark/pull/6099

   [SPARK-7528] [MLLIB] make RankingMetrics Java-friendly
   Xiangrui Meng <meng@databricks.com>
   2015-05-12 16:53:47 -0700
   Commit: 6c292a2, github.com/apache/spark/pull/6098

   [SPARK-7553] [STREAMING] Added methods to maintain a singleton StreamingContext
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-12 16:44:14 -0700
   Commit: 91fbd93, github.com/apache/spark/pull/6070

   [SPARK-7573] [ML] OneVsRest cleanups
   Joseph K. Bradley <joseph@databricks.com>
   2015-05-12 16:42:30 -0700
   Commit: 612247f, github.com/apache/spark/pull/6097

   [SPARK-7557] [ML] [DOC] User guide for spark.ml HashingTF, Tokenizer
   Joseph K. Bradley <joseph@databricks.com>
   2015-05-12 16:39:56 -0700
   Commit: d080df1, github.com/apache/spark/pull/6093

   [SPARK-7496] [MLLIB] Update Programming guide with Online LDA
   Yuhao Yang <hhbyyh@gmail.com>
   2015-05-12 15:12:29 -0700
   Commit: fe34a59, github.com/apache/spark/pull/6046

   [SPARK-7406] [STREAMING] [WEBUI] Add tooltips for "Scheduling Delay", "Processing Time" and "Total Delay"
   zsxwing <zsxwing@gmail.com>
   2015-05-12 14:41:21 -0700
   Commit: 221375e, github.com/apache/spark/pull/5952

   [SPARK-7571] [MLLIB] rename Math to math
   Xiangrui Meng <meng@databricks.com>
   2015-05-12 14:39:03 -0700
   Commit: 2555517, github.com/apache/spark/pull/6092

   [SPARK-7484][SQL]Support jdbc connection properties
   Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-05-12 14:37:23 -0700
   Commit: 32819fc, github.com/apache/spark/pull/6009

   [SPARK-7559] [MLLIB] Bucketizer should include the right most boundary in the last bucket.
   Xiangrui Meng <meng@databricks.com>
   2015-05-12 14:24:26 -0700
   Commit: 98ccd93, github.com/apache/spark/pull/6075

   [SPARK-7569][SQL] Better error for invalid binary expressions
   Michael Armbrust <michael@databricks.com>
   2015-05-12 13:36:55 -0700
   Commit: c68485e, github.com/apache/spark/pull/6089

   [SPARK-7015] [MLLIB] [WIP] Multiclass to Binary Reduction: One Against All
   Ram Sriharsha <rsriharsha@hw11853.local>
   2015-05-12 13:35:12 -0700
   Commit: fd16709, github.com/apache/spark/pull/5830

   [SPARK-2018] [CORE] Upgrade LZF library to fix endian serialization p…
   Tim Ellison <t.p.ellison@gmail.com>
   2015-05-12 20:48:26 +0100
   Commit: eadda92, github.com/apache/spark/pull/6077

   [SPARK-7487] [ML] Feature Parity in PySpark for ml.regression
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-12 12:17:05 -0700
   Commit: 432694c, github.com/apache/spark/pull/6016

   [HOT FIX #6076] DAG visualization: curve the edges
   Andrew Or <andrew@databricks.com>
   2015-05-12 12:06:30 -0700
   Commit: ce6c400

   [SPARK-7276] [DATAFRAME] speed up DataFrame.select by collapsing Project
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-12 11:51:55 -0700
   Commit: 8be43f8, github.com/apache/spark/pull/5831

   [SPARK-7500] DAG visualization: move cluster labeling to dagre-d3
   Andrew Or <andrew@databricks.com>
   2015-05-12 11:17:59 -0700
   Commit: a236104, github.com/apache/spark/pull/6076

   [DataFrame][minor] support column in field accessor
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-12 10:37:57 -0700
   Commit: ec89286, github.com/apache/spark/pull/6080

   [SPARK-3928] [SPARK-5182] [SQL] Partitioning support for the data sources API
   Cheng Lian <lian@databricks.com>
   2015-05-13 01:32:28 +0800
   Commit: d232813, github.com/apache/spark/pull/5526

   [DataFrame][minor] cleanup unapply methods in DataTypes
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-12 10:28:40 -0700
   Commit: a9d84a9, github.com/apache/spark/pull/6079

   [SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-05-12 10:23:41 -0700
   Commit: 653db0a, github.com/apache/spark/pull/6003

   [SPARK-7532] [STREAMING] StreamingContext.start() made to logWarning and not throw exception
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-12 08:48:24 -0700
   Commit: 2bbb685, github.com/apache/spark/pull/6060

   [SPARK-7467] Dag visualization: treat checkpoint as an RDD operation
   Andrew Or <andrew@databricks.com>
   2015-05-12 01:40:55 -0700
   Commit: 5601632, github.com/apache/spark/pull/6004

   [SPARK-7485] [BUILD] Remove pyspark files from assembly.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-05-12 01:39:21 -0700
   Commit: afe54b7, github.com/apache/spark/pull/6022

   [MINOR] [PYSPARK] Set PYTHONPATH to python/lib/pyspark.zip rather than python/pyspark
   linweizhong <linweizhong@huawei.com>
   2015-05-12 01:36:27 -0700
   Commit: 4092a2e, github.com/apache/spark/pull/6047

   [SPARK-7534] [CORE] [WEBUI] Fix the Stage table when a stage is missing
   zsxwing <zsxwing@gmail.com>
   2015-05-12 01:34:33 -0700
   Commit: af374ed, github.com/apache/spark/pull/6061

   [SPARK-6994][SQL] Update docs for fetching Row fields by name
   vidmantas zemleris <vidmantas@vinted.com>
   2015-05-11 22:29:24 -0700
   Commit: 6523fb8, github.com/apache/spark/pull/6030

   [SQL] Rename Dialect -> ParserDialect.
   Reynold Xin <rxin@databricks.com>
   2015-05-11 22:06:56 -0700
   Commit: c6b8148, github.com/apache/spark/pull/6071

   [SPARK-7435] [SPARKR] Make DataFrame.show() consistent with that of Scala and pySpark
   Joshi <rekhajoshm@gmail.com>, Rekha Joshi <rekhajoshm@gmail.com>
   2015-05-11 21:02:34 -0700
   Commit: 835a770, github.com/apache/spark/pull/5989

   [SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.
   Reynold Xin <rxin@databricks.com>
   2015-05-11 20:04:36 -0700
   Commit: ed40ab5, github.com/apache/spark/pull/6068

   [SPARK-7437] [SQL] Fold "literal in (item1, item2, ..., literal, ...)" into true or false directly
   Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
   2015-05-11 19:22:44 -0700
   Commit: c30982d, github.com/apache/spark/pull/5972

   [SPARK-7411] [SQL] Support SerDe for HiveQl in CTAS
   Cheng Hao <hao.cheng@intel.com>
   2015-05-11 19:21:16 -0700
   Commit: 1a664a0, github.com/apache/spark/pull/5963

   [SPARK-7324] [SQL] DataFrame.dropDuplicates
   Reynold Xin <rxin@databricks.com>
   2015-05-11 19:15:14 -0700
   Commit: 8a9d234, github.com/apache/spark/pull/6066

   [SPARK-7530] [STREAMING] Added StreamingContext.getState() to expose the current state of the context
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-11 18:53:50 -0700
   Commit: c16b47f, github.com/apache/spark/pull/6058

   [SPARK-5893] [ML] Add bucketizer
   Xusen Yin <yinxusen@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
   2015-05-11 18:41:22 -0700
   Commit: f188815, github.com/apache/spark/pull/5980

   Updated DataFrame.saveAsTable Hive warning to include SPARK-7550 ticket.
   Reynold Xin <rxin@databricks.com>
   2015-05-11 18:10:45 -0700
   Commit: e1e599d, github.com/apache/spark/pull/6067

   [SPARK-7462][SQL] Update documentation for retaining grouping columns in DataFrames.
   Reynold Xin <rxin@databricks.com>
   2015-05-11 18:07:12 -0700
   Commit: eaa6116, github.com/apache/spark/pull/6062

   [SPARK-7084] improve saveAsTable documentation
   madhukar <phatak.dev@gmail.com>
   2015-05-11 17:04:11 -0700
   Commit: 0dbfe16, github.com/apache/spark/pull/5654

   [SQL] Show better error messages for incorrect join types in DataFrames.
   Reynold Xin <rxin@databricks.com>
   2015-05-11 17:02:11 -0700
   Commit: 0ff34f80, github.com/apache/spark/pull/6064

   Update Documentation: leftsemi instead of semijoin
   LCY Vincent <lauchunyin@gmail.com>
   2015-05-11 14:48:10 -0700
   Commit: 788503a, github.com/apache/spark/pull/5944

   [STREAMING] [MINOR] Close files correctly when iterator is finished in streaming WAL recovery
   jerryshao <saisai.shao@intel.com>
   2015-05-11 14:38:58 -0700
   Commit: 9e226e1, github.com/apache/spark/pull/6050

   [SPARK-7516] [Minor] [DOC] Replace depreciated inferSchema() with createDataFrame()
   gchen <chenguancheng@gmail.com>
   2015-05-11 14:37:18 -0700
   Commit: 1538b10, github.com/apache/spark/pull/6041

   [SPARK-7508] JettyUtils-generated servlets to log & report all errors
   Steve Loughran <stevel@hortonworks.com>
   2015-05-11 13:35:06 -0700
   Commit: 779174a, github.com/apache/spark/pull/6033

   [SPARK-7462] By default retain group by columns in aggregate
   Reynold Xin <rxin@databricks.com>, Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-05-11 11:35:16 -0700
   Commit: 9c35f02, github.com/apache/spark/pull/5996

   [SPARK-7361] [STREAMING] Throw unambiguous exception when attempting to start multiple StreamingContexts in the same JVM
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-11 10:58:56 -0700
   Commit: 11648fa, github.com/apache/spark/pull/5907

   [SPARK-7522] [EXAMPLES] Removed angle brackets from dataFormat option
   Bryan Cutler <bjcutler@us.ibm.com>
   2015-05-11 09:23:47 -0700
   Commit: c234d78, github.com/apache/spark/pull/6049

   [SPARK-6092] [MLLIB] Add RankingMetrics in PySpark/MLlib
   Yanbo Liang <ybliang8@gmail.com>
   2015-05-11 09:14:20 -0700
   Commit: 017f9fa, github.com/apache/spark/pull/6044

   [SPARK-7326] [STREAMING] Performing window() on a WindowedDStream doesn't work all the time
   Wesley Miao <wesley.miao@gmail.com>, Wesley <wesley.miao@autodesk.com>
   2015-05-11 12:20:06 +0100
   Commit: da1be15, github.com/apache/spark/pull/5871

   [SPARK-7519] [SQL] fix minor bugs in thrift server UI
   tianyi <tianyi.asiainfo@gmail.com>
   2015-05-11 14:08:15 +0800
   Commit: fff3c86, github.com/apache/spark/pull/6048

   [SPARK-7512] [SPARKR] Fix RDD's show method to use getJRDD
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-05-10 19:49:42 -0700
   Commit: 5f227fd, github.com/apache/spark/pull/6035

   [SPARK-7427] [PYSPARK] Make sharedParams match in Scala, Python
   Glenn Weidner <gweidner@us.ibm.com>
   2015-05-10 19:18:32 -0700
   Commit: 051864e, github.com/apache/spark/pull/6023

   [SPARK-5521] PCA wrapper for easy transform vectors
   Kirill A. Korinskiy <catap@catap.ru>, Joseph K. Bradley <joseph@databricks.com>
   2015-05-10 13:34:00 -0700
   Commit: 193ff69, github.com/apache/spark/pull/4304

   [SPARK-7431] [ML] [PYTHON] Made CrossValidatorModel call parent init in PySpark
   Joseph K. Bradley <joseph@databricks.com>
   2015-05-10 13:29:27 -0700
   Commit: d49b72c, github.com/apache/spark/pull/5968

   [MINOR] [SQL] Fixes variable name typo
   Cheng Lian <lian@databricks.com>
   2015-05-10 21:26:36 +0800
   Commit: fd87b2a, github.com/apache/spark/pull/6038

   [SPARK-7345][SQL] Spark cannot detect renamed columns using JDBC connector
   Oleg Sidorkin <oleg.sidorkin@gmail.com>
   2015-05-10 01:31:34 -0700
   Commit: 5c40403, github.com/apache/spark/pull/6032

   [SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlib
   Yanbo Liang <ybliang8@gmail.com>
   2015-05-10 00:57:14 -0700
   Commit: fe46374, github.com/apache/spark/pull/6011

   [SPARK-7475] [MLLIB] adjust ldaExample for online LDA
   Yuhao Yang <hhbyyh@gmail.com>
   2015-05-09 15:40:46 -0700
   Commit: e96fc86, github.com/apache/spark/pull/6000

   [BUILD] Reference fasterxml.jackson.version in sql/core/pom.xml
   tedyu <yuzhihong@gmail.com>
   2015-05-09 13:19:07 -0700
   Commit: 5110f3e, github.com/apache/spark/pull/6031

   Upgrade version of jackson-databind in sql/core/pom.xml
   tedyu <yuzhihong@gmail.com>
   2015-05-09 10:41:30 -0700
   Commit: 6c5b9ff, github.com/apache/spark/pull/6028

   [STREAMING] [DOCS] Fix wrong url about API docs of StreamingListener
   dobashim <dobashim@oss.nttdata.co.jp>
   2015-05-09 10:14:46 +0100
   Commit: 5dbc7bb, github.com/apache/spark/pull/6024

   [SPARK-7403] [WEBUI] Link URL in objects on Timeline View is wrong in case of running on YARN
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-05-09 10:10:29 +0100
   Commit: 869a52d, github.com/apache/spark/pull/5947

   [SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct
   Vinod K C <vinod.kc@huawei.com>
   2015-05-09 10:03:15 +0100
   Commit: b0460f4, github.com/apache/spark/pull/5974

   [SPARK-7498] [ML] removed varargs annotation from Params.setDefaults
   Joseph K. Bradley <joseph@databricks.com>
   2015-05-08 21:55:54 -0700
   Commit: 25972d3, github.com/apache/spark/pull/6021

   [SPARK-7262] [ML] Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package
   DB Tsai <dbt@netflix.com>
   2015-05-08 21:43:05 -0700
   Commit: 80bbe72, github.com/apache/spark/pull/5967

   [SPARK-7375] [SQL] Avoid row copying in exchange when sort.serializeMapOutputs takes effect
   Josh Rosen <joshrosen@databricks.com>
   2015-05-08 22:09:55 -0400
   Commit: 21212a2, github.com/apache/spark/pull/5948

   [SPARK-7231] [SPARKR] Changes to make SparkR DataFrame dplyr friendly.
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-05-08 18:29:57 -0700
   Commit: 448ff33, github.com/apache/spark/pull/6005

   [SPARK-7451] [YARN] Preemption of executors is counted as failure causing Spark job to fail
   Ashwin Shankar <ashankar@netflix.com>
   2015-05-08 17:51:00 -0700
   Commit: 959c7b6, github.com/apache/spark/pull/5993

   [SPARK-7488] [ML] Feature Parity in PySpark for ml.recommendation
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-08 17:24:32 -0700
   Commit: 85cab34, github.com/apache/spark/pull/6015

   [SPARK-7237] Clean function in several RDD methods
   tedyu <yuzhihong@gmail.com>
   2015-05-08 17:16:38 -0700
   Commit: 45b6215, github.com/apache/spark/pull/5959

   [SPARK-7469] [SQL] DAG visualization: show SQL query operators
   Andrew Or <andrew@databricks.com>
   2015-05-08 17:15:10 -0700
   Commit: cafffd0, github.com/apache/spark/pull/5999

   [SPARK-6955] Perform port retries at NettyBlockTransferService level
   Aaron Davidson <aaron@databricks.com>
   2015-05-08 17:13:55 -0700
   Commit: 1eae476, github.com/apache/spark/pull/5575

   updated ec2 instance types
   Brendan Collins <bcollins@blueraster.com>
   2015-05-08 15:59:34 -0700
   Commit: 6e35cb5, github.com/apache/spark/pull/6014

   [SPARK-5913] [MLLIB] Python API for ChiSqSelector
   Yanbo Liang <ybliang8@gmail.com>
   2015-05-08 15:48:39 -0700
   Commit: ab48df3, github.com/apache/spark/pull/5939

   [SPARK-4699] [SQL] Make caseSensitive configurable in spark sql analyzer
   Jacky Li <jacky.likun@huawei.com>, wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
   2015-05-08 15:25:54 -0700
   Commit: 21bd722, github.com/apache/spark/pull/5806

   [SPARK-7390] [SQL] Only merge other CovarianceCounter when its count is greater than zero
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-08 14:41:16 -0700
   Commit: 5205eb4, github.com/apache/spark/pull/5931

   [SPARK-7378] [CORE] Handle deep links to unloaded apps.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-05-08 14:12:58 -0700
   Commit: 3024f6b, github.com/apache/spark/pull/5922

   [MINOR] [CORE] Allow History Server to read kerberos opts from config file.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-05-08 14:10:27 -0700
   Commit: 3da5f8b, github.com/apache/spark/pull/5998

   [SPARK-7466] DAG visualization: fix orphan nodes
   Andrew Or <andrew@databricks.com>
   2015-05-08 14:09:39 -0700
   Commit: ca2f1c5, github.com/apache/spark/pull/6002

   [MINOR] Defeat early garbage collection of test suite variable
   Tim Ellison <t.p.ellison@gmail.com>
   2015-05-08 14:08:52 -0700
   Commit: f734c58, github.com/apache/spark/pull/6010

   [SPARK-7489] [SPARK SHELL] Spark shell crashes when compiled with scala 2.11
   vinodkc <vinod.kc.in@gmail.com>
   2015-05-08 14:07:53 -0700
   Commit: 3b7fb7a, github.com/apache/spark/pull/6013

   [WEBUI] Remove debug feature for vis.js
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-05-08 14:06:37 -0700
   Commit: 1dde3b3, github.com/apache/spark/pull/5994

   [MINOR] Ignore python/lib/pyspark.zip
   zsxwing <zsxwing@gmail.com>
   2015-05-08 14:06:02 -0700
   Commit: ab0caa0, github.com/apache/spark/pull/6017

   [SPARK-7490] [CORE] [Minor] MapOutputTracker.deserializeMapStatuses: close input streams
   Evan Jones <ejones@twitter.com>
   2015-05-08 22:00:39 +0100
   Commit: 6230809, github.com/apache/spark/pull/5982

   [SPARK-6627] Finished rename to ShuffleBlockResolver
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-05-08 12:24:06 -0700
   Commit: 82be68f, github.com/apache/spark/pull/5764

   [SPARK-7133] [SQL] Implement struct, array, and map field accessor
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-08 11:49:38 -0700
   Commit: f8468c4, github.com/apache/spark/pull/5744

   [SPARK-7298] Harmonize style of new visualizations
   Matei Zaharia <matei@databricks.com>
   2015-05-08 14:41:42 -0400
   Commit: 0b2c252, github.com/apache/spark/pull/5942

   [SPARK-7436] Fixed instantiation of custom recovery mode factory and added tests
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-05-08 11:38:09 -0700
   Commit: 89d9487, github.com/apache/spark/pull/5976

   [SPARK-6824] Fill the docs for DataFrame API in SparkR
   hqzizania <qian.huang@intel.com>, qhuang <qian.huang@intel.com>
   2015-05-08 11:25:04 -0700
   Commit: 4f01f5b, github.com/apache/spark/pull/5969

   [SPARK-7474] [MLLIB] update ParamGridBuilder doctest
   Xiangrui Meng <meng@databricks.com>
   2015-05-08 11:16:04 -0700
   Commit: 75fed0c, github.com/apache/spark/pull/6001

   [SPARK-7383] [ML] Feature Parity in PySpark for ml.features
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-08 11:14:39 -0700
   Commit: 85e1154, github.com/apache/spark/pull/5991

   [SPARK-3454] separate json endpoints for data in the UI
   Imran Rashid <irashid@cloudera.com>
   2015-05-08 16:54:32 +0100
   Commit: 532bfda, github.com/apache/spark/pull/5940

   [SPARK-6869] [PYSPARK] Add pyspark archives path to PYTHONPATH
   Lianhui Wang <lianhuiwang09@gmail.com>
   2015-05-08 08:44:46 -0500
   Commit: acf4bc1, github.com/apache/spark/pull/5580

   [SPARK-7392] [CORE] bugfix: Kryo buffer size cannot be larger than 2M
   Zhang, Liye <liye.zhang@intel.com>
   2015-05-08 09:10:58 +0100
   Commit: f5e9678, github.com/apache/spark/pull/5934

   [SPARK-7232] [SQL] Add a Substitution batch for spark sql analyzer
   wangfei <wangfei1@huawei.com>
   2015-05-07 22:55:42 -0700
   Commit: bb5872f, github.com/apache/spark/pull/5776

   [SPARK-7470] [SQL] Spark shell SQLContext crashes without hive
   Andrew Or <andrew@databricks.com>
   2015-05-07 22:32:13 -0700
   Commit: 1a3e9e9, github.com/apache/spark/pull/5997

   [SPARK-6986] [SQL] Use Serializer2 in more cases.
   Yin Huai <yhuai@databricks.com>
   2015-05-07 20:59:42 -0700
   Commit: 9d0d289, github.com/apache/spark/pull/5849

   [SPARK-7452] [MLLIB] fix bug in topBykey and update test
   Shuo Xiang <shuoxiangpub@gmail.com>
   2015-05-07 20:55:08 -0700
   Commit: 28d4238, github.com/apache/spark/pull/5990

   [SPARK-6908] [SQL] Use isolated Hive client
   Michael Armbrust <michael@databricks.com>
   2015-05-07 19:36:24 -0700
   Commit: 05454fd, github.com/apache/spark/pull/5876

   [SPARK-7305] [STREAMING] [WEBUI] Make BatchPage show friendly information when jobs are dropped by SparkListener
   zsxwing <zsxwing@gmail.com>
   2015-05-07 17:34:44 -0700
   Commit: 2e8a141, github.com/apache/spark/pull/5840

   [SPARK-7450] Use UNSAFE.getLong() to speed up BitSetMethods#anySet()
   tedyu <yuzhihong@gmail.com>
   2015-05-07 16:53:59 -0700
   Commit: 99897fe, github.com/apache/spark/pull/5897

   [SPARK-2155] [SQL] [WHEN D THEN E] [ELSE F] add CaseKeyWhen for "CASE a WHEN b THEN c * END"
   Wenchen Fan <cloud0fan@outlook.com>
   2015-05-07 16:26:49 -0700
   Commit: 622a0c5, github.com/apache/spark/pull/5979

   [SPARK-5281] [SQL] Registering table on RDD is giving MissingRequirementError
   Iulian Dragos <jaguarul@gmail.com>
   2015-05-07 16:24:11 -0700
   Commit: 9fd25f7, github.com/apache/spark/pull/5981

   [SPARK-7277] [SQL] Throw exception if the property mapred.reduce.tasks is set to -1
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-07 16:22:45 -0700
   Commit: 7064ea0, github.com/apache/spark/pull/5811

   [SQL] [MINOR] make star and multialias extend NamedExpression
   scwf <wangfei1@huawei.com>
   2015-05-07 16:21:24 -0700
   Commit: 2425e4d, github.com/apache/spark/pull/5928

   [SPARK-6948] [MLLIB] compress vectors in VectorAssembler
   Xiangrui Meng <meng@databricks.com>
   2015-05-07 15:45:37 -0700
   Commit: 475143a, github.com/apache/spark/pull/5985

   [SPARK-5726] [MLLIB] Elementwise (Hadamard) Vector Product Transformer
   Octavian Geagla <ogeagla@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
   2015-05-07 14:49:55 -0700
   Commit: 76e58b5, github.com/apache/spark/pull/4580

   [SPARK-7328] [MLLIB] [PYSPARK] Pyspark.mllib.linalg.Vectors: Missing items
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-05-07 14:02:05 -0700
   Commit: 4436e26, github.com/apache/spark/pull/5872

   [SPARK-7347] DAG visualization: add tooltips to RDDs
   Andrew Or <andrew@databricks.com>
   2015-05-07 12:29:56 -0700
   Commit: 1b742a4, github.com/apache/spark/pull/5957

   [SPARK-7391] DAG visualization: auto expand if linked from another viz
   Andrew Or <andrew@databricks.com>
   2015-05-07 12:29:18 -0700
   Commit: 800c0fc, github.com/apache/spark/pull/5958

   [SPARK-7373] [MESOS] Add docker support for launching drivers in mesos cluster mode.
   Timothy Chen <tnachen@gmail.com>
   2015-05-07 12:23:16 -0700
   Commit: 226033c, github.com/apache/spark/pull/5917

   [SPARK-7399] [SPARK CORE] Fixed compilation error in scala 2.11
   Tijo Thomas <tijoparacka@gmail.com>
   2015-05-07 12:21:09 -0700
   Commit: d4e31bf, github.com/apache/spark/pull/5966

   [SPARK-5213] [SQL] Remove the duplicated SparkSQLParser
   Cheng Hao <hao.cheng@intel.com>
   2015-05-07 12:09:54 -0700
   Commit: 2b0c423, github.com/apache/spark/pull/5965

   [SPARK-7116] [SQL] [PYSPARK] Remove cache() causing memory leak
   ksonj <kson@siberie.de>
   2015-05-07 12:04:19 -0700
   Commit: 86f141c, github.com/apache/spark/pull/5973

   [SPARK-1442] [SQL] [FOLLOW-UP] Address minor comments in Window Function PR (#5604).
   Yin Huai <yhuai@databricks.com>
   2015-05-07 11:46:49 -0700
   Commit: 9dcf4f7, github.com/apache/spark/pull/5945

   [SPARK-6093] [MLLIB] Add RegressionMetrics in PySpark/MLlib
   Yanbo Liang <ybliang8@gmail.com>
   2015-05-07 11:18:32 -0700
   Commit: ef835dc, github.com/apache/spark/pull/5941

   [SPARK-7118] [Python] Add the coalesce Spark SQL function available in PySpark
   Olivier Girardot <o.girardot@lateral-thoughts.com>
   2015-05-07 10:58:35 -0700
   Commit: 3038b26, github.com/apache/spark/pull/5698

   [SPARK-7388] [SPARK-7383] wrapper for VectorAssembler in Python
   Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>
   2015-05-07 10:25:41 -0700
   Commit: 6b9737a, github.com/apache/spark/pull/5930

   [SPARK-7330] [SQL] avoid NPE at jdbc rdd
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-05-07 10:05:01 -0700
   Commit: 84ee348, github.com/apache/spark/pull/5877

   [SPARK-7429] [ML] Params cleanups
   Joseph K. Bradley <joseph@databricks.com>
   2015-05-07 01:28:44 -0700
   Commit: 91ce131, github.com/apache/spark/pull/5960

   [SPARK-7421] [MLLIB] OnlineLDA cleanups
   Joseph K. Bradley <joseph@databricks.com>
   2015-05-07 01:12:14 -0700
   Commit: a038c51, github.com/apache/spark/pull/5956

   [SPARK-7035] Encourage __getitem__ over __getattr__ on column access in the Python DataFrame API
   ksonj <kson@siberie.de>
   2015-05-07 01:02:00 -0700
   Commit: b929a75, github.com/apache/spark/pull/5971

   [SPARK-7295][SQL]  bitwise operations for DataFrame DSL
   Shiti <ssaxena.ece@gmail.com>
   2015-05-07 01:00:29 -0700
   Commit: 703211b, github.com/apache/spark/pull/5867

   [SPARK-7217] [STREAMING] Add configuration to control the default behavior of StreamingContext.stop() implicitly calling SparkContext.stop()
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-07 00:24:44 -0700
   Commit: cb13c98, github.com/apache/spark/pull/5929

   [SPARK-7430] [STREAMING] [TEST] General improvements to streaming tests to increase debuggability
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-07 00:21:10 -0700
   Commit: 065d114, github.com/apache/spark/pull/5961

   [SPARK-5938] [SPARK-5443] [SQL] Improve JsonRDD performance
   Nathan Howell <nhowell@godaddy.com>
   2015-05-06 22:56:53 -0700
   Commit: 2337ccc1, github.com/apache/spark/pull/5801

   [SPARK-6812] [SPARKR] filter() on DataFrame does not work as expected.
   Sun Rui <rui.sun@intel.com>
   2015-05-06 22:48:16 -0700
   Commit: 4948f42, github.com/apache/spark/pull/5938

   [SPARK-7432] [MLLIB] disable cv doctest
   Xiangrui Meng <meng@databricks.com>
   2015-05-06 22:29:07 -0700
   Commit: fb4967b, github.com/apache/spark/pull/5962

   [SPARK-7405] [STREAMING] Fix the bug that ReceiverInputDStream doesn't report InputInfo
   zsxwing <zsxwing@gmail.com>
   2015-05-06 18:07:00 -0700
   Commit: d6e76cb, github.com/apache/spark/pull/5950

   [HOT FIX] For DAG visualization #5954
   Andrew Or <andrew@databricks.com>
   2015-05-06 18:02:08 -0700
   Commit: 85a644b

   [SPARK-7371] [SPARK-7377] [SPARK-7408] DAG visualization addendum (#5729)
   Andrew Or <andrew@databricks.com>
   2015-05-06 17:52:34 -0700
   Commit: 76e8344, github.com/apache/spark/pull/5954

   [SPARK-7396] [STREAMING] [EXAMPLE] Update KafkaWordCountProducer to use new Producer API
   jerryshao <saisai.shao@intel.com>
   2015-05-06 17:44:43 -0700
   Commit: ba24dfa, github.com/apache/spark/pull/5936

   [SPARK-6799] [SPARKR] Remove SparkR RDD examples, add dataframe examples
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-05-06 17:28:11 -0700
   Commit: 4b91e18, github.com/apache/spark/pull/5949

   [HOT FIX] [SPARK-7418] Ignore flaky SparkSubmitUtilsSuite test
   Andrew Or <andrew@databricks.com>
   2015-05-06 17:08:39 -0700
   Commit: c0ec20a

   [SPARK-5995] [ML] Make Prediction dev API public
   Joseph K. Bradley <joseph@databricks.com>
   2015-05-06 16:15:51 -0700
   Commit: b681b93, github.com/apache/spark/pull/5913

   [HOT-FIX] Move HiveWindowFunctionQuerySuite.scala to hive compatibility dir.
   Yin Huai <yhuai@databricks.com>
   2015-05-06 14:48:25 -0700
   Commit: 14bcb84, github.com/apache/spark/pull/5951

   Add `Private` annotation.
   Josh Rosen <joshrosen@databricks.com>
   2015-05-06 11:03:17 -0700
   Commit: 2163367

   [SPARK-7311] Introduce internal Serializer API for determining if serializers support object relocation
   Josh Rosen <joshrosen@databricks.com>
   2015-05-06 10:52:55 -0700
   Commit: d651e28, github.com/apache/spark/pull/5924

   [SPARK-1442] [SQL] Window Function Support for Spark SQL
   Yin Huai <yhuai@databricks.com>
   2015-05-06 10:43:00 -0700
   Commit: b521a3b, github.com/apache/spark/pull/5604

   [SPARK-6201] [SQL] promote string and do widen types for IN
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-05-06 10:30:42 -0700
   Commit: 7212897, github.com/apache/spark/pull/4945

   [SPARK-5456] [SQL] fix decimal compare for jdbc rdd
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-05-06 10:05:10 -0700
   Commit: f1a5caf, github.com/apache/spark/pull/5803

   [SQL] JavaDoc update for various DataFrame functions.
   Reynold Xin <rxin@databricks.com>
   2015-05-06 08:50:56 -0700
   Commit: 389b755, github.com/apache/spark/pull/5935

   [SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API
   Xiangrui Meng <meng@databricks.com>
   2015-05-06 01:28:43 -0700
   Commit: 3e27a54, github.com/apache/spark/pull/5926

   [SPARK-7384][Core][Tests] Fix flaky tests for distributed mode in BroadcastSuite
   zsxwing <zsxwing@gmail.com>
   2015-05-05 23:25:28 -0700
   Commit: 20f9237, github.com/apache/spark/pull/5925

   [SPARK-6267] [MLLIB] Python API for IsotonicRegression
   Yanbo Liang <ybliang8@gmail.com>, Xiangrui Meng <meng@databricks.com>
   2015-05-05 22:57:13 -0700
   Commit: 384ac3c, github.com/apache/spark/pull/5890

   [SPARK-7358][SQL] Move DataFrame mathfunctions into functions
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-05 22:56:01 -0700
   Commit: 8aa6681, github.com/apache/spark/pull/5923

   [SPARK-6841] [SPARKR] add support for mean, median, stdev etc.
   qhuang <qian.huang@intel.com>
   2015-05-05 20:39:56 -0700
   Commit: b5cd7dc, github.com/apache/spark/pull/5446

   Revert "[SPARK-3454] separate json endpoints for data in the UI"
   Reynold Xin <rxin@databricks.com>
   2015-05-05 19:28:35 -0700
   Commit: 765f6e1

   [SPARK-6231][SQL/DF] Automatically resolve join condition ambiguity for self-joins.
   Reynold Xin <rxin@databricks.com>
   2015-05-05 18:59:46 -0700
   Commit: e61083c, github.com/apache/spark/pull/5919

   Some minor cleanup after SPARK-4550.
   Sandy Ryza <sandy@cloudera.com>
   2015-05-05 18:32:16 -0700
   Commit: 762ff2e, github.com/apache/spark/pull/5916

   [SPARK-7230] [SPARKR] Make RDD private in SparkR.
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-05-05 14:40:33 -0700
   Commit: 4afb578, github.com/apache/spark/pull/5895

   [SQL][Minor] make StringComparison extends ExpectsInputTypes
   wangfei <wangfei1@huawei.com>
   2015-05-05 14:24:37 -0700
   Commit: b6566a2, github.com/apache/spark/pull/5905

   [SPARK-7351] [STREAMING] [DOCS] Add spark.streaming.ui.retainedBatches to docs
   zsxwing <zsxwing@gmail.com>
   2015-05-05 13:42:23 -0700
   Commit: 4c95fe5, github.com/apache/spark/pull/5899

   [SPARK-7294][SQL] ADD BETWEEN
   云峤 <chensong.cs@alibaba-inc.com>, kaka1992 <kaka_1992@163.com>
   2015-05-05 13:23:53 -0700
   Commit: c68d0e2, github.com/apache/spark/pull/5839

   [SPARK-6939] [STREAMING] [WEBUI] Add timeline and histogram graphs for streaming statistics
   zsxwing <zsxwing@gmail.com>
   2015-05-05 12:52:16 -0700
   Commit: 8109c9e, github.com/apache/spark/pull/5533

   [SPARK-5888] [MLLIB] Add OneHotEncoder as a Transformer
   Sandy Ryza <sandy@cloudera.com>
   2015-05-05 12:34:02 -0700
   Commit: 94ac9eb, github.com/apache/spark/pull/5500

   [SPARK-7333] [MLLIB] Add BinaryClassificationEvaluator to PySpark
   Xiangrui Meng <meng@databricks.com>
   2015-05-05 11:45:37 -0700
   Commit: dfb6bfc, github.com/apache/spark/pull/5885

   [SPARK-7243][SQL] Reduce  size for Contingency Tables in DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-05 11:01:25 -0700
   Commit: 598902b, github.com/apache/spark/pull/5900

   [SPARK-7007] [CORE] Add a metric source for ExecutorAllocationManager
   jerryshao <saisai.shao@intel.com>
   2015-05-05 09:43:49 -0700
   Commit: 29350ee, github.com/apache/spark/pull/5589

   [SPARK-7318] [STREAMING] DStream cleans objects that are not closures
   Andrew Or <andrew@databricks.com>
   2015-05-05 09:37:49 -0700
   Commit: acc877a, github.com/apache/spark/pull/5860

   [SPARK-7237] Many user provided closures are not actually cleaned
   Andrew Or <andrew@databricks.com>
   2015-05-05 09:37:04 -0700
   Commit: 01d4022, github.com/apache/spark/pull/5787

   [SPARK-6612] [MLLIB] [PYSPARK] Python KMeans parity
   Hrishikesh Subramonian <hrishikesh.subramonian@flytxt.com>
   2015-05-05 07:57:39 -0700
   Commit: 8b63103, github.com/apache/spark/pull/5647

   [SPARK-7202] [MLLIB] [PYSPARK] Add SparseMatrixPickler to SerDe
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-05-05 07:53:11 -0700
   Commit: cd55e9a, github.com/apache/spark/pull/5775

   [SPARK-7350] [STREAMING] [WEBUI] Attach the Streaming tab when calling ssc.start()
   zsxwing <zsxwing@gmail.com>
   2015-05-05 15:09:58 +0100
   Commit: 49923f7, github.com/apache/spark/pull/5898

   [SPARK-5074] [CORE] [TESTS] Fix the flakey test 'run shuffle with map stage failure' in DAGSchedulerSuite
   zsxwing <zsxwing@gmail.com>
   2015-05-05 15:04:14 +0100
   Commit: 6f35dac, github.com/apache/spark/pull/5903

   [MINOR] Minor update for document
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-05 14:44:02 +0100
   Commit: d288322, github.com/apache/spark/pull/5906

   [SPARK-3454] separate json endpoints for data in the UI
   Imran Rashid <irashid@cloudera.com>
   2015-05-05 07:25:40 -0500
   Commit: ff8b449, github.com/apache/spark/pull/4435

   [SPARK-5112] Expose SizeEstimator as a developer api
   Sandy Ryza <sandy@cloudera.com>
   2015-05-05 12:38:46 +0100
   Commit: 0327ca2, github.com/apache/spark/pull/3913

   [SPARK-6653] [YARN] New config to specify port for sparkYarnAM actor system
   shekhar.bansal <shekhar.bansal@guavus.com>
   2015-05-05 11:09:51 +0100
   Commit: 93af96a, github.com/apache/spark/pull/5719

   [SPARK-7341] [STREAMING] [TESTS] Fix the flaky test: org.apache.spark.stre...
   zsxwing <zsxwing@gmail.com>
   2015-05-05 02:15:39 -0700
   Commit: 0634510, github.com/apache/spark/pull/5891

   [SPARK-7113] [STREAMING] Support input information reporting for Direct Kafka stream
   jerryshao <saisai.shao@intel.com>
   2015-05-05 02:01:06 -0700
   Commit: becdb81, github.com/apache/spark/pull/5879

   [HOTFIX] [TEST] Ignoring flaky tests
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-05 01:58:51 -0700
   Commit: e8f847a, github.com/apache/spark/pull/5901

   [SPARK-7139] [STREAMING] Allow received block metadata to be saved to WAL and recovered on driver failure
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-05 01:45:19 -0700
   Commit: ae27c0e, github.com/apache/spark/pull/5732

   [MINOR] [BUILD] Declare ivy dependency in root pom.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-05-05 08:56:16 +0100
   Commit: 5160437, github.com/apache/spark/pull/5893

   [SPARK-7314] [SPARK-3524] [PYSPARK] upgrade Pyrolite to 4.4
   Xiangrui Meng <meng@databricks.com>
   2015-05-04 23:52:42 -0700
   Commit: 21ed108, github.com/apache/spark/pull/5850

   [SPARK-7236] [CORE] Fix to prevent AkkaUtils askWithReply from sleeping on final attempt
   Bryan Cutler <bjcutler@us.ibm.com>
   2015-05-04 18:29:22 -0700
   Commit: 48655d1, github.com/apache/spark/pull/5896

   [SPARK-7266] Add ExpectsInputTypes to expressions when possible.
   Reynold Xin <rxin@databricks.com>
   2015-05-04 18:03:07 -0700
   Commit: 1388a46, github.com/apache/spark/pull/5796

   [SPARK-7243][SQL] Contingency Tables for DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-04 17:02:49 -0700
   Commit: ecf0d8a, github.com/apache/spark/pull/5842

   [SPARK-6943] [SPARK-6944] DAG visualization on SparkUI
   Andrew Or <andrew@databricks.com>
   2015-05-04 16:21:36 -0700
   Commit: 863ec0c, github.com/apache/spark/pull/5729

   [SPARK-7319][SQL] Improve the output from DataFrame.show()
   云峤 <chensong.cs@alibaba-inc.com>
   2015-05-04 12:08:38 -0700
   Commit: 34edaa8, github.com/apache/spark/pull/5865

   [SPARK-5956] [MLLIB] Pipeline components should be copyable.
   Xiangrui Meng <meng@databricks.com>
   2015-05-04 11:28:59 -0700
   Commit: 893b310, github.com/apache/spark/pull/5820

   [SPARK-5100] [SQL] add webui for thriftserver
   tianyi <tianyi.asiainfo@gmail.com>
   2015-05-04 16:59:34 +0800
   Commit: 343d3bf, github.com/apache/spark/pull/5730

   [SPARK-5563] [MLLIB] LDA with online variational inference
   Yuhao Yang <hhbyyh@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
   2015-05-04 00:06:25 -0700
   Commit: 3539cb7, github.com/apache/spark/pull/4419

   [SPARK-7241] Pearson correlation for DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-03 21:44:39 -0700
   Commit: 9646018, github.com/apache/spark/pull/5858

   [SPARK-7329] [MLLIB] simplify ParamGridBuilder impl
   Xiangrui Meng <meng@databricks.com>
   2015-05-03 18:06:48 -0700
   Commit: 1ffa8cb, github.com/apache/spark/pull/5873

   [SPARK-7302] [DOCS] SPARK building documentation still mentions building for yarn 0.23
   Sean Owen <sowen@cloudera.com>
   2015-05-03 21:22:31 +0100
   Commit: 9e25b09, github.com/apache/spark/pull/5863

   [SPARK-6907] [SQL] Isolated client for HiveMetastore
   Michael Armbrust <michael@databricks.com>
   2015-05-03 13:12:50 -0700
   Commit: daa70bf, github.com/apache/spark/pull/5851

   [SPARK-7022] [PYSPARK] [ML] Add ML.Tuning.ParamGridBuilder to PySpark
   Omede Firouz <ofirouz@palantir.com>, Omede <omedefirouz@gmail.com>
   2015-05-03 11:42:02 -0700
   Commit: f4af925, github.com/apache/spark/pull/5601

   [SPARK-7031] [THRIFTSERVER] let thrift server take SPARK_DAEMON_MEMORY and SPARK_DAEMON_JAVA_OPTS
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-05-03 00:47:47 +0100
   Commit: 49549d5, github.com/apache/spark/pull/5609

   [SPARK-7255] [STREAMING] [DOCUMENTATION] Added documentation for spark.streaming.kafka.maxRetries
   BenFradet <benjamin.fradet@gmail.com>
   2015-05-02 23:41:14 +0100
   Commit: ea841ef, github.com/apache/spark/pull/5808

   [SPARK-5213] [SQL] Pluggable SQL Parser Support
   Cheng Hao <hao.cheng@intel.com>, scwf <wangfei1@huawei.com>
   2015-05-02 15:20:07 -0700
   Commit: 5d6b90d, github.com/apache/spark/pull/5827

   [MINOR] [HIVE] Fix QueryPartitionSuite.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-05-02 23:10:35 +0100
   Commit: 82c8c37, github.com/apache/spark/pull/5854

   [SPARK-6030] [CORE] Using simulated field layout method to compute class shellSize
   Ye Xianjin <advancedxy@gmail.com>
   2015-05-02 23:08:09 +0100
   Commit: bfcd528, github.com/apache/spark/pull/4783

   [SPARK-7323] [SPARK CORE] Use insertAll instead of insert while merging combiners in reducer
   Mridul Muralidharan <mridulm@yahoo-inc.com>
   2015-05-02 23:05:51 +0100
   Commit: da30352, github.com/apache/spark/pull/5862

   [SPARK-3444] Fix typo in Dataframes.py introduced in []
   Dean Chen <deanchen5@gmail.com>
   2015-05-02 23:04:13 +0100
   Commit: 856a571, github.com/apache/spark/pull/5866

   [SPARK-7315] [STREAMING] [TEST] Fix flaky WALBackedBlockRDDSuite
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-05-02 01:53:14 -0700
   Commit: ecc6eb5, github.com/apache/spark/pull/5853

   [SPARK-7120] [SPARK-7121] Closure cleaner nesting + documentation + tests
   Andrew Or <andrew@databricks.com>
   2015-05-01 23:57:58 -0700
   Commit: 7394e7a, github.com/apache/spark/pull/5685

   [SPARK-7242] added python api for freqItems in DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-01 23:43:24 -0700
   Commit: 2e0f357, github.com/apache/spark/pull/5859

   [SPARK-7317] [Shuffle] Expose shuffle handle
   Mridul Muralidharan <mridulm@yahoo-inc.com>
   2015-05-01 21:23:42 -0700
   Commit: b79aeb9, github.com/apache/spark/pull/5857

   [SPARK-6229] Add SASL encryption to network library.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-05-01 19:01:46 -0700
   Commit: 38d4e9e, github.com/apache/spark/pull/5377

   [SPARK-2691] [MESOS] Support for Mesos DockerInfo
   Chris Heller <hellertime@gmail.com>
   2015-05-01 18:41:22 -0700
   Commit: 8f50a07, github.com/apache/spark/pull/3074

   [SPARK-6443] [SPARK SUBMIT] Could not submit app in standalone cluster mode when HA is enabled
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-05-01 18:38:20 -0700
   Commit: b4b43df, github.com/apache/spark/pull/5116

   [SPARK-7216] [MESOS] Add driver details page to Mesos cluster UI.
   Timothy Chen <tnachen@gmail.com>
   2015-05-01 18:36:42 -0700
   Commit: 2022193, github.com/apache/spark/pull/5763

   [SPARK-6954] [YARN] ExecutorAllocationManager can end up requesting a negative n...
   Sandy Ryza <sandy@cloudera.com>
   2015-05-01 18:32:46 -0700
   Commit: 099327d, github.com/apache/spark/pull/5704

   [SPARK-3444] Provide an easy way to change log level
   Holden Karau <holden@pigscanfly.ca>
   2015-05-01 18:02:10 -0700
   Commit: ae98eec, github.com/apache/spark/pull/5791

   [SPARK-2808][Streaming][Kafka] update kafka to 0.8.2
   cody koeninger <cody@koeninger.org>, Helena Edelson <helena.edelson@datastax.com>
   2015-05-01 17:54:56 -0700
   Commit: 4786484, github.com/apache/spark/pull/4537

   [SPARK-7112][Streaming][WIP] Add a InputInfoTracker to track all the input streams
   jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
   2015-05-01 17:46:06 -0700
   Commit: b88c275, github.com/apache/spark/pull/5680

   [SPARK-7309] [CORE] [STREAMING] Shutdown the thread pools in ReceivedBlockHandler and DAGScheduler
   zsxwing <zsxwing@gmail.com>
   2015-05-01 17:41:55 -0700
   Commit: ebc25a4, github.com/apache/spark/pull/5845

   [SPARK-6999] [SQL] Remove the infinite recursive method (useless)
   Cheng Hao <hao.cheng@intel.com>
   2015-05-01 19:39:30 -0500
   Commit: 98e7045, github.com/apache/spark/pull/5804

   [SPARK-7304] [BUILD] Include $@ in call to mvn consistently in make-distribution.sh
   Rajendra Gokhale (rvgcentos) <rvg@cloudera.com>
   2015-05-01 17:01:36 -0700
   Commit: e6fb377, github.com/apache/spark/pull/5846

   [SPARK-7312][SQL] SPARK-6913 broke jdk6 build
   Yin Huai <yhuai@databricks.com>
   2015-05-01 16:47:00 -0700
   Commit: 41c6a44, github.com/apache/spark/pull/5847

   Ignore flakey test in SparkSubmitUtilsSuite
   Patrick Wendell <patrick@databricks.com>
   2015-05-01 14:42:58 -0700
   Commit: 5c1faba

   [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS
   Hari Shreedharan <hshreedharan@apache.org>
   2015-05-01 15:32:09 -0500
   Commit: b1f4ca8, github.com/apache/spark/pull/5823

   [SPARK-7240][SQL] Single pass covariance calculation for dataframes
   Burak Yavuz <brkyvz@gmail.com>
   2015-05-01 13:29:17 -0700
   Commit: 4dc8d74, github.com/apache/spark/pull/5825

   [SPARK-7281] [YARN] Add option to set AM's lib path in client mode.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-05-01 21:20:46 +0100
   Commit: 7b5dd3e, github.com/apache/spark/pull/5813

   [SPARK-7213] [YARN] Check for read permissions before copying a Hadoop config file
   Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
   2015-05-01 21:14:16 +0100
   Commit: f53a488, github.com/apache/spark/pull/5760

   Revert "[SPARK-7224] added mock repository generator for --packages tests"
   Patrick Wendell <patrick@databricks.com>
   2015-05-01 13:01:43 -0700
   Commit: c6d9a42

   Revert "[SPARK-7287] enabled fixed test"
   Patrick Wendell <patrick@databricks.com>
   2015-05-01 13:01:14 -0700
   Commit: 58d6584

   [SPARK-7274] [SQL] Create Column expression for array/struct creation.
   Reynold Xin <rxin@databricks.com>
   2015-05-01 12:49:02 -0700
   Commit: 3753776, github.com/apache/spark/pull/5802

   [SPARK-7183] [NETWORK] Fix memory leak of TransportRequestHandler.streamIds
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-01 11:59:12 -0700
   Commit: 1686032, github.com/apache/spark/pull/5743

   [SPARK-6846] [WEBUI] [HOTFIX] return to GET for kill link in UI since YARN AM won't proxy POST
   Sean Owen <sowen@cloudera.com>
   2015-05-01 19:57:37 +0100
   Commit: 1262e31, github.com/apache/spark/pull/5837

   [SPARK-5854] personalized page rank
   Dan McClary <dan.mcclary@gmail.com>, dwmclary <dan.mcclary@gmail.com>
   2015-05-01 11:55:43 -0700
   Commit: 7d42722, github.com/apache/spark/pull/4774

   changing persistence engine trait to an abstract class
   niranda <niranda.perera@gmail.com>
   2015-05-01 11:27:45 -0700
   Commit: 27de6fe, github.com/apache/spark/pull/5832

   Limit help option regex
   Chris Biow <chris.biow@10gen.com>
   2015-05-01 19:26:55 +0100
   Commit: c8c481d, github.com/apache/spark/pull/5816

   [SPARK-5891] [ML] Add Binarizer ML Transformer
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-05-01 08:31:01 -0700
   Commit: 7630213, github.com/apache/spark/pull/5699

   [SPARK-3066] [MLLIB] Support recommendAll in matrix factorization model
   Debasish Das <debasish.das@one.verizon.com>, Xiangrui Meng <meng@databricks.com>
   2015-05-01 08:27:46 -0700
   Commit: 3b514af, github.com/apache/spark/pull/3098

   [SPARK-4705] Handle multiple app attempts event logs, history server.
   Marcelo Vanzin <vanzin@cloudera.com>, twinkle sachdeva <twinkle@kite.ggn.in.guavus.com>, twinkle.sachdeva <twinkle.sachdeva@guavus.com>, twinkle sachdeva <twinkle.sachdeva@guavus.com>
   2015-05-01 09:50:55 -0500
   Commit: 3052f49, github.com/apache/spark/pull/5432

   [SPARK-3468] [WEBUI] Timeline-View feature
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-05-01 01:39:56 -0700
   Commit: 7fe0f3f, github.com/apache/spark/pull/2342

   [SPARK-6257] [PYSPARK] [MLLIB] MLlib API missing items in Recommendation
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-04-30 23:51:00 -0700
   Commit: c24aeb6, github.com/apache/spark/pull/5807

   [SPARK-7291] [CORE] Fix a flaky test in AkkaRpcEnvSuite
   zsxwing <zsxwing@gmail.com>
   2015-04-30 23:44:33 -0700
   Commit: 14b3288, github.com/apache/spark/pull/5822

   [SPARK-7287] enabled fixed test
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-30 23:39:58 -0700
   Commit: 7cf1eb7, github.com/apache/spark/pull/5826

   [SPARK-4550] In sort-based shuffle, store map outputs in serialized form
   Sandy Ryza <sandy@cloudera.com>
   2015-04-30 23:14:14 -0700
   Commit: 0a2b15c, github.com/apache/spark/pull/4450

   HOTFIX: Disable buggy dependency checker
   Patrick Wendell <patrick@databricks.com>
   2015-04-30 22:39:58 -0700
   Commit: a9fc505

   [SPARK-6479] [BLOCK MANAGER] Create off-heap block storage API
   Zhan Zhang <zhazhan@gmail.com>
   2015-04-30 22:24:31 -0700
   Commit: 36a7a68, github.com/apache/spark/pull/5430

   [SPARK-7248] implemented random number generators for DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-30 21:56:03 -0700
   Commit: b5347a4, github.com/apache/spark/pull/5819

   [SPARK-7282] [STREAMING] Fix the race conditions in StreamingListenerSuite
   zsxwing <zsxwing@gmail.com>
   2015-04-30 21:32:11 -0700
   Commit: 69a739c, github.com/apache/spark/pull/5812

   Revert "[SPARK-5213] [SQL] Pluggable SQL Parser Support"
   Patrick Wendell <patrick@databricks.com>
   2015-04-30 20:33:36 -0700
   Commit: beeafcf

   [SPARK-7123] [SQL] support table.star in sqlcontext
   scwf <wangfei1@huawei.com>
   2015-04-30 18:50:14 -0700
   Commit: 473552f, github.com/apache/spark/pull/5690

   [SPARK-5213] [SQL] Pluggable SQL Parser Support
   Cheng Hao <hao.cheng@intel.com>
   2015-04-30 18:49:06 -0700
   Commit: 3ba5aaa, github.com/apache/spark/pull/4015

   [SPARK-6913][SQL] Fixed "java.sql.SQLException: No suitable driver found"
   Vyacheslav Baranov <slavik.baranov@gmail.com>
   2015-04-30 18:45:14 -0700
   Commit: e991255, github.com/apache/spark/pull/5782

   [SPARK-7109] [SQL] Push down left side filter for left semi join
   wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
   2015-04-30 18:18:54 -0700
   Commit: a0d8a61, github.com/apache/spark/pull/5677

   [SPARK-7093] [SQL] Using newPredicate in NestedLoopJoin to enable code generation
   scwf <wangfei1@huawei.com>
   2015-04-30 18:15:56 -0700
   Commit: 0797338, github.com/apache/spark/pull/5665

   [SPARK-7280][SQL] Add "drop" column/s on a data frame
   rakeshchalasani <vnit.rakesh@gmail.com>
   2015-04-30 17:42:50 -0700
   Commit: ee04413, github.com/apache/spark/pull/5818

   [SPARK-7242][SQL][MLLIB] Frequent items for DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-30 16:40:32 -0700
   Commit: 149b3ee, github.com/apache/spark/pull/5799

   [SPARK-7279] Removed diffSum which is theoretical zero in LinearRegression and coding formating
   DB Tsai <dbt@netflix.com>
   2015-04-30 16:26:51 -0700
   Commit: 1c3e402, github.com/apache/spark/pull/5809

   [Build] Enable MiMa checks for SQL
   Josh Rosen <joshrosen@databricks.com>
   2015-04-30 16:23:01 -0700
   Commit: fa01bec, github.com/apache/spark/pull/5727

   [SPARK-7267][SQL]Push down Project when it's child is Limit
   Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
   2015-04-30 15:22:13 -0700
   Commit: 77cc25f, github.com/apache/spark/pull/5797

   [SPARK-7288] Suppress compiler warnings due to use of sun.misc.Unsafe; add facade in front of Unsafe; remove use of Unsafe.setMemory
   Josh Rosen <joshrosen@databricks.com>
   2015-04-30 15:21:00 -0700
   Commit: 07a8620, github.com/apache/spark/pull/5814

   [SPARK-7196][SQL] Support precision and scale of decimal type for JDBC
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-30 15:13:43 -0700
   Commit: 6702324, github.com/apache/spark/pull/5777

   Revert "[SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS"
   Patrick Wendell <patrick@databricks.com>
   2015-04-30 14:59:20 -0700
   Commit: e0628f2

   [SPARK-7207] [ML] [BUILD] Added ml.recommendation, ml.regression to SparkBuild
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-30 14:39:27 -0700
   Commit: adbdb19, github.com/apache/spark/pull/5758

   [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS
   Hari Shreedharan <hshreedharan@apache.org>
   2015-04-30 13:03:23 -0500
   Commit: 6c65da6, github.com/apache/spark/pull/4688

   [SPARK-7224] added mock repository generator for --packages tests
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-30 10:19:08 -0700
   Commit: 7dacc08, github.com/apache/spark/pull/5790

   [HOTFIX] Disabling flaky test (fix in progress as part of SPARK-7224)
   Patrick Wendell <patrick@databricks.com>
   2015-04-30 01:02:33 -0700
   Commit: 47bf406

   [SPARK-1406] Mllib pmml model export
   Vincenzo Selvaggio <vselvaggio@hotmail.it>, Xiangrui Meng <meng@databricks.com>, selvinsource <vselvaggio@hotmail.it>
   2015-04-29 23:21:21 -0700
   Commit: 254e050, github.com/apache/spark/pull/3062

   [SPARK-7225][SQL] CombineLimits optimizer does not work
   Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
   2015-04-29 22:44:14 -0700
   Commit: 4459514, github.com/apache/spark/pull/5770

   Some code clean up.
   DB Tsai <dbt@netflix.com>
   2015-04-29 21:44:41 -0700
   Commit: ba49eb1, github.com/apache/spark/pull/5794

   [SPARK-7156][SQL] Addressed follow up comments for randomSplit
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-29 19:13:47 -0700
   Commit: 5553198, github.com/apache/spark/pull/5795

   [SPARK-7234][SQL] Fix DateType mismatch when codegen on.
   云峤 <chensong.cs@alibaba-inc.com>
   2015-04-29 18:23:42 -0700
   Commit: 7143f6e, github.com/apache/spark/pull/5778

   [SPARK-6862] [STREAMING] [WEBUI] Add BatchPage to display details of a batch
   zsxwing <zsxwing@gmail.com>
   2015-04-29 18:22:14 -0700
   Commit: 1b7106b, github.com/apache/spark/pull/5473

   [SPARK-7176] [ML] Add validation functionality to Param
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-29 17:26:46 -0700
   Commit: 114bad6, github.com/apache/spark/pull/5740

   [SQL] [Minor] Print detail query execution info when spark answer is not right
   wangfei <wangfei1@huawei.com>
   2015-04-29 17:00:24 -0700
   Commit: 1fdfdb4, github.com/apache/spark/pull/5774

   [SPARK-7259] [ML] VectorIndexer: do not copy non-ML metadata to output column
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-29 16:35:17 -0700
   Commit: b1ef6a6, github.com/apache/spark/pull/5789

   [SPARK-7229] [SQL] SpecificMutableRow should take integer type as internal representation for Date
   Cheng Hao <hao.cheng@intel.com>
   2015-04-29 16:23:34 -0700
   Commit: f8cbb0a, github.com/apache/spark/pull/5772

   [SPARK-7155] [CORE] Allow newAPIHadoopFile to support comma-separated list of files as input
   yongtang <yongtang@users.noreply.github.com>
   2015-04-29 23:55:51 +0100
   Commit: 3fc6cfd, github.com/apache/spark/pull/5708

   [SPARK-7181] [CORE] fix inifite loop in Externalsorter's mergeWithAggregation
   Qiping Li <liqiping1991@gmail.com>
   2015-04-29 23:52:16 +0100
   Commit: 7f4b583, github.com/apache/spark/pull/5737

   [SPARK-7156][SQL] support RandomSplit in DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-29 15:34:05 -0700
   Commit: d7dbce8, github.com/apache/spark/pull/5761

   [SPARK-6529] [ML] Add Word2Vec transformer
   Xusen Yin <yinxusen@gmail.com>
   2015-04-29 14:55:32 -0700
   Commit: c9d530e, github.com/apache/spark/pull/5596

   [SPARK-7222] [ML] Added mathematical derivation in comment and compressed the model, removed the correction terms in LinearRegression with ElasticNet
   DB Tsai <dbt@netflix.com>
   2015-04-29 14:53:37 -0700
   Commit: 15995c8, github.com/apache/spark/pull/5767

   [SPARK-6629] cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads
   Josh Rosen <joshrosen@databricks.com>
   2015-04-29 13:31:52 -0700
   Commit: 3a180c1, github.com/apache/spark/pull/5288

   [SPARK-6752] [STREAMING] [REOPENED] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-04-29 13:10:31 -0700
   Commit: a9c4e29, github.com/apache/spark/pull/5773

   [SPARK-7056] [STREAMING] Make the Write Ahead Log pluggable
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-04-29 13:06:11 -0700
   Commit: 1868bd4, github.com/apache/spark/pull/5645

   Fix a typo of "threshold"
   Xusen Yin <yinxusen@gmail.com>
   2015-04-29 10:13:48 -0700
   Commit: c0c0ba6, github.com/apache/spark/pull/5769

   [SQL][Minor] fix java doc for DataFrame.agg
   Wenchen Fan <cloud0fan@outlook.com>
   2015-04-29 09:49:24 -0700
   Commit: 81ea42b, github.com/apache/spark/pull/5712

   Better error message on access to non-existing attribute
   ksonj <kson@siberie.de>
   2015-04-29 09:48:47 -0700
   Commit: 3df9c5d, github.com/apache/spark/pull/5771

   [SPARK-7223] Rename RPC askWithReply -> askWithReply, sendWithReply -> ask.
   Reynold Xin <rxin@databricks.com>
   2015-04-29 09:46:37 -0700
   Commit: 687273d, github.com/apache/spark/pull/5768

   [SPARK-6918] [YARN] Secure HBase support.
   Dean Chen <deanchen5@gmail.com>
   2015-04-29 08:58:33 -0500
   Commit: baed3f2, github.com/apache/spark/pull/5586

   [SPARK-7076][SPARK-7077][SPARK-7080][SQL] Use managed memory for aggregations
   Josh Rosen <joshrosen@databricks.com>
   2015-04-29 01:07:26 -0700
   Commit: f49284b, github.com/apache/spark/pull/5725

   [SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operations
   Patrick Wendell <patrick@databricks.com>
   2015-04-29 00:35:08 -0700
   Commit: 1fd6ed9, github.com/apache/spark/pull/5757

   [SPARK-7188] added python support for math DataFrame functions
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-29 00:09:24 -0700
   Commit: fe917f5, github.com/apache/spark/pull/5750

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-04-28 23:38:59 -0700
   Commit: 8dee274, github.com/apache/spark/pull/3205

   [SPARK-7205] Support `.ivy2/local` and `.m2/repositories/` in --packages
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-28 23:05:02 -0700
   Commit: f98773a, github.com/apache/spark/pull/5755

   [SPARK-7215] made coalesce and repartition a part of the query plan
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-28 22:48:04 -0700
   Commit: 271c4c6, github.com/apache/spark/pull/5762

   [SPARK-6756] [MLLIB] add toSparse, toDense, numActives, numNonzeros, and compressed to Vector
   Xiangrui Meng <meng@databricks.com>
   2015-04-28 21:49:53 -0700
   Commit: 5ef006f, github.com/apache/spark/pull/5756

   [SPARK-7208] [ML] [PYTHON] Added Matrix, SparseMatrix to __all__ list in linalg.py
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-28 21:15:47 -0700
   Commit: a8aeadb, github.com/apache/spark/pull/5759

   [SPARK-7138] [STREAMING] Add method to BlockGenerator to add multiple records to BlockGenerator with single callback
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-04-28 19:31:57 -0700
   Commit: 5c8f4bd, github.com/apache/spark/pull/5695

   [SPARK-6965] [MLLIB] StringIndexer handles numeric input.
   Xiangrui Meng <meng@databricks.com>
   2015-04-28 17:41:09 -0700
   Commit: d36e673, github.com/apache/spark/pull/5753

   Closes #4807 Closes #5055 Closes #3583
   Xiangrui Meng <meng@databricks.com>
   2015-04-28 14:21:25 -0700
   Commit: 555213e

   [SPARK-7201] [MLLIB] move Identifiable to ml.util
   Xiangrui Meng <meng@databricks.com>
   2015-04-28 14:07:26 -0700
   Commit: f0a1f90, github.com/apache/spark/pull/5749

   [MINOR] [CORE] Warn users who try to cache RDDs with dynamic allocation on.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-28 13:49:29 -0700
   Commit: 28b1af7, github.com/apache/spark/pull/5751

   [SPARK-5338] [MESOS] Add cluster mode support for Mesos
   Timothy Chen <tnachen@gmail.com>, Luc Bourlier <luc.bourlier@typesafe.com>
   2015-04-28 13:31:08 -0700
   Commit: 53befac, github.com/apache/spark/pull/5144

   [SPARK-6314] [CORE] handle JsonParseException for history server
   Zhang, Liye <liye.zhang@intel.com>
   2015-04-28 12:33:48 -0700
   Commit: 8009810, github.com/apache/spark/pull/5736

   [SPARK-5932] [CORE] Use consistent naming for size properties
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-04-28 12:18:55 -0700
   Commit: 2d222fb, github.com/apache/spark/pull/5574

   [SPARK-4286] Add an external shuffle service that can be run as a daemon.
   Iulian Dragos <jaguarul@gmail.com>
   2015-04-28 12:08:18 -0700
   Commit: 8aab94d, github.com/apache/spark/pull/4990

   [Core][test][minor] replace try finally block with tryWithSafeFinally
   Zhang, Liye <liye.zhang@intel.com>
   2015-04-28 10:24:00 -0700
   Commit: 52ccf1d, github.com/apache/spark/pull/5739

   [SPARK-7140] [MLLIB] only scan the first 16 entries in Vector.hashCode
   Xiangrui Meng <meng@databricks.com>
   2015-04-28 09:59:36 -0700
   Commit: b14cd23, github.com/apache/spark/pull/5697

   [SPARK-5253] [ML] LinearRegression with L1/L2 (ElasticNet) using OWLQN
   DB Tsai <dbt@netflix.com>, DB Tsai <dbtsai@alpinenow.com>
   2015-04-28 09:46:08 -0700
   Commit: 6a827d5, github.com/apache/spark/pull/4259

   [SPARK-6435] spark-shell --jars option does not add all jars to classpath
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2015-04-28 07:55:21 -0400
   Commit: 268c419, github.com/apache/spark/pull/5227

   [SPARK-7100] [MLLIB] Fix persisted RDD leak in GradientBoostTrees
   Jim Carroll <jim@dontcallme.com>
   2015-04-28 07:51:02 -0400
   Commit: 75905c5, github.com/apache/spark/pull/5669

   [SPARK-7168] [BUILD] Update plugin versions in Maven build and centralize versions
   Sean Owen <sowen@cloudera.com>
   2015-04-28 07:48:34 -0400
   Commit: 7f3b3b7, github.com/apache/spark/pull/5720

   [SPARK-6352] [SQL] Custom parquet output committer
   Pei-Lun Lee <pllee@appier.com>
   2015-04-28 16:50:18 +0800
   Commit: e13cd86, github.com/apache/spark/pull/5525

   [SPARK-7135][SQL] DataFrame expression for monotonically increasing IDs.
   Reynold Xin <rxin@databricks.com>
   2015-04-28 00:39:08 -0700
   Commit: d94cd1a, github.com/apache/spark/pull/5709

   [SPARK-7187] SerializationDebugger should not crash user code
   Andrew Or <andrew@databricks.com>
   2015-04-28 00:38:14 -0700
   Commit: bf35edd, github.com/apache/spark/pull/5734

   [SPARK-5946] [STREAMING] Add Python API for direct Kafka stream
   jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
   2015-04-27 23:48:02 -0700
   Commit: 9e4e82b, github.com/apache/spark/pull/4723

   [SPARK-6829] Added math functions for DataFrames
   Burak Yavuz <brkyvz@gmail.com>
   2015-04-27 23:10:14 -0700
   Commit: 29576e7, github.com/apache/spark/pull/5616

   [SPARK-7174][Core] Move calling `TaskScheduler.executorHeartbeatReceived` to another thread
   zsxwing <zsxwing@gmail.com>
   2015-04-27 21:45:40 -0700
   Commit: 874a2ca, github.com/apache/spark/pull/5723

   [SPARK-7090] [MLLIB] Introduce LDAOptimizer to LDA to further improve extensibility
   Yuhao Yang <hhbyyh@gmail.com>
   2015-04-27 19:02:51 -0700
   Commit: 4d9e560, github.com/apache/spark/pull/5661

   [SPARK-7162] [YARN] Launcher error in yarn-client
   GuoQiang Li <witgo@qq.com>
   2015-04-27 19:52:41 -0400
   Commit: 62888a4, github.com/apache/spark/pull/5716

   [SPARK-7145] [CORE] commons-lang (2.x) classes used instead of commons-lang3 (3.x); commons-io used without dependency
   Sean Owen <sowen@cloudera.com>
   2015-04-27 19:50:55 -0400
   Commit: ab5adb7, github.com/apache/spark/pull/5703

   [SPARK-3090] [CORE] Stop SparkContext if user forgets to.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-27 19:46:17 -0400
   Commit: 5d45e1f, github.com/apache/spark/pull/5696

   [SPARK-6738] [CORE] Improve estimate the size of a large array
   Hong Shen <hongshen@tencent.com>
   2015-04-27 18:57:31 -0400
   Commit: 8e1c00d, github.com/apache/spark/pull/5608

   [SPARK-7103] Fix crash with SparkContext.union when RDD has no partitioner
   Steven She <steven@canopylabs.com>
   2015-04-27 18:55:02 -0400
   Commit: b9de9e0, github.com/apache/spark/pull/5679

   [SPARK-6991] [SPARKR] Adds support for zipPartitions.
   hlin09 <hlin09pu@gmail.com>
   2015-04-27 15:04:37 -0700
   Commit: ca9f4eb, github.com/apache/spark/pull/5568

   SPARK-7107 Add parameter for zookeeper.znode.parent to hbase_inputformat...
   tedyu <yuzhihong@gmail.com>
   2015-04-27 14:42:40 -0700
   Commit: ef82bdd, github.com/apache/spark/pull/5673

   [SPARK-6856] [R] Make RDD information more useful in SparkR
   Jeff Harrison <jeffrharrison@gmail.com>
   2015-04-27 13:38:25 -0700
   Commit: 7078f60, github.com/apache/spark/pull/5667

   [SPARK-4925] Publish Spark SQL hive-thriftserver maven artifact
   Misha Chernetsov <chernetsov@gmail.com>
   2015-04-27 11:27:56 -0700
   Commit: 998aac2, github.com/apache/spark/pull/5429

   [SPARK-6505] [SQL] Remove the reflection call in HiveFunctionWrapper
   baishuo <vc_java@hotmail.com>
   2015-04-27 14:08:05 +0800
   Commit: 82bb7fd, github.com/apache/spark/pull/5660

   [SQL][Minor] rename DataTypeParser.apply to DataTypeParser.parse
   wangfei <wangfei1@huawei.com>
   2015-04-26 21:08:47 -0700
   Commit: d188b8b, github.com/apache/spark/pull/5710

   [SPARK-7152][SQL] Add a Column expression for partition ID.
   Reynold Xin <rxin@databricks.com>
   2015-04-26 11:46:58 -0700
   Commit: ca55dc9, github.com/apache/spark/pull/5705

   [MINOR] [MLLIB] Refactor toString method in MLLIB
   Alain <aihe@usc.edu>
   2015-04-26 07:14:24 -0400
   Commit: 9a5bbe0, github.com/apache/spark/pull/5687

   [SPARK-6014] [CORE] [HOTFIX] Add try-catch block around ShutDownHook
   Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
   2015-04-25 20:02:23 -0400
   Commit: f5473c2, github.com/apache/spark/pull/5672

   [SPARK-7092] Update spark scala version to 2.11.6
   Prashant Sharma <prashant.s@imaginea.com>
   2015-04-25 18:07:34 -0400
   Commit: a11c868, github.com/apache/spark/pull/5662

   [SQL] Update SQL readme to include instructions on generating golden answer files based on Hive 0.13.1.
   Yin Huai <yhuai@databricks.com>
   2015-04-25 13:43:39 -0700
   Commit: aa6966f, github.com/apache/spark/pull/5702

   [SPARK-6113] [ML] Tree ensembles for Pipelines API
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-25 12:27:19 -0700
   Commit: a7160c4, github.com/apache/spark/pull/5626

   Revert "[SPARK-6752][Streaming] Allow StreamingContext to be recreated from checkpoint and existing SparkContext"
   Patrick Wendell <patrick@databricks.com>
   2015-04-25 10:37:34 -0700
   Commit: a61d65f

   update the deprecated CountMinSketchMonoid function to TopPctCMS function
   KeheCAI <caikehe@gmail.com>
   2015-04-25 08:42:38 -0400
   Commit: cca9905, github.com/apache/spark/pull/5629

   [SPARK-7136][Docs] Spark SQL and DataFrame Guide fix example file and paths
   Deborah Siegel <deborah.siegel@gmail.com>, DEBORAH SIEGEL <deborahsiegel@d-140-142-0-49.dhcp4.washington.edu>, DEBORAH SIEGEL <deborahsiegel@DEBORAHs-MacBook-Pro.local>, DEBORAH SIEGEL <deborahsiegel@d-69-91-154-197.dhcp4.washington.edu>
   2015-04-24 20:25:07 -0700
   Commit: 59b7cfc, github.com/apache/spark/pull/5693

   [PySpark][Minor] Update sql example, so that can read file correctly
   linweizhong <linweizhong@huawei.com>
   2015-04-24 20:23:19 -0700
   Commit: d874f8b, github.com/apache/spark/pull/5684

   [SPARK-6122] [CORE] Upgrade tachyon-client version to 0.6.3
   Calvin Jia <jia.calvin@gmail.com>
   2015-04-24 17:57:41 -0400
   Commit: 438859e, github.com/apache/spark/pull/5354

   [SPARK-6852] [SPARKR] Accept numeric as numPartitions in SparkR.
   Sun Rui <rui.sun@intel.com>
   2015-04-24 12:52:07 -0700
   Commit: caf0136, github.com/apache/spark/pull/5613

   [SPARK-7033] [SPARKR] Clean usage of split. Use partition instead where applicable.
   Sun Rui <rui.sun@intel.com>
   2015-04-24 11:00:19 -0700
   Commit: ebb77b2, github.com/apache/spark/pull/5628

   [SPARK-6528] [ML] Add IDF transformer
   Xusen Yin <yinxusen@gmail.com>
   2015-04-24 08:29:49 -0700
   Commit: 6e57d57, github.com/apache/spark/pull/5266

   [SPARK-7115] [MLLIB] skip the very first 1 in poly expansion
   Xiangrui Meng <meng@databricks.com>
   2015-04-24 08:27:48 -0700
   Commit: 78b39c7, github.com/apache/spark/pull/5681

   [SPARK-5894] [ML] Add polynomial mapper
   Xusen Yin <yinxusen@gmail.com>, Xiangrui Meng <meng@databricks.com>
   2015-04-24 00:39:29 -0700
   Commit: 8509519, github.com/apache/spark/pull/5245

   Fixed a typo from the previous commit.
   Reynold Xin <rxin@databricks.com>
   2015-04-23 22:39:00 -0700
   Commit: 4c722d7

   [SQL] Fixed expression data type matching.
   Reynold Xin <rxin@databricks.com>
   2015-04-23 21:21:03 -0700
   Commit: d3a302d, github.com/apache/spark/pull/5675

   Update sql-programming-guide.md
   Ken Geis <geis.ken@gmail.com>
   2015-04-23 20:45:33 -0700
   Commit: 67bccbd, github.com/apache/spark/pull/5674

   [SPARK-7060][SQL] Add alias function to python dataframe
   Yin Huai <yhuai@databricks.com>
   2015-04-23 18:52:55 -0700
   Commit: 2d010f7, github.com/apache/spark/pull/5634

   [SPARK-7037] [CORE] Inconsistent behavior for non-spark config properties in spark-shell and spark-submit
   Cheolsoo Park <cheolsoop@netflix.com>
   2015-04-23 20:10:55 -0400
   Commit: 336f7f5, github.com/apache/spark/pull/5617

   [SPARK-6818] [SPARKR] Support column deletion in SparkR DataFrame API.
   Sun Rui <rui.sun@intel.com>
   2015-04-23 16:08:14 -0700
   Commit: 73db132, github.com/apache/spark/pull/5655

   [SQL] Break dataTypes.scala into multiple files.
   Reynold Xin <rxin@databricks.com>
   2015-04-23 14:48:19 -0700
   Commit: 6220d93, github.com/apache/spark/pull/5670

   [SPARK-7070] [MLLIB] LDA.setBeta should call setTopicConcentration.
   Xiangrui Meng <meng@databricks.com>
   2015-04-23 14:46:54 -0700
   Commit: 1ed46a6, github.com/apache/spark/pull/5649

   [SPARK-7087] [BUILD] Fix path issue change version script
   Tijo Thomas <tijoparacka@gmail.com>
   2015-04-23 17:23:15 -0400
   Commit: 6d0749c, github.com/apache/spark/pull/5656

   [SPARK-6879] [HISTORYSERVER] check if app is completed before clean it up
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-04-23 17:20:17 -0400
   Commit: baa83a9, github.com/apache/spark/pull/5491

   [SPARK-7085][MLlib] Fix miniBatchFraction parameter in train method called with 4 arguments
   wizz <wizz@wizz-dev01.kawasaki.flab.fujitsu.com>
   2015-04-23 14:00:07 -0700
   Commit: 3e91cc2, github.com/apache/spark/pull/5658

   [SPARK-7058] Include RDD deserialization time in "task deserialization time" metric
   Josh Rosen <joshrosen@databricks.com>
   2015-04-23 13:19:03 -0700
   Commit: 6afde2c, github.com/apache/spark/pull/5635

   [SPARK-7055][SQL]Use correct ClassLoader for JDBC Driver in JDBCRDD.getConnector
   Vinod K C <vinod.kc@huawei.com>
   2015-04-23 12:00:23 -0700
   Commit: c1213e6, github.com/apache/spark/pull/5633

   [SPARK-6752][Streaming] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-04-23 11:29:34 -0700
   Commit: 534f2a4, github.com/apache/spark/pull/5428

   [SPARK-7044] [SQL] Fix the deadlock in script transformation
   Cheng Hao <hao.cheng@intel.com>
   2015-04-23 10:35:22 -0700
   Commit: cc48e63, github.com/apache/spark/pull/5625

   [minor][streaming]fixed scala string interpolation error
   Prabeesh K <prabeesh.k@namshi.com>
   2015-04-23 10:33:13 -0700
   Commit: 975f53e, github.com/apache/spark/pull/5653

   [HOTFIX] [SQL] Fix compilation for scala 2.11.
   Prashant Sharma <prashant.s@imaginea.com>
   2015-04-23 16:45:26 +0530
   Commit: a7d65d3, github.com/apache/spark/pull/5652

   [SPARK-7069][SQL] Rename NativeType -> AtomicType.
   Reynold Xin <rxin@databricks.com>
   2015-04-23 01:43:40 -0700
   Commit: f60bece, github.com/apache/spark/pull/5651

   [SPARK-7068][SQL] Remove PrimitiveType
   Reynold Xin <rxin@databricks.com>
   2015-04-22 23:55:20 -0700
   Commit: 29163c5, github.com/apache/spark/pull/5646

   [MLlib] Add support for BooleanType to VectorAssembler.
   Reynold Xin <rxin@databricks.com>
   2015-04-22 23:54:48 -0700
   Commit: 2d33323, github.com/apache/spark/pull/5648

   [HOTFIX][SQL] Fix broken cached test
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-22 22:18:56 -0700
   Commit: d9e70f3, github.com/apache/spark/pull/5640

   [SPARK-7046] Remove InputMetrics from BlockResult
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-04-22 21:42:09 -0700
   Commit: 03e85b4, github.com/apache/spark/pull/5627

   [SPARK-7066][MLlib] VectorAssembler should use NumericType not NativeType.
   Reynold Xin <rxin@databricks.com>
   2015-04-22 21:35:42 -0700
   Commit: d206860, github.com/apache/spark/pull/5642

   [MLlib] UnaryTransformer nullability should not depend on PrimitiveType.
   Reynold Xin <rxin@databricks.com>
   2015-04-22 21:35:12 -0700
   Commit: 1b85e08, github.com/apache/spark/pull/5644

   Disable flaky test: ReceiverSuite "block generator throttling".
   Reynold Xin <rxin@databricks.com>
   2015-04-22 21:24:22 -0700
   Commit: b69c4f9

   [SPARK-6967] [SQL] fix date type convertion in jdbcrdd
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-04-22 19:14:28 -0700
   Commit: 04525c0, github.com/apache/spark/pull/5590

   [SPARK-6827] [MLLIB] Wrap FPGrowthModel.freqItemsets and make it consistent with Java API
   Yanbo Liang <ybliang8@gmail.com>
   2015-04-22 17:22:26 -0700
   Commit: f4f3998, github.com/apache/spark/pull/5614

   [SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.
   Reynold Xin <rxin@databricks.com>
   2015-04-22 15:26:58 -0700
   Commit: baf865d, github.com/apache/spark/pull/5638

   [SPARK-7039][SQL]JDBCRDD: Add support on type NVARCHAR
   szheng79 <szheng.code@gmail.com>
   2015-04-22 13:02:55 -0700
   Commit: fbe7106, github.com/apache/spark/pull/5618

   [SQL] Rename some apply functions.
   Reynold Xin <rxin@databricks.com>
   2015-04-22 11:18:01 -0700
   Commit: cdf0328, github.com/apache/spark/pull/5624

   [SPARK-7052][Core] Add ThreadUtils and move thread methods from Utils to ThreadUtils
   zsxwing <zsxwing@gmail.com>
   2015-04-22 11:08:59 -0700
   Commit: 33b8562, github.com/apache/spark/pull/5631

   [SPARK-6889] [DOCS] CONTRIBUTING.md updates to accompany contribution doc updates
   Sean Owen <sowen@cloudera.com>
   2015-04-21 22:34:31 -0700
   Commit: bdc5c16, github.com/apache/spark/pull/5623

   [SPARK-6113] [ML] Small cleanups after original tree API PR
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-21 21:44:44 -0700
   Commit: 607eff0, github.com/apache/spark/pull/5567

   [MINOR] Comment improvements in ExternalSorter.
   Patrick Wendell <patrick@databricks.com>
   2015-04-21 21:04:04 -0700
   Commit: 70f9f8f, github.com/apache/spark/pull/5620

   [SPARK-6490][Docs] Add docs for rpc configurations
   zsxwing <zsxwing@gmail.com>
   2015-04-21 18:37:53 -0700
   Commit: 3a3f710, github.com/apache/spark/pull/5607

   [SPARK-1684] [PROJECT INFRA] Merge script should standardize SPARK-XXX prefix
   texasmichelle <texasmichelle@gmail.com>
   2015-04-21 18:08:29 -0700
   Commit: a0761ec, github.com/apache/spark/pull/5149

   Closes #5427
   Reynold Xin <rxin@databricks.com>
   2015-04-21 17:52:52 -0700
   Commit: 41ef78a

   [SPARK-6953] [PySpark] speed up python tests
   Reynold Xin <rxin@databricks.com>, Xiangrui Meng <meng@databricks.com>
   2015-04-21 17:49:55 -0700
   Commit: 3134c3f, github.com/apache/spark/pull/5605

   [SPARK-6014] [core] Revamp Spark shutdown hooks, fix shutdown races.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-21 20:33:57 -0400
   Commit: e72c16e, github.com/apache/spark/pull/5560

   Avoid warning message about invalid refuse_seconds value in Mesos >=0.21...
   mweindel <m.weindel@usu-software.de>
   2015-04-21 20:19:33 -0400
   Commit: b063a61, github.com/apache/spark/pull/5597

   [Minor][MLLIB] Fix a minor formatting bug in toString method in Node.scala
   Alain <aihe@usc.edu>
   2015-04-21 16:46:17 -0700
   Commit: ae036d0, github.com/apache/spark/pull/5621

   [SPARK-7036][MLLIB] ALS.train should support DataFrames in PySpark
   Xiangrui Meng <meng@databricks.com>
   2015-04-21 16:44:52 -0700
   Commit: 686dd74, github.com/apache/spark/pull/5619

   [SPARK-6065] [MLlib] Optimize word2vec.findSynonyms using blas calls
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-04-21 16:42:45 -0700
   Commit: 7fe6142, github.com/apache/spark/pull/5467

   [minor] [build] Set java options when generating mima ignores.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-21 16:35:37 -0700
   Commit: a70e849, github.com/apache/spark/pull/5615

   [SPARK-3386] Share and reuse SerializerInstances in shuffle paths
   Josh Rosen <joshrosen@databricks.com>
   2015-04-21 16:24:15 -0700
   Commit: f83c0f1, github.com/apache/spark/pull/5606

   [SPARK-5817] [SQL] Fix bug of udtf with column names
   Cheng Hao <hao.cheng@intel.com>
   2015-04-21 15:11:15 -0700
   Commit: 7662ec2, github.com/apache/spark/pull/4602

   [SPARK-6996][SQL] Support map types in java beans
   Punya Biswal <pbiswal@palantir.com>
   2015-04-21 14:50:02 -0700
   Commit: 2a24bf9, github.com/apache/spark/pull/5578

   [SPARK-6969][SQL] Refresh the cached table when REFRESH TABLE is used
   Yin Huai <yhuai@databricks.com>
   2015-04-21 14:48:42 -0700
   Commit: 6265cba, github.com/apache/spark/pull/5583

   [SQL][minor] make it more clear that we only need to re-throw GetField exception for UnresolvedAttribute
   Wenchen Fan <cloud0fan@outlook.com>
   2015-04-21 14:48:02 -0700
   Commit: 03fd921, github.com/apache/spark/pull/5588

   [SPARK-6994] Allow to fetch field values by name in sql.Row
   vidmantas zemleris <vidmantas@vinted.com>
   2015-04-21 14:47:09 -0700
   Commit: 2e8c6ca, github.com/apache/spark/pull/5573

   [SPARK-7011] Build(compilation) fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.
   Prashant Sharma <prashant.s@imaginea.com>
   2015-04-21 14:43:46 -0700
   Commit: 04bf34e, github.com/apache/spark/pull/5593

   [SPARK-6845] [MLlib] [PySpark] Add isTranposed flag to DenseMatrix
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-04-21 14:36:50 -0700
   Commit: 45c47fa, github.com/apache/spark/pull/5455

   SPARK-3276 Added a new configuration spark.streaming.minRememberDuration
   emres <emre.sevinc@gmail.com>
   2015-04-21 16:39:56 -0400
   Commit: c25ca7c, github.com/apache/spark/pull/5438

   [SPARK-5360] [SPARK-6606] Eliminate duplicate objects in serialized CoGroupedRDD
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-04-21 11:01:18 -0700
   Commit: c035c0f, github.com/apache/spark/pull/4145

   [SPARK-6985][streaming] Receiver maxRate over 1000 causes a StackOverflowError
   David McGuire <david.mcguire2@nike.com>
   2015-04-21 07:21:10 -0400
   Commit: 5fea3e5, github.com/apache/spark/pull/5559

   [SPARK-5990] [MLLIB] Model import/export for IsotonicRegression
   Yanbo Liang <ybliang8@gmail.com>
   2015-04-21 00:14:16 -0700
   Commit: 1f2f723, github.com/apache/spark/pull/5270

   [SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression
   Davies Liu <davies@databricks.com>
   2015-04-21 00:08:18 -0700
   Commit: ab9128f, github.com/apache/spark/pull/5570

   [SPARK-6490][Core] Add spark.rpc.* and deprecate spark.akka.*
   zsxwing <zsxwing@gmail.com>
   2015-04-20 23:18:42 -0700
   Commit: 8136810, github.com/apache/spark/pull/5595

   [SPARK-6635][SQL] DataFrame.withColumn should replace columns with identical column names
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-20 18:54:01 -0700
   Commit: c736220, github.com/apache/spark/pull/5541

   [SPARK-6368][SQL] Build a specialized serializer for Exchange operator.
   Yin Huai <yhuai@databricks.com>
   2015-04-20 18:42:50 -0700
   Commit: ce7ddab, github.com/apache/spark/pull/5497

   [doc][streaming] Fixed broken link in mllib section
   BenFradet <benjamin.fradet@gmail.com>
   2015-04-20 13:46:55 -0700
   Commit: 517bdf3, github.com/apache/spark/pull/5600

   fixed doc
   Eric Chiang <eric.chiang.m@gmail.com>
   2015-04-20 13:11:21 -0700
   Commit: 97fda73, github.com/apache/spark/pull/5599

   [Minor][MLlib] Incorrect path to test data is used in DecisionTreeExample
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-20 10:47:37 -0700
   Commit: 1ebceaa, github.com/apache/spark/pull/5594

   [SPARK-6661] Python type errors should print type, not object
   Elisey Zanko <elisey.zanko@gmail.com>
   2015-04-20 10:44:09 -0700
   Commit: 7717661, github.com/apache/spark/pull/5361

   [SPARK-7003] Improve reliability of connection failure detection between Netty block transfer service endpoints
   Aaron Davidson <aaron@databricks.com>
   2015-04-20 09:54:21 -0700
   Commit: 968ad97, github.com/apache/spark/pull/5584

   [SPARK-5924] Add the ability to specify withMean or withStd parameters with StandarScaler
   jrabary <Jaonary@gmail.com>
   2015-04-20 09:47:56 -0700
   Commit: 1be2070, github.com/apache/spark/pull/4704

   [doc][mllib] Fix typo of the page title in Isotonic regression documents
   dobashim <dobashim@oss.nttdata.co.jp>
   2015-04-20 00:03:23 -0400
   Commit: 6fe690d, github.com/apache/spark/pull/5581

   [SPARK-6979][Streaming] Replace JobScheduler.eventActor and JobGenerator.eventActor with EventLoop
   zsxwing <zsxwing@gmail.com>
   2015-04-19 20:48:36 -0700
   Commit: c776ee8, github.com/apache/spark/pull/5554

   [SPARK-6983][Streaming] Update ReceiverTrackerActor to use the new Rpc interface
   zsxwing <zsxwing@gmail.com>
   2015-04-19 20:35:43 -0700
   Commit: d8e1b7b, github.com/apache/spark/pull/5557

   [SPARK-6998][MLlib] Make StreamingKMeans 'Serializable'
   zsxwing <zsxwing@gmail.com>
   2015-04-19 20:33:51 -0700
   Commit: fa73da0, github.com/apache/spark/pull/5582

   [SPARK-6963][CORE]Flaky test: o.a.s.ContextCleanerSuite automatically cleanup checkpoint
   GuoQiang Li <witgo@qq.com>
   2015-04-19 09:37:09 +0100
   Commit: 0424da6, github.com/apache/spark/pull/5548

   SPARK-6993 : Add default min, max methods for JavaDoubleRDD
   Olivier Girardot <o.girardot@lateral-thoughts.com>
   2015-04-18 18:21:44 -0700
   Commit: 8fbd45c, github.com/apache/spark/pull/5571

   Fixed doc
   Gaurav Nanda <gaurav324@gmail.com>
   2015-04-18 17:20:46 -0700
   Commit: 729885e, github.com/apache/spark/pull/5576

   [SPARK-6219] Reuse pep8.py
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-04-18 16:46:28 -0700
   Commit: 28683b4, github.com/apache/spark/pull/5561

   [core] [minor] Make sure ConnectionManager stops.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-18 10:14:56 +0100
   Commit: 327ebf0, github.com/apache/spark/pull/5566

   SPARK-6992 : Fix documentation example for Spark SQL on StructType
   Olivier Girardot <o.girardot@lateral-thoughts.com>
   2015-04-18 00:31:01 -0700
   Commit: 5f095d5, github.com/apache/spark/pull/5569

   [SPARK-6975][Yarn] Fix argument validation error
   jerryshao <saisai.shao@intel.com>
   2015-04-17 19:17:06 -0700
   Commit: d850b4b, github.com/apache/spark/pull/5551

   [SPARK-5933] [core] Move config deprecation warnings to SparkConf.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-17 19:02:07 -0700
   Commit: 1991337, github.com/apache/spark/pull/5562

   [SPARK-6350][Mesos] Make mesosExecutorCores configurable in mesos "fine-grained" mode
   Jongyoul Lee <jongyoul@gmail.com>
   2015-04-17 18:30:55 -0700
   Commit: 6fbeb82, github.com/apache/spark/pull/5063

   [SPARK-6703][Core] Provide a way to discover existing SparkContext's
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-04-17 18:28:42 -0700
   Commit: c5ed510, github.com/apache/spark/pull/5501

   Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.
   Reynold Xin <rxin@databricks.com>
   2015-04-17 16:30:13 -0500
   Commit: a452c59, github.com/apache/spark/pull/5558

   SPARK-6988 : Fix documentation regarding DataFrames using the Java API
   Olivier Girardot <o.girardot@lateral-thoughts.com>
   2015-04-17 16:23:10 -0500
   Commit: d305e68, github.com/apache/spark/pull/5564

   [SPARK-6807] [SparkR] Merge recent SparkR-pkg changes
   cafreeman <cfreeman@alteryx.com>, Davies Liu <davies@databricks.com>, Zongheng Yang <zongheng.y@gmail.com>, Shivaram Venkataraman <shivaram.venkataraman@gmail.com>, Shivaram Venkataraman <shivaram@cs.berkeley.edu>, Sun Rui <rui.sun@intel.com>
   2015-04-17 13:42:19 -0700
   Commit: 59e206d, github.com/apache/spark/pull/5436

   [SPARK-6113] [ml] Stabilize DecisionTree API
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-17 13:15:36 -0700
   Commit: a83571a, github.com/apache/spark/pull/5530

   [SPARK-2669] [yarn] Distribute client configuration to AM.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-17 14:21:51 -0500
   Commit: 50ab8a6, github.com/apache/spark/pull/4142

   [SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandas
   Davies Liu <davies@databricks.com>
   2015-04-17 11:29:27 -0500
   Commit: c84d916, github.com/apache/spark/pull/5544

   [SPARK-6604][PySpark]Specify ip of python server scoket
   linweizhong <linweizhong@huawei.com>
   2015-04-17 12:04:02 +0100
   Commit: dc48ba9, github.com/apache/spark/pull/5256

   [SPARK-6952] Handle long args when detecting PID reuse
   Punya Biswal <pbiswal@palantir.com>
   2015-04-17 11:08:37 +0100
   Commit: f6a9a57, github.com/apache/spark/pull/5535

   [SPARK-6046] [core] Reorganize deprecated config support in SparkConf.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-17 11:06:01 +0100
   Commit: 4527761, github.com/apache/spark/pull/5514

   SPARK-6846 [WEBUI] Stage kill URL easy to accidentally trigger and possibility for security issue
   Sean Owen <sowen@cloudera.com>
   2015-04-17 11:02:31 +0100
   Commit: f7a2564, github.com/apache/spark/pull/5528

   [SPARK-6972][SQL] Add Coalesce to DataFrame
   Michael Armbrust <michael@databricks.com>
   2015-04-16 21:49:26 -0500
   Commit: 8220d52, github.com/apache/spark/pull/5545

   [SPARK-6966][SQL] Use correct ClassLoader for JDBC Driver
   Michael Armbrust <michael@databricks.com>
   2015-04-16 17:59:49 -0700
   Commit: e5949c2, github.com/apache/spark/pull/5543

   [SPARK-6899][SQL] Fix type mismatch when using codegen with Average on DecimalType
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-16 17:50:20 -0700
   Commit: 1e43851, github.com/apache/spark/pull/5517

   [SQL][Minor] Fix foreachUp of treenode
   scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
   2015-04-16 17:35:51 -0700
   Commit: d966086, github.com/apache/spark/pull/5518

   [SPARK-6911] [SQL] improve accessor for nested types
   Davies Liu <davies@databricks.com>
   2015-04-16 17:33:57 -0700
   Commit: 6183b5e, github.com/apache/spark/pull/5513

   SPARK-6927 [SQL] Sorting Error when codegen on
   云峤 <chensong.cs@alibaba-inc.com>
   2015-04-16 17:32:42 -0700
   Commit: 5fe4343, github.com/apache/spark/pull/5524

   [SPARK-4897] [PySpark] Python 3 support
   Davies Liu <davies@databricks.com>, twneale <twneale@gmail.com>, Josh Rosen <joshrosen@databricks.com>
   2015-04-16 16:20:57 -0700
   Commit: 04e44b3, github.com/apache/spark/pull/5173

   [SPARK-6855] [SPARKR] Set R includes to get the right collate order.
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>
   2015-04-16 13:06:34 -0700
   Commit: 55f553a, github.com/apache/spark/pull/5462

   [SPARK-6934][Core] Use 'spark.akka.askTimeout' for the ask timeout
   zsxwing <zsxwing@gmail.com>
   2015-04-16 13:45:55 -0500
   Commit: ef3fb80, github.com/apache/spark/pull/5529

   [SPARK-6694][SQL]SparkSQL CLI must be able to specify an option --database on the command line.
   Jin Adachi <adachij2002@yahoo.co.jp>, adachij <adachij@nttdata.co.jp>
   2015-04-16 23:41:04 +0800
   Commit: 3ae37b9, github.com/apache/spark/pull/5345

   [SPARK-4194] [core] Make SparkContext initialization exception-safe.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-16 10:48:31 +0100
   Commit: de4fa6b, github.com/apache/spark/pull/5335

   SPARK-4783 [CORE] System.exit() calls in SparkContext disrupt applications embedding Spark
   Sean Owen <sowen@cloudera.com>
   2015-04-16 10:45:32 +0100
   Commit: 6179a94, github.com/apache/spark/pull/5492

   [Streaming][minor] Remove additional quote and unneeded imports
   jerryshao <saisai.shao@intel.com>
   2015-04-16 10:39:02 +0100
   Commit: 8370550, github.com/apache/spark/pull/5540

   [SPARK-6893][ML] default pipeline parameter handling in python
   Xiangrui Meng <meng@databricks.com>
   2015-04-15 23:49:42 -0700
   Commit: 57cd1e8, github.com/apache/spark/pull/5534

   SPARK-6938: All require statements now have an informative error message.
   Juliet Hougland <juliet@cloudera.com>
   2015-04-15 21:52:25 -0700
   Commit: 52c3439, github.com/apache/spark/pull/5532

   [SPARK-5277][SQL] - SparkSqlSerializer doesn't always register user specified KryoRegistrators
   Max Seiden <max@platfora.com>
   2015-04-15 16:15:11 -0700
   Commit: 8a53de1, github.com/apache/spark/pull/5237

   [SPARK-2312] Logging Unhandled messages
   Isaias Barroso <isaias.barroso@gmail.com>
   2015-04-15 22:40:52 +0100
   Commit: d5f1b96, github.com/apache/spark/pull/2055

   [SPARK-2213] [SQL] sort merge join for spark sql
   Daoyuan Wang <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>
   2015-04-15 14:06:10 -0700
   Commit: 585638e, github.com/apache/spark/pull/5208

   [SPARK-6898][SQL] completely support special chars in column names
   Wenchen Fan <cloud0fan@outlook.com>
   2015-04-15 13:39:12 -0700
   Commit: 4754e16, github.com/apache/spark/pull/5511

   [SPARK-6937][MLLIB] Fixed bug in PICExample in which the radius were not being accepted on c...
   sboeschhuawei <stephen.boesch@huawei.com>
   2015-04-15 13:28:10 -0700
   Commit: 557a797, github.com/apache/spark/pull/5531

   [SPARK-6844][SQL] Clean up accumulators used in InMemoryRelation when it is uncached
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-15 13:15:58 -0700
   Commit: cf38fe0, github.com/apache/spark/pull/5475

   [SPARK-6638] [SQL] Improve performance of StringType in SQL
   Davies Liu <davies@databricks.com>
   2015-04-15 13:06:38 -0700
   Commit: 8584276, github.com/apache/spark/pull/5350

   [SPARK-6887][SQL] ColumnBuilder misses FloatType
   Yin Huai <yhuai@databricks.com>
   2015-04-15 13:04:03 -0700
   Commit: 785f955, github.com/apache/spark/pull/5499

   [SPARK-6800][SQL] Update doc for JDBCRelation's columnPartition
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-15 13:01:29 -0700
   Commit: e3e4e9a, github.com/apache/spark/pull/5488

   [SPARK-6730][SQL] Allow using keyword as identifier in OPTIONS
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-15 13:00:19 -0700
   Commit: b75b307, github.com/apache/spark/pull/5520

   [SPARK-6886] [PySpark] fix big closure with shuffle
   Davies Liu <davies@databricks.com>
   2015-04-15 12:58:02 -0700
   Commit: f11288d, github.com/apache/spark/pull/5496

   SPARK-6861 [BUILD] Scalastyle config prevents building Maven child modules alone
   Sean Owen <sowen@cloudera.com>
   2015-04-15 15:17:58 +0100
   Commit: 6c5ed8a, github.com/apache/spark/pull/5471

   [HOTFIX] [SPARK-6896] [SQL] fix compile error in hive-thriftserver
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-04-15 10:23:53 +0100
   Commit: 29aabdd, github.com/apache/spark/pull/5507

   [SPARK-6871][SQL] WITH clause in CTE can not following another WITH clause
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-14 23:47:16 -0700
   Commit: 6be9189, github.com/apache/spark/pull/5480

   [SPARK-5634] [core] Show correct message in HS when no incomplete apps f...
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-14 18:52:48 -0700
   Commit: 30a6e0d, github.com/apache/spark/pull/5515

   [SPARK-6890] [core] Fix launcher lib work with SPARK_PREPEND_CLASSES.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-14 18:51:39 -0700
   Commit: 9717389, github.com/apache/spark/pull/5504

   [SPARK-6796][Streaming][WebUI] Add "Active Batches" and "Completed Batches" lists to StreamingPage
   zsxwing <zsxwing@gmail.com>
   2015-04-14 16:51:36 -0700
   Commit: 6de282e, github.com/apache/spark/pull/5434

   Revert "[SPARK-6352] [SQL] Add DirectParquetOutputCommitter"
   Josh Rosen <joshrosen@databricks.com>
   2015-04-14 14:07:25 -0700
   Commit: a76b921

   [SPARK-6769][YARN][TEST] Usage of the ListenerBus in YarnClusterSuite is wrong
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-04-14 14:00:49 -0700
   Commit: 4d4b249, github.com/apache/spark/pull/5417

   [SPARK-5808] [build] Package pyspark files in sbt assembly.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-14 13:41:38 -0700
   Commit: 6577437, github.com/apache/spark/pull/5461

   [SPARK-6905] Upgrade to snappy-java 1.1.1.7
   Josh Rosen <joshrosen@databricks.com>
   2015-04-14 13:40:07 -0700
   Commit: 6adb8bc, github.com/apache/spark/pull/5512

   [SPARK-6700] [yarn] Re-enable flaky test.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-14 13:34:44 -0700
   Commit: b075e4b, github.com/apache/spark/pull/5459

   SPARK-1706: Allow multiple executors per worker in Standalone mode
   CodingCat <zhunansjtu@gmail.com>
   2015-04-14 13:32:06 -0700
   Commit: 8f8dc45, github.com/apache/spark/pull/731

   [SPARK-2033] Automatically cleanup checkpoint
   GuoQiang Li <witgo@qq.com>
   2015-04-14 12:56:47 -0700
   Commit: 25998e4, github.com/apache/spark/pull/855

   [CORE] SPARK-6880: Fixed null check when all the dependent stages are cancelled due to previous stage failure
   pankaj arora <pankaj.arora@guavus.com>
   2015-04-14 12:06:46 -0700
   Commit: dcf8a9f, github.com/apache/spark/pull/5494

   [SPARK-6894]spark.executor.extraLibraryOptions => spark.executor.extraLibraryPath
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-04-14 12:02:11 -0700
   Commit: f63b44a, github.com/apache/spark/pull/5506

   [SPARK-6081] Support fetching http/https uris in driver runner.
   Timothy Chen <tnachen@gmail.com>
   2015-04-14 11:48:12 -0700
   Commit: 320bca4, github.com/apache/spark/pull/4832

   SPARK-6878 [CORE] Fix for sum on empty RDD fails with exception
   Erik van Oosten <evanoosten@ebay.com>
   2015-04-14 12:39:56 +0100
   Commit: 51b306b, github.com/apache/spark/pull/5489

   [SPARK-6731] Bump version of apache commons-math3
   Punyashloka Biswal <punya.biswal@gmail.com>
   2015-04-14 11:43:06 +0100
   Commit: 628a72f, github.com/apache/spark/pull/5380

   [WIP][HOTFIX][SPARK-4123]: Fix bug in PR dependency (all deps. removed issue)
   Brennon York <brennon.york@capitalone.com>
   2015-04-13 22:31:44 -0700
   Commit: 77eeb10, github.com/apache/spark/pull/5443

   [SPARK-5957][ML] better handling of parameters
   Xiangrui Meng <meng@databricks.com>
   2015-04-13 21:18:05 -0700
   Commit: 971b95b, github.com/apache/spark/pull/5431

   [Minor][SparkR] Minor refactor and removes redundancy related to cleanClosure.
   hlin09 <hlin09pu@gmail.com>
   2015-04-13 20:43:24 -0700
   Commit: 0ba3fdd, github.com/apache/spark/pull/5495

   [SPARK-5794] [SQL] fix add jar
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-04-13 18:26:00 -0700
   Commit: b45059d, github.com/apache/spark/pull/4586

   [SQL] [Minor] Fix for SqlApp.scala
   Fei Wang <wangfei1@huawei.com>
   2015-04-13 18:23:35 -0700
   Commit: 3782e1f, github.com/apache/spark/pull/5485

   [Spark-4848] Allow different Worker configurations in standalone cluster
   Nathan Kronenfeld <nkronenfeld@oculusinfo.com>
   2015-04-13 18:21:16 -0700
   Commit: 435b877, github.com/apache/spark/pull/5140

   [SPARK-6877][SQL] Add code generation support for Min
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-13 18:16:33 -0700
   Commit: 4898dfa, github.com/apache/spark/pull/5487

   [SPARK-6303][SQL] Remove unnecessary Average in GeneratedAggregate
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-13 18:15:29 -0700
   Commit: 5b8b324, github.com/apache/spark/pull/4996

   [SPARK-6881][SparkR] Changes the checkpoint directory name.
   hlin09 <hlin09pu@gmail.com>
   2015-04-13 16:53:50 -0700
   Commit: d7f2c19, github.com/apache/spark/pull/5493

   [SPARK-5931][CORE] Use consistent naming for time properties
   Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
   2015-04-13 16:28:07 -0700
   Commit: c4ab255, github.com/apache/spark/pull/5236

   [SPARK-5941] [SQL] Unit Test loads the table `src` twice for leftsemijoin.q
   Cheng Hao <hao.cheng@intel.com>
   2015-04-13 16:02:18 -0700
   Commit: c5602bd, github.com/apache/spark/pull/4506

   [SPARK-6872] [SQL] add copy in external sort
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-04-13 16:00:58 -0700
   Commit: e63a86a, github.com/apache/spark/pull/5481

   [SPARK-5972] [MLlib] Cache residuals and gradient in GBT during training and validation
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-04-13 15:36:33 -0700
   Commit: 2a55cb4, github.com/apache/spark/pull/5330

   [SQL][SPARK-6742]: Don't push down predicates which reference partition column(s)
   Yash Datta <Yash.Datta@guavus.com>
   2015-04-13 14:43:07 -0700
   Commit: 3a205bb, github.com/apache/spark/pull/5390

   [SPARK-6130] [SQL] support if not exists for insert overwrite into partition in hiveQl
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-04-13 14:29:07 -0700
   Commit: 85ee0ca, github.com/apache/spark/pull/4865

   [SPARK-5988][MLlib] add save/load for PowerIterationClusteringModel
   Xusen Yin <yinxusen@gmail.com>
   2015-04-13 11:53:17 -0700
   Commit: 1e340c3, github.com/apache/spark/pull/5450

   [SPARK-6662][YARN] Allow variable substitution in spark.yarn.historyServer.address
   Cheolsoo Park <cheolsoop@netflix.com>
   2015-04-13 13:45:10 -0500
   Commit: 6cc5b3e, github.com/apache/spark/pull/5321

   [SPARK-6765] Enable scalastyle on test code.
   Reynold Xin <rxin@databricks.com>
   2015-04-13 09:29:04 -0700
   Commit: c5b0b29, github.com/apache/spark/pull/5486

   [SPARK-6207] [YARN] [SQL] Adds delegation tokens for metastore to conf.
   Doug Balog <doug.balogtarget.com>, Doug Balog <doug.balog@target.com>
   2015-04-13 09:49:58 -0500
   Commit: 77620be, github.com/apache/spark/pull/5031

   [SPARK-6352] [SQL] Add DirectParquetOutputCommitter
   Pei-Lun Lee <pllee@appier.com>
   2015-04-13 21:52:00 +0800
   Commit: b29663e, github.com/apache/spark/pull/5042

   [SPARK-6870][Yarn] Catch InterruptedException when yarn application state monitor thread been interrupted
   linweizhong <linweizhong@huawei.com>
   2015-04-13 13:06:54 +0100
   Commit: 202ebf0, github.com/apache/spark/pull/5479

   [SPARK-6671] Add status command for spark daemons
   Pradeep Chanumolu <pchanumolu@maprtech.com>
   2015-04-13 13:02:55 +0100
   Commit: 240ea03, github.com/apache/spark/pull/5327

   [SPARK-6440][CORE]Handle IPv6 addresses properly when constructing URI
   nyaapa <nyaapa@gmail.com>
   2015-04-13 12:55:25 +0100
   Commit: 9d117ce, github.com/apache/spark/pull/5424

   [SPARK-6860][Streaming][WebUI] Fix the possible inconsistency of StreamingPage
   zsxwing <zsxwing@gmail.com>
   2015-04-13 12:21:29 +0100
   Commit: 14ce3ea, github.com/apache/spark/pull/5470

   [SPARK-6762]Fix potential resource leaks in CheckPoint CheckpointWriter and CheckpointReader
   lisurprise <zhichao.li@intel.com>
   2015-04-13 12:18:05 +0100
   Commit: cadd7d7, github.com/apache/spark/pull/5407

   [SPARK-6868][YARN] Fix broken container log link on executor page when HTTPS_ONLY.
   Dean Chen <deanchen5@gmail.com>
   2015-04-13 12:08:55 +0100
   Commit: 950645d, github.com/apache/spark/pull/5477

   [SPARK-6562][SQL] DataFrame.replace
   Reynold Xin <rxin@databricks.com>
   2015-04-12 22:56:12 -0700
   Commit: 68d1faa, github.com/apache/spark/pull/5282

   [SPARK-5885][MLLIB] Add VectorAssembler as a feature transformer
   Xiangrui Meng <meng@databricks.com>
   2015-04-12 22:42:01 -0700
   Commit: 9294044, github.com/apache/spark/pull/5196

   [SPARK-5886][ML] Add StringIndexer as a feature transformer
   Xiangrui Meng <meng@databricks.com>
   2015-04-12 22:41:05 -0700
   Commit: 685ddcf, github.com/apache/spark/pull/4735

   [SPARK-4081] [mllib]  VectorIndexer
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-12 22:38:27 -0700
   Commit: d3792f5, github.com/apache/spark/pull/3000

   [SPARK-6643][MLLIB] Implement StandardScalerModel missing methods
   lewuathe <lewuathe@me.com>
   2015-04-12 22:17:16 -0700
   Commit: fc17661, github.com/apache/spark/pull/5310

   [SPARK-6765] Fix test code style for core.
   Reynold Xin <rxin@databricks.com>
   2015-04-12 20:50:49 -0700
   Commit: a1fe59d, github.com/apache/spark/pull/5484

   [MINOR] a typo: coalesce
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-04-12 18:58:53 +0100
   Commit: 04bcd67, github.com/apache/spark/pull/5482

   [SPARK-6431][Streaming][Kafka] Error message for partition metadata requ...
   cody koeninger <cody@koeninger.org>
   2015-04-12 17:37:30 +0100
   Commit: 6ac8eea, github.com/apache/spark/pull/5454

   [SPARK-6843][core]Add volatile for the "state"
   lisurprise <zhichao.li@intel.com>
   2015-04-12 13:41:44 +0100
   Commit: ddc1743, github.com/apache/spark/pull/5448

   [SPARK-6866][Build] Remove duplicated dependency in launcher/pom.xml
   Guancheng (G.C.) Chen <chenguancheng@gmail.com>
   2015-04-12 11:36:41 +0100
   Commit: e9445b1, github.com/apache/spark/pull/5476

   [SPARK-6677] [SQL] [PySpark] fix cached classes
   Davies Liu <davies@databricks.com>
   2015-04-11 22:33:23 -0700
   Commit: 5d8f7b9, github.com/apache/spark/pull/5445

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-04-11 22:12:56 -0700
   Commit: 0cc8fcb, github.com/apache/spark/pull/4994

   SPARK-6710 GraphX Fixed Wrong initial bias in GraphX SVDPlusPlus
   Michael Malak <michaelmalak@yahoo.com>
   2015-04-11 21:01:23 -0700
   Commit: 1205f7e, github.com/apache/spark/pull/5464

   [HOTFIX] Add explicit return types to fix lint errors
   Josh Rosen <joshrosen@databricks.com>
   2015-04-11 20:12:40 -0700
   Commit: dea5dac

   [SQL][minor] move `resolveGetField` into a object
   Wenchen Fan <cloud0fan@outlook.com>
   2015-04-11 19:35:56 -0700
   Commit: 5c2844c, github.com/apache/spark/pull/5435

   [SPARK-6367][SQL] Use the proper data type for those expressions that are hijacking existing data types.
   Yin Huai <yhuai@databricks.com>
   2015-04-11 19:26:15 -0700
   Commit: 6d4e854, github.com/apache/spark/pull/5094

   [SQL] Handle special characters in the authority of a Path's URI.
   Yin Huai <yhuai@databricks.com>
   2015-04-11 18:44:54 -0700
   Commit: d2383fb, github.com/apache/spark/pull/5381

   [SPARK-6379][SQL] Support a functon to call user-defined functions registered in SQLContext
   Takeshi YAMAMURO <linguin.m.s@gmail.com>
   2015-04-11 18:41:12 -0700
   Commit: 352a5da, github.com/apache/spark/pull/5061

   [SPARK-6179][SQL] Add token for "SHOW PRINCIPALS role_name" and "SHOW TRANSACTIONS" and "SHOW COMPACTIONS"
   DoingDone9 <799203320@qq.com>, Zhongshuai Pei <799203320@qq.com>, Xu Tingjun <xutingjun@huawei.com>
   2015-04-11 18:34:17 -0700
   Commit: 48cc840, github.com/apache/spark/pull/4902

   [Spark-5068][SQL]Fix bug query data when path doesn't exist for HiveContext
   lazymam500 <lazyman500@gmail.com>, lazyman <lazyman500@gmail.com>
   2015-04-11 18:33:14 -0700
   Commit: 1f39a61, github.com/apache/spark/pull/5059

   [SPARK-6199] [SQL] Support CTE in HiveContext and SQLContext
   haiyang <huhaiyang@huawei.com>
   2015-04-11 18:30:17 -0700
   Commit: 2f53588, github.com/apache/spark/pull/4929

   [Minor][SQL] Fix typo in sql
   Guancheng (G.C.) Chen <chenguancheng@gmail.com>
   2015-04-11 15:43:12 -0700
   Commit: 7dbd371, github.com/apache/spark/pull/5474

   [SPARK-6863] Fix formatting on SQL programming guide.
   Santiago M. Mola <santiago.mola@sap.com>
   2015-04-11 15:42:03 -0700
   Commit: 6437e7c, github.com/apache/spark/pull/5472

   [SPARK-6611][SQL] Add support for INTEGER as synonym of INT.
   Santiago M. Mola <santiago.mola@sap.com>
   2015-04-11 14:52:49 -0700
   Commit: 5f7b7cd, github.com/apache/spark/pull/5271

   [SPARK-6858][SQL] Register Java HashMap for SparkSqlSerializer
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-11 14:50:50 -0700
   Commit: 198cf2a, github.com/apache/spark/pull/5465

   [SPARK-6835] [SQL] Fix bug of Hive UDTF in Lateral View (ClassNotFound)
   Cheng Hao <hao.cheng@intel.com>
   2015-04-11 22:11:03 +0800
   Commit: 3ceb810, github.com/apache/spark/pull/5444

   [hotfix] [build] Make sure JAVA_HOME is set for tests.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-11 13:10:01 +0100
   Commit: 694aef0, github.com/apache/spark/pull/5441

   [Minor][Core] Fix typo
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-11 13:07:41 +0100
   Commit: 95a0759, github.com/apache/spark/pull/5466

   [SQL] [SPARK-6620] Speed up toDF() and rdd() functions by constructing converters in ScalaReflection
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-04-10 16:27:56 -0700
   Commit: 67d0688, github.com/apache/spark/pull/5279

   [SPARK-6851][SQL] Create new instance for each converted parquet relation
   Michael Armbrust <michael@databricks.com>
   2015-04-10 16:05:14 -0700
   Commit: 23d5f88, github.com/apache/spark/pull/5458

   [SPARK-6850] [SparkR] use one partition when we need to compare the whole result
   Davies Liu <davies@databricks.com>
   2015-04-10 15:35:45 -0700
   Commit: 68ecdb7, github.com/apache/spark/pull/5460

   [SPARK-6216] [PySpark] check the python version in worker
   Davies Liu <davies@databricks.com>
   2015-04-10 14:04:53 -0700
   Commit: 4740d6a, github.com/apache/spark/pull/5404

   [SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.
   Milan Straka <fox@ucw.cz>
   2015-04-10 13:50:32 -0700
   Commit: 0375134, github.com/apache/spark/pull/4761

   [SQL] [SPARK-6794] Use kryo-based SparkSqlSerializer for GeneralHashedRelation
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-04-10 12:09:54 -0700
   Commit: b9baa4c, github.com/apache/spark/pull/5433

   [SPARK-6773][Tests]Fix RAT checks still passed issue when download rat jar failed
   June.He <jun.hejun@huawei.com>
   2015-04-10 20:02:35 +0100
   Commit: 9f5ed99, github.com/apache/spark/pull/5421

   [SPARK-6766][Streaming] Fix issue about StreamingListenerBatchSubmitted and StreamingListenerBatchStarted
   zsxwing <zsxwing@gmail.com>
   2015-04-10 01:51:42 -0700
   Commit: 18ca089, github.com/apache/spark/pull/5414

   [SPARK-6211][Streaming] Add Python Kafka API unit test
   jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
   2015-04-09 23:14:24 -0700
   Commit: 3290d2d, github.com/apache/spark/pull/4961

   [SPARK-6577] [MLlib] [PySpark] SparseMatrix should be supported in PySpark
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-04-09 23:10:13 -0700
   Commit: e236081, github.com/apache/spark/pull/5355

   [SPARK-3074] [PySpark] support groupByKey() with single huge key
   Davies Liu <davies.liu@gmail.com>, Davies Liu <davies@databricks.com>
   2015-04-09 17:07:23 -0700
   Commit: b5c51c8, github.com/apache/spark/pull/1977

   [Spark-6693][MLlib]add tostring with max lines and width for matrix
   Yuhao Yang <hhbyyh@gmail.com>
   2015-04-09 15:37:45 -0700
   Commit: 9c67049, github.com/apache/spark/pull/5344

   [SPARK-6264] [MLLIB] Support FPGrowth algorithm in Python API
   Yanbo Liang <ybliang8@gmail.com>
   2015-04-09 15:10:10 -0700
   Commit: a0411ae, github.com/apache/spark/pull/5213

   [SPARK-6758]block the right jetty package in log
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-04-09 17:44:08 -0400
   Commit: 7d92db3, github.com/apache/spark/pull/5406

   [minor] [examples] Avoid packaging duplicate classes.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-09 07:07:50 -0400
   Commit: 470d745, github.com/apache/spark/pull/5379

   SPARK-4924 addendum. Minor assembly directory fix in load-spark-env-sh
   raschild <raschild@users.noreply.github.com>
   2015-04-09 07:04:18 -0400
   Commit: 53f6bb1, github.com/apache/spark/pull/5261

   [SPARK-6343] Doc driver-worker network reqs
   Peter Parente <pparent@us.ibm.com>
   2015-04-09 06:37:20 -0400
   Commit: b9c51c0, github.com/apache/spark/pull/5382

   [SPARK-5654] Integrate SparkR
   Shivaram Venkataraman <shivaram@cs.berkeley.edu>, Shivaram Venkataraman <shivaram.venkataraman@gmail.com>, Zongheng Yang <zongheng.y@gmail.com>, cafreeman <cfreeman@alteryx.com>, Shivaram Venkataraman <shivaram@eecs.berkeley.edu>, Davies Liu <davies@databricks.com>, Davies Liu <davies.liu@gmail.com>, hlin09 <hlin09pu@gmail.com>, Sun Rui <rui.sun@intel.com>, lythesia <iranaikimi@gmail.com>, oscaroboto <oscarjr@gmail.com>, Antonio Piccolboni <antonio@piccolboni.info>, root <edward>, edwardt <edwardt.tril@gmail.com>, hqzizania <qian.huang@intel.com>, dputler <dan.putler@gmail.com>, Todd Gao <todd.gao.2013@gmail.com>, Chris Freeman <cfreeman@alteryx.com>, Felix Cheung <fcheung@AVVOMAC-119.local>, Hossein <hossein@databricks.com>, Evert Lammerts <evert@apache.org>, Felix Cheung <fcheung@avvomac-119.t-mobile.com>, felixcheung <felixcheung_m@hotmail.com>, Ryan Hafen <rhafen@gmail.com>, Ashutosh Raina <ashutoshraina@users.noreply.github.com>, Oscar Olmedo <oscarjr@gmail.com>, Josh Rosen <rosenville@gmail.com>, Yi Lu <iranaikimi@gmail.com>, Harihar Nahak <hnahak87@users.noreply.github.com>
   2015-04-08 22:45:40 -0700
   Commit: 2fe0a1a, github.com/apache/spark/pull/5096

   [SPARK-6765] Fix test code style for SQL
   Reynold Xin <rxin@databricks.com>
   2015-04-08 20:35:29 -0700
   Commit: 1b2aab8, github.com/apache/spark/pull/5412

   [SPARK-6696] [SQL] Adds HiveContext.refreshTable to PySpark
   Cheng Lian <lian@databricks.com>
   2015-04-08 18:47:39 -0700
   Commit: 891ada5, github.com/apache/spark/pull/5349

   [SPARK-6451][SQL] supported code generation for CombineSum
   Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-04-08 18:42:34 -0700
   Commit: 7d7384c, github.com/apache/spark/pull/5138

   [SQL][minor] remove duplicated resolveGetField and update comment
   Wenchen Fan <cloud0fan@outlook.com>
   2015-04-08 13:57:01 -0700
   Commit: 9418280, github.com/apache/spark/pull/5304

   [SPARK-4346][SPARK-3596][YARN] Commonize the monitor logic
   unknown <l00251599@HGHY1L002515991.china.huawei.com>, Sephiroth-Lin <linwzhong@gmail.com>
   2015-04-08 13:56:42 -0700
   Commit: 55a92ef, github.com/apache/spark/pull/5305

   [SPARK-5242]: Add --private-ips flag to EC2 script
   Michelangelo D'Agostino <mdagostino@civisanalytics.com>
   2015-04-08 16:48:45 -0400
   Commit: 86403f5, github.com/apache/spark/pull/5244

   [SPARK-6767][SQL] Fixed Query DSL error in spark sql Readme
   Tijo Thomas <tijoparacka@gmail.com>
   2015-04-08 13:42:29 -0700
   Commit: 2f482d7, github.com/apache/spark/pull/5415

   [SPARK-6781] [SQL] use sqlContext in python shell
   Davies Liu <davies@databricks.com>
   2015-04-08 13:31:45 -0700
   Commit: 6ada4f6, github.com/apache/spark/pull/5425

   [SPARK-6765] Fix test code style for mllib.
   Reynold Xin <rxin@databricks.com>
   2015-04-08 11:32:44 -0700
   Commit: 66159c3, github.com/apache/spark/pull/5411

   [SPARK-6765] Fix test code style for graphx.
   Reynold Xin <rxin@databricks.com>
   2015-04-08 11:31:48 -0700
   Commit: 8d812f9, github.com/apache/spark/pull/5410

   [SPARK-6753] Clone SparkConf in ShuffleSuite tests
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-04-08 10:26:45 -0700
   Commit: 9d44ddc, github.com/apache/spark/pull/5401

   [SPARK-6506] [pyspark] Do not try to retrieve SPARK_HOME when not needed...
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-08 10:14:52 -0700
   Commit: f7e21dd, github.com/apache/spark/pull/5405

   [SPARK-6765] Fix test code style for streaming.
   Reynold Xin <rxin@databricks.com>
   2015-04-08 00:24:59 -0700
   Commit: 15e0d2b, github.com/apache/spark/pull/5409

   [SPARK-6754] Remove unnecessary TaskContextHelper
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-04-07 22:40:42 -0700
   Commit: 8d2a36c, github.com/apache/spark/pull/5402

   [SPARK-6705][MLLIB] Add fit intercept api to ml logisticregression
   Omede Firouz <ofirouz@palantir.com>
   2015-04-07 23:36:31 -0400
   Commit: d138aa8, github.com/apache/spark/pull/5301

   [SPARK-6737] Fix memory leak in OutputCommitCoordinator
   Josh Rosen <joshrosen@databricks.com>
   2015-04-07 16:18:55 -0700
   Commit: c83e039, github.com/apache/spark/pull/5397

   [SPARK-6748] [SQL] Makes QueryPlan.schema a lazy val
   Cheng Lian <lian@databricks.com>
   2015-04-08 07:00:56 +0800
   Commit: 77bcceb, github.com/apache/spark/pull/5398

   [SPARK-6720][MLLIB] PySpark MultivariateStatisticalSummary unit test for normL1...
   lewuathe <lewuathe@me.com>
   2015-04-07 14:36:57 -0700
   Commit: fc957dc, github.com/apache/spark/pull/5374

   Revert "[SPARK-6568] spark-shell.cmd --jars option does not accept the jar that has space in its path"
   Xiangrui Meng <meng@databricks.com>
   2015-04-07 14:34:15 -0700
   Commit: e6f08fb

   [SPARK-6568] spark-shell.cmd --jars option does not accept the jar that has space in its path
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2015-04-07 14:29:53 -0700
   Commit: 596ba77, github.com/apache/spark/pull/5347

   [SPARK-6750] Upgrade ScalaStyle to 0.7.
   Reynold Xin <rxin@databricks.com>
   2015-04-07 12:37:33 -0700
   Commit: 1232215, github.com/apache/spark/pull/5399

   Replace use of .size with .length for Arrays
   sksamuel <sam@sksamuel.com>
   2015-04-07 10:43:22 -0700
   Commit: 2c32bef, github.com/apache/spark/pull/5376

   [SPARK-6733][ Scheduler]Added scala.language.existentials
   Vinod K C <vinod.kc@huawei.com>
   2015-04-07 10:42:08 -0700
   Commit: 7162ecf, github.com/apache/spark/pull/5384

   [SPARK-3591][YARN]fire and forget for YARN cluster mode
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-04-07 08:36:25 -0500
   Commit: b65bad6, github.com/apache/spark/pull/5297

   [SPARK-6736][GraphX][Doc]Example of Graph#aggregateMessages has error
   Sasaki Toru <sasakitoa@nttdata.co.jp>
   2015-04-07 01:55:32 -0700
   Commit: ae980eb, github.com/apache/spark/pull/5388

   [SPARK-6636] Use public DNS hostname everywhere in spark_ec2.py
   Matt Aasted <aasted@twitch.tv>
   2015-04-06 23:50:48 -0700
   Commit: 6f0d55d, github.com/apache/spark/pull/5302

   [SPARK-6716] Change SparkContext.DRIVER_IDENTIFIER from <driver> to driver
   Josh Rosen <joshrosen@databricks.com>
   2015-04-06 23:33:16 -0700
   Commit: a0846c4, github.com/apache/spark/pull/5372

   [Minor] [SQL] [SPARK-6729] Minor fix for DriverQuirks get
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-04-06 18:00:51 -0700
   Commit: e40ea87, github.com/apache/spark/pull/5378

   [MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex
   Reza Zadeh <reza@databricks.com>
   2015-04-06 13:15:01 -0700
   Commit: 30363ed, github.com/apache/spark/pull/5364

   SPARK-6569 [STREAMING] Down-grade same-offset message in Kafka streaming to INFO
   Sean Owen <sowen@cloudera.com>
   2015-04-06 10:18:56 +0100
   Commit: 9fe4125, github.com/apache/spark/pull/5366

   [SPARK-6673] spark-shell.cmd can't start in Windows even when spark was built
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2015-04-06 10:11:20 +0100
   Commit: 49f3882, github.com/apache/spark/pull/5328

   [SPARK-6602][Core] Update MapOutputTrackerMasterActor to MapOutputTrackerMasterEndpoint
   zsxwing <zsxwing@gmail.com>
   2015-04-05 21:57:15 -0700
   Commit: 0b5d028, github.com/apache/spark/pull/5371

   [SPARK-6262][MLLIB]Implement missing methods for MultivariateStatisticalSummary
   lewuathe <lewuathe@me.com>
   2015-04-05 16:13:31 -0700
   Commit: acffc43, github.com/apache/spark/pull/5359

   [SPARK-6602][Core] Replace direct use of Akka with Spark RPC interface - part 1
   zsxwing <zsxwing@gmail.com>
   2015-04-04 11:52:05 -0700
   Commit: f15806a, github.com/apache/spark/pull/5268

   [SPARK-6607][SQL] Check invalid characters for Parquet schema and show error messages
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-05 00:20:43 +0800
   Commit: 7bca62f, github.com/apache/spark/pull/5263

   [SQL] Use path.makeQualified in newParquet.
   Yin Huai <yhuai@databricks.com>
   2015-04-04 23:26:10 +0800
   Commit: da25c86, github.com/apache/spark/pull/5353

   [SPARK-6700] disable flaky test
   Davies Liu <davies@databricks.com>
   2015-04-03 15:22:21 -0700
   Commit: 9b40c17, github.com/apache/spark/pull/5356

   [SPARK-6647][SQL] Make trait StringComparison as BinaryPredicate and fix unit tests of string data source Filter
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-03 12:35:00 -0700
   Commit: 26b415e, github.com/apache/spark/pull/5309

   [SPARK-6688] [core] Always use resolved URIs in EventLoggingListener.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-03 11:54:31 -0700
   Commit: 14632b7, github.com/apache/spark/pull/5340

   Closes #3158
   Reynold Xin <rxin@databricks.com>
   2015-04-03 11:53:07 -0700
   Commit: ffe8cc9

   [SPARK-6640][Core] Fix the race condition of creating HeartbeatReceiver and retrieving HeartbeatReceiver
   zsxwing <zsxwing@gmail.com>
   2015-04-03 11:44:27 -0700
   Commit: 88504b7, github.com/apache/spark/pull/5306

   [SPARK-6492][CORE] SparkContext.stop() can deadlock when DAGSchedulerEventProcessLoop dies
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-04-03 19:23:11 +0100
   Commit: 2c43ea3, github.com/apache/spark/pull/5277

   [SPARK-5203][SQL] fix union with different decimal type
   guowei2 <guowei2@asiainfo.com>
   2015-04-04 02:02:30 +0800
   Commit: c23ba81, github.com/apache/spark/pull/4004

   [Minor][SQL] Fix typo
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-04-03 18:31:48 +0100
   Commit: dc6dff2, github.com/apache/spark/pull/5352

   [SPARK-6615][MLLIB] Python API for Word2Vec
   lewuathe <lewuathe@me.com>
   2015-04-03 09:49:50 -0700
   Commit: 512a2f1, github.com/apache/spark/pull/5296

   [MLLIB] Remove println in LogisticRegression.scala
   Omede Firouz <ofirouz@palantir.com>
   2015-04-03 10:26:43 +0100
   Commit: b52c7f9, github.com/apache/spark/pull/5338

   [SPARK-6560][CORE] Do not suppress exceptions from writer.write.
   Stephen Haberman <stephen@exigencecorp.com>
   2015-04-03 09:48:37 +0100
   Commit: b0d884f, github.com/apache/spark/pull/5223

   [SPARK-6428] Turn on explicit type checking for public methods.
   Reynold Xin <rxin@databricks.com>
   2015-04-03 01:25:02 -0700
   Commit: 82701ee, github.com/apache/spark/pull/5342

   [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
   Yin Huai <yhuai@databricks.com>
   2015-04-03 14:40:36 +0800
   Commit: c42c3fc, github.com/apache/spark/pull/5339

   [SPARK-6621][Core] Fix the bug that calling EventLoop.stop in EventLoop.onReceive/onError/onStart doesn't call onStop
   zsxwing <zsxwing@gmail.com>
   2015-04-02 22:54:30 -0700
   Commit: 440ea31, github.com/apache/spark/pull/5280

   [SPARK-6345][STREAMING][MLLIB] Fix for training with prediction
   freeman <the.freeman.lab@gmail.com>
   2015-04-02 21:37:44 -0700
   Commit: 6e1c1ec, github.com/apache/spark/pull/5037

   [CORE] The descriptionof jobHistory config should be spark.history.fs.logDirectory
   KaiXinXiaoLei <huleilei1@huawei.com>
   2015-04-02 20:24:31 -0700
   Commit: 8a0aa81, github.com/apache/spark/pull/5332

   [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
   Yin Huai <yhuai@databricks.com>
   2015-04-02 20:23:08 -0700
   Commit: 4b82bd7, github.com/apache/spark/pull/5339

   [SPARK-6650] [core] Stop ExecutorAllocationManager when context stops.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-02 19:48:55 -0700
   Commit: 45134ec, github.com/apache/spark/pull/5311

   [SPARK-6686][SQL] Use resolved output instead of names for toDF rename
   Michael Armbrust <michael@databricks.com>
   2015-04-02 18:30:55 -0700
   Commit: 052dee0, github.com/apache/spark/pull/5337

   [SPARK-6243][SQL] The Operation of match did not conside the scenarios that order.dataType does not match NativeType
   DoingDone9 <799203320@qq.com>
   2015-04-02 17:23:51 -0700
   Commit: 947802c, github.com/apache/spark/pull/4959

   [SQL][Minor] Use analyzed logical instead of unresolved in HiveComparisonTest
   Cheng Hao <hao.cheng@intel.com>
   2015-04-02 17:20:31 -0700
   Commit: dfd2982, github.com/apache/spark/pull/4946

   [SPARK-6618][SPARK-6669][SQL] Lock Hive metastore client correctly.
   Yin Huai <yhuai@databricks.com>, Michael Armbrust <michael@databricks.com>
   2015-04-02 16:46:50 -0700
   Commit: 5db8912, github.com/apache/spark/pull/5333

   [Minor] [SQL] Follow-up of PR #5210
   Cheng Lian <lian@databricks.com>
   2015-04-02 16:15:34 -0700
   Commit: d3944b6, github.com/apache/spark/pull/5219

   [SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property
   Yin Huai <yhuai@databricks.com>
   2015-04-02 16:02:31 -0700
   Commit: 251698f, github.com/apache/spark/pull/5313

   [SQL] Throw UnsupportedOperationException instead of NotImplementedError
   Michael Armbrust <michael@databricks.com>
   2015-04-02 16:01:03 -0700
   Commit: 4214e50, github.com/apache/spark/pull/5315

   SPARK-6414: Spark driver failed with NPE on job cancelation
   Hung Lin <hung.lin@gmail.com>
   2015-04-02 14:01:43 -0700
   Commit: e3202aa, github.com/apache/spark/pull/5124

   [SPARK-6667] [PySpark] remove setReuseAddress
   Davies Liu <davies@databricks.com>
   2015-04-02 12:18:33 -0700
   Commit: 0cce545, github.com/apache/spark/pull/5324

   [SPARK-6672][SQL] convert row to catalyst in createDataFrame(RDD[Row], ...)
   Xiangrui Meng <meng@databricks.com>
   2015-04-02 17:57:01 +0800
   Commit: 424e987, github.com/apache/spark/pull/5329

   [SPARK-6627] Some clean-up in shuffle code.
   Patrick Wendell <patrick@databricks.com>
   2015-04-01 23:42:09 -0700
   Commit: 6562787, github.com/apache/spark/pull/5286

   [SPARK-6663] [SQL] use Literal.create instread of constructor
   Davies Liu <davies@databricks.com>
   2015-04-01 23:11:38 -0700
   Commit: 40df5d4, github.com/apache/spark/pull/5320

   Revert "[SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock"
   Cheng Lian <lian@databricks.com>
   2015-04-02 12:56:34 +0800
   Commit: 2bc7fe7

   [SPARK-6658][SQL] Update DataFrame documentation to fix type references.
   Chet Mancini <chetmancini@gmail.com>
   2015-04-01 21:39:46 -0700
   Commit: 191524e, github.com/apache/spark/pull/5316

   [SPARK-6578] Small rewrite to make the logic more clear in MessageWithHeader.transferTo.
   Reynold Xin <rxin@databricks.com>
   2015-04-01 18:36:06 -0700
   Commit: 899ebcb, github.com/apache/spark/pull/5319

   [SPARK-6660][MLLIB] pythonToJava doesn't recognize object arrays
   Xiangrui Meng <meng@databricks.com>
   2015-04-01 18:17:07 -0700
   Commit: 4815bc2, github.com/apache/spark/pull/5318

   [SPARK-6553] [pyspark] Support functools.partial as UDF
   ksonj <kson@siberie.de>
   2015-04-01 17:23:57 -0700
   Commit: 757b2e9, github.com/apache/spark/pull/5206

   [SPARK-6580] [MLLIB] Optimize LogisticRegressionModel.predictPoint
   Yanbo Liang <ybliang8@gmail.com>
   2015-04-01 17:19:36 -0700
   Commit: 86b4399, github.com/apache/spark/pull/5249

   [SPARK-6576] [MLlib] [PySpark] DenseMatrix in PySpark should support indexing
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-04-01 17:03:39 -0700
   Commit: 2fa3b47, github.com/apache/spark/pull/5232

   [SPARK-6642][MLLIB] use 1.2 lambda scaling and remove addImplicit from NormalEquation
   Xiangrui Meng <meng@databricks.com>
   2015-04-01 16:47:18 -0700
   Commit: ccafd75, github.com/apache/spark/pull/5314

   [SPARK-6578] [core] Fix thread-safety issue in outbound path of network library.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-01 16:06:11 -0700
   Commit: f084c5d, github.com/apache/spark/pull/5234

   [SPARK-6657] [Python] [Docs] fixed python doc build warnings
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-01 15:15:47 -0700
   Commit: fb25e8c, github.com/apache/spark/pull/5317

   [SPARK-6651][MLLIB] delegate dense vector arithmetics to the underlying numpy array
   Xiangrui Meng <meng@databricks.com>
   2015-04-01 13:29:04 -0700
   Commit: 2275acc, github.com/apache/spark/pull/5312

   SPARK-6433 hive tests to import spark-sql test JAR for QueryTest access
   Steve Loughran <stevel@hortonworks.com>
   2015-04-01 16:26:54 +0100
   Commit: ee11be2, github.com/apache/spark/pull/5119

   [SPARK-6608] [SQL] Makes DataFrame.rdd a lazy val
   Cheng Lian <lian@databricks.com>
   2015-04-01 21:34:45 +0800
   Commit: d36c5fc, github.com/apache/spark/pull/5265

   SPARK-6626 [DOCS]: Corrected Scala:TwitterUtils parameters
   jayson <jayson@ziprecruiter.com>
   2015-04-01 11:12:55 +0100
   Commit: 0358b08, github.com/apache/spark/pull/5295

   [SPARK-6597][Minor] Replace `input:checkbox` with `input[type="checkbox"]` in additional-metrics.js
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-04-01 11:11:56 +0100
   Commit: d824c11, github.com/apache/spark/pull/5254

   [EC2] [SPARK-6600] Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway
   Florian Verhein <florian.verhein@gmail.com>
   2015-04-01 11:10:43 +0100
   Commit: 4122623, github.com/apache/spark/pull/5257

   [SPARK-4655][Core] Split Stage into ShuffleMapStage and ResultStage subclasses
   Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
   2015-04-01 11:09:00 +0100
   Commit: ff1915e, github.com/apache/spark/pull/4708

   [Doc] Improve Python DataFrame documentation
   Reynold Xin <rxin@databricks.com>
   2015-03-31 18:31:36 -0700
   Commit: 305abe1, github.com/apache/spark/pull/5287

   [SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure
   Josh Rosen <joshrosen@databricks.com>
   2015-03-31 16:18:39 -0700
   Commit: 3732607, github.com/apache/spark/pull/5276

   [SPARK-5692] [MLlib] Word2Vec save/load
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-03-31 16:01:08 -0700
   Commit: 0e00f12, github.com/apache/spark/pull/5291

   [SPARK-6633][SQL] Should be "Contains" instead of "EndsWith" when constructing sources.StringContains
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-31 13:18:07 -0700
   Commit: 2036bc5, github.com/apache/spark/pull/5299

   [SPARK-5371][SQL] Propagate types after function conversion, before futher resolution
   Michael Armbrust <michael@databricks.com>
   2015-03-31 11:34:29 -0700
   Commit: beebb7f, github.com/apache/spark/pull/5278

   [SPARK-6255] [MLLIB] Support multiclass classification in Python API
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-31 11:32:14 -0700
   Commit: b5bd75d, github.com/apache/spark/pull/5137

   [SPARK-6598][MLLIB] Python API for IDFModel
   lewuathe <lewuathe@me.com>
   2015-03-31 11:25:21 -0700
   Commit: 46de6c0, github.com/apache/spark/pull/5264

   [SPARK-6145][SQL] fix ORDER BY on nested fields
   Michael Armbrust <michael@databricks.com>
   2015-03-31 11:23:18 -0700
   Commit: cd48ca5, github.com/apache/spark/pull/5189

   [SPARK-6575] [SQL] Adds configuration to disable schema merging while converting metastore Parquet tables
   Cheng Lian <lian@databricks.com>
   2015-03-31 11:21:15 -0700
   Commit: 8102014, github.com/apache/spark/pull/5231

   [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation
   Cheng Lian <lian@databricks.com>
   2015-03-31 11:18:25 -0700
   Commit: a7992ff, github.com/apache/spark/pull/5289

   [SPARK-4894][mllib] Added Bernoulli option to NaiveBayes model in mllib
   leahmcguire <lmcguire@salesforce.com>, Joseph K. Bradley <joseph@databricks.com>, Leah McGuire <lmcguire@salesforce.com>
   2015-03-31 11:16:55 -0700
   Commit: d01a6d8, github.com/apache/spark/pull/4087

   [SPARK-6542][SQL] add CreateStruct
   Xiangrui Meng <meng@databricks.com>
   2015-03-31 17:05:23 +0800
   Commit: a05835b, github.com/apache/spark/pull/5195

   [SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock
   Yin Huai <yhuai@databricks.com>
   2015-03-31 16:28:40 +0800
   Commit: 314afd0, github.com/apache/spark/pull/5281

   [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.
   Reynold Xin <rxin@databricks.com>
   2015-03-31 00:25:23 -0700
   Commit: b80a030, github.com/apache/spark/pull/5284

   [SPARK-6625][SQL] Add common string filters to data sources.
   Reynold Xin <rxin@databricks.com>
   2015-03-31 00:19:51 -0700
   Commit: f07e714, github.com/apache/spark/pull/5285

   [SPARK-5124][Core] Move StopCoordinator to the receive method since it does not require a reply
   zsxwing <zsxwing@gmail.com>
   2015-03-30 22:10:49 -0700
   Commit: 5677557, github.com/apache/spark/pull/5283

   [SPARK-6119][SQL] DataFrame support for missing data handling
   Reynold Xin <rxin@databricks.com>
   2015-03-30 20:47:10 -0700
   Commit: b8ff2bc, github.com/apache/spark/pull/5274

   [SPARK-6369] [SQL] Uses commit coordinator to help committing Hive and Parquet tables
   Cheng Lian <lian@databricks.com>
   2015-03-31 07:48:37 +0800
   Commit: fde6945, github.com/apache/spark/pull/5139

   [SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() and applySchema
   Davies Liu <davies@databricks.com>
   2015-03-30 15:47:00 -0700
   Commit: f76d2e5, github.com/apache/spark/pull/5273

   [HOTFIX][SPARK-4123]: Updated to fix bug where multiple dependencies added breaks Github output
   Brennon York <brennon.york@capitalone.com>
   2015-03-30 12:48:26 -0700
   Commit: df35500, github.com/apache/spark/pull/5269

   [SPARK-6592][SQL] fix filter for scaladoc to generate API doc for Row class under catalyst dir
   CodingCat <zhunansjtu@gmail.com>
   2015-03-30 11:54:44 -0700
   Commit: 32259c6, github.com/apache/spark/pull/5252

   [SPARK-6595][SQL] MetastoreRelation should be a MultiInstanceRelation
   Michael Armbrust <michael@databricks.com>
   2015-03-30 22:24:12 +0800
   Commit: fe81f6c, github.com/apache/spark/pull/5251

   [HOTFIX] Update start-slave.sh
   Jose Manuel Gomez <jmgomez@stratio.com>
   2015-03-30 14:59:08 +0100
   Commit: 19d4c39, github.com/apache/spark/pull/5262

   [SPARK-5750][SPARK-3441][SPARK-5836][CORE] Added documentation explaining shuffle
   Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
   2015-03-30 11:52:02 +0100
   Commit: 4bdfb7b, github.com/apache/spark/pull/5074

   [SPARK-6596] fix the instruction on building scaladoc
   CodingCat <zhunansjtu@gmail.com>
   2015-03-30 11:41:43 +0100
   Commit: de67330, github.com/apache/spark/pull/5253

   [spark-sql] a better exception message than "scala.MatchError" for unsupported types in Schema creation
   Eran Medan <ehrann.mehdan@gmail.com>
   2015-03-30 00:02:52 -0700
   Commit: 17b13c5, github.com/apache/spark/pull/5235

   Fix string interpolator error in HeartbeatReceiver
   Li Zhihui <zhihui.li@intel.com>
   2015-03-29 21:30:37 -0700
   Commit: 01dc9f5, github.com/apache/spark/pull/5255

   [SPARK-5124][Core] A standard RPC interface and an Akka implementation
   zsxwing <zsxwing@gmail.com>
   2015-03-29 21:25:09 -0700
   Commit: a8d53af, github.com/apache/spark/pull/4588

   [SPARK-6585][Tests]Fix FileServerSuite testcase in some Env.
   June.He <jun.hejun@huawei.com>
   2015-03-29 12:47:22 +0100
   Commit: 0e2753f, github.com/apache/spark/pull/5239

   [SPARK-6558] Utils.getCurrentUserName returns the full principal name instead of login name
   Thomas Graves <tgraves@apache.org>
   2015-03-29 12:43:30 +0100
   Commit: 52ece26, github.com/apache/spark/pull/5229

   [SPARK-6406] Launch Spark using assembly jar instead of a separate launcher jar
   Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
   2015-03-29 12:40:37 +0100
   Commit: e3eb393, github.com/apache/spark/pull/5085

   [SPARK-4123][Project Infra]: Show new dependencies added in pull requests
   Brennon York <brennon.york@capitalone.com>
   2015-03-29 12:37:53 +0100
   Commit: 55153f5, github.com/apache/spark/pull/5093

   [DOC] Improvements to Python docs.
   Reynold Xin <rxin@databricks.com>
   2015-03-28 23:59:27 -0700
   Commit: 5eef00d, github.com/apache/spark/pull/5238

   [SPARK-6571][MLLIB] use wrapper in MatrixFactorizationModel.load
   Xiangrui Meng <meng@databricks.com>
   2015-03-28 15:08:05 -0700
   Commit: f75f633, github.com/apache/spark/pull/5243

   [SPARK-6552][Deploy][Doc]expose start-slave.sh to user and update outdated doc
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-03-28 12:32:35 +0000
   Commit: 9963143, github.com/apache/spark/pull/5205

   [SPARK-6538][SQL] Add missing nullable Metastore fields when merging a Parquet schema
   Adam Budde <budde@amazon.com>
   2015-03-28 09:14:09 +0800
   Commit: 5909f09, github.com/apache/spark/pull/5214

   [SPARK-6564][SQL] SQLContext.emptyDataFrame should contain 0 row, not 1 row
   Reynold Xin <rxin@databricks.com>
   2015-03-27 14:56:57 -0700
   Commit: 3af7334, github.com/apache/spark/pull/5226

   [SPARK-6526][ML] Add Normalizer transformer in ML package
   Xusen Yin <yinxusen@gmail.com>
   2015-03-27 13:29:10 -0700
   Commit: d5497ab, github.com/apache/spark/pull/5181

   [SPARK-6574] [PySpark] fix sql example
   Davies Liu <davies@databricks.com>
   2015-03-27 11:42:26 -0700
   Commit: 887e1b7, github.com/apache/spark/pull/5230

   [SPARK-6550][SQL] Use analyzed plan in DataFrame
   Michael Armbrust <michael@databricks.com>
   2015-03-27 11:40:00 -0700
   Commit: 5d9c37c, github.com/apache/spark/pull/5217

   [SPARK-6544][build] Increment Avro version from 1.7.6 to 1.7.7
   Dean Chen <deanchen5@gmail.com>
   2015-03-27 14:32:51 +0000
   Commit: aa2b991, github.com/apache/spark/pull/5193

   [SPARK-6556][Core] Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver
   zsxwing <zsxwing@gmail.com>
   2015-03-27 12:31:06 +0000
   Commit: da546b7, github.com/apache/spark/pull/5209

   [SPARK-6341][mllib] Upgrade breeze from 0.11.1 to 0.11.2
   Yu ISHIKAWA <yuu.ishikawa@gmail.com>
   2015-03-27 00:15:02 -0700
   Commit: f43a610, github.com/apache/spark/pull/5222

   [SPARK-6405] Limiting the maximum Kryo buffer size to be 2GB.
   mcheah <mcheah@palantir.com>
   2015-03-26 22:48:42 -0700
   Commit: 49d2ec6, github.com/apache/spark/pull/5218

   [SPARK-6510][GraphX]: Add Graph#minus method to act as Set#difference
   Brennon York <brennon.york@capitalone.com>
   2015-03-26 19:08:09 -0700
   Commit: 39fb579, github.com/apache/spark/pull/5175

   [DOCS][SQL] Fix JDBC example
   Michael Armbrust <michael@databricks.com>
   2015-03-26 14:51:46 -0700
   Commit: aad0032, github.com/apache/spark/pull/5192

   [SPARK-6554] [SQL] Don't push down predicates which reference partition column(s)
   Cheng Lian <lian@databricks.com>
   2015-03-26 13:11:37 -0700
   Commit: 71a0d40, github.com/apache/spark/pull/5210

   [SPARK-6117] [SQL] Improvements to DataFrame.describe()
   Reynold Xin <rxin@databricks.com>
   2015-03-26 12:26:13 -0700
   Commit: 784fcd5, github.com/apache/spark/pull/5201

   SPARK-6532 [BUILD] LDAModel.scala fails scalastyle on Windows
   Sean Owen <sowen@cloudera.com>
   2015-03-26 10:52:31 -0700
   Commit: c3a52a0, github.com/apache/spark/pull/5211

   SPARK-6480 [CORE] histogram() bucket function is wrong in some simple edge cases
   Sean Owen <sowen@cloudera.com>
   2015-03-26 15:00:23 +0000
   Commit: fe15ea9, github.com/apache/spark/pull/5148

   [MLlib]remove unused import
   Yuhao Yang <hhbyyh@gmail.com>
   2015-03-26 13:27:05 +0000
   Commit: 3ddb975, github.com/apache/spark/pull/5207

   [SQL][SPARK-6471]: Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns
   Yash Datta <Yash.Datta@guavus.com>
   2015-03-26 21:13:38 +0800
   Commit: 1c05027, github.com/apache/spark/pull/5141

   [SPARK-6468][Block Manager] Fix the race condition of subDirs in DiskBlockManager
   zsxwing <zsxwing@gmail.com>
   2015-03-26 12:54:48 +0000
   Commit: 0c88ce5, github.com/apache/spark/pull/5136

   [SPARK-6465][SQL] Fix serialization of GenericRowWithSchema using kryo
   Michael Armbrust <michael@databricks.com>
   2015-03-26 18:46:57 +0800
   Commit: f88f51b, github.com/apache/spark/pull/5191

   [SPARK-6546][Build] Using the wrong code that will make spark compile failed!!
   DoingDone9 <799203320@qq.com>
   2015-03-26 17:04:19 +0800
   Commit: 855cba8, github.com/apache/spark/pull/5198

   [SPARK-6117] [SQL] add describe function to DataFrame for summary statis...
   azagrebin <azagrebin@gmail.com>
   2015-03-26 00:25:04 -0700
   Commit: 5bbcd13, github.com/apache/spark/pull/5073

   [SPARK-6536] [PySpark] Column.inSet() in Python
   Davies Liu <davies@databricks.com>
   2015-03-26 00:01:24 -0700
   Commit: f535802, github.com/apache/spark/pull/5190

   [SPARK-6463][SQL] AttributeSet.equal should compare size
   sisihj <jun.hejun@huawei.com>, Michael Armbrust <michael@databricks.com>
   2015-03-25 19:21:54 -0700
   Commit: 276ef1c, github.com/apache/spark/pull/5194

   The UT test of spark is failed. Because there is a test in SQLQuerySuite about creating table “test”
   KaiXinXiaoLei <huleilei1@huawei.com>
   2015-03-25 19:15:30 -0700
   Commit: e87bf37, github.com/apache/spark/pull/5150

   [SPARK-6202] [SQL] enable variable substitution on test framework
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-03-25 18:43:26 -0700
   Commit: 5ab6e9f, github.com/apache/spark/pull/4930

   [SPARK-6271][SQL] Sort these tokens in alphabetic order to avoid further duplicate in HiveQl
   DoingDone9 <799203320@qq.com>
   2015-03-25 18:41:59 -0700
   Commit: 328daf6, github.com/apache/spark/pull/4973

   [SPARK-6326][SQL] Improve castStruct to be faster
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-25 17:52:23 -0700
   Commit: 73d5775, github.com/apache/spark/pull/5017

   [SPARK-5498][SQL]fix query exception when partition schema does not match table schema
   jeanlyn <jeanlyn92@gmail.com>
   2015-03-25 17:47:45 -0700
   Commit: e6d1406, github.com/apache/spark/pull/4289

   [SPARK-6450] [SQL] Fixes metastore Parquet table conversion
   Cheng Lian <lian@databricks.com>
   2015-03-25 17:40:19 -0700
   Commit: 8c3b005, github.com/apache/spark/pull/5183

   [SPARK-6079] Use index to speed up StatusTracker.getJobIdsForGroup()
   Josh Rosen <joshrosen@databricks.com>
   2015-03-25 17:40:00 -0700
   Commit: d44a336, github.com/apache/spark/pull/4830

   [SPARK-5987] [MLlib] Save/load for GaussianMixtureModels
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-03-25 14:45:23 -0700
   Commit: 4fc4d03, github.com/apache/spark/pull/4986

   [SPARK-6256] [MLlib] MLlib Python API parity check for regression
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-25 13:38:33 -0700
   Commit: 4353373, github.com/apache/spark/pull/4997

   [SPARK-5771] Master UI inconsistently displays application cores
   Andrew Or <andrew@databricks.com>
   2015-03-25 13:28:32 -0700
   Commit: c1b74df, github.com/apache/spark/pull/5177

   [SPARK-6537] UIWorkloadGenerator: The main thread should not stop SparkContext until all jobs finish
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-03-25 13:27:15 -0700
   Commit: acef51d, github.com/apache/spark/pull/5187

   [SPARK-6076][Block Manager] Fix a potential OOM issue when StorageLevel is MEMORY_AND_DISK_SER
   zsxwing <zsxwing@gmail.com>
   2015-03-25 12:09:30 -0700
   Commit: 883b7e9, github.com/apache/spark/pull/4827

   [SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work.
   DoingDone9 <799203320@qq.com>
   2015-03-25 11:11:52 -0700
   Commit: 968408b, github.com/apache/spark/pull/5131

   [ML][FEATURE] SPARK-5566: RegEx Tokenizer
   Augustin Borsu <augustin@sagacify.com>, Augustin Borsu <a.borsu@gmail.com>, Augustin Borsu <aborsu@gmail.com>, Xiangrui Meng <meng@databricks.com>
   2015-03-25 10:16:39 -0700
   Commit: 982952f, github.com/apache/spark/pull/4504

   [SPARK-6496] [MLLIB] GeneralizedLinearAlgorithm.run(input, initialWeights) should initialize numFeatures
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-25 17:05:56 +0000
   Commit: 10c7860, github.com/apache/spark/pull/5167

   [SPARK-6483][SQL]Improve ScalaUdf called performance.
   zzcclp <xm_zzc@sina.com>
   2015-03-25 19:11:04 +0800
   Commit: 64262ed, github.com/apache/spark/pull/5154

   [DOCUMENTATION]Fixed Missing Type Import in Documentation
   Bill Chambers <wchambers@ischool.berkeley.edu>, anabranch <wac.chambers@gmail.com>
   2015-03-24 22:24:35 -0700
   Commit: c5cc414, github.com/apache/spark/pull/5179

   [SPARK-6515] update OpenHashSet impl
   Xiangrui Meng <meng@databricks.com>
   2015-03-24 18:58:27 -0700
   Commit: c14ddd9, github.com/apache/spark/pull/5176

   [SPARK-6428][Streaming] Added explicit types for all public methods.
   Reynold Xin <rxin@databricks.com>
   2015-03-24 17:08:25 -0700
   Commit: 9459865, github.com/apache/spark/pull/5110

   [SPARK-6512] add contains to OpenHashMap
   Xiangrui Meng <meng@databricks.com>
   2015-03-24 17:06:22 -0700
   Commit: 6930e96, github.com/apache/spark/pull/5171

   [SPARK-6469] Improving documentation on YARN local directories usage
   Christophe Préaud <christophe.preaud@kelkoo.com>
   2015-03-24 17:05:49 -0700
   Commit: 05c2214, github.com/apache/spark/pull/5165

   Revert "[SPARK-5771] Number of Cores in Completed Applications of Standalone Master Web Page always be 0 if sc.stop() is called"
   Andrew Or <andrew@databricks.com>
   2015-03-24 16:49:27 -0700
   Commit: dd907d1

   Revert "[SPARK-5771][UI][hotfix] Change Requested Cores into * if default cores is not set"
   Andrew Or <andrew@databricks.com>
   2015-03-24 16:41:31 -0700
   Commit: f7c3668

   [SPARK-3570] Include time to open files in shuffle write time.
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-03-24 16:29:40 -0700
   Commit: d8ccf65, github.com/apache/spark/pull/4550

   [SPARK-6088] Correct how tasks that get remote results are shown in UI.
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-03-24 16:26:43 -0700
   Commit: 6948ab6, github.com/apache/spark/pull/4839

   [SPARK-6428][SQL] Added explicit types for all public methods in catalyst
   Reynold Xin <rxin@databricks.com>
   2015-03-24 16:03:55 -0700
   Commit: 7334801, github.com/apache/spark/pull/5162

   [SPARK-6209] Clean up connections in ExecutorClassLoader after failing to load classes (master branch PR)
   Josh Rosen <joshrosen@databricks.com>
   2015-03-24 14:38:20 -0700
   Commit: 7215aa74, github.com/apache/spark/pull/4944

   [SPARK-6458][SQL] Better error messages for invalid data sources
   Michael Armbrust <michael@databricks.com>
   2015-03-24 14:10:56 -0700
   Commit: a8f51b8, github.com/apache/spark/pull/5158

   [SPARK-6376][SQL] Avoid eliminating subqueries until optimization
   Michael Armbrust <michael@databricks.com>
   2015-03-24 14:08:20 -0700
   Commit: cbeaf9e, github.com/apache/spark/pull/5160

   [SPARK-6375][SQL] Fix formatting of error messages.
   Michael Armbrust <michael@databricks.com>
   2015-03-24 13:22:46 -0700
   Commit: 046c1e2, github.com/apache/spark/pull/5155

   [SPARK-6054][SQL] Fix transformations of TreeNodes that hold StructTypes
   Michael Armbrust <michael@databricks.com>
   2015-03-24 12:28:01 -0700
   Commit: 3fa3d12, github.com/apache/spark/pull/5157

   [SPARK-6437][SQL] Use completion iterator to close external sorter
   Michael Armbrust <michael@databricks.com>
   2015-03-24 12:10:30 -0700
   Commit: 26c6ce3, github.com/apache/spark/pull/5161

   [SPARK-6459][SQL] Warn when constructing trivially true equals predicate
   Michael Armbrust <michael@databricks.com>
   2015-03-24 12:09:02 -0700
   Commit: 32efadd, github.com/apache/spark/pull/5163

   [SPARK-6361][SQL] support adding a column with metadata in DF
   Xiangrui Meng <meng@databricks.com>
   2015-03-24 12:08:19 -0700
   Commit: 6bdddb6, github.com/apache/spark/pull/5151

   [SPARK-6475][SQL] recognize array types when infer data types from JavaBeans
   Xiangrui Meng <meng@databricks.com>
   2015-03-24 10:11:27 -0700
   Commit: a1d1529, github.com/apache/spark/pull/5146

   [ML][docs][minor] Define LabeledDocument/Document classes in CV example
   Peter Rudenko <petro.rudenko@gmail.com>
   2015-03-24 16:33:38 +0000
   Commit: 08d4528, github.com/apache/spark/pull/5135

   [SPARK-5559] [Streaming] [Test] Remove oppotunity we met flakiness when running FlumeStreamSuite
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-03-24 16:13:25 +0000
   Commit: 85cf063, github.com/apache/spark/pull/4337

   [SPARK-6473] [core] Do not try to figure out Scala version if not needed...
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-24 13:48:33 +0000
   Commit: b293afc, github.com/apache/spark/pull/5143

   Update the command to use IPython notebook
   Cong Yue <yuecong1104@gmail.com>
   2015-03-24 12:56:13 +0000
   Commit: c12312f, github.com/apache/spark/pull/5111

   [SPARK-6477][Build]: Run MIMA tests before the Spark test suite
   Brennon York <brennon.york@capitalone.com>
   2015-03-24 10:33:04 +0000
   Commit: 37fac1d, github.com/apache/spark/pull/5145

   [SPARK-6452] [SQL] Checks for missing attributes and unresolved operator for all types of operator
   Cheng Lian <lian@databricks.com>
   2015-03-24 01:12:11 -0700
   Commit: 1afcf77, github.com/apache/spark/pull/5129

   [SPARK-6428] Added explicit types for all public methods in core.
   Reynold Xin <rxin@databricks.com>
   2015-03-23 23:41:06 -0700
   Commit: 4ce2782, github.com/apache/spark/pull/5125

   [SPARK-6124] Support jdbc connection properties in OPTIONS part of the query
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-03-23 17:00:27 -0700
   Commit: bfd3ee9, github.com/apache/spark/pull/4859

   Revert "[SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1."
   Patrick Wendell <patrick@databricks.com>
   2015-03-23 15:08:39 -0700
   Commit: 6cd7058

   [SPARK-6308] [MLlib] [Sql] Override TypeName in VectorUDT and MatrixUDT
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-03-23 13:30:21 -0700
   Commit: 474d132, github.com/apache/spark/pull/5118

   [SPARK-6397][SQL] Check the missingInput simply
   Yadong Qi <qiyadong2010@gmail.com>
   2015-03-23 18:16:49 +0800
   Commit: 9f3273b, github.com/apache/spark/pull/5132

   Revert "[SPARK-6397][SQL] Check the missingInput simply"
   Cheng Lian <lian@databricks.com>
   2015-03-23 12:15:19 +0800
   Commit: bf044de

   [SPARK-6397][SQL] Check the missingInput simply
   q00251598 <qiyadong@huawei.com>
   2015-03-23 12:06:13 +0800
   Commit: e566fe5, github.com/apache/spark/pull/5082

   [SPARK-4985] [SQL] parquet support for date type
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-03-23 11:46:16 +0800
   Commit: 4659468, github.com/apache/spark/pull/3822

   [SPARK-6337][Documentation, SQL]Spark 1.3 doc fixes
   vinodkc <vinod.kc.in@gmail.com>
   2015-03-22 20:00:08 +0000
   Commit: 2bf40c5, github.com/apache/spark/pull/5112

   [HOTFIX] Build break due to https://github.com/apache/spark/pull/5128
   Reynold Xin <rxin@databricks.com>
   2015-03-22 12:08:15 -0700
   Commit: 7a0da47

   [SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1.
   Calvin Jia <jia.calvin@gmail.com>
   2015-03-22 11:11:29 -0700
   Commit: a41b9c6, github.com/apache/spark/pull/4867

   SPARK-6454 [DOCS] Fix links to pyspark api
   Kamil Smuga <smugakamil@gmail.com>, stderr <smugakamil@gmail.com>
   2015-03-22 15:56:25 +0000
   Commit: 6ef4863, github.com/apache/spark/pull/5120

   [SPARK-6453][Mesos] Some Mesos*Suite have a different package with their classes
   Jongyoul Lee <jongyoul@gmail.com>
   2015-03-22 15:53:18 +0000
   Commit: adb2ff7, github.com/apache/spark/pull/5126

   [SPARK-6455] [docs] Correct some mistakes and typos
   Hangchen Yu <yuhc@gitcafe.com>
   2015-03-22 15:51:10 +0000
   Commit: ab4f516, github.com/apache/spark/pull/5128

   [SPARK-6448] Make history server log parse exceptions
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-03-22 11:54:23 +0000
   Commit: b9fe504, github.com/apache/spark/pull/5122

   [SPARK-6408] [SQL] Fix JDBCRDD filtering string literals
   ypcat <ypcat6@gmail.com>, Pei-Lun Lee <pllee@appier.com>
   2015-03-22 15:49:13 +0800
   Commit: 9b1e1f2, github.com/apache/spark/pull/5087

   [SPARK-6428][SQL] Added explicit type for all public methods for Hive module
   Reynold Xin <rxin@databricks.com>
   2015-03-21 14:30:04 -0700
   Commit: b6090f9, github.com/apache/spark/pull/5108

   [SPARK-6250][SPARK-6146][SPARK-5911][SQL] Types are now reserved words in DDL parser.
   Yin Huai <yhuai@databricks.com>
   2015-03-21 13:27:53 -0700
   Commit: 94a102a, github.com/apache/spark/pull/5078

   [SPARK-5680][SQL] Sum function on all null values, should return zero
   Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-03-21 13:24:24 -0700
   Commit: ee569a0, github.com/apache/spark/pull/4466

   [SPARK-5320][SQL]Add statistics method at NoRelation (override super).
   x1- <viva008@gmail.com>
   2015-03-21 13:22:34 -0700
   Commit: 52dd4b2, github.com/apache/spark/pull/5105

   [SPARK-5821] [SQL] JSON CTAS command should throw error message when delete path failure
   Yanbo Liang <ybliang8@gmail.com>, Yanbo Liang <yanbohappy@gmail.com>
   2015-03-21 11:23:28 +0800
   Commit: e5d2c37, github.com/apache/spark/pull/4610

   [SPARK-6315] [SQL] Also tries the case class string parser while reading Parquet schema
   Cheng Lian <lian@databricks.com>
   2015-03-21 11:18:45 +0800
   Commit: 937c1e5, github.com/apache/spark/pull/5034

   [SPARK-5821] [SQL] ParquetRelation2 CTAS should check if delete is successful
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-21 10:53:04 +0800
   Commit: bc37c97, github.com/apache/spark/pull/5107

   [SPARK-6025] [MLlib] Add helper method evaluateEachIteration to extract learning curve
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-03-20 17:14:09 -0700
   Commit: 25e271d, github.com/apache/spark/pull/4906

   [SPARK-6428][SQL] Added explicit type for all public methods in sql/core
   Reynold Xin <rxin@databricks.com>
   2015-03-20 15:47:07 -0700
   Commit: a95043b, github.com/apache/spark/pull/5104

   [SPARK-6421][MLLIB] _regression_train_wrapper does not test initialWeights correctly
   lewuathe <lewuathe@me.com>
   2015-03-20 17:18:18 -0400
   Commit: 257cde7, github.com/apache/spark/pull/5101

   [SPARK-6309] [SQL] [MLlib] Implement MatrixUDT
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-03-20 17:13:18 -0400
   Commit: 11e0259, github.com/apache/spark/pull/5048

   [SPARK-6423][Mesos] MemoryUtils should use memoryOverhead if it's set
   Jongyoul Lee <jongyoul@gmail.com>
   2015-03-20 19:14:35 +0000
   Commit: 49a01c7, github.com/apache/spark/pull/5099

   [SPARK-5955][MLLIB] add checkpointInterval to ALS
   Xiangrui Meng <meng@databricks.com>
   2015-03-20 15:02:57 -0400
   Commit: 6b36470, github.com/apache/spark/pull/5076

   [Spark 6096][MLlib] Add Naive Bayes load save methods in Python
   Xusen Yin <yinxusen@gmail.com>
   2015-03-20 14:53:59 -0400
   Commit: 25636d9, github.com/apache/spark/pull/5090

   [MLlib] SPARK-5954: Top by key
   Shuo Xiang <shuoxiangpub@gmail.com>
   2015-03-20 14:45:44 -0400
   Commit: 5e6ad24, github.com/apache/spark/pull/5075

   [SPARK-6095] [MLLIB] Support model save/load in Python's linear models
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-20 14:44:21 -0400
   Commit: 48866f7, github.com/apache/spark/pull/5016

   [SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-20 18:43:57 +0000
   Commit: a745645, github.com/apache/spark/pull/5056

   [SPARK-6426][Doc]User could also point the yarn cluster config directory via YARN_CONF_DI...
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-03-20 18:42:18 +0000
   Commit: 385b2ff, github.com/apache/spark/pull/5103

   [SPARK-6370][core] Documentation: Improve all 3 docs for RDD.sample
   mbonaci <mbonaci@gmail.com>
   2015-03-20 18:30:45 +0000
   Commit: 28bcb9e, github.com/apache/spark/pull/5097

   [SPARK-6428][MLlib] Added explicit type for public methods and implemented hashCode when equals is defined.
   Reynold Xin <rxin@databricks.com>
   2015-03-20 14:13:02 -0400
   Commit: db4d317, github.com/apache/spark/pull/5102

   SPARK-6338 [CORE] Use standard temp dir mechanisms in tests to avoid orphaned temp files
   Sean Owen <sowen@cloudera.com>
   2015-03-20 14:16:21 +0000
   Commit: 6f80c3e, github.com/apache/spark/pull/5029

   SPARK-5134 [BUILD] Bump default Hadoop version to 2+
   Sean Owen <sowen@cloudera.com>
   2015-03-20 14:14:53 +0000
   Commit: d08e3eb, github.com/apache/spark/pull/5027

   [SPARK-6286][Mesos][minor] Handle missing Mesos case TASK_ERROR
   Jongyoul Lee <jongyoul@gmail.com>
   2015-03-20 12:24:34 +0000
   Commit: 116c553, github.com/apache/spark/pull/5088

   Tighten up field/method visibility in Executor and made some code more clear to read.
   Reynold Xin <rxin@databricks.com>
   2015-03-19 22:12:01 -0400
   Commit: 0745a30, github.com/apache/spark/pull/4850

   [SPARK-6219] [Build] Check that Python code compiles
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-03-19 12:46:10 -0700
   Commit: f17d43b, github.com/apache/spark/pull/4941

   [Core][minor] remove unused `visitedStages` in `DAGScheduler.stageDependsOn`
   Wenchen Fan <cloud0fan@outlook.com>
   2015-03-19 15:25:32 -0400
   Commit: 3b5aaa6, github.com/apache/spark/pull/5086

   [SPARK-5313][Project Infra]: Create simple framework for highlighting changes introduced in a PR
   Brennon York <brennon.york@capitalone.com>
   2015-03-19 11:18:24 -0400
   Commit: 8cb23a1, github.com/apache/spark/pull/5072

   [SPARK-6291] [MLLIB] GLM toString & toDebugString
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-19 11:10:20 -0400
   Commit: dda4ded, github.com/apache/spark/pull/5038

   [SPARK-5843] [API] Allowing map-side combine to be specified in Java.
   mcheah <mcheah@palantir.com>
   2015-03-19 08:51:49 -0400
   Commit: 3c4e486, github.com/apache/spark/pull/4634

   [SPARK-6402][DOC] - Remove some refererences to shark in docs and ec2
   Pierre Borckmans <pierre.borckmans@realimpactanalytics.com>
   2015-03-19 08:02:06 -0400
   Commit: 797f8a0, github.com/apache/spark/pull/5083

   [SPARK-4012] stop SparkContext when the exception is thrown from an infinite loop
   CodingCat <zhunansjtu@gmail.com>
   2015-03-18 23:48:45 -0700
   Commit: 2c3f83c, github.com/apache/spark/pull/5004

   [SPARK-6222][Streaming] Dont delete checkpoint data when doing pre-batch-start checkpoint
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-19 02:15:50 -0400
   Commit: 645cf3f, github.com/apache/spark/pull/5008

   [SPARK-6394][Core] cleanup BlockManager companion object and improve the getCacheLocs method in DAGScheduler
   Wenchen Fan <cloud0fan@outlook.com>
   2015-03-18 19:43:04 -0700
   Commit: 540b2a4, github.com/apache/spark/pull/5043

   SPARK-6085 Part. 2 Increase default value for memory overhead
   Jongyoul Lee <jongyoul@gmail.com>
   2015-03-18 20:54:22 -0400
   Commit: 3db1387, github.com/apache/spark/pull/5065

   [SPARK-6374] [MLlib] add get for GeneralizedLinearAlgo
   Yuhao Yang <hhbyyh@gmail.com>
   2015-03-18 13:44:37 -0400
   Commit: a95ee24, github.com/apache/spark/pull/5058

   [SPARK-6325] [core,yarn] Do not change target executor count when killing executors.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-18 09:18:28 -0400
   Commit: 981fbaf, github.com/apache/spark/pull/5018

   [SPARK-6286][minor] Handle missing Mesos case TASK_ERROR.
   Iulian Dragos <jaguarul@gmail.com>
   2015-03-18 09:15:33 -0400
   Commit: 9d112a9, github.com/apache/spark/pull/5000

   SPARK-6389 YARN app diagnostics report doesn't report NPEs
   Steve Loughran <stevel@hortonworks.com>
   2015-03-18 09:09:32 -0400
   Commit: e09c852, github.com/apache/spark/pull/5070

   [SPARK-6372] [core] Propagate --conf to child processes.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-18 09:06:57 -0400
   Commit: 6205a25, github.com/apache/spark/pull/5057

   [SPARK-6247][SQL] Fix resolution of ambiguous joins caused by new aliases
   Michael Armbrust <michael@databricks.com>
   2015-03-17 19:47:51 -0700
   Commit: 3579003, github.com/apache/spark/pull/5062

   [SPARK-5651][SQL] Add input64 in blacklist and add test suit for create table within backticks
   watermen <qiyadong2010@gmail.com>, q00251598 <qiyadong@huawei.com>
   2015-03-17 19:35:18 -0700
   Commit: a6ee2f7, github.com/apache/spark/pull/4427

   [SPARK-5404] [SQL] Update the default statistic number
   Cheng Hao <hao.cheng@intel.com>
   2015-03-17 19:32:38 -0700
   Commit: 78cb08a, github.com/apache/spark/pull/4914

   [SPARK-5908][SQL] Resolve UdtfsAlias when only single Alias is used
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-17 18:58:52 -0700
   Commit: 5c80643, github.com/apache/spark/pull/4692

   [SPARK-6383][SQL]Fixed compiler and errors in Dataframe examples
   Tijo Thomas <tijoparacka@gmail.com>
   2015-03-17 18:50:19 -0700
   Commit: a012e08, github.com/apache/spark/pull/5068

   [SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTable should be "error" instead of "append".
   Yin Huai <yhuai@databricks.com>
   2015-03-18 09:41:06 +0800
   Commit: dc9c919, github.com/apache/spark/pull/5053

   [SPARK-6330] [SQL] Add a test case for SPARK-6330
   Pei-Lun Lee <pllee@appier.com>
   2015-03-18 08:34:46 +0800
   Commit: 4633a87, github.com/apache/spark/pull/5039

   [SPARK-6226][MLLIB] add save/load in PySpark's KMeansModel
   Xiangrui Meng <meng@databricks.com>
   2015-03-17 12:14:40 -0700
   Commit: c94d062, github.com/apache/spark/pull/5049

   [SPARK-6336] LBFGS should document what convergenceTol means
   lewuathe <lewuathe@me.com>
   2015-03-17 12:11:57 -0700
   Commit: d9f3e01, github.com/apache/spark/pull/5033

   [SPARK-6313] Add config option to disable file locks/fetchFile cache to ...
   nemccarthy <nathan@nemccarthy.me>
   2015-03-17 09:33:11 -0700
   Commit: 4cca391, github.com/apache/spark/pull/5036

   [SPARK-3266] Use intermediate abstract classes to fix type erasure issues in Java APIs
   Josh Rosen <joshrosen@databricks.com>
   2015-03-17 09:18:57 -0700
   Commit: 0f673c2, github.com/apache/spark/pull/5050

   [SPARK-6365] jetty-security also needed for SPARK_PREPEND_CLASSES to work
   Imran Rashid <irashid@cloudera.com>
   2015-03-17 09:41:06 -0500
   Commit: e9f22c6, github.com/apache/spark/pull/5052

   [SPARK-6331] Load new master URL if present when recovering streaming context from checkpoint
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-17 05:31:27 -0700
   Commit: c928796, github.com/apache/spark/pull/5024

   [docs] [SPARK-4820] Spark build encounters "File name too long" on some encrypted filesystems
   Theodore Vasiloudis <tvas@sics.se>
   2015-03-17 11:25:01 +0000
   Commit: e26db9be, github.com/apache/spark/pull/5041

   [SPARK-6269] [CORE] Use ScalaRunTime's array methods instead of java.lang.reflect.Array in size estimation
   mcheah <mcheah@palantir.com>, Justin Uang <justin.uang@gmail.com>
   2015-03-17 11:20:20 +0000
   Commit: 005d1c5, github.com/apache/spark/pull/4972

   [SPARK-4011] tighten the visibility of the members in Master/Worker class
   CodingCat <zhunansjtu@gmail.com>
   2015-03-17 11:18:27 +0000
   Commit: 25f3580, github.com/apache/spark/pull/4844

   SPARK-6044 [CORE] RDD.aggregate() should not use the closure serializer on the zero value
   Sean Owen <sowen@cloudera.com>
   2015-03-16 23:58:52 -0700
   Commit: b2d8c02, github.com/apache/spark/pull/5028

   [SPARK-6357][GraphX] Add unapply in EdgeContext
   Takeshi YAMAMURO <linguin.m.s@gmail.com>
   2015-03-16 23:54:54 -0700
   Commit: b3e6eca, github.com/apache/spark/pull/5047

   [SQL][docs][minor] Fixed sample code in SQLContext scaladoc
   Lomig Mégard <lomig.megard@gmail.com>
   2015-03-16 23:52:42 -0700
   Commit: 6870722, github.com/apache/spark/pull/5051

   [SPARK-6299][CORE] ClassNotFoundException in standalone mode when running groupByKey with class defined in REPL
   Kevin (Sangwoo) Kim <sangwookim.me@gmail.com>
   2015-03-16 23:49:23 -0700
   Commit: f0edeae, github.com/apache/spark/pull/5046

   [SPARK-5712] [SQL] fix comment with semicolon at end
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-03-17 12:29:15 +0800
   Commit: 9667b9f, github.com/apache/spark/pull/4500

   [SPARK-6327] [PySpark] fix launch spark-submit from python
   Davies Liu <davies@databricks.com>
   2015-03-16 16:26:55 -0700
   Commit: e3f315a, github.com/apache/spark/pull/5019

   [SPARK-6077] Remove streaming tab while stopping StreamingContext
   lisurprise <zhichao.li@intel.com>
   2015-03-16 13:10:32 -0700
   Commit: f149b8b, github.com/apache/spark/pull/4828

   [SPARK-6330] Fix filesystem bug in newParquet relation
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-03-16 12:13:18 -0700
   Commit: d19efed, github.com/apache/spark/pull/5020

   [SPARK-2087] [SQL] Multiple thriftserver sessions with single HiveContext instance
   Cheng Hao <hao.cheng@intel.com>
   2015-03-17 01:09:27 +0800
   Commit: 12a345a, github.com/apache/spark/pull/4885

   [SPARK-6300][Spark Core] sc.addFile(path) does not support the relative path.
   DoingDone9 <799203320@qq.com>
   2015-03-16 12:27:15 +0000
   Commit: 00e730b, github.com/apache/spark/pull/4993

   [SPARK-5922][GraphX]: Add diff(other: RDD[VertexId, VD]) in VertexRDD
   Brennon York <brennon.york@capitalone.com>
   2015-03-16 01:06:26 -0700
   Commit: 45f4c66, github.com/apache/spark/pull/4733

   [SPARK-3619] Part 2. Upgrade to Mesos 0.21 to work around MESOS-1688
   Jongyoul Lee <jongyoul@gmail.com>
   2015-03-15 15:46:55 +0000
   Commit: aa6536f, github.com/apache/spark/pull/4361

   [SPARK-6285][SQL]Remove ParquetTestData in SparkBuild.scala and in README.md
   OopsOutOfMemory <victorshengli@126.com>
   2015-03-15 20:44:45 +0800
   Commit: 62ede53, github.com/apache/spark/pull/5032

   [SPARK-5790][GraphX]: VertexRDD's won't zip properly for `diff` capability (added tests)
   Brennon York <brennon.york@capitalone.com>
   2015-03-14 17:38:12 +0000
   Commit: c49d156, github.com/apache/spark/pull/5023

   [SPARK-6329][Docs]: Minor doc changes for Mesos and TOC
   Brennon York <brennon.york@capitalone.com>
   2015-03-14 17:28:13 +0000
   Commit: 127268b, github.com/apache/spark/pull/5022

   [SPARK-6195] [SQL] Adds in-memory column type for fixed-precision decimals
   Cheng Lian <lian@databricks.com>
   2015-03-14 19:53:54 +0800
   Commit: 5be6b0e, github.com/apache/spark/pull/4938

   [SQL]Delete some dupliate code in HiveThriftServer2
   ArcherShao <ArcherShao@users.noreply.github.com>, ArcherShao <shaochuan@huawei.com>
   2015-03-14 08:27:18 +0000
   Commit: ee15404, github.com/apache/spark/pull/5007

   [SPARK-6210] [SQL] use prettyString as column name in agg()
   Davies Liu <davies@databricks.com>
   2015-03-14 00:43:33 -0700
   Commit: b38e073, github.com/apache/spark/pull/5006

   [SPARK-6317][SQL]Fixed HIVE console startup issue
   vinodkc <vinod.kc.in@gmail.com>, Vinod K C <vinod.kc@huawei.com>
   2015-03-14 07:17:54 +0800
   Commit: e360d5e, github.com/apache/spark/pull/5011

   [SPARK-6285] [SQL] Removes unused ParquetTestData and duplicated TestGroupWriteSupport
   Cheng Lian <lian@databricks.com>
   2015-03-14 07:09:53 +0800
   Commit: cdc34ed, github.com/apache/spark/pull/5010

   [SPARK-4600][GraphX]: org.apache.spark.graphx.VertexRDD.diff does not work
   Brennon York <brennon.york@capitalone.com>
   2015-03-13 18:48:31 +0000
   Commit: b943f5d, github.com/apache/spark/pull/5015

   [SPARK-6278][MLLIB] Mention the change of objective in linear regression
   Xiangrui Meng <meng@databricks.com>
   2015-03-13 10:27:28 -0700
   Commit: 7f13434, github.com/apache/spark/pull/4978

   [SPARK-6252] [mllib] Added getLambda to Scala NaiveBayes
   Joseph K. Bradley <joseph.kurata.bradley@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
   2015-03-13 10:26:09 -0700
   Commit: dc4abd4, github.com/apache/spark/pull/4969

   [CORE][minor] remove unnecessary ClassTag in `DAGScheduler`
   Wenchen Fan <cloud0fan@outlook.com>
   2015-03-13 14:08:56 +0000
   Commit: ea3d2ee, github.com/apache/spark/pull/4992

   [SPARK-6197][CORE] handle json exception when hisotry file not finished writing
   Zhang, Liye <liye.zhang@intel.com>
   2015-03-13 13:59:54 +0000
   Commit: 9048e81, github.com/apache/spark/pull/4927

   [SPARK-5310] [SQL] [DOC] Parquet section for the SQL programming guide
   Cheng Lian <lian@databricks.com>
   2015-03-13 21:34:50 +0800
   Commit: 69ff8e8, github.com/apache/spark/pull/5001

   [SPARK-5845][Shuffle] Time to cleanup spilled shuffle files not included in shuffle write time
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-03-13 13:21:04 +0000
   Commit: 0af9ea7, github.com/apache/spark/pull/4965

   HOTFIX: Changes to release script.
   Patrick Wendell <patrick@databricks.com>
   2015-03-12 18:36:17 -0700
   Commit: 3980ebd

   [mllib] [python] Add LassoModel to __all__ in regression.py
   Joseph K. Bradley <joseph@databricks.com>
   2015-03-12 16:46:29 -0700
   Commit: 17c309c, github.com/apache/spark/pull/4970

   [SPARK-4588] ML Attributes
   Xiangrui Meng <meng@databricks.com>, Sean Owen <sowen@cloudera.com>
   2015-03-12 16:34:56 -0700
   Commit: a4b2716, github.com/apache/spark/pull/4925

   [SPARK-6268][MLlib] KMeans parameter getter methods
   Yuhao Yang <hhbyyh@gmail.com>
   2015-03-12 15:17:46 -0700
   Commit: fb4787c, github.com/apache/spark/pull/4974

   [build] [hotfix] Fix make-distribution.sh for Scala 2.11.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-12 19:16:58 +0000
   Commit: 8f1bc79, github.com/apache/spark/pull/5002

   [SPARK-6275][Documentation]Miss toDF() function in docs/sql-programming-guide.md
   zzcclp <xm_zzc@sina.com>
   2015-03-12 15:07:15 +0000
   Commit: 304366c, github.com/apache/spark/pull/4977

   [docs] [SPARK-6306] Readme points to dead link
   Theodore Vasiloudis <tvas@sics.se>
   2015-03-12 15:01:33 +0000
   Commit: 4e47d54, github.com/apache/spark/pull/4999

   [SPARK-5814][MLLIB][GRAPHX] Remove JBLAS from runtime
   Xiangrui Meng <meng@databricks.com>
   2015-03-12 01:39:04 -0700
   Commit: 0cba802, github.com/apache/spark/pull/4699

   [SPARK-6294] fix hang when call take() in JVM on PythonRDD
   Davies Liu <davies@databricks.com>
   2015-03-12 01:34:38 -0700
   Commit: 712679a, github.com/apache/spark/pull/4987

   [SPARK-6296] [SQL] Added equals to Column
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-03-12 00:55:26 -0700
   Commit: 25b71d8, github.com/apache/spark/pull/4988

   BUILD: Adding more known contributor names
   Patrick Wendell <patrick@databricks.com>
   2015-03-11 22:24:08 -0700
   Commit: e921a66

   [SPARK-6128][Streaming][Documentation] Updates to Spark Streaming Programming Guide
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-11 18:48:21 -0700
   Commit: cd3b68d, github.com/apache/spark/pull/4956

   [SPARK-6274][Streaming][Examples] Added examples streaming + sql examples.
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-11 11:19:51 -0700
   Commit: 51a79a7, github.com/apache/spark/pull/4975

   SPARK-6245 [SQL] jsonRDD() of empty RDD results in exception
   Sean Owen <sowen@cloudera.com>
   2015-03-11 14:09:09 +0000
   Commit: 55c4831, github.com/apache/spark/pull/4971

   SPARK-3642. Document the nuances of shared variables.
   Sandy Ryza <sandy@cloudera.com>
   2015-03-11 13:22:05 +0000
   Commit: 2d87a41, github.com/apache/spark/pull/2490

   [SPARK-4423] Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-03-11 13:20:15 +0000
   Commit: 548643a, github.com/apache/spark/pull/4696

   [SPARK-6228] [network] Move SASL classes from network/shuffle to network...
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-11 13:16:22 +0000
   Commit: 5b335bd, github.com/apache/spark/pull/4953

   SPARK-6225 [CORE] [SQL] [STREAMING] Resolve most build warnings, 1.3.0 edition
   Sean Owen <sowen@cloudera.com>
   2015-03-11 13:15:19 +0000
   Commit: 6e94c4e, github.com/apache/spark/pull/4950

   [SPARK-6279][Streaming]In KafkaRDD.scala, Miss expressions flag "s" at logging string
   zzcclp <xm_zzc@sina.com>
   2015-03-11 12:22:24 +0000
   Commit: ec30c17, github.com/apache/spark/pull/4979

   [SQL][Minor] fix typo in comments
   Hongbo Liu <liuhb86@gmail.com>
   2015-03-11 12:18:24 +0000
   Commit: 40f4979, github.com/apache/spark/pull/4976

   [MINOR] [DOCS] Fix map -> mapToPair in Streaming Java example
   Sean Owen <sowen@cloudera.com>
   2015-03-11 12:16:32 +0000
   Commit: 35b2564, github.com/apache/spark/pull/4967

   [SPARK-4924] Add a library for launching Spark jobs programmatically.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-11 01:03:01 -0700
   Commit: 517975d, github.com/apache/spark/pull/3916

   [SPARK-5986][MLLib] Add save/load for k-means
   Xusen Yin <yinxusen@gmail.com>
   2015-03-11 00:24:55 -0700
   Commit: 2d4e00e, github.com/apache/spark/pull/4951

   [SPARK-5183][SQL] Update SQL Docs with JDBC and Migration Guide
   Michael Armbrust <michael@databricks.com>
   2015-03-10 18:13:09 -0700
   Commit: 2672374, github.com/apache/spark/pull/4958

   Minor doc: Remove the extra blank line in data types javadoc.
   Reynold Xin <rxin@databricks.com>
   2015-03-10 17:25:04 -0700
   Commit: 74fb433, github.com/apache/spark/pull/4955

   [SPARK-6186] [EC2] Make Tachyon version configurable in EC2 deployment script
   cheng chang <myairia@gmail.com>
   2015-03-10 11:02:12 +0000
   Commit: 7c7d2d5, github.com/apache/spark/pull/4901

   [SPARK-6191] [EC2] Generalize ability to download libs
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-03-10 10:58:31 +0000
   Commit: d14df06, github.com/apache/spark/pull/4919

   [SPARK-6087][CORE] Provide actionable exception if Kryo buffer is not large enough
   Lev Khomich <levkhomich@gmail.com>
   2015-03-10 10:55:42 +0000
   Commit: c4c4b07, github.com/apache/spark/pull/4947

   [SPARK-6177][MLlib]Add note in LDA example to remind possible coalesce
   Yuhao Yang <hhbyyh@gmail.com>
   2015-03-10 10:51:44 +0000
   Commit: 9a0272f, github.com/apache/spark/pull/4899

   [SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
   Davies Liu <davies@databricks.com>
   2015-03-09 16:24:06 -0700
   Commit: 8767565, github.com/apache/spark/pull/4923

   [SPARK-5310][Doc] Update SQL Programming Guide to include DataFrames.
   Reynold Xin <rxin@databricks.com>
   2015-03-09 16:16:16 -0700
   Commit: 3cac199, github.com/apache/spark/pull/4954

   [Docs] Replace references to SchemaRDD with DataFrame
   Reynold Xin <rxin@databricks.com>
   2015-03-09 13:29:19 -0700
   Commit: 70f8814, github.com/apache/spark/pull/4952

   [EC2] [SPARK-6188] Instance types can be mislabeled when re-starting cluster with default arguments
   Theodore Vasiloudis <thvasilo@users.noreply.github.com>, Theodore Vasiloudis <tvas@sics.se>
   2015-03-09 14:16:07 +0000
   Commit: f7c7992, github.com/apache/spark/pull/4916

   [GraphX] Improve LiveJournalPageRank example
   Jacky Li <jacky.likun@huawei.com>
   2015-03-08 19:47:35 +0000
   Commit: 55b1b32, github.com/apache/spark/pull/4917

   SPARK-6205 [CORE] UISeleniumSuite fails for Hadoop 2.x test with NoClassDefFoundError
   Sean Owen <sowen@cloudera.com>
   2015-03-08 14:09:40 +0000
   Commit: f16b7b0, github.com/apache/spark/pull/4933

   [SPARK-6193] [EC2] Push group filter up to EC2
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-03-08 14:01:26 +0000
   Commit: 52ed7da, github.com/apache/spark/pull/4922

   [SPARK-5641] [EC2] Allow spark_ec2.py to copy arbitrary files to cluster
   Florian Verhein <florian.verhein@gmail.com>
   2015-03-07 12:56:59 +0000
   Commit: 334c5bd, github.com/apache/spark/pull/4583

   [Minor]fix the wrong description
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-03-07 12:35:26 +0000
   Commit: 729c05b, github.com/apache/spark/pull/4936

   [EC2] Reorder print statements on termination
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-03-07 12:33:41 +0000
   Commit: 2646794, github.com/apache/spark/pull/4932

   Fix python typo (+ Scala, Java typos)
   RobertZK <technoguyrob@gmail.com>, Robert Krzyzanowski <technoguyrob@gmail.com>
   2015-03-07 00:16:50 +0000
   Commit: 48a723c, github.com/apache/spark/pull/4840

   [SPARK-6178][Shuffle] Removed unused imports
   Vinod K C <vinod.kchuawei.com>, Vinod K C <vinod.kc@huawei.com>
   2015-03-06 14:43:09 +0000
   Commit: dba0b2e, github.com/apache/spark/pull/4900

   [Minor] Resolve sbt warnings: postfix operator second should be enabled
   GuoQiang Li <witgo@qq.com>
   2015-03-06 13:20:20 +0000
   Commit: 05cb6b3, github.com/apache/spark/pull/4908

   [core] [minor] Don't pollute source directory when running UtilsSuite.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-06 09:43:24 +0000
   Commit: cd7594c, github.com/apache/spark/pull/4921

   [CORE, DEPLOY][minor] align arguments order with docs of worker
   Zhang, Liye <liye.zhang@intel.com>
   2015-03-06 09:34:07 +0000
   Commit: d8b3da9, github.com/apache/spark/pull/4924

   [SQL] Make Strategies a public developer API
   Michael Armbrust <michael@databricks.com>
   2015-03-05 14:50:25 -0800
   Commit: eb48fd6, github.com/apache/spark/pull/4920

   [SPARK-6163][SQL] jsonFile should be backed by the data source API
   Yin Huai <yhuai@databricks.com>
   2015-03-05 14:49:44 -0800
   Commit: 1b4bb25, github.com/apache/spark/pull/4896

   [SPARK-6145][SQL] fix ORDER BY on nested fields
   Wenchen Fan <cloud0fan@outlook.com>, Michael Armbrust <michael@databricks.com>
   2015-03-05 14:49:01 -0800
   Commit: 5873c71, github.com/apache/spark/pull/4918

   [SPARK-6175] Fix standalone executor log links when ephemeral ports or SPARK_PUBLIC_DNS are used
   Josh Rosen <joshrosen@databricks.com>
   2015-03-05 12:04:00 -0800
   Commit: 424a86a, github.com/apache/spark/pull/4903

   [SPARK-6090][MLLIB] add a basic BinaryClassificationMetrics to PySpark/MLlib
   Xiangrui Meng <meng@databricks.com>
   2015-03-05 11:50:09 -0800
   Commit: 0bfacd5, github.com/apache/spark/pull/4863

   SPARK-6182 [BUILD] spark-parent pom needs to be published for both 2.10 and 2.11
   Sean Owen <sowen@cloudera.com>
   2015-03-05 11:31:48 -0800
   Commit: c9cfba0, github.com/apache/spark/pull/4912

   [SPARK-6153] [SQL] promote guava dep for hive-thriftserver
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-03-05 16:35:17 +0800
   Commit: e06c7df, github.com/apache/spark/pull/4884

   SPARK-5143 [BUILD] [WIP] spark-network-yarn 2.11 depends on spark-network-shuffle 2.10
   Sean Owen <sowen@cloudera.com>
   2015-03-04 21:00:51 -0800
   Commit: 7ac072f, github.com/apache/spark/pull/4876

   [SPARK-6149] [SQL] [Build] Excludes Guava 15 referenced by jackson-module-scala_2.10
   Cheng Lian <lian@databricks.com>
   2015-03-04 20:52:58 -0800
   Commit: 1aa90e3, github.com/apache/spark/pull/4890

   [SPARK-6144] [core] Fix addFile when source files are on "hdfs:"
   Marcelo Vanzin <vanzin@cloudera.com>, trystanleftwich <trystan@atscale.com>
   2015-03-04 12:58:39 -0800
   Commit: 3a35a0d, github.com/apache/spark/pull/4894

   [SPARK-6107][CORE] Display inprogress application information for event log history for standalone mode
   Zhang, Liye <liye.zhang@intel.com>
   2015-03-04 12:28:27 +0000
   Commit: f6773ed, github.com/apache/spark/pull/4848

   [SPARK-6134][SQL] Fix wrong datatype for casting FloatType and default LongType value in defaultPrimitive
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-04 20:23:43 +0800
   Commit: aef8a84, github.com/apache/spark/pull/4870

   [SPARK-6136] [SQL] Removed JDBC integration tests which depends on docker-client
   Cheng Lian <lian@databricks.com>
   2015-03-04 19:39:02 +0800
   Commit: 76b472f, github.com/apache/spark/pull/4872

   [SPARK-3355][Core]: Allow running maven tests in run-tests
   Brennon York <brennon.york@capitalone.com>
   2015-03-04 11:02:33 +0000
   Commit: 418f38d, github.com/apache/spark/pull/4734

   SPARK-6085 Increase default value for memory overhead
   tedyu <yuzhihong@gmail.com>
   2015-03-04 11:00:52 +0000
   Commit: 8d3e241, github.com/apache/spark/pull/4836

   [SPARK-6141][MLlib] Upgrade Breeze from 0.10 to 0.11 to fix convergence bug
   Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>, DB Tsai <dbtsai@dbtsai.com>
   2015-03-03 23:52:02 -0800
   Commit: 76e20a0, github.com/apache/spark/pull/4879

   [SPARK-6132][HOTFIX] ContextCleaner InterruptedException should be quiet
   Andrew Or <andrew@databricks.com>
   2015-03-03 20:49:45 -0800
   Commit: d334bfb, github.com/apache/spark/pull/4882

   [SPARK-5949] HighlyCompressedMapStatus needs more classes registered w/ kryo
   Imran Rashid <irashid@cloudera.com>
   2015-03-03 15:33:19 -0800
   Commit: 1f1fccc, github.com/apache/spark/pull/4877

   [SPARK-6133] Make sc.stop() idempotent
   Andrew Or <andrew@databricks.com>
   2015-03-03 15:09:57 -0800
   Commit: 6c20f35, github.com/apache/spark/pull/4871

   [SPARK-6132] ContextCleaner race condition across SparkContexts
   Andrew Or <andrew@databricks.com>
   2015-03-03 13:44:05 -0800
   Commit: fe63e82, github.com/apache/spark/pull/4869

   SPARK-1911 [DOCS] Warn users if their assembly jars are not built with Java 6
   Sean Owen <sowen@cloudera.com>
   2015-03-03 13:40:11 -0800
   Commit: e750a6b, github.com/apache/spark/pull/4874

   Revert "[SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file"
   Andrew Or <andrew@databricks.com>
   2015-03-03 13:03:52 -0800
   Commit: 9af0017

   [SPARK-6138][CORE][minor] enhance the `toArray` method in `SizeTrackingVector`
   Wenchen Fan <cloud0fan@outlook.com>
   2015-03-03 12:12:23 +0000
   Commit: e359794, github.com/apache/spark/pull/4825

   [SPARK-6118] making package name of deploy.worker.CommandUtils and deploy.CommandUtilsSuite consistent
   CodingCat <zhunansjtu@gmail.com>
   2015-03-03 10:32:57 +0000
   Commit: 975643c, github.com/apache/spark/pull/4856

   BUILD: Minor tweaks to internal build scripts
   Patrick Wendell <patrick@databricks.com>
   2015-03-03 00:38:12 -0800
   Commit: 0c9a8ea

   HOTFIX: Bump HBase version in MapR profiles.
   Patrick Wendell <patrick@databricks.com>
   2015-03-03 01:38:07 -0800
   Commit: 165ff36

   [SPARK-5537][MLlib][Docs] Add user guide for multinomial logistic regression
   DB Tsai <dbtsai@alpinenow.com>
   2015-03-02 22:37:12 -0800
   Commit: b196056, github.com/apache/spark/pull/4866

   [SPARK-6120] [mllib] Warnings about memory in tree, ensemble model save
   Joseph K. Bradley <joseph@databricks.com>
   2015-03-02 22:33:51 -0800
   Commit: c2fe3a6, github.com/apache/spark/pull/4864

   [SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlib
   Xiangrui Meng <meng@databricks.com>
   2015-03-02 22:27:01 -0800
   Commit: 7e53a79, github.com/apache/spark/pull/4854

   [SPARK-5310][SQL] Fixes to Docs and Datasources API
   Reynold Xin <rxin@databricks.com>, Michael Armbrust <michael@databricks.com>
   2015-03-02 22:14:08 -0800
   Commit: 54d1968, github.com/apache/spark/pull/4868

   [SPARK-5950][SQL]Insert array into a metastore table saved as parquet should work when using datasource api
   Yin Huai <yhuai@databricks.com>
   2015-03-02 19:31:55 -0800
   Commit: 1259994, github.com/apache/spark/pull/4826

   [SPARK-6127][Streaming][Docs] Add Kafka to Python api docs
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-02 18:40:46 -0800
   Commit: 9eb22ec, github.com/apache/spark/pull/4860

   [SPARK-5537] Add user guide for multinomial logistic regression
   Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>
   2015-03-02 18:10:50 -0800
   Commit: 9d6c5ae, github.com/apache/spark/pull/4801

   [SPARK-6121][SQL][MLLIB] simpleString for UDT
   Xiangrui Meng <meng@databricks.com>
   2015-03-02 17:14:34 -0800
   Commit: 2db6a85, github.com/apache/spark/pull/4858

   [SPARK-4777][CORE] Some block memory after unrollSafely not count into used memory(memoryStore.entrys or unrollMemory)
   hushan[胡珊] <hushan@xiaomi.com>
   2015-03-02 16:53:54 -0800
   Commit: e3a88d1, github.com/apache/spark/pull/3629

   [SPARK-6048] SparkConf should not translate deprecated configs on set
   Andrew Or <andrew@databricks.com>
   2015-03-02 16:36:42 -0800
   Commit: 258d154, github.com/apache/spark/pull/4799

   [SPARK-6066] Make event log format easier to parse
   Andrew Or <andrew@databricks.com>
   2015-03-02 16:34:32 -0800
   Commit: 6776cb3, github.com/apache/spark/pull/4821

   [SPARK-6082] [SQL] Provides better error message for malformed rows when caching tables
   Cheng Lian <lian@databricks.com>
   2015-03-02 16:18:00 -0800
   Commit: 1a49496, github.com/apache/spark/pull/4842

   [SPARK-6114][SQL] Avoid metastore conversions before plan is resolved
   Michael Armbrust <michael@databricks.com>
   2015-03-02 16:10:54 -0800
   Commit: 8223ce6, github.com/apache/spark/pull/4855

   [SPARK-5522] Accelerate the Histroty Server start
   guliangliang <guliangliang@qiyi.com>
   2015-03-02 15:33:23 -0800
   Commit: 26c1c56, github.com/apache/spark/pull/4525

   [SPARK-6050] [yarn] Relax matching of vcore count in received containers.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-02 16:41:43 -0600
   Commit: 6b348d9, github.com/apache/spark/pull/4818

   [SPARK-6040][SQL] Fix the percent bug in tablesample
   q00251598 <qiyadong@huawei.com>
   2015-03-02 13:16:29 -0800
   Commit: 582e5a2, github.com/apache/spark/pull/4789

   [Minor] Fix doc typo for describing primitiveTerm effectiveness condition
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-02 13:11:17 -0800
   Commit: 3f9def8, github.com/apache/spark/pull/4762

   SPARK-5390 [DOCS] Encourage users to post on Stack Overflow in Community Docs
   Sean Owen <sowen@cloudera.com>
   2015-03-02 21:10:08 +0000
   Commit: 0b472f6, github.com/apache/spark/pull/4843

   [DOCS] Refactored Dataframe join comment to use correct parameter ordering
   Paul Power <paul.power@peerside.com>
   2015-03-02 13:08:47 -0800
   Commit: d9a8bae, github.com/apache/spark/pull/4847

   [SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-02 10:17:24 -0800
   Commit: af2effd, github.com/apache/spark/pull/4831

   aggregateMessages example in graphX doc
   DEBORAH SIEGEL <deborahsiegel@DEBORAHs-MacBook-Pro.local>
   2015-03-02 10:15:32 -0800
   Commit: e7d8ae4, github.com/apache/spark/pull/4853

   [SPARK-5741][SQL] Support the path contains comma in HiveContext
   q00251598 <qiyadong@huawei.com>
   2015-03-02 10:13:11 -0800
   Commit: 9ce12aa, github.com/apache/spark/pull/4532

   [SPARK-6111] Fixed usage string in documentation.
   Kenneth Myers <myerske@us.ibm.com>
   2015-03-02 17:25:24 +0000
   Commit: 95ac68b, github.com/apache/spark/pull/4852

   [SPARK-6052][SQL]In JSON schema inference, we should always set containsNull of an ArrayType to true
   Yin Huai <yhuai@databricks.com>
   2015-03-02 23:18:07 +0800
   Commit: 3efd8bb, github.com/apache/spark/pull/4806

   [SPARK-6073][SQL] Need to refresh metastore cache after append data in CreateMetastoreDataSourceAsSelect
   Yin Huai <yhuai@databricks.com>
   2015-03-02 22:42:18 +0800
   Commit: 39a54b4, github.com/apache/spark/pull/4824

   [SPARK-6103][Graphx]remove unused class to import in EdgeRDDImpl
   Lianhui Wang <lianhuiwang09@gmail.com>
   2015-03-02 09:06:56 +0000
   Commit: 49c7a8f, github.com/apache/spark/pull/4846

   SPARK-3357 [CORE] Internal log messages should be set at DEBUG level instead of INFO
   Sean Owen <sowen@cloudera.com>
   2015-03-02 08:51:03 +0000
   Commit: 948c239, github.com/apache/spark/pull/4838

   [Streaming][Minor]Fix some error docs in streaming examples
   Saisai Shao <saisai.shao@intel.com>
   2015-03-02 08:49:19 +0000
   Commit: d8fb40e, github.com/apache/spark/pull/4837

   [SPARK-6083] [MLLib] [DOC] Make Python API example consistent in NaiveBayes
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-03-01 16:28:15 -0800
   Commit: 3f00bb3, github.com/apache/spark/pull/4834

   [SPARK-6053][MLLIB] support save/load in PySpark's ALS
   Xiangrui Meng <meng@databricks.com>
   2015-03-01 16:26:57 -0800
   Commit: aedbbaa, github.com/apache/spark/pull/4811

   [SPARK-6074] [sql] Package pyspark sql bindings.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-01 11:05:10 +0000
   Commit: fd8d283, github.com/apache/spark/pull/4822

   [SPARK-6075] Fix bug in that caused lost accumulator updates: do not store WeakReferences in localAccums map
   Josh Rosen <joshrosen@databricks.com>
   2015-02-28 22:51:01 -0800
   Commit: 2df5f1f, github.com/apache/spark/pull/4835

   SPARK-5984: Fix TimSort bug causes ArrayOutOfBoundsException
   Evan Yu <ehotou@gmail.com>
   2015-02-28 18:55:34 -0800
   Commit: 643300a, github.com/apache/spark/pull/4804

   SPARK-1965 [WEBUI] Spark UI throws NPE on trying to load the app page for non-existent app
   Sean Owen <sowen@cloudera.com>
   2015-02-28 15:34:08 +0000
   Commit: 86fcdae, github.com/apache/spark/pull/4777

   SPARK-5983 [WEBUI] Don't respond to HTTP TRACE in HTTP-based UIs
   Sean Owen <sowen@cloudera.com>
   2015-02-28 15:23:59 +0000
   Commit: f91298e, github.com/apache/spark/pull/4765

   SPARK-6063 MLlib doesn't pass mvn scalastyle check due to UTF chars in LDAModel.scala
   Michael Griffiths <msjgriffiths@gmail.com>, Griffiths, Michael (NYC-RPM) <michael.griffiths@reprisemedia.com>
   2015-02-28 14:47:39 +0000
   Commit: b36b1bc, github.com/apache/spark/pull/4815

   [SPARK-5775] [SQL] BugFix: GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table
   Cheng Lian <lian@databricks.com>, Cheng Lian <liancheng@users.noreply.github.com>, Yin Huai <yhuai@databricks.com>
   2015-02-28 21:15:43 +0800
   Commit: e6003f0, github.com/apache/spark/pull/4792

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-02-27 23:10:09 -0800
   Commit: 9168259, github.com/apache/spark/pull/1128

   [SPARK-5979][SPARK-6032] Smaller safer --packages fix
   Burak Yavuz <brkyvz@gmail.com>
   2015-02-27 22:59:35 -0800
   Commit: 6d8e5fb, github.com/apache/spark/pull/4802

   [SPARK-6070] [yarn] Remove unneeded classes from shuffle service jar.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-27 22:44:11 -0800
   Commit: dba08d1, github.com/apache/spark/pull/4820

   [SPARK-6055] [PySpark] fix incorrect __eq__ of DataType
   Davies Liu <davies@databricks.com>
   2015-02-27 20:07:17 -0800
   Commit: e0e64ba, github.com/apache/spark/pull/4808

   [SPARK-5751] [SQL] Sets SPARK_HOME as SPARK_PID_DIR when running Thrift server test suites
   Cheng Lian <lian@databricks.com>
   2015-02-28 08:41:49 +0800
   Commit: 8c468a6, github.com/apache/spark/pull/4758

   [Streaming][Minor] Remove useless type signature of Java Kafka direct stream API
   Saisai Shao <saisai.shao@intel.com>
   2015-02-27 13:01:42 -0800
   Commit: 5f7f3b9, github.com/apache/spark/pull/4817

   [SPARK-4587] [mllib] [docs] Fixed save,load calls in ML guide examples
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-27 13:00:36 -0800
   Commit: d17cb2b, github.com/apache/spark/pull/4816

   [SPARK-6059][Yarn] Add volatile to ApplicationMaster's reporterThread and allocator
   zsxwing <zsxwing@gmail.com>
   2015-02-27 13:33:39 +0000
   Commit: 57566d0, github.com/apache/spark/pull/4814

   [SPARK-6058][Yarn] Log the user class exception in ApplicationMaster
   zsxwing <zsxwing@gmail.com>
   2015-02-27 13:31:46 +0000
   Commit: e747e98, github.com/apache/spark/pull/4813

   [SPARK-6036][CORE] avoid race condition between eventlogListener and akka actor system
   Zhang, Liye <liye.zhang@intel.com>
   2015-02-26 23:11:43 -0800
   Commit: 8cd1692, github.com/apache/spark/pull/4785

   fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode
   许鹏 <peng.xu@fraudmetrix.cn>
   2015-02-26 23:05:56 -0800
   Commit: 0375a41, github.com/apache/spark/pull/4803

   [SPARK-6046] Privatize SparkConf.translateConfKey
   Andrew Or <andrew@databricks.com>
   2015-02-26 22:39:46 -0800
   Commit: 7c99a01, github.com/apache/spark/pull/4797

   SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server.
   Lukasz Jastrzebski <lukasz.jastrzebski@gmail.com>
   2015-02-26 22:38:06 -0800
   Commit: 4a8a0a8, github.com/apache/spark/pull/4778

   [SPARK-5495][UI] Add app and driver kill function in master web UI
   jerryshao <saisai.shao@intel.com>
   2015-02-26 22:36:48 -0800
   Commit: 67595eb, github.com/apache/spark/pull/4288

   [SPARK-5771][UI][hotfix] Change Requested Cores into * if default cores is not set
   jerryshao <saisai.shao@intel.com>
   2015-02-26 22:35:43 -0800
   Commit: 12135e9, github.com/apache/spark/pull/4800

   [SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore.
   Yin Huai <yhuai@databricks.com>
   2015-02-26 20:46:05 -0800
   Commit: 5e5ad65, github.com/apache/spark/pull/4795

   [SPARK-6037][SQL] Avoiding duplicate Parquet schema merging
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-27 11:06:47 +0800
   Commit: 4ad5153, github.com/apache/spark/pull/4786

   [SPARK-5529][CORE]Add expireDeadHosts in HeartbeatReceiver
   Hong Shen <hongshen@tencent.com>
   2015-02-26 18:43:23 -0800
   Commit: 18f2098, github.com/apache/spark/pull/4363

   SPARK-4579 [WEBUI] Scheduling Delay appears negative
   Sean Owen <sowen@cloudera.com>
   2015-02-26 17:35:09 -0800
   Commit: fbc4694, github.com/apache/spark/pull/4796

   SPARK-6045 RecordWriter should be checked against null in PairRDDFunctio...
   tedyu <yuzhihong@gmail.com>
   2015-02-26 23:26:07 +0000
   Commit: e60ad2f, github.com/apache/spark/pull/4794

   [SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode
   mohit.goyal <mohit.goyal@guavus.com>
   2015-02-26 14:27:47 -0800
   Commit: b38dec2, github.com/apache/spark/pull/4730

   Add a note for context termination for History server on Yarn
   moussa taifi <moutai10@gmail.com>
   2015-02-26 14:19:43 -0800
   Commit: c871e2d, github.com/apache/spark/pull/4721

   SPARK-4300 [CORE] Race condition during SparkWorker shutdown
   Sean Owen <sowen@cloudera.com>
   2015-02-26 14:08:56 -0800
   Commit: 3fb53c0, github.com/apache/spark/pull/4787

   [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM
   Cheolsoo Park <cheolsoop@netflix.com>
   2015-02-26 13:53:49 -0800
   Commit: 5f3238b, github.com/apache/spark/pull/4773

   [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-26 13:46:07 -0800
   Commit: aa63f63, github.com/apache/spark/pull/4779

   [SPARK-3562]Periodic cleanup event logs
   xukun 00228947 <xukun.xu@huawei.com>
   2015-02-26 13:24:00 -0800
   Commit: 8942b52, github.com/apache/spark/pull/4214

   Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs.
   Li Zhihui <zhihui.li@intel.com>
   2015-02-26 13:07:07 -0800
   Commit: 10094a5, github.com/apache/spark/pull/4781

   SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output
   Sean Owen <sowen@cloudera.com>
   2015-02-26 12:56:54 -0800
   Commit: cd5c8d7, github.com/apache/spark/pull/4788

   [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe
   Davies Liu <davies@databricks.com>
   2015-02-26 11:54:17 -0800
   Commit: 7fa960e, github.com/apache/spark/pull/4776

   [SPARK-6004][MLlib] Pick the best model when training GradientBoostedTrees with validation
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-26 10:51:47 -0800
   Commit: cfff397, github.com/apache/spark/pull/4763

   [SPARK-6007][SQL] Add numRows param in DataFrame.show()
   Jacky Li <jacky.likun@huawei.com>
   2015-02-26 10:40:58 -0800
   Commit: 2358657, github.com/apache/spark/pull/4767

   [SPARK-5801] [core] Avoid creating nested directories.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-26 17:35:03 +0000
   Commit: df3d559, github.com/apache/spark/pull/4747

   [SPARK-6016][SQL] Cannot read the parquet table after overwriting the existing table when spark.sql.parquet.cacheMetadata=true
   Yin Huai <yhuai@databricks.com>
   2015-02-27 01:01:32 +0800
   Commit: 192e42a, github.com/apache/spark/pull/4775

   [SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2
   Yin Huai <yhuai@databricks.com>
   2015-02-26 22:39:49 +0800
   Commit: f02394d, github.com/apache/spark/pull/4782

   [SPARK-5914] to run spark-submit requiring only user perm on windows
   Judy Nash <judynash@microsoft.com>
   2015-02-26 11:14:37 +0000
   Commit: 51a6f90, github.com/apache/spark/pull/4742

   [SPARK-5976][MLLIB] Add partitioner to factors returned by ALS
   Xiangrui Meng <meng@databricks.com>
   2015-02-25 23:43:29 -0800
   Commit: e43139f, github.com/apache/spark/pull/4748

   [SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide with save/load, Python GBT
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-25 16:13:17 -0800
   Commit: d20559b, github.com/apache/spark/pull/4750

   [SPARK-1182][Docs] Sort the configuration parameters in configuration.md
   Brennon York <brennon.york@capitalone.com>
   2015-02-25 16:12:56 -0800
   Commit: 46a044a, github.com/apache/spark/pull/3863

   [SPARK-5926] [SQL] make DataFrame.explain leverage queryExecution.logical
   Yanbo Liang <ybliang8@gmail.com>
   2015-02-25 15:37:13 -0800
   Commit: 41e2e5a, github.com/apache/spark/pull/4707

   [SPARK-5999][SQL] Remove duplicate Literal matching block
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-25 15:22:33 -0800
   Commit: 12dbf98, github.com/apache/spark/pull/4760

   [SPARK-6010] [SQL] Merging compatible Parquet schemas before computing splits
   Cheng Lian <lian@databricks.com>
   2015-02-25 15:15:22 -0800
   Commit: e0fdd46, github.com/apache/spark/pull/4768

   [SPARK-5944] [PySpark] fix version in Python API docs
   Davies Liu <davies@databricks.com>
   2015-02-25 15:13:34 -0800
   Commit: f3f4c87, github.com/apache/spark/pull/4731

   [SPARK-5982] Remove incorrect Local Read Time Metric
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-25 14:55:24 -0800
   Commit: 838a480, github.com/apache/spark/pull/4749

   [SPARK-1955][GraphX]: VertexRDD can incorrectly assume index sharing
   Brennon York <brennon.york@capitalone.com>
   2015-02-25 14:11:12 -0800
   Commit: 9f603fc, github.com/apache/spark/pull/4705

   [SPARK-5970][core] Register directory created in getOrCreateLocalRootDirs for automatic deletion.
   Milan Straka <fox@ucw.cz>
   2015-02-25 21:33:34 +0000
   Commit: a777c65, github.com/apache/spark/pull/4759

   SPARK-5930 [DOCS] Documented default of spark.shuffle.io.retryWait is confusing
   Sean Owen <sowen@cloudera.com>
   2015-02-25 12:20:44 -0800
   Commit: 7d8e6a2, github.com/apache/spark/pull/4769

   [SPARK-5996][SQL] Fix specialized outbound conversions
   Michael Armbrust <michael@databricks.com>
   2015-02-25 10:13:40 -0800
   Commit: f84c799, github.com/apache/spark/pull/4757

   [SPARK-5771] Number of Cores in Completed Applications of Standalone Master Web Page always be 0 if sc.stop() is called
   guliangliang <guliangliang@qiyi.com>
   2015-02-25 14:48:02 +0000
   Commit: dd077ab, github.com/apache/spark/pull/4567

   [GraphX] fixing 3 typos in the graphx programming guide
   Benedikt Linse <benedikt.linse@gmail.com>
   2015-02-25 14:46:17 +0000
   Commit: 5b8480e, github.com/apache/spark/pull/4766

   [SPARK-5666][streaming][MQTT streaming] some trivial fixes
   prabs <prabsmails@gmail.com>, Prabeesh K <prabsmails@gmail.com>
   2015-02-25 14:37:35 +0000
   Commit: d51ed26, github.com/apache/spark/pull/4178

   [SPARK-5994] [SQL] Python DataFrame documentation fixes
   Davies Liu <davies@databricks.com>
   2015-02-24 20:51:55 -0800
   Commit: d641fbb, github.com/apache/spark/pull/4756

   [SPARK-5286][SQL] SPARK-5286 followup
   Yin Huai <yhuai@databricks.com>
   2015-02-24 19:51:36 -0800
   Commit: 769e092, github.com/apache/spark/pull/4755

   [SPARK-5993][Streaming][Build] Fix assembly jar location of kafka-assembly
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-24 19:10:37 -0800
   Commit: 922b43b, github.com/apache/spark/pull/4753

   [SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.
   Reynold Xin <rxin@databricks.com>
   2015-02-24 18:59:23 -0800
   Commit: fba11c2, github.com/apache/spark/pull/4752

   [SPARK-5904][SQL] DataFrame Java API test suites.
   Reynold Xin <rxin@databricks.com>
   2015-02-24 18:51:41 -0800
   Commit: 53a1ebf, github.com/apache/spark/pull/4751

   [SPARK-5751] [SQL] [WIP] Revamped HiveThriftServer2Suite for robustness
   Cheng Lian <lian@databricks.com>
   2015-02-25 08:34:55 +0800
   Commit: f816e73, github.com/apache/spark/pull/4720

   [SPARK-5436] [MLlib] Validate GradientBoostedTrees using runWithValidation
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-02-24 15:13:22 -0800
   Commit: 2a0fe34, github.com/apache/spark/pull/4677

   [SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
   Davies Liu <davies@databricks.com>
   2015-02-24 14:50:00 -0800
   Commit: da505e5, github.com/apache/spark/pull/4745

   [SPARK-5952][SQL] Lock when using hive metastore client
   Michael Armbrust <michael@databricks.com>
   2015-02-24 13:39:29 -0800
   Commit: a2b9137, github.com/apache/spark/pull/4746

   [Spark-5708] Add Slf4jSink to Spark Metrics
   Judy <judynash@microsoft.com>, judynash <judynash@microsoft.com>
   2015-02-24 20:50:16 +0000
   Commit: c5ba975, github.com/apache/spark/pull/4644

   [MLLIB] Change x_i to y_i in Variance's user guide
   Xiangrui Meng <meng@databricks.com>
   2015-02-24 11:38:59 -0800
   Commit: 105791e, github.com/apache/spark/pull/4740

   [SPARK-5965] Standalone Worker UI displays {{USER_JAR}}
   Andrew Or <andrew@databricks.com>
   2015-02-24 11:08:07 -0800
   Commit: 6d2caa5, github.com/apache/spark/pull/4739

   [Spark-5967] [UI] Correctly clean JobProgressListener.stageIdToActiveJobIds
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-24 11:02:47 -0800
   Commit: 64d2c01, github.com/apache/spark/pull/4741

   [SPARK-5532][SQL] Repartition should not use external rdd representation
   Michael Armbrust <michael@databricks.com>
   2015-02-24 10:52:18 -0800
   Commit: 2012366, github.com/apache/spark/pull/4738

   [SPARK-5910][SQL] Support for as in selectExpr
   Michael Armbrust <michael@databricks.com>
   2015-02-24 10:49:51 -0800
   Commit: 0a59e45, github.com/apache/spark/pull/4736

   [SPARK-5968] [SQL] Suppresses ParquetOutputCommitter WARN logs
   Cheng Lian <lian@databricks.com>
   2015-02-24 10:45:38 -0800
   Commit: 8403331, github.com/apache/spark/pull/4744

   [SPARK-5958][MLLIB][DOC] update block matrix user guide
   Xiangrui Meng <meng@databricks.com>
   2015-02-23 22:08:44 -0800
   Commit: cf2e416, github.com/apache/spark/pull/4737

   [SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryExecution
   Michael Armbrust <michael@databricks.com>
   2015-02-23 17:34:54 -0800
   Commit: 1ed5708, github.com/apache/spark/pull/4684

   [SPARK-5935][SQL] Accept MapType in the schema provided to a JSON dataset.
   Yin Huai <yhuai@databricks.com>, Yin Huai <huai@cse.ohio-state.edu>
   2015-02-23 17:16:34 -0800
   Commit: 48376bf, github.com/apache/spark/pull/4710

   [SPARK-5912] [docs] [mllib] Small fixes to ChiSqSelector docs
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-23 16:15:57 -0800
   Commit: 59536cc, github.com/apache/spark/pull/4732

   [MLLIB] SPARK-5912 Programming guide for feature selection
   Alexander Ulanov <nashb@yandex.ru>
   2015-02-23 12:09:40 -0800
   Commit: 28ccf5e, github.com/apache/spark/pull/4709

   [SPARK-5939][MLLib] make FPGrowth example app take parameters
   Jacky Li <jacky.likun@huawei.com>
   2015-02-23 08:47:28 -0800
   Commit: 651a1c0, github.com/apache/spark/pull/4714

   [SPARK-5724] fix the misconfiguration in AkkaUtils
   CodingCat <zhunansjtu@gmail.com>
   2015-02-23 11:29:25 +0000
   Commit: 242d495, github.com/apache/spark/pull/4512

   [SPARK-5943][Streaming] Update the test to use new API to reduce the warning
   Saisai Shao <saisai.shao@intel.com>
   2015-02-23 11:27:27 +0000
   Commit: 757b14b, github.com/apache/spark/pull/4722

   [EXAMPLES] fix typo.
   Makoto Fukuhara <fukuo33@gmail.com>
   2015-02-23 09:24:33 +0000
   Commit: 9348767, github.com/apache/spark/pull/4724

   [SPARK-3885] Provide mechanism to remove accumulators once they are no longer used
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-02-22 22:43:04 -0800
   Commit: 95cd643, github.com/apache/spark/pull/4021

   [SPARK-911] allow efficient queries for a range if RDD is partitioned wi...
   Aaron Josephs <ajoseph4@binghamton.edu>
   2015-02-22 22:09:06 -0800
   Commit: e4f9d03, github.com/apache/spark/pull/1381

   [DataFrame] [Typo] Fix the typo
   Cheng Hao <hao.cheng@intel.com>
   2015-02-22 08:56:30 +0000
   Commit: 275b1be, github.com/apache/spark/pull/4717

   [DOCS] Fix typo in API for custom InputFormats based on the “new” MapReduce API
   Alexander <abezzubov@nflabs.com>
   2015-02-22 08:53:05 +0000
   Commit: a7f9039, github.com/apache/spark/pull/4718

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-02-21 23:07:30 -0800
   Commit: 46462ff, github.com/apache/spark/pull/3490

   [SPARK-5860][CORE] JdbcRDD: overflow on large range with high number of partitions
   Evan Yu <ehotou@gmail.com>
   2015-02-21 20:40:21 +0000
   Commit: 7683982, github.com/apache/spark/pull/4701

   [SPARK-5937][YARN] Fix ClientSuite to set YARN mode, so that the correct class is used in t...
   Hari Shreedharan <hshreedharan@apache.org>
   2015-02-21 10:01:01 -0800
   Commit: 7138816, github.com/apache/spark/pull/4711

   SPARK-5841 [CORE] [HOTFIX 2] Memory leak in DiskBlockManager
   Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
   2015-02-21 09:59:28 -0800
   Commit: d3cbd38, github.com/apache/spark/pull/4690

   [MLlib] fix typo
   Jacky Li <jackylk@users.noreply.github.com>
   2015-02-21 13:00:16 +0000
   Commit: e155324, github.com/apache/spark/pull/4713

   [SPARK-5898] [SPARK-5896] [SQL]  [PySpark] create DataFrame from pandas and tuple/list
   Davies Liu <davies@databricks.com>
   2015-02-20 15:35:05 -0800
   Commit: 5b0a42c, github.com/apache/spark/pull/4679

   [SPARK-5867] [SPARK-5892] [doc] [ml] [mllib] Doc cleanups for 1.3 release
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-20 02:31:32 -0800
   Commit: 4a17eed, github.com/apache/spark/pull/4675

   SPARK-5744 [CORE] Take 2. RDD.isEmpty / take fails for (empty) RDD of Nothing
   Sean Owen <sowen@cloudera.com>
   2015-02-20 10:21:39 +0000
   Commit: d3dfebe, github.com/apache/spark/pull/4698

   [SPARK-5909][SQL] Add a clearCache command to Spark SQL's cache manager
   Yin Huai <yhuai@databricks.com>
   2015-02-20 16:20:02 +0800
   Commit: 70bfb5c, github.com/apache/spark/pull/4694

   [SPARK-4808] Removing minimum number of elements read before spill check
   mcheah <mcheah@palantir.com>
   2015-02-19 18:09:22 -0800
   Commit: 3be92cd, github.com/apache/spark/pull/4420

   [SPARK-5900][MLLIB] make PIC and FPGrowth Java-friendly
   Xiangrui Meng <meng@databricks.com>
   2015-02-19 18:06:16 -0800
   Commit: 0cfd2ce, github.com/apache/spark/pull/4695

   SPARK-5570: No docs stating that `new SparkConf().set("spark.driver.memory", ...) will not work
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-02-19 15:50:58 -0800
   Commit: 6bddc40, github.com/apache/spark/pull/4665

   SPARK-4682 [CORE] Consolidate various 'Clock' classes
   Sean Owen <sowen@cloudera.com>
   2015-02-19 15:35:23 -0800
   Commit: 34b7c35, github.com/apache/spark/pull/4514

   [Spark-5889] Remove pid file after stopping service.
   Zhan Zhang <zhazhan@gmail.com>
   2015-02-19 23:13:02 +0000
   Commit: ad6b169, github.com/apache/spark/pull/4676

   [SPARK-5902] [ml] Made PipelineStage.transformSchema public instead of private to ml
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-19 12:46:27 -0800
   Commit: a5fed34, github.com/apache/spark/pull/4682

   [SPARK-5904][SQL] DataFrame API fixes.
   Reynold Xin <rxin@databricks.com>
   2015-02-19 12:09:44 -0800
   Commit: 8ca3418, github.com/apache/spark/pull/4686

   [SPARK-5825] [Spark Submit] Remove the double checking instance name when stopping the service
   Cheng Hao <hao.cheng@intel.com>
   2015-02-19 12:07:51 -0800
   Commit: 94cdb05, github.com/apache/spark/pull/4611

   [SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file
   zsxwing <zsxwing@gmail.com>
   2015-02-19 18:37:31 +0000
   Commit: 90095bf, github.com/apache/spark/pull/4219

   [SPARK-5816] Add huge compatibility warning in DriverWrapper
   Andrew Or <andrew@databricks.com>
   2015-02-19 09:56:25 -0800
   Commit: 38e624a, github.com/apache/spark/pull/4687

   SPARK-5548: Fix for AkkaUtilsSuite failure - attempt 2
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-02-19 09:53:36 -0800
   Commit: fb87f44, github.com/apache/spark/pull/4653

   [SPARK-5846] Correctly set job description and pool for SQL jobs
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-19 09:49:34 +0800
   Commit: e945aa6, github.com/apache/spark/pull/4630

   [SPARK-5879][MLLIB] update PIC user guide and add a Java example
   Xiangrui Meng <meng@databricks.com>
   2015-02-18 16:29:32 -0800
   Commit: d12d2ad, github.com/apache/spark/pull/4680

   [SPARK-5722] [SQL] [PySpark] infer int as LongType
   Davies Liu <davies@databricks.com>
   2015-02-18 14:17:04 -0800
   Commit: aa8f10e, github.com/apache/spark/pull/4666

   [SPARK-5840][SQL] HiveContext cannot be serialized due to tuple extraction
   Reynold Xin <rxin@databricks.com>
   2015-02-18 14:02:32 -0800
   Commit: f0e3b71, github.com/apache/spark/pull/4628

   [SPARK-5507] Added documentation for BlockMatrix
   Burak Yavuz <brkyvz@gmail.com>
   2015-02-18 10:11:08 -0800
   Commit: a8eb92d, github.com/apache/spark/pull/4664

   [SPARK-5519][MLLIB] add user guide with example code for fp-growth
   Xiangrui Meng <meng@databricks.com>
   2015-02-18 10:09:56 -0800
   Commit: 85e9d09, github.com/apache/spark/pull/4661

   SPARK-5669 [BUILD] [HOTFIX] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
   Sean Owen <sowen@cloudera.com>
   2015-02-18 14:41:44 +0000
   Commit: 5aecdcf, github.com/apache/spark/pull/4673

   [SPARK-4949]shutdownCallback in SparkDeploySchedulerBackend should be enclosed by synchronized block.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-02-18 12:20:11 +0000
   Commit: 82197ed, github.com/apache/spark/pull/3781

   SPARK-4610 addendum: [Minor] [MLlib] Minor doc fix in GBT classification example
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-02-18 10:13:28 +0000
   Commit: e79a7a6, github.com/apache/spark/pull/4672

   [SPARK-5878] fix DataFrame.repartition() in Python
   Davies Liu <davies@databricks.com>
   2015-02-18 01:00:54 -0800
   Commit: c1b6fa9, github.com/apache/spark/pull/4667

   Avoid deprecation warnings in JDBCSuite.
   Tor Myklebust <tmyklebu@gmail.com>
   2015-02-18 01:00:13 -0800
   Commit: de0dd6d, github.com/apache/spark/pull/4668

   [Minor] [SQL] Cleans up DataFrame variable names and toDF() calls
   Cheng Lian <lian@databricks.com>
   2015-02-17 23:36:20 -0800
   Commit: 61ab085, github.com/apache/spark/pull/4670

   [SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStreamSuite
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-17 22:44:16 -0800
   Commit: 3912d33, github.com/apache/spark/pull/4597

   [SPARK-5723][SQL]Change the default file format to Parquet for CTAS statements.
   Yin Huai <yhuai@databricks.com>
   2015-02-17 18:14:33 -0800
   Commit: e50934f, github.com/apache/spark/pull/4639

   [SPARK-5875][SQL]logical.Project should not be resolved if it contains aggregates or generators
   Yin Huai <yhuai@databricks.com>
   2015-02-17 17:50:39 -0800
   Commit: d5f12bf, github.com/apache/spark/pull/4663

   [SPARK-4454] Revert getOrElse() cleanup in DAGScheduler.getCacheLocs()
   Josh Rosen <joshrosen@databricks.com>
   2015-02-17 17:45:16 -0800
   Commit: a51fc7e

   [SPARK-4454] Properly synchronize accesses to DAGScheduler cacheLocs map
   Josh Rosen <joshrosen@databricks.com>
   2015-02-17 17:39:58 -0800
   Commit: d46d624, github.com/apache/spark/pull/4660

   [SPARK-5811] Added documentation for maven coordinates and added Spark Packages support
   Burak Yavuz <brkyvz@gmail.com>, Davies Liu <davies@databricks.com>
   2015-02-17 17:15:43 -0800
   Commit: ae6cfb3, github.com/apache/spark/pull/4662

   [SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
   Davies Liu <davies@databricks.com>
   2015-02-17 16:54:57 -0800
   Commit: c3d2b90, github.com/apache/spark/pull/4629

   [SPARK-5852][SQL]Fail to convert a newly created empty metastore parquet table to a data source parquet table.
   Yin Huai <yhuai@databricks.com>, Cheng Hao <hao.cheng@intel.com>
   2015-02-17 15:47:59 -0800
   Commit: 117121a, github.com/apache/spark/pull/4655

   [SPARK-5872] [SQL] create a sqlCtx in pyspark shell
   Davies Liu <davies@databricks.com>
   2015-02-17 15:44:37 -0800
   Commit: 4d4cc76, github.com/apache/spark/pull/4659

   [SPARK-5871] output explain in Python
   Davies Liu <davies@databricks.com>
   2015-02-17 13:48:38 -0800
   Commit: 3df85dc, github.com/apache/spark/pull/4658

   [SPARK-4172] [PySpark] Progress API in Python
   Davies Liu <davies@databricks.com>
   2015-02-17 13:36:43 -0800
   Commit: 445a755, github.com/apache/spark/pull/3027

   [SPARK-5868][SQL] Fix python UDFs in HiveContext and checks in SQLContext
   Michael Armbrust <michael@databricks.com>
   2015-02-17 13:23:45 -0800
   Commit: de4836f, github.com/apache/spark/pull/4657

   [SQL] [Minor] Update the HiveContext Unittest
   Cheng Hao <hao.cheng@intel.com>
   2015-02-17 12:25:35 -0800
   Commit: 9d281fa, github.com/apache/spark/pull/4584

   [Minor][SQL] Use same function to check path parameter in JSONRelation
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-17 12:24:13 -0800
   Commit: ac506b7, github.com/apache/spark/pull/4649

   [SPARK-5862][SQL] Only transformUp the given plan once in HiveMetastoreCatalog
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-17 12:23:18 -0800
   Commit: 4611de1, github.com/apache/spark/pull/4651

   [Minor] fix typo in SQL document
   CodingCat <zhunansjtu@gmail.com>
   2015-02-17 12:16:52 -0800
   Commit: 31efb39, github.com/apache/spark/pull/4656

   [SPARK-5864] [PySpark] support .jar as python package
   Davies Liu <davies@databricks.com>
   2015-02-17 12:05:06 -0800
   Commit: fc4eb95, github.com/apache/spark/pull/4652

   SPARK-5841 [CORE] [HOTFIX] Memory leak in DiskBlockManager
   Sean Owen <sowen@cloudera.com>
   2015-02-17 19:40:06 +0000
   Commit: 49c19fd, github.com/apache/spark/pull/4648

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-02-17 11:35:26 -0800
   Commit: 24f358b, github.com/apache/spark/pull/3297

   [SPARK-3381] [MLlib] Eliminate bins for unordered features in DecisionTrees
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-02-17 11:19:23 -0800
   Commit: 9b746f3, github.com/apache/spark/pull/4231

   [SPARK-5661]function hasShutdownDeleteTachyonDir should use shutdownDeleteTachyonPaths to determine whether contains file
   xukun 00228947 <xukun.xu@huawei.com>, viper-kun <xukun.xu@huawei.com>
   2015-02-17 18:59:41 +0000
   Commit: b271c26, github.com/apache/spark/pull/4418

   [SPARK-5778] throw if nonexistent metrics config file provided
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-02-17 10:57:16 -0800
   Commit: d8f69cf, github.com/apache/spark/pull/4571

   [SPARK-5859] [PySpark] [SQL] fix DataFrame Python API
   Davies Liu <davies@databricks.com>
   2015-02-17 10:22:48 -0800
   Commit: d8adefe, github.com/apache/spark/pull/4645

   [SPARK-5166][SPARK-5247][SPARK-5258][SQL] API Cleanup / Documentation
   Michael Armbrust <michael@databricks.com>
   2015-02-17 10:21:17 -0800
   Commit: c74b07f, github.com/apache/spark/pull/4642

   [SPARK-5858][MLLIB] Remove unnecessary first() call in GLM
   Xiangrui Meng <meng@databricks.com>
   2015-02-17 10:17:45 -0800
   Commit: c76da36, github.com/apache/spark/pull/4647

   SPARK-5856: In Maven build script, launch Zinc with more memory
   Patrick Wendell <patrick@databricks.com>
   2015-02-17 10:10:01 -0800
   Commit: 3ce46e9, github.com/apache/spark/pull/4643

   Revert "[SPARK-5363] [PySpark] check ending mark in non-block way"
   Josh Rosen <joshrosen@databricks.com>
   2015-02-17 07:48:27 -0800
   Commit: ee6e3ef

   [SPARK-5826][Streaming] Fix Configuration not serializable problem
   jerryshao <saisai.shao@intel.com>
   2015-02-17 10:45:18 +0000
   Commit: a65766b, github.com/apache/spark/pull/4612

   HOTFIX: Style issue causing build break
   Patrick Wendell <patrick@databricks.com>
   2015-02-16 22:10:39 -0800
   Commit: c06e42f

   [SPARK-5802][MLLIB] cache transformed data in glm
   Xiangrui Meng <meng@databricks.com>
   2015-02-16 22:09:04 -0800
   Commit: fd84229, github.com/apache/spark/pull/4593

   [SPARK-5853][SQL] Schema support in Row.
   Reynold Xin <rxin@databricks.com>
   2015-02-16 20:42:57 -0800
   Commit: d380f32, github.com/apache/spark/pull/4640

   SPARK-5850: Remove experimental label for Scala 2.11 and FlumePollingStream
   Patrick Wendell <patrick@databricks.com>
   2015-02-16 20:33:33 -0800
   Commit: a51d51f, github.com/apache/spark/pull/4638

   [SPARK-5363] [PySpark] check ending mark in non-block way
   Davies Liu <davies@databricks.com>
   2015-02-16 20:32:03 -0800
   Commit: ac6fe67, github.com/apache/spark/pull/4601

   [SQL] Various DataFrame doc changes.
   Reynold Xin <rxin@databricks.com>
   2015-02-16 19:00:30 -0800
   Commit: 0e180bf, github.com/apache/spark/pull/4636

   [SPARK-5849] Handle more types of invalid JSON requests in SubmitRestProtocolMessage.parseAction
   Josh Rosen <joshrosen@databricks.com>
   2015-02-16 18:08:02 -0800
   Commit: 58a82a7, github.com/apache/spark/pull/4637

   [SPARK-3340] Deprecate ADD_JARS and ADD_FILES
   azagrebin <azagrebin@gmail.com>
   2015-02-16 18:06:19 -0800
   Commit: 1668765, github.com/apache/spark/pull/4616

   [SPARK-5788] [PySpark] capture the exception in python write thread
   Davies Liu <davies@databricks.com>
   2015-02-16 17:57:14 -0800
   Commit: b1bd1dd, github.com/apache/spark/pull/4577

   SPARK-5848: tear down the ConsoleProgressBar timer
   Matt Whelan <mwhelan@perka.com>
   2015-02-17 00:59:49 +0000
   Commit: 1294a6e, github.com/apache/spark/pull/4635

   [SPARK-4865][SQL]Include temporary tables in SHOW TABLES
   Yin Huai <yhuai@databricks.com>
   2015-02-16 15:59:23 -0800
   Commit: e189cbb, github.com/apache/spark/pull/4618

   [SQL] Optimize arithmetic and predicate operators
   kai <kaizeng@eecs.berkeley.edu>
   2015-02-16 15:58:05 -0800
   Commit: cb6c48c, github.com/apache/spark/pull/4472

   [SPARK-5839][SQL]HiveMetastoreCatalog does not recognize table names and aliases of data source tables.
   Yin Huai <yhuai@databricks.com>
   2015-02-16 15:54:01 -0800
   Commit: f3ff1eb, github.com/apache/spark/pull/4626

   [SPARK-5746][SQL] Check invalid cases for the write path of data source API
   Yin Huai <yhuai@databricks.com>
   2015-02-16 15:51:59 -0800
   Commit: 5b6cd65, github.com/apache/spark/pull/4617

   HOTFIX: Break in Jekyll build from #4589
   Patrick Wendell <patrick@databricks.com>
   2015-02-16 15:43:56 -0800
   Commit: 04b401d

   [SPARK-2313] Use socket to communicate GatewayServer port back to Python driver
   Josh Rosen <joshrosen@databricks.com>
   2015-02-16 15:25:11 -0800
   Commit: 0cfda84, github.com/apache/spark/pull/3424.

   SPARK-5357: Update commons-codec version to 1.10 (current)
   Matt Whelan <mwhelan@perka.com>
   2015-02-16 23:05:34 +0000
   Commit: c01c4eb, github.com/apache/spark/pull/4153

   SPARK-5841: remove DiskBlockManager shutdown hook on stop
   Matt Whelan <mwhelan@perka.com>
   2015-02-16 22:54:32 +0000
   Commit: bb05982, github.com/apache/spark/pull/4627

   [SPARK-5833] [SQL] Adds REFRESH TABLE command
   Cheng Lian <lian@databricks.com>
   2015-02-16 12:52:05 -0800
   Commit: c51ab37, github.com/apache/spark/pull/4624

   [SPARK-5296] [SQL] Add more filter types for data sources API
   Cheng Lian <lian@databricks.com>
   2015-02-16 12:48:55 -0800
   Commit: 6f54dee, github.com/apache/spark/pull/4623

   [SQL] Add fetched row count in SparkSQLCLIDriver
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-16 12:34:09 -0800
   Commit: b4d7c70, github.com/apache/spark/pull/4604

   [SQL] Initial support for reporting location of error in sql string
   Michael Armbrust <michael@databricks.com>
   2015-02-16 12:32:56 -0800
   Commit: 104b2c4, github.com/apache/spark/pull/4587

   [SPARK-5824] [SQL] add null format in ctas and set default col comment to null
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-16 12:31:36 -0800
   Commit: 275a0c0, github.com/apache/spark/pull/4609

   [SQL] [Minor] Update the SpecificMutableRow.copy
   Cheng Hao <hao.cheng@intel.com>
   2015-02-16 12:21:08 -0800
   Commit: cc552e0, github.com/apache/spark/pull/4619

   SPARK-5795 [STREAMING] api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java
   Sean Owen <sowen@cloudera.com>
   2015-02-16 19:32:31 +0000
   Commit: 8e25373, github.com/apache/spark/pull/4608

   Minor fixes for commit https://github.com/apache/spark/pull/4592.
   Reynold Xin <rxin@databricks.com>
   2015-02-16 10:09:55 -0800
   Commit: 9baac56

   [SPARK-5799][SQL] Compute aggregation function on specified numeric columns
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-16 10:06:11 -0800
   Commit: 5c78be7, github.com/apache/spark/pull/4592

   SPARK-5815 [MLLIB] Part 2. Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS
   Sean Owen <sowen@cloudera.com>
   2015-02-16 17:04:30 +0000
   Commit: a3afa4a, github.com/apache/spark/pull/4625

   [SPARK-5831][Streaming]When checkpoint file size is bigger than 10, then delete the old ones
   Xutingjun <1039320815@qq.com>
   2015-02-16 14:54:23 +0000
   Commit: 1115e8e, github.com/apache/spark/pull/4621

   [SPARK-4553] [SPARK-5767] [SQL] Wires Parquet data source with the newly introduced write support for data source API
   Cheng Lian <lian@databricks.com>
   2015-02-16 01:38:31 -0800
   Commit: 3ce58cf, github.com/apache/spark/pull/4563

   [Minor] [SQL] Renames stringRddToDataFrame to stringRddToDataFrameHolder for consistency
   Cheng Lian <lian@databricks.com>
   2015-02-16 01:33:37 -0800
   Commit: 199a9e8, github.com/apache/spark/pull/4613

   [Ml] SPARK-5804 Explicitly manage cache in Crossvalidator k-fold loop
   Peter Rudenko <petro.rudenko@gmail.com>
   2015-02-16 00:07:23 -0800
   Commit: d51d6ba, github.com/apache/spark/pull/4595

   [Ml] SPARK-5796 Don't transform data on a last estimator in Pipeline
   Peter Rudenko <petro.rudenko@gmail.com>
   2015-02-15 20:51:32 -0800
   Commit: c78a12c, github.com/apache/spark/pull/4590

   SPARK-5815 [MLLIB] Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS
   Sean Owen <sowen@cloudera.com>
   2015-02-15 20:41:27 -0800
   Commit: acf2558, github.com/apache/spark/pull/4614

   [SPARK-5769] Set params in constructors and in setParams in Python ML pipelines
   Xiangrui Meng <meng@databricks.com>
   2015-02-15 20:29:26 -0800
   Commit: cd4a153, github.com/apache/spark/pull/4564

   SPARK-5669 [BUILD] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
   Sean Owen <sowen@cloudera.com>
   2015-02-15 09:15:48 -0800
   Commit: 836577b, github.com/apache/spark/pull/4453

   [MLLIB][SPARK-5502] User guide for isotonic regression
   martinzapletal <zapletal-martin@email.cz>
   2015-02-15 09:10:03 -0800
   Commit: 61eb126, github.com/apache/spark/pull/4536

   [SPARK-5827][SQL] Add missing import in the example of SqlContext
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2015-02-15 14:42:20 +0000
   Commit: c771e47, github.com/apache/spark/pull/4615

   SPARK-5822 [BUILD] cannot import src/main/scala & src/test/scala into eclipse as source folder
   gli <gli@redhat.com>
   2015-02-14 20:43:27 +0000
   Commit: ed5f4bb, github.com/apache/spark/pull/4531

   Revise formatting of previous commit f80e2629bb74bc62960c61ff313f7e7802d61319
   Sean Owen <sowen@cloudera.com>
   2015-02-14 20:12:29 +0000
   Commit: 15a2ab5

   [SPARK-5800] Streaming Docs. Change linked files according the selected language
   gasparms <gmunoz@stratio.com>
   2015-02-14 20:10:29 +0000
   Commit: f80e262, github.com/apache/spark/pull/4589

   [SPARK-5752][SQL] Don't implicitly convert RDDs directly to DataFrames
   Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
   2015-02-13 23:03:22 -0800
   Commit: e98dfe6, github.com/apache/spark/pull/4556

   SPARK-3290 [GRAPHX] No unpersist callls in SVDPlusPlus
   Sean Owen <sowen@cloudera.com>
   2015-02-13 20:12:52 -0800
   Commit: 0ce4e43, github.com/apache/spark/pull/4234

   [SPARK-5227] [SPARK-5679] Disable FileSystem cache in WholeTextFileRecordReaderSuite
   Josh Rosen <joshrosen@databricks.com>
   2015-02-13 17:45:31 -0800
   Commit: d06d5ee, github.com/apache/spark/pull/4599

   [SPARK-5730][ML] add doc groups to spark.ml components
   Xiangrui Meng <meng@databricks.com>
   2015-02-13 16:45:59 -0800
   Commit: 4f4c6d5, github.com/apache/spark/pull/4600

   [SPARK-5803][MLLIB] use ArrayBuilder to build primitive arrays
   Xiangrui Meng <meng@databricks.com>
   2015-02-13 16:43:49 -0800
   Commit: d50a91d, github.com/apache/spark/pull/4594

   [SPARK-5806] re-organize sections in mllib-clustering.md
   Xiangrui Meng <meng@databricks.com>
   2015-02-13 15:09:27 -0800
   Commit: cc56c87, github.com/apache/spark/pull/4598

   [SPARK-5789][SQL]Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.
   Yin Huai <yhuai@databricks.com>
   2015-02-13 13:51:06 -0800
   Commit: 2e0c084, github.com/apache/spark/pull/4582

   [SPARK-5642] [SQL] Apply column pruning on unused aggregation fields
   Daoyuan Wang <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>
   2015-02-13 13:46:50 -0800
   Commit: 2cbb3e4, github.com/apache/spark/pull/4415

   [HOTFIX] Fix build break in MesosSchedulerBackendSuite
   Andrew Or <andrew@databricks.com>
   2015-02-13 13:10:29 -0800
   Commit: 5d3cc6b

   [HOTFIX] Ignore DirectKafkaStreamSuite.
   Reynold Xin <rxin@databricks.com>
   2015-02-13 12:43:53 -0800
   Commit: 378c7eb

   SPARK-5805 Fixed the type error in documentation.
   Emre Sevinç <emre.sevinc@gmail.com>
   2015-02-13 12:31:27 -0800
   Commit: 9f31db0, github.com/apache/spark/pull/4596

   [SPARK-5735] Replace uses of EasyMock with Mockito
   Josh Rosen <joshrosen@databricks.com>
   2015-02-13 09:53:57 -0800
   Commit: 077eec2, github.com/apache/spark/pull/4578

   [SPARK-5783] Better eventlog-parsing error messages
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-02-13 09:47:26 -0800
   Commit: fc6d3e7, github.com/apache/spark/pull/4573

   [SPARK-5503][MLLIB] Example code for Power Iteration Clustering
   sboeschhuawei <stephen.boesch@huawei.com>
   2015-02-13 09:45:57 -0800
   Commit: e1a1ff8, github.com/apache/spark/pull/4495

   [SPARK-5732][CORE]:Add an option to print the spark version in spark script.
   uncleGen <hustyugm@gmail.com>, genmao.ygm <genmao.ygm@alibaba-inc.com>
   2015-02-13 09:43:10 -0800
   Commit: c0ccd25, github.com/apache/spark/pull/4522

   [SPARK-4832][Deploy]some other processes might take the daemon pid
   WangTaoTheTonic <barneystinson@aliyun.com>, WangTaoTheTonic <wangtao111@huawei.com>
   2015-02-13 10:27:23 +0000
   Commit: 1768bd5, github.com/apache/spark/pull/3683

   [SPARK-3365][SQL]Wrong schema generated for List type
   tianyi <tianyi.asiainfo@gmail.com>
   2015-02-12 22:18:39 -0800
   Commit: 1c8633f, github.com/apache/spark/pull/4581

   [SQL] Fix docs of SQLContext.tables
   Yin Huai <yhuai@databricks.com>
   2015-02-12 20:37:55 -0800
   Commit: 2aea892, github.com/apache/spark/pull/4579

   [SPARK-3299][SQL]Public API in SQLContext to list tables
   Yin Huai <yhuai@databricks.com>
   2015-02-12 18:08:01 -0800
   Commit: 1d0596a, github.com/apache/spark/pull/4547

   [SQL] Move SaveMode to SQL package.
   Yin Huai <yhuai@databricks.com>
   2015-02-12 15:32:17 -0800
   Commit: c025a46, github.com/apache/spark/pull/4542

   [SPARK-5335] Fix deletion of security groups within a VPC
   Vladimir Grigor <vladimir@kiosked.com>, Vladimir Grigor <vladimir@voukka.com>
   2015-02-12 23:26:24 +0000
   Commit: ada993e, github.com/apache/spark/pull/4122

   [SPARK-5755] [SQL] remove unnecessary Add
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-12 15:22:07 -0800
   Commit: d5fc514, github.com/apache/spark/pull/4551

   [SPARK-5573][SQL] Add explode to dataframes
   Michael Armbrust <michael@databricks.com>
   2015-02-12 15:19:19 -0800
   Commit: ee04a8b, github.com/apache/spark/pull/4546

   [SPARK-5758][SQL] Use LongType as the default type for integers in JSON schema inference.
   Yin Huai <yhuai@databricks.com>
   2015-02-12 15:17:25 -0800
   Commit: c352ffb, github.com/apache/spark/pull/4544

   [SPARK-5780] [PySpark] Mute the logging during unit tests
   Davies Liu <davies@databricks.com>
   2015-02-12 14:54:38 -0800
   Commit: 0bf0315, github.com/apache/spark/pull/4572

   SPARK-5747: Fix wordsplitting bugs in make-distribution.sh
   David Y. Ross <dyross@gmail.com>
   2015-02-12 14:52:38 -0800
   Commit: 26c816e, github.com/apache/spark/pull/4540

   [SPARK-5759][Yarn]ExecutorRunnable should catch YarnException while NMClient start contain...
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-12 14:50:16 -0800
   Commit: 947b8bd, github.com/apache/spark/pull/4554

   [SPARK-5760][SPARK-5761] Fix standalone rest protocol corner cases + revamp tests
   Andrew Or <andrew@databricks.com>
   2015-02-12 14:47:52 -0800
   Commit: 1d5663e, github.com/apache/spark/pull/4557

   [SPARK-5762] Fix shuffle write time for sort-based shuffle
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-12 14:46:37 -0800
   Commit: 47c73d4, github.com/apache/spark/pull/4559

   [SPARK-5765][Examples]Fixed word split problem in run-example and compute-classpath
   Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-02-12 14:44:21 -0800
   Commit: 629d014, github.com/apache/spark/pull/4561

   [EC2] Update default Spark version to 1.2.1
   Katsunori Kanda <potix2@gmail.com>
   2015-02-12 14:38:42 -0800
   Commit: 9c80765, github.com/apache/spark/pull/4566

   [SPARK-5645] Added local read bytes/time to task metrics
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-12 14:35:44 -0800
   Commit: 893d6fd, github.com/apache/spark/pull/4510

   [SQL] Improve error messages
   Michael Armbrust <michael@databricks.com>, wangfei <wangfei1@huawei.com>
   2015-02-12 13:11:28 -0800
   Commit: aa4ca8b, github.com/apache/spark/pull/4558

   [SQL][DOCS] Update sql documentation
   Antonio Navarro Perez <ajnavarro@users.noreply.github.com>
   2015-02-12 12:46:17 -0800
   Commit: 6a1be02, github.com/apache/spark/pull/4560

   SPARK-5776 JIRA version not of form x.y.z breaks merge_spark_pr.py
   Sean Owen <sowen@cloudera.com>
   2015-02-12 20:14:45 +0000
   Commit: bc57789, github.com/apache/spark/pull/4570

   [SPARK-5757][MLLIB] replace SQL JSON usage in model import/export by json4s
   Xiangrui Meng <meng@databricks.com>
   2015-02-12 10:48:13 -0800
   Commit: 99bd500, github.com/apache/spark/pull/4555

   [SPARK-5655] Don't chmod700 application files if running in YARN
   Andrew Rowson <github@growse.com>
   2015-02-12 18:41:39 +0000
   Commit: 466b1f6, github.com/apache/spark/pull/4509

   ignore cache paths for RAT tests
   Oren Mazor <oren.mazor@gmail.com>
   2015-02-12 18:37:00 +0000
   Commit: 9a6efbc, github.com/apache/spark/pull/4569

   SPARK-5727 [BUILD] Remove Debian packaging
   Sean Owen <sowen@cloudera.com>
   2015-02-12 12:36:26 +0000
   Commit: 9a3ea49, github.com/apache/spark/pull/4526

   [SQL] Make dataframe more tolerant of being serialized
   Michael Armbrust <michael@databricks.com>
   2015-02-11 19:05:49 -0800
   Commit: a38e23c, github.com/apache/spark/pull/4545

   [SQL] Two DataFrame fixes.
   Reynold Xin <rxin@databricks.com>
   2015-02-11 18:32:48 -0800
   Commit: d931b01, github.com/apache/spark/pull/4543

   [SPARK-3688][SQL] More inline comments for LogicalPlan.
   Reynold Xin <rxin@databricks.com>
   2015-02-11 15:26:31 -0800
   Commit: fa6bdc6, github.com/apache/spark/pull/4539

   [SPARK-3688][SQL]LogicalPlan can't resolve column correctlly
   tianyi <tianyi.asiainfo@gmail.com>
   2015-02-11 12:50:17 -0800
   Commit: 44b2311, github.com/apache/spark/pull/4524

   [SPARK-5454] More robust handling of self joins
   Michael Armbrust <michael@databricks.com>
   2015-02-11 12:31:56 -0800
   Commit: a60d2b7, github.com/apache/spark/pull/4520

   Remove outdated remark about take(n).
   Daniel Darabos <darabos.daniel@gmail.com>
   2015-02-11 20:24:17 +0000
   Commit: 03bf704, github.com/apache/spark/pull/4533

   [SPARK-5677] [SPARK-5734] [SQL] [PySpark] Python DataFrame API remaining tasks
   Davies Liu <davies@databricks.com>
   2015-02-11 12:13:16 -0800
   Commit: b694eb9, github.com/apache/spark/pull/4528

   [SPARK-5733] Error Link in Pagination of HistroyPage when showing Incomplete Applications
   guliangliang <guliangliang@qiyi.com>
   2015-02-11 15:55:49 +0000
   Commit: 1ac099e, github.com/apache/spark/pull/4523

   SPARK-5727 [BUILD] Deprecate Debian packaging
   Sean Owen <sowen@cloudera.com>
   2015-02-11 08:30:16 +0000
   Commit: bd0d6e0, github.com/apache/spark/pull/4516

   SPARK-5728 [STREAMING] MQTTStreamSuite leaves behind ActiveMQ database files
   Sean Owen <sowen@cloudera.com>
   2015-02-11 08:13:51 +0000
   Commit: da89720, github.com/apache/spark/pull/4517

   [SPARK-4964] [Streaming] refactor createRDD to take leaders via map instead of array
   cody koeninger <cody@koeninger.org>
   2015-02-11 00:13:27 -0800
   Commit: 658687b, github.com/apache/spark/pull/4511

   HOTFIX: Adding Junit to Hive tests for Maven build
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 23:39:21 -0800
   Commit: c2131c0

   HOTFIX: Java 6 compilation error in Spark SQL
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 22:43:32 -0800
   Commit: 7e2f882

   [SPARK-5714][Mllib] Refactor initial step of LDA to remove redundant operations
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-10 21:51:15 -0800
   Commit: f86a89a, github.com/apache/spark/pull/4501

   [SPARK-5702][SQL] Allow short names for built-in data sources.
   Reynold Xin <rxin@databricks.com>
   2015-02-10 20:40:21 -0800
   Commit: b8f88d3, github.com/apache/spark/pull/4489

   [SPARK-5729] Potential NPE in standalone REST API
   Andrew Or <andrew@databricks.com>
   2015-02-10 20:19:14 -0800
   Commit: b969182, github.com/apache/spark/pull/4518

   [SPARK-4879] Use driver to coordinate Hadoop output committing for speculative tasks
   mcheah <mcheah@palantir.com>, Josh Rosen <joshrosen@databricks.com>
   2015-02-10 20:12:18 -0800
   Commit: 1cb3770, github.com/apache/spark/pull/4155.

   [SQL][DataFrame] Fix column computability bug.
   Reynold Xin <rxin@databricks.com>
   2015-02-10 19:50:44 -0800
   Commit: 7e24249, github.com/apache/spark/pull/4519

   [SPARK-5709] [SQL] Add EXPLAIN support in DataFrame API for debugging purpose
   Cheng Hao <hao.cheng@intel.com>
   2015-02-10 19:40:51 -0800
   Commit: 45df77b, github.com/apache/spark/pull/4496

   [SPARK-5704] [SQL] [PySpark] createDataFrame from RDD with columns
   Davies Liu <davies@databricks.com>
   2015-02-10 19:40:12 -0800
   Commit: ea60284, github.com/apache/spark/pull/4498

   [SPARK-5683] [SQL] Avoid multiple json generator created
   Cheng Hao <hao.cheng@intel.com>
   2015-02-10 18:19:56 -0800
   Commit: a60aea8, github.com/apache/spark/pull/4468

   [SQL] Add an exception for analysis errors.
   Michael Armbrust <michael@databricks.com>
   2015-02-10 17:32:42 -0800
   Commit: 6195e24, github.com/apache/spark/pull/4439

   [SPARK-5658][SQL] Finalize DDL and write support APIs
   Yin Huai <yhuai@databricks.com>
   2015-02-10 17:29:52 -0800
   Commit: aaf50d0, github.com/apache/spark/pull/4446

   [SPARK-5493] [core] Add option to impersonate user.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-10 17:19:10 -0800
   Commit: ed167e7, github.com/apache/spark/pull/4405

   [SQL] Make Options in the data source API CREATE TABLE statements optional.
   Yin Huai <yhuai@databricks.com>
   2015-02-10 17:06:12 -0800
   Commit: e28b6bd, github.com/apache/spark/pull/4515

   [SPARK-5725] [SQL] Fixes ParquetRelation2.equals
   Cheng Lian <lian@databricks.com>
   2015-02-10 17:02:44 -0800
   Commit: 2d50a01, github.com/apache/spark/pull/4513

   [SQL][Minor] correct some comments
   Sheng, Li <OopsOutOfMemory@users.noreply.github.com>, OopsOutOfMemory <victorshengli@126.com>
   2015-02-11 00:59:46 +0000
   Commit: 91e3512, github.com/apache/spark/pull/4508

   [SPARK-5644] [Core]Delete tmp dir when sc is stop
   Sephiroth-Lin <linwzhong@gmail.com>
   2015-02-10 23:23:35 +0000
   Commit: 52983d7, github.com/apache/spark/pull/4412

   [SPARK-5343][GraphX]: ShortestPaths traverses backwards
   Brennon York <brennon.york@capitalone.com>
   2015-02-10 14:57:00 -0800
   Commit: 5820961, github.com/apache/spark/pull/4478

   [SPARK-5021] [MLlib] Gaussian Mixture now supports Sparse Input
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-02-10 14:05:55 -0800
   Commit: fd2c032, github.com/apache/spark/pull/4459

   [SPARK-5686][SQL] Add show current roles command in HiveQl
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-10 13:20:15 -0800
   Commit: f98707c, github.com/apache/spark/pull/4471

   [SQL] Add toString to DataFrame/Column
   Michael Armbrust <michael@databricks.com>
   2015-02-10 13:14:01 -0800
   Commit: de80b1b, github.com/apache/spark/pull/4436

   [SPARK-5668] Display region in spark_ec2.py get_existing_cluster()
   Miguel Peralvo <miguel.peralvo@gmail.com>
   2015-02-10 19:54:52 +0000
   Commit: c49a404, github.com/apache/spark/pull/4457

   [SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table
   wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
   2015-02-10 11:54:30 -0800
   Commit: 59272da, github.com/apache/spark/pull/4368

   [HOTFIX][SPARK-4136] Fix compilation and tests
   Andrew Or <andrew@databricks.com>
   2015-02-10 11:18:01 -0800
   Commit: b640c84

   SPARK-4136. Under dynamic allocation, cancel outstanding executor requests when no longer needed
   Sandy Ryza <sandy@cloudera.com>
   2015-02-10 11:07:25 -0800
   Commit: 69bc3bb, github.com/apache/spark/pull/4168

   [SPARK-5716] [SQL] Support TOK_CHARSETLITERAL in HiveQl
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-10 11:08:21 -0800
   Commit: c7ad80a, github.com/apache/spark/pull/4502

   [Spark-5717] [MLlib] add stop and reorganize import
   JqueryFan <firing@126.com>, Yuhao Yang <hhbyyh@gmail.com>
   2015-02-10 17:37:32 +0000
   Commit: 6cc96cf, github.com/apache/spark/pull/4503

   [SPARK-1805] [EC2] Validate instance types
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-10 15:45:38 +0000
   Commit: 50820f1, github.com/apache/spark/pull/4455

   [SPARK-5700] [SQL] [Build] Bumps jets3t to 0.9.3 for hadoop-2.3 and hadoop-2.4 profiles
   Cheng Lian <lian@databricks.com>
   2015-02-10 02:28:47 -0800
   Commit: ba66793, github.com/apache/spark/pull/4499

   SPARK-5239 [CORE] JdbcRDD throws "java.lang.AbstractMethodError: oracle.jdbc.driver.xxxxxx.isClosed()Z"
   Sean Owen <sowen@cloudera.com>
   2015-02-10 09:19:01 +0000
   Commit: 2d1e916, github.com/apache/spark/pull/4470

   [SPARK-4964][Streaming][Kafka] More updates to Exactly-once Kafka stream
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-09 22:45:48 -0800
   Commit: c151346, github.com/apache/spark/pull/4384

   [SPARK-5597][MLLIB] save/load for decision trees and emsembles
   Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
   2015-02-09 22:09:07 -0800
   Commit: ef2f55b, github.com/apache/spark/pull/4444.

   [SQL] Remove the duplicated code
   Cheng Hao <hao.cheng@intel.com>
   2015-02-09 21:33:34 -0800
   Commit: bd0b5ea, github.com/apache/spark/pull/4494

   [SPARK-5701] Only set ShuffleReadMetrics when task has shuffle deps
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-09 21:22:09 -0800
   Commit: a2d33d0, github.com/apache/spark/pull/4488

   [SPARK-5703] AllJobsPage throws empty.max exception
   Andrew Or <andrew@databricks.com>
   2015-02-09 21:18:48 -0800
   Commit: a95ed52, github.com/apache/spark/pull/4490

   [SPARK-2996] Implement userClassPathFirst for driver, yarn.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-09 21:17:06 -0800
   Commit: 20a6013, github.com/apache/spark/pull/3233

   SPARK-4900 [MLLIB] MLlib SingularValueDecomposition ARPACK IllegalStateException
   Sean Owen <sowen@cloudera.com>
   2015-02-09 21:13:58 -0800
   Commit: 36c4e1d, github.com/apache/spark/pull/4485

   Add a config option to print DAG.
   KaiXinXiaoLei <huleilei1@huawei.com>
   2015-02-09 20:58:58 -0800
   Commit: 31d435e, github.com/apache/spark/pull/4257

   [SPARK-5469] restructure pyspark.sql into multiple files
   Davies Liu <davies@databricks.com>
   2015-02-09 20:49:22 -0800
   Commit: 08488c1, github.com/apache/spark/pull/4479

   [SPARK-5698] Do not let user request negative # of executors
   Andrew Or <andrew@databricks.com>
   2015-02-09 17:33:29 -0800
   Commit: d302c48, github.com/apache/spark/pull/4483

   [SPARK-5699] [SQL] [Tests] Runs hive-thriftserver tests whenever SQL code is modified
   Cheng Lian <lian@databricks.com>
   2015-02-09 16:52:05 -0800
   Commit: 3ec3ad2, github.com/apache/spark/pull/4486

   [SPARK-5648][SQL] support "alter ... unset tblproperties("key")"
   DoingDone9 <799203320@qq.com>
   2015-02-09 16:40:26 -0800
   Commit: d08e7c2, github.com/apache/spark/pull/4424

   [SPARK-2096][SQL] support dot notation on array of struct
   Wenchen Fan <cloud0fan@outlook.com>
   2015-02-09 16:39:34 -0800
   Commit: 0ee53eb, github.com/apache/spark/pull/2405

   [SPARK-5614][SQL] Predicate pushdown through Generate.
   Lu Yan <luyan02@baidu.com>
   2015-02-09 16:25:38 -0800
   Commit: 2a36292, github.com/apache/spark/pull/4394

   [SPARK-5696] [SQL] [HOTFIX] Asks HiveThriftServer2 to re-initialize log4j using Hive configurations
   Cheng Lian <lian@databricks.com>
   2015-02-09 16:23:12 -0800
   Commit: b8080aa, github.com/apache/spark/pull/4484

   [SQL] Code cleanup.
   Yin Huai <yhuai@databricks.com>
   2015-02-09 16:20:42 -0800
   Commit: 5f0b30e, github.com/apache/spark/pull/4482

   [SQL] Add some missing DataFrame functions.
   Michael Armbrust <michael@databricks.com>
   2015-02-09 16:02:56 -0800
   Commit: 68b25cf, github.com/apache/spark/pull/4437

   [SPARK-5611] [EC2] Allow spark-ec2 repo and branch to be set on CLI of spark_ec2.py
   Florian Verhein <florian.verhein@gmail.com>
   2015-02-09 23:47:07 +0000
   Commit: b884daa, github.com/apache/spark/pull/4385

   [SPARK-5675][SQL] XyzType companion object should subclass XyzType
   Reynold Xin <rxin@databricks.com>
   2015-02-09 14:51:46 -0800
   Commit: f48199e, github.com/apache/spark/pull/4463

   [SPARK-4905][STREAMING] FlumeStreamSuite fix.
   Hari Shreedharan <hshreedharan@apache.org>
   2015-02-09 14:17:14 -0800
   Commit: 0765af9, github.com/apache/spark/pull/4371

   [SPARK-5691] Fixing wrong data structure lookup for dupe app registratio...
   mcheah <mcheah@palantir.com>
   2015-02-09 13:20:14 -0800
   Commit: 6fe70d8, github.com/apache/spark/pull/4477

   [SPARK-5664][BUILD] Restore stty settings when exiting from SBT's spark-shell
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-09 11:45:12 -0800
   Commit: dae2161, github.com/apache/spark/pull/4451

   [SPARK-5678] Convert DataFrame to pandas.DataFrame and Series
   Davies Liu <davies@databricks.com>
   2015-02-09 11:42:52 -0800
   Commit: afb1316, github.com/apache/spark/pull/4476

   SPARK-4267 [YARN] Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later
   Sean Owen <sowen@cloudera.com>
   2015-02-09 10:33:57 -0800
   Commit: de78060, github.com/apache/spark/pull/4452

   SPARK-2149. [MLLIB] Univariate kernel density estimation
   Sandy Ryza <sandy@cloudera.com>
   2015-02-09 10:12:12 +0000
   Commit: 0793ee1, github.com/apache/spark/pull/1093

   [SPARK-5473] [EC2] Expose SSH failures after status checks pass
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-09 09:44:53 +0000
   Commit: 4dfe180, github.com/apache/spark/pull/4262

   [SPARK-5539][MLLIB] LDA guide
   Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
   2015-02-08 23:40:36 -0800
   Commit: 855d12a, github.com/apache/spark/pull/4465

   [SPARK-5472][SQL] Fix Scala code style
   Hung Lin <hung@zoomdata.com>
   2015-02-08 22:36:42 -0800
   Commit: 4575c56, github.com/apache/spark/pull/4464

   SPARK-4405 [MLLIB] Matrices.* construction methods should check for rows x cols overflow
   Sean Owen <sowen@cloudera.com>
   2015-02-08 21:08:50 -0800
   Commit: 4396dfb, github.com/apache/spark/pull/4461

   [SPARK-5660][MLLIB] Make Matrix apply public
   Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
   2015-02-08 21:07:36 -0800
   Commit: c171611, github.com/apache/spark/pull/4447

   [SPARK-5643][SQL] Add a show method to print the content of a DataFrame in tabular format.
   Reynold Xin <rxin@databricks.com>
   2015-02-08 18:56:51 -0800
   Commit: a052ed4, github.com/apache/spark/pull/4416

   SPARK-5665 [DOCS] Update netlib-java documentation
   Sam Halliday <sam.halliday@Gmail.com>, Sam Halliday <sam.halliday@gmail.com>
   2015-02-08 16:34:26 -0800
   Commit: 56aff4b, github.com/apache/spark/pull/4448

   [SPARK-5598][MLLIB] model save/load for ALS
   Xiangrui Meng <meng@databricks.com>
   2015-02-08 16:26:20 -0800
   Commit: 5c299c5, github.com/apache/spark/pull/4422

   [SQL] Set sessionState in QueryExecution.
   Yin Huai <yhuai@databricks.com>
   2015-02-08 14:55:07 -0800
   Commit: 804949d, github.com/apache/spark/pull/4445

   [SPARK-3039] [BUILD] Spark assembly for new hadoop API (hadoop 2) contai...
   medale <medale94@yahoo.com>
   2015-02-08 10:35:29 +0000
   Commit: 75fdccc, github.com/apache/spark/pull/4315

   [SPARK-5672][Web UI] Don't return `ERROR 500` when have missing args
   Kirill A. Korinskiy <catap@catap.ru>
   2015-02-08 10:31:46 +0000
   Commit: 23a99da, github.com/apache/spark/pull/4239

   [SPARK-5656] Fail gracefully for large values of k and/or n that will ex...
   mbittmann <mbittmann@gmail.com>, bittmannm <mark.bittmann@agilex.com>
   2015-02-08 10:13:29 +0000
   Commit: 4878313, github.com/apache/spark/pull/4433

   [SPARK-5366][EC2] Check the mode of private key
   liuchang0812 <liuchang0812@gmail.com>
   2015-02-08 10:08:51 +0000
   Commit: 6fb141e, github.com/apache/spark/pull/4162

   [SPARK-5671] Upgrade jets3t to 0.9.2 in hadoop-2.3 and 2.4 profiles
   Josh Rosen <joshrosen@databricks.com>
   2015-02-07 17:19:08 -0800
   Commit: 5de14cc, github.com/apache/spark/pull/4454

   [SPARK-5108][BUILD] Jackson dependency management for Hadoop-2.6.0 support
   Zhan Zhang <zhazhan@gmail.com>
   2015-02-07 19:41:30 +0000
   Commit: ecbbed2, github.com/apache/spark/pull/3938

   SPARK-5408: Use -XX:MaxPermSize specified by user instead of default in ...
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-02-07 15:58:04 +0000
   Commit: dd4cb33, github.com/apache/spark/pull/4203

   [BUILD] Add the ability to launch spark-shell from SBT.
   Michael Armbrust <michael@databricks.com>
   2015-02-07 00:14:38 -0800
   Commit: e9a4fe1, github.com/apache/spark/pull/4438

   [SPARK-5388] Provide a stable application submission gateway for standalone cluster mode
   Andrew Or <andrew@databricks.com>
   2015-02-06 15:57:06 -0800
   Commit: 1390e56, github.com/apache/spark/pull/4216

   SPARK-5403: Ignore UserKnownHostsFile in SSH calls
   Grzegorz Dubicki <grzegorz.dubicki@gmail.com>
   2015-02-06 15:43:58 -0800
   Commit: e772b4e, github.com/apache/spark/pull/4196

   [SPARK-5601][MLLIB] make streaming linear algorithms Java-friendly
   Xiangrui Meng <meng@databricks.com>
   2015-02-06 15:42:59 -0800
   Commit: 0e23ca9, github.com/apache/spark/pull/4432

   [SQL] [Minor] HiveParquetSuite was disabled by mistake, re-enable them
   Cheng Lian <lian@databricks.com>
   2015-02-06 15:23:42 -0800
   Commit: c402140, github.com/apache/spark/pull/4440

   [SQL] Use TestSQLContext in Java tests
   Michael Armbrust <michael@databricks.com>
   2015-02-06 15:11:02 -0800
   Commit: 76c4bf5, github.com/apache/spark/pull/4441

   [SPARK-4994][network]Cleanup removed executors' ShuffleInfo  in yarn shuffle service
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 14:47:52 -0800
   Commit: 61073f8, github.com/apache/spark/pull/3828

   [SPARK-5444][Network]Add a retry to deal with the conflict port in netty server.
   huangzhaowei <carlmartinmax@gmail.com>
   2015-02-06 14:35:29 -0800
   Commit: 2bda1c1, github.com/apache/spark/pull/4240

   [SPARK-4874] [CORE] Collect record count metrics
   Kostas Sakellis <kostas@cloudera.com>
   2015-02-06 14:31:20 -0800
   Commit: dcd1e42, github.com/apache/spark/pull/4067

   [HOTFIX] Fix the maven build after adding sqlContext to spark-shell
   Michael Armbrust <michael@databricks.com>
   2015-02-06 14:27:06 -0800
   Commit: 5796156, github.com/apache/spark/pull/4443

   [SPARK-5600] [core] Clean up FsHistoryProvider test, fix app sort order.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-06 14:23:09 -0800
   Commit: 5687bab, github.com/apache/spark/pull/4370

   SPARK-5613: Catch the ApplicationNotFoundException exception to avoid thread from getting killed on yarn restart.
   Kashish Jain <kashish.jain@guavus.com>
   2015-02-06 13:47:23 -0800
   Commit: ca66159, github.com/apache/spark/pull/4392

   SPARK-5633 pyspark saveAsTextFile support for compression codec
   Vladimir Vladimirov <vladimir.vladimirov@magnetic.com>
   2015-02-06 13:55:02 -0800
   Commit: b3872e0, github.com/apache/spark/pull/4403

   [HOTFIX][MLLIB] fix a compilation error with java 6
   Xiangrui Meng <meng@databricks.com>
   2015-02-06 13:52:35 -0800
   Commit: 65181b7, github.com/apache/spark/pull/4442

   [SPARK-4983] Insert waiting time before tagging EC2 instances
   GenTang <gen.tang86@gmail.com>, Gen TANG <gen.tang86@gmail.com>
   2015-02-06 13:27:34 -0800
   Commit: 0f3a360, github.com/apache/spark/pull/3986

   [SPARK-5586][Spark Shell][SQL] Make `sqlContext` available in spark shell
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-06 13:20:10 -0800
   Commit: 3d3ecd7, github.com/apache/spark/pull/4387

   [SPARK-5278][SQL] Introduce UnresolvedGetField and complete the check of ambiguous reference to fields
   Wenchen Fan <cloud0fan@outlook.com>
   2015-02-06 13:08:09 -0800
   Commit: 4793c84, github.com/apache/spark/pull/4068

   [SQL][Minor] Remove cache keyword in SqlParser
   wangfei <wangfei1@huawei.com>
   2015-02-06 12:42:23 -0800
   Commit: bc36356, github.com/apache/spark/pull/4393

   [SQL][HiveConsole][DOC] HiveConsole `correct hiveconsole imports`
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-06 12:41:28 -0800
   Commit: b62c352, github.com/apache/spark/pull/4389

   [SPARK-5595][SPARK-5603][SQL] Add a rule to do PreInsert type casting and field renaming and invalidating in memory cache after INSERT
   Yin Huai <yhuai@databricks.com>
   2015-02-06 12:38:07 -0800
   Commit: 3eccf29, github.com/apache/spark/pull/4373

   [SPARK-5324][SQL] Results of describe can't be queried
   OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
   2015-02-06 12:33:20 -0800
   Commit: 0b7eb3f, github.com/apache/spark/pull/4249

   [SPARK-5619][SQL] Support 'show roles' in HiveContext
   q00251598 <qiyadong@huawei.com>
   2015-02-06 12:29:26 -0800
   Commit: a958d60, github.com/apache/spark/pull/4397

   [SPARK-5640] Synchronize ScalaReflection where necessary
   Tobias Schlatter <tobias@meisch.ch>
   2015-02-06 12:15:02 -0800
   Commit: 500dc2b, github.com/apache/spark/pull/4431

   [SPARK-5650][SQL] Support optional 'FROM' clause
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-06 12:13:44 -0800
   Commit: d433816, github.com/apache/spark/pull/4426

   [SPARK-5628] Add version option to spark-ec2
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-06 12:08:22 -0800
   Commit: 70e5b03, github.com/apache/spark/pull/4414

   [SPARK-2945][YARN][Doc]add doc for spark.executor.instances
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-02-06 11:57:02 -0800
   Commit: d34f79c, github.com/apache/spark/pull/4350

   [SPARK-4361][Doc] Add more docs for Hadoop Configuration
   zsxwing <zsxwing@gmail.com>
   2015-02-06 11:50:20 -0800
   Commit: af2a2a2, github.com/apache/spark/pull/3225

   [HOTFIX] Fix test build break in ExecutorAllocationManagerSuite.
   Josh Rosen <joshrosen@databricks.com>
   2015-02-06 11:47:32 -0800
   Commit: fb6c0cb

   [SPARK-5652][Mllib] Use broadcasted weights in LogisticRegressionModel
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-06 11:22:11 -0800
   Commit: 80f3bcb, github.com/apache/spark/pull/4429

   [SPARK-5555] Enable UISeleniumSuite tests
   Josh Rosen <joshrosen@databricks.com>
   2015-02-06 11:14:58 -0800
   Commit: 0d74bd7, github.com/apache/spark/pull/4334

   SPARK-2450 Adds executor log links to Web UI
   Kostas Sakellis <kostas@cloudera.com>, Josh Rosen <joshrosen@databricks.com>
   2015-02-06 11:13:00 -0800
   Commit: 32e964c, github.com/apache/spark/pull/3486

   [SPARK-5618][Spark Core][Minor] Optimise utility code.
   Makoto Fukuhara <fukuo33@gmail.com>
   2015-02-06 11:11:38 -0800
   Commit: 4cdb26c, github.com/apache/spark/pull/4396

   [SPARK-5593][Core]Replace BlockManagerListener with ExecutorListener in ExecutorAllocationListener
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 11:09:37 -0800
   Commit: 6072fcc, github.com/apache/spark/pull/4369

   [SPARK-4877] Allow user first classes to extend classes in the parent.
   Stephen Haberman <stephen@exigencecorp.com>
   2015-02-06 11:03:56 -0800
   Commit: 9792bec, github.com/apache/spark/pull/3725

   [SPARK-5396] Syntax error in spark scripts on windows.
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2015-02-06 10:58:26 -0800
   Commit: c01b985, github.com/apache/spark/pull/4428

   [SPARK-5636] Ramp up faster in dynamic allocation
   Andrew Or <andrew@databricks.com>
   2015-02-06 10:54:23 -0800
   Commit: fe3740c, github.com/apache/spark/pull/4409

   SPARK-4337. [YARN] Add ability to cancel pending requests
   Sandy Ryza <sandy@cloudera.com>
   2015-02-06 10:53:16 -0800
   Commit: 1a88f20, github.com/apache/spark/pull/4141

   [SPARK-5653][YARN] In ApplicationMaster rename isDriver to isClusterMode
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 10:48:31 -0800
   Commit: cc6e531, github.com/apache/spark/pull/4430

   [SPARK-5013] [MLlib] Added documentation and sample data file for GaussianMixture
   Travis Galoppo <tjg2107@columbia.edu>
   2015-02-06 10:26:51 -0800
   Commit: 9ad56ad, github.com/apache/spark/pull/4401

   [SPARK-5416] init Executor.threadPool before ExecutorSource
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-02-06 12:22:25 +0000
   Commit: 37d35ab, github.com/apache/spark/pull/4212

   [Build] Set all Debian package permissions to 755
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-06 11:38:39 +0000
   Commit: cf6778e, github.com/apache/spark/pull/4277

   Update ec2-scripts.md
   Miguel Peralvo <miguel.peralvo@gmail.com>
   2015-02-06 11:04:48 +0000
   Commit: f827ef4, github.com/apache/spark/pull/4300

   [SPARK-5470][Core]use defaultClassLoader to load classes in KryoSerializer
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 11:00:35 +0000
   Commit: ed3aac7, github.com/apache/spark/pull/4258

   [SPARK-5582] [history] Ignore empty log directories.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-06 10:07:20 +0000
   Commit: 8569289, github.com/apache/spark/pull/4352

   [SPARK-5157][YARN] Configure more JVM options properly when we use ConcMarkSweepGC for AM.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-02-06 09:39:12 +0000
   Commit: 24dbc50, github.com/apache/spark/pull/3956

   [Minor] Remove permission for execution from spark-shell.cmd
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-02-06 09:33:36 +0000
   Commit: f6ba813, github.com/apache/spark/pull/3983

   [SPARK-5380][GraphX]  Solve an ArrayIndexOutOfBoundsException when build graph with a file format error
   Leolh <leosandylh@gmail.com>
   2015-02-06 09:01:53 +0000
   Commit: 575d2df, github.com/apache/spark/pull/4176

   [SPARK-4789] [SPARK-4942] [SPARK-5031] [mllib] Standardize ML Prediction APIs
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-05 23:43:47 -0800
   Commit: dc0c449, github.com/apache/spark/pull/3637

   [SPARK-5604][MLLIB] remove checkpointDir from trees
   Xiangrui Meng <meng@databricks.com>
   2015-02-05 23:32:09 -0800
   Commit: 6b88825, github.com/apache/spark/pull/4407

   [SPARK-5639][SQL] Support DataFrame.renameColumn.
   Reynold Xin <rxin@databricks.com>
   2015-02-05 23:02:40 -0800
   Commit: 7dc4965, github.com/apache/spark/pull/4410

   Revert "SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2."
   Patrick Wendell <patrick@databricks.com>
   2015-02-05 18:36:48 -0800
   Commit: 6d3b7cb

   SPARK-5557: Explicitly include servlet API in dependencies.
   Patrick Wendell <patrick@databricks.com>
   2015-02-05 18:14:54 -0800
   Commit: 793dbae, github.com/apache/spark/pull/4411

   [HOTFIX] [SQL] Disables Metastore Parquet table conversion for "SQLQuerySuite.CTAS with serde"
   Cheng Lian <lian@databricks.com>
   2015-02-05 18:09:18 -0800
   Commit: 7c0a648, github.com/apache/spark/pull/4413

   [SPARK-5638][SQL] Add a config flag to disable eager analysis of DataFrames
   Reynold Xin <rxin@databricks.com>
   2015-02-05 18:07:10 -0800
   Commit: e8a5d50, github.com/apache/spark/pull/4408

   [SPARK-5620][DOC] group methods in generated unidoc
   Xiangrui Meng <meng@databricks.com>
   2015-02-05 16:26:51 -0800
   Commit: 85ccee8, github.com/apache/spark/pull/4404

   [SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source improvements
   Cheng Lian <lian@databricks.com>
   2015-02-05 15:29:56 -0800
   Commit: a9ed511, github.com/apache/spark/pull/4308

   [SPARK-5604[MLLIB] remove checkpointDir from LDA
   Xiangrui Meng <meng@databricks.com>
   2015-02-05 15:07:33 -0800
   Commit: c19152c, github.com/apache/spark/pull/4390

   [SPARK-5460][MLlib] Wrapped `Try` around `deleteAllCheckpoints` - RandomForest.
   x1- <viva008@gmail.com>
   2015-02-05 15:02:04 -0800
   Commit: 62371ad, github.com/apache/spark/pull/4347

   [SPARK-5135][SQL] Add support for describe table to DDL in SQLContext
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-05 13:07:48 -0800
   Commit: 4d8d070, github.com/apache/spark/pull/4227

   [SPARK-5617][SQL] fix test failure of SQLQuerySuite
   wangfei <wangfei1@huawei.com>
   2015-02-05 12:44:12 -0800
   Commit: a83936e, github.com/apache/spark/pull/4395

   [Branch-1.3] [DOC] doc fix for date
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-05 12:42:27 -0800
   Commit: 6fa4ac1, github.com/apache/spark/pull/4400

   SPARK-5548: Fixed a race condition in AkkaUtilsSuite
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-02-05 12:00:04 -0800
   Commit: 081ac69, github.com/apache/spark/pull/4343

   [SPARK-5474][Build]curl should support URL redirection in build/mvn
   GuoQiang Li <witgo@qq.com>
   2015-02-05 12:03:13 -0800
   Commit: 3414754, github.com/apache/spark/pull/4263

   [SPARK-5608] Improve SEO of Spark documentation pages
   Matei Zaharia <matei@databricks.com>
   2015-02-05 11:12:50 -0800
   Commit: 4d74f06, github.com/apache/spark/pull/4381

   SPARK-4687. Add a recursive option to the addFile API
   Sandy Ryza <sandy@cloudera.com>
   2015-02-05 10:15:55 -0800
   Commit: c4b1108, github.com/apache/spark/pull/3670

   [HOTFIX] MLlib build break.
   Reynold Xin <rxin@databricks.com>
   2015-02-05 00:42:50 -0800
   Commit: 6580929

   [MLlib] Minor: UDF style update.
   Reynold Xin <rxin@databricks.com>
   2015-02-04 23:57:53 -0800
   Commit: c3ba4d4, github.com/apache/spark/pull/4388

   [SPARK-5612][SQL] Move DataFrame implicit functions into SQLContext.implicits.
   Reynold Xin <rxin@databricks.com>
   2015-02-04 23:44:34 -0800
   Commit: 7d789e1, github.com/apache/spark/pull/4386

   [SPARK-5606][SQL] Support plus sign in HiveContext
   q00251598 <qiyadong@huawei.com>
   2015-02-04 23:16:01 -0800
   Commit: 9d3a75e, github.com/apache/spark/pull/4378

   [SPARK-5599] Check MLlib public APIs for 1.3
   Xiangrui Meng <meng@databricks.com>
   2015-02-04 23:03:47 -0800
   Commit: db34690, github.com/apache/spark/pull/4377

   [SPARK-5596] [mllib] ML model import/export for GLMs, NaiveBayes
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-04 22:46:48 -0800
   Commit: 975bcef, github.com/apache/spark/pull/4233

   SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2.
   Patrick Wendell <patrick@databricks.com>
   2015-02-04 22:39:44 -0800
   Commit: c23ac03, github.com/apache/spark/pull/4383

   [SPARK-5602][SQL] Better support for creating DataFrame from local data collection
   Reynold Xin <rxin@databricks.com>
   2015-02-04 19:53:57 -0800
   Commit: 84acd08, github.com/apache/spark/pull/4372

   [SPARK-5538][SQL] Fix flaky CachedTableSuite
   Reynold Xin <rxin@databricks.com>
   2015-02-04 19:52:41 -0800
   Commit: 206f9bc, github.com/apache/spark/pull/4379

   [SQL][DataFrame] Minor cleanup.
   Reynold Xin <rxin@databricks.com>
   2015-02-04 19:51:48 -0800
   Commit: 6b4c7f0, github.com/apache/spark/pull/4374

   [SPARK-4520] [SQL] This pr fixes the ArrayIndexOutOfBoundsException as r...
   Sadhan Sood <sadhan@tellapart.com>
   2015-02-04 19:18:06 -0800
   Commit: dba98bf, github.com/apache/spark/pull/4148

   [SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggregate functions
   Reynold Xin <rxin@databricks.com>
   2015-02-04 18:35:51 -0800
   Commit: 1fbd124, github.com/apache/spark/pull/4376

   [SPARK-5411] Allow SparkListeners to be specified in SparkConf and loaded when creating SparkContext
   Josh Rosen <joshrosen@databricks.com>
   2015-02-04 17:18:03 -0800
   Commit: 9a7ce70, github.com/apache/spark/pull/4111

   [SPARK-5577] Python udf for DataFrame
   Davies Liu <davies@databricks.com>
   2015-02-04 15:55:09 -0800
   Commit: dc101b0, github.com/apache/spark/pull/4351

   [SPARK-5118][SQL] Fix: create table test stored as parquet as select ..
   guowei2 <guowei2@asiainfo.com>
   2015-02-04 15:26:10 -0800
   Commit: e0490e2, github.com/apache/spark/pull/3921

   [SQL] Use HiveContext's sessionState in HiveMetastoreCatalog.hiveDefaultTableFilePath
   Yin Huai <yhuai@databricks.com>
   2015-02-04 15:22:40 -0800
   Commit: 548c9c2, github.com/apache/spark/pull/4355

   [SQL] Correct the default size of TimestampType and expose NumericType
   Yin Huai <yhuai@databricks.com>
   2015-02-04 15:14:49 -0800
   Commit: 0d81645, github.com/apache/spark/pull/4314

   [SQL][Hiveconsole] Bring hive console code up to date and update README.md
   OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
   2015-02-04 15:13:54 -0800
   Commit: b73d5ff, github.com/apache/spark/pull/4330

   [SPARK-5367][SQL] Support star expression in udfs
   wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
   2015-02-04 15:12:07 -0800
   Commit: 417d111, github.com/apache/spark/pull/4353

   [SPARK-5426][SQL] Add SparkSQL Java API helper methods.
   kul <kuldeep.bora@gmail.com>
   2015-02-04 15:08:37 -0800
   Commit: 424cb69, github.com/apache/spark/pull/4243

   [SPARK-5587][SQL] Support change database owner
   wangfei <wangfei1@huawei.com>
   2015-02-04 14:35:12 -0800
   Commit: b90dd39, github.com/apache/spark/pull/4357

   [SPARK-5591][SQL] Fix NoSuchObjectException for CTAS
   wangfei <wangfei1@huawei.com>
   2015-02-04 14:33:07 -0800
   Commit: a9f0db1, github.com/apache/spark/pull/4365

   [SPARK-4939] move to next locality when no pending tasks
   Davies Liu <davies@databricks.com>
   2015-02-04 14:22:07 -0800
   Commit: 0a89b15, github.com/apache/spark/pull/3779

   [SPARK-4707][STREAMING] Reliable Kafka Receiver can lose data if the blo...
   Hari Shreedharan <hshreedharan@apache.org>
   2015-02-04 14:20:44 -0800
   Commit: f0500f9, github.com/apache/spark/pull/3655

   [SPARK-4964] [Streaming] Exactly-once semantics for Kafka
   cody koeninger <cody@koeninger.org>
   2015-02-04 12:06:34 -0800
   Commit: b0c0021, github.com/apache/spark/pull/3798

   [SPARK-5588] [SQL] support select/filter by SQL expression
   Davies Liu <davies@databricks.com>
   2015-02-04 11:34:46 -0800
   Commit: ac0b2b7, github.com/apache/spark/pull/4359

   [SPARK-5585] Flaky test in MLlib python
   Davies Liu <davies@databricks.com>
   2015-02-04 08:54:20 -0800
   Commit: 38a416f, github.com/apache/spark/pull/4358

   [SPARK-5574] use given name prefix in dir
   Imran Rashid <irashid@cloudera.com>
   2015-02-04 01:02:20 -0800
   Commit: 5aa0f21, github.com/apache/spark/pull/4344

   [Minor] Fix incorrect warning log
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-04 00:52:41 -0800
   Commit: a74cbbf, github.com/apache/spark/pull/4360

   [SPARK-5379][Streaming] Add awaitTerminationOrTimeout
   zsxwing <zsxwing@gmail.com>
   2015-02-04 00:40:28 -0800
   Commit: 4cf4cba, github.com/apache/spark/pull/4171

   [SPARK-5341] Use maven coordinates as dependencies in spark-shell and spark-submit
   Burak Yavuz <brkyvz@gmail.com>
   2015-02-03 22:39:17 -0800
   Commit: 6aed719, github.com/apache/spark/pull/4215

   [SPARK-4939] revive offers periodically in LocalBackend
   Davies Liu <davies@databricks.com>
   2015-02-03 22:30:23 -0800
   Commit: 83de71c, github.com/apache/spark/pull/4147

   [SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streaming
   freeman <the.freeman.lab@gmail.com>
   2015-02-03 22:24:30 -0800
   Commit: 242b4f0, github.com/apache/spark/pull/3803

   [SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions
   Reynold Xin <rxin@databricks.com>
   2015-02-03 22:15:35 -0800
   Commit: 40c4cb2, github.com/apache/spark/pull/4348

   [FIX][MLLIB] fix seed handling in Python GMM
   Xiangrui Meng <meng@databricks.com>
   2015-02-03 20:39:11 -0800
   Commit: eb15631, github.com/apache/spark/pull/4349

   [SPARK-4795][Core] Redesign the "primitive type => Writable" implicit APIs to make them be activated automatically
   zsxwing <zsxwing@gmail.com>
   2015-02-03 20:17:12 -0800
   Commit: d37978d, github.com/apache/spark/pull/3642

   [SPARK-5578][SQL][DataFrame] Provide a convenient way for Scala users to use UDFs
   Reynold Xin <rxin@databricks.com>
   2015-02-03 20:07:46 -0800
   Commit: 1077f2e, github.com/apache/spark/pull/4345

   [SPARK-5520][MLlib] Make FP-Growth implementation take generic item types (WIP)
   Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
   2015-02-03 17:02:42 -0800
   Commit: e380d2d, github.com/apache/spark/pull/4340

   [SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
   Davies Liu <davies@databricks.com>
   2015-02-03 16:01:56 -0800
   Commit: 068c0e2, github.com/apache/spark/pull/4331

   [STREAMING] SPARK-4986 Wait for receivers to deregister and receiver job to terminate
   Jesper Lundgren <jesper.lundgren@vpon.com>
   2015-02-03 14:53:39 -0800
   Commit: 1e8b539, github.com/apache/spark/pull/4338

   [SPARK-5153][Streaming][Test] Increased timeout to deal with flaky KafkaStreamSuite
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-03 13:46:02 -0800
   Commit: 681f9df, github.com/apache/spark/pull/4342

   [SPARK-4508] [SQL] build native date type to conform behavior to Hive
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-03 12:21:45 -0800
   Commit: db821ed, github.com/apache/spark/pull/4325

   [SPARK-5383][SQL] Support alias for udtfs
   wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
   2015-02-03 12:16:31 -0800
   Commit: 5adbb39, github.com/apache/spark/pull/4186

   [SPARK-5550] [SQL] Support the case insensitive for UDF
   Cheng Hao <hao.cheng@intel.com>
   2015-02-03 12:12:26 -0800
   Commit: ca7a6cd, github.com/apache/spark/pull/4326

   [SPARK-4987] [SQL] parquet timestamp type support
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-03 12:06:06 -0800
   Commit: 0c20ce6, github.com/apache/spark/pull/3820


 Release 1.3.1

   [SQL] Use path.makeQualified in newParquet.
   Yin Huai <yhuai@databricks.com>
   2015-04-04 23:26:10 +0800
   Commit: eb57d4f, github.com/apache/spark/pull/5353

   [SPARK-6700] disable flaky test
   Davies Liu <davies@databricks.com>
   2015-04-03 15:22:21 -0700
   Commit: 3366af6, github.com/apache/spark/pull/5356

   [SPARK-6688] [core] Always use resolved URIs in EventLoggingListener.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-03 11:54:31 -0700
   Commit: f17a2fe, github.com/apache/spark/pull/5340

   [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
   Yin Huai <yhuai@databricks.com>
   2015-04-03 14:40:36 +0800
   Commit: 0c1b78b, github.com/apache/spark/pull/5339

   [SPARK-6621][Core] Fix the bug that calling EventLoop.stop in EventLoop.onReceive/onError/onStart doesn't call onStop
   zsxwing <zsxwing@gmail.com>
   2015-04-02 22:54:30 -0700
   Commit: ac705aa, github.com/apache/spark/pull/5280

   [SPARK-6345][STREAMING][MLLIB] Fix for training with prediction
   freeman <the.freeman.lab@gmail.com>
   2015-04-02 21:37:44 -0700
   Commit: d21f779, github.com/apache/spark/pull/5037

   [CORE] The descriptionof jobHistory config should be spark.history.fs.logDirectory
   KaiXinXiaoLei <huleilei1@huawei.com>
   2015-04-02 20:24:31 -0700
   Commit: 17ab6b0, github.com/apache/spark/pull/5332

   [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
   Yin Huai <yhuai@databricks.com>
   2015-04-02 20:23:08 -0700
   Commit: 0c1c0fb, github.com/apache/spark/pull/5339

   [SPARK-6650] [core] Stop ExecutorAllocationManager when context stops.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-02 19:48:55 -0700
   Commit: 0ef46b2, github.com/apache/spark/pull/5311

   [SPARK-6686][SQL] Use resolved output instead of names for toDF rename
   Michael Armbrust <michael@databricks.com>
   2015-04-02 18:30:55 -0700
   Commit: 2927af1, github.com/apache/spark/pull/5337

   [SPARK-6672][SQL] convert row to catalyst in createDataFrame(RDD[Row], ...)
   Xiangrui Meng <meng@databricks.com>
   2015-04-02 17:57:01 +0800
   Commit: c2694bb, github.com/apache/spark/pull/5329

   [SPARK-6618][SPARK-6669][SQL] Lock Hive metastore client correctly.
   Yin Huai <yhuai@databricks.com>, Michael Armbrust <michael@databricks.com>
   2015-04-02 16:46:50 -0700
   Commit: e6ee95c, github.com/apache/spark/pull/5333

   [Minor] [SQL] Follow-up of PR #5210
   Cheng Lian <lian@databricks.com>
   2015-04-02 16:15:34 -0700
   Commit: 4f1fe3f, github.com/apache/spark/pull/5219

   [SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property
   Yin Huai <yhuai@databricks.com>
   2015-04-02 16:02:31 -0700
   Commit: aecec07, github.com/apache/spark/pull/5313

   [SQL] Throw UnsupportedOperationException instead of NotImplementedError
   Michael Armbrust <michael@databricks.com>
   2015-04-02 16:01:03 -0700
   Commit: 78ba245, github.com/apache/spark/pull/5315

   SPARK-6414: Spark driver failed with NPE on job cancelation
   Hung Lin <hung.lin@gmail.com>
   2015-04-02 14:01:43 -0700
   Commit: 58e2b3f, github.com/apache/spark/pull/5124

   [SPARK-6079] Use index to speed up StatusTracker.getJobIdsForGroup()
   Josh Rosen <joshrosen@databricks.com>
   2015-03-25 17:40:00 -0700
   Commit: a6664dc, github.com/apache/spark/pull/4830

   [SPARK-6667] [PySpark] remove setReuseAddress
   Davies Liu <davies@databricks.com>
   2015-04-02 12:18:33 -0700
   Commit: ee2bd70, github.com/apache/spark/pull/5324

   Revert "[SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock"
   Cheng Lian <lian@databricks.com>
   2015-04-02 12:59:38 +0800
   Commit: 1160cc9

   [SQL] SPARK-6658: Update DataFrame documentation to refer to correct types
   Michael Armbrust <michael@databricks.com>
   2015-04-01 18:00:07 -0400
   Commit: 223dd3f

   [SPARK-6578] Small rewrite to make the logic more clear in MessageWithHeader.transferTo.
   Reynold Xin <rxin@databricks.com>
   2015-04-01 18:36:06 -0700
   Commit: d697b76, github.com/apache/spark/pull/5319

   [SPARK-6660][MLLIB] pythonToJava doesn't recognize object arrays
   Xiangrui Meng <meng@databricks.com>
   2015-04-01 18:17:07 -0700
   Commit: 0d1e476, github.com/apache/spark/pull/5318

   [SPARK-6553] [pyspark] Support functools.partial as UDF
   ksonj <kson@siberie.de>
   2015-04-01 17:23:57 -0700
   Commit: 98f72df, github.com/apache/spark/pull/5206

   [SPARK-6642][MLLIB] use 1.2 lambda scaling and remove addImplicit from NormalEquation
   Xiangrui Meng <meng@databricks.com>
   2015-04-01 16:47:18 -0700
   Commit: bc04fa2, github.com/apache/spark/pull/5314

   [SPARK-6578] [core] Fix thread-safety issue in outbound path of network library.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-04-01 16:06:11 -0700
   Commit: 1c31ebd, github.com/apache/spark/pull/5234

   [SPARK-6657] [Python] [Docs] fixed python doc build warnings
   Joseph K. Bradley <joseph@databricks.com>
   2015-04-01 15:15:47 -0700
   Commit: e347a7a, github.com/apache/spark/pull/5317

   [SPARK-6651][MLLIB] delegate dense vector arithmetics to the underlying numpy array
   Xiangrui Meng <meng@databricks.com>
   2015-04-01 13:29:04 -0700
   Commit: f50d95a, github.com/apache/spark/pull/5312

   SPARK-6626 [DOCS]: Corrected Scala:TwitterUtils parameters
   jayson <jayson@ziprecruiter.com>
   2015-04-01 11:12:55 +0100
   Commit: 7d029cb, github.com/apache/spark/pull/5295

   [Doc] Improve Python DataFrame documentation
   Reynold Xin <rxin@databricks.com>
   2015-03-31 18:31:36 -0700
   Commit: e527b35, github.com/apache/spark/pull/5287

   [SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure
   Josh Rosen <joshrosen@databricks.com>
   2015-03-31 16:18:39 -0700
   Commit: c4c982a, github.com/apache/spark/pull/5276

   [SPARK-6633][SQL] Should be "Contains" instead of "EndsWith" when constructing sources.StringContains
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-31 13:18:07 -0700
   Commit: d851646, github.com/apache/spark/pull/5299

   [SPARK-5371][SQL] Propagate types after function conversion, before futher resolution
   Michael Armbrust <michael@databricks.com>
   2015-03-31 11:34:29 -0700
   Commit: 5a957fe, github.com/apache/spark/pull/5278

   [SPARK-6145][SQL] fix ORDER BY on nested fields
   Michael Armbrust <michael@databricks.com>
   2015-03-31 11:23:18 -0700
   Commit: 045228f, github.com/apache/spark/pull/5189

   [SPARK-6575] [SQL] Adds configuration to disable schema merging while converting metastore Parquet tables
   Cheng Lian <lian@databricks.com>
   2015-03-31 11:21:15 -0700
   Commit: 778c876, github.com/apache/spark/pull/5231

   [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation
   Cheng Lian <lian@databricks.com>
   2015-03-31 11:18:25 -0700
   Commit: 9ebefb1, github.com/apache/spark/pull/5289

   [SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock
   Yin Huai <yhuai@databricks.com>
   2015-03-31 16:28:40 +0800
   Commit: fd600ce, github.com/apache/spark/pull/5281

   [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.
   Reynold Xin <rxin@databricks.com>
   2015-03-31 00:25:23 -0700
   Commit: cf651a4, github.com/apache/spark/pull/5284

   [SPARK-6625][SQL] Add common string filters to data sources.
   Reynold Xin <rxin@databricks.com>
   2015-03-31 00:19:51 -0700
   Commit: a97d4e6, github.com/apache/spark/pull/5285

   [SPARK-6119][SQL] DataFrame support for missing data handling
   Reynold Xin <rxin@databricks.com>
   2015-03-30 20:47:10 -0700
   Commit: 67c885e, github.com/apache/spark/pull/5274

   [SPARK-6369] [SQL] Uses commit coordinator to help committing Hive and Parquet tables
   Cheng Lian <lian@databricks.com>
   2015-03-31 07:48:37 +0800
   Commit: fedbfc7, github.com/apache/spark/pull/5139

   [SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() and applySchema
   Davies Liu <davies@databricks.com>
   2015-03-30 15:47:00 -0700
   Commit: 30e7c63, github.com/apache/spark/pull/5273

   [SPARK-6592][SQL] fix filter for scaladoc to generate API doc for Row class under catalyst dir
   CodingCat <zhunansjtu@gmail.com>
   2015-03-30 11:54:44 -0700
   Commit: f9d4efa, github.com/apache/spark/pull/5252

   [SPARK-6571][MLLIB] use wrapper in MatrixFactorizationModel.load
   Xiangrui Meng <meng@databricks.com>
   2015-03-28 15:08:05 -0700
   Commit: 93a7166, github.com/apache/spark/pull/5243

   [SPARK-6595][SQL] MetastoreRelation should be a MultiInstanceRelation
   Michael Armbrust <michael@databricks.com>
   2015-03-30 22:24:12 +0800
   Commit: c411530, github.com/apache/spark/pull/5251

   [SPARK-6558] Utils.getCurrentUserName returns the full principal name instead of login name
   Thomas Graves <tgraves@apache.org>
   2015-03-29 12:43:30 +0100
   Commit: f8132de, github.com/apache/spark/pull/5229

   [SPARK-5750][SPARK-3441][SPARK-5836][CORE] Added documentation explaining shuffle
   Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
   2015-03-30 11:52:02 +0100
   Commit: 1c59a4b, github.com/apache/spark/pull/5074

   [spark-sql] a better exception message than "scala.MatchError" for unsupported types in Schema creation
   Eran Medan <ehrann.mehdan@gmail.com>
   2015-03-30 00:02:52 -0700
   Commit: 4859c40, github.com/apache/spark/pull/5235

   [HOTFIX] Build break due to NoRelation cherry-pick.
   Reynold Xin <rxin@databricks.com>
   2015-03-29 12:07:28 -0700
   Commit: 6181366

   [DOC] Improvements to Python docs.
   Reynold Xin <rxin@databricks.com>
   2015-03-28 23:59:27 -0700
   Commit: 3db0844, github.com/apache/spark/pull/5238

   [SPARK-6538][SQL] Add missing nullable Metastore fields when merging a Parquet schema
   Adam Budde <budde@amazon.com>
   2015-03-28 09:14:09 +0800
   Commit: 5e04f45, github.com/apache/spark/pull/5214

   [SPARK-6564][SQL] SQLContext.emptyDataFrame should contain 0 row, not 1 row
   Reynold Xin <rxin@databricks.com>
   2015-03-27 14:56:57 -0700
   Commit: 7006858, github.com/apache/spark/pull/5226

   [SPARK-6544][build] Increment Avro version from 1.7.6 to 1.7.7
   Dean Chen <deanchen5@gmail.com>
   2015-03-27 14:32:51 +0000
   Commit: fefd49f, github.com/apache/spark/pull/5193

   [SPARK-6574] [PySpark] fix sql example
   Davies Liu <davies@databricks.com>
   2015-03-27 11:42:26 -0700
   Commit: b902a95, github.com/apache/spark/pull/5230

   [SPARK-6550][SQL] Use analyzed plan in DataFrame
   Michael Armbrust <michael@databricks.com>
   2015-03-27 11:40:00 -0700
   Commit: bc75189, github.com/apache/spark/pull/5217

   [SPARK-6341][mllib] Upgrade breeze from 0.11.1 to 0.11.2
   Yu ISHIKAWA <yuu.ishikawa@gmail.com>
   2015-03-27 00:15:02 -0700
   Commit: b318858, github.com/apache/spark/pull/5222

   [DOCS][SQL] Fix JDBC example
   Michael Armbrust <michael@databricks.com>
   2015-03-26 14:51:46 -0700
   Commit: 54d92b5, github.com/apache/spark/pull/5192

   [SPARK-6554] [SQL] Don't push down predicates which reference partition column(s)
   Cheng Lian <lian@databricks.com>
   2015-03-26 13:11:37 -0700
   Commit: 3d54578, github.com/apache/spark/pull/5210

   [SPARK-6117] [SQL] Improvements to DataFrame.describe()
   Reynold Xin <rxin@databricks.com>
   2015-03-26 12:26:13 -0700
   Commit: 28e3a1e, github.com/apache/spark/pull/5201

   [SPARK-6117] [SQL] add describe function to DataFrame for summary statis...
   azagrebin <azagrebin@gmail.com>
   2015-03-26 00:25:04 -0700
   Commit: 84735c3, github.com/apache/spark/pull/5073

   SPARK-6480 [CORE] histogram() bucket function is wrong in some simple edge cases
   Sean Owen <sowen@cloudera.com>
   2015-03-26 15:00:23 +0000
   Commit: aa2d157, github.com/apache/spark/pull/5148

   [SPARK-6491] Spark will put the current working dir to the CLASSPATH
   guliangliang <guliangliang@qiyi.com>
   2015-03-26 13:28:56 +0000
   Commit: 5b5f0e2, github.com/apache/spark/pull/5156

   [SQL][SPARK-6471]: Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns
   Yash Datta <Yash.Datta@guavus.com>
   2015-03-26 21:13:38 +0800
   Commit: 836c921, github.com/apache/spark/pull/5141

   [SPARK-6465][SQL] Fix serialization of GenericRowWithSchema using kryo
   Michael Armbrust <michael@databricks.com>
   2015-03-26 18:46:57 +0800
   Commit: 8254996, github.com/apache/spark/pull/5191

   [SPARK-6536] [PySpark] Column.inSet() in Python
   Davies Liu <davies@databricks.com>
   2015-03-26 00:01:24 -0700
   Commit: 0ba7599, github.com/apache/spark/pull/5190

   [SPARK-6463][SQL] AttributeSet.equal should compare size
   sisihj <jun.hejun@huawei.com>, Michael Armbrust <michael@databricks.com>
   2015-03-25 19:21:54 -0700
   Commit: 9edb34f, github.com/apache/spark/pull/5194

   [SPARK-6450] [SQL] Fixes metastore Parquet table conversion
   Cheng Lian <lian@databricks.com>
   2015-03-25 17:40:19 -0700
   Commit: 0cd4748, github.com/apache/spark/pull/5183

   [SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work.
   DoingDone9 <799203320@qq.com>
   2015-03-25 11:11:52 -0700
   Commit: 4efa6c5, github.com/apache/spark/pull/5131

   SPARK-6063 MLlib doesn't pass mvn scalastyle check due to UTF chars in LDAModel.scala
   Michael Griffiths <msjgriffiths@gmail.com>, Griffiths, Michael (NYC-RPM) <michael.griffiths@reprisemedia.com>
   2015-02-28 14:47:39 +0000
   Commit: 6791f42, github.com/apache/spark/pull/4815

   [SPARK-6496] [MLLIB] GeneralizedLinearAlgorithm.run(input, initialWeights) should initialize numFeatures
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-25 17:05:56 +0000
   Commit: 2be4255, github.com/apache/spark/pull/5167

   [DOCUMENTATION]Fixed Missing Type Import in Documentation
   Bill Chambers <wchambers@ischool.berkeley.edu>, anabranch <wac.chambers@gmail.com>
   2015-03-24 22:24:35 -0700
   Commit: 8e4e2e3, github.com/apache/spark/pull/5179

   [SPARK-6469] Improving documentation on YARN local directories usage
   Christophe Préaud <christophe.preaud@kelkoo.com>
   2015-03-24 17:05:49 -0700
   Commit: 6af9408, github.com/apache/spark/pull/5165

   [SPARK-3570] Include time to open files in shuffle write time.
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-03-24 16:29:40 -0700
   Commit: e4db5a3, github.com/apache/spark/pull/4550

   [SPARK-6088] Correct how tasks that get remote results are shown in UI.
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-03-24 16:26:43 -0700
   Commit: de8b2d4, github.com/apache/spark/pull/4839

   [SPARK-6428][SQL] Added explicit types for all public methods in catalyst
   Reynold Xin <rxin@databricks.com>
   2015-03-24 16:03:55 -0700
   Commit: 586e0d9, github.com/apache/spark/pull/5162

   [SPARK-6209] Clean up connections in ExecutorClassLoader after failing to load classes (master branch PR)
   Josh Rosen <joshrosen@databricks.com>
   2015-03-24 14:38:20 -0700
   Commit: dcf56aa, github.com/apache/spark/pull/4944

   [SPARK-6458][SQL] Better error messages for invalid data sources
   Michael Armbrust <michael@databricks.com>
   2015-03-24 14:10:56 -0700
   Commit: f48c16d, github.com/apache/spark/pull/5158

   [SPARK-6376][SQL] Avoid eliminating subqueries until optimization
   Michael Armbrust <michael@databricks.com>
   2015-03-24 14:08:20 -0700
   Commit: df671bc, github.com/apache/spark/pull/5160

   [SPARK-6375][SQL] Fix formatting of error messages.
   Michael Armbrust <michael@databricks.com>
   2015-03-24 13:22:46 -0700
   Commit: 92bf888, github.com/apache/spark/pull/5155

   Revert "[SPARK-5680][SQL] Sum function on all null values, should return zero"
   Michael Armbrust <michael@databricks.com>
   2015-03-24 12:32:25 -0700
   Commit: 930b667

   [SPARK-6054][SQL] Fix transformations of TreeNodes that hold StructTypes
   Michael Armbrust <michael@databricks.com>
   2015-03-24 12:28:01 -0700
   Commit: c699e2b, github.com/apache/spark/pull/5157

   [SPARK-6437][SQL] Use completion iterator to close external sorter
   Michael Armbrust <michael@databricks.com>
   2015-03-24 12:10:30 -0700
   Commit: c0101d3, github.com/apache/spark/pull/5161

   [SPARK-6459][SQL] Warn when constructing trivially true equals predicate
   Michael Armbrust <michael@databricks.com>
   2015-03-24 12:09:02 -0700
   Commit: f0141ca, github.com/apache/spark/pull/5163

   [SPARK-5955][MLLIB] add checkpointInterval to ALS
   Xiangrui Meng <meng@databricks.com>
   2015-03-20 15:02:57 -0400
   Commit: bc92a2e, github.com/apache/spark/pull/5076

   [ML][docs][minor] Define LabeledDocument/Document classes in CV example
   Peter Rudenko <petro.rudenko@gmail.com>
   2015-03-24 16:33:38 +0000
   Commit: 4ff5771, github.com/apache/spark/pull/5135

   [SPARK-5559] [Streaming] [Test] Remove oppotunity we met flakiness when running FlumeStreamSuite
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-03-24 16:13:25 +0000
   Commit: 8722369, github.com/apache/spark/pull/4337

   Update the command to use IPython notebook
   Cong Yue <yuecong1104@gmail.com>
   2015-03-24 12:56:13 +0000
   Commit: e545143, github.com/apache/spark/pull/5111

   [SPARK-6452] [SQL] Checks for missing attributes and unresolved operator for all types of operator
   Cheng Lian <lian@databricks.com>
   2015-03-24 01:12:11 -0700
   Commit: 6f10142, github.com/apache/spark/pull/5129

   [SPARK-6124] Support jdbc connection properties in OPTIONS part of the query
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-03-23 17:00:27 -0700
   Commit: 04b2078, github.com/apache/spark/pull/4859

   [SPARK-6397][SQL] Check the missingInput simply
   Yadong Qi <qiyadong2010@gmail.com>
   2015-03-23 18:16:49 +0800
   Commit: a29f493, github.com/apache/spark/pull/5132

   [SPARK-4985] [SQL] parquet support for date type
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-03-23 11:46:16 +0800
   Commit: 60b9b96, github.com/apache/spark/pull/3822

   [SPARK-6337][Documentation, SQL]Spark 1.3 doc fixes
   vinodkc <vinod.kc.in@gmail.com>
   2015-03-22 20:00:08 +0000
   Commit: 857e8a6, github.com/apache/spark/pull/5112

   SPARK-6454 [DOCS] Fix links to pyspark api
   Kamil Smuga <smugakamil@gmail.com>, stderr <smugakamil@gmail.com>
   2015-03-22 15:56:25 +0000
   Commit: 3ba295f, github.com/apache/spark/pull/5120

   [SPARK-6408] [SQL] Fix JDBCRDD filtering string literals
   ypcat <ypcat6@gmail.com>, Pei-Lun Lee <pllee@appier.com>
   2015-03-22 15:49:13 +0800
   Commit: e60fbf6, github.com/apache/spark/pull/5087

   [SPARK-6428][SQL] Added explicit type for all public methods for Hive module
   Reynold Xin <rxin@databricks.com>
   2015-03-21 14:30:04 -0700
   Commit: 0021d22, github.com/apache/spark/pull/5108

   [SPARK-6428][SQL] Added explicit type for all public methods in sql/core
   Reynold Xin <rxin@databricks.com>
   2015-03-20 15:47:07 -0700
   Commit: c964588, github.com/apache/spark/pull/5104

   [SPARK-6250][SPARK-6146][SPARK-5911][SQL] Types are now reserved words in DDL parser.
   Yin Huai <yhuai@databricks.com>
   2015-03-21 13:27:53 -0700
   Commit: 102daaf, github.com/apache/spark/pull/5078

   [SPARK-5680][SQL] Sum function on all null values, should return zero
   Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-03-21 13:24:24 -0700
   Commit: 93975a3, github.com/apache/spark/pull/4466

   [SPARK-5320][SQL]Add statistics method at NoRelation (override super).
   x1- <viva008@gmail.com>
   2015-03-21 13:22:34 -0700
   Commit: cba6842, github.com/apache/spark/pull/5105

   [SPARK-5821] [SQL] JSON CTAS command should throw error message when delete path failure
   Yanbo Liang <ybliang8@gmail.com>, Yanbo Liang <yanbohappy@gmail.com>
   2015-03-21 11:23:28 +0800
   Commit: 8de90c7, github.com/apache/spark/pull/4610

   [SPARK-6315] [SQL] Also tries the case class string parser while reading Parquet schema
   Cheng Lian <lian@databricks.com>
   2015-03-21 11:18:45 +0800
   Commit: b75943f, github.com/apache/spark/pull/5034

   [SPARK-5821] [SQL] ParquetRelation2 CTAS should check if delete is successful
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-21 10:53:04 +0800
   Commit: df83e21, github.com/apache/spark/pull/5107

   [SPARK-6421][MLLIB] _regression_train_wrapper does not test initialWeights correctly
   lewuathe <lewuathe@me.com>
   2015-03-20 17:18:18 -0400
   Commit: aff9f8d, github.com/apache/spark/pull/5101

   [SPARK-6286][Mesos][minor] Handle missing Mesos case TASK_ERROR
   Jongyoul Lee <jongyoul@gmail.com>
   2015-03-20 12:24:34 +0000
   Commit: db812d9, github.com/apache/spark/pull/5088

   [SPARK-6222][Streaming] Dont delete checkpoint data when doing pre-batch-start checkpoint
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-19 02:15:50 -0400
   Commit: 03e263f, github.com/apache/spark/pull/5008

   [SPARK-6325] [core,yarn] Do not change target executor count when killing executors.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-18 09:18:28 -0400
   Commit: 1723f05, github.com/apache/spark/pull/5018

   [SPARK-6286][minor] Handle missing Mesos case TASK_ERROR.
   Iulian Dragos <jaguarul@gmail.com>
   2015-03-18 09:15:33 -0400
   Commit: ff0a7f4, github.com/apache/spark/pull/5000

   [SPARK-6247][SQL] Fix resolution of ambiguous joins caused by new aliases
   Michael Armbrust <michael@databricks.com>
   2015-03-17 19:47:51 -0700
   Commit: ba8352c, github.com/apache/spark/pull/5062

   [SPARK-6383][SQL]Fixed compiler and errors in Dataframe examples
   Tijo Thomas <tijoparacka@gmail.com>
   2015-03-17 18:50:19 -0700
   Commit: cee6d08, github.com/apache/spark/pull/5068

   [SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTable should be "error" instead of "append".
   Yin Huai <yhuai@databricks.com>
   2015-03-18 09:41:06 +0800
   Commit: 3ea38bc, github.com/apache/spark/pull/5053

   [SPARK-6330] [SQL] Add a test case for SPARK-6330
   Pei-Lun Lee <pllee@appier.com>
   2015-03-18 08:34:46 +0800
   Commit: 9d88f0c, github.com/apache/spark/pull/5039

   [SPARK-6336] LBFGS should document what convergenceTol means
   lewuathe <lewuathe@me.com>
   2015-03-17 12:11:57 -0700
   Commit: 476c4e1, github.com/apache/spark/pull/5033

   [SPARK-6365] jetty-security also needed for SPARK_PREPEND_CLASSES to work
   Imran Rashid <irashid@cloudera.com>
   2015-03-17 12:03:54 -0500
   Commit: ac0e7cc, github.com/apache/spark/pull/5071

   [SPARK-6313] Add config option to disable file locks/fetchFile cache to ...
   nemccarthy <nathan@nemccarthy.me>
   2015-03-17 09:33:11 -0700
   Commit: febb123, github.com/apache/spark/pull/5036

   [SPARK-3266] Use intermediate abstract classes to fix type erasure issues in Java APIs
   Josh Rosen <joshrosen@databricks.com>
   2015-03-17 09:18:57 -0700
   Commit: 29e39e1, github.com/apache/spark/pull/5050

   [SPARK-6331] Load new master URL if present when recovering streaming context from checkpoint
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-17 05:31:27 -0700
   Commit: 95f8d1c, github.com/apache/spark/pull/5024

   [SQL][docs][minor] Fixed sample code in SQLContext scaladoc
   Lomig Mégard <lomig.megard@gmail.com>
   2015-03-16 23:52:42 -0700
   Commit: 426816b, github.com/apache/spark/pull/5051

   [SPARK-6299][CORE] ClassNotFoundException in standalone mode when running groupByKey with class defined in REPL
   Kevin (Sangwoo) Kim <sangwookim.me@gmail.com>
   2015-03-16 23:49:23 -0700
   Commit: 5c16ced, github.com/apache/spark/pull/5046

   [SPARK-6077] Remove streaming tab while stopping StreamingContext
   lisurprise <zhichao.li@intel.com>
   2015-03-16 13:10:32 -0700
   Commit: 47cce98, github.com/apache/spark/pull/4828

   [SPARK-6330] Fix filesystem bug in newParquet relation
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-03-16 12:13:18 -0700
   Commit: 67fa6d1, github.com/apache/spark/pull/5020

   SPARK-6245 [SQL] jsonRDD() of empty RDD results in exception
   Sean Owen <sowen@cloudera.com>
   2015-03-11 14:09:09 +0000
   Commit: 684ff24, github.com/apache/spark/pull/4971

   [SPARK-6300][Spark Core] sc.addFile(path) does not support the relative path.
   DoingDone9 <799203320@qq.com>
   2015-03-16 12:27:15 +0000
   Commit: 724aab4, github.com/apache/spark/pull/4993

   [SPARK-3619] Part 2. Upgrade to Mesos 0.21 to work around MESOS-1688
   Jongyoul Lee <jongyoul@gmail.com>
   2015-03-15 15:46:55 +0000
   Commit: 43fcab0, github.com/apache/spark/pull/4361

   [SPARK-6210] [SQL] use prettyString as column name in agg()
   Davies Liu <davies@databricks.com>
   2015-03-14 00:43:33 -0700
   Commit: ad47563, github.com/apache/spark/pull/5006

   [SPARK-6275][Documentation]Miss toDF() function in docs/sql-programming-guide.md
   zzcclp <xm_zzc@sina.com>
   2015-03-12 15:07:15 +0000
   Commit: 3012781, github.com/apache/spark/pull/4977

   [SPARK-6133] Make sc.stop() idempotent
   Andrew Or <andrew@databricks.com>
   2015-03-03 15:09:57 -0800
   Commit: a08588c, github.com/apache/spark/pull/4871

   [SPARK-6132][HOTFIX] ContextCleaner InterruptedException should be quiet
   Andrew Or <andrew@databricks.com>
   2015-03-03 20:49:45 -0800
   Commit: 338bea7, github.com/apache/spark/pull/4882

   [SPARK-6132] ContextCleaner race condition across SparkContexts
   Andrew Or <andrew@databricks.com>
   2015-03-03 13:44:05 -0800
   Commit: 3cdc8a3, github.com/apache/spark/pull/4869

   [SPARK-6087][CORE] Provide actionable exception if Kryo buffer is not large enough
   Lev Khomich <levkhomich@gmail.com>
   2015-03-10 10:55:42 +0000
   Commit: 9846790, github.com/apache/spark/pull/4947

   [SPARK-6036][CORE] avoid race condition between eventlogListener and akka actor system
   Zhang, Liye <liye.zhang@intel.com>
   2015-02-26 23:11:43 -0800
   Commit: f81611d, github.com/apache/spark/pull/4785

   SPARK-4044 [CORE] Thriftserver fails to start when JAVA_HOME points to JRE instead of JDK
   Sean Owen <sowen@cloudera.com>
   2015-03-13 17:59:31 +0000
   Commit: 4aa4132, github.com/apache/spark/pull/4981

   SPARK-4300 [CORE] Race condition during SparkWorker shutdown
   Sean Owen <sowen@cloudera.com>
   2015-02-26 14:08:56 -0800
   Commit: a3493eb, github.com/apache/spark/pull/4787

   [SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
   Davies Liu <davies@databricks.com>
   2015-03-09 16:24:06 -0700
   Commit: 170af49, github.com/apache/spark/pull/4923

   SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output
   Sean Owen <sowen@cloudera.com>
   2015-02-26 12:56:54 -0800
   Commit: dbee7e1, github.com/apache/spark/pull/4788

   [SPARK-6278][MLLIB] Mention the change of objective in linear regression
   Xiangrui Meng <meng@databricks.com>
   2015-03-13 10:27:28 -0700
   Commit: 214f681, github.com/apache/spark/pull/4978

   [SPARK-5310] [SQL] [DOC] Parquet section for the SQL programming guide
   Cheng Lian <lian@databricks.com>
   2015-03-13 21:34:50 +0800
   Commit: dc287f3, github.com/apache/spark/pull/5001

   [mllib] [python] Add LassoModel to __all__ in regression.py
   Joseph K. Bradley <joseph@databricks.com>
   2015-03-12 16:46:29 -0700
   Commit: 23069bd, github.com/apache/spark/pull/4970

   [SPARK-6294] fix hang when call take() in JVM on PythonRDD
   Davies Liu <davies@databricks.com>
   2015-03-12 01:34:38 -0700
   Commit: 850e694, github.com/apache/spark/pull/4987

   [SPARK-6296] [SQL] Added equals to Column
   Volodymyr Lyubinets <vlyubin@gmail.com>
   2015-03-12 00:55:26 -0700
   Commit: d9e141c, github.com/apache/spark/pull/4988

   [SPARK-6128][Streaming][Documentation] Updates to Spark Streaming Programming Guide
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-11 18:48:21 -0700
   Commit: bdc4682, github.com/apache/spark/pull/4956

   [SPARK-6274][Streaming][Examples] Added examples streaming + sql examples.
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-11 11:19:51 -0700
   Commit: ac61466, github.com/apache/spark/pull/4975

   [SPARK-5183][SQL] Update SQL Docs with JDBC and Migration Guide
   Michael Armbrust <michael@databricks.com>
   2015-03-10 18:13:09 -0700
   Commit: edbcb6f, github.com/apache/spark/pull/4958

   Minor doc: Remove the extra blank line in data types javadoc.
   Reynold Xin <rxin@databricks.com>
   2015-03-10 17:25:04 -0700
   Commit: 7295192, github.com/apache/spark/pull/4955

   [SPARK-5310][Doc] Update SQL Programming Guide to include DataFrames.
   Reynold Xin <rxin@databricks.com>
   2015-03-09 16:16:16 -0700
   Commit: bc53d3d, github.com/apache/spark/pull/4954

   [Docs] Replace references to SchemaRDD with DataFrame
   Reynold Xin <rxin@databricks.com>
   2015-03-09 13:29:19 -0700
   Commit: 5e58f76, github.com/apache/spark/pull/4952

   Preparing development version 1.3.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-03-05 23:02:08 +0000
   Commit: c152f9a


 Release 1.3.0

   [SQL] Make Strategies a public developer API
   Michael Armbrust <michael@databricks.com>
   2015-03-05 14:50:25 -0800
   Commit: 556e0de, github.com/apache/spark/pull/4920

   [SPARK-6163][SQL] jsonFile should be backed by the data source API
   Yin Huai <yhuai@databricks.com>
   2015-03-05 14:49:44 -0800
   Commit: 083fed5, github.com/apache/spark/pull/4896

   [SPARK-6145][SQL] fix ORDER BY on nested fields
   Wenchen Fan <cloud0fan@outlook.com>, Michael Armbrust <michael@databricks.com>
   2015-03-05 14:49:01 -0800
   Commit: e358f55, github.com/apache/spark/pull/4918

   [SPARK-6175] Fix standalone executor log links when ephemeral ports or SPARK_PUBLIC_DNS are used
   Josh Rosen <joshrosen@databricks.com>
   2015-03-05 12:04:00 -0800
   Commit: 988b498, github.com/apache/spark/pull/4903

   SPARK-6182 [BUILD] spark-parent pom needs to be published for both 2.10 and 2.11
   Sean Owen <sowen@cloudera.com>
   2015-03-05 11:31:48 -0800
   Commit: ae315d2, github.com/apache/spark/pull/4912

   Revert "[SPARK-6153] [SQL] promote guava dep for hive-thriftserver"
   Cheng Lian <lian@databricks.com>
   2015-03-05 17:58:18 +0800
   Commit: f8205d3

   [SPARK-6153] [SQL] promote guava dep for hive-thriftserver
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-03-05 16:35:17 +0800
   Commit: b92d925, github.com/apache/spark/pull/4884

   Updating CHANGES file
   Patrick Wendell <patrick@databricks.com>
   2015-03-04 21:19:49 -0800
   Commit: 87eac3c

   SPARK-5143 [BUILD] [WIP] spark-network-yarn 2.11 depends on spark-network-shuffle 2.10
   Sean Owen <sowen@cloudera.com>
   2015-03-04 21:00:51 -0800
   Commit: f509159, github.com/apache/spark/pull/4876

   [SPARK-6149] [SQL] [Build] Excludes Guava 15 referenced by jackson-module-scala_2.10
   Cheng Lian <lian@databricks.com>
   2015-03-04 20:52:58 -0800
   Commit: a0aa24a, github.com/apache/spark/pull/4890

   [SPARK-6144] [core] Fix addFile when source files are on "hdfs:"
   Marcelo Vanzin <vanzin@cloudera.com>, trystanleftwich <trystan@atscale.com>
   2015-03-04 12:58:39 -0800
   Commit: 3fc74f4, github.com/apache/spark/pull/4894

   [SPARK-6134][SQL] Fix wrong datatype for casting FloatType and default LongType value in defaultPrimitive
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-04 20:23:43 +0800
   Commit: bfa4e31, github.com/apache/spark/pull/4870

   [SPARK-6136] [SQL] Removed JDBC integration tests which depends on docker-client
   Cheng Lian <lian@databricks.com>
   2015-03-04 19:39:02 +0800
   Commit: 035243d, github.com/apache/spark/pull/4872

   [SPARK-6141][MLlib] Upgrade Breeze from 0.10 to 0.11 to fix convergence bug
   Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>, DB Tsai <dbtsai@dbtsai.com>
   2015-03-03 23:52:02 -0800
   Commit: 9f24977, github.com/apache/spark/pull/4879

   [SPARK-5949] HighlyCompressedMapStatus needs more classes registered w/ kryo
   Imran Rashid <irashid@cloudera.com>
   2015-03-03 15:33:19 -0800
   Commit: 9a0b75c, github.com/apache/spark/pull/4877

   SPARK-1911 [DOCS] Warn users if their assembly jars are not built with Java 6
   Sean Owen <sowen@cloudera.com>
   2015-03-03 13:40:11 -0800
   Commit: 8446ad0, github.com/apache/spark/pull/4874

   Revert "[SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file"
   Andrew Or <andrew@databricks.com>
   2015-03-03 13:04:15 -0800
   Commit: ee4929d

   Adding CHANGES.txt for Spark 1.3
   Patrick Wendell <patrick@databricks.com>
   2015-03-03 02:19:19 -0800
   Commit: ce7158c

   BUILD: Minor tweaks to internal build scripts
   Patrick Wendell <patrick@databricks.com>
   2015-03-03 00:38:12 -0800
   Commit: ae60eb9

   HOTFIX: Bump HBase version in MapR profiles.
   Patrick Wendell <patrick@databricks.com>
   2015-03-03 01:38:07 -0800
   Commit: 1aa8461

   [SPARK-5537][MLlib][Docs] Add user guide for multinomial logistic regression
   DB Tsai <dbtsai@alpinenow.com>
   2015-03-02 22:37:12 -0800
   Commit: 841d2a2, github.com/apache/spark/pull/4866

   [SPARK-6120] [mllib] Warnings about memory in tree, ensemble model save
   Joseph K. Bradley <joseph@databricks.com>
   2015-03-02 22:33:51 -0800
   Commit: 81648a7, github.com/apache/spark/pull/4864

   [SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlib
   Xiangrui Meng <meng@databricks.com>
   2015-03-02 22:27:01 -0800
   Commit: 62c53be, github.com/apache/spark/pull/4854

   [SPARK-5310][SQL] Fixes to Docs and Datasources API
   Reynold Xin <rxin@databricks.com>, Michael Armbrust <michael@databricks.com>
   2015-03-02 22:14:08 -0800
   Commit: 4e6e008, github.com/apache/spark/pull/4868

   [SPARK-5950][SQL]Insert array into a metastore table saved as parquet should work when using datasource api
   Yin Huai <yhuai@databricks.com>
   2015-03-02 19:31:55 -0800
   Commit: 1b490e9, github.com/apache/spark/pull/4826

   [SPARK-6127][Streaming][Docs] Add Kafka to Python api docs
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-03-02 18:40:46 -0800
   Commit: ffd0591, github.com/apache/spark/pull/4860

   [SPARK-5537] Add user guide for multinomial logistic regression
   Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>
   2015-03-02 18:10:50 -0800
   Commit: 11389f0, github.com/apache/spark/pull/4801

   [SPARK-6121][SQL][MLLIB] simpleString for UDT
   Xiangrui Meng <meng@databricks.com>
   2015-03-02 17:14:34 -0800
   Commit: 1b8ab57, github.com/apache/spark/pull/4858

   [SPARK-6048] SparkConf should not translate deprecated configs on set
   Andrew Or <andrew@databricks.com>
   2015-03-02 16:36:42 -0800
   Commit: ea69cf2, github.com/apache/spark/pull/4799

   [SPARK-6066] Make event log format easier to parse
   Andrew Or <andrew@databricks.com>
   2015-03-02 16:34:32 -0800
   Commit: 8100b79, github.com/apache/spark/pull/4821

   [SPARK-6082] [SQL] Provides better error message for malformed rows when caching tables
   Cheng Lian <lian@databricks.com>
   2015-03-02 16:18:00 -0800
   Commit: 866f281, github.com/apache/spark/pull/4842

   [SPARK-6114][SQL] Avoid metastore conversions before plan is resolved
   Michael Armbrust <michael@databricks.com>
   2015-03-02 16:10:54 -0800
   Commit: 3899c7c, github.com/apache/spark/pull/4855

   [SPARK-6050] [yarn] Relax matching of vcore count in received containers.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-02 16:41:43 -0600
   Commit: 650d1e7, github.com/apache/spark/pull/4818

   [SPARK-6040][SQL] Fix the percent bug in tablesample
   q00251598 <qiyadong@huawei.com>
   2015-03-02 13:16:29 -0800
   Commit: a83b9bb, github.com/apache/spark/pull/4789

   [Minor] Fix doc typo for describing primitiveTerm effectiveness condition
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-03-02 13:11:17 -0800
   Commit: f92876a, github.com/apache/spark/pull/4762

   SPARK-5390 [DOCS] Encourage users to post on Stack Overflow in Community Docs
   Sean Owen <sowen@cloudera.com>
   2015-03-02 21:10:08 +0000
   Commit: 58e7198, github.com/apache/spark/pull/4843

   [DOCS] Refactored Dataframe join comment to use correct parameter ordering
   Paul Power <paul.power@peerside.com>
   2015-03-02 13:08:47 -0800
   Commit: 54ac243, github.com/apache/spark/pull/4847

   [SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark
   Yanbo Liang <ybliang8@gmail.com>
   2015-03-02 10:17:24 -0800
   Commit: 4ffaf85, github.com/apache/spark/pull/4831

   [SPARK-5741][SQL] Support the path contains comma in HiveContext
   q00251598 <qiyadong@huawei.com>
   2015-03-02 10:13:11 -0800
   Commit: f476108, github.com/apache/spark/pull/4532

   [SPARK-6111] Fixed usage string in documentation.
   Kenneth Myers <myerske@us.ibm.com>
   2015-03-02 17:25:24 +0000
   Commit: b2b7f01, github.com/apache/spark/pull/4852

   [SPARK-6052][SQL]In JSON schema inference, we should always set containsNull of an ArrayType to true
   Yin Huai <yhuai@databricks.com>
   2015-03-02 23:18:07 +0800
   Commit: a3fef2c, github.com/apache/spark/pull/4806

   [SPARK-6073][SQL] Need to refresh metastore cache after append data in CreateMetastoreDataSourceAsSelect
   Yin Huai <yhuai@databricks.com>
   2015-03-02 22:42:18 +0800
   Commit: c59871c, github.com/apache/spark/pull/4824

   [Streaming][Minor]Fix some error docs in streaming examples
   Saisai Shao <saisai.shao@intel.com>
   2015-03-02 08:49:19 +0000
   Commit: 1fe677a, github.com/apache/spark/pull/4837

   [SPARK-6083] [MLLib] [DOC] Make Python API example consistent in NaiveBayes
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-03-01 16:28:15 -0800
   Commit: 6a2fc85, github.com/apache/spark/pull/4834

   [SPARK-6053][MLLIB] support save/load in PySpark's ALS
   Xiangrui Meng <meng@databricks.com>
   2015-03-01 16:26:57 -0800
   Commit: b570d98, github.com/apache/spark/pull/4811

   [SPARK-6074] [sql] Package pyspark sql bindings.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-03-01 11:05:10 +0000
   Commit: bb16618, github.com/apache/spark/pull/4822

   SPARK-5984: Fix TimSort bug causes ArrayOutOfBoundsException
   Evan Yu <ehotou@gmail.com>
   2015-02-28 18:55:34 -0800
   Commit: 317694c, github.com/apache/spark/pull/4804

   [SPARK-5775] [SQL] BugFix: GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table
   Cheng Lian <lian@databricks.com>, Cheng Lian <liancheng@users.noreply.github.com>, Yin Huai <yhuai@databricks.com>
   2015-02-28 21:15:43 +0800
   Commit: aa39460, github.com/apache/spark/pull/4792

   [SPARK-5979][SPARK-6032] Smaller safer --packages fix
   Burak Yavuz <brkyvz@gmail.com>
   2015-02-27 22:59:35 -0800
   Commit: 5a55c96, github.com/apache/spark/pull/4802

   [SPARK-6070] [yarn] Remove unneeded classes from shuffle service jar.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-27 22:44:11 -0800
   Commit: 1747e0a, github.com/apache/spark/pull/4820

   [SPARK-6055] [PySpark] fix incorrect __eq__ of DataType
   Davies Liu <davies@databricks.com>
   2015-02-27 20:07:17 -0800
   Commit: 49f2187, github.com/apache/spark/pull/4808

   [SPARK-5751] [SQL] Sets SPARK_HOME as SPARK_PID_DIR when running Thrift server test suites
   Cheng Lian <lian@databricks.com>
   2015-02-28 08:41:49 +0800
   Commit: 5d19cf0, github.com/apache/spark/pull/4758

   [Streaming][Minor] Remove useless type signature of Java Kafka direct stream API
   Saisai Shao <saisai.shao@intel.com>
   2015-02-27 13:01:42 -0800
   Commit: ceebe3c, github.com/apache/spark/pull/4817

   [SPARK-4587] [mllib] [docs] Fixed save,load calls in ML guide examples
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-27 13:00:36 -0800
   Commit: 117e10c, github.com/apache/spark/pull/4816

   [SPARK-6058][Yarn] Log the user class exception in ApplicationMaster
   zsxwing <zsxwing@gmail.com>
   2015-02-27 13:31:46 +0000
   Commit: bff8088, github.com/apache/spark/pull/4813

   fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode
   许鹏 <peng.xu@fraudmetrix.cn>
   2015-02-26 23:05:56 -0800
   Commit: b8db84c, github.com/apache/spark/pull/4803

   SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server.
   Lukasz Jastrzebski <lukasz.jastrzebski@gmail.com>
   2015-02-26 22:38:06 -0800
   Commit: 485b919, github.com/apache/spark/pull/4778

   [SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore.
   Yin Huai <yhuai@databricks.com>
   2015-02-26 20:46:05 -0800
   Commit: 6200f07, github.com/apache/spark/pull/4795

   [SPARK-6037][SQL] Avoiding duplicate Parquet schema merging
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-27 11:06:47 +0800
   Commit: 25a109e, github.com/apache/spark/pull/4786

   SPARK-4579 [WEBUI] Scheduling Delay appears negative
   Sean Owen <sowen@cloudera.com>
   2015-02-26 17:35:09 -0800
   Commit: b83a93e, github.com/apache/spark/pull/4796

   [SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode
   mohit.goyal <mohit.goyal@guavus.com>
   2015-02-26 14:27:47 -0800
   Commit: 5b426cb, github.com/apache/spark/pull/4730

   Add a note for context termination for History server on Yarn
   moussa taifi <moutai10@gmail.com>
   2015-02-26 14:19:43 -0800
   Commit: 297c3ef, github.com/apache/spark/pull/4721

   [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM
   Cheolsoo Park <cheolsoop@netflix.com>
   2015-02-26 13:53:49 -0800
   Commit: fe79674, github.com/apache/spark/pull/4773

   [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-26 13:46:07 -0800
   Commit: 731a997, github.com/apache/spark/pull/4779

   Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs.
   Li Zhihui <zhihui.li@intel.com>
   2015-02-26 13:07:07 -0800
   Commit: 62652dc, github.com/apache/spark/pull/4781

   [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe
   Davies Liu <davies@databricks.com>
   2015-02-26 11:54:17 -0800
   Commit: 5d309ad, github.com/apache/spark/pull/4776

   [SPARK-6015] fix links to source code in Python API docs
   Davies Liu <davies@databricks.com>
   2015-02-26 10:45:29 -0800
   Commit: dafb3d2, github.com/apache/spark/pull/4772

   [SPARK-6007][SQL] Add numRows param in DataFrame.show()
   Jacky Li <jacky.likun@huawei.com>
   2015-02-26 10:40:58 -0800
   Commit: 7c779d8, github.com/apache/spark/pull/4767

   [SPARK-6016][SQL] Cannot read the parquet table after overwriting the existing table when spark.sql.parquet.cacheMetadata=true
   Yin Huai <yhuai@databricks.com>
   2015-02-27 01:01:32 +0800
   Commit: b5c5e93, github.com/apache/spark/pull/4775

   [SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2
   Yin Huai <yhuai@databricks.com>
   2015-02-26 22:39:49 +0800
   Commit: e0f5fb0, github.com/apache/spark/pull/4782

   [SPARK-5976][MLLIB] Add partitioner to factors returned by ALS
   Xiangrui Meng <meng@databricks.com>
   2015-02-25 23:43:29 -0800
   Commit: a51d9db, github.com/apache/spark/pull/4748

   [SPARK-1182][Docs] Sort the configuration parameters in configuration.md
   Brennon York <brennon.york@capitalone.com>
   2015-02-25 16:12:56 -0800
   Commit: 56fa38a, github.com/apache/spark/pull/3863

   [SPARK-5724] fix the misconfiguration in AkkaUtils
   CodingCat <zhunansjtu@gmail.com>
   2015-02-23 11:29:25 +0000
   Commit: b32a653, github.com/apache/spark/pull/4512

   [SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide with save/load, Python GBT
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-25 16:13:17 -0800
   Commit: a1b4856, github.com/apache/spark/pull/4750

   [SPARK-5926] [SQL] make DataFrame.explain leverage queryExecution.logical
   Yanbo Liang <ybliang8@gmail.com>
   2015-02-25 15:37:13 -0800
   Commit: 5bd4b49, github.com/apache/spark/pull/4707

   [SPARK-5999][SQL] Remove duplicate Literal matching block
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-25 15:22:33 -0800
   Commit: 6fff9b8, github.com/apache/spark/pull/4760

   [SPARK-6010] [SQL] Merging compatible Parquet schemas before computing splits
   Cheng Lian <lian@databricks.com>
   2015-02-25 15:15:22 -0800
   Commit: 016f1f8, github.com/apache/spark/pull/4768

   [SPARK-5944] [PySpark] fix version in Python API docs
   Davies Liu <davies@databricks.com>
   2015-02-25 15:13:34 -0800
   Commit: 9aca3c6, github.com/apache/spark/pull/4731

   [SPARK-5982] Remove incorrect Local Read Time Metric
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-25 14:55:24 -0800
   Commit: 791df93, github.com/apache/spark/pull/4749

   [SPARK-1955][GraphX]: VertexRDD can incorrectly assume index sharing
   Brennon York <brennon.york@capitalone.com>
   2015-02-25 14:11:12 -0800
   Commit: 8073767, github.com/apache/spark/pull/4705

   SPARK-5930 [DOCS] Documented default of spark.shuffle.io.retryWait is confusing
   Sean Owen <sowen@cloudera.com>
   2015-02-25 12:20:44 -0800
   Commit: eaffc6e, github.com/apache/spark/pull/4769

   [SPARK-5996][SQL] Fix specialized outbound conversions
   Michael Armbrust <michael@databricks.com>
   2015-02-25 10:13:40 -0800
   Commit: fada683, github.com/apache/spark/pull/4757

   [SPARK-5994] [SQL] Python DataFrame documentation fixes
   Davies Liu <davies@databricks.com>
   2015-02-24 20:51:55 -0800
   Commit: 5c421e0, github.com/apache/spark/pull/4756

   [SPARK-5286][SQL] SPARK-5286 followup
   Yin Huai <yhuai@databricks.com>
   2015-02-24 19:51:36 -0800
   Commit: e7a748e, github.com/apache/spark/pull/4755

   [SPARK-5993][Streaming][Build] Fix assembly jar location of kafka-assembly
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-24 19:10:37 -0800
   Commit: 1e94894, github.com/apache/spark/pull/4753

   [SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.
   Reynold Xin <rxin@databricks.com>
   2015-02-24 18:59:23 -0800
   Commit: 5e233b2, github.com/apache/spark/pull/4752

   [SPARK-5904][SQL] DataFrame Java API test suites.
   Reynold Xin <rxin@databricks.com>
   2015-02-24 18:51:41 -0800
   Commit: 78a1781, github.com/apache/spark/pull/4751

   [SPARK-5751] [SQL] [WIP] Revamped HiveThriftServer2Suite for robustness
   Cheng Lian <lian@databricks.com>
   2015-02-25 08:34:55 +0800
   Commit: 17ee246, github.com/apache/spark/pull/4720

   [SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
   Davies Liu <davies@databricks.com>
   2015-02-24 14:50:00 -0800
   Commit: 91bf0f8, github.com/apache/spark/pull/4745

   [SPARK-5952][SQL] Lock when using hive metastore client
   Michael Armbrust <michael@databricks.com>
   2015-02-24 13:39:29 -0800
   Commit: 641423d, github.com/apache/spark/pull/4746

   [MLLIB] Change x_i to y_i in Variance's user guide
   Xiangrui Meng <meng@databricks.com>
   2015-02-24 11:38:59 -0800
   Commit: a4ff445, github.com/apache/spark/pull/4740

   [SPARK-5965] Standalone Worker UI displays {{USER_JAR}}
   Andrew Or <andrew@databricks.com>
   2015-02-24 11:08:07 -0800
   Commit: eaf7bf9, github.com/apache/spark/pull/4739

   [Spark-5967] [UI] Correctly clean JobProgressListener.stageIdToActiveJobIds
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-24 11:02:47 -0800
   Commit: 28dd53b, github.com/apache/spark/pull/4741

   [SPARK-5532][SQL] Repartition should not use external rdd representation
   Michael Armbrust <michael@databricks.com>
   2015-02-24 10:52:18 -0800
   Commit: e46096b, github.com/apache/spark/pull/4738

   [SPARK-5910][SQL] Support for as in selectExpr
   Michael Armbrust <michael@databricks.com>
   2015-02-24 10:49:51 -0800
   Commit: ba5d60d, github.com/apache/spark/pull/4736

   [SPARK-5968] [SQL] Suppresses ParquetOutputCommitter WARN logs
   Cheng Lian <lian@databricks.com>
   2015-02-24 10:45:38 -0800
   Commit: 2b562b0, github.com/apache/spark/pull/4744

   [SPARK-5958][MLLIB][DOC] update block matrix user guide
   Xiangrui Meng <meng@databricks.com>
   2015-02-23 22:08:44 -0800
   Commit: dd42558, github.com/apache/spark/pull/4737

   [SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryExecution
   Michael Armbrust <michael@databricks.com>
   2015-02-23 17:34:54 -0800
   Commit: 2d7786e, github.com/apache/spark/pull/4684

   [SPARK-5935][SQL] Accept MapType in the schema provided to a JSON dataset.
   Yin Huai <yhuai@databricks.com>, Yin Huai <huai@cse.ohio-state.edu>
   2015-02-23 17:16:34 -0800
   Commit: 33ccad2, github.com/apache/spark/pull/4710

   [SPARK-5912] [docs] [mllib] Small fixes to ChiSqSelector docs
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-23 16:15:57 -0800
   Commit: ae97040, github.com/apache/spark/pull/4732

   [MLLIB] SPARK-5912 Programming guide for feature selection
   Alexander Ulanov <nashb@yandex.ru>
   2015-02-23 12:09:40 -0800
   Commit: 8355773, github.com/apache/spark/pull/4709

   [SPARK-5939][MLLib] make FPGrowth example app take parameters
   Jacky Li <jacky.likun@huawei.com>
   2015-02-23 08:47:28 -0800
   Commit: 33b9084, github.com/apache/spark/pull/4714

   [SPARK-5943][Streaming] Update the test to use new API to reduce the warning
   Saisai Shao <saisai.shao@intel.com>
   2015-02-23 11:27:27 +0000
   Commit: 67b7f79, github.com/apache/spark/pull/4722

   [EXAMPLES] fix typo.
   Makoto Fukuhara <fukuo33@gmail.com>
   2015-02-23 09:24:33 +0000
   Commit: f172387, github.com/apache/spark/pull/4724

   Revert "[SPARK-4808] Removing minimum number of elements read before spill check"
   Andrew Or <andrew@databricks.com>
   2015-02-22 09:44:52 -0800
   Commit: 4186dd3

   SPARK-5669 [BUILD] Reverse exclusion of JBLAS libs for 1.3
   Sean Owen <sowen@cloudera.com>
   2015-02-22 09:09:06 +0000
   Commit: eed7389, github.com/apache/spark/pull/4715

   [DataFrame] [Typo] Fix the typo
   Cheng Hao <hao.cheng@intel.com>
   2015-02-22 08:56:30 +0000
   Commit: 04d3b32, github.com/apache/spark/pull/4717

   [DOCS] Fix typo in API for custom InputFormats based on the “new” MapReduce API
   Alexander <abezzubov@nflabs.com>
   2015-02-22 08:53:05 +0000
   Commit: c5a5c6f, github.com/apache/spark/pull/4718

   [SPARK-5937][YARN] Fix ClientSuite to set YARN mode, so that the correct class is used in t...
   Hari Shreedharan <hshreedharan@apache.org>
   2015-02-21 10:01:01 -0800
   Commit: 76e3e65, github.com/apache/spark/pull/4711

   SPARK-5841 [CORE] [HOTFIX 2] Memory leak in DiskBlockManager
   Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
   2015-02-21 09:59:28 -0800
   Commit: 932338e, github.com/apache/spark/pull/4690

   [SPARK-5909][SQL] Add a clearCache command to Spark SQL's cache manager
   Yin Huai <yhuai@databricks.com>
   2015-02-20 16:20:02 +0800
   Commit: b9a6c5c, github.com/apache/spark/pull/4694

   [SPARK-5898] [SPARK-5896] [SQL]  [PySpark] create DataFrame from pandas and tuple/list
   Davies Liu <davies@databricks.com>
   2015-02-20 15:35:05 -0800
   Commit: 913562a, github.com/apache/spark/pull/4679

   [SPARK-5867] [SPARK-5892] [doc] [ml] [mllib] Doc cleanups for 1.3 release
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-20 02:31:32 -0800
   Commit: 8c12f31, github.com/apache/spark/pull/4675

   [SPARK-4808] Removing minimum number of elements read before spill check
   mcheah <mcheah@palantir.com>
   2015-02-19 18:09:22 -0800
   Commit: 0382dcc, github.com/apache/spark/pull/4420

   [SPARK-5900][MLLIB] make PIC and FPGrowth Java-friendly
   Xiangrui Meng <meng@databricks.com>
   2015-02-19 18:06:16 -0800
   Commit: ba941ce, github.com/apache/spark/pull/4695

   SPARK-5570: No docs stating that `new SparkConf().set("spark.driver.memory", ...) will not work
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-02-19 15:50:58 -0800
   Commit: c5f3b9e, github.com/apache/spark/pull/4665

   SPARK-4682 [CORE] Consolidate various 'Clock' classes
   Sean Owen <sowen@cloudera.com>
   2015-02-19 15:35:23 -0800
   Commit: bd49e8b, github.com/apache/spark/pull/4514

   [Spark-5889] Remove pid file after stopping service.
   Zhan Zhang <zhazhan@gmail.com>
   2015-02-19 23:13:02 +0000
   Commit: ff8976e, github.com/apache/spark/pull/4676

   [SPARK-5902] [ml] Made PipelineStage.transformSchema public instead of private to ml
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-19 12:46:27 -0800
   Commit: 0c494cf, github.com/apache/spark/pull/4682

   [SPARK-5904][SQL] DataFrame API fixes.
   Reynold Xin <rxin@databricks.com>
   2015-02-19 12:09:44 -0800
   Commit: 55d91d9, github.com/apache/spark/pull/4686

   [SPARK-5825] [Spark Submit] Remove the double checking instance name when stopping the service
   Cheng Hao <hao.cheng@intel.com>
   2015-02-19 12:07:51 -0800
   Commit: fe00eb6, github.com/apache/spark/pull/4611

   [SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file
   zsxwing <zsxwing@gmail.com>
   2015-02-19 18:37:31 +0000
   Commit: 25fae8e, github.com/apache/spark/pull/4219

   [SPARK-5816] Add huge compatibility warning in DriverWrapper
   Andrew Or <andrew@databricks.com>
   2015-02-19 09:56:25 -0800
   Commit: f93d4d9, github.com/apache/spark/pull/4687

   SPARK-5548: Fix for AkkaUtilsSuite failure - attempt 2
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-02-19 09:53:36 -0800
   Commit: fbcb949, github.com/apache/spark/pull/4653

   [SPARK-5846] Correctly set job description and pool for SQL jobs
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-19 09:49:34 +0800
   Commit: 092b45f, github.com/apache/spark/pull/4630

   [SPARK-5879][MLLIB] update PIC user guide and add a Java example
   Xiangrui Meng <meng@databricks.com>
   2015-02-18 16:29:32 -0800
   Commit: a64f374, github.com/apache/spark/pull/4680

   [SPARK-5722] [SQL] [PySpark] infer int as LongType
   Davies Liu <davies@databricks.com>
   2015-02-18 14:17:04 -0800
   Commit: 470cba8, github.com/apache/spark/pull/4666

   [SPARK-5840][SQL] HiveContext cannot be serialized due to tuple extraction
   Reynold Xin <rxin@databricks.com>
   2015-02-18 14:02:32 -0800
   Commit: b86e44c, github.com/apache/spark/pull/4628

   [SPARK-5507] Added documentation for BlockMatrix
   Burak Yavuz <brkyvz@gmail.com>
   2015-02-18 10:11:08 -0800
   Commit: 56f8f29, github.com/apache/spark/pull/4664

   [SPARK-5519][MLLIB] add user guide with example code for fp-growth
   Xiangrui Meng <meng@databricks.com>
   2015-02-18 10:09:56 -0800
   Commit: 661fbd3, github.com/apache/spark/pull/4661

   SPARK-5669 [BUILD] [HOTFIX] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
   Sean Owen <sowen@cloudera.com>
   2015-02-18 14:41:44 +0000
   Commit: 9f256ce, github.com/apache/spark/pull/4673

   SPARK-4610 addendum: [Minor] [MLlib] Minor doc fix in GBT classification example
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-02-18 10:13:28 +0000
   Commit: 3997e74, github.com/apache/spark/pull/4672

   [SPARK-5878] fix DataFrame.repartition() in Python
   Davies Liu <davies@databricks.com>
   2015-02-18 01:00:54 -0800
   Commit: aca7991, github.com/apache/spark/pull/4667

   Avoid deprecation warnings in JDBCSuite.
   Tor Myklebust <tmyklebu@gmail.com>
   2015-02-18 01:00:13 -0800
   Commit: 9a565b8, github.com/apache/spark/pull/4668

   [Minor] [SQL] Cleans up DataFrame variable names and toDF() calls
   Cheng Lian <lian@databricks.com>
   2015-02-17 23:36:20 -0800
   Commit: 2bd33ce, github.com/apache/spark/pull/4670

   [SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStreamSuite
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-17 22:44:16 -0800
   Commit: f8f9a64, github.com/apache/spark/pull/4597

   [SPARK-5723][SQL]Change the default file format to Parquet for CTAS statements.
   Yin Huai <yhuai@databricks.com>
   2015-02-17 18:14:33 -0800
   Commit: 6e82c46, github.com/apache/spark/pull/4639

   Preparing development version 1.3.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-02-18 01:52:06 +0000
   Commit: 2ab0ba0

   Preparing Spark release v1.3.0-rc1
   Patrick Wendell <patrick@databricks.com>
   2015-02-18 01:52:06 +0000
   Commit: f97b0d4

   [SPARK-5875][SQL]logical.Project should not be resolved if it contains aggregates or generators
   Yin Huai <yhuai@databricks.com>
   2015-02-17 17:50:39 -0800
   Commit: e8284b2, github.com/apache/spark/pull/4663

   Revert "Preparing Spark release v1.3.0-snapshot1"
   Patrick Wendell <patrick@databricks.com>
   2015-02-17 17:48:47 -0800
   Commit: 7320605

   Revert "Preparing development version 1.3.1-SNAPSHOT"
   Patrick Wendell <patrick@databricks.com>
   2015-02-17 17:48:43 -0800
   Commit: 932ae4d

   [SPARK-4454] Revert getOrElse() cleanup in DAGScheduler.getCacheLocs()
   Josh Rosen <joshrosen@databricks.com>
   2015-02-17 17:45:16 -0800
   Commit: 7e5e4d8

   [SPARK-4454] Properly synchronize accesses to DAGScheduler cacheLocs map
   Josh Rosen <joshrosen@databricks.com>
   2015-02-17 17:39:58 -0800
   Commit: 07a401a, github.com/apache/spark/pull/4660

   [SPARK-5811] Added documentation for maven coordinates and added Spark Packages support
   Burak Yavuz <brkyvz@gmail.com>, Davies Liu <davies@databricks.com>
   2015-02-17 17:15:43 -0800
   Commit: cb90584, github.com/apache/spark/pull/4662

   [SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
   Davies Liu <davies@databricks.com>
   2015-02-17 16:54:57 -0800
   Commit: 8120235, github.com/apache/spark/pull/4629

   [SPARK-5852][SQL]Fail to convert a newly created empty metastore parquet table to a data source parquet table.
   Yin Huai <yhuai@databricks.com>, Cheng Hao <hao.cheng@intel.com>
   2015-02-17 15:47:59 -0800
   Commit: 07d8ef9, github.com/apache/spark/pull/4655

   [SPARK-5872] [SQL] create a sqlCtx in pyspark shell
   Davies Liu <davies@databricks.com>
   2015-02-17 15:44:37 -0800
   Commit: 0dba382, github.com/apache/spark/pull/4659

   [SPARK-5871] output explain in Python
   Davies Liu <davies@databricks.com>
   2015-02-17 13:48:38 -0800
   Commit: cb06160, github.com/apache/spark/pull/4658

   [SPARK-4172] [PySpark] Progress API in Python
   Davies Liu <davies@databricks.com>
   2015-02-17 13:36:43 -0800
   Commit: 35e23ff, github.com/apache/spark/pull/3027

   [SPARK-5868][SQL] Fix python UDFs in HiveContext and checks in SQLContext
   Michael Armbrust <michael@databricks.com>
   2015-02-17 13:23:45 -0800
   Commit: e65dc1f, github.com/apache/spark/pull/4657

   [SQL] [Minor] Update the HiveContext Unittest
   Cheng Hao <hao.cheng@intel.com>
   2015-02-17 12:25:35 -0800
   Commit: 0135651, github.com/apache/spark/pull/4584

   [Minor][SQL] Use same function to check path parameter in JSONRelation
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-17 12:24:13 -0800
   Commit: d74d5e8, github.com/apache/spark/pull/4649

   [SPARK-5862][SQL] Only transformUp the given plan once in HiveMetastoreCatalog
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-17 12:23:18 -0800
   Commit: 62063b7, github.com/apache/spark/pull/4651

   [Minor] fix typo in SQL document
   CodingCat <zhunansjtu@gmail.com>
   2015-02-17 12:16:52 -0800
   Commit: 5636c4a, github.com/apache/spark/pull/4656

   [SPARK-5864] [PySpark] support .jar as python package
   Davies Liu <davies@databricks.com>
   2015-02-17 12:05:06 -0800
   Commit: 71cf6e2, github.com/apache/spark/pull/4652

   SPARK-5841 [CORE] [HOTFIX] Memory leak in DiskBlockManager
   Sean Owen <sowen@cloudera.com>
   2015-02-17 19:40:06 +0000
   Commit: e64afcd, github.com/apache/spark/pull/4648

   [SPARK-5661]function hasShutdownDeleteTachyonDir should use shutdownDeleteTachyonPaths to determine whether contains file
   xukun 00228947 <xukun.xu@huawei.com>, viper-kun <xukun.xu@huawei.com>
   2015-02-17 18:59:41 +0000
   Commit: 420bc9b, github.com/apache/spark/pull/4418

   [SPARK-5778] throw if nonexistent metrics config file provided
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-02-17 10:57:16 -0800
   Commit: 2bf2b56, github.com/apache/spark/pull/4571

   [SPARK-5859] [PySpark] [SQL] fix DataFrame Python API
   Davies Liu <davies@databricks.com>
   2015-02-17 10:22:48 -0800
   Commit: 4a581aa, github.com/apache/spark/pull/4645

   [SPARK-5166][SPARK-5247][SPARK-5258][SQL] API Cleanup / Documentation
   Michael Armbrust <michael@databricks.com>
   2015-02-17 10:21:17 -0800
   Commit: cd3d415, github.com/apache/spark/pull/4642

   [SPARK-5858][MLLIB] Remove unnecessary first() call in GLM
   Xiangrui Meng <meng@databricks.com>
   2015-02-17 10:17:45 -0800
   Commit: 97cb568, github.com/apache/spark/pull/4647

   SPARK-5856: In Maven build script, launch Zinc with more memory
   Patrick Wendell <patrick@databricks.com>
   2015-02-17 10:10:01 -0800
   Commit: 8240629, github.com/apache/spark/pull/4643

   Revert "[SPARK-5363] [PySpark] check ending mark in non-block way"
   Josh Rosen <joshrosen@databricks.com>
   2015-02-17 07:48:27 -0800
   Commit: aeb85cd

   [SPARK-5826][Streaming] Fix Configuration not serializable problem
   jerryshao <saisai.shao@intel.com>
   2015-02-17 10:45:18 +0000
   Commit: b8da5c3, github.com/apache/spark/pull/4612

   HOTFIX: Style issue causing build break
   Patrick Wendell <patrick@databricks.com>
   2015-02-16 22:10:39 -0800
   Commit: e9241fa

   [SPARK-5802][MLLIB] cache transformed data in glm
   Xiangrui Meng <meng@databricks.com>
   2015-02-16 22:09:04 -0800
   Commit: dfe0fa0, github.com/apache/spark/pull/4593

   [SPARK-5853][SQL] Schema support in Row.
   Reynold Xin <rxin@databricks.com>
   2015-02-16 20:42:57 -0800
   Commit: d0701d9, github.com/apache/spark/pull/4640

   SPARK-5850: Remove experimental label for Scala 2.11 and FlumePollingStream
   Patrick Wendell <patrick@databricks.com>
   2015-02-16 20:33:33 -0800
   Commit: c6a7069, github.com/apache/spark/pull/4638

   [SPARK-5363] [PySpark] check ending mark in non-block way
   Davies Liu <davies@databricks.com>
   2015-02-16 20:32:03 -0800
   Commit: baad6b3, github.com/apache/spark/pull/4601

   [SQL] Various DataFrame doc changes.
   Reynold Xin <rxin@databricks.com>
   2015-02-16 19:00:30 -0800
   Commit: e355b54, github.com/apache/spark/pull/4636

   [SPARK-5849] Handle more types of invalid JSON requests in SubmitRestProtocolMessage.parseAction
   Josh Rosen <joshrosen@databricks.com>
   2015-02-16 18:08:02 -0800
   Commit: 385a339, github.com/apache/spark/pull/4637

   [SPARK-3340] Deprecate ADD_JARS and ADD_FILES
   azagrebin <azagrebin@gmail.com>
   2015-02-16 18:06:19 -0800
   Commit: d8c70fb, github.com/apache/spark/pull/4616

   [SPARK-5788] [PySpark] capture the exception in python write thread
   Davies Liu <davies@databricks.com>
   2015-02-16 17:57:14 -0800
   Commit: c2a9a61, github.com/apache/spark/pull/4577

   SPARK-5848: tear down the ConsoleProgressBar timer
   Matt Whelan <mwhelan@perka.com>
   2015-02-17 00:59:49 +0000
   Commit: 52994d8, github.com/apache/spark/pull/4635

   [SPARK-4865][SQL]Include temporary tables in SHOW TABLES
   Yin Huai <yhuai@databricks.com>
   2015-02-16 15:59:23 -0800
   Commit: 8a94bf7, github.com/apache/spark/pull/4618

   [SQL] Optimize arithmetic and predicate operators
   kai <kaizeng@eecs.berkeley.edu>
   2015-02-16 15:58:05 -0800
   Commit: 639a3c2, github.com/apache/spark/pull/4472

   [SPARK-5839][SQL]HiveMetastoreCatalog does not recognize table names and aliases of data source tables.
   Yin Huai <yhuai@databricks.com>
   2015-02-16 15:54:01 -0800
   Commit: a15a0a0, github.com/apache/spark/pull/4626

   [SPARK-5746][SQL] Check invalid cases for the write path of data source API
   Yin Huai <yhuai@databricks.com>
   2015-02-16 15:51:59 -0800
   Commit: 4198654, github.com/apache/spark/pull/4617

   HOTFIX: Break in Jekyll build from #4589
   Patrick Wendell <patrick@databricks.com>
   2015-02-16 15:43:56 -0800
   Commit: ad8fd4f

   [SPARK-2313] Use socket to communicate GatewayServer port back to Python driver
   Josh Rosen <joshrosen@databricks.com>
   2015-02-16 15:25:11 -0800
   Commit: b70b8ba, github.com/apache/spark/pull/3424.

   SPARK-5357: Update commons-codec version to 1.10 (current)
   Matt Whelan <mwhelan@perka.com>
   2015-02-16 23:05:34 +0000
   Commit: 8c45619, github.com/apache/spark/pull/4153

   SPARK-5841: remove DiskBlockManager shutdown hook on stop
   Matt Whelan <mwhelan@perka.com>
   2015-02-16 22:54:32 +0000
   Commit: dd977df, github.com/apache/spark/pull/4627

   [SPARK-5833] [SQL] Adds REFRESH TABLE command
   Cheng Lian <lian@databricks.com>
   2015-02-16 12:52:05 -0800
   Commit: 864d77e, github.com/apache/spark/pull/4624

   [SPARK-5296] [SQL] Add more filter types for data sources API
   Cheng Lian <lian@databricks.com>
   2015-02-16 12:48:55 -0800
   Commit: 363a9a7, github.com/apache/spark/pull/4623

   [SQL] Add fetched row count in SparkSQLCLIDriver
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-16 12:34:09 -0800
   Commit: 0368494, github.com/apache/spark/pull/4604

   [SQL] Initial support for reporting location of error in sql string
   Michael Armbrust <michael@databricks.com>
   2015-02-16 12:32:56 -0800
   Commit: 63fa123, github.com/apache/spark/pull/4587

   [SPARK-5824] [SQL] add null format in ctas and set default col comment to null
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-16 12:31:36 -0800
   Commit: c2eaaea, github.com/apache/spark/pull/4609

   [SQL] [Minor] Update the SpecificMutableRow.copy
   Cheng Hao <hao.cheng@intel.com>
   2015-02-16 12:21:08 -0800
   Commit: 1a88955, github.com/apache/spark/pull/4619

   SPARK-5795 [STREAMING] api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java
   Sean Owen <sowen@cloudera.com>
   2015-02-16 19:32:31 +0000
   Commit: fef2267, github.com/apache/spark/pull/4608

   [SPARK-5799][SQL] Compute aggregation function on specified numeric columns
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-16 10:06:11 -0800
   Commit: 0165e9d, github.com/apache/spark/pull/4592

   [SPARK-4553] [SPARK-5767] [SQL] Wires Parquet data source with the newly introduced write support for data source API
   Cheng Lian <lian@databricks.com>
   2015-02-16 01:38:31 -0800
   Commit: 78f7edb, github.com/apache/spark/pull/4563

   [Minor] [SQL] Renames stringRddToDataFrame to stringRddToDataFrameHolder for consistency
   Cheng Lian <lian@databricks.com>
   2015-02-16 01:33:37 -0800
   Commit: 066301c, github.com/apache/spark/pull/4613

   [Ml] SPARK-5804 Explicitly manage cache in Crossvalidator k-fold loop
   Peter Rudenko <petro.rudenko@gmail.com>
   2015-02-16 00:07:23 -0800
   Commit: 0d93205, github.com/apache/spark/pull/4595

   [Ml] SPARK-5796 Don't transform data on a last estimator in Pipeline
   Peter Rudenko <petro.rudenko@gmail.com>
   2015-02-15 20:51:32 -0800
   Commit: 9cf7d70, github.com/apache/spark/pull/4590

   SPARK-5815 [MLLIB] Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS
   Sean Owen <sowen@cloudera.com>
   2015-02-15 20:41:27 -0800
   Commit: db3c539, github.com/apache/spark/pull/4614

   [SPARK-5769] Set params in constructors and in setParams in Python ML pipelines
   Xiangrui Meng <meng@databricks.com>
   2015-02-15 20:29:26 -0800
   Commit: d710991, github.com/apache/spark/pull/4564

   SPARK-5669 [BUILD] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
   Sean Owen <sowen@cloudera.com>
   2015-02-15 09:15:48 -0800
   Commit: 4e099d7, github.com/apache/spark/pull/4453

   [MLLIB][SPARK-5502] User guide for isotonic regression
   martinzapletal <zapletal-martin@email.cz>
   2015-02-15 09:10:03 -0800
   Commit: d96e188, github.com/apache/spark/pull/4536

   [HOTFIX] Ignore DirectKafkaStreamSuite.
   Patrick Wendell <patrick@databricks.com>
   2015-02-13 12:43:53 -0800
   Commit: 70ebad4

   [SPARK-5827][SQL] Add missing import in the example of SqlContext
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2015-02-15 14:42:20 +0000
   Commit: 9c1c70d, github.com/apache/spark/pull/4615

   SPARK-5822 [BUILD] cannot import src/main/scala & src/test/scala into eclipse as source folder
   gli <gli@redhat.com>
   2015-02-14 20:43:27 +0000
   Commit: f87f3b7, github.com/apache/spark/pull/4531

   Revise formatting of previous commit f80e2629bb74bc62960c61ff313f7e7802d61319
   Sean Owen <sowen@cloudera.com>
   2015-02-14 20:12:29 +0000
   Commit: 1945fcf

   [SPARK-5800] Streaming Docs. Change linked files according the selected language
   gasparms <gmunoz@stratio.com>
   2015-02-14 20:10:29 +0000
   Commit: e99e170, github.com/apache/spark/pull/4589

   [SPARK-5752][SQL] Don't implicitly convert RDDs directly to DataFrames
   Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
   2015-02-13 23:03:22 -0800
   Commit: ba91bf5, github.com/apache/spark/pull/4556

   SPARK-3290 [GRAPHX] No unpersist callls in SVDPlusPlus
   Sean Owen <sowen@cloudera.com>
   2015-02-13 20:12:52 -0800
   Commit: db57479, github.com/apache/spark/pull/4234

   [SPARK-5227] [SPARK-5679] Disable FileSystem cache in WholeTextFileRecordReaderSuite
   Josh Rosen <joshrosen@databricks.com>
   2015-02-13 17:45:31 -0800
   Commit: 152147f, github.com/apache/spark/pull/4599

   [SPARK-5730][ML] add doc groups to spark.ml components
   Xiangrui Meng <meng@databricks.com>
   2015-02-13 16:45:59 -0800
   Commit: fccd38d, github.com/apache/spark/pull/4600

   [SPARK-5803][MLLIB] use ArrayBuilder to build primitive arrays
   Xiangrui Meng <meng@databricks.com>
   2015-02-13 16:43:49 -0800
   Commit: 356b798, github.com/apache/spark/pull/4594

   [SPARK-5806] re-organize sections in mllib-clustering.md
   Xiangrui Meng <meng@databricks.com>
   2015-02-13 15:09:27 -0800
   Commit: 9658763, github.com/apache/spark/pull/4598

   [SPARK-5789][SQL]Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.
   Yin Huai <yhuai@databricks.com>
   2015-02-13 13:51:06 -0800
   Commit: d9d0250, github.com/apache/spark/pull/4582

   [SPARK-5642] [SQL] Apply column pruning on unused aggregation fields
   Daoyuan Wang <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>
   2015-02-13 13:46:50 -0800
   Commit: efffc2e, github.com/apache/spark/pull/4415

   [HOTFIX] Fix build break in MesosSchedulerBackendSuite
   Andrew Or <andrew@databricks.com>
   2015-02-13 13:10:29 -0800
   Commit: 4160371

   SPARK-5805 Fixed the type error in documentation.
   Emre Sevinç <emre.sevinc@gmail.com>
   2015-02-13 12:31:27 -0800
   Commit: ad73189, github.com/apache/spark/pull/4596

   [SPARK-5735] Replace uses of EasyMock with Mockito
   Josh Rosen <joshrosen@databricks.com>
   2015-02-13 09:53:57 -0800
   Commit: cc9eec1, github.com/apache/spark/pull/4578

   [SPARK-5783] Better eventlog-parsing error messages
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-02-13 09:47:26 -0800
   Commit: e5690a5, github.com/apache/spark/pull/4573

   [SPARK-5503][MLLIB] Example code for Power Iteration Clustering
   sboeschhuawei <stephen.boesch@huawei.com>
   2015-02-13 09:45:57 -0800
   Commit: 5e63942, github.com/apache/spark/pull/4495

   [SPARK-5732][CORE]:Add an option to print the spark version in spark script.
   uncleGen <hustyugm@gmail.com>, genmao.ygm <genmao.ygm@alibaba-inc.com>
   2015-02-13 09:43:10 -0800
   Commit: 5c883df, github.com/apache/spark/pull/4522

   [SPARK-4832][Deploy]some other processes might take the daemon pid
   WangTaoTheTonic <barneystinson@aliyun.com>, WangTaoTheTonic <wangtao111@huawei.com>
   2015-02-13 10:27:23 +0000
   Commit: 1255e83, github.com/apache/spark/pull/3683

   [SQL] Fix docs of SQLContext.tables
   Yin Huai <yhuai@databricks.com>
   2015-02-12 20:37:55 -0800
   Commit: a8f560c, github.com/apache/spark/pull/4579

   [SPARK-3365][SQL]Wrong schema generated for List type
   tianyi <tianyi.asiainfo@gmail.com>
   2015-02-12 22:18:39 -0800
   Commit: b9f332a, github.com/apache/spark/pull/4581

   [SPARK-3299][SQL]Public API in SQLContext to list tables
   Yin Huai <yhuai@databricks.com>
   2015-02-12 18:08:01 -0800
   Commit: edbac17, github.com/apache/spark/pull/4547

   [SQL] Move SaveMode to SQL package.
   Yin Huai <yhuai@databricks.com>
   2015-02-12 15:32:17 -0800
   Commit: 925fd84, github.com/apache/spark/pull/4542

   [SPARK-5335] Fix deletion of security groups within a VPC
   Vladimir Grigor <vladimir@kiosked.com>, Vladimir Grigor <vladimir@voukka.com>
   2015-02-12 23:26:24 +0000
   Commit: 5c9db4e, github.com/apache/spark/pull/4122

   [SPARK-5755] [SQL] remove unnecessary Add
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-12 15:22:07 -0800
   Commit: f7103b3, github.com/apache/spark/pull/4551

   [SPARK-5573][SQL] Add explode to dataframes
   Michael Armbrust <michael@databricks.com>
   2015-02-12 15:19:19 -0800
   Commit: c7eb9ee, github.com/apache/spark/pull/4546

   [SPARK-5758][SQL] Use LongType as the default type for integers in JSON schema inference.
   Yin Huai <yhuai@databricks.com>
   2015-02-12 15:17:25 -0800
   Commit: b0c79da, github.com/apache/spark/pull/4544

   [SPARK-5780] [PySpark] Mute the logging during unit tests
   Davies Liu <davies@databricks.com>
   2015-02-12 14:54:38 -0800
   Commit: bf0d15c, github.com/apache/spark/pull/4572

   SPARK-5747: Fix wordsplitting bugs in make-distribution.sh
   David Y. Ross <dyross@gmail.com>
   2015-02-12 14:52:38 -0800
   Commit: 11a0d5b, github.com/apache/spark/pull/4540

   [SPARK-5759][Yarn]ExecutorRunnable should catch YarnException while NMClient start contain...
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-12 14:50:16 -0800
   Commit: 02d5b32, github.com/apache/spark/pull/4554

   [SPARK-5760][SPARK-5761] Fix standalone rest protocol corner cases + revamp tests
   Andrew Or <andrew@databricks.com>
   2015-02-12 14:47:52 -0800
   Commit: 11d1080, github.com/apache/spark/pull/4557

   [SPARK-5762] Fix shuffle write time for sort-based shuffle
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-12 14:46:37 -0800
   Commit: 0040fc5, github.com/apache/spark/pull/4559

   [SPARK-5765][Examples]Fixed word split problem in run-example and compute-classpath
   Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-02-12 14:44:21 -0800
   Commit: 9a1de4b, github.com/apache/spark/pull/4561

   [SPARK-5645] Added local read bytes/time to task metrics
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-12 14:35:44 -0800
   Commit: 74f34bb, github.com/apache/spark/pull/4510

   [SQL] Improve error messages
   Michael Armbrust <michael@databricks.com>, wangfei <wangfei1@huawei.com>
   2015-02-12 13:11:28 -0800
   Commit: e3a975d, github.com/apache/spark/pull/4558

   [SQL][DOCS] Update sql documentation
   Antonio Navarro Perez <ajnavarro@users.noreply.github.com>
   2015-02-12 12:46:17 -0800
   Commit: cbd659e, github.com/apache/spark/pull/4560

   [SPARK-5757][MLLIB] replace SQL JSON usage in model import/export by json4s
   Xiangrui Meng <meng@databricks.com>
   2015-02-12 10:48:13 -0800
   Commit: e26c149, github.com/apache/spark/pull/4555

   [SPARK-5655] Don't chmod700 application files if running in YARN
   Andrew Rowson <github@growse.com>
   2015-02-12 18:41:39 +0000
   Commit: e23c8f5, github.com/apache/spark/pull/4509

   [SQL] Make dataframe more tolerant of being serialized
   Michael Armbrust <michael@databricks.com>
   2015-02-11 19:05:49 -0800
   Commit: 3c1b9bf, github.com/apache/spark/pull/4545

   [SQL] Two DataFrame fixes.
   Reynold Xin <rxin@databricks.com>
   2015-02-11 18:32:48 -0800
   Commit: bcb1382, github.com/apache/spark/pull/4543

   [SPARK-3688][SQL] More inline comments for LogicalPlan.
   Reynold Xin <rxin@databricks.com>
   2015-02-11 15:26:31 -0800
   Commit: 08ab3d2, github.com/apache/spark/pull/4539

   [SPARK-3688][SQL]LogicalPlan can't resolve column correctlly
   tianyi <tianyi.asiainfo@gmail.com>
   2015-02-11 12:50:17 -0800
   Commit: e136f47, github.com/apache/spark/pull/4524

   [SPARK-5454] More robust handling of self joins
   Michael Armbrust <michael@databricks.com>
   2015-02-11 12:31:56 -0800
   Commit: 1bb3631, github.com/apache/spark/pull/4520

   Remove outdated remark about take(n).
   Daniel Darabos <darabos.daniel@gmail.com>
   2015-02-11 20:24:17 +0000
   Commit: 72adfc5, github.com/apache/spark/pull/4533

   [SPARK-5677] [SPARK-5734] [SQL] [PySpark] Python DataFrame API remaining tasks
   Davies Liu <davies@databricks.com>
   2015-02-11 12:13:16 -0800
   Commit: d66aae2, github.com/apache/spark/pull/4528

   [SPARK-5733] Error Link in Pagination of HistroyPage when showing Incomplete Applications
   guliangliang <guliangliang@qiyi.com>
   2015-02-11 15:55:49 +0000
   Commit: 864dccd, github.com/apache/spark/pull/4523

   SPARK-5727 [BUILD] Deprecate Debian packaging
   Sean Owen <sowen@cloudera.com>
   2015-02-11 08:30:16 +0000
   Commit: 057ec4f, github.com/apache/spark/pull/4516

   SPARK-5728 [STREAMING] MQTTStreamSuite leaves behind ActiveMQ database files
   Sean Owen <sowen@cloudera.com>
   2015-02-11 08:13:51 +0000
   Commit: 476b6d7, github.com/apache/spark/pull/4517

   [SPARK-4964] [Streaming] refactor createRDD to take leaders via map instead of array
   cody koeninger <cody@koeninger.org>
   2015-02-11 00:13:27 -0800
   Commit: 811d179, github.com/apache/spark/pull/4511

   Preparing development version 1.3.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-02-11 07:47:03 +0000
   Commit: e57c81b

   Preparing Spark release v1.3.0-snapshot1
   Patrick Wendell <patrick@databricks.com>
   2015-02-11 07:47:02 +0000
   Commit: d97bfc6

   Revert "Preparing Spark release v1.3.0-snapshot1"
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 23:46:04 -0800
   Commit: 6a91d59

   Revert "Preparing development version 1.3.1-SNAPSHOT"
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 23:46:02 -0800
   Commit: 3a50383

   HOTFIX: Adding Junit to Hive tests for Maven build
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 23:39:21 -0800
   Commit: 0386fc4

   Preparing development version 1.3.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-02-11 06:45:03 +0000
   Commit: ba12b79

   Preparing Spark release v1.3.0-snapshot1
   Patrick Wendell <patrick@databricks.com>
   2015-02-11 06:45:03 +0000
   Commit: 53068f5

   HOTFIX: Java 6 compilation error in Spark SQL
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 22:43:32 -0800
   Commit: 15180bc

   Revert "Preparing Spark release v1.3.0-snapshot1"
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 22:44:10 -0800
   Commit: 536dae9

   Revert "Preparing development version 1.3.1-SNAPSHOT"
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 22:44:07 -0800
   Commit: 01b562e

   Preparing development version 1.3.1-SNAPSHOT
   Patrick Wendell <patrick@databricks.com>
   2015-02-11 06:15:29 +0000
   Commit: db80d0f

   Preparing Spark release v1.3.0-snapshot1
   Patrick Wendell <patrick@databricks.com>
   2015-02-11 06:15:29 +0000
   Commit: c2e4001

   Updating versions for Spark 1.3
   Patrick Wendell <patrick@databricks.com>
   2015-02-10 21:54:55 -0800
   Commit: 2f52489

   [SPARK-5714][Mllib] Refactor initial step of LDA to remove redundant operations
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-10 21:51:15 -0800
   Commit: ba3aa8f, github.com/apache/spark/pull/4501

   [SPARK-5702][SQL] Allow short names for built-in data sources.
   Reynold Xin <rxin@databricks.com>
   2015-02-10 20:40:21 -0800
   Commit: 63af90c, github.com/apache/spark/pull/4489

   [SPARK-5729] Potential NPE in standalone REST API
   Andrew Or <andrew@databricks.com>
   2015-02-10 20:19:14 -0800
   Commit: 1bc75b0, github.com/apache/spark/pull/4518

   [SPARK-4879] Use driver to coordinate Hadoop output committing for speculative tasks
   mcheah <mcheah@palantir.com>, Josh Rosen <joshrosen@databricks.com>
   2015-02-10 20:12:18 -0800
   Commit: 79cd59c, github.com/apache/spark/pull/4155.

   [SQL][DataFrame] Fix column computability bug.
   Reynold Xin <rxin@databricks.com>
   2015-02-10 19:50:44 -0800
   Commit: e477e91, github.com/apache/spark/pull/4519

   [SPARK-5709] [SQL] Add EXPLAIN support in DataFrame API for debugging purpose
   Cheng Hao <hao.cheng@intel.com>
   2015-02-10 19:40:51 -0800
   Commit: 7fa0d5f, github.com/apache/spark/pull/4496

   [SPARK-5704] [SQL] [PySpark] createDataFrame from RDD with columns
   Davies Liu <davies@databricks.com>
   2015-02-10 19:40:12 -0800
   Commit: 1056c5b, github.com/apache/spark/pull/4498

   [SPARK-5683] [SQL] Avoid multiple json generator created
   Cheng Hao <hao.cheng@intel.com>
   2015-02-10 18:19:56 -0800
   Commit: fc0446f, github.com/apache/spark/pull/4468

   [SQL] Add an exception for analysis errors.
   Michael Armbrust <michael@databricks.com>
   2015-02-10 17:32:42 -0800
   Commit: 748cdc1, github.com/apache/spark/pull/4439

   [SPARK-5658][SQL] Finalize DDL and write support APIs
   Yin Huai <yhuai@databricks.com>
   2015-02-10 17:29:52 -0800
   Commit: a21090e, github.com/apache/spark/pull/4446

   [SPARK-5493] [core] Add option to impersonate user.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-10 17:19:10 -0800
   Commit: 8e75b0e, github.com/apache/spark/pull/4405

   [SQL] Make Options in the data source API CREATE TABLE statements optional.
   Yin Huai <yhuai@databricks.com>
   2015-02-10 17:06:12 -0800
   Commit: 445dbc7, github.com/apache/spark/pull/4515

   [SPARK-5725] [SQL] Fixes ParquetRelation2.equals
   Cheng Lian <lian@databricks.com>
   2015-02-10 17:02:44 -0800
   Commit: f43bc3d, github.com/apache/spark/pull/4513

   [SPARK-5343][GraphX]: ShortestPaths traverses backwards
   Brennon York <brennon.york@capitalone.com>
   2015-02-10 14:57:00 -0800
   Commit: 5be8902, github.com/apache/spark/pull/4478

   [SPARK-5021] [MLlib] Gaussian Mixture now supports Sparse Input
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-02-10 14:05:55 -0800
   Commit: bba0953, github.com/apache/spark/pull/4459

   [HOTFIX][SPARK-4136] Fix compilation and tests
   Andrew Or <andrew@databricks.com>
   2015-02-10 11:18:01 -0800
   Commit: 4e3aa68

   [SPARK-5686][SQL] Add show current roles command in HiveQl
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-10 13:20:15 -0800
   Commit: 8b7587a, github.com/apache/spark/pull/4471

   [SQL] Add toString to DataFrame/Column
   Michael Armbrust <michael@databricks.com>
   2015-02-10 13:14:01 -0800
   Commit: ef739d9, github.com/apache/spark/pull/4436

   SPARK-5613: Catch the ApplicationNotFoundException exception to avoid thread from getting killed on yarn restart.
   Kashish Jain <kashish.jain@guavus.com>
   2015-02-06 13:47:23 -0800
   Commit: c294216, github.com/apache/spark/pull/4392

   [SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table
   wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
   2015-02-10 11:54:30 -0800
   Commit: dbfce30, github.com/apache/spark/pull/4368

   SPARK-4136. Under dynamic allocation, cancel outstanding executor requests when no longer needed
   Sandy Ryza <sandy@cloudera.com>
   2015-02-10 11:07:25 -0800
   Commit: e53da21, github.com/apache/spark/pull/4168

   [SPARK-5716] [SQL] Support TOK_CHARSETLITERAL in HiveQl
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-10 11:08:21 -0800
   Commit: e508237, github.com/apache/spark/pull/4502

   [Spark-5717] [MLlib] add stop and reorganize import
   JqueryFan <firing@126.com>, Yuhao Yang <hhbyyh@gmail.com>
   2015-02-10 17:37:32 +0000
   Commit: b32f553, github.com/apache/spark/pull/4503

   [SPARK-5700] [SQL] [Build] Bumps jets3t to 0.9.3 for hadoop-2.3 and hadoop-2.4 profiles
   Cheng Lian <lian@databricks.com>
   2015-02-10 02:28:47 -0800
   Commit: d6f31e0, github.com/apache/spark/pull/4499

   SPARK-5239 [CORE] JdbcRDD throws "java.lang.AbstractMethodError: oracle.jdbc.driver.xxxxxx.isClosed()Z"
   Sean Owen <sowen@cloudera.com>
   2015-02-10 09:19:01 +0000
   Commit: 4cfc025, github.com/apache/spark/pull/4470

   [SPARK-4964][Streaming][Kafka] More updates to Exactly-once Kafka stream
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-09 22:45:48 -0800
   Commit: 281614d, github.com/apache/spark/pull/4384

   [SPARK-5597][MLLIB] save/load for decision trees and emsembles
   Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
   2015-02-09 22:09:07 -0800
   Commit: 01905c4, github.com/apache/spark/pull/4444.

   [SQL] Remove the duplicated code
   Cheng Hao <hao.cheng@intel.com>
   2015-02-09 21:33:34 -0800
   Commit: 663d34e, github.com/apache/spark/pull/4494

   [SPARK-5701] Only set ShuffleReadMetrics when task has shuffle deps
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-02-09 21:22:09 -0800
   Commit: 6ddbca4, github.com/apache/spark/pull/4488

   [SPARK-5703] AllJobsPage throws empty.max exception
   Andrew Or <andrew@databricks.com>
   2015-02-09 21:18:48 -0800
   Commit: 8326255, github.com/apache/spark/pull/4490

   [SPARK-2996] Implement userClassPathFirst for driver, yarn.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-09 21:17:06 -0800
   Commit: 6a1e0f9, github.com/apache/spark/pull/3233

   SPARK-4900 [MLLIB] MLlib SingularValueDecomposition ARPACK IllegalStateException
   Sean Owen <sowen@cloudera.com>
   2015-02-09 21:13:58 -0800
   Commit: ebf1df0, github.com/apache/spark/pull/4485

   Add a config option to print DAG.
   KaiXinXiaoLei <huleilei1@huawei.com>
   2015-02-09 20:58:58 -0800
   Commit: dad05e0, github.com/apache/spark/pull/4257

   [SPARK-5469] restructure pyspark.sql into multiple files
   Davies Liu <davies@databricks.com>
   2015-02-09 20:49:22 -0800
   Commit: f0562b4, github.com/apache/spark/pull/4479

   [SPARK-5698] Do not let user request negative # of executors
   Andrew Or <andrew@databricks.com>
   2015-02-09 17:33:29 -0800
   Commit: 62b1e1f, github.com/apache/spark/pull/4483

   [SPARK-5699] [SQL] [Tests] Runs hive-thriftserver tests whenever SQL code is modified
   Cheng Lian <lian@databricks.com>
   2015-02-09 16:52:05 -0800
   Commit: 71f0f51, github.com/apache/spark/pull/4486

   [SPARK-5648][SQL] support "alter ... unset tblproperties("key")"
   DoingDone9 <799203320@qq.com>
   2015-02-09 16:40:26 -0800
   Commit: e2bf59a, github.com/apache/spark/pull/4424

   [SPARK-2096][SQL] support dot notation on array of struct
   Wenchen Fan <cloud0fan@outlook.com>
   2015-02-09 16:39:34 -0800
   Commit: 15f557f, github.com/apache/spark/pull/2405

   [SPARK-5614][SQL] Predicate pushdown through Generate.
   Lu Yan <luyan02@baidu.com>
   2015-02-09 16:25:38 -0800
   Commit: ce2c89c, github.com/apache/spark/pull/4394

   [SPARK-5696] [SQL] [HOTFIX] Asks HiveThriftServer2 to re-initialize log4j using Hive configurations
   Cheng Lian <lian@databricks.com>
   2015-02-09 16:23:12 -0800
   Commit: 379233c, github.com/apache/spark/pull/4484

   [SQL] Code cleanup.
   Yin Huai <yhuai@databricks.com>
   2015-02-09 16:20:42 -0800
   Commit: e241601, github.com/apache/spark/pull/4482

   [SQL] Add some missing DataFrame functions.
   Michael Armbrust <michael@databricks.com>
   2015-02-09 16:02:56 -0800
   Commit: a70dca0, github.com/apache/spark/pull/4437

   [SPARK-5675][SQL] XyzType companion object should subclass XyzType
   Reynold Xin <rxin@databricks.com>
   2015-02-09 14:51:46 -0800
   Commit: 1e2fab2, github.com/apache/spark/pull/4463

   [SPARK-4905][STREAMING] FlumeStreamSuite fix.
   Hari Shreedharan <hshreedharan@apache.org>
   2015-02-09 14:17:14 -0800
   Commit: 18c5a99, github.com/apache/spark/pull/4371

   [SPARK-5691] Fixing wrong data structure lookup for dupe app registratio...
   mcheah <mcheah@palantir.com>
   2015-02-09 13:20:14 -0800
   Commit: 6a0144c, github.com/apache/spark/pull/4477

   [SPARK-5678] Convert DataFrame to pandas.DataFrame and Series
   Davies Liu <davies@databricks.com>
   2015-02-09 11:42:52 -0800
   Commit: 43972b5, github.com/apache/spark/pull/4476

   [SPARK-5664][BUILD] Restore stty settings when exiting from SBT's spark-shell
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-09 11:45:12 -0800
   Commit: fa67877, github.com/apache/spark/pull/4451

   SPARK-4267 [YARN] Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later
   Sean Owen <sowen@cloudera.com>
   2015-02-09 10:33:57 -0800
   Commit: c88d4ab, github.com/apache/spark/pull/4452

   [SPARK-5473] [EC2] Expose SSH failures after status checks pass
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-09 09:44:53 +0000
   Commit: f2aa7b7, github.com/apache/spark/pull/4262

   [SPARK-5539][MLLIB] LDA guide
   Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
   2015-02-08 23:40:36 -0800
   Commit: 5782ee2, github.com/apache/spark/pull/4465

   [SPARK-5472][SQL] Fix Scala code style
   Hung Lin <hung@zoomdata.com>
   2015-02-08 22:36:42 -0800
   Commit: 955f286, github.com/apache/spark/pull/4464

   SPARK-4405 [MLLIB] Matrices.* construction methods should check for rows x cols overflow
   Sean Owen <sowen@cloudera.com>
   2015-02-08 21:08:50 -0800
   Commit: fa8ea48, github.com/apache/spark/pull/4461

   [SPARK-5660][MLLIB] Make Matrix apply public
   Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
   2015-02-08 21:07:36 -0800
   Commit: df9b105, github.com/apache/spark/pull/4447

   [SPARK-5643][SQL] Add a show method to print the content of a DataFrame in tabular format.
   Reynold Xin <rxin@databricks.com>
   2015-02-08 18:56:51 -0800
   Commit: e1996aa, github.com/apache/spark/pull/4416

   SPARK-5665 [DOCS] Update netlib-java documentation
   Sam Halliday <sam.halliday@Gmail.com>, Sam Halliday <sam.halliday@gmail.com>
   2015-02-08 16:34:26 -0800
   Commit: c515634, github.com/apache/spark/pull/4448

   [SPARK-5598][MLLIB] model save/load for ALS
   Xiangrui Meng <meng@databricks.com>
   2015-02-08 16:26:20 -0800
   Commit: 9e4d58f, github.com/apache/spark/pull/4422

   [SQL] Set sessionState in QueryExecution.
   Yin Huai <yhuai@databricks.com>
   2015-02-08 14:55:07 -0800
   Commit: 42c56b6, github.com/apache/spark/pull/4445

   [SPARK-3039] [BUILD] Spark assembly for new hadoop API (hadoop 2) contai...
   medale <medale94@yahoo.com>
   2015-02-08 10:35:29 +0000
   Commit: bc55e20, github.com/apache/spark/pull/4315

   [SPARK-5672][Web UI] Don't return `ERROR 500` when have missing args
   Kirill A. Korinskiy <catap@catap.ru>
   2015-02-08 10:31:46 +0000
   Commit: 96010fa, github.com/apache/spark/pull/4239

   [SPARK-5671] Upgrade jets3t to 0.9.2 in hadoop-2.3 and 2.4 profiles
   Josh Rosen <joshrosen@databricks.com>
   2015-02-07 17:19:08 -0800
   Commit: 0f9d765, github.com/apache/spark/pull/4454

   [SPARK-5108][BUILD] Jackson dependency management for Hadoop-2.6.0 support
   Zhan Zhang <zhazhan@gmail.com>
   2015-02-07 19:41:30 +0000
   Commit: 51fbca4, github.com/apache/spark/pull/3938

   [BUILD] Add the ability to launch spark-shell from SBT.
   Michael Armbrust <michael@databricks.com>
   2015-02-07 00:14:38 -0800
   Commit: 6bda169, github.com/apache/spark/pull/4438

   [SPARK-5388] Provide a stable application submission gateway for standalone cluster mode
   Andrew Or <andrew@databricks.com>
   2015-02-06 15:57:06 -0800
   Commit: 6ec0cdc, github.com/apache/spark/pull/4216

   SPARK-5403: Ignore UserKnownHostsFile in SSH calls
   Grzegorz Dubicki <grzegorz.dubicki@gmail.com>
   2015-02-06 15:43:58 -0800
   Commit: 3d99741, github.com/apache/spark/pull/4196

   [SPARK-5601][MLLIB] make streaming linear algorithms Java-friendly
   Xiangrui Meng <meng@databricks.com>
   2015-02-06 15:42:59 -0800
   Commit: 11b28b9, github.com/apache/spark/pull/4432

   [SQL] [Minor] HiveParquetSuite was disabled by mistake, re-enable them
   Cheng Lian <lian@databricks.com>
   2015-02-06 15:23:42 -0800
   Commit: 4005802, github.com/apache/spark/pull/4440

   [SQL] Use TestSQLContext in Java tests
   Michael Armbrust <michael@databricks.com>
   2015-02-06 15:11:02 -0800
   Commit: c950058, github.com/apache/spark/pull/4441

   [SPARK-4994][network]Cleanup removed executors' ShuffleInfo  in yarn shuffle service
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 14:47:52 -0800
   Commit: af6ddf8, github.com/apache/spark/pull/3828

   [SPARK-5444][Network]Add a retry to deal with the conflict port in netty server.
   huangzhaowei <carlmartinmax@gmail.com>
   2015-02-06 14:35:29 -0800
   Commit: caca15a, github.com/apache/spark/pull/4240

   [SPARK-4874] [CORE] Collect record count metrics
   Kostas Sakellis <kostas@cloudera.com>
   2015-02-06 14:31:20 -0800
   Commit: 9fa29a6, github.com/apache/spark/pull/4067

   [HOTFIX] Fix the maven build after adding sqlContext to spark-shell
   Michael Armbrust <michael@databricks.com>
   2015-02-06 14:27:06 -0800
   Commit: 11dbf71, github.com/apache/spark/pull/4443

   [SPARK-5600] [core] Clean up FsHistoryProvider test, fix app sort order.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-06 14:23:09 -0800
   Commit: 09feecc, github.com/apache/spark/pull/4370

   SPARK-5633 pyspark saveAsTextFile support for compression codec
   Vladimir Vladimirov <vladimir.vladimirov@magnetic.com>
   2015-02-06 13:55:02 -0800
   Commit: 1d32341, github.com/apache/spark/pull/4403

   [HOTFIX][MLLIB] fix a compilation error with java 6
   Xiangrui Meng <meng@databricks.com>
   2015-02-06 13:52:35 -0800
   Commit: 87e0f0d, github.com/apache/spark/pull/4442

   [SPARK-4983] Insert waiting time before tagging EC2 instances
   GenTang <gen.tang86@gmail.com>, Gen TANG <gen.tang86@gmail.com>
   2015-02-06 13:27:34 -0800
   Commit: 2872d83, github.com/apache/spark/pull/3986

   [SPARK-5586][Spark Shell][SQL] Make `sqlContext` available in spark shell
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-06 13:20:10 -0800
   Commit: 2ef9853, github.com/apache/spark/pull/4387

   [SPARK-5278][SQL] Introduce UnresolvedGetField and complete the check of ambiguous reference to fields
   Wenchen Fan <cloud0fan@outlook.com>
   2015-02-06 13:08:09 -0800
   Commit: 1b148ad, github.com/apache/spark/pull/4068

   [SQL][Minor] Remove cache keyword in SqlParser
   wangfei <wangfei1@huawei.com>
   2015-02-06 12:42:23 -0800
   Commit: d822606, github.com/apache/spark/pull/4393

   [SQL][HiveConsole][DOC] HiveConsole `correct hiveconsole imports`
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-06 12:41:28 -0800
   Commit: 2abaa6e, github.com/apache/spark/pull/4389

   [SPARK-5595][SPARK-5603][SQL] Add a rule to do PreInsert type casting and field renaming and invalidating in memory cache after INSERT
   Yin Huai <yhuai@databricks.com>
   2015-02-06 12:38:07 -0800
   Commit: 3c34d62, github.com/apache/spark/pull/4373

   [SPARK-5324][SQL] Results of describe can't be queried
   OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
   2015-02-06 12:33:20 -0800
   Commit: 0fc35da, github.com/apache/spark/pull/4249

   [SPARK-5619][SQL] Support 'show roles' in HiveContext
   q00251598 <qiyadong@huawei.com>
   2015-02-06 12:29:26 -0800
   Commit: cc66a3c, github.com/apache/spark/pull/4397

   [SPARK-5640] Synchronize ScalaReflection where necessary
   Tobias Schlatter <tobias@meisch.ch>
   2015-02-06 12:15:02 -0800
   Commit: 779e28b, github.com/apache/spark/pull/4431

   [SPARK-5650][SQL] Support optional 'FROM' clause
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-06 12:13:44 -0800
   Commit: 921121d, github.com/apache/spark/pull/4426

   [SPARK-5628] Add version option to spark-ec2
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-06 12:08:22 -0800
   Commit: ab0ffde, github.com/apache/spark/pull/4414

   [SPARK-2945][YARN][Doc]add doc for spark.executor.instances
   WangTaoTheTonic <wangtao111@huawei.com>
   2015-02-06 11:57:02 -0800
   Commit: 540f474, github.com/apache/spark/pull/4350

   [SPARK-4361][Doc] Add more docs for Hadoop Configuration
   zsxwing <zsxwing@gmail.com>
   2015-02-06 11:50:20 -0800
   Commit: 528dd34, github.com/apache/spark/pull/3225

   [HOTFIX] Fix test build break in ExecutorAllocationManagerSuite.
   Josh Rosen <joshrosen@databricks.com>
   2015-02-06 11:47:32 -0800
   Commit: 9e828f4

   [SPARK-5652][Mllib] Use broadcasted weights in LogisticRegressionModel
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-06 11:22:11 -0800
   Commit: 6fda4c1, github.com/apache/spark/pull/4429

   [SPARK-5555] Enable UISeleniumSuite tests
   Josh Rosen <joshrosen@databricks.com>
   2015-02-06 11:14:58 -0800
   Commit: 93fee7b, github.com/apache/spark/pull/4334

   SPARK-2450 Adds executor log links to Web UI
   Kostas Sakellis <kostas@cloudera.com>, Josh Rosen <joshrosen@databricks.com>
   2015-02-06 11:13:00 -0800
   Commit: e74dd04, github.com/apache/spark/pull/3486

   [SPARK-5618][Spark Core][Minor] Optimise utility code.
   Makoto Fukuhara <fukuo33@gmail.com>
   2015-02-06 11:11:38 -0800
   Commit: 3feb798, github.com/apache/spark/pull/4396

   [SPARK-5593][Core]Replace BlockManagerListener with ExecutorListener in ExecutorAllocationListener
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 11:09:37 -0800
   Commit: 9387dc1, github.com/apache/spark/pull/4369

   [SPARK-4877] Allow user first classes to extend classes in the parent.
   Stephen Haberman <stephen@exigencecorp.com>
   2015-02-06 11:03:56 -0800
   Commit: 52386cf, github.com/apache/spark/pull/3725

   [SPARK-5396] Syntax error in spark scripts on windows.
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2015-02-06 10:58:26 -0800
   Commit: 2dc94cd, github.com/apache/spark/pull/4428

   [SPARK-5636] Ramp up faster in dynamic allocation
   Andrew Or <andrew@databricks.com>
   2015-02-06 10:54:23 -0800
   Commit: 0a90305, github.com/apache/spark/pull/4409

   SPARK-4337. [YARN] Add ability to cancel pending requests
   Sandy Ryza <sandy@cloudera.com>
   2015-02-06 10:53:16 -0800
   Commit: 1568391, github.com/apache/spark/pull/4141

   [SPARK-5416] init Executor.threadPool before ExecutorSource
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-02-06 12:22:25 +0000
   Commit: f9bc4ef, github.com/apache/spark/pull/4212

   [Build] Set all Debian package permissions to 755
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-06 11:38:39 +0000
   Commit: 3638216, github.com/apache/spark/pull/4277

   Update ec2-scripts.md
   Miguel Peralvo <miguel.peralvo@gmail.com>
   2015-02-06 11:04:48 +0000
   Commit: f6613fc, github.com/apache/spark/pull/4300

   [SPARK-5470][Core]use defaultClassLoader to load classes in KryoSerializer
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 11:00:35 +0000
   Commit: 8007a4f, github.com/apache/spark/pull/4258

   [SPARK-5653][YARN] In ApplicationMaster rename isDriver to isClusterMode
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-02-06 10:48:31 -0800
   Commit: 4ff8855, github.com/apache/spark/pull/4430

   [SPARK-5582] [history] Ignore empty log directories.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-02-06 10:07:20 +0000
   Commit: faccdcb, github.com/apache/spark/pull/4352

   [SPARK-5157][YARN] Configure more JVM options properly when we use ConcMarkSweepGC for AM.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-02-06 09:39:12 +0000
   Commit: 25d8044, github.com/apache/spark/pull/3956

   [Minor] Remove permission for execution from spark-shell.cmd
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-02-06 09:33:36 +0000
   Commit: 7c54681, github.com/apache/spark/pull/3983

   [SPARK-5380][GraphX]  Solve an ArrayIndexOutOfBoundsException when build graph with a file format error
   Leolh <leosandylh@gmail.com>
   2015-02-06 09:01:53 +0000
   Commit: ffdb2e9, github.com/apache/spark/pull/4176

   [SPARK-5013] [MLlib] Added documentation and sample data file for GaussianMixture
   Travis Galoppo <tjg2107@columbia.edu>
   2015-02-06 10:26:51 -0800
   Commit: f408db6, github.com/apache/spark/pull/4401

   [SPARK-4789] [SPARK-4942] [SPARK-5031] [mllib] Standardize ML Prediction APIs
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-05 23:43:47 -0800
   Commit: 45b95e7, github.com/apache/spark/pull/3637

   [SPARK-5604][MLLIB] remove checkpointDir from trees
   Xiangrui Meng <meng@databricks.com>
   2015-02-05 23:32:09 -0800
   Commit: c35a11e, github.com/apache/spark/pull/4407

   [SPARK-5639][SQL] Support DataFrame.renameColumn.
   Reynold Xin <rxin@databricks.com>
   2015-02-05 23:02:40 -0800
   Commit: 0639d3e, github.com/apache/spark/pull/4410

   Revert "SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2."
   Patrick Wendell <patrick@databricks.com>
   2015-02-05 18:37:55 -0800
   Commit: 6d31531

   SPARK-5557: Explicitly include servlet API in dependencies.
   Patrick Wendell <patrick@databricks.com>
   2015-02-05 18:14:54 -0800
   Commit: 34131fd, github.com/apache/spark/pull/4411

   [HOTFIX] [SQL] Disables Metastore Parquet table conversion for "SQLQuerySuite.CTAS with serde"
   Cheng Lian <lian@databricks.com>
   2015-02-05 18:09:18 -0800
   Commit: ce6d8bb, github.com/apache/spark/pull/4413

   [SPARK-5638][SQL] Add a config flag to disable eager analysis of DataFrames
   Reynold Xin <rxin@databricks.com>
   2015-02-05 18:07:10 -0800
   Commit: 4fd67e4, github.com/apache/spark/pull/4408

   [SPARK-5620][DOC] group methods in generated unidoc
   Xiangrui Meng <meng@databricks.com>
   2015-02-05 16:26:51 -0800
   Commit: e2be79d, github.com/apache/spark/pull/4404

   [SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source improvements
   Cheng Lian <lian@databricks.com>
   2015-02-05 15:29:56 -0800
   Commit: 50c48eb, github.com/apache/spark/pull/4308

   [SPARK-5604[MLLIB] remove checkpointDir from LDA
   Xiangrui Meng <meng@databricks.com>
   2015-02-05 15:07:33 -0800
   Commit: 59798cb, github.com/apache/spark/pull/4390

   [SPARK-5460][MLlib] Wrapped `Try` around `deleteAllCheckpoints` - RandomForest.
   x1- <viva008@gmail.com>
   2015-02-05 15:02:04 -0800
   Commit: 44768f5, github.com/apache/spark/pull/4347

   [SPARK-5135][SQL] Add support for describe table to DDL in SQLContext
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-05 13:07:48 -0800
   Commit: 55cebcf, github.com/apache/spark/pull/4227

   [SPARK-5617][SQL] fix test failure of SQLQuerySuite
   wangfei <wangfei1@huawei.com>
   2015-02-05 12:44:12 -0800
   Commit: 785a2e3, github.com/apache/spark/pull/4395

   [Branch-1.3] [DOC] doc fix for date
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-05 12:42:27 -0800
   Commit: 17ef7f9, github.com/apache/spark/pull/4400

   [SPARK-5474][Build]curl should support URL redirection in build/mvn
   GuoQiang Li <witgo@qq.com>
   2015-02-05 12:03:13 -0800
   Commit: d1066e9, github.com/apache/spark/pull/4263

   [HOTFIX] MLlib build break.
   Reynold Xin <rxin@databricks.com>
   2015-02-05 00:42:50 -0800
   Commit: c83d118

   SPARK-5548: Fixed a race condition in AkkaUtilsSuite
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-02-05 12:00:04 -0800
   Commit: fba2dc6, github.com/apache/spark/pull/4343

   [SPARK-5608] Improve SEO of Spark documentation pages
   Matei Zaharia <matei@databricks.com>
   2015-02-05 11:12:50 -0800
   Commit: de112a2, github.com/apache/spark/pull/4381

   SPARK-4687. Add a recursive option to the addFile API
   Sandy Ryza <sandy@cloudera.com>
   2015-02-05 10:15:55 -0800
   Commit: c22ccc0, github.com/apache/spark/pull/3670

   [MLlib] Minor: UDF style update.
   Reynold Xin <rxin@databricks.com>
   2015-02-04 23:57:53 -0800
   Commit: 4074674, github.com/apache/spark/pull/4388

   [SPARK-5612][SQL] Move DataFrame implicit functions into SQLContext.implicits.
   Reynold Xin <rxin@databricks.com>
   2015-02-04 23:44:34 -0800
   Commit: 0040b61, github.com/apache/spark/pull/4386

   [SPARK-5606][SQL] Support plus sign in HiveContext
   q00251598 <qiyadong@huawei.com>
   2015-02-04 23:16:01 -0800
   Commit: bf43781, github.com/apache/spark/pull/4378

   [SPARK-5599] Check MLlib public APIs for 1.3
   Xiangrui Meng <meng@databricks.com>
   2015-02-04 23:03:47 -0800
   Commit: abc184e, github.com/apache/spark/pull/4377

   [SPARK-5596] [mllib] ML model import/export for GLMs, NaiveBayes
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-04 22:46:48 -0800
   Commit: 885bcbb, github.com/apache/spark/pull/4233

   SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2.
   Patrick Wendell <patrick@databricks.com>
   2015-02-04 22:39:44 -0800
   Commit: 59fb5c7, github.com/apache/spark/pull/4383

   [SPARK-5602][SQL] Better support for creating DataFrame from local data collection
   Reynold Xin <rxin@databricks.com>
   2015-02-04 19:53:57 -0800
   Commit: b8f9c00, github.com/apache/spark/pull/4372

   [SPARK-5538][SQL] Fix flaky CachedTableSuite
   Reynold Xin <rxin@databricks.com>
   2015-02-04 19:52:41 -0800
   Commit: 1901b19, github.com/apache/spark/pull/4379

   [SQL][DataFrame] Minor cleanup.
   Reynold Xin <rxin@databricks.com>
   2015-02-04 19:51:48 -0800
   Commit: f05bfa6, github.com/apache/spark/pull/4374

   [SPARK-4520] [SQL] This pr fixes the ArrayIndexOutOfBoundsException as r...
   Sadhan Sood <sadhan@tellapart.com>
   2015-02-04 19:18:06 -0800
   Commit: aa6f4ca, github.com/apache/spark/pull/4148

   [SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggregate functions
   Reynold Xin <rxin@databricks.com>
   2015-02-04 18:35:51 -0800
   Commit: 478ee3f, github.com/apache/spark/pull/4376

   [SPARK-5411] Allow SparkListeners to be specified in SparkConf and loaded when creating SparkContext
   Josh Rosen <joshrosen@databricks.com>
   2015-02-04 17:18:03 -0800
   Commit: 47e4d57, github.com/apache/spark/pull/4111

   [SPARK-5577] Python udf for DataFrame
   Davies Liu <davies@databricks.com>
   2015-02-04 15:55:09 -0800
   Commit: dc9ead9, github.com/apache/spark/pull/4351

   [SPARK-5118][SQL] Fix: create table test stored as parquet as select ..
   guowei2 <guowei2@asiainfo.com>
   2015-02-04 15:26:10 -0800
   Commit: 06da868, github.com/apache/spark/pull/3921

   [SQL] Use HiveContext's sessionState in HiveMetastoreCatalog.hiveDefaultTableFilePath
   Yin Huai <yhuai@databricks.com>
   2015-02-04 15:22:40 -0800
   Commit: cb4c3e5, github.com/apache/spark/pull/4355

   [SQL] Correct the default size of TimestampType and expose NumericType
   Yin Huai <yhuai@databricks.com>
   2015-02-04 15:14:49 -0800
   Commit: 513bb2c, github.com/apache/spark/pull/4314

   [SQL][Hiveconsole] Bring hive console code up to date and update README.md
   OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
   2015-02-04 15:13:54 -0800
   Commit: 2cdcfe3, github.com/apache/spark/pull/4330

   [SPARK-5367][SQL] Support star expression in udfs
   wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
   2015-02-04 15:12:07 -0800
   Commit: 8b803f6, github.com/apache/spark/pull/4353

   [SPARK-5426][SQL] Add SparkSQL Java API helper methods.
   kul <kuldeep.bora@gmail.com>
   2015-02-04 15:08:37 -0800
   Commit: 38ab92e, github.com/apache/spark/pull/4243

   [SPARK-5587][SQL] Support change database owner
   wangfei <wangfei1@huawei.com>
   2015-02-04 14:35:12 -0800
   Commit: 7920791, github.com/apache/spark/pull/4357

   [SPARK-5591][SQL] Fix NoSuchObjectException for CTAS
   wangfei <wangfei1@huawei.com>
   2015-02-04 14:33:07 -0800
   Commit: c79dd1e, github.com/apache/spark/pull/4365

   [SPARK-4939] move to next locality when no pending tasks
   Davies Liu <davies@databricks.com>
   2015-02-04 14:22:07 -0800
   Commit: f9bb3cb, github.com/apache/spark/pull/3779

   [SPARK-4707][STREAMING] Reliable Kafka Receiver can lose data if the blo...
   Hari Shreedharan <hshreedharan@apache.org>
   2015-02-04 14:20:44 -0800
   Commit: 14c9f32, github.com/apache/spark/pull/3655

   [SPARK-4964] [Streaming] Exactly-once semantics for Kafka
   cody koeninger <cody@koeninger.org>
   2015-02-04 12:06:34 -0800
   Commit: a119cae, github.com/apache/spark/pull/3798

   [SPARK-5588] [SQL] support select/filter by SQL expression
   Davies Liu <davies@databricks.com>
   2015-02-04 11:34:46 -0800
   Commit: 950a0d3, github.com/apache/spark/pull/4359

   [SPARK-5585] Flaky test in MLlib python
   Davies Liu <davies@databricks.com>
   2015-02-04 08:54:20 -0800
   Commit: 84c6273, github.com/apache/spark/pull/4358

   [SPARK-5574] use given name prefix in dir
   Imran Rashid <irashid@cloudera.com>
   2015-02-04 01:02:20 -0800
   Commit: 5d9278a, github.com/apache/spark/pull/4344

   [Minor] Fix incorrect warning log
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-04 00:52:41 -0800
   Commit: 316a4bb, github.com/apache/spark/pull/4360

   [SPARK-5379][Streaming] Add awaitTerminationOrTimeout
   zsxwing <zsxwing@gmail.com>
   2015-02-04 00:40:28 -0800
   Commit: 4d3dbfd, github.com/apache/spark/pull/4171

   [SPARK-5341] Use maven coordinates as dependencies in spark-shell and spark-submit
   Burak Yavuz <brkyvz@gmail.com>
   2015-02-03 22:39:17 -0800
   Commit: 3b7acd2, github.com/apache/spark/pull/4215

   [SPARK-4939] revive offers periodically in LocalBackend
   Davies Liu <davies@databricks.com>
   2015-02-03 22:30:23 -0800
   Commit: e196da8, github.com/apache/spark/pull/4147

   [SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streaming
   freeman <the.freeman.lab@gmail.com>
   2015-02-03 22:24:30 -0800
   Commit: 9a33f89, github.com/apache/spark/pull/3803

   [SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions
   Reynold Xin <rxin@databricks.com>
   2015-02-03 22:15:35 -0800
   Commit: cb7f783, github.com/apache/spark/pull/4348

   [FIX][MLLIB] fix seed handling in Python GMM
   Xiangrui Meng <meng@databricks.com>
   2015-02-03 20:39:11 -0800
   Commit: 679228b, github.com/apache/spark/pull/4349

   [SPARK-4795][Core] Redesign the "primitive type => Writable" implicit APIs to make them be activated automatically
   zsxwing <zsxwing@gmail.com>
   2015-02-03 20:17:12 -0800
   Commit: 5c63e05, github.com/apache/spark/pull/3642

   [SPARK-5578][SQL][DataFrame] Provide a convenient way for Scala users to use UDFs
   Reynold Xin <rxin@databricks.com>
   2015-02-03 20:07:46 -0800
   Commit: b22d5b5, github.com/apache/spark/pull/4345

   [SPARK-5520][MLlib] Make FP-Growth implementation take generic item types (WIP)
   Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
   2015-02-03 17:02:42 -0800
   Commit: 298ef5b, github.com/apache/spark/pull/4340

   [SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
   Davies Liu <davies@databricks.com>
   2015-02-03 16:01:56 -0800
   Commit: 4640623, github.com/apache/spark/pull/4331

   [STREAMING] SPARK-4986 Wait for receivers to deregister and receiver job to terminate
   Jesper Lundgren <jesper.lundgren@vpon.com>
   2015-02-03 14:53:39 -0800
   Commit: 092d4ba, github.com/apache/spark/pull/4338

   [SPARK-5153][Streaming][Test] Increased timeout to deal with flaky KafkaStreamSuite
   Tathagata Das <tathagata.das1565@gmail.com>
   2015-02-03 13:46:02 -0800
   Commit: d644bd9, github.com/apache/spark/pull/4342

   [SPARK-4508] [SQL] build native date type to conform behavior to Hive
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-03 12:21:45 -0800
   Commit: 6e244cf, github.com/apache/spark/pull/4325

   [SPARK-5383][SQL] Support alias for udtfs
   wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
   2015-02-03 12:16:31 -0800
   Commit: 5dbeb21, github.com/apache/spark/pull/4186

   [SPARK-5550] [SQL] Support the case insensitive for UDF
   Cheng Hao <hao.cheng@intel.com>
   2015-02-03 12:12:26 -0800
   Commit: 654c992, github.com/apache/spark/pull/4326

   [SPARK-4987] [SQL] parquet timestamp type support
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-03 12:06:06 -0800
   Commit: 67d5220, github.com/apache/spark/pull/3820

   [SQL] DataFrame API update
   Reynold Xin <rxin@databricks.com>
   2015-02-03 10:34:56 -0800
   Commit: 4204a12, github.com/apache/spark/pull/4332

   Minor: Fix TaskContext deprecated annotations.
   Reynold Xin <rxin@databricks.com>
   2015-02-03 10:34:16 -0800
   Commit: f7948f3, github.com/apache/spark/pull/4333

   [SPARK-5549] Define TaskContext interface in Scala.
   Reynold Xin <rxin@databricks.com>
   2015-02-03 00:46:04 -0800
   Commit: bebf4c4, github.com/apache/spark/pull/4324

   [SPARK-5551][SQL] Create type alias for SchemaRDD for source backward compatibility
   Reynold Xin <rxin@databricks.com>
   2015-02-03 00:29:23 -0800
   Commit: 523a935, github.com/apache/spark/pull/4327

   [SQL][DataFrame] Remove DataFrameApi, ExpressionApi, and GroupedDataFrameApi
   Reynold Xin <rxin@databricks.com>
   2015-02-03 00:29:04 -0800
   Commit: 37df330, github.com/apache/spark/pull/4328

   [minor] update streaming linear algorithms
   Xiangrui Meng <meng@databricks.com>
   2015-02-03 00:14:43 -0800
   Commit: 659329f, github.com/apache/spark/pull/4329

   [SPARK-1405] [mllib] Latent Dirichlet Allocation (LDA) using EM
   Xiangrui Meng <meng@databricks.com>
   2015-02-02 23:57:35 -0800
   Commit: 980764f, github.com/apache/spark/pull/2388

   [SPARK-5536] replace old ALS implementation by the new one
   Xiangrui Meng <meng@databricks.com>
   2015-02-02 23:49:09 -0800
   Commit: 0cc7b88, github.com/apache/spark/pull/4321

   [SPARK-5414] Add SparkFirehoseListener class for consuming all SparkListener events
   Josh Rosen <joshrosen@databricks.com>
   2015-02-02 23:35:07 -0800
   Commit: b8ebebe, github.com/apache/spark/pull/4210

   [SPARK-5501][SPARK-5420][SQL] Write support for the data source API
   Yin Huai <yhuai@databricks.com>
   2015-02-02 23:30:44 -0800
   Commit: 13531dd, github.com/apache/spark/pull/4294

   [SPARK-5012][MLLib][PySpark]Python API for Gaussian Mixture Model
   FlytxtRnD <meethu.mathew@flytxt.com>
   2015-02-02 23:04:55 -0800
   Commit: 50a1a87, github.com/apache/spark/pull/4059

   [SPARK-3778] newAPIHadoopRDD doesn't properly pass credentials for secure hdfs
   Thomas Graves <tgraves@apache.org>
   2015-02-02 22:45:55 -0800
   Commit: c31c36c, github.com/apache/spark/pull/4292

   [SPARK-4979][MLLIB] Streaming logisitic regression
   freeman <the.freeman.lab@gmail.com>
   2015-02-02 22:42:15 -0800
   Commit: eb0da6c, github.com/apache/spark/pull/4306

   [SPARK-5219][Core] Add locks to avoid scheduling race conditions
   zsxwing <zsxwing@gmail.com>
   2015-02-02 21:42:18 -0800
   Commit: c306555, github.com/apache/spark/pull/4019

   [Doc] Minor: Fixes several formatting issues
   Cheng Lian <lian@databricks.com>
   2015-02-02 21:14:21 -0800
   Commit: 60f67e7, github.com/apache/spark/pull/4316

   SPARK-3996: Add jetty servlet and continuations.
   Patrick Wendell <patrick@databricks.com>
   2015-02-02 21:01:36 -0800
   Commit: 7930d2b, github.com/apache/spark/pull/4323

   SPARK-5542: Decouple publishing, packaging, and tagging in release script
   Patrick Wendell <patrick@databricks.com>, Patrick Wendell <pwendell@gmail.com>
   2015-02-02 21:00:30 -0800
   Commit: 0ef38f5, github.com/apache/spark/pull/4319

   [SPARK-5543][WebUI] Remove unused import JsonUtil from from JsonProtocol
   nemccarthy <nathan@nemccarthy.me>
   2015-02-02 20:03:13 -0800
   Commit: cb39f12, github.com/apache/spark/pull/4320

   [SPARK-5472][SQL] A JDBC data source for Spark SQL.
   Tor Myklebust <tmyklebu@gmail.com>
   2015-02-02 19:50:14 -0800
   Commit: 8f471a6, github.com/apache/spark/pull/4261

   [SPARK-5512][Mllib] Run the PIC algorithm with initial vector suggected by the PIC paper
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-02 19:34:25 -0800
   Commit: 1bcd465, github.com/apache/spark/pull/4301

   [SPARK-5154] [PySpark] [Streaming] Kafka streaming support in Python
   Davies Liu <davies@databricks.com>, Tathagata Das <tdas@databricks.com>
   2015-02-02 19:16:27 -0800
   Commit: 0561c45, github.com/apache/spark/pull/3715

   [SQL] Improve DataFrame API error reporting
   Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
   2015-02-02 19:01:47 -0800
   Commit: 554403f, github.com/apache/spark/pull/4296

   Revert "[SPARK-4508] [SQL] build native date type to conform behavior to Hive"
   Patrick Wendell <patrick@databricks.com>
   2015-02-02 17:52:17 -0800
   Commit: eccb9fb

   Spark 3883: SSL support for HttpServer and Akka
   Jacek Lewandowski <lewandowski.jacek@gmail.com>, Jacek Lewandowski <jacek.lewandowski@datastax.com>
   2015-02-02 17:18:54 -0800
   Commit: cfea300, github.com/apache/spark/pull/3571

   [SPARK-5540] hide ALS.solveLeastSquares
   Xiangrui Meng <meng@databricks.com>
   2015-02-02 17:10:01 -0800
   Commit: ef65cf0, github.com/apache/spark/pull/4318

   [SPARK-5534] [graphx]  Graph getStorageLevel fix
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-02 17:02:29 -0800
   Commit: f133dec, github.com/apache/spark/pull/4317

   [SPARK-5514] DataFrame.collect should call executeCollect
   Reynold Xin <rxin@databricks.com>
   2015-02-02 16:55:36 -0800
   Commit: 8aa3cff, github.com/apache/spark/pull/4313

   [SPARK-5195][sql]Update HiveMetastoreCatalog.scala(override the MetastoreRelation's sameresult method only compare databasename and table name)
   seayi <405078363@qq.com>, Michael Armbrust <michael@databricks.com>
   2015-02-02 16:06:52 -0800
   Commit: dca6faa, github.com/apache/spark/pull/3898

   [SPARK-2309][MLlib] Multinomial Logistic Regression
   DB Tsai <dbtsai@alpinenow.com>
   2015-02-02 15:59:15 -0800
   Commit: b1aa8fe, github.com/apache/spark/pull/3833

   [SPARK-5513][MLLIB] Add nonnegative option to ml's ALS
   Xiangrui Meng <meng@databricks.com>
   2015-02-02 15:55:44 -0800
   Commit: 46d50f1, github.com/apache/spark/pull/4302

   [SPARK-4508] [SQL] build native date type to conform behavior to Hive
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-02 15:49:22 -0800
   Commit: 1646f89, github.com/apache/spark/pull/3732

   SPARK-5500. Document that feeding hadoopFile into a shuffle operation wi...
   Sandy Ryza <sandy@cloudera.com>
   2015-02-02 14:52:46 -0800
   Commit: 8309349, github.com/apache/spark/pull/4293

   [SPARK-5461] [graphx] Add isCheckpointed, getCheckpointedFiles methods to Graph
   Joseph K. Bradley <joseph@databricks.com>
   2015-02-02 14:34:48 -0800
   Commit: 842d000, github.com/apache/spark/pull/4253

   SPARK-5425: Use synchronised methods in system properties to create SparkConf
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-02-02 14:07:19 -0800
   Commit: 5a55261, github.com/apache/spark/pull/4222

   Disabling Utils.chmod700 for Windows
   Martin Weindel <martin.weindel@gmail.com>, mweindel <m.weindel@usu-software.de>
   2015-02-02 13:46:18 -0800
   Commit: bff65b5, github.com/apache/spark/pull/4299

   Make sure only owner can read / write to directories created for the job.
   Josh Rosen <joshrosen@databricks.com>
   2015-01-21 14:38:14 -0800
   Commit: 52f5754

   [HOTFIX] Add jetty references to build for YARN module.
   Patrick Wendell <patrick@databricks.com>
   2015-02-02 14:00:14 -0800
   Commit: 2321dd1

   [SPARK-4631][streaming][FIX] Wait for a receiver to start before publishing test data.
   Iulian Dragos <jaguarul@gmail.com>
   2015-02-02 14:00:33 -0800
   Commit: e908322, github.com/apache/spark/pull/4270

   [SPARK-5212][SQL] Add support of schema-less, custom field delimiter and SerDe for HiveQL transform
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-02 13:53:55 -0800
   Commit: 683e938, github.com/apache/spark/pull/4014

   [SPARK-5530] Add executor container to executorIdToContainer
   Xutingjun <1039320815@qq.com>
   2015-02-02 12:37:51 -0800
   Commit: 62a93a1, github.com/apache/spark/pull/4309

   [Docs] Fix Building Spark link text
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-02-02 12:33:49 -0800
   Commit: 3f941b6, github.com/apache/spark/pull/4312

   [SPARK-5173]support python application running on yarn cluster mode
   lianhuiwang <lianhuiwang09@gmail.com>, Wang Lianhui <lianhuiwang09@gmail.com>
   2015-02-02 12:32:28 -0800
   Commit: f5e6375, github.com/apache/spark/pull/3976

   SPARK-4585. Spark dynamic executor allocation should use minExecutors as...
   Sandy Ryza <sandy@cloudera.com>
   2015-02-02 12:27:08 -0800
   Commit: b2047b5, github.com/apache/spark/pull/4051

   [MLLIB] SPARK-5491 (ex SPARK-1473): Chi-square feature selection
   Alexander Ulanov <nashb@yandex.ru>
   2015-02-02 12:13:05 -0800
   Commit: c081b21, github.com/apache/spark/pull/1484

   SPARK-5492. Thread statistics can break with older Hadoop versions
   Sandy Ryza <sandy@cloudera.com>
   2015-02-02 00:54:06 -0800
   Commit: 6f34131, github.com/apache/spark/pull/4305

   [SPARK-5478][UI][Minor] Add missing right parentheses
   jerryshao <saisai.shao@intel.com>
   2015-02-01 23:56:13 -0800
   Commit: 63dfe21, github.com/apache/spark/pull/4267

   [SPARK-5353] Log failures in REPL class loading
   Tobias Schlatter <tobias@meisch.ch>
   2015-02-01 21:43:49 -0800
   Commit: 9f0a6e1, github.com/apache/spark/pull/4130

   [SPARK-3996]: Shade Jetty in Spark deliverables
   Patrick Wendell <patrick@databricks.com>
   2015-02-01 21:13:57 -0800
   Commit: a15f6e3, github.com/apache/spark/pull/4285

   [SPARK-4001][MLlib] adding parallel FP-Growth algorithm for frequent pattern mining in MLlib
   Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
   2015-02-01 20:07:25 -0800
   Commit: 859f724, github.com/apache/spark/pull/2847

   [Spark-5406][MLlib] LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound
   Yuhao Yang <hhbyyh@gmail.com>
   2015-02-01 19:40:26 -0800
   Commit: d85cd4e, github.com/apache/spark/pull/4200

   [SPARK-5465] [SQL] Fixes filter push-down for Parquet data source
   Cheng Lian <lian@databricks.com>
   2015-02-01 18:52:39 -0800
   Commit: ec10032, github.com/apache/spark/pull/4255

   [SPARK-5262] [SPARK-5244] [SQL] add coalesce in SQLParser and widen types for parameters of coalesce
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-02-01 18:51:38 -0800
   Commit: 8cf4a1f, github.com/apache/spark/pull/4057

   [SPARK-5196][SQL] Support `comment` in Create Table Field DDL
   OopsOutOfMemory <victorshengli@126.com>
   2015-02-01 18:41:49 -0800
   Commit: 1b56f1d, github.com/apache/spark/pull/3999

   [SPARK-1825] Make Windows Spark client work fine with Linux YARN cluster
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2015-02-01 18:26:28 -0800
   Commit: 7712ed5, github.com/apache/spark/pull/3943

   [SPARK-5176] The thrift server does not support cluster mode
   Tom Panning <tom.panning@nextcentury.com>
   2015-02-01 17:57:31 -0800
   Commit: 1ca0a10, github.com/apache/spark/pull/4137

   [SPARK-5155] Build fails with spark-ganglia-lgpl profile
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-02-01 17:53:56 -0800
   Commit: c80194b, github.com/apache/spark/pull/4303

   [Minor][SQL] Little refactor DataFrame related codes
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-02-01 17:52:18 -0800
   Commit: ef89b82, github.com/apache/spark/pull/4298

   [SPARK-4859][Core][Streaming] Refactor LiveListenerBus and StreamingListenerBus
   zsxwing <zsxwing@gmail.com>
   2015-02-01 17:47:51 -0800
   Commit: 883bc88, github.com/apache/spark/pull/4006

   [SPARK-5424][MLLIB] make the new ALS impl take generic ID types
   Xiangrui Meng <meng@databricks.com>
   2015-02-01 14:13:31 -0800
   Commit: 4a17122, github.com/apache/spark/pull/4281

   [SPARK-5207] [MLLIB] StandardScalerModel mean and variance re-use
   Octavian Geagla <ogeagla@gmail.com>
   2015-02-01 09:21:14 -0800
   Commit: bdb0680, github.com/apache/spark/pull/4140

   [SPARK-5422] Add support for sending Graphite metrics via UDP
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-01-31 23:41:05 -0800
   Commit: 80bd715, github.com/apache/spark/pull/4218

   SPARK-3359 [CORE] [DOCS] `sbt/sbt unidoc` doesn't work with Java 8
   Sean Owen <sowen@cloudera.com>
   2015-01-31 10:40:42 -0800
   Commit: c84d5a1, github.com/apache/spark/pull/4193

   [SPARK-3975] Added support for BlockMatrix addition and multiplication
   Burak Yavuz <brkyvz@gmail.com>, Burak Yavuz <brkyvz@dn51t42l.sunet>, Burak Yavuz <brkyvz@dn51t4rd.sunet>, Burak Yavuz <brkyvz@dn0a221430.sunet>, Burak Yavuz <brkyvz@dn0a22b17d.sunet>
   2015-01-31 00:47:30 -0800
   Commit: ef8974b, github.com/apache/spark/pull/4274

   [MLLIB][SPARK-3278] Monotone (Isotonic) regression using parallel pool adjacent violators algorithm
   martinzapletal <zapletal-martin@email.cz>, Xiangrui Meng <meng@databricks.com>, Martin Zapletal <zapletal-martin@email.cz>
   2015-01-31 00:46:02 -0800
   Commit: 34250a6, github.com/apache/spark/pull/3519

   [SPARK-5307] Add a config option for SerializationDebugger.
   Reynold Xin <rxin@databricks.com>
   2015-01-31 00:06:36 -0800
   Commit: 6364083, github.com/apache/spark/pull/4297

   [SQL] remove redundant field "childOutput" from execution.Aggregate, use child.output instead
   kai <kaizeng@eecs.berkeley.edu>
   2015-01-30 23:19:10 -0800
   Commit: f54c9f6, github.com/apache/spark/pull/4291

   [SPARK-5307] SerializationDebugger
   Reynold Xin <rxin@databricks.com>
   2015-01-30 22:34:10 -0800
   Commit: 740a568, github.com/apache/spark/pull/4098

   [SPARK-5504] [sql] convertToCatalyst should support nested arrays
   Joseph K. Bradley <joseph@databricks.com>
   2015-01-30 15:40:14 -0800
   Commit: e643de4, github.com/apache/spark/pull/4295

   SPARK-5400 [MLlib] Changed name of GaussianMixtureEM to GaussianMixture
   Travis Galoppo <tjg2107@columbia.edu>
   2015-01-30 15:32:25 -0800
   Commit: 9869773, github.com/apache/spark/pull/4290

   [SPARK-4259][MLlib]: Add Power Iteration Clustering Algorithm with Gaussian Similarity Function
   sboeschhuawei <stephen.boesch@huawei.com>, Fan Jiang <fanjiang.sc@huawei.com>, Jiang Fan <fjiang6@gmail.com>, Stephen Boesch <stephen.boesch@huawei.com>, Xiangrui Meng <meng@databricks.com>
   2015-01-30 14:09:49 -0800
   Commit: f377431, github.com/apache/spark/pull/4254

   [SPARK-5486] Added validate method to BlockMatrix
   Burak Yavuz <brkyvz@gmail.com>
   2015-01-30 13:59:10 -0800
   Commit: 6ee8338, github.com/apache/spark/pull/4279

   [SPARK-5496][MLLIB] Allow both classification and Classification in Algo for trees.
   Xiangrui Meng <meng@databricks.com>
   2015-01-30 10:08:07 -0800
   Commit: 0a95085, github.com/apache/spark/pull/4287

   [MLLIB] SPARK-4846: throw a RuntimeException and give users hints to increase the minCount
   Joseph J.C. Tang <jinntrance@gmail.com>
   2015-01-30 10:07:26 -0800
   Commit: 54d9575, github.com/apache/spark/pull/4247

   SPARK-5393. Flood of util.RackResolver log messages after SPARK-1714
   Sandy Ryza <sandy@cloudera.com>
   2015-01-30 11:31:54 -0600
   Commit: 254eaa4, github.com/apache/spark/pull/4192

   [SPARK-5457][SQL] Add missing DSL for ApproxCountDistinct.
   Takuya UESHIN <ueshin@happy-camper.st>
   2015-01-30 01:21:35 -0800
   Commit: 6f21dce, github.com/apache/spark/pull/4250

   [SPARK-5094][MLlib] Add Python API for Gradient Boosted Trees
   Kazuki Taniguchi <kazuki.t.1018@gmail.com>
   2015-01-30 00:39:44 -0800
   Commit: bc1fc9b, github.com/apache/spark/pull/3951

   [SPARK-5322] Added transpose functionality to BlockMatrix
   Burak Yavuz <brkyvz@gmail.com>
   2015-01-29 21:26:29 -0800
   Commit: dd4d84c, github.com/apache/spark/pull/4275

   [SQL] Support df("*") to select all columns in a data frame.
   Reynold Xin <rxin@databricks.com>
   2015-01-29 19:09:08 -0800
   Commit: 80def9d, github.com/apache/spark/pull/4283

   [SPARK-5462] [SQL] Use analyzed query plan in DataFrame.apply()
   Josh Rosen <joshrosen@databricks.com>
   2015-01-29 18:23:05 -0800
   Commit: 22271f9, github.com/apache/spark/pull/4282

   [SPARK-5395] [PySpark] fix python process leak while coalesce()
   Davies Liu <davies@databricks.com>
   2015-01-29 17:28:37 -0800
   Commit: 5c746ee, github.com/apache/spark/pull/4238

   [SQL] DataFrame API improvements
   Reynold Xin <rxin@databricks.com>
   2015-01-29 17:24:00 -0800
   Commit: ce9c43b, github.com/apache/spark/pull/4280

   Revert "[WIP] [SPARK-3996]: Shade Jetty in Spark deliverables"
   Patrick Wendell <patrick@databricks.com>
   2015-01-29 17:14:27 -0800
   Commit: d2071e8

   remove 'return'
   Yoshihiro Shimizu <shimizu@amoad.com>
   2015-01-29 16:55:00 -0800
   Commit: 5338772, github.com/apache/spark/pull/4268

   [WIP] [SPARK-3996]: Shade Jetty in Spark deliverables
   Patrick Wendell <patrick@databricks.com>
   2015-01-29 16:31:19 -0800
   Commit: f240fe3, github.com/apache/spark/pull/4252

   [SPARK-5464] Fix help() for Python DataFrame instances
   Josh Rosen <joshrosen@databricks.com>
   2015-01-29 16:23:20 -0800
   Commit: 0bb15f2, github.com/apache/spark/pull/4278

   [SPARK-4296][SQL] Trims aliases when resolving and checking aggregate expressions
   Yin Huai <yhuai@databricks.com>, Cheng Lian <lian@databricks.com>
   2015-01-29 15:49:34 -0800
   Commit: c00d517, github.com/apache/spark/pull/4010

   [SPARK-5373][SQL] Literal in agg grouping expressions leads to incorrect result
   wangfei <wangfei1@huawei.com>
   2015-01-29 15:47:13 -0800
   Commit: c1b3eeb, github.com/apache/spark/pull/4169

   [SPARK-5367][SQL] Support star expression in udf
   wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
   2015-01-29 15:44:53 -0800
   Commit: fbaf9e0, github.com/apache/spark/pull/4163

   [SPARK-4786][SQL]: Parquet filter pushdown for castable types
   Yash Datta <Yash.Datta@guavus.com>
   2015-01-29 15:42:23 -0800
   Commit: de221ea, github.com/apache/spark/pull/4156

   [SPARK-5309][SQL] Add support for dictionaries in PrimitiveConverter for Strin...
   Michael Davies <Michael.BellDavies@gmail.com>
   2015-01-29 15:40:59 -0800
   Commit: 940f375, github.com/apache/spark/pull/4187

   [SPARK-5429][SQL] Use javaXML plan serialization for Hive golden answers on Hive 0.13.1
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-01-29 15:28:22 -0800
   Commit: bce0ba1, github.com/apache/spark/pull/4223

   [SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.
   Reynold Xin <rxin@databricks.com>
   2015-01-29 15:13:09 -0800
   Commit: 7156322, github.com/apache/spark/pull/4276

   [SPARK-5466] Add explicit guava dependencies where needed.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-01-29 13:00:45 -0800
   Commit: f9e5694, github.com/apache/spark/pull/4272

   [SPARK-5477] refactor stat.py
   Xiangrui Meng <meng@databricks.com>
   2015-01-29 10:11:44 -0800
   Commit: a3dc618, github.com/apache/spark/pull/4266

   [SQL] Various DataFrame DSL update.
   Reynold Xin <rxin@databricks.com>
   2015-01-29 00:01:10 -0800
   Commit: 5ad78f6, github.com/apache/spark/pull/4260

   [SPARK-3977] Conversion methods for BlockMatrix to other Distributed Matrices
   Burak Yavuz <brkyvz@gmail.com>
   2015-01-28 23:42:07 -0800
   Commit: a63be1a, github.com/apache/spark/pull/4256

   [SPARK-5445][SQL] Made DataFrame dsl usable in Java
   Reynold Xin <rxin@databricks.com>
   2015-01-28 19:10:32 -0800
   Commit: 5b9760d, github.com/apache/spark/pull/4241

   [SPARK-5430] move treeReduce and treeAggregate from mllib to core
   Xiangrui Meng <meng@databricks.com>
   2015-01-28 17:26:03 -0800
   Commit: 4ee79c7, github.com/apache/spark/pull/4228

   [SPARK-4586][MLLIB] Python API for ML pipeline and parameters
   Xiangrui Meng <meng@databricks.com>, Davies Liu <davies@databricks.com>
   2015-01-28 17:14:23 -0800
   Commit: e80dc1c, github.com/apache/spark/pull/4151

   [SPARK-5441][pyspark] Make SerDeUtil PairRDD to Python conversions more robust
   Michael Nazario <mnazario@palantir.com>
   2015-01-28 13:55:01 -0800
   Commit: e023112, github.com/apache/spark/pull/4236

   [SPARK-4387][PySpark] Refactoring python profiling code to make it extensible
   Yandu Oppacher <yandu.oppacher@jadedpixel.com>, Davies Liu <davies@databricks.com>
   2015-01-28 13:48:06 -0800
   Commit: 3bead67, github.com/apache/spark/pull/3255.

   [SPARK-5417] Remove redundant executor-id set() call
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-01-28 13:04:52 -0800
   Commit: a731314, github.com/apache/spark/pull/4213

   [SPARK-5434] [EC2] Preserve spaces in EC2 path
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-01-28 12:56:03 -0800
   Commit: d44ee43, github.com/apache/spark/pull/4224

   [SPARK-5437] Fix DriverSuite and SparkSubmitSuite timeout issues
   Andrew Or <andrew@databricks.com>
   2015-01-28 12:52:31 -0800
   Commit: 84b6ecd, github.com/apache/spark/pull/4230

   [SPARK-4955]With executor dynamic scaling enabled,executor shoude be added or killed in yarn-cluster mode.
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-01-28 12:50:57 -0800
   Commit: 81f8f34, github.com/apache/spark/pull/3962

   [SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd
   Michael Nazario <mnazario@palantir.com>
   2015-01-28 12:47:12 -0800
   Commit: 456c11f, github.com/apache/spark/pull/4237

   SPARK-1934 [CORE] "this" reference escape to "selectorThread" during construction in ConnectionManager
   Sean Owen <sowen@cloudera.com>
   2015-01-28 12:44:35 -0800
   Commit: 9b18009, github.com/apache/spark/pull/4225

   [SPARK-5188][BUILD] make-distribution.sh should support curl, not only wget to get Tachyon
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-28 12:43:22 -0800
   Commit: e902dc4, github.com/apache/spark/pull/3988

   SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs
   Sandy Ryza <sandy@cloudera.com>
   2015-01-28 12:41:23 -0800
   Commit: 406f6d3, github.com/apache/spark/pull/4251

   [SPARK-5447][SQL] Replaced reference to SchemaRDD with DataFrame.
   Reynold Xin <rxin@databricks.com>
   2015-01-28 12:10:01 -0800
   Commit: c8e934e, github.com/apache/spark/pull/4242

   [SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly
   Winston Chen <wchen@quid.com>
   2015-01-28 11:08:44 -0800
   Commit: 453d799, github.com/apache/spark/pull/4146

   [SPARK-5291][CORE] Add timestamp and reason why an executor is removed to SparkListenerExecutorAdded and SparkListenerExecutorRemoved
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-28 11:02:51 -0800
   Commit: 0b35fcd, github.com/apache/spark/pull/4082

   [SPARK-3974][MLlib] Distributed Block Matrix Abstractions
   Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>, Burak Yavuz <brkyvz@dn51t42l.sunet>, Burak Yavuz <brkyvz@dn51t4rd.sunet>, Burak Yavuz <brkyvz@dn0a221430.sunet>
   2015-01-28 10:06:37 -0800
   Commit: eeb53bf, github.com/apache/spark/pull/3200

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-01-28 02:15:14 -0800
   Commit: 622ff09, github.com/apache/spark/pull/1480

   [SPARK-5415] bump sbt to version to 0.13.7
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-01-28 02:13:06 -0800
   Commit: 661d3f9, github.com/apache/spark/pull/4211

   [SPARK-4809] Rework Guava library shading.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-01-28 00:29:29 -0800
   Commit: 37a5e27, github.com/apache/spark/pull/3658

   [SPARK-5097][SQL] Test cases for DataFrame expressions.
   Reynold Xin <rxin@databricks.com>
   2015-01-27 18:10:49 -0800
   Commit: d743732, github.com/apache/spark/pull/4235

   [SPARK-5097][SQL] DataFrame
   Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
   2015-01-27 16:08:24 -0800
   Commit: 119f45d, github.com/apache/spark/pull/4173

   SPARK-5199. FS read metrics should support CombineFileSplits and track bytes from all FSs
   Sandy Ryza <sandy@cloudera.com>
   2015-01-27 15:42:55 -0800
   Commit: b1b35ca, github.com/apache/spark/pull/4050

   [MLlib] fix python example of ALS in guide
   Davies Liu <davies@databricks.com>
   2015-01-27 15:33:01 -0800
   Commit: fdaad4e, github.com/apache/spark/pull/4226

   SPARK-5308 [BUILD] MD5 / SHA1 hash format doesn't match standard Maven output
   Sean Owen <sowen@cloudera.com>
   2015-01-27 10:22:50 -0800
   Commit: ff356e2, github.com/apache/spark/pull/4161

   [SPARK-5321] Support for transposing local matrices
   Burak Yavuz <brkyvz@gmail.com>
   2015-01-27 01:46:17 -0800
   Commit: 9142674, github.com/apache/spark/pull/4109

   [SPARK-5419][Mllib] Fix the logic in Vectors.sqdist
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-01-27 01:29:14 -0800
   Commit: 7b0ed79, github.com/apache/spark/pull/4217

   [SPARK-3726] [MLlib] Allow sampling_rate not equal to 1.0 in RandomForests
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-01-26 19:46:17 -0800
   Commit: d6894b1, github.com/apache/spark/pull/4073

   [SPARK-5119] java.lang.ArrayIndexOutOfBoundsException on trying to train...
   lewuathe <lewuathe@me.com>
   2015-01-26 18:03:21 -0800
   Commit: f2ba5c6, github.com/apache/spark/pull/3975

   [SPARK-5052] Add common/base classes to fix guava methods signatures.
   Elmer Garduno <elmerg@google.com>
   2015-01-26 17:40:48 -0800
   Commit: 661e0fc, github.com/apache/spark/pull/3874

   SPARK-960 [CORE] [TEST] JobCancellationSuite "two jobs sharing the same stage" is broken
   Sean Owen <sowen@cloudera.com>
   2015-01-26 14:32:27 -0800
   Commit: 0497ea5, github.com/apache/spark/pull/4180

   Fix command spaces issue in make-distribution.sh
   David Y. Ross <dyross@gmail.com>
   2015-01-26 14:26:10 -0800
   Commit: b38034e, github.com/apache/spark/pull/4126

   SPARK-4147 [CORE] Reduce log4j dependency
   Sean Owen <sowen@cloudera.com>
   2015-01-26 14:23:42 -0800
   Commit: 54e7b45, github.com/apache/spark/pull/4190

   [SPARK-5339][BUILD] build/mvn doesn't work because of invalid URL for maven's tgz.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-26 13:07:49 -0800
   Commit: c094c73, github.com/apache/spark/pull/4124

   [SPARK-5355] use j.u.c.ConcurrentHashMap instead of TrieMap
   Davies Liu <davies@databricks.com>
   2015-01-26 12:51:32 -0800
   Commit: 1420931, github.com/apache/spark/pull/4208

   [SPARK-5384][mllib] Vectors.sqdist returns inconsistent results for sparse/dense vectors when the vectors have different lengths
   Yuhao Yang <hhbyyh@gmail.com>
   2015-01-25 22:18:09 -0800
   Commit: 8125168, github.com/apache/spark/pull/4183

   [SPARK-5268] don't stop CoarseGrainedExecutorBackend for irrelevant DisassociatedEvent
   CodingCat <zhunansjtu@gmail.com>
   2015-01-25 19:28:53 -0800
   Commit: 8df9435, github.com/apache/spark/pull/4063

   SPARK-4430 [STREAMING] [TEST] Apache RAT Checks fail spuriously on test files
   Sean Owen <sowen@cloudera.com>
   2015-01-25 19:16:44 -0800
   Commit: 0528b85, github.com/apache/spark/pull/4189

   [SPARK-5326] Show fetch wait time as optional metric in the UI
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-01-25 16:48:26 -0800
   Commit: fc2168f, github.com/apache/spark/pull/4110

   [SPARK-5344][WebUI] HistoryServer cannot recognize that inprogress file was renamed to completed file
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-25 15:34:20 -0800
   Commit: 8f5c827, github.com/apache/spark/pull/4132

   SPARK-4506 [DOCS] Addendum: Update more docs to reflect that standalone works in cluster mode
   Sean Owen <sowen@cloudera.com>
   2015-01-25 15:25:05 -0800
   Commit: 9f64357, github.com/apache/spark/pull/4160

   SPARK-5382: Use SPARK_CONF_DIR in spark-class if it is defined
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-01-25 15:15:09 -0800
   Commit: 1c30afd, github.com/apache/spark/pull/4179

   SPARK-3782 [CORE] Direct use of log4j in AkkaUtils interferes with certain logging configurations
   Sean Owen <sowen@cloudera.com>
   2015-01-25 15:11:57 -0800
   Commit: 383425a, github.com/apache/spark/pull/4184

   SPARK-3852 [DOCS] Document spark.driver.extra* configs
   Sean Owen <sowen@cloudera.com>
   2015-01-25 15:08:05 -0800
   Commit: c586b45, github.com/apache/spark/pull/4185

   [SPARK-5402] log executor ID at executor-construction time
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-01-25 14:20:02 -0800
   Commit: aea2548, github.com/apache/spark/pull/4195

   [SPARK-5401] set executor ID before creating MetricsSystem
   Ryan Williams <ryan.blake.williams@gmail.com>
   2015-01-25 14:17:59 -0800
   Commit: 2d9887b, github.com/apache/spark/pull/4194

   Add comment about defaultMinPartitions
   Idan Zalzberg <idanzalz@gmail.com>
   2015-01-25 11:28:05 -0800
   Commit: 412a58e, github.com/apache/spark/pull/4102

   Closes #4157
   Reynold Xin <rxin@databricks.com>
   2015-01-25 00:24:59 -0800
   Commit: d22ca1e

   [SPARK-5214][Test] Add a test to demonstrate EventLoop can be stopped in the event thread
   zsxwing <zsxwing@gmail.com>
   2015-01-24 11:00:35 -0800
   Commit: 0d1e67e, github.com/apache/spark/pull/4174

   [SPARK-5058] Part 2. Typos and broken URL
   Jongyoul Lee <jongyoul@gmail.com>
   2015-01-23 23:34:11 -0800
   Commit: 09e09c5, github.com/apache/spark/pull/4172

   [SPARK-5351][GraphX] Do not use Partitioner.defaultPartitioner as a partitioner of EdgeRDDImp...
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2015-01-23 19:25:15 -0800
   Commit: e224dbb, github.com/apache/spark/pull/4136

   [SPARK-5063] More helpful error messages for several invalid operations
   Josh Rosen <joshrosen@databricks.com>
   2015-01-23 17:53:15 -0800
   Commit: cef1f09, github.com/apache/spark/pull/3884

   [SPARK-3541][MLLIB] New ALS implementation with improved storage
   Xiangrui Meng <meng@databricks.com>
   2015-01-22 22:09:13 -0800
   Commit: ea74365, github.com/apache/spark/pull/3720

   [SPARK-5315][Streaming] Fix reduceByWindow Java API not work bug
   jerryshao <saisai.shao@intel.com>
   2015-01-22 22:04:21 -0800
   Commit: e0f7fb7, github.com/apache/spark/pull/4104

   [SPARK-5233][Streaming] Fix error replaying of WAL introduced bug
   jerryshao <saisai.shao@intel.com>
   2015-01-22 21:58:53 -0800
   Commit: 3c3fa63, github.com/apache/spark/pull/4032

   SPARK-5370. [YARN] Remove some unnecessary synchronization in YarnAlloca...
   Sandy Ryza <sandy@cloudera.com>
   2015-01-22 13:49:35 -0600
   Commit: 820ce03, github.com/apache/spark/pull/4164

   [SPARK-5365][MLlib] Refactor KMeans to reduce redundant data
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-01-22 08:16:35 -0800
   Commit: 246111d, github.com/apache/spark/pull/4159

   [SPARK-5147][Streaming] Delete the received data WAL log periodically
   Tathagata Das <tathagata.das1565@gmail.com>, jerryshao <saisai.shao@intel.com>
   2015-01-21 23:41:44 -0800
   Commit: 3027f06, github.com/apache/spark/pull/4149

   [SPARK-5317]Set BoostingStrategy.defaultParams With Enumeration Algo.Classification or Algo.Regression
   Basin <jpsachilles@gmail.com>
   2015-01-21 23:06:34 -0800
   Commit: fcb3e18, github.com/apache/spark/pull/4103

   [SPARK-3424][MLLIB] cache point distances during k-means|| init
   Xiangrui Meng <meng@databricks.com>
   2015-01-21 21:20:31 -0800
   Commit: ca7910d, github.com/apache/spark/pull/4144

   [SPARK-5202] [SQL] Add hql variable substitution support
   Cheng Hao <hao.cheng@intel.com>
   2015-01-21 17:34:18 -0800
   Commit: 27bccc5, github.com/apache/spark/pull/4003

   [SPARK-5355] make SparkConf thread-safe
   Davies Liu <davies@databricks.com>
   2015-01-21 16:51:42 -0800
   Commit: 9bad062, github.com/apache/spark/pull/4143

   [SPARK-4984][CORE][WEBUI] Adding a pop-up containing the full job description when it is very long
   wangfei <wangfei1@huawei.com>
   2015-01-21 15:27:42 -0800
   Commit: 3be2a88, github.com/apache/spark/pull/3819

   [SQL] [Minor] Remove deprecated parquet tests
   Cheng Lian <lian@databricks.com>
   2015-01-21 14:38:10 -0800
   Commit: ba19689, github.com/apache/spark/pull/4116

   Revert "[SPARK-5244] [SQL] add coalesce() in sql parser"
   Josh Rosen <joshrosen@databricks.com>
   2015-01-21 14:27:43 -0800
   Commit: b328ac6

   [SPARK-5009] [SQL] Long keyword support in SQL Parsers
   Cheng Hao <hao.cheng@intel.com>
   2015-01-21 13:05:56 -0800
   Commit: 8361078, github.com/apache/spark/pull/3926

   [SPARK-5244] [SQL] add coalesce() in sql parser
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-01-21 12:59:41 -0800
   Commit: 812d367, github.com/apache/spark/pull/4040

   [SPARK-5064][GraphX] Add numEdges upperbound validation for R-MAT graph generator to prevent infinite loop
   Kenji Kikushima <kikushima.kenji@lab.ntt.co.jp>
   2015-01-21 12:34:00 -0800
   Commit: 3ee3ab5, github.com/apache/spark/pull/3950

   [SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed
   nate.crosswhite <nate.crosswhite@stresearch.com>, nxwhite-str <nxwhite-str@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
   2015-01-21 10:32:10 -0800
   Commit: 7450a99, github.com/apache/spark/pull/3610

   [MLlib] [SPARK-5301] Missing conversions and operations on IndexedRowMatrix and CoordinateMatrix
   Reza Zadeh <reza@databricks.com>
   2015-01-21 09:48:38 -0800
   Commit: aa1e22b, github.com/apache/spark/pull/4089

   SPARK-1714. Take advantage of AMRMClient APIs to simplify logic in YarnA...
   Sandy Ryza <sandy@cloudera.com>
   2015-01-21 10:31:54 -0600
   Commit: 2eeada3, github.com/apache/spark/pull/3765

   [SPARK-5336][YARN]spark.executor.cores must not be less than spark.task.cpus
   WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
   2015-01-21 09:42:30 -0600
   Commit: 8c06a5f, github.com/apache/spark/pull/4123

   [SPARK-5297][Streaming] Fix Java file stream type erasure problem
   jerryshao <saisai.shao@intel.com>
   2015-01-20 23:37:47 -0800
   Commit: 424d8c6, github.com/apache/spark/pull/4101

   [HOTFIX] Update pom.xml to pull MapR's Hadoop version 2.4.1.
   Kannan Rajah <rkannan82@gmail.com>
   2015-01-20 23:34:04 -0800
   Commit: ec5b0f2, github.com/apache/spark/pull/4108

   [SPARK-5275] [Streaming] include python source code
   Davies Liu <davies@databricks.com>
   2015-01-20 22:44:58 -0800
   Commit: bad6c57, github.com/apache/spark/pull/4128

   [SPARK-5294][WebUI] Hide tables in AllStagePages for "Active Stages, Completed Stages and Failed Stages" when they are empty
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-20 16:40:46 -0800
   Commit: 9a151ce, github.com/apache/spark/pull/4083

   [SPARK-5186] [MLLIB]  Vector.equals and Vector.hashCode are very inefficient
   Yuhao Yang <hhbyyh@gmail.com>, Yuhao Yang <yuhao@yuhaodevbox.sh.intel.com>
   2015-01-20 15:20:20 -0800
   Commit: 2f82c84, github.com/apache/spark/pull/3997

   [SPARK-5323][SQL] Remove Row's Seq inheritance.
   Reynold Xin <rxin@databricks.com>
   2015-01-20 15:16:14 -0800
   Commit: d181c2a, github.com/apache/spark/pull/4115

   [SPARK-5287][SQL] Add defaultSizeOf to every data type.
   Yin Huai <yhuai@databricks.com>
   2015-01-20 13:26:36 -0800
   Commit: bc20a52, github.com/apache/spark/pull/4081

   SPARK-5019 [MLlib] - GaussianMixtureModel exposes instances of MultivariateGauss...
   Travis Galoppo <tjg2107@columbia.edu>
   2015-01-20 12:58:11 -0800
   Commit: 23e2554, github.com/apache/spark/pull/4088

   [SPARK-5329][WebUI] UIWorkloadGenerator should stop SparkContext.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-20 12:40:55 -0800
   Commit: 769aced, github.com/apache/spark/pull/4112

   SPARK-4660: Use correct class loader in JavaSerializer (copy of PR #3840...
   Jacek Lewandowski <lewandowski.jacek@gmail.com>
   2015-01-20 12:38:01 -0800
   Commit: c93a57f, github.com/apache/spark/pull/4113

   [SQL][Minor] Refactors deeply nested FP style code in BooleanSimplification
   Cheng Lian <lian@databricks.com>
   2015-01-20 11:20:14 -0800
   Commit: 8140802, github.com/apache/spark/pull/4091

   [SPARK-5333][Mesos] MesosTaskLaunchData occurs BufferUnderflowException
   Jongyoul Lee <jongyoul@gmail.com>
   2015-01-20 10:17:29 -0800
   Commit: 9d9294a, github.com/apache/spark/pull/4119

   [SPARK-4803] [streaming] Remove duplicate RegisterReceiver message
   Ilayaperumal Gopinathan <igopinathan@pivotal.io>
   2015-01-20 01:41:10 -0800
   Commit: 4afad9c, github.com/apache/spark/pull/3648

   [SQL][minor] Add a log4j file for catalyst test.
   Reynold Xin <rxin@databricks.com>
   2015-01-20 00:55:25 -0800
   Commit: debc031, github.com/apache/spark/pull/4117

   SPARK-5270 [CORE] Provide isEmpty() function in RDD API
   Sean Owen <sowen@cloudera.com>
   2015-01-19 22:50:44 -0800
   Commit: 306ff18, github.com/apache/spark/pull/4074

   [SPARK-5214][Core] Add EventLoop and change DAGScheduler to an EventLoop
   zsxwing <zsxwing@gmail.com>
   2015-01-19 18:15:51 -0800
   Commit: e69fb8c, github.com/apache/spark/pull/4016

   [SPARK-4504][Examples] fix run-example failure if multiple assembly jars exist
   Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2015-01-19 11:58:16 -0800
   Commit: 74de94e, github.com/apache/spark/pull/3377

   [SPARK-5286][SQL] Fail to drop an invalid table when using the data source API
   Yin Huai <yhuai@databricks.com>
   2015-01-19 10:45:29 -0800
   Commit: 2604bc3, github.com/apache/spark/pull/4076

   [SPARK-5284][SQL] Insert into Hive throws NPE when a inner complex type field has a null value
   Yin Huai <yhuai@databricks.com>
   2015-01-19 10:44:12 -0800
   Commit: cd5da42, github.com/apache/spark/pull/4077

   [SPARK-5282][mllib]: RowMatrix easily gets int overflow in the memory size warning
   Yuhao Yang <hhbyyh@gmail.com>
   2015-01-19 10:10:15 -0800
   Commit: 4432568, github.com/apache/spark/pull/4069

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-01-19 02:05:24 -0800
   Commit: 1ac1c1d, github.com/apache/spark/pull/3584

   [SPARK-5088] Use spark-class for running executors directly
   Jongyoul Lee <jongyoul@gmail.com>
   2015-01-19 02:01:56 -0800
   Commit: 4a4f9cc, github.com/apache/spark/pull/3897

   [SPARK-3288] All fields in TaskMetrics should be private and use getters/setters
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-01-19 01:32:22 -0800
   Commit: 3453d57, github.com/apache/spark/pull/4020

   SPARK-5217 Spark UI should report pending stages during job execution on AllStagesPage.
   Prashant Sharma <prashant.s@imaginea.com>
   2015-01-19 01:28:42 -0800
   Commit: 851b6a9, github.com/apache/spark/pull/4043

   [SQL] fix typo in class description
   Jacky Li <jacky.likun@gmail.com>
   2015-01-18 23:59:08 -0800
   Commit: 7dbf1fd, github.com/apache/spark/pull/4100

   [SQL][minor] Put DataTypes.java in java dir.
   Reynold Xin <rxin@databricks.com>
   2015-01-18 16:35:40 -0800
   Commit: 1955645, github.com/apache/spark/pull/4097

   [SQL][Minor] Update sql doc according to data type APIs changes
   scwf <wangfei1@huawei.com>
   2015-01-18 11:03:13 -0800
   Commit: 1a200a3, github.com/apache/spark/pull/4095

   [SPARK-5279][SQL] Use java.math.BigDecimal as the exposed Decimal type.
   Reynold Xin <rxin@databricks.com>
   2015-01-18 11:01:42 -0800
   Commit: 1727e08, github.com/apache/spark/pull/4092

   [HOTFIX]: Minor clean up regarding skipped artifacts in build files.
   Patrick Wendell <patrick@databricks.com>
   2015-01-17 23:15:12 -0800
   Commit: ad16da1, github.com/apache/spark/pull/4080

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <patrick@databricks.com>
   2015-01-17 20:39:54 -0800
   Commit: e12b5b6, github.com/apache/spark/pull/681

   [SQL][Minor] Added comments and examples to explain BooleanSimplification
   Reynold Xin <rxin@databricks.com>
   2015-01-17 17:35:53 -0800
   Commit: e7884bc, github.com/apache/spark/pull/4090

   [SPARK-5096] Use sbt tasks instead of vals to get hadoop version
   Michael Armbrust <michael@databricks.com>
   2015-01-17 17:03:07 -0800
   Commit: 6999910, github.com/apache/spark/pull/3905

   [SPARK-4937][SQL] Comment for the newly optimization rules in `BooleanSimplification`
   scwf <wangfei1@huawei.com>
   2015-01-17 15:51:24 -0800
   Commit: c1f3c27, github.com/apache/spark/pull/4086

   [SQL][minor] Improved Row documentation.
   Reynold Xin <rxin@databricks.com>
   2015-01-17 00:11:08 -0800
   Commit: f3bfc76, github.com/apache/spark/pull/4085

   [SPARK-5193][SQL] Remove Spark SQL Java-specific API.
   Reynold Xin <rxin@databricks.com>
   2015-01-16 21:09:06 -0800
   Commit: 61b427d, github.com/apache/spark/pull/4065

   [SPARK-4937][SQL] Adding optimization to simplify the  And, Or condition in spark sql
   scwf <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
   2015-01-16 14:01:22 -0800
   Commit: ee1c1f3, github.com/apache/spark/pull/3778

   [SPARK-733] Add documentation on use of accumulators in lazy transformation
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2015-01-16 13:25:17 -0800
   Commit: fd3a8a1, github.com/apache/spark/pull/4022

   [SPARK-4923][REPL] Add Developer API to REPL to allow re-publishing the REPL jar
   Chip Senkbeil <rcsenkbe@us.ibm.com>, Chip Senkbeil <chip.senkbeil@gmail.com>
   2015-01-16 12:56:40 -0800
   Commit: d05c9ee, github.com/apache/spark/pull/4034

   [WebUI] Fix collapse of WebUI layout
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-16 12:19:08 -0800
   Commit: ecf943d, github.com/apache/spark/pull/3995

   [SPARK-5231][WebUI] History Server shows wrong job submission time.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-16 10:05:11 -0800
   Commit: e8422c5, github.com/apache/spark/pull/4029

   [DOCS] Fix typo in return type of cogroup
   Sean Owen <sowen@cloudera.com>
   2015-01-16 09:28:44 -0800
   Commit: f6b852a, github.com/apache/spark/pull/4072

   [SPARK-5201][CORE] deal with int overflow in the ParallelCollectionRDD.slice method
   Ye Xianjin <advancedxy@gmail.com>
   2015-01-16 09:20:53 -0800
   Commit: e200ac8, github.com/apache/spark/pull/4002

   [SPARK-1507][YARN]specify # cores for ApplicationMaster
   WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
   2015-01-16 09:16:56 -0800
   Commit: 2be82b1, github.com/apache/spark/pull/4018

   [SPARK-4092] [CORE] Fix InputMetrics for coalesce'd Rdds
   Kostas Sakellis <kostas@cloudera.com>
   2015-01-15 18:48:39 -0800
   Commit: a79a9f9, github.com/apache/spark/pull/3120

   [SPARK-4857] [CORE] Adds Executor membership events to SparkListener
   Kostas Sakellis <kostas@cloudera.com>
   2015-01-15 17:53:42 -0800
   Commit: 96c2c71, github.com/apache/spark/pull/3711

   [Minor] Fix tiny typo in BlockManager
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-15 17:07:44 -0800
   Commit: 65858ba, github.com/apache/spark/pull/4046

   [SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.
   Reynold Xin <rxin@databricks.com>
   2015-01-15 16:15:12 -0800
   Commit: 1881431, github.com/apache/spark/pull/4056

   [SPARK-5224] [PySpark] improve performance of parallelize list/ndarray
   Davies Liu <davies@databricks.com>
   2015-01-15 11:40:41 -0800
   Commit: 3c8650c, github.com/apache/spark/pull/4024

   [SPARK-5193][SQL] Tighten up HiveContext API
   Reynold Xin <rxin@databricks.com>
   2015-01-14 20:31:02 -0800
   Commit: 4b325c7, github.com/apache/spark/pull/4054

   [SPARK-5254][MLLIB] remove developers section from spark.ml guide
   Xiangrui Meng <meng@databricks.com>
   2015-01-14 18:54:17 -0800
   Commit: 6abc45e, github.com/apache/spark/pull/4053

   [SPARK-5193][SQL] Tighten up SQLContext API
   Reynold Xin <rxin@databricks.com>
   2015-01-14 18:36:15 -0800
   Commit: cfa397c, github.com/apache/spark/pull/4049

   [SPARK-5254][MLLIB] Update the user guide to position spark.ml better
   Xiangrui Meng <meng@databricks.com>
   2015-01-14 17:50:33 -0800
   Commit: 13d2406, github.com/apache/spark/pull/4052

   [SPARK-5234][ml]examples for ml don't have sparkContext.stop
   Yuhao Yang <yuhao@yuhaodevbox.sh.intel.com>
   2015-01-14 11:53:43 -0800
   Commit: 76389c5, github.com/apache/spark/pull/4044

   [SPARK-5235] Make SQLConf Serializable
   Alex Baretta <alexbaretta@gmail.com>
   2015-01-14 11:51:55 -0800
   Commit: 2fd7f72, github.com/apache/spark/pull/4031

   [SPARK-4014] Add TaskContext.attemptNumber and deprecate TaskContext.attemptId
   Josh Rosen <joshrosen@databricks.com>
   2015-01-14 11:45:40 -0800
   Commit: 259936b, github.com/apache/spark/pull/3849

   [SPARK-5228][WebUI] Hide tables for "Active Jobs/Completed Jobs/Failed Jobs" when they are empty
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-14 11:10:29 -0800
   Commit: 9d4449c, github.com/apache/spark/pull/4028

   [SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-01-14 11:03:11 -0800
   Commit: 5840f54, github.com/apache/spark/pull/4025

   [SQL] some comments fix for GROUPING SETS
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-01-14 09:50:01 -0800
   Commit: 38bdc99, github.com/apache/spark/pull/4000

   [SPARK-5211][SQL]Restore HiveMetastoreTypes.toDataType
   Yin Huai <yhuai@databricks.com>
   2015-01-14 09:47:30 -0800
   Commit: 81f72a0, github.com/apache/spark/pull/4026

   [SPARK-5248] [SQL] move sql.types.decimal.Decimal to sql.types.Decimal
   Daoyuan Wang <daoyuan.wang@intel.com>
   2015-01-14 09:36:59 -0800
   Commit: a3f7421, github.com/apache/spark/pull/4041

   [SPARK-5167][SQL] Move Row into sql package and make it usable for Java.
   Reynold Xin <rxin@databricks.com>
   2015-01-14 00:38:55 -0800
   Commit: d5eeb35, github.com/apache/spark/pull/4030

   [SPARK-5123][SQL] Reconcile Java/Scala API for data types.
   Reynold Xin <rxin@databricks.com>
   2015-01-13 17:16:41 -0800
   Commit: f996909, github.com/apache/spark/pull/3958

   [SPARK-5168] Make SQLConf a field rather than mixin in SQLContext
   Reynold Xin <rxin@databricks.com>
   2015-01-13 13:30:35 -0800
   Commit: 14e3f11, github.com/apache/spark/pull/3965

   [SPARK-4912][SQL] Persistent tables for the Spark SQL data sources api
   Yin Huai <yhuai@databricks.com>, Michael Armbrust <michael@databricks.com>
   2015-01-13 13:01:27 -0800
   Commit: 6463e0b, github.com/apache/spark/pull/3960

   [SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib
   Davies Liu <davies@databricks.com>
   2015-01-13 12:50:31 -0800
   Commit: 8ead999, github.com/apache/spark/pull/4023

   [SPARK-5131][Streaming][DOC]: There is a discrepancy in WAL implementation and configuration doc.
   uncleGen <hustyugm@gmail.com>
   2015-01-13 10:07:19 -0800
   Commit: 39e333e, github.com/apache/spark/pull/3930

   [SPARK-4697][YARN]System properties should override environment variables
   WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
   2015-01-13 09:43:48 -0800
   Commit: 9dea64e, github.com/apache/spark/pull/3557

   [SPARK-5006][Deploy]spark.port.maxRetries doesn't work
   WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
   2015-01-13 09:28:21 -0800
   Commit: f7741a9, github.com/apache/spark/pull/3841

   [SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple
   Gabe Mulley <gabe@edx.org>
   2015-01-12 21:44:51 -0800
   Commit: 1e42e96, github.com/apache/spark/pull/3978

   [SPARK-5049][SQL] Fix ordering of partition columns in ParquetTableScan
   Michael Armbrust <michael@databricks.com>
   2015-01-12 15:19:09 -0800
   Commit: 5d9fa55, github.com/apache/spark/pull/3990

   [SPARK-4999][Streaming] Change storeInBlockManager to false by default
   jerryshao <saisai.shao@intel.com>
   2015-01-12 13:14:44 -0800
   Commit: 3aed305, github.com/apache/spark/pull/3906

   SPARK-5172 [BUILD] spark-examples-***.jar shades a wrong Hadoop distribution
   Sean Owen <sowen@cloudera.com>
   2015-01-12 12:15:34 -0800
   Commit: aff49a3, github.com/apache/spark/pull/3992

   [SPARK-5078] Optionally read from SPARK_LOCAL_HOSTNAME
   Michael Armbrust <michael@databricks.com>
   2015-01-12 11:57:59 -0800
   Commit: a3978f3, github.com/apache/spark/pull/3893

   SPARK-4159 [BUILD] Addendum: improve running of single test after enabling Java tests
   Sean Owen <sowen@cloudera.com>
   2015-01-12 11:00:56 -0800
   Commit: 13e610b, github.com/apache/spark/pull/3993

   [SPARK-5102][Core]subclass of MapStatus needs to be registered with Kryo
   lianhuiwang <lianhuiwang09@gmail.com>
   2015-01-12 10:57:12 -0800
   Commit: ef9224e, github.com/apache/spark/pull/4007

   [SPARK-5200] Disable web UI in Hive ThriftServer tests
   Josh Rosen <joshrosen@databricks.com>
   2015-01-12 10:47:12 -0800
   Commit: 82fd38d, github.com/apache/spark/pull/3998

   SPARK-5018 [MLlib] [WIP] Make MultivariateGaussian public
   Travis Galoppo <tjg2107@columbia.edu>
   2015-01-11 21:31:16 -0800
   Commit: 2130de9, github.com/apache/spark/pull/3923

   [SPARK-4033][Examples]Input of the SparkPi  too big causes the emption exception
   huangzhaowei <carlmartinmax@gmail.com>
   2015-01-11 16:32:47 -0800
   Commit: f38ef65, github.com/apache/spark/pull/2874

   [SPARK-4951][Core] Fix the issue that a busy executor may be killed
   zsxwing <zsxwing@gmail.com>
   2015-01-11 16:23:28 -0800
   Commit: 6942b97, github.com/apache/spark/pull/3783

   [SPARK-5073] spark.storage.memoryMapThreshold have two default value
   lewuathe <lewuathe@me.com>
   2015-01-11 13:50:42 -0800
   Commit: 1656aae, github.com/apache/spark/pull/3900

   [SPARK-5032] [graphx] Remove GraphX MIMA exclude for 1.3
   Joseph K. Bradley <joseph@databricks.com>
   2015-01-10 17:25:39 -0800
   Commit: 3313260, github.com/apache/spark/pull/3856

   [SPARK-5029][SQL] Enable from follow multiple brackets
   scwf <wangfei1@huawei.com>
   2015-01-10 17:07:34 -0800
   Commit: d22a31f, github.com/apache/spark/pull/3853

   [SPARK-4871][SQL] Show sql statement in spark ui when run sql with spark-sql
   wangfei <wangfei1@huawei.com>
   2015-01-10 17:04:56 -0800
   Commit: 92d9a70, github.com/apache/spark/pull/3718

   [Minor]Resolve sbt warnings during build (MQTTStreamSuite.scala).
   GuoQiang Li <witgo@qq.com>
   2015-01-10 15:38:43 -0800
   Commit: 8a29dc7, github.com/apache/spark/pull/3989

   [SPARK-5181] do not print writing WAL log when WAL is disabled
   CodingCat <zhunansjtu@gmail.com>
   2015-01-10 15:35:41 -0800
   Commit: f0d558b, github.com/apache/spark/pull/3985

   [SPARK-4692] [SQL] Support ! boolean logic operator like NOT
   YanTangZhai <hakeemzhai@tencent.com>, Michael Armbrust <michael@databricks.com>
   2015-01-10 15:05:23 -0800
   Commit: 0ca51cc, github.com/apache/spark/pull/3555

   [SPARK-5187][SQL] Fix caching of tables with HiveUDFs in the WHERE clause
   Michael Armbrust <michael@databricks.com>
   2015-01-10 14:25:45 -0800
   Commit: 3684fd2, github.com/apache/spark/pull/3987

   SPARK-4963 [SQL] Add copy to SQL's Sample operator
   Yanbo Liang <yanbohappy@gmail.com>
   2015-01-10 14:16:37 -0800
   Commit: 77106df, github.com/apache/spark/pull/3827

   [SPARK-4861][SQL] Refactory command in spark sql
   scwf <wangfei1@huawei.com>
   2015-01-10 14:08:04 -0800
   Commit: b3e86dc, github.com/apache/spark/pull/3948

   [SPARK-4574][SQL] Adding support for defining schema in foreign DDL commands.
   scwf <wangfei1@huawei.com>, Yin Huai <yhuai@databricks.com>, Fei Wang <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
   2015-01-10 13:53:21 -0800
   Commit: 693a323, github.com/apache/spark/pull/3431

   [SPARK-4943][SQL] Allow table name having dot for db/catalog
   Alex Liu <alex_liu68@yahoo.com>
   2015-01-10 13:23:09 -0800
   Commit: 4b39fd1, github.com/apache/spark/pull/3941

   [SPARK-4925][SQL] Publish Spark SQL hive-thriftserver maven artifact
   Alex Liu <alex_liu68@yahoo.com>
   2015-01-10 13:19:12 -0800
   Commit: 1e56eba, github.com/apache/spark/pull/3766

   [SPARK-5141][SQL]CaseInsensitiveMap throws java.io.NotSerializableException
   luogankun <luogankun@gmail.com>
   2015-01-09 20:38:41 -0800
   Commit: 545dfcb, github.com/apache/spark/pull/3944

   [SPARK-4406] [MLib] FIX: Validate k in SVD
   MechCoder <manojkumarsivaraj334@gmail.com>
   2015-01-09 17:45:18 -0800
   Commit: 4554529, github.com/apache/spark/pull/3945

   [SPARK-4990][Deploy]to find default properties file, search SPARK_CONF_DIR first
   WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
   2015-01-09 17:10:02 -0800
   Commit: 8782eb9, github.com/apache/spark/pull/3823

   [Minor] Fix import order and other coding style
   bilna <bilnap@am.amrita.edu>, Bilna P <bilna.p@gmail.com>
   2015-01-09 14:45:28 -0800
   Commit: 4e1f12d, github.com/apache/spark/pull/3966

   [DOC] Fixed Mesos version in doc from 0.18.1 to 0.21.0
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-09 14:40:45 -0800
   Commit: ae62872, github.com/apache/spark/pull/3982

   [SPARK-4737] Task set manager properly handles serialization errors
   mcheah <mcheah@palantir.com>
   2015-01-09 14:16:20 -0800
   Commit: e0f28e0, github.com/apache/spark/pull/3638

   [SPARK-1953][YARN]yarn client mode Application Master memory size is same as driver memory...
   WangTaoTheTonic <barneystinson@aliyun.com>
   2015-01-09 13:20:32 -0800
   Commit: e966452, github.com/apache/spark/pull/3607

   [SPARK-5015] [mllib] Random seed for GMM + make test suite deterministic
   Joseph K. Bradley <joseph@databricks.com>
   2015-01-09 13:00:15 -0800
   Commit: 7e8e62a, github.com/apache/spark/pull/3981

   [SPARK-3619] Upgrade to Mesos 0.21 to work around MESOS-1688
   Jongyoul Lee <jongyoul@gmail.com>
   2015-01-09 10:47:08 -0800
   Commit: 454fe12, github.com/apache/spark/pull/3934

   [SPARK-5145][Mllib] Add BLAS.dsyr and use it in GaussianMixtureEM
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-01-09 10:27:33 -0800
   Commit: e9ca16e, github.com/apache/spark/pull/3949

   [SPARK-1143] Separate pool tests into their own suite.
   Kay Ousterhout <kayousterhout@gmail.com>
   2015-01-09 09:47:06 -0800
   Commit: b6aa557, github.com/apache/spark/pull/3967

   HOTFIX: Minor improvements to make-distribution.sh
   Patrick Wendell <pwendell@gmail.com>
   2015-01-09 09:40:18 -0800
   Commit: 1790b38, github.com/apache/spark/pull/3973

   SPARK-5136 [DOCS] Improve documentation around setting up Spark IntelliJ project
   Sean Owen <sowen@cloudera.com>
   2015-01-09 09:35:46 -0800
   Commit: 547df97, github.com/apache/spark/pull/3952

   [Minor] Fix test RetryingBlockFetcherSuite after changed config name
   Aaron Davidson <aaron@databricks.com>
   2015-01-09 09:20:16 -0800
   Commit: b4034c3, github.com/apache/spark/pull/3972

   [SPARK-5169][YARN]fetch the correct max attempts
   WangTaoTheTonic <barneystinson@aliyun.com>
   2015-01-09 08:10:09 -0600
   Commit: f3da4bd, github.com/apache/spark/pull/3942

   [SPARK-5122] Remove Shark from spark-ec2
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2015-01-08 17:42:08 -0800
   Commit: 167a5ab, github.com/apache/spark/pull/3939

   [SPARK-4048] Enhance and extend hadoop-provided profile.
   Marcelo Vanzin <vanzin@cloudera.com>
   2015-01-08 17:15:13 -0800
   Commit: 48cecf6, github.com/apache/spark/pull/2982

   [SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...
   RJ Nowling <rnowling@gmail.com>
   2015-01-08 15:03:43 -0800
   Commit: c9c8b21, github.com/apache/spark/pull/3955

   [SPARK-4973][CORE] Local directory in the driver of client-mode continues remaining even if application finished when external shuffle is enabled
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-08 13:43:09 -0800
   Commit: a00af6b, github.com/apache/spark/pull/3811

   SPARK-5148 [MLlib] Make usersOut/productsOut storagelevel in ALS configurable
   Fernando Otero (ZeoS) <fotero@gmail.com>
   2015-01-08 12:42:54 -0800
   Commit: 72df5a3, github.com/apache/spark/pull/3953

   Document that groupByKey will OOM for large keys
   Eric Moyer <eric_moyer@yahoo.com>
   2015-01-08 11:55:23 -0800
   Commit: 538f221, github.com/apache/spark/pull/3936

   [SPARK-5130][Deploy]Take yarn-cluster as cluster mode in spark-submit
   WangTaoTheTonic <barneystinson@aliyun.com>
   2015-01-08 11:45:42 -0800
   Commit: 0760787, github.com/apache/spark/pull/3929

   [Minor] Fix the value represented by spark.executor.id for consistency.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2015-01-08 11:35:56 -0800
   Commit: 0a59727, github.com/apache/spark/pull/3812

   [SPARK-4989][CORE] avoid wrong eventlog conf cause cluster down in standalone mode
   Zhang, Liye <liye.zhang@intel.com>
   2015-01-08 10:40:26 -0800
   Commit: 06dc4b5, github.com/apache/spark/pull/3824

   [SPARK-4917] Add a function to convert into a graph with canonical edges in GraphOps
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2015-01-08 09:55:12 -0800
   Commit: f825e19, github.com/apache/spark/pull/3760

   SPARK-5087. [YARN] Merge yarn.Client and yarn.ClientBase
   Sandy Ryza <sandy@cloudera.com>
   2015-01-08 09:25:43 -0800
   Commit: 8d45834, github.com/apache/spark/pull/3896

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2015-01-07 23:25:56 -0800
   Commit: c082385, github.com/apache/spark/pull/3880

   [SPARK-5116][MLlib] Add extractor for SparseVector and DenseVector
   Shuo Xiang <shuoxiangpub@gmail.com>
   2015-01-07 23:22:37 -0800
   Commit: c66a976, github.com/apache/spark/pull/3919

   [SPARK-5126][Core] Verify Spark urls before creating Actors so that invalid urls can crash the process.
   zsxwing <zsxwing@gmail.com>
   2015-01-07 23:01:30 -0800
   Commit: 2b729d2, github.com/apache/spark/pull/3927

   [SPARK-5132][Core]Correct stage Attempt Id key in stageInfofromJson
   hushan[胡珊] <hushan@xiaomi.com>
   2015-01-07 12:09:12 -0800
   Commit: d345ebe, github.com/apache/spark/pull/3932

   [SPARK-5128][MLLib] Add common used log1pExp API in MLUtils
   DB Tsai <dbtsai@alpinenow.com>
   2015-01-07 10:13:41 -0800
   Commit: 60e2d9e, github.com/apache/spark/pull/3915

   [SPARK-2458] Make failed application log visible on History Server
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2015-01-07 07:32:16 -0800
   Commit: 6e74ede, github.com/apache/spark/pull/3467

   [SPARK-2165][YARN]add support for setting maxAppAttempts in the ApplicationSubmissionContext
   WangTaoTheTonic <barneystinson@aliyun.com>
   2015-01-07 08:14:39 -0600
   Commit: 8fdd489, github.com/apache/spark/pull/3878

   [YARN][SPARK-4929] Bug fix: fix the yarn-client code to support HA
   huangzhaowei <carlmartinmax@gmail.com>
   2015-01-07 08:10:42 -0600
   Commit: 5fde661, github.com/apache/spark/pull/3771

   [SPARK-5099][Mllib] Simplify logistic loss function
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-01-06 21:23:31 -0800
   Commit: e21acc1, github.com/apache/spark/pull/3899

   [SPARK-5050][Mllib] Add unit test for sqdist
   Liang-Chi Hsieh <viirya@gmail.com>
   2015-01-06 14:00:45 -0800
   Commit: bb38ebb, github.com/apache/spark/pull/3869

   SPARK-5017 [MLlib] - Use SVD to compute determinant and inverse of covariance matrix
   Travis Galoppo <tjg2107@columbia.edu>
   2015-01-06 13:57:42 -0800
   Commit: 4108e5f, github.com/apache/spark/pull/3871

   SPARK-4159 [CORE] Maven build doesn't run JUnit test suites
   Sean Owen <sowen@cloudera.com>
   2015-01-06 12:02:08 -0800
   Commit: 4cba6eb, github.com/apache/spark/pull/3651

   [Minor] Fix comments for GraphX 2D partitioning strategy
   kj-ki <kikushima.kenji@lab.ntt.co.jp>
   2015-01-06 09:49:37 -0800
   Commit: 5e3ec11, github.com/apache/spark/pull/3904

   [SPARK-1600] Refactor FileInputStream tests to remove Thread.sleep() calls and SystemClock usage
   Josh Rosen <joshrosen@databricks.com>
   2015-01-06 00:31:19 -0800
   Commit: a6394bc, github.com/apache/spark/pull/3801

   SPARK-4843 [YARN] Squash ExecutorRunnableUtil and ExecutorRunnable
   Kostas Sakellis <kostas@cloudera.com>
   2015-01-05 23:26:33 -0800
   Commit: 451546a, github.com/apache/spark/pull/3696

   [SPARK-5040][SQL] Support expressing unresolved attributes using $"attribute name" notation in SQL DSL.
   Reynold Xin <rxin@databricks.com>
   2015-01-05 15:34:22 -0800
   Commit: 04d55d8, github.com/apache/spark/pull/3862

   [SPARK-5093] Set spark.network.timeout to 120s consistently.
   Reynold Xin <rxin@databricks.com>
   2015-01-05 15:19:53 -0800
   Commit: bbcba3a, github.com/apache/spark/pull/3903

   [SPARK-5089][PYSPARK][MLLIB] Fix vector convert
   freeman <the.freeman.lab@gmail.com>
   2015-01-05 13:10:59 -0800
   Commit: 6c6f325, github.com/apache/spark/pull/3902

   [SPARK-4465] runAsSparkUser doesn't affect TaskRunner in Mesos environme...
   Jongyoul Lee <jongyoul@gmail.com>
   2015-01-05 12:05:09 -0800
   Commit: 1c0e7ce, github.com/apache/spark/pull/3741

   [SPARK-5057] Log message in failed askWithReply attempts
   WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
   2015-01-05 11:59:38 -0800
   Commit: ce39b34, github.com/apache/spark/pull/3875

   [SPARK-4688] Have a single shared network timeout in Spark
   Varun Saxena <vsaxena.varun@gmail.com>, varunsaxena <vsaxena.varun@gmail.com>
   2015-01-05 10:32:37 -0800
   Commit: d3f07fd, github.com/apache/spark/pull/3562

   [SPARK-5074][Core] Fix a non-deterministic test failure
   zsxwing <zsxwing@gmail.com>
   2015-01-04 21:18:33 -0800
   Commit: 5c506ce, github.com/apache/spark/pull/3889

   [SPARK-5083][Core] Fix a flaky test in TaskResultGetterSuite
   zsxwing <zsxwing@gmail.com>
   2015-01-04 21:09:21 -0800
   Commit: 27e7f5a, github.com/apache/spark/pull/3894

   [SPARK-5069][Core] Fix the race condition of TaskSchedulerImpl.dagScheduler
   zsxwing <zsxwing@gmail.com>
   2015-01-04 21:06:04 -0800
   Commit: 6c726a3, github.com/apache/spark/pull/3887

   [SPARK-5067][Core] Use '===' to compare well-defined case class
   zsxwing <zsxwing@gmail.com>
   2015-01-04 21:03:17 -0800
   Commit: 7239652, github.com/apache/spark/pull/3886

   [SPARK-4835] Disable validateOutputSpecs for Spark Streaming jobs
   Josh Rosen <joshrosen@databricks.com>
   2015-01-04 20:26:18 -0800
   Commit: 939ba1f, github.com/apache/spark/pull/3832

   [SPARK-4631] unit test for MQTT
   bilna <bilnap@am.amrita.edu>, Bilna P <bilna.p@gmail.com>
   2015-01-04 19:37:48 -0800
   Commit: e767d7d, github.com/apache/spark/pull/3844

   [SPARK-4787] Stop SparkContext if a DAGScheduler init error occurs
   Dale <tigerquoll@outlook.com>
   2015-01-04 13:28:37 -0800
   Commit: 3fddc94, github.com/apache/spark/pull/3809

   [SPARK-794][Core] Remove sleep() in ClusterScheduler.stop
   Brennon York <brennon.york@capitalone.com>
   2015-01-04 12:40:39 -0800
   Commit: b96008d, github.com/apache/spark/pull/3851

   [SPARK-5058] Updated broken links
   sigmoidanalytics <mayur@sigmoidanalytics.com>
   2015-01-03 19:46:08 -0800
   Commit: 342612b, github.com/apache/spark/pull/3877

   Fixed typos in streaming-kafka-integration.md
   Akhil Das <akhld@darktech.ca>
   2015-01-02 15:12:27 -0800
   Commit: cdccc26, github.com/apache/spark/pull/3876

   [SPARK-3325][Streaming] Add a parameter to the method print in class DStream
   Yadong Qi <qiyadong2010@gmail.com>, q00251598 <qiyadong@huawei.com>, Tathagata Das <tathagata.das1565@gmail.com>, wangfei <wangfei1@huawei.com>
   2015-01-02 15:09:41 -0800
   Commit: bd88b71, github.com/apache/spark/pull/3865

   [HOTFIX] Bind web UI to ephemeral port in DriverSuite
   Josh Rosen <joshrosen@databricks.com>
   2015-01-01 15:03:54 -0800
   Commit: 0128398, github.com/apache/spark/pull/3873

   [SPARK-5038] Add explicit return type for implicit functions.
   Reynold Xin <rxin@databricks.com>
   2014-12-31 17:07:47 -0800
   Commit: 7749dd6, github.com/apache/spark/pull/3860

   SPARK-2757 [BUILD] [STREAMING] Add Mima test for Spark Sink after 1.10 is released
   Sean Owen <sowen@cloudera.com>
   2014-12-31 16:59:17 -0800
   Commit: 4bb1248, github.com/apache/spark/pull/3842

   [SPARK-5035] [Streaming] ReceiverMessage trait should extend Serializable
   Josh Rosen <joshrosen@databricks.com>
   2014-12-31 16:02:47 -0800
   Commit: fe6efac, github.com/apache/spark/pull/3857

   SPARK-5020 [MLlib] GaussianMixtureModel.predictMembership() should take an RDD only
   Travis Galoppo <tjg2107@columbia.edu>
   2014-12-31 15:39:58 -0800
   Commit: c4f0b4f, github.com/apache/spark/pull/3854

   [SPARK-5028][Streaming]Add total received and processed records metrics to Streaming UI
   jerryshao <saisai.shao@intel.com>
   2014-12-31 14:45:31 -0800
   Commit: fdc2aa4, github.com/apache/spark/pull/3852

   [SPARK-4790][STREAMING] Fix ReceivedBlockTrackerSuite waits for old file...
   Hari Shreedharan <hshreedharan@apache.org>
   2014-12-31 14:35:07 -0800
   Commit: 3610d3c, github.com/apache/spark/pull/3726

   [SPARK-5038][SQL] Add explicit return type for implicit functions in Spark SQL
   Reynold Xin <rxin@databricks.com>
   2014-12-31 14:25:03 -0800
   Commit: c88a3d7, github.com/apache/spark/pull/3859

   [HOTFIX] Disable Spark UI in SparkSubmitSuite tests
   Josh Rosen <joshrosen@databricks.com>
   2014-12-12 12:38:37 -0800
   Commit: e24d3a9

   SPARK-4547 [MLLIB] OOM when making bins in BinaryClassificationMetrics
   Sean Owen <sowen@cloudera.com>
   2014-12-31 13:37:04 -0800
   Commit: 3d194cc, github.com/apache/spark/pull/3702

   [SPARK-4298][Core] - The spark-submit cannot read Main-Class from Manifest.
   Brennon York <brennon.york@capitalone.com>
   2014-12-31 11:54:10 -0800
   Commit: 8e14c5e, github.com/apache/spark/pull/3561

   [SPARK-4797] Replace breezeSquaredDistance
   Liang-Chi Hsieh <viirya@gmail.com>
   2014-12-31 11:50:53 -0800
   Commit: 06a9aa5, github.com/apache/spark/pull/3643

   [SPARK-1010] Clean up uses of System.setProperty in unit tests
   Josh Rosen <joshrosen@databricks.com>
   2014-12-30 18:12:20 -0800
   Commit: 352ed6b, github.com/apache/spark/pull/3739

   [SPARK-4998][MLlib]delete the "train" function
   Liu Jiongzhou <ljzzju@163.com>
   2014-12-30 15:55:56 -0800
   Commit: 035bac8, github.com/apache/spark/pull/3836

   [SPARK-4813][Streaming] Fix the issue that ContextWaiter didn't handle 'spurious wakeup'
   zsxwing <zsxwing@gmail.com>
   2014-12-30 14:39:13 -0800
   Commit: 6a89782, github.com/apache/spark/pull/3661

   [Spark-4995] Replace Vector.toBreeze.activeIterator with foreachActive
   Jakub Dubovsky <dubovsky@avast.com>
   2014-12-30 14:19:07 -0800
   Commit: 0f31992, github.com/apache/spark/pull/3846

   SPARK-3955 part 2 [CORE] [HOTFIX] Different versions between jackson-mapper-asl and jackson-core-asl
   Sean Owen <sowen@cloudera.com>
   2014-12-30 14:00:57 -0800
   Commit: b239ea1, github.com/apache/spark/pull/3829

   [SPARK-4570][SQL]add BroadcastLeftSemiJoinHash
   wangxiaojing <u9jing@gmail.com>
   2014-12-30 13:54:12 -0800
   Commit: 07fa191, github.com/apache/spark/pull/3442

   [SPARK-4935][SQL] When hive.cli.print.header configured, spark-sql aborted if passed in a invalid sql
   wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
   2014-12-30 13:44:30 -0800
   Commit: 8f29b7c, github.com/apache/spark/pull/3761

   [SPARK-4386] Improve performance when writing Parquet files
   Michael Davies <Michael.BellDavies@gmail.com>
   2014-12-30 13:40:51 -0800
   Commit: 7425bec, github.com/apache/spark/pull/3843

   [SPARK-4937][SQL] Normalizes conjunctions and disjunctions to eliminate common predicates
   Cheng Lian <lian@databricks.com>
   2014-12-30 13:38:27 -0800
   Commit: 61a99f6, github.com/apache/spark/pull/3784

   [SPARK-4928][SQL] Fix: Operator '>,<,>=,<=' with decimal between different precision report error
   guowei2 <guowei2@asiainfo.com>
   2014-12-30 12:21:00 -0800
   Commit: a75dd83, github.com/apache/spark/pull/3767

   [SPARK-4930][SQL][DOCS]Update SQL programming guide, CACHE TABLE is eager
   luogankun <luogankun@gmail.com>
   2014-12-30 12:18:55 -0800
   Commit: 2deac74, github.com/apache/spark/pull/3773

   [SPARK-4916][SQL][DOCS]Update SQL programming guide about cache section
   luogankun <luogankun@gmail.com>
   2014-12-30 12:17:49 -0800
   Commit: f7a41a0, github.com/apache/spark/pull/3759

   [SPARK-4493][SQL] Tests for IsNull / IsNotNull in the ParquetFilterSuite
   Cheng Lian <lian@databricks.com>
   2014-12-30 12:16:45 -0800
   Commit: 19a8802, github.com/apache/spark/pull/3748

   [Spark-4512] [SQL] Unresolved Attribute Exception in Sort By
   Cheng Hao <hao.cheng@intel.com>
   2014-12-30 12:11:44 -0800
   Commit: 53f0a00, github.com/apache/spark/pull/3386

   [SPARK-5002][SQL] Using ascending by default when not specify order in order by
   wangfei <wangfei1@huawei.com>
   2014-12-30 12:07:24 -0800
   Commit: daac221, github.com/apache/spark/pull/3838

   [SPARK-4904] [SQL] Remove the unnecessary code change in Generic UDF
   Cheng Hao <hao.cheng@intel.com>
   2014-12-30 11:47:08 -0800
   Commit: 63b84b7, github.com/apache/spark/pull/3745

   [SPARK-4959] [SQL] Attributes are case sensitive when using a select query from a projection
   Cheng Hao <hao.cheng@intel.com>
   2014-12-30 11:33:47 -0800
   Commit: 5595eaa, github.com/apache/spark/pull/3796

   [SPARK-4975][SQL] Fix HiveInspectorSuite test failure
   scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
   2014-12-30 11:30:47 -0800
   Commit: 65357f1, github.com/apache/spark/pull/3814

   [SQL] enable view test
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-30 11:29:13 -0800
   Commit: 94d60b7, github.com/apache/spark/pull/3826

   [SPARK-4908][SQL] Prevent multiple concurrent hive native commands
   Michael Armbrust <michael@databricks.com>
   2014-12-30 11:24:46 -0800
   Commit: 480bd1d, github.com/apache/spark/pull/3834

   [SPARK-4882] Register PythonBroadcast with Kryo so that PySpark works with KryoSerializer
   Josh Rosen <joshrosen@databricks.com>
   2014-12-30 09:29:52 -0800
   Commit: efa80a53, github.com/apache/spark/pull/3831

   [SPARK-4920][UI] add version on master and worker page for standalone mode
   Zhang, Liye <liye.zhang@intel.com>
   2014-12-30 09:19:47 -0800
   Commit: 9077e72, github.com/apache/spark/pull/3769

   [SPARK-4972][MLlib] Updated the scala doc for lasso and ridge regression for the change of LeastSquaresGradient
   DB Tsai <dbtsai@alpinenow.com>
   2014-12-29 17:17:12 -0800
   Commit: 040d6f2, github.com/apache/spark/pull/3808

   Added setMinCount to Word2Vec.scala
   ganonp <ganonp@gmail.com>
   2014-12-29 15:31:19 -0800
   Commit: 343db39, github.com/apache/spark/pull/3693

   SPARK-4156 [MLLIB] EM algorithm for GMMs
   Travis Galoppo <tjg2107@columbia.edu>, Travis Galoppo <travis@localhost.localdomain>, tgaloppo <tjg2107@columbia.edu>, FlytxtRnD <meethu.mathew@flytxt.com>
   2014-12-29 15:29:15 -0800
   Commit: 6cf6fdf, github.com/apache/spark/pull/3022

   SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions
   Yash Datta <Yash.Datta@guavus.com>
   2014-12-29 13:49:45 -0800
   Commit: 9bc0df6, github.com/apache/spark/pull/3830

   [SPARK-4409][MLlib] Additional Linear Algebra Utils
   Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>
   2014-12-29 13:24:26 -0800
   Commit: 02b55de, github.com/apache/spark/pull/3319

   [Minor] Fix a typo of type parameter in JavaUtils.scala
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-29 12:05:08 -0800
   Commit: 8d72341, github.com/apache/spark/pull/3789

   [SPARK-4946] [CORE] Using AkkaUtils.askWithReply in MapOutputTracker.askTracker to reduce the chance of the communicating problem
   YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>
   2014-12-29 11:30:54 -0800
   Commit: 815de54, github.com/apache/spark/pull/3785

   Adde LICENSE Header to build/mvn, build/sbt and sbt/sbt
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-29 10:48:53 -0800
   Commit: 4cef05e, github.com/apache/spark/pull/3817

   [SPARK-4982][DOC] `spark.ui.retainedJobs` description is wrong in Spark UI configuration guide
   wangxiaojing <u9jing@gmail.com>
   2014-12-29 10:45:14 -0800
   Commit: 6645e52, github.com/apache/spark/pull/3818

   [SPARK-4966][YARN]The MemoryOverhead value is setted not correctly
   meiyoula <1039320815@qq.com>
   2014-12-29 08:20:30 -0600
   Commit: 14fa87b, github.com/apache/spark/pull/3797

   [SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/scalac
   Brennon York <brennon.york@capitalone.com>
   2014-12-27 13:25:18 -0800
   Commit: a3e51cc, github.com/apache/spark/pull/3707

   [SPARK-4952][Core]Handle ConcurrentModificationExceptions in SparkEnv.environmentDetails
   GuoQiang Li <witgo@qq.com>
   2014-12-26 23:31:29 -0800
   Commit: 080ceb7, github.com/apache/spark/pull/3788

   [SPARK-4954][Core] add spark version infomation in log for standalone mode
   Zhang, Liye <liye.zhang@intel.com>
   2014-12-26 23:23:13 -0800
   Commit: 786808a, github.com/apache/spark/pull/3790

   [SPARK-3955] Different versions between jackson-mapper-asl and jackson-c...
   Jongyoul Lee <jongyoul@gmail.com>
   2014-12-26 22:59:34 -0800
   Commit: 2483c1e, github.com/apache/spark/pull/3716

   HOTFIX: Slight tweak on previous commit.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-26 22:55:04 -0800
   Commit: 82bf4be

   [SPARK-3787][BUILD] Assembly jar name is wrong when we build with sbt omitting -Dhadoop.version
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-26 22:52:04 -0800
   Commit: de95c57, github.com/apache/spark/pull/3046

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-26 22:39:56 -0800
   Commit: 534f24b, github.com/apache/spark/pull/3456

   SPARK-4971: Fix typo in BlockGenerator comment
   CodingCat <zhunansjtu@gmail.com>
   2014-12-26 12:03:22 -0800
   Commit: fda4331, github.com/apache/spark/pull/3807

   [SPARK-4608][Streaming] Reorganize StreamingContext implicit to improve API convenience
   zsxwing <zsxwing@gmail.com>
   2014-12-25 19:46:05 -0800
   Commit: f9ed2b6, github.com/apache/spark/pull/3464

   [SPARK-4537][Streaming] Expand StreamingSource to add more metrics
   jerryshao <saisai.shao@intel.com>
   2014-12-25 19:39:49 -0800
   Commit: f205fe4, github.com/apache/spark/pull/3466

   [EC2] Update mesos/spark-ec2 branch to branch-1.3
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2014-12-25 14:16:50 -0800
   Commit: ac82785, github.com/apache/spark/pull/3804

   [EC2] Update default Spark version to 1.2.0
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2014-12-25 14:13:12 -0800
   Commit: b6b6393, github.com/apache/spark/pull/3793

   Fix "Building Spark With Maven" link in README.md
   Denny Lee <denny.g.lee@gmail.com>
   2014-12-25 14:05:55 -0800
   Commit: 08b18c7, github.com/apache/spark/pull/3802

   [SPARK-4953][Doc] Fix the description of building Spark with YARN
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-25 07:05:43 -0800
   Commit: 11dd993, github.com/apache/spark/pull/3787

   [SPARK-4873][Streaming] Use `Future.zip` instead of `Future.flatMap`(for-loop) in WriteAheadLogBasedBlockHandler
   zsxwing <zsxwing@gmail.com>
   2014-12-24 19:49:41 -0800
   Commit: b4d0db8, github.com/apache/spark/pull/3721

   SPARK-4297 [BUILD] Build warning fixes omnibus
   Sean Owen <sowen@cloudera.com>
   2014-12-24 13:32:51 -0800
   Commit: 29fabb1, github.com/apache/spark/pull/3157

   [SPARK-4881][Minor] Use SparkConf#getBoolean instead of get().toBoolean
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-23 19:14:34 -0800
   Commit: 199e59a, github.com/apache/spark/pull/3733

   [SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`
   jbencook <jbenjamincook@gmail.com>, J. Benjamin Cook <jbenjamincook@gmail.com>
   2014-12-23 17:46:24 -0800
   Commit: fd41eb9, github.com/apache/spark/pull/3764

   [SPARK-4606] Send EOF to child JVM when there's no more data to read.
   Marcelo Vanzin <vanzin@cloudera.com>
   2014-12-23 16:02:59 -0800
   Commit: 7e2deb7, github.com/apache/spark/pull/3460

   [SPARK-4671][Streaming]Do not replicate streaming block when WAL is enabled
   jerryshao <saisai.shao@intel.com>
   2014-12-23 15:45:53 -0800
   Commit: 3f5f4cc, github.com/apache/spark/pull/3534

   [SPARK-4802] [streaming] Remove receiverInfo once receiver is de-registered
   Ilayaperumal Gopinathan <igopinathan@pivotal.io>
   2014-12-23 15:14:54 -0800
   Commit: 10d69e9, github.com/apache/spark/pull/3647

   [SPARK-4913] Fix incorrect event log path
   Liang-Chi Hsieh <viirya@gmail.com>
   2014-12-23 14:58:33 -0800
   Commit: 96281cd, github.com/apache/spark/pull/3755

   [SPARK-4730][YARN] Warn against deprecated YARN settings
   Andrew Or <andrew@databricks.com>
   2014-12-23 14:28:36 -0800
   Commit: 27c5399, github.com/apache/spark/pull/3590

   [SPARK-4914][Build] Cleans lib_managed before compiling with Hive 0.13.1
   Cheng Lian <lian@databricks.com>
   2014-12-23 12:54:20 -0800
   Commit: 395b771, github.com/apache/spark/pull/3756

   [SPARK-4932] Add help comments in Analytics
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2014-12-23 12:39:41 -0800
   Commit: 9c251c5, github.com/apache/spark/pull/3775

   [SPARK-4834] [standalone] Clean up application files after app finishes.
   Marcelo Vanzin <vanzin@cloudera.com>
   2014-12-23 12:02:08 -0800
   Commit: dd15536, github.com/apache/spark/pull/3705

   [SPARK-4931][Yarn][Docs] Fix the format of running-on-yarn.md
   zsxwing <zsxwing@gmail.com>
   2014-12-23 11:18:06 -0800
   Commit: 2d215ae, github.com/apache/spark/pull/3774

   [SPARK-4890] Ignore downloaded EC2 libs
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2014-12-23 11:12:16 -0800
   Commit: 2823c7f, github.com/apache/spark/pull/3770

   [Docs] Minor typo fixes
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2014-12-22 22:54:32 -0800
   Commit: 0e532cc, github.com/apache/spark/pull/3772

   [SPARK-4907][MLlib] Inconsistent loss and gradient in LeastSquaresGradient compared with R
   DB Tsai <dbtsai@alpinenow.com>
   2014-12-22 16:42:55 -0800
   Commit: a96b727, github.com/apache/spark/pull/3746

   [SPARK-4818][Core] Add 'iterator' to reduce memory consumed by join
   zsxwing <zsxwing@gmail.com>
   2014-12-22 14:26:28 -0800
   Commit: c233ab3, github.com/apache/spark/pull/3671

   [SPARK-4920][UI]:current spark version in UI is not striking.
   genmao.ygm <genmao.ygm@alibaba-inc.com>
   2014-12-22 14:14:39 -0800
   Commit: de9d7d2, github.com/apache/spark/pull/3763

   [Minor] Fix scala doc
   Liang-Chi Hsieh <viirya@gmail.com>
   2014-12-22 14:13:31 -0800
   Commit: a61aa66, github.com/apache/spark/pull/3751

   [SPARK-4864] Add documentation to Netty-based configs
   Aaron Davidson <aaron@databricks.com>
   2014-12-22 13:09:22 -0800
   Commit: fbca6b6, github.com/apache/spark/pull/3713

   [SPARK-4079] [CORE] Consolidates Errors if a CompressionCodec is not available
   Kostas Sakellis <kostas@cloudera.com>
   2014-12-22 13:07:01 -0800
   Commit: 7c0ed13, github.com/apache/spark/pull/3119

   SPARK-4447. Remove layers of abstraction in YARN code no longer needed after dropping yarn-alpha
   Sandy Ryza <sandy@cloudera.com>
   2014-12-22 12:23:43 -0800
   Commit: d62da64, github.com/apache/spark/pull/3652

   [SPARK-4733] Add missing prameter comments in ShuffleDependency
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2014-12-22 12:19:23 -0800
   Commit: fb8e85e, github.com/apache/spark/pull/3594

   [Minor] Improve some code in BroadcastTest for short
   carlmartin <carlmartinmax@gmail.com>
   2014-12-22 12:13:53 -0800
   Commit: 1d9788e, github.com/apache/spark/pull/3750

   [SPARK-4883][Shuffle] Add a name to the directoryCleaner thread
   zsxwing <zsxwing@gmail.com>
   2014-12-22 12:11:36 -0800
   Commit: 8773705, github.com/apache/spark/pull/3734

   [SPARK-4870] Add spark version to driver log
   Zhang, Liye <liye.zhang@intel.com>
   2014-12-22 11:36:49 -0800
   Commit: 39272c8, github.com/apache/spark/pull/3717

   [SPARK-4915][YARN] Fix classname to be specified for external shuffle service.
   Tsuyoshi Ozawa <ozawa.tsuyoshi@lab.ntt.co.jp>
   2014-12-22 11:28:05 -0800
   Commit: 96606f6, github.com/apache/spark/pull/3757

   [SPARK-4918][Core] Reuse Text in saveAsTextFile
   zsxwing <zsxwing@gmail.com>
   2014-12-22 11:20:00 -0800
   Commit: 93b2f3a, github.com/apache/spark/pull/3762

   [SPARK-2075][Core] Make the compiler generate same bytes code for Hadoop 1.+ and Hadoop 2.+
   zsxwing <zsxwing@gmail.com>
   2014-12-21 22:10:19 -0800
   Commit: 6ee6aa7, github.com/apache/spark/pull/3740

   SPARK-4910 [CORE] build failed (use of FileStatus.isFile in Hadoop 1.x)
   Sean Owen <sowen@cloudera.com>
   2014-12-21 13:16:57 -0800
   Commit: c6a3c0d, github.com/apache/spark/pull/3754

   [Minor] Build Failed: value defaultProperties not found
   huangzhaowei <carlmartinmax@gmail.com>
   2014-12-19 23:32:56 -0800
   Commit: a764960, github.com/apache/spark/pull/3749

   [SPARK-4140] Document dynamic allocation
   Andrew Or <andrew@databricks.com>, Tsuyoshi Ozawa <ozawa.tsuyoshi@gmail.com>
   2014-12-19 19:36:20 -0800
   Commit: 15c03e1, github.com/apache/spark/pull/3731

   [SPARK-4831] Do not include SPARK_CLASSPATH if empty
   Daniel Darabos <darabos.daniel@gmail.com>
   2014-12-19 19:32:39 -0800
   Commit: 7cb3f54, github.com/apache/spark/pull/3678

   SPARK-2641: Passing num executors to spark arguments from properties file
   Kanwaljit Singh <kanwaljit.singh@guavus.com>
   2014-12-19 19:25:39 -0800
   Commit: 1d64812, github.com/apache/spark/pull/1657

   [SPARK-3060] spark-shell.cmd doesn't accept application options in Windows OS
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2014-12-19 19:19:53 -0800
   Commit: 8d93247, github.com/apache/spark/pull/3350

   change signature of example to match released code
   Eran Medan <ehrann.mehdan@gmail.com>
   2014-12-19 18:29:36 -0800
   Commit: c25c669, github.com/apache/spark/pull/3747

   [SPARK-2261] Make event logger use a single file.
   Marcelo Vanzin <vanzin@cloudera.com>
   2014-12-19 18:21:15 -0800
   Commit: 4564519, github.com/apache/spark/pull/1222

   [SPARK-4890] Upgrade Boto to 2.34.0; automatically download Boto from PyPi instead of packaging it
   Josh Rosen <joshrosen@databricks.com>
   2014-12-19 17:02:37 -0800
   Commit: c28083f, github.com/apache/spark/pull/3737

   [SPARK-4896] don’t redundantly overwrite executor JAR deps
   Ryan Williams <ryan.blake.williams@gmail.com>
   2014-12-19 15:24:41 -0800
   Commit: 7981f96, github.com/apache/spark/pull/2848

   [SPARK-4889] update history server example cmds
   Ryan Williams <ryan.blake.williams@gmail.com>
   2014-12-19 13:56:04 -0800
   Commit: cdb2c64, github.com/apache/spark/pull/3736

   Small refactoring to pass SparkEnv into Executor rather than creating SparkEnv in Executor.
   Reynold Xin <rxin@databricks.com>
   2014-12-19 12:51:12 -0800
   Commit: 336cd34, github.com/apache/spark/pull/3738

   [Build] Remove spark-staging-1038
   scwf <wangfei1@huawei.com>
   2014-12-19 08:29:38 -0800
   Commit: 8e253eb, github.com/apache/spark/pull/3743

   [SPARK-4901] [SQL] Hot fix for ByteWritables.copyBytes
   Cheng Hao <hao.cheng@intel.com>
   2014-12-19 08:04:41 -0800
   Commit: 5479450, github.com/apache/spark/pull/3742

   SPARK-3428. TaskMetrics for running tasks is missing GC time metrics
   Sandy Ryza <sandy@cloudera.com>
   2014-12-18 22:40:44 -0800
   Commit: 283263f, github.com/apache/spark/pull/3684

   [SPARK-4674] Refactor getCallSite
   Liang-Chi Hsieh <viirya@gmail.com>
   2014-12-18 21:41:02 -0800
   Commit: d7fc69a, github.com/apache/spark/pull/3532

   [SPARK-4728][MLLib] Add exponential, gamma, and log normal sampling to MLlib da...
   RJ Nowling <rnowling@gmail.com>
   2014-12-18 21:00:49 -0800
   Commit: ee1fb97, github.com/apache/spark/pull/3680

   [SPARK-4861][SQL] Refactory command in spark sql
   wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
   2014-12-18 20:24:56 -0800
   Commit: c3d91da, github.com/apache/spark/pull/3712

   [SPARK-4573] [SQL] Add SettableStructObjectInspector support in "wrap" function
   Cheng Hao <hao.cheng@intel.com>
   2014-12-18 20:21:52 -0800
   Commit: ae9f128, github.com/apache/spark/pull/3429

   [SPARK-2554][SQL] Supporting SumDistinct partial aggregation
   ravipesala <ravindra.pesala@huawei.com>
   2014-12-18 20:19:10 -0800
   Commit: 7687415, github.com/apache/spark/pull/3348

   [SPARK-4693] [SQL] PruningPredicates may be wrong if predicates contains an empty AttributeSet() references
   YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>
   2014-12-18 20:13:46 -0800
   Commit: e7de7e5, github.com/apache/spark/pull/3556

   [SPARK-4756][SQL] FIX: sessionToActivePool grow infinitely, even as sessions expire
   guowei2 <guowei2@asiainfo.com>
   2014-12-18 20:10:23 -0800
   Commit: 22ddb6e, github.com/apache/spark/pull/3617

   [SPARK-3928][SQL] Support wildcard matches on Parquet files.
   Thu Kyaw <trk007@gmail.com>
   2014-12-18 20:08:32 -0800
   Commit: b68bc6d, github.com/apache/spark/pull/3407

   [SPARK-2663] [SQL] Support the Grouping Set
   Cheng Hao <hao.cheng@intel.com>
   2014-12-18 18:58:29 -0800
   Commit: f728e0f, github.com/apache/spark/pull/1567

   [SPARK-4754] Refactor SparkContext into ExecutorAllocationClient
   Andrew Or <andrew@databricks.com>
   2014-12-18 17:37:42 -0800
   Commit: 9804a75, github.com/apache/spark/pull/3614

   [SPARK-4837] NettyBlockTransferService should use spark.blockManager.port config
   Aaron Davidson <aaron@databricks.com>
   2014-12-18 16:43:16 -0800
   Commit: 105293a, github.com/apache/spark/pull/3688

   SPARK-4743 - Use SparkEnv.serializer instead of closureSerializer in aggregateByKey and foldByKey
   Ivan Vergiliev <ivan@leanplum.com>
   2014-12-18 16:29:36 -0800
   Commit: f9f58b9, github.com/apache/spark/pull/3605

   [SPARK-4884]: Improve Partition docs
   Madhu Siddalingaiah <madhu@madhu.com>
   2014-12-18 16:00:53 -0800
   Commit: d5a596d, github.com/apache/spark/pull/3722

   [SPARK-4880] remove spark.locality.wait in Analytics
   Ernest <earneyzxl@gmail.com>
   2014-12-18 15:42:26 -0800
   Commit: a7ed6f3, github.com/apache/spark/pull/3730

   [SPARK-4887][MLlib] Fix a bad unittest in LogisticRegressionSuite
   DB Tsai <dbtsai@alpinenow.com>
   2014-12-18 13:55:49 -0800
   Commit: 59a49db, github.com/apache/spark/pull/3735

   [SPARK-3607] ConnectionManager threads.max configs on the thread pools don't work
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2014-12-18 12:53:18 -0800
   Commit: 3720057, github.com/apache/spark/pull/3664

   Add mesos specific configurations into doc
   Timothy Chen <tnachen@gmail.com>
   2014-12-18 12:15:53 -0800
   Commit: d9956f8, github.com/apache/spark/pull/3349

   SPARK-3779. yarn spark.yarn.applicationMaster.waitTries config should be...
   Sandy Ryza <sandy@cloudera.com>
   2014-12-18 12:19:07 -0600
   Commit: 253b72b, github.com/apache/spark/pull/3471

   [SPARK-4461][YARN] pass extra java options to yarn application master
   Zhan Zhang <zhazhan@gmail.com>
   2014-12-18 10:01:46 -0600
   Commit: 3b76469, github.com/apache/spark/pull/3409

   [SPARK-4822] Use sphinx tags for Python doc annotations
   lewuathe <lewuathe@me.com>
   2014-12-17 17:31:24 -0800
   Commit: 3cd5161, github.com/apache/spark/pull/3685

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-17 15:50:10 -0800
   Commit: ca12608, github.com/apache/spark/pull/3137

   [SPARK-3891][SQL] Add array support to percentile, percentile_approx and constant inspectors support
   Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
   2014-12-17 15:41:35 -0800
   Commit: f33d550, github.com/apache/spark/pull/2802

   [SPARK-4856] [SQL] NullType instead of StringType when sampling against empty string or nul...
   Cheng Hao <hao.cheng@intel.com>
   2014-12-17 15:01:59 -0800
   Commit: 8d0d2a6, github.com/apache/spark/pull/3708

   [HOTFIX][SQL] Fix parquet filter suite
   Michael Armbrust <michael@databricks.com>
   2014-12-17 14:27:02 -0800
   Commit: 19c0faa, github.com/apache/spark/pull/3727

   [SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc
   Joseph K. Bradley <joseph@databricks.com>
   2014-12-17 14:12:46 -0800
   Commit: affc3f4, github.com/apache/spark/pull/3669

   [SPARK-3739] [SQL] Update the split num base on block size for table scanning
   Cheng Hao <hao.cheng@intel.com>
   2014-12-17 13:39:36 -0800
   Commit: 636d9fc, github.com/apache/spark/pull/2589

   [SPARK-4755] [SQL] sqrt(negative value) should return null
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-17 12:51:27 -0800
   Commit: 902e4d5, github.com/apache/spark/pull/3616

   [SPARK-4493][SQL] Don't pushdown Eq, NotEq, Lt, LtEq, Gt and GtEq predicates with nulls for Parquet
   Cheng Lian <lian@databricks.com>
   2014-12-17 12:48:04 -0800
   Commit: 6277135, github.com/apache/spark/pull/3367

   [SPARK-3698][SQL] Fix case insensitive resolution of GetField.
   Michael Armbrust <michael@databricks.com>
   2014-12-17 12:43:51 -0800
   Commit: 7ad579e, github.com/apache/spark/pull/3724

   [SPARK-4694]Fix HiveThriftServer2  cann't stop In Yarn HA mode.
   carlmartin <carlmartinmax@gmail.com>
   2014-12-17 12:24:03 -0800
   Commit: 4782def, github.com/apache/spark/pull/3576

   [SPARK-4625] [SQL] Add sort by for DSL & SimpleSqlParser
   Cheng Hao <hao.cheng@intel.com>
   2014-12-17 12:01:57 -0800
   Commit: 5fdcbdc, github.com/apache/spark/pull/3481

   [SPARK-4595][Core] Fix MetricsServlet not work issue
   Saisai Shao <saisai.shao@intel.com>, Josh Rosen <joshrosen@databricks.com>, jerryshao <saisai.shao@intel.com>
   2014-12-17 11:47:44 -0800
   Commit: cf50631, github.com/apache/spark/pull/3444

   [HOTFIX] Fix RAT exclusion for known_translations file
   Josh Rosen <joshrosen@databricks.com>
   2014-12-16 23:00:25 -0800
   Commit: 3d0c37b, github.com/apache/spark/pull/3719

   [Release] Update contributors list format and sort it
   Andrew Or <andrew@databricks.com>
   2014-12-16 22:11:03 -0800
   Commit: 4e1112e

   [SPARK-4618][SQL] Make foreign DDL commands options case-insensitive
   scwf <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
   2014-12-16 21:26:36 -0800
   Commit: 6069880, github.com/apache/spark/pull/3470

   [SPARK-4866] support StructType as key in MapType
   Davies Liu <davies@databricks.com>
   2014-12-16 21:23:28 -0800
   Commit: ec5c427, github.com/apache/spark/pull/3714

   [SPARK-4375] [SQL] Add 0 argument support for udf
   Cheng Hao <hao.cheng@intel.com>
   2014-12-16 21:21:11 -0800
   Commit: 770d815, github.com/apache/spark/pull/3595

   [SPARK-4720][SQL] Remainder should also return null if the divider is 0.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-12-16 21:19:57 -0800
   Commit: ddc7ba3, github.com/apache/spark/pull/3581

   [SPARK-4744] [SQL] Short circuit evaluation for AND & OR in CodeGen
   Cheng Hao <hao.cheng@intel.com>
   2014-12-16 21:18:39 -0800
   Commit: 0aa834a, github.com/apache/spark/pull/3606

   [SPARK-4798][SQL] A new set of Parquet testing API and test suites
   Cheng Lian <lian@databricks.com>
   2014-12-16 21:16:03 -0800
   Commit: 3b395e1, github.com/apache/spark/pull/3644

   [Release] Cache known author translations locally
   Andrew Or <andrew@databricks.com>
   2014-12-16 19:28:43 -0800
   Commit: b85044e

   [Release] Major improvements to generate contributors script
   Andrew Or <andrew@databricks.com>
   2014-12-16 17:55:27 -0800
   Commit: 6f80b74

   [SPARK-4269][SQL] make wait time configurable in BroadcastHashJoin
   Jacky Li <jacky.likun@huawei.com>
   2014-12-16 15:34:59 -0800
   Commit: fa66ef6, github.com/apache/spark/pull/3133

   [SPARK-4827][SQL] Fix resolution of deeply nested Project(attr, Project(Star,...)).
   Michael Armbrust <michael@databricks.com>
   2014-12-16 15:31:19 -0800
   Commit: a66c23e, github.com/apache/spark/pull/3674

   [SPARK-4483][SQL]Optimization about reduce memory costs during the HashOuterJoin
   tianyi <tianyi@asiainfo-linkage.com>, tianyi <tianyi.asiainfo@gmail.com>
   2014-12-16 15:22:29 -0800
   Commit: 30f6b85, github.com/apache/spark/pull/3375

   [SPARK-4527][SQl]Add BroadcastNestedLoopJoin operator selection testsuite
   wangxiaojing <u9jing@gmail.com>
   2014-12-16 14:45:56 -0800
   Commit: ea1315e, github.com/apache/spark/pull/3395

   SPARK-4767: Add support for launching in a specified placement group to spark_ec2
   Holden Karau <holden@pigscanfly.ca>
   2014-12-16 14:37:04 -0800
   Commit: b0dfdbd, github.com/apache/spark/pull/3623

   [SPARK-4812][SQL] Fix the initialization issue of 'codegenEnabled'
   zsxwing <zsxwing@gmail.com>
   2014-12-16 14:13:40 -0800
   Commit: 6530243, github.com/apache/spark/pull/3660

   [SPARK-4847][SQL]Fix "extraStrategies cannot take effect in SQLContext" issue
   jerryshao <saisai.shao@intel.com>
   2014-12-16 14:08:28 -0800
   Commit: dc8280d, github.com/apache/spark/pull/3698

   [DOCS][SQL] Add a Note on jsonFile having separate JSON objects per line
   Peter Vandenabeele <peter@vandenabeele.com>
   2014-12-16 13:57:55 -0800
   Commit: 1a9e35e, github.com/apache/spark/pull/3517

   [SQL] SPARK-4700: Add HTTP protocol spark thrift server
   Judy Nash <judynash@microsoft.com>, judynash <judynash@microsoft.com>
   2014-12-16 12:37:26 -0800
   Commit: 17688d1, github.com/apache/spark/pull/3672

   [SPARK-3405] add subnet-id and vpc-id options to spark_ec2.py
   Mike Jennings <mvj101@gmail.com>, Mike Jennings <mvj@google.com>
   2014-12-16 12:13:21 -0800
   Commit: d12c071, github.com/apache/spark/pull/2872

   [SPARK-4855][mllib] testing the Chi-squared hypothesis test
   jbencook <jbenjamincook@gmail.com>
   2014-12-16 11:37:23 -0800
   Commit: cb48447, github.com/apache/spark/pull/3679

   [SPARK-4437] update doc for WholeCombineFileRecordReader
   Davies Liu <davies@databricks.com>, Josh Rosen <joshrosen@databricks.com>
   2014-12-16 11:19:36 -0800
   Commit: ed36200, github.com/apache/spark/pull/3301

   [SPARK-4841] fix zip with textFile()
   Davies Liu <davies@databricks.com>
   2014-12-15 22:58:26 -0800
   Commit: c246b95, github.com/apache/spark/pull/3706

   [SPARK-4792] Add error message when making local dir unsuccessfully
   meiyoula <1039320815@qq.com>
   2014-12-15 22:30:18 -0800
   Commit: c762877, github.com/apache/spark/pull/3635

   SPARK-4814 [CORE] Enable assertions in SBT, Maven tests / AssertionError from Hive's LazyBinaryInteger
   Sean Owen <sowen@cloudera.com>
   2014-12-15 17:12:05 -0800
   Commit: 81112e4, github.com/apache/spark/pull/3692

   [Minor][Core] fix comments in MapOutputTracker
   wangfei <wangfei1@huawei.com>
   2014-12-15 16:46:21 -0800
   Commit: 5c24759, github.com/apache/spark/pull/3700

   SPARK-785 [CORE] ClosureCleaner not invoked on most PairRDDFunctions
   Sean Owen <sowen@cloudera.com>
   2014-12-15 16:06:15 -0800
   Commit: 2a28bc6, github.com/apache/spark/pull/3690

   [SPARK-4668] Fix some documentation typos.
   Ryan Williams <ryan.blake.williams@gmail.com>
   2014-12-15 14:52:17 -0800
   Commit: 8176b7a, github.com/apache/spark/pull/3523

   [SPARK-1037] The name of findTaskFromList & findTask in TaskSetManager.scala is confusing
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2014-12-15 14:51:15 -0800
   Commit: 38703bb, github.com/apache/spark/pull/3665

   [SPARK-4826] Fix generation of temp file names in WAL tests
   Josh Rosen <joshrosen@databricks.com>
   2014-12-15 14:33:43 -0800
   Commit: f6b8591, github.com/apache/spark/pull/3695.

   [SPARK-4494][mllib] IDFModel.transform() add support for single vector
   Yuu ISHIKAWA <yuu.ishikawa@gmail.com>
   2014-12-15 13:44:15 -0800
   Commit: 8098fab, github.com/apache/spark/pull/3603

   HOTFIX: Disabling failing block manager test
   Patrick Wendell <pwendell@gmail.com>
   2014-12-15 10:54:45 -0800
   Commit: 4c06738

   fixed spelling errors in documentation
   Peter Klipfel <peter@klipfel.me>
   2014-12-14 00:01:16 -0800
   Commit: 2a2983f, github.com/apache/spark/pull/3691

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-11 23:38:40 -0800
   Commit: ef84dab, github.com/apache/spark/pull/3488

   [SPARK-4829] [SQL] add rule to fold count(expr) if expr is not null
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-11 22:56:42 -0800
   Commit: 41a3f93, github.com/apache/spark/pull/3676

   [SPARK-4742][SQL] The name of Parquet File generated by AppendingParquetOutputFormat should be zero padded
   Sasaki Toru <sasakitoa@nttdata.co.jp>
   2014-12-11 22:54:21 -0800
   Commit: 8091dd6, github.com/apache/spark/pull/3602

   [SPARK-4825] [SQL] CTAS fails to resolve when created using saveAsTable
   Cheng Hao <hao.cheng@intel.com>
   2014-12-11 22:51:49 -0800
   Commit: 0abbff2, github.com/apache/spark/pull/3673

   [SQL] enable empty aggr test case
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-11 22:50:18 -0800
   Commit: cbb634a, github.com/apache/spark/pull/3445

   [SPARK-4828] [SQL] sum and avg on empty table should always return null
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-11 22:49:27 -0800
   Commit: acb3be6, github.com/apache/spark/pull/3675

   [SQL] Remove unnecessary case in HiveContext.toHiveString
   scwf <wangfei1@huawei.com>
   2014-12-11 22:48:03 -0800
   Commit: d8cf678, github.com/apache/spark/pull/3563

   [SPARK-4293][SQL] Make Cast be able to handle complex types.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-12-11 22:45:25 -0800
   Commit: 3344803, github.com/apache/spark/pull/3150

   [SPARK-4639] [SQL] Pass maxIterations in as a parameter in Analyzer
   Jacky Li <jacky.likun@huawei.com>
   2014-12-11 22:44:27 -0800
   Commit: c152dde, github.com/apache/spark/pull/3499

   [SPARK-4662] [SQL] Whitelist more unittest
   Cheng Hao <hao.cheng@intel.com>
   2014-12-11 22:43:02 -0800
   Commit: a7f07f5, github.com/apache/spark/pull/3522

   [SPARK-4713] [SQL] SchemaRDD.unpersist() should not raise exception if it is not persisted
   Cheng Hao <hao.cheng@intel.com>
   2014-12-11 22:41:36 -0800
   Commit: bf40cf8, github.com/apache/spark/pull/3572

   [SPARK-4806] Streaming doc update for 1.2
   Tathagata Das <tathagata.das1565@gmail.com>, Josh Rosen <joshrosen@databricks.com>, Josh Rosen <rosenville@gmail.com>
   2014-12-11 06:21:23 -0800
   Commit: b004150, github.com/apache/spark/pull/3653

   [SPARK-4791] [sql] Infer schema from case class with multiple constructors
   Joseph K. Bradley <joseph@databricks.com>
   2014-12-10 23:41:15 -0800
   Commit: 2a5b5fd, github.com/apache/spark/pull/3646

   [CORE]codeStyle: uniform ConcurrentHashMap define in StorageLevel.scala with other places
   Zhang, Liye <liye.zhang@intel.com>
   2014-12-10 20:44:59 -0800
   Commit: 57d37f9, github.com/apache/spark/pull/2793

   SPARK-3526 Add section about data locality to the tuning guide
   Andrew Ash <andrew@andrewash.com>
   2014-12-10 15:01:15 -0800
   Commit: 652b781, github.com/apache/spark/pull/2519

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-10 14:41:16 -0800
   Commit: 36bdb5b, github.com/apache/spark/pull/2883

   [SPARK-4759] Fix driver hanging from coalescing partitions
   Andrew Or <andrew@databricks.com>
   2014-12-10 14:27:53 -0800
   Commit: 4f93d0c, github.com/apache/spark/pull/3633

   [SPARK-4569] Rename 'externalSorting' in Aggregator
   Ilya Ganelin <ilya.ganelin@capitalone.com>
   2014-12-10 14:19:37 -0800
   Commit: 447ae2d, github.com/apache/spark/pull/3666

   [SPARK-4793] [Deploy] ensure .jar at end of line
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-10 13:29:27 -0800
   Commit: e230da1, github.com/apache/spark/pull/3641

   [SPARK-4215] Allow requesting / killing executors only in YARN mode
   Andrew Or <andrew@databricks.com>
   2014-12-10 12:48:24 -0800
   Commit: faa8fd8, github.com/apache/spark/pull/3615

   [SPARK-4771][Docs] Document standalone cluster supervise mode
   Andrew Or <andrew@databricks.com>
   2014-12-10 12:41:36 -0800
   Commit: 5621283, github.com/apache/spark/pull/3627

   [SPARK-4329][WebUI] HistoryPage pagenation
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-10 12:29:00 -0800
   Commit: 0fc637b, github.com/apache/spark/pull/3194

   [SPARK-4161]Spark shell class path is not correctly set if "spark.driver.extraClassPath" is set in defaults.conf
   GuoQiang Li <witgo@qq.com>
   2014-12-10 12:24:04 -0800
   Commit: 742e709, github.com/apache/spark/pull/3050

   [SPARK-4772] Clear local copies of accumulators as soon as we're done with them
   Nathan Kronenfeld <nkronenfeld@oculusinfo.com>
   2014-12-09 23:53:17 -0800
   Commit: 94b377f, github.com/apache/spark/pull/3570

   [Minor] Use <sup> tag for help icon in web UI page header
   Josh Rosen <joshrosen@databricks.com>
   2014-12-09 23:47:05 -0800
   Commit: f79c1cf, github.com/apache/spark/pull/3659

   Config updates for the new shuffle transport.
   Reynold Xin <rxin@databricks.com>
   2014-12-09 19:29:09 -0800
   Commit: 9bd9334, github.com/apache/spark/pull/3657

   [SPARK-4740] Create multiple concurrent connections between two peer nodes in Netty.
   Reynold Xin <rxin@databricks.com>
   2014-12-09 17:49:59 -0800
   Commit: 2b9b726, github.com/apache/spark/pull/3625

   SPARK-4805 [CORE] BlockTransferMessage.toByteArray() trips assertion
   Sean Owen <sowen@cloudera.com>
   2014-12-09 16:38:27 -0800
   Commit: d8f84f2, github.com/apache/spark/pull/3650

   SPARK-4567. Make SparkJobInfo and SparkStageInfo serializable
   Sandy Ryza <sandy@cloudera.com>
   2014-12-09 16:26:07 -0800
   Commit: 5e4c06f, github.com/apache/spark/pull/3426

   [SPARK-4714] BlockManager.dropFromMemory() should check whether block has been removed after synchronizing on BlockInfo instance.
   hushan[胡珊] <hushan@xiaomi.com>
   2014-12-09 15:11:20 -0800
   Commit: 30dca92, github.com/apache/spark/pull/3574

   [SPARK-4765] Make GC time always shown in UI.
   Kay Ousterhout <kayousterhout@gmail.com>
   2014-12-09 15:10:36 -0800
   Commit: 1f51106, github.com/apache/spark/pull/3622

   [SPARK-4691][shuffle] Restructure a few lines in shuffle code
   maji2014 <maji3@asiainfo.com>
   2014-12-09 13:13:12 -0800
   Commit: b310744, github.com/apache/spark/pull/3553

   [SPARK-874] adding a --wait flag
   jbencook <jbenjamincook@gmail.com>
   2014-12-09 12:16:19 -0800
   Commit: 61f1a70, github.com/apache/spark/pull/3567

   SPARK-4338. [YARN] Ditch yarn-alpha.
   Sandy Ryza <sandy@cloudera.com>
   2014-12-09 11:02:43 -0800
   Commit: 912563a, github.com/apache/spark/pull/3215

   [SPARK-4785][SQL] Initilize Hive UDFs on the driver and serialize them with a wrapper
   Cheng Hao <hao.cheng@intel.com>, Cheng Lian <lian@databricks.com>
   2014-12-09 10:28:15 -0800
   Commit: 383c555, github.com/apache/spark/pull/3640

   [SPARK-3154][STREAMING] Replace ConcurrentHashMap with mutable.HashMap and remove @volatile from 'stopped'
   zsxwing <zsxwing@gmail.com>
   2014-12-08 23:54:15 -0800
   Commit: bcb5cda, github.com/apache/spark/pull/3634

   [SPARK-4769] [SQL] CTAS does not work when reading from temporary tables
   Cheng Hao <hao.cheng@intel.com>
   2014-12-08 17:39:12 -0800
   Commit: 51b1fe1, github.com/apache/spark/pull/3336

   [SQL] remove unnecessary import in spark-sql
   Jacky Li <jacky.likun@huawei.com>
   2014-12-08 17:27:46 -0800
   Commit: 9443843, github.com/apache/spark/pull/3630

   SPARK-4770. [DOC] [YARN] spark.scheduler.minRegisteredResourcesRatio doc...
   Sandy Ryza <sandy@cloudera.com>
   2014-12-08 16:28:36 -0800
   Commit: cda94d1, github.com/apache/spark/pull/3624

   SPARK-3926 [CORE] Reopened: result of JavaRDD collectAsMap() is not serializable
   Sean Owen <sowen@cloudera.com>
   2014-12-08 16:13:03 -0800
   Commit: e829bfa, github.com/apache/spark/pull/3587

   [SPARK-4750] Dynamic allocation - synchronize kills
   Andrew Or <andrew@databricks.com>
   2014-12-08 16:02:33 -0800
   Commit: 65f929d, github.com/apache/spark/pull/3612

   [SPARK-4774] [SQL] Makes HiveFromSpark more portable
   Kostas Sakellis <kostas@cloudera.com>
   2014-12-08 15:44:18 -0800
   Commit: d6a972b, github.com/apache/spark/pull/3628

   [SPARK-4764] Ensure that files are fetched atomically
   Christophe Préaud <christophe.preaud@kelkoo.com>
   2014-12-08 11:44:54 -0800
   Commit: ab2abcb, github.com/apache/spark/pull/2855

   [SPARK-4620] Add unpersist in Graph and GraphImpl
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2014-12-07 19:42:02 -0800
   Commit: 8817fc7, github.com/apache/spark/pull/3476

   [SPARK-4646] Replace Scala.util.Sorting.quickSort with Sorter(TimSort) in Spark
   Takeshi Yamamuro <linguin.m.s@gmail.com>
   2014-12-07 19:36:08 -0800
   Commit: 2e6b736, github.com/apache/spark/pull/3507

   [SPARK-3623][GraphX] GraphX should support the checkpoint operation
   GuoQiang Li <witgo@qq.com>
   2014-12-06 00:56:51 -0800
   Commit: e895e0c, github.com/apache/spark/pull/2631

   Streaming doc : do you mean inadvertently?
   CrazyJvm <crazyjvm@gmail.com>
   2014-12-05 13:42:13 -0800
   Commit: 6eb1b6f, github.com/apache/spark/pull/3620

   [SPARK-4005][CORE] handle message replies in receive instead of in the individual private methods
   Zhang, Liye <liye.zhang@intel.com>
   2014-12-05 12:00:32 -0800
   Commit: 98a7d09, github.com/apache/spark/pull/2853

   [SPARK-4761][SQL] Enables Kryo by default in Spark SQL Thrift server
   Cheng Lian <lian@databricks.com>
   2014-12-05 10:27:40 -0800
   Commit: 6f61e1f, github.com/apache/spark/pull/3621

   [SPARK-4753][SQL] Use catalyst for partition pruning in newParquet.
   Michael Armbrust <michael@databricks.com>
   2014-12-04 22:25:21 -0800
   Commit: f5801e8, github.com/apache/spark/pull/3613

   Revert "SPARK-2624 add datanucleus jars to the container in yarn-cluster"
   Andrew Or <andrew@databricks.com>
   2014-12-04 21:53:49 -0800
   Commit: fd85253

   Revert "[HOT FIX] [YARN] Check whether `/lib` exists before listing its files"
   Andrew Or <andrew@databricks.com>
   2014-12-04 21:53:38 -0800
   Commit: 87437df

   [SPARK-4464] Description about configuration options need to be modified in docs.
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2014-12-04 19:33:02 -0800
   Commit: ca37903, github.com/apache/spark/pull/3329

   Fix typo in Spark SQL docs.
   Andy Konwinski <andykonwinski@gmail.com>
   2014-12-04 18:27:02 -0800
   Commit: 15cf3b0, github.com/apache/spark/pull/3611

   [SPARK-4421] Wrong link in spark-standalone.html
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2014-12-04 18:14:36 -0800
   Commit: ddfc09c, github.com/apache/spark/pull/3279

   [SPARK-4397] Move object RDD to the front of RDD.scala.
   Reynold Xin <rxin@databricks.com>
   2014-12-04 16:32:20 -0800
   Commit: ed92b47, github.com/apache/spark/pull/3580

   [SPARK-4652][DOCS] Add docs about spark-git-repo option
   lewuathe <lewuathe@me.com>, Josh Rosen <joshrosen@databricks.com>
   2014-12-04 15:14:36 -0800
   Commit: ab8177d, github.com/apache/spark/pull/3513

   [SPARK-4459] Change groupBy type parameter from K to U
   Saldanha <saldaal1@phusca-l24858.wlan.na.novartis.net>
   2014-12-04 14:22:09 -0800
   Commit: 743a889, github.com/apache/spark/pull/3327

   [SPARK-4745] Fix get_existing_cluster() function with multiple security groups
   alexdebrie <alexdebrie1@gmail.com>
   2014-12-04 14:13:59 -0800
   Commit: 794f3ae, github.com/apache/spark/pull/3596

   [HOTFIX] Fixing two issues with the release script.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-04 12:11:41 -0800
   Commit: 8dae26f, github.com/apache/spark/pull/3608

   [SPARK-4253] Ignore spark.driver.host in yarn-cluster and standalone-cluster modes
   WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
   2014-12-04 11:52:47 -0800
   Commit: 8106b1e, github.com/apache/spark/pull/3112

   [SPARK-4683][SQL] Add a beeline.cmd to run on Windows
   Cheng Lian <lian@databricks.com>
   2014-12-04 10:21:03 -0800
   Commit: 28c7aca, github.com/apache/spark/pull/3599

   [FIX][DOC] Fix broken links in ml-guide.md
   Xiangrui Meng <meng@databricks.com>
   2014-12-04 20:16:35 +0800
   Commit: 7e758d7, github.com/apache/spark/pull/3601

   [SPARK-4575] [mllib] [docs] spark.ml pipelines doc + bug fixes
   Joseph K. Bradley <joseph@databricks.com>, jkbradley <joseph.kurata.bradley@gmail.com>, Xiangrui Meng <meng@databricks.com>
   2014-12-04 17:00:06 +0800
   Commit: 469a6e5, github.com/apache/spark/pull/3588

   [docs] Fix outdated comment in tuning guide
   Joseph K. Bradley <joseph@databricks.com>
   2014-12-04 00:59:32 -0800
   Commit: 529439b, github.com/apache/spark/pull/3592

   [SQL] Minor: Avoid calling Seq#size in a loop
   Aaron Davidson <aaron@databricks.com>
   2014-12-04 00:58:42 -0800
   Commit: c6c7165, github.com/apache/spark/pull/3593

   [SPARK-4685] Include all spark.ml and spark.mllib packages in JavaDoc's MLlib group
   lewuathe <lewuathe@me.com>, Xiangrui Meng <meng@databricks.com>
   2014-12-04 16:51:41 +0800
   Commit: 20bfea4, github.com/apache/spark/pull/3554

   [SPARK-4719][API] Consolidate various narrow dep RDD classes with MapPartitionsRDD
   Reynold Xin <rxin@databricks.com>
   2014-12-04 00:45:57 -0800
   Commit: c3ad486, github.com/apache/spark/pull/3578

   [SQL] remove unnecessary import
   Jacky Li <jacky.likun@huawei.com>
   2014-12-04 00:43:55 -0800
   Commit: ed88db4, github.com/apache/spark/pull/3585

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-03 22:15:46 -0800
   Commit: 3cdae03, github.com/apache/spark/pull/1875

   [Release] Correctly translate contributors name in release notes
   Andrew Or <andrew@databricks.com>
   2014-12-03 19:08:29 -0800
   Commit: a4dfb4e

   [SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + DecisionTree API fix
   Joseph K. Bradley <joseph@databricks.com>, Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
   2014-12-04 09:57:50 +0800
   Commit: 657a888, github.com/apache/spark/pull/3461

   [SPARK-4711] [mllib] [docs] Programming guide advice on choosing optimizer
   Joseph K. Bradley <joseph@databricks.com>
   2014-12-04 08:58:03 +0800
   Commit: 27ab0b8, github.com/apache/spark/pull/3569

   [SPARK-4085] Propagate FetchFailedException when Spark fails to read local shuffle file.
   Reynold Xin <rxin@databricks.com>
   2014-12-03 16:28:24 -0800
   Commit: 1826372, github.com/apache/spark/pull/3579

   [SPARK-4498][core] Don't transition ExecutorInfo to RUNNING until Driver adds Executor
   Mark Hamstra <markhamstra@gmail.com>
   2014-12-03 15:08:01 -0800
   Commit: 96b2785, github.com/apache/spark/pull/3550

   [SPARK-4552][SQL] Avoid exception when reading empty parquet data through Hive
   Michael Armbrust <michael@databricks.com>
   2014-12-03 14:13:35 -0800
   Commit: 513ef82, github.com/apache/spark/pull/3586

   [HOT FIX] [YARN] Check whether `/lib` exists before listing its files
   Andrew Or <andrew@databricks.com>
   2014-12-03 13:56:23 -0800
   Commit: 90ec643, github.com/apache/spark/pull/3589

   [SPARK-4642] Add description about spark.yarn.queue to running-on-YARN document.
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2014-12-03 13:16:24 -0800
   Commit: 692f493, github.com/apache/spark/pull/3500

   [SPARK-4715][Core] Make sure tryToAcquire won't return a negative value
   zsxwing <zsxwing@gmail.com>
   2014-12-03 12:19:40 -0800
   Commit: edd3cd4, github.com/apache/spark/pull/3575

   [SPARK-4701] Typo in sbt/sbt
   Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
   2014-12-03 12:08:00 -0800
   Commit: 96786e3, github.com/apache/spark/pull/3560

   SPARK-2624 add datanucleus jars to the container in yarn-cluster
   Jim Lim <jim@quixey.com>
   2014-12-03 11:16:02 -0800
   Commit: a975dc3, github.com/apache/spark/pull/3238

   [SPARK-4717][MLlib] Optimize BLAS library to avoid de-reference multiple times in loop
   DB Tsai <dbtsai@alpinenow.com>
   2014-12-03 22:31:39 +0800
   Commit: d005429, github.com/apache/spark/pull/3577

   [SPARK-4708][MLLib] Make k-mean runs two/three times faster with dense/sparse sample
   DB Tsai <dbtsai@alpinenow.com>
   2014-12-03 19:01:56 +0800
   Commit: 7fc49ed, github.com/apache/spark/pull/3565

   [SPARK-4710] [mllib] Eliminate MLlib compilation warnings
   Joseph K. Bradley <joseph@databricks.com>
   2014-12-03 18:50:03 +0800
   Commit: 4ac2151, github.com/apache/spark/pull/3568

   [SPARK-4397][Core] Change the 'since' value of '@deprecated' to '1.3.0'
   zsxwing <zsxwing@gmail.com>
   2014-12-03 02:05:17 -0800
   Commit: 8af551f, github.com/apache/spark/pull/3573

   [SPARK-4672][Core]Checkpoint() should clear f to shorten the serialization chain
   JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
   2014-12-02 23:53:29 -0800
   Commit: 77be8b9, github.com/apache/spark/pull/3545

   [SPARK-4672][GraphX]Non-transient PartitionsRDDs will lead to StackOverflow error
   JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
   2014-12-02 17:14:11 -0800
   Commit: 17c162f, github.com/apache/spark/pull/3544

   [SPARK-4672][GraphX]Perform checkpoint() on PartitionsRDD to shorten the lineage
   JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
   2014-12-02 17:08:02 -0800
   Commit: fc0a147, github.com/apache/spark/pull/3549

   [Release] Translate unknown author names automatically
   Andrew Or <andrew@databricks.com>
   2014-12-02 16:36:12 -0800
   Commit: 5da21f0

   Minor nit style cleanup in GraphX.
   Reynold Xin <rxin@databricks.com>
   2014-12-02 14:40:26 -0800
   Commit: 2d4f6e7

   [SPARK-4695][SQL] Get result using executeCollect
   wangfei <wangfei1@huawei.com>
   2014-12-02 14:30:44 -0800
   Commit: 3ae0cda, github.com/apache/spark/pull/3547

   [SPARK-4670] [SQL] wrong symbol for bitwise not
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-02 14:25:12 -0800
   Commit: 1f5ddf1, github.com/apache/spark/pull/3528

   [SPARK-4593][SQL] Return null when denominator is 0
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-02 14:21:12 -0800
   Commit: f6df609, github.com/apache/spark/pull/3443

   [SPARK-4676][SQL] JavaSchemaRDD.schema may throw NullType MatchError if sql has null
   YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>, Michael Armbrust <michael@databricks.com>
   2014-12-02 14:12:48 -0800
   Commit: 1066427, github.com/apache/spark/pull/3538

   [SPARK-4663][sql]add finally to avoid resource leak
   baishuo <vc_java@hotmail.com>
   2014-12-02 12:12:03 -0800
   Commit: 69b6fed, github.com/apache/spark/pull/3526

   [SPARK-4536][SQL] Add sqrt and abs to Spark SQL DSL
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-02 12:07:52 -0800
   Commit: e75e04f, github.com/apache/spark/pull/3401

   Indent license header properly for interfaces.scala.
   Reynold Xin <rxin@databricks.com>
   2014-12-02 11:59:15 -0800
   Commit: b1f8fe3, github.com/apache/spark/pull/3552

   [SPARK-4686] Link to allowed master URLs is broken
   Kay Ousterhout <kayousterhout@gmail.com>
   2014-12-02 09:06:02 -0800
   Commit: d9a148b, github.com/apache/spark/pull/3542

   [SPARK-4397][Core] Cleanup 'import SparkContext._' in core
   zsxwing <zsxwing@gmail.com>
   2014-12-02 00:18:41 -0800
   Commit: 6dfe38a, github.com/apache/spark/pull/3530

   [SPARK-4611][MLlib] Implement the efficient vector norm
   DB Tsai <dbtsai@alpinenow.com>
   2014-12-02 11:40:43 +0800
   Commit: 64f3175, github.com/apache/spark/pull/3462

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-12-01 17:27:14 -0800
   Commit: b0a46d8, github.com/apache/spark/pull/1612

   [SPARK-4268][SQL] Use #::: to get benefit from Stream in SqlLexical.allCaseVersions
   zsxwing <zsxwing@gmail.com>
   2014-12-01 16:39:54 -0800
   Commit: d3e02dd, github.com/apache/spark/pull/3132

   [SPARK-4529] [SQL] support view with column alias
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-01 16:08:51 -0800
   Commit: 4df60a8, github.com/apache/spark/pull/3396

   [SQL][DOC] Date type in SQL programming guide
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-12-01 14:03:57 -0800
   Commit: 5edbcbf, github.com/apache/spark/pull/3535

   [SQL] Minor fix for doc and comment
   wangfei <wangfei1@huawei.com>
   2014-12-01 14:02:02 -0800
   Commit: 7b79957, github.com/apache/spark/pull/3533

   [SPARK-4658][SQL] Code documentation issue in DDL of datasource API
   ravipesala <ravindra.pesala@huawei.com>
   2014-12-01 13:31:27 -0800
   Commit: bc35381, github.com/apache/spark/pull/3516

   [SPARK-4650][SQL] Supporting multi column support in countDistinct function like count(distinct c1,c2..) in Spark SQL
   ravipesala <ravindra.pesala@huawei.com>, Michael Armbrust <michael@databricks.com>
   2014-12-01 13:26:44 -0800
   Commit: 6a9ff19, github.com/apache/spark/pull/3511

   [SPARK-4358][SQL] Let BigDecimal do checking type compatibility
   Liang-Chi Hsieh <viirya@gmail.com>
   2014-12-01 13:17:56 -0800
   Commit: b57365a, github.com/apache/spark/pull/3208

   [SQL] add @group tab in limit() and count()
   Jacky Li <jacky.likun@gmail.com>
   2014-12-01 13:12:30 -0800
   Commit: bafee67, github.com/apache/spark/pull/3458

   [SPARK-4258][SQL][DOC] Documents spark.sql.parquet.filterPushdown
   Cheng Lian <lian@databricks.com>
   2014-12-01 13:09:51 -0800
   Commit: 5db8dca, github.com/apache/spark/pull/3440

   Documentation: add description for repartitionAndSortWithinPartitions
   Madhu Siddalingaiah <madhu@madhu.com>
   2014-12-01 08:45:34 -0800
   Commit: 2b233f5, github.com/apache/spark/pull/3390

   [SPARK-4661][Core] Minor code and docs cleanup
   zsxwing <zsxwing@gmail.com>
   2014-12-01 00:35:01 -0800
   Commit: 30a86ac, github.com/apache/spark/pull/3521

   [SPARK-4664][Core] Throw an exception when spark.akka.frameSize > 2047
   zsxwing <zsxwing@gmail.com>
   2014-12-01 00:32:54 -0800
   Commit: 1d238f2, github.com/apache/spark/pull/3527

   SPARK-2192 [BUILD] Examples Data Not in Binary Distribution
   Sean Owen <sowen@cloudera.com>
   2014-12-01 16:31:04 +0800
   Commit: 6384f42, github.com/apache/spark/pull/3480

   Fix wrong file name pattern in .gitignore
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-12-01 00:29:28 -0800
   Commit: 97eb6d7, github.com/apache/spark/pull/3529

   [SPARK-4632] version update
   Prabeesh K <prabsmails@gmail.com>
   2014-11-30 20:51:53 -0800
   Commit: 5e7a6dc, github.com/apache/spark/pull/3495

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-11-30 20:51:13 -0800
   Commit: 06dc1b1, github.com/apache/spark/pull/2915

   [DOC] Fixes formatting typo in SQL programming guide
   Cheng Lian <lian@databricks.com>
   2014-11-30 19:04:07 -0800
   Commit: 2a4d389, github.com/apache/spark/pull/3498

   [SPARK-4656][Doc] Typo in Programming Guide markdown
   lewuathe <lewuathe@me.com>
   2014-11-30 17:18:50 -0800
   Commit: a217ec5, github.com/apache/spark/pull/3412

   [SPARK-4623]Add the some error infomation if using spark-sql in yarn-cluster mode
   carlmartin <carlmartinmax@gmail.com>, huangzhaowei <carlmartinmax@gmail.com>
   2014-11-30 16:19:41 -0800
   Commit: aea7a99, github.com/apache/spark/pull/3479

   SPARK-2143 [WEB UI] Add Spark version to UI footer
   Sean Owen <sowen@cloudera.com>
   2014-11-30 11:40:08 -0800
   Commit: 048ecca, github.com/apache/spark/pull/3410

   [DOCS][BUILD] Add instruction to use change-version-to-2.11.sh in 'Building for Scala 2.11'.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-30 00:10:31 -0500
   Commit: 0fcd24c, github.com/apache/spark/pull/3361

   SPARK-4507: PR merge script should support closing multiple JIRA tickets
   Takayuki Hasegawa <takayuki.hasegawa0311@gmail.com>
   2014-11-29 23:12:10 -0500
   Commit: 4316a7b, github.com/apache/spark/pull/3428

   [SPARK-4505][Core] Add a ClassTag parameter to CompactBuffer[T]
   zsxwing <zsxwing@gmail.com>
   2014-11-29 20:23:08 -0500
   Commit: c062224, github.com/apache/spark/pull/3378

   [SPARK-4057] Use -agentlib instead of -Xdebug in sbt-launch-lib.bash for debugging
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-29 20:14:14 -0500
   Commit: 938dc14, github.com/apache/spark/pull/2904

   Include the key name when failing on an invalid value.
   Stephen Haberman <stephen@exigencecorp.com>
   2014-11-29 20:12:05 -0500
   Commit: 95290bf, github.com/apache/spark/pull/3514

   [SPARK-3398] [SPARK-4325] [EC2] Use EC2 status checks.
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2014-11-29 00:31:06 -0800
   Commit: 317e114, github.com/apache/spark/pull/3195

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-11-29 00:24:35 -0500
   Commit: 047ff57, github.com/apache/spark/pull/3451

   [SPARK-4597] Use proper exception and reset variable in Utils.createTempDir()
   Liang-Chi Hsieh <viirya@gmail.com>
   2014-11-28 18:04:05 -0800
   Commit: 49fe879, github.com/apache/spark/pull/3449

   SPARK-1450 [EC2] Specify the default zone in the EC2 script help
   Sean Owen <sowen@cloudera.com>
   2014-11-28 17:43:38 -0500
   Commit: 48223d8, github.com/apache/spark/pull/3454

   [SPARK-4584] [yarn] Remove security manager from Yarn AM.
   Marcelo Vanzin <vanzin@cloudera.com>
   2014-11-28 15:15:30 -0500
   Commit: 915f8ee, github.com/apache/spark/pull/3484

   [SPARK-4193][BUILD] Disable doclint in Java 8 to prevent from build error.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-28 13:00:15 -0500
   Commit: e464f0a, github.com/apache/spark/pull/3058

   [SPARK-4643] [Build] Remove unneeded staging repositories from build
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-11-28 12:41:38 -0500
   Commit: 53ed7f1, github.com/apache/spark/pull/3504

   Delete unnecessary function
   KaiXinXiaoLei <huleilei1@huawei.com>
   2014-11-28 12:34:07 -0500
   Commit: 052e658, github.com/apache/spark/pull/3224

   [SPARK-4645][SQL] Disables asynchronous execution in Hive 0.13.1 HiveThriftServer2
   Cheng Lian <lian@databricks.com>
   2014-11-28 11:42:40 -0500
   Commit: 5b99bf2, github.com/apache/spark/pull/3506

   [SPARK-4619][Storage]delete redundant time suffix
   maji2014 <maji3@asiainfo.com>
   2014-11-28 00:36:22 -0800
   Commit: ceb6281, github.com/apache/spark/pull/3475

   [SPARK-4613][Core] Java API for JdbcRDD
   Cheng Lian <lian@databricks.com>
   2014-11-27 18:01:14 -0800
   Commit: 120a350, github.com/apache/spark/pull/3478

   [SPARK-4626] Kill a task only if the executorId is (still) registered with the scheduler
   roxchkplusony <roxchkplusony@gmail.com>
   2014-11-27 15:54:40 -0800
   Commit: 84376d3, github.com/apache/spark/pull/3483

   SPARK-4170 [CORE] Closure problems when running Scala app that "extends App"
   Sean Owen <sowen@cloudera.com>
   2014-11-27 09:03:17 -0800
   Commit: 5d7fe17, github.com/apache/spark/pull/3497

   [Release] Automate generation of contributors list
   Andrew Or <andrew@databricks.com>
   2014-11-26 23:16:23 -0800
   Commit: c86e9bc

   [SPARK-732][SPARK-3628][CORE][RESUBMIT] eliminate duplicate update on accmulator
   CodingCat <zhunansjtu@gmail.com>
   2014-11-26 16:52:04 -0800
   Commit: 5af53ad, github.com/apache/spark/pull/2524

   [SPARK-4614][MLLIB] Slight API changes in Matrix and Matrices
   Xiangrui Meng <meng@databricks.com>
   2014-11-26 08:22:50 -0800
   Commit: 561d31d, github.com/apache/spark/pull/3468

   Removing confusing TripletFields
   Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
   2014-11-26 00:55:28 -0800
   Commit: 288ce58, github.com/apache/spark/pull/3472

   [SPARK-4612] Reduce task latency and increase scheduling throughput by making configuration initialization lazy
   Tathagata Das <tathagata.das1565@gmail.com>
   2014-11-25 23:15:58 -0800
   Commit: e7f4d25, github.com/apache/spark/pull/3463

   [SPARK-4516] Avoid allocating Netty PooledByteBufAllocators unnecessarily
   Aaron Davidson <aaron@databricks.com>
   2014-11-26 00:32:45 -0500
   Commit: 346bc17, github.com/apache/spark/pull/3465

   [SPARK-4516] Cap default number of Netty threads at 8
   Aaron Davidson <aaron@databricks.com>
   2014-11-25 23:57:04 -0500
   Commit: f5f2d27, github.com/apache/spark/pull/3469

   [SPARK-4604][MLLIB] make MatrixFactorizationModel public
   Xiangrui Meng <meng@databricks.com>
   2014-11-25 20:11:40 -0800
   Commit: b5fb141, github.com/apache/spark/pull/3459

   [HOTFIX]: Adding back without-hive dist
   Patrick Wendell <pwendell@gmail.com>
   2014-11-25 23:10:19 -0500
   Commit: 4d95526

   [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates
   Joseph K. Bradley <joseph@databricks.com>
   2014-11-25 20:10:15 -0800
   Commit: c251fd7, github.com/apache/spark/pull/3439

   [Spark-4509] Revert EC2 tag-based cluster membership patch
   Xiangrui Meng <meng@databricks.com>
   2014-11-25 16:07:09 -0800
   Commit: 7eba0fb, github.com/apache/spark/pull/3453

   Fix SPARK-4471: blockManagerIdFromJson function throws exception while B...
   hushan[胡珊] <hushan@xiaomi.com>
   2014-11-25 15:51:08 -0800
   Commit: 9bdf5da, github.com/apache/spark/pull/3340

   [SPARK-4546] Improve HistoryServer first time user experience
   Andrew Or <andrew@databricks.com>
   2014-11-25 15:48:02 -0800
   Commit: 9afcbe4, github.com/apache/spark/pull/3411

   [SPARK-4592] Avoid duplicate worker registrations in standalone mode
   Andrew Or <andrew@databricks.com>
   2014-11-25 15:46:26 -0800
   Commit: 1b2ab1c, github.com/apache/spark/pull/3447

   [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles
   Tathagata Das <tathagata.das1565@gmail.com>
   2014-11-25 14:16:27 -0800
   Commit: 8838ad7, github.com/apache/spark/pull/3457

   [SPARK-4581][MLlib] Refactorize StandardScaler to improve the transformation performance
   DB Tsai <dbtsai@alpinenow.com>
   2014-11-25 11:07:11 -0800
   Commit: bf1a6aa, github.com/apache/spark/pull/3435

   [SPARK-4601][Streaming] Set correct call site for streaming jobs so that it is displayed correctly on the Spark UI
   Tathagata Das <tathagata.das1565@gmail.com>
   2014-11-25 06:50:36 -0800
   Commit: 69cd53e, github.com/apache/spark/pull/3455

   [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first
   arahuja <aahuja11@gmail.com>
   2014-11-25 08:23:41 -0600
   Commit: d240760, github.com/apache/spark/pull/3209

   [SPARK-4381][Streaming]Add warning log when user set spark.master to local in Spark Streaming and there's no job executed
   jerryshao <saisai.shao@intel.com>
   2014-11-25 05:36:29 -0800
   Commit: fef27b2, github.com/apache/spark/pull/3244

   [SPARK-4535][Streaming] Fix the error in comments
   q00251598 <qiyadong@huawei.com>
   2014-11-25 04:01:56 -0800
   Commit: a51118a, github.com/apache/spark/pull/3400

   [SPARK-4526][MLLIB]GradientDescent get a wrong gradient value according to the gradient formula.
   GuoQiang Li <witgo@qq.com>
   2014-11-25 02:01:19 -0800
   Commit: f515f94, github.com/apache/spark/pull/3399

   [SPARK-4596][MLLib] Refactorize Normalizer to make code cleaner
   DB Tsai <dbtsai@alpinenow.com>
   2014-11-25 01:57:34 -0800
   Commit: 89f9122, github.com/apache/spark/pull/3446

   [DOC][Build] Wrong cmd for build spark with apache hadoop 2.4.X and hive 12
   wangfei <wangfei1@huawei.com>
   2014-11-24 22:32:39 -0800
   Commit: 0fe54cf, github.com/apache/spark/pull/3335

   [SQL] Compute timeTaken correctly
   w00228970 <wangfei1@huawei.com>
   2014-11-24 21:17:24 -0800
   Commit: 723be60, github.com/apache/spark/pull/3423

   [SPARK-4582][MLLIB] get raw vectors for further processing in Word2Vec
   tkaessmann <tobias.kaessmanns24.com>, tkaessmann <tobias.kaessmann@s24.com>
   2014-11-24 19:58:01 -0800
   Commit: 9ce2bf3, github.com/apache/spark/pull/3309

   [SPARK-4525] Mesos should decline unused offers
   Patrick Wendell <pwendell@gmail.com>, Jongyoul Lee <jongyoul@gmail.com>
   2014-11-24 19:14:14 -0800
   Commit: f0afb62, github.com/apache/spark/pull/3436

   Revert "[SPARK-4525] Mesos should decline unused offers"
   Patrick Wendell <pwendell@gmail.com>
   2014-11-24 19:16:53 -0800
   Commit: a68d442

   [SPARK-4525] Mesos should decline unused offers
   Patrick Wendell <pwendell@gmail.com>, Jongyoul Lee <jongyoul@gmail.com>
   2014-11-24 19:14:14 -0800
   Commit: b043c27, github.com/apache/spark/pull/3436

   [SPARK-4266] [Web-UI] Reduce stage page load time.
   Kay Ousterhout <kayousterhout@gmail.com>
   2014-11-24 18:03:10 -0800
   Commit: d24d5bf, github.com/apache/spark/pull/3328

   [SPARK-4548] []SPARK-4517] improve performance of python broadcast
   Davies Liu <davies@databricks.com>
   2014-11-24 17:17:03 -0800
   Commit: 6cf5076, github.com/apache/spark/pull/3417

   [SPARK-4578] fix asDict() with nested Row()
   Davies Liu <davies@databricks.com>
   2014-11-24 16:41:23 -0800
   Commit: 050616b, github.com/apache/spark/pull/3434

   [SPARK-4562] [MLlib] speedup vector
   Davies Liu <davies@databricks.com>
   2014-11-24 16:37:14 -0800
   Commit: b660de7, github.com/apache/spark/pull/3420

   [SPARK-4518][SPARK-4519][Streaming] Refactored file stream to prevent files from being processed multiple times
   Tathagata Das <tathagata.das1565@gmail.com>
   2014-11-24 13:50:20 -0800
   Commit: cb0e9b0, github.com/apache/spark/pull/3419

   [SPARK-4145] Web UI job pages
   Josh Rosen <joshrosen@databricks.com>
   2014-11-24 13:18:14 -0800
   Commit: 4a90276, github.com/apache/spark/pull/3009

   [SPARK-4487][SQL] Fix attribute reference resolution error when using ORDER BY.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-24 12:54:37 -0800
   Commit: dd1c9cb, github.com/apache/spark/pull/3363

   [SQL] Fix path in HiveFromSpark
   scwf <wangfei1@huawei.com>
   2014-11-24 12:49:08 -0800
   Commit: b384119, github.com/apache/spark/pull/3415

   [SQL] Fix comment in HiveShim
   Daniel Darabos <darabos.daniel@gmail.com>
   2014-11-24 12:45:07 -0800
   Commit: d5834f0, github.com/apache/spark/pull/3432

   [SPARK-4479][SQL] Avoids unnecessary defensive copies when sort based shuffle is on
   Cheng Lian <lian@databricks.com>
   2014-11-24 12:43:45 -0800
   Commit: a6d7b61, github.com/apache/spark/pull/3422

   SPARK-4457. Document how to build for Hadoop versions greater than 2.4
   Sandy Ryza <sandy@cloudera.com>
   2014-11-24 13:28:48 -0600
   Commit: 29372b6, github.com/apache/spark/pull/3322

   [SPARK-4377] Fixed serialization issue by switching to akka provided serializer.
   Prashant Sharma <prashant.s@imaginea.com>
   2014-11-22 14:05:38 -0800
   Commit: 9b2a3c6, github.com/apache/spark/pull/3402

   [SPARK-4431][MLlib] Implement efficient foreachActive for dense and sparse vector
   DB Tsai <dbtsai@alpinenow.com>
   2014-11-21 18:15:07 -0800
   Commit: b5d17ef, github.com/apache/spark/pull/3288

   [SPARK-4531] [MLlib] cache serialized java object
   Davies Liu <davies@databricks.com>
   2014-11-21 15:02:31 -0800
   Commit: ce95bd8, github.com/apache/spark/pull/3397

   SPARK-4532: Fix bug in detection of Hive in Spark 1.2
   Patrick Wendell <pwendell@gmail.com>
   2014-11-21 12:10:04 -0800
   Commit: a81918c, github.com/apache/spark/pull/3398

   [SPARK-4397][Core] Reorganize 'implicit's to improve the API convenience
   zsxwing <zsxwing@gmail.com>
   2014-11-21 10:06:30 -0800
   Commit: 65b987c, github.com/apache/spark/pull/3262

   [SPARK-4472][Shell] Print "Spark context available as sc." only when SparkContext is created...
   zsxwing <zsxwing@gmail.com>
   2014-11-21 00:42:43 -0800
   Commit: f1069b8, github.com/apache/spark/pull/3341

   [Doc][GraphX] Remove unused png files.
   Reynold Xin <rxin@databricks.com>
   2014-11-21 00:30:58 -0800
   Commit: 28fdc6f

   [Doc][GraphX] Remove Motivation section and did some minor update.
   Reynold Xin <rxin@databricks.com>
   2014-11-21 00:29:02 -0800
   Commit: b97070e

   [SPARK-4522][SQL] Parse schema with missing metadata.
   Michael Armbrust <michael@databricks.com>
   2014-11-20 20:34:43 -0800
   Commit: 90a6a46, github.com/apache/spark/pull/3392

   add Sphinx as a dependency of building docs
   Davies Liu <davies@databricks.com>
   2014-11-20 19:12:45 -0800
   Commit: 8cd6eea, github.com/apache/spark/pull/3388

   [SPARK-4413][SQL] Parquet support through datasource API
   Michael Armbrust <michael@databricks.com>
   2014-11-20 18:31:02 -0800
   Commit: 02ec058, github.com/apache/spark/pull/3269

   [SPARK-4244] [SQL] Support Hive Generic UDFs with constant object inspector parameters
   Cheng Hao <hao.cheng@intel.com>
   2014-11-20 16:50:59 -0800
   Commit: 84d79ee, github.com/apache/spark/pull/3109

   [SPARK-4477] [PySpark] remove numpy from RDDSampler
   Davies Liu <davies@databricks.com>, Xiangrui Meng <meng@databricks.com>
   2014-11-20 16:40:25 -0800
   Commit: d39f2e9, github.com/apache/spark/pull/3351

   [SQL] fix function description mistake
   Jacky Li <jacky.likun@gmail.com>
   2014-11-20 15:48:36 -0800
   Commit: ad5f1f3, github.com/apache/spark/pull/3344

   [SPARK-2918] [SQL] Support the CTAS in EXPLAIN command
   Cheng Hao <hao.cheng@intel.com>
   2014-11-20 15:46:00 -0800
   Commit: 6aa0fc9, github.com/apache/spark/pull/3357

   [SPARK-4318][SQL] Fix empty sum distinct.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-20 15:41:24 -0800
   Commit: 2c2e7a4, github.com/apache/spark/pull/3184

   [SPARK-4513][SQL] Support relational operator '<=>' in Spark SQL
   ravipesala <ravindra.pesala@huawei.com>
   2014-11-20 15:34:03 -0800
   Commit: 98e9419, github.com/apache/spark/pull/3387

   [SPARK-4439] [MLlib] add python api for random forest
   Davies Liu <davies@databricks.com>
   2014-11-20 15:31:28 -0800
   Commit: 1c53a5d, github.com/apache/spark/pull/3320

   [SPARK-4228][SQL] SchemaRDD to JSON
   Dan McClary <dan.mcclary@gmail.com>
   2014-11-20 13:36:50 -0800
   Commit: b8e6886, github.com/apache/spark/pull/3213

   [SPARK-3938][SQL] Names in-memory columnar RDD with corresponding table name
   Cheng Lian <lian@databricks.com>
   2014-11-20 13:12:24 -0800
   Commit: abf2918, github.com/apache/spark/pull/3383

   [SPARK-4486][MLLIB] Improve GradientBoosting APIs and doc
   Xiangrui Meng <meng@databricks.com>
   2014-11-20 00:48:59 -0800
   Commit: 15cacc8, github.com/apache/spark/pull/3374

   [SPARK-4446] [SPARK CORE]
   Leolh <leosandylh@gmail.com>
   2014-11-19 18:18:55 -0800
   Commit: e216ffa, github.com/apache/spark/pull/3306

   [SPARK-4480] Avoid many small spills in external data structures
   Andrew Or <andrew@databricks.com>
   2014-11-19 18:07:27 -0800
   Commit: 0eb4a7f, github.com/apache/spark/pull/3353

   [Spark-4484] Treat maxResultSize as unlimited when set to 0; improve error message
   Nishkam Ravi <nravi@cloudera.com>, nravi <nravi@c1704.halxg.cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>
   2014-11-19 17:23:42 -0800
   Commit: 73fedf5, github.com/apache/spark/pull/3360

   [SPARK-4478] Keep totalRegisteredExecutors up-to-date
   Akshat Aranya <aaranya@quantcast.com>
   2014-11-19 17:20:20 -0800
   Commit: 9ccc53c, github.com/apache/spark/pull/3373

   Updating GraphX programming guide and documentation
   Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
   2014-11-19 16:53:33 -0800
   Commit: 377b068, github.com/apache/spark/pull/3359

   [SPARK-4495] Fix memory leak in JobProgressListener
   Josh Rosen <joshrosen@databricks.com>
   2014-11-19 16:50:21 -0800
   Commit: 04d462f, github.com/apache/spark/pull/3372

   [SPARK-4294][Streaming] UnionDStream stream should express the requirements in the same way as TransformedDStream
   Yadong Qi <qiyadong2010@gmail.com>
   2014-11-19 15:53:06 -0800
   Commit: c3002c4, github.com/apache/spark/pull/3152

   [SPARK-4384] [PySpark] improve sort spilling
   Davies Liu <davies@databricks.com>
   2014-11-19 15:45:37 -0800
   Commit: 73c8ea8, github.com/apache/spark/pull/3252

   [SPARK-4429][BUILD] Build for Scala 2.11 using sbt fails.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-19 14:40:21 -0800
   Commit: f9adda9, github.com/apache/spark/pull/3342

   [DOC][PySpark][Streaming] Fix docstring for sphinx
   Ken Takagiwa <ugw.gi.world@gmail.com>
   2014-11-19 14:23:18 -0800
   Commit: 9b7bbce, github.com/apache/spark/pull/3311

   SPARK-3962 Marked scope as provided for external projects.
   Prashant Sharma <prashant.s@imaginea.com>, Prashant Sharma <scrapcodes@gmail.com>
   2014-11-19 14:18:10 -0800
   Commit: 1c93841, github.com/apache/spark/pull/2959

   [HOT FIX] MiMa tests are broken
   Andrew Or <andrew@databricks.com>
   2014-11-19 14:03:44 -0800
   Commit: 0df02ca, github.com/apache/spark/pull/3371

   [SPARK-4481][Streaming][Doc] Fix the wrong description of updateFunc
   zsxwing <zsxwing@gmail.com>
   2014-11-19 13:17:15 -0800
   Commit: 3bf7cee, github.com/apache/spark/pull/3356

   [SPARK-4482][Streaming] Disable ReceivedBlockTracker's write ahead log by default
   Tathagata Das <tathagata.das1565@gmail.com>
   2014-11-19 13:06:48 -0800
   Commit: 22fc4e7, github.com/apache/spark/pull/3358

   [SPARK-4470] Validate number of threads in local mode
   Kenichi Maehashi <webmaster@kenichimaehashi.com>
   2014-11-19 12:11:09 -0800
   Commit: eacc788, github.com/apache/spark/pull/3337

   [SPARK-4467] fix elements read count for ExtrenalSorter
   Tianshuo Deng <tdeng@twitter.com>
   2014-11-19 10:01:09 -0800
   Commit: d75579d, github.com/apache/spark/pull/3302

   SPARK-4455 Exclude dependency on hbase-annotations module
   tedyu <yuzhihong@gmail.com>
   2014-11-19 00:55:39 -0800
   Commit: 5f5ac2d, github.com/apache/spark/pull/3286

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-11-19 00:27:31 -0800
   Commit: 8327df6, github.com/apache/spark/pull/2777

   [Spark-4432]close InStream after the block is accessed
   Mingfei <mingfei.shi@intel.com>
   2014-11-18 22:17:06 -0800
   Commit: 165cec9, github.com/apache/spark/pull/3290

   [SPARK-4441] Close Tachyon client when TachyonBlockManager is shutdown
   Mingfei <mingfei.shi@intel.com>
   2014-11-18 22:16:36 -0800
   Commit: 67e9876, github.com/apache/spark/pull/3299

   Bumping version to 1.3.0-SNAPSHOT.
   Marcelo Vanzin <vanzin@cloudera.com>
   2014-11-18 21:24:18 -0800
   Commit: 397d3aa, github.com/apache/spark/pull/3277

   [SPARK-4468][SQL] Fixes Parquet filter creation for inequality predicates with literals on the left hand side
   Cheng Lian <lian@databricks.com>
   2014-11-18 17:41:54 -0800
   Commit: 423baea, github.com/apache/spark/pull/3334

   [SPARK-4327] [PySpark] Python API for RDD.randomSplit()
   Davies Liu <davies@databricks.com>
   2014-11-18 16:37:35 -0800
   Commit: 7f22fa8, github.com/apache/spark/pull/3193

   [SPARK-4433] fix a racing condition in zipWithIndex
   Xiangrui Meng <meng@databricks.com>
   2014-11-18 16:25:44 -0800
   Commit: bb46046, github.com/apache/spark/pull/3291

   [SPARK-3721] [PySpark] broadcast objects larger than 2G
   Davies Liu <davies@databricks.com>, Davies Liu <davies.liu@gmail.com>
   2014-11-18 16:17:51 -0800
   Commit: 4a377af, github.com/apache/spark/pull/2659

   [SPARK-4306] [MLlib] Python API for LogisticRegressionWithLBFGS
   Davies Liu <davies@databricks.com>
   2014-11-18 15:57:33 -0800
   Commit: d2e2951, github.com/apache/spark/pull/3307

   [SPARK-4463] Add (de)select all button for add'l metrics.
   Kay Ousterhout <kayousterhout@gmail.com>
   2014-11-18 15:01:06 -0800
   Commit: 010bc86, github.com/apache/spark/pull/3331

   [SPARK-4017] show progress bar in console
   Davies Liu <davies@databricks.com>
   2014-11-18 13:37:21 -0800
   Commit: e34f38f, github.com/apache/spark/pull/3029

   [SPARK-4404] remove sys.exit() in shutdown hook
   Davies Liu <davies@databricks.com>
   2014-11-18 13:11:38 -0800
   Commit: 80f3177, github.com/apache/spark/pull/3289

   [SPARK-4075][SPARK-4434] Fix the URI validation logic for Application Jar name.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-18 12:17:33 -0800
   Commit: bfebfd8, github.com/apache/spark/pull/3326

   [SQL] Support partitioned parquet tables that have the key in both the directory and the file
   Michael Armbrust <michael@databricks.com>
   2014-11-18 12:13:23 -0800
   Commit: 90d72ec, github.com/apache/spark/pull/3272

   [SPARK-4396] allow lookup by index in Python's Rating
   Xiangrui Meng <meng@databricks.com>
   2014-11-18 10:35:29 -0800
   Commit: b54c6ab, github.com/apache/spark/pull/3261

   [SPARK-4435] [MLlib] [PySpark] improve classification
   Davies Liu <davies@databricks.com>
   2014-11-18 10:11:13 -0800
   Commit: 8fbf72b, github.com/apache/spark/pull/3305

   ALS implicit: added missing parameter alpha in doc string
   Felix Maximilian Möller <felixmaximilian.moeller@immobilienscout24.de>
   2014-11-18 10:08:24 -0800
   Commit: cedc3b5, github.com/apache/spark/pull/3343

   SPARK-4466: Provide support for publishing Scala 2.11 artifacts to Maven
   Patrick Wendell <pwendell@gmail.com>
   2014-11-17 21:07:50 -0800
   Commit: c6e0c2a, github.com/apache/spark/pull/3332

   [SPARK-4453][SPARK-4213][SQL] Simplifies Parquet filter generation code
   Cheng Lian <lian@databricks.com>
   2014-11-17 16:55:12 -0800
   Commit: 36b0956, github.com/apache/spark/pull/3317

   [SPARK-4448] [SQL] unwrap for the ConstantObjectInspector
   Cheng Hao <hao.cheng@intel.com>
   2014-11-17 16:35:49 -0800
   Commit: ef7c464, github.com/apache/spark/pull/3308

   [SPARK-4443][SQL] Fix statistics for external table in spark sql hive
   w00228970 <wangfei1@huawei.com>
   2014-11-17 16:33:50 -0800
   Commit: 42389b1, github.com/apache/spark/pull/3304

   [SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types
   Cheng Lian <lian@databricks.com>
   2014-11-17 16:31:05 -0800
   Commit: 6b7f2f7, github.com/apache/spark/pull/3298

   [SQL] Construct the MutableRow from an Array
   Cheng Hao <hao.cheng@intel.com>
   2014-11-17 16:29:52 -0800
   Commit: 69e858c, github.com/apache/spark/pull/3217

   [SPARK-4425][SQL] Handle NaN or Infinity cast to Timestamp correctly.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-17 16:28:07 -0800
   Commit: 566c791, github.com/apache/spark/pull/3283

   [SPARK-4420][SQL] Change nullability of Cast from DoubleType/FloatType to DecimalType.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-17 16:26:48 -0800
   Commit: 3a81a1c, github.com/apache/spark/pull/3278

   [SQL] Makes conjunction pushdown more aggressive for in-memory table
   Cheng Lian <lian@databricks.com>
   2014-11-17 15:33:13 -0800
   Commit: 5ce7dae, github.com/apache/spark/pull/3318

   [SPARK-4180] [Core] Prevent creation of multiple active SparkContexts
   Josh Rosen <joshrosen@databricks.com>
   2014-11-17 12:48:18 -0800
   Commit: 0f3ceb5, github.com/apache/spark/pull/3121

   [DOCS][SQL] Fix broken link to Row class scaladoc
   Andy Konwinski <andykonwinski@gmail.com>
   2014-11-17 11:52:23 -0800
   Commit: cec1116, github.com/apache/spark/pull/3323

   Revert "[SPARK-4075] [Deploy] Jar url validation is not enough for Jar file"
   Andrew Or <andrew@databricks.com>
   2014-11-17 11:24:28 -0800
   Commit: dbb9da5

   [SPARK-4444] Drop VD type parameter from EdgeRDD
   Ankur Dave <ankurdave@gmail.com>
   2014-11-17 11:06:31 -0800
   Commit: 9ac2bb1, github.com/apache/spark/pull/3303

   SPARK-2811 upgrade algebird to 0.8.1
   Adam Pingel <adam@axle-lang.org>
   2014-11-17 10:47:29 -0800
   Commit: e7690ed, github.com/apache/spark/pull/3282

   SPARK-4445, Don't display storage level in toDebugString unless RDD is persisted.
   Prashant Sharma <prashant.s@imaginea.com>
   2014-11-17 10:40:33 -0800
   Commit: 5c92d47, github.com/apache/spark/pull/3310

   [SPARK-4410][SQL] Add support for external sort
   Michael Armbrust <michael@databricks.com>
   2014-11-16 21:55:57 -0800
   Commit: 64c6b9b, github.com/apache/spark/pull/3268

   [SPARK-4422][MLLIB]In some cases, Vectors.fromBreeze get wrong results.
   GuoQiang Li <witgo@qq.com>
   2014-11-16 21:31:51 -0800
   Commit: 5168c6c, github.com/apache/spark/pull/3281

   Revert "[SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types"
   Michael Armbrust <michael@databricks.com>
   2014-11-16 15:05:04 -0800
   Commit: 45ce327, github.com/apache/spark/pull/3292

   [SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types
   Cheng Lian <lian@databricks.com>
   2014-11-16 14:26:41 -0800
   Commit: cb6bd83, github.com/apache/spark/pull/3178

   [SPARK-4393] Fix memory leak in ConnectionManager ACK timeout TimerTasks; use HashedWheelTimer
   Josh Rosen <joshrosen@databricks.com>
   2014-11-16 00:44:15 -0800
   Commit: 7850e0c, github.com/apache/spark/pull/3259

   [SPARK-4426][SQL][Minor] The symbol of BitwiseOr is wrong, should not be '&'
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-15 22:23:47 -0800
   Commit: 84468b2, github.com/apache/spark/pull/3284

   [SPARK-4419] Upgrade snappy-java to 1.1.1.6
   Josh Rosen <joshrosen@databricks.com>
   2014-11-15 22:22:34 -0800
   Commit: 7d8e152, github.com/apache/spark/pull/3287

   [SPARK-2321] Several progress API improvements / refactorings
   Josh Rosen <joshrosen@databricks.com>
   2014-11-14 23:46:25 -0800
   Commit: 40eb8b6, github.com/apache/spark/pull/3197

   Added contains(key) to Metadata
   kai <kaizeng@eecs.berkeley.edu>
   2014-11-14 23:44:23 -0800
   Commit: cbddac2, github.com/apache/spark/pull/3273

   [SPARK-4260] Httpbroadcast should set connection timeout.
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-14 22:36:56 -0800
   Commit: 60969b0, github.com/apache/spark/pull/3122

   [SPARK-4363][Doc] Update the Broadcast example
   zsxwing <zsxwing@gmail.com>
   2014-11-14 22:28:48 -0800
   Commit: 861223e, github.com/apache/spark/pull/3226

   [SPARK-4379][Core] Change Exception to SparkException in checkpoint
   zsxwing <zsxwing@gmail.com>
   2014-11-14 22:25:41 -0800
   Commit: dba1405, github.com/apache/spark/pull/3241

   [SPARK-4415] [PySpark] JVM should exit after Python exit
   Davies Liu <davies@databricks.com>
   2014-11-14 20:13:46 -0800
   Commit: 7fe08b4, github.com/apache/spark/pull/3274

   [SPARK-4404]SparkSubmitDriverBootstrapper should stop after its SparkSubmit sub-proc...
   WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
   2014-11-14 20:11:51 -0800
   Commit: 303a4e4, github.com/apache/spark/pull/3266

   SPARK-4214. With dynamic allocation, avoid outstanding requests for more...
   Sandy Ryza <sandy@cloudera.com>
   2014-11-14 15:51:05 -0800
   Commit: ad42b28, github.com/apache/spark/pull/3204

   [SPARK-4412][SQL] Fix Spark's control of Parquet logging.
   Jim Carroll <jim@dontcallme.com>
   2014-11-14 15:33:21 -0800
   Commit: 37482ce, github.com/apache/spark/pull/3271

   [SPARK-4365][SQL] Remove unnecessary filter call on records returned from parquet library
   Yash Datta <Yash.Datta@guavus.com>
   2014-11-14 15:16:36 -0800
   Commit: 63ca3af, github.com/apache/spark/pull/3229

   [SPARK-4386] Improve performance when writing Parquet files.
   Jim Carroll <jim@dontcallme.com>
   2014-11-14 15:11:53 -0800
   Commit: f76b968, github.com/apache/spark/pull/3254

   [SPARK-4322][SQL] Enables struct fields as sub expressions of grouping fields
   Cheng Lian <lian@databricks.com>
   2014-11-14 15:09:36 -0800
   Commit: 0c7b66b, github.com/apache/spark/pull/3248

   [SQL] Don't shuffle code generated rows
   Michael Armbrust <michael@databricks.com>
   2014-11-14 15:03:23 -0800
   Commit: 4b4b50c, github.com/apache/spark/pull/3263

   [SQL] Minor cleanup of comments, errors and override.
   Michael Armbrust <michael@databricks.com>
   2014-11-14 15:00:42 -0800
   Commit: f805025, github.com/apache/spark/pull/3257

   [SPARK-4391][SQL] Configure parquet filters using SQLConf
   Michael Armbrust <michael@databricks.com>
   2014-11-14 14:59:35 -0800
   Commit: e47c387, github.com/apache/spark/pull/3258

   [SPARK-4390][SQL] Handle NaN cast to decimal correctly
   Michael Armbrust <michael@databricks.com>
   2014-11-14 14:56:57 -0800
   Commit: a0300ea, github.com/apache/spark/pull/3256

   [SPARK-4062][Streaming]Add ReliableKafkaReceiver in Spark Streaming Kafka connector
   jerryshao <saisai.shao@intel.com>, Tathagata Das <tathagata.das1565@gmail.com>, Saisai Shao <saisai.shao@intel.com>
   2014-11-14 14:33:37 -0800
   Commit: 5930f64, github.com/apache/spark/pull/2991

   [SPARK-4333][SQL]  Correctly log number of iterations in RuleExecutor
   DoingDone9 <799203320@qq.com>
   2014-11-14 14:28:06 -0800
   Commit: 0cbdb01, github.com/apache/spark/pull/3180

   SPARK-4375. no longer require -Pscala-2.10
   Sandy Ryza <sandy@cloudera.com>
   2014-11-14 14:21:57 -0800
   Commit: f5f757e, github.com/apache/spark/pull/3239

   [SPARK-4245][SQL] Fix containsNull of the result ArrayType of CreateArray expression.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-14 14:21:16 -0800
   Commit: bbd8f5b, github.com/apache/spark/pull/3110

   [SPARK-4239] [SQL] support view in HiveQl
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-11-14 13:51:20 -0800
   Commit: ade72c4, github.com/apache/spark/pull/3131

   Update failed assert text to match code in SizeEstimatorSuite
   Jeff Hammerbacher <jeff.hammerbacher@gmail.com>
   2014-11-14 13:37:48 -0800
   Commit: c258db9, github.com/apache/spark/pull/3242

   [SPARK-4313][WebUI][Yarn] Fix link issue of the executor thread dump page in yarn-cluster mode
   zsxwing <zsxwing@gmail.com>
   2014-11-14 13:36:13 -0800
   Commit: 156cf33, github.com/apache/spark/pull/3183

   SPARK-3663 Document SPARK_LOG_DIR and SPARK_PID_DIR
   Andrew Ash <andrew@andrewash.com>
   2014-11-14 13:33:35 -0800
   Commit: 5c265cc, github.com/apache/spark/pull/2518

   [Spark Core] SPARK-4380 Edit spilling log from MB to B
   Hong Shen <hongshen@tencent.com>
   2014-11-14 13:29:41 -0800
   Commit: 0c56a03, github.com/apache/spark/pull/3243

   [SPARK-4398][PySpark] specialize sc.parallelize(xrange)
   Xiangrui Meng <meng@databricks.com>
   2014-11-14 12:43:17 -0800
   Commit: abd5817, github.com/apache/spark/pull/3264

   [SPARK-4394][SQL] Data Sources API Improvements
   Michael Armbrust <michael@databricks.com>
   2014-11-14 12:00:08 -0800
   Commit: 77e845c, github.com/apache/spark/pull/3260

   [SPARK-3722][Docs]minor improvement and fix in docs
   WangTao <barneystinson@aliyun.com>
   2014-11-14 08:09:42 -0600
   Commit: e421072, github.com/apache/spark/pull/2579

   [SPARK-4310][WebUI] Sort 'Submitted' column in Stage page by time
   zsxwing <zsxwing@gmail.com>
   2014-11-13 14:37:04 -0800
   Commit: 825709a, github.com/apache/spark/pull/3179

   [SPARK-4372][MLLIB] Make LR and SVM's default parameters consistent in Scala and Python
   Xiangrui Meng <meng@databricks.com>
   2014-11-13 13:54:16 -0800
   Commit: 3221830, github.com/apache/spark/pull/3232

   [SPARK-4326] fix unidoc
   Xiangrui Meng <meng@databricks.com>
   2014-11-13 13:16:20 -0800
   Commit: 4b0c1ed, github.com/apache/spark/pull/3253

   [HOT FIX] make-distribution.sh fails if Yarn shuffle jar DNE
   Andrew Or <andrew@databricks.com>
   2014-11-13 11:54:45 -0800
   Commit: a0fa1ba, github.com/apache/spark/pull/3250

   [SPARK-4378][MLLIB] make ALS more Java-friendly
   Xiangrui Meng <meng@databricks.com>
   2014-11-13 11:42:27 -0800
   Commit: ca26a21, github.com/apache/spark/pull/3240

   [SPARK-4348] [PySpark] [MLlib] rename random.py to rand.py
   Davies Liu <davies@databricks.com>
   2014-11-13 10:24:54 -0800
   Commit: ce0333f, github.com/apache/spark/pull/3216

   [SPARK-4256] Make Binary Evaluation Metrics functions defined in cases where there ar...
   Andrew Bullen <andrew.bullen@workday.com>
   2014-11-12 22:14:44 -0800
   Commit: 484fecb, github.com/apache/spark/pull/3118

   [SPARK-4370] [Core] Limit number of Netty cores based on executor size
   Aaron Davidson <aaron@databricks.com>
   2014-11-12 18:46:37 -0800
   Commit: b9e1c2e, github.com/apache/spark/pull/3155

   [SPARK-4373][MLLIB] fix MLlib maven tests
   Xiangrui Meng <meng@databricks.com>
   2014-11-12 18:15:14 -0800
   Commit: 23f5bdf, github.com/apache/spark/pull/3235

   [Release] Bring audit scripts up-to-date
   Andrew Or <andrew@databricks.com>
   2014-11-13 00:30:58 +0000
   Commit: 723a86b

   [SPARK-2672] support compressed file in wholeTextFile
   Davies Liu <davies@databricks.com>
   2014-11-12 15:58:12 -0800
   Commit: d7d54a4, github.com/apache/spark/pull/3005

   [SPARK-4369] [MLLib] fix TreeModel.predict() with RDD
   Davies Liu <davies@databricks.com>
   2014-11-12 13:56:41 -0800
   Commit: bd86118, github.com/apache/spark/pull/3230

   [SPARK-3666] Extract interfaces for EdgeRDD and VertexRDD
   Ankur Dave <ankurdave@gmail.com>
   2014-11-12 13:49:20 -0800
   Commit: a5ef581, github.com/apache/spark/pull/2530

   [Release] Correct make-distribution.sh log path
   Andrew Or <andrew@databricks.com>
   2014-11-12 13:46:26 -0800
   Commit: c3afd32

   Internal cleanup for aggregateMessages
   Ankur Dave <ankurdave@gmail.com>
   2014-11-12 13:44:49 -0800
   Commit: 0402be9, github.com/apache/spark/pull/3231

   [SPARK-4281][Build] Package Yarn shuffle service into its own jar
   Andrew Or <andrew@databricks.com>
   2014-11-12 13:39:45 -0800
   Commit: aa43a8d, github.com/apache/spark/pull/3147

   [Test] Better exception message from SparkSubmitSuite
   Andrew Or <andrew@databricks.com>
   2014-11-12 13:35:48 -0800
   Commit: 6e3c5a2, github.com/apache/spark/pull/3212

   [SPARK-3660][STREAMING] Initial RDD for updateStateByKey transformation
   Soumitra Kumar <kumar.soumitra@gmail.com>
   2014-11-12 12:25:31 -0800
   Commit: 36ddeb7, github.com/apache/spark/pull/2665

   [SPARK-3530][MLLIB] pipeline and parameters with examples
   Xiangrui Meng <meng@databricks.com>
   2014-11-12 10:38:57 -0800
   Commit: 4b736db, github.com/apache/spark/pull/3099

   [SPARK-4355][MLLIB] fix OnlineSummarizer.merge when other.mean is zero
   Xiangrui Meng <meng@databricks.com>
   2014-11-12 01:50:11 -0800
   Commit: 84324fb, github.com/apache/spark/pull/3220

   [SPARK-3936] Add aggregateMessages, which supersedes mapReduceTriplets
   Ankur Dave <ankurdave@gmail.com>
   2014-11-11 23:38:27 -0800
   Commit: faeb41d, github.com/apache/spark/pull/3100

   [MLLIB] SPARK-4347: Reducing GradientBoostingSuite run time.
   Manish Amde <manish9ue@gmail.com>
   2014-11-11 22:47:53 -0800
   Commit: 2ef016b, github.com/apache/spark/pull/3214

   Support cross building for Scala 2.11
   Prashant Sharma <prashant.s@imaginea.com>, Patrick Wendell <pwendell@gmail.com>
   2014-11-11 21:36:48 -0800
   Commit: daaca14, github.com/apache/spark/pull/3159

   [Release] Log build output for each distribution
   Andrew Or <andrew@databricks.com>
   2014-11-11 18:02:59 -0800
   Commit: 2ddb141

   SPARK-2269 Refactor mesos scheduler resourceOffers and add unit test
   Timothy Chen <tnachen@gmail.com>
   2014-11-11 14:29:18 -0800
   Commit: a878660, github.com/apache/spark/pull/1487

   [SPARK-4282][YARN] Stopping flag in YarnClientSchedulerBackend should be volatile
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-11 12:33:53 -0600
   Commit: 7f37188, github.com/apache/spark/pull/3143

   SPARK-4305 [BUILD] yarn-alpha profile won't build due to network/yarn module
   Sean Owen <sowen@cloudera.com>
   2014-11-11 12:30:35 -0600
   Commit: f820b56, github.com/apache/spark/pull/3167

   SPARK-1830 Deploy failover, Make Persistence engine and LeaderAgent Pluggable
   Prashant Sharma <prashant.s@imaginea.com>
   2014-11-11 09:29:48 -0800
   Commit: deefd9d, github.com/apache/spark/pull/771

   [Streaming][Minor]Replace some 'if-else' in Clock
   huangzhaowei <carlmartinmax@gmail.com>
   2014-11-11 03:02:12 -0800
   Commit: 6e03de3, github.com/apache/spark/pull/3088

   [SPARK-2492][Streaming] kafkaReceiver minor changes to align with Kafka 0.8
   jerryshao <saisai.shao@intel.com>
   2014-11-11 02:22:23 -0800
   Commit: c8850a3, github.com/apache/spark/pull/1420

   [SPARK-4295][External]Fix exception in SparkSinkSuite
   maji2014 <maji3@asiainfo.com>
   2014-11-11 02:18:27 -0800
   Commit: f8811a5, github.com/apache/spark/pull/3177

   [SPARK-4307] Initialize FileDescriptor lazily in FileRegion.
   Reynold Xin <rxin@databricks.com>, Reynold Xin <rxin@apache.org>
   2014-11-11 00:25:31 -0800
   Commit: ef29a9a, github.com/apache/spark/pull/3172

   [SPARK-4324] [PySpark] [MLlib] support numpy.array for all MLlib API
   Davies Liu <davies@databricks.com>
   2014-11-10 22:26:16 -0800
   Commit: 65083e9, github.com/apache/spark/pull/3189

   [SPARK-4330][Doc] Link to proper URL for YARN overview
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-10 22:18:00 -0800
   Commit: 3c07b8f, github.com/apache/spark/pull/3196

   [SPARK-3649] Remove GraphX custom serializers
   Ankur Dave <ankurdave@gmail.com>
   2014-11-10 19:31:52 -0800
   Commit: 300887b, github.com/apache/spark/pull/2503

   [SPARK-4274] [SQL] Fix NPE in printing the details of the query plan
   Cheng Hao <hao.cheng@intel.com>
   2014-11-10 17:46:05 -0800
   Commit: c764d0a, github.com/apache/spark/pull/3139

   [SPARK-3954][Streaming] Optimization to FileInputDStream
   surq <surq@asiainfo.com>
   2014-11-10 17:37:16 -0800
   Commit: ce6ed2a, github.com/apache/spark/pull/2811

   [SPARK-4149][SQL] ISO 8601 support for json date time strings
   Daoyuan Wang <daoyuan.wang@intel.com>
   2014-11-10 17:26:03 -0800
   Commit: a1fc059, github.com/apache/spark/pull/3012

   [SPARK-4250] [SQL] Fix bug of constant null value mapping to ConstantObjectInspector
   Cheng Hao <hao.cheng@intel.com>
   2014-11-10 17:22:57 -0800
   Commit: fa77783, github.com/apache/spark/pull/3114

   [SQL] remove a decimal case branch that has no effect at runtime
   Xiangrui Meng <meng@databricks.com>
   2014-11-10 17:20:52 -0800
   Commit: d793d80, github.com/apache/spark/pull/3192

   [SPARK-4308][SQL] Sets SQL operation state to ERROR when exception is thrown
   Cheng Lian <lian@databricks.com>
   2014-11-10 16:56:36 -0800
   Commit: acb55ae, github.com/apache/spark/pull/3175

   [SPARK-4000][Build] Uploads HiveCompatibilitySuite logs
   Cheng Lian <lian@databricks.com>
   2014-11-10 16:17:52 -0800
   Commit: 534b231, github.com/apache/spark/pull/2993

   [SPARK-4319][SQL] Enable an ignored test "null count".
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-10 15:55:15 -0800
   Commit: dbf1058, github.com/apache/spark/pull/3185

   Revert "[SPARK-2703][Core]Make Tachyon related unit tests execute without deploying a Tachyon system locally."
   Patrick Wendell <pwendell@gmail.com>
   2014-11-10 14:56:06 -0800
   Commit: 6e7a309

   [SPARK-4047] - Generate runtime warnings for example implementation of PageRank
   Varadharajan Mukundan <srinathsmn@gmail.com>
   2014-11-10 14:32:29 -0800
   Commit: 974d334, github.com/apache/spark/pull/2894

   SPARK-1297 Upgrade HBase dependency to 0.98
   tedyu <yuzhihong@gmail.com>
   2014-11-10 13:23:33 -0800
   Commit: b32734e, github.com/apache/spark/pull/3115

   SPARK-4230. Doc for spark.default.parallelism is incorrect
   Sandy Ryza <sandy@cloudera.com>
   2014-11-10 12:40:41 -0800
   Commit: c6f4e70, github.com/apache/spark/pull/3107

   [SPARK-4312] bash doesn't have "die"
   Jey Kottalam <jey@kottalam.net>
   2014-11-10 12:37:56 -0800
   Commit: c5db8e2, github.com/apache/spark/pull/2898

   Update RecoverableNetworkWordCount.scala
   comcmipi <pitonak@fns.uniba.sk>
   2014-11-10 12:33:48 -0800
   Commit: 0340c56, github.com/apache/spark/pull/2735

   SPARK-2548 [STREAMING] JavaRecoverableWordCount is missing
   Sean Owen <sowen@cloudera.com>
   2014-11-10 11:47:27 -0800
   Commit: 3a02d41, github.com/apache/spark/pull/2564

   [SPARK-4169] [Core] Accommodate non-English Locales in unit tests
   Niklas Wilcke <1wilcke@informatik.uni-hamburg.de>
   2014-11-10 11:37:38 -0800
   Commit: ed8bf1e, github.com/apache/spark/pull/3036

   [SQL] support udt to hive types conversion (hive->udt is not supported)
   Xiangrui Meng <meng@databricks.com>
   2014-11-10 11:04:12 -0800
   Commit: 894a724, github.com/apache/spark/pull/3164

   [SPARK-2703][Core]Make Tachyon related unit tests execute without deploying a Tachyon system locally.
   RongGu <gurongwalker@gmail.com>
   2014-11-09 23:48:15 -0800
   Commit: bd86cb1, github.com/apache/spark/pull/3030

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-11-09 23:07:14 -0800
   Commit: 227488d, github.com/apache/spark/pull/2898

   SPARK-3179. Add task OutputMetrics.
   Sandy Ryza <sandy@cloudera.com>
   2014-11-09 22:29:03 -0800
   Commit: 3c2cff4, github.com/apache/spark/pull/2968

   SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop
   Sean Owen <sowen@cloudera.com>
   2014-11-09 22:11:20 -0800
   Commit: f8e5732, github.com/apache/spark/pull/3048

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-11-09 18:16:20 -0800
   Commit: f73b56f, github.com/apache/spark/pull/464

   SPARK-1344 [DOCS] Scala API docs for top methods
   Sean Owen <sowen@cloudera.com>
   2014-11-09 17:42:08 -0800
   Commit: d136265, github.com/apache/spark/pull/3168

   SPARK-971 [DOCS] Link to Confluence wiki from project website / documentation
   Sean Owen <sowen@cloudera.com>
   2014-11-09 17:40:48 -0800
   Commit: 8c99a47, github.com/apache/spark/pull/3169

   [SPARK-4301] StreamingContext should not allow start() to be called after calling stop()
   Josh Rosen <joshrosen@databricks.com>
   2014-11-08 18:10:23 -0800
   Commit: 7b41b17, github.com/apache/spark/pull/3160

   [Minor] [Core] Don't NPE on closeQuietly(null)
   Aaron Davidson <aaron@databricks.com>
   2014-11-08 13:03:51 -0800
   Commit: 4af5c7e, github.com/apache/spark/pull/3166

   [SPARK-4291][Build] Rename network module projects
   Andrew Or <andrew@databricks.com>
   2014-11-07 23:16:13 -0800
   Commit: 7afc856, github.com/apache/spark/pull/3148

   [MLLIB] [PYTHON] SPARK-4221: Expose nonnegative ALS in the python API
   Michelangelo D'Agostino <mdagostino@civisanalytics.com>
   2014-11-07 22:53:01 -0800
   Commit: 7e9d975, github.com/apache/spark/pull/3095

   [SPARK-4304] [PySpark] Fix sort on empty RDD
   Davies Liu <davies@databricks.com>
   2014-11-07 20:53:03 -0800
   Commit: 7779109, github.com/apache/spark/pull/3162

   MAINTENANCE: Automated closing of pull requests.
   Patrick Wendell <pwendell@gmail.com>
   2014-11-07 13:08:25 -0800
   Commit: 5923dd9, github.com/apache/spark/pull/3016

   Update JavaCustomReceiver.java
   xiao321 <1042460381@qq.com>
   2014-11-07 12:56:49 -0800
   Commit: 7c9ec52, github.com/apache/spark/pull/3153

   [SPARK-4292][SQL] Result set iterator bug in JDBC/ODBC
   wangfei <wangfei1@huawei.com>
   2014-11-07 12:55:11 -0800
   Commit: d6e5552, github.com/apache/spark/pull/3149

   [SPARK-4203][SQL] Partition directories in random order when inserting into hive table
   Matthew Taylor <matthew.t@tbfe.net>
   2014-11-07 12:53:08 -0800
   Commit: ac70c97, github.com/apache/spark/pull/3076

   [SPARK-4270][SQL] Fix Cast from DateType to DecimalType.
   Takuya UESHIN <ueshin@happy-camper.st>
   2014-11-07 12:30:47 -0800
   Commit: a6405c5, github.com/apache/spark/pull/3134

   [SPARK-4272] [SQL] Add more unwrapper functions for primitive type in TableReader
   Cheng Hao <hao.cheng@intel.com>
   2014-11-07 12:15:53 -0800
   Commit: 60ab80f, github.com/apache/spark/pull/3136

   [SPARK-4213][SQL] ParquetFilters - No support for LT, LTE, GT, GTE operators
   Kousuke Saruta <sarutak@oss.nttdata.co.jp>
   2014-11-07 11:56:40 -0800
   Commit: 14c54f1, github.com/apache/spark/pull/3083

   [SQL] Modify keyword val location according to ordering
   Jacky Li <jacky.likun@gmail.com>
   2014-11-07 11:52:08 -0800
   Commit: 68609c5, github.com/apache/spark/pull/3080

   [SQL] Support ScalaReflection of schema in different universes
   Michael Armbrust <michael@databricks.com>
   2014-11-07 11:51:20 -0800
   Commit: 8154ed7, github.com/apache/spark/pull/3096

   [SPARK-4225][SQL] Resorts to SparkContext.version to inspect Spark version
   Cheng Lian <lian@databricks.com>
   2014-11-07 11:45:25 -0800
   Commit: 86e9eaa, github.com/apache/spark/pull/3105

   [SQL][DOC][Minor] Spark SQL Hive now support dynamic partitioning
   wangfei <wangfei1@huawei.com>
   2014-11-07 11:43:35 -0800
   Commit: 636d7bc, github.com/apache/spark/pull/3127

   [SPARK-4187] [Core] Switch to binary protocol for external shuffle service messages
   Aaron Davidson <aaron@databricks.com>
   2014-11-07 09:42:21 -0800
   Commit: d4fa04e, github.com/apache/spark/pull/3146

   [SPARK-4204][Core][WebUI] Change Utils.exceptionString to contain the inner exceptions and make the error information in Web UI more friendly
   zsxwing <zsxwing@gmail.com>
   2014-11-06 21:52:12 -0800
   Commit: 3abdb1b, github.com/apache/spark/pull/3073

   [SPARK-4236] Cleanup removed applications' files in shuffle service
   Aaron Davidson <aaron@databricks.com>
   2014-11-06 19:54:32 -0800
   Commit: 48a19a6, github.com/apache/spark/pull/3126

   [SPARK-4188] [Core] Perform network-level retry of shuffle file fetches
   Aaron Davidson <aaron@databricks.com>
   2014-11-06 18:39:14 -0800
   Commit: f165b2b, github.com/apache/spark/pull/3101

   [SPARK-4277] Support external shuffle service on Standalone Worker
   Aaron Davidson <aaron@databricks.com>
   2014-11-06 17:20:46 -0800
   Commit: 6e9ef10, github.com/apache/spark/pull/3142

   [SPARK-3797] Minor addendum to Yarn shuffle service
   Andrew Or <andrew@databricks.com>
   2014-11-06 17:18:49 -0800
   Commit: 96136f2, github.com/apache/spark/pull/3144

   [HOT FIX] Make distribution fails
   Andrew Or <andrew@databricks.com>
   2014-11-06 15:31:07 -0800
   Commit: 470881b, github.com/apache/spark/pull/3145

   [SPARK-4249][GraphX]fix a problem of EdgePartitionBuilder in Graphx
   lianhuiwang <lianhuiwang09@gmail.com>
   2014-11-06 10:46:45 -0800
   Commit: d15c6e9, github.com/apache/spark/pull/3138

   [SPARK-4264] Completion iterator should only invoke callback once
   Aaron Davidson <aaron@databricks.com>
   2014-11-06 10:45:46 -0800
   Commit: 23eaf0e, github.com/apache/spark/pull/3128

   [SPARK-4186] add binaryFiles and binaryRecords in Python
   Davies Liu <davies@databricks.com>
   2014-11-06 00:22:19 -0800
   Commit: b41a39e, github.com/apache/spark/pull/3078

   [SPARK-4255] Fix incorrect table striping
   Kay Ousterhout <kayousterhout@gmail.com>
   2014-11-06 00:03:03 -0800
   Commit: 5f27ae1, github.com/apache/spark/pull/3117

   [SPARK-4137] [EC2] Don't change working dir on user
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2014-11-05 20:45:35 -0800
   Commit: db45f5a, github.com/apache/spark/pull/2988

   [SPARK-4262][SQL] add .schemaRDD to JavaSchemaRDD
   Xiangrui Meng <meng@databricks.com>
   2014-11-05 19:56:16 -0800
   Commit: 3d2b5bc, github.com/apache/spark/pull/3125

   [SPARK-4254] [mllib] MovieLensALS bug fix
   Joseph K. Bradley <joseph@databricks.com>
   2014-11-05 19:51:18 -0800
   Commit: c315d13, github.com/apache/spark/pull/3116

   [SPARK-4158] Fix for missing resources.
   Brenden Matthews <brenden@diddyinc.com>
   2014-11-05 16:02:44 -0800
   Commit: cb0eae3, github.com/apache/spark/pull/3024

   SPARK-3223 runAsSparkUser cannot change HDFS write permission properly i...
   Jongyoul Lee <jongyoul@gmail.com>
   2014-11-05 15:49:42 -0800
   Commit: f7ac8c2, github.com/apache/spark/pull/3034

   SPARK-4040. Update documentation to exemplify use of local (n) value, fo...
   jay@apache.org <jayunit100>
   2014-11-05 15:45:34 -0800
   Commit: 868cd4c, github.com/apache/spark/pull/2964

   [SPARK-3797] Run external shuffle service in Yarn NM
   Andrew Or <andrew@databricks.com>
   2014-11-05 15:42:05 -0800
   Commit: 61a5cce, github.com/apache/spark/pull/3082

   SPARK-4222 [CORE] use readFully in FixedLengthBinaryRecordReader
   industrial-sloth <industrial-sloth@users.noreply.github.com>
   2014-11-05 15:38:48 -0800
   Commit: f37817b, github.com/apache/spark/pull/3093

   [SPARK-3984] [SPARK-3983] Fix incorrect scheduler delay and display task deserialization time in UI
   Kay Ousterhout <kayousterhout@gmail.com>
   2014-11-05 15:30:31 -0800
   Commit: a46497e, github.com/apache/spark/pull/2832

   [SPARK-4242] [Core] Add SASL to external shuffle service
   Aaron Davidson <aaron@databricks.com>
   2014-11-05 14:38:43 -0800
   Commit: 4c42986, github.com/apache/spark/pull/3108

   [SPARK-4197] [mllib] GradientBoosting API cleanup and examples in Scala, Java
   Joseph K. Bradley <joseph@databricks.com>
   2014-11-05 10:33:13 -0800
   Commit: 5b3b6f6, github.com/apache/spark/pull/3094

   [SPARK-4029][Streaming] Update streaming driver to reliably save and recover received block metadata on driver failures
   Tathagata Das <tathagata.das1565@gmail.com>
   2014-11-05 01:21:53 -0800
   Commit: 5f13759, github.com/apache/spark/pull/3026

   [SPARK-3964] [MLlib] [PySpark] add Hypothesis test Python API
   Davies Liu <davies@databricks.com>
   2014-11-04 21:35:52 -0800
   Commit: c8abddc, github.com/apache/spark/pull/3091

   [SQL] Add String option for DSL AS
   Michael Armbrust <michael@databricks.com>
   2014-11-04 18:14:28 -0800
   Commit: 515abb9, github.com/apache/spark/pull/3097

   [SPARK-2938] Support SASL authentication in NettyBlockTransferService
   Aaron Davidson <aaron@databricks.com>
   2014-11-04 16:15:38 -0800
   Commit: 5e73138, github.com/apache/spark/pull/3087

   [Spark-4060] [MLlib] exposing special rdd functions to the public
   Niklas Wilcke <1wilcke@informatik.uni-hamburg.de>
   2014-11-04 09:57:03 -0800
   Commit: f90ad5d, github.com/apache/spark/pull/2907

   fixed MLlib Naive-Bayes java example bug
   Dariusz Kobylarz <darek.kobylarz@gmail.com>
   2014-11-04 09:53:43 -0800
   Commit: bcecd73, github.com/apache/spark/pull/3081

   [SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by default.
   Davies Liu <davies@databricks.com>
   2014-11-03 23:56:14 -0800
   Commit: e4f4263, github.com/apache/spark/pull/2920

   [SPARK-4166][Core] Add a backward compatibility test for ExecutorLostFailure
   zsxwing <zsxwing@gmail.com>
   2014-11-03 22:47:45 -0800
   Commit: b671ce0, github.com/apache/spark/pull/3085

   [SPARK-4163][Core] Add a backward compatibility test for FetchFailed
   zsxwing <zsxwing@gmail.com>
   2014-11-03 22:40:43 -0800
   Commit: 9bdc841, github.com/apache/spark/pull/3086

   [SPARK-3573][MLLIB] Make MLlib's Vector compatible with SQL's SchemaRDD
   Xiangrui Meng <meng@databricks.com>
   2014-11-03 22:29:48 -0800
   Commit: 1a9c6cd, github.com/apache/spark/pull/3070

   [SPARK-4192][SQL] Internal API for Python UDT
   Xiangrui Meng <meng@databricks.com>
   2014-11-03 19:29:11 -0800
   Commit: 04450d1, github.com/apache/spark/pull/3068

   [FIX][MLLIB] fix seed in BaggedPointSuite
   Xiangrui Meng <meng@databricks.com>
   2014-11-03 18:50:37 -0800
   Commit: c5912ec, github.com/apache/spark/pull/3084

   [SPARK-611] Display executor thread dumps in web UI
   Josh Rosen <joshrosen@databricks.com>
   2014-11-03 18:18:47 -0800
   Commit: 4f035dd, github.com/apache/spark/pull/2944

   [SPARK-4168][WebUI] web statges number should show correctly when stages are more than 1000
   Zhang, Liye <liye.zhang@intel.com>
   2014-11-03 18:17:32 -0800
   Commit: 97a466e, github.com/apache/spark/pull/3035

   [SQL] Convert arguments to Scala UDFs
   Michael Armbrust <michael@databricks.com>
   2014-11-03 18:04:51 -0800
   Commit: 15b58a2, github.com/apache/spark/pull/3077

   SPARK-4178. Hadoop input metrics ignore bytes read in RecordReader insta...
   Sandy Ryza <sandy@cloudera.com>
   2014-11-03 15:19:01 -0800
   Commit: 2812815, github.com/apache/spark/pull/3045

   [SQL] More aggressive defaults
   Michael Armbrust <michael@databricks.com>
   2014-11-03 14:08:27 -0800
   Commit: 25bef7e, github.com/apache/spark/pull/3064

   [SPARK-4152] [SQL] Avoid data change in CTAS while table already existed
   Cheng Hao <hao.cheng@intel.com>
   2014-11-03 13:59:43 -0800
   Commit: e83f13e, github.com/apache/spark/pull/3013

   [SPARK-4202][SQL] Simple DSL support for Scala UDF
   Cheng Lian <lian@databricks.com>
   2014-11-03 13:20:33 -0800
   Commit: c238fb4, github.com/apache/spark/pull/3067

   [SPARK-3594] [PySpark] [SQL] take more rows to infer schema or sampling
   Davies Liu <davies.liu@gmail.com>, Davies Liu <davies@databricks.com>
   2014-11-03 13:17:09 -0800
   Commit: 24544fb, github.com/apache/spark/pull/2716

   [SPARK-4207][SQL] Query which has syntax like 'not like' is not working in Spark SQL
   ravipesala <ravindra.pesala@huawei.com>
   2014-11-03 13:07:41 -0800
   Commit: 2b6e1ce, github.com/apache/spark/pull/3075

   [SPARK-4211][Build] Fixes hive.version in Maven profile hive-0.13.1
   fi <coderfi@gmail.com>
   2014-11-03 12:56:56 -0800
   Commit: df607da, github.com/apache/spark/pull/3072

   [SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sample
   Xiangrui Meng <meng@databricks.com>
   2014-11-03 12:24:24 -0800
   Commit: 3cca196, github.com/apache/spark/pull/3010

   [EC2] Factor out Mesos spark-ec2 branch
   Nicholas Chammas <nicholas.chammas@gmail.com>
   2014-11-03 09:02:35 -0800
   Commit: 2aca97c, github.com/apache/spark/pull/3008