blob: 7292d91b808244cbdd5551884060904996594adf [file] [log] [blame]
Spark Change Log
----------------
Release 1.1.1
[SPARK-4480] Avoid many small spills in external data structures (1.1)
Andrew Or <andrew@databricks.com>
2014-11-19 10:45:42 -0800
Commit: 16bf5f3, github.com/apache/spark/pull/3354
[SPARK-4380] Log more precise number of bytes spilled (1.1)
Andrew Or <andrew@databricks.com>
2014-11-18 20:15:00 -0800
Commit: e22a759, github.com/apache/spark/pull/3355
[SPARK-4468][SQL] Backports #3334 to branch-1.1
Cheng Lian <lian@databricks.com>
2014-11-18 17:40:24 -0800
Commit: f9739b9, github.com/apache/spark/pull/3338
[SPARK-4433] fix a racing condition in zipWithIndex
Xiangrui Meng <meng@databricks.com>
2014-11-18 16:25:44 -0800
Commit: ae9b1f6, github.com/apache/spark/pull/3291
[SPARK-4393] Fix memory leak in ConnectionManager ACK timeout TimerTasks; use HashedWheelTimer (For branch-1.1)
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-18 12:09:18 -0800
Commit: 91b5fa8, github.com/apache/spark/pull/3321
[SPARK-4467] Partial fix for fetch failure in sort-based shuffle (1.1)
Andrew Or <andrew@databricks.com>
2014-11-17 18:10:49 -0800
Commit: aa9ebda, github.com/apache/spark/pull/3330
Revert "[SPARK-4075] [Deploy] Jar url validation is not enough for Jar file"
Andrew Or <andrew@databricks.com>
2014-11-17 11:25:38 -0800
Commit: b528367
[branch-1.1][SPARK-4355] OnlineSummarizer doesn't merge mean correctly
Xiangrui Meng <meng@databricks.com>
2014-11-13 15:36:03 -0800
Commit: 4b1c77c, github.com/apache/spark/pull/3251
[Release] Correct make-distribution.sh log path
Andrew Or <andrew@databricks.com>
2014-11-12 13:46:26 -0800
Commit: ba6d81d
[Release] Bring audit scripts up-to-date
Andrew Or <andrewor14@gmail.com>
2014-11-13 00:30:58 +0000
Commit: 88bc482
[Release] Log build output for each distribution
Andrew Or <andrew@databricks.com>
2014-11-11 18:02:59 -0800
Commit: e3a5ee9
Revert "SPARK-3039: Allow spark to be built using avro-mapred for hadoop2"
Andrew Or <andrew@databricks.com>
2014-11-12 00:04:30 -0800
Commit: 45a01b6
Update CHANGES.txt
Andrew Or <andrewor14@gmail.com>
2014-11-11 23:11:32 +0000
Commit: 131c626
[SPARK-4295][External]Fix exception in SparkSinkSuite
maji2014 <maji3@asiainfo.com>
2014-11-11 02:18:27 -0800
Commit: bf867c3, github.com/apache/spark/pull/3177
[branch-1.1][SPARK-3990] add a note on ALS usage
Xiangrui Meng <meng@databricks.com>
2014-11-10 22:39:09 -0800
Commit: b2cb357, github.com/apache/spark/pull/3190
[BRANCH-1.1][SPARK-2652] change the default spark.serializer in pyspark back to Kryo
Xiangrui Meng <meng@databricks.com>
2014-11-10 22:21:14 -0800
Commit: 11798d0, github.com/apache/spark/pull/3187
[SPARK-4330][Doc] Link to proper URL for YARN overview
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-10 22:18:00 -0800
Commit: d313be8, github.com/apache/spark/pull/3196
[SQL] Backport backtick and smallint JDBC fixes to 1.1
Michael Armbrust <michael@databricks.com>, ravipesala <ravindra.pesala@huawei.com>, scwf <wangfei1@huawei.com>
2014-11-10 19:51:07 -0800
Commit: 8a1d818, github.com/apache/spark/pull/3199
Update versions for 1.1.1 release
Andrew Or <andrew@databricks.com>
2014-11-10 18:40:34 -0800
Commit: 01d233e
[SPARK-3495][SPARK-3496] Backporting block replication fixes made in master to branch 1.1
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-10 18:23:02 -0800
Commit: be0cc99, github.com/apache/spark/pull/3191
[SPARK-3954][Streaming] Optimization to FileInputDStream
surq <surq@asiainfo.com>
2014-11-10 17:37:16 -0800
Commit: 3d889df, github.com/apache/spark/pull/2811
[SPARK-3971][SQL] Backport #2843 to branch-1.1
Cheng Lian <lian@databricks.com>, Cheng Lian <lian.cs.zju@gmail.com>, Michael Armbrust <michael@databricks.com>
2014-11-10 17:04:10 -0800
Commit: 64945f8, github.com/apache/spark/pull/3113
[SPARK-4308][SQL] Follow up of #3175 for branch 1.1
Cheng Lian <lian@databricks.com>
2014-11-10 16:57:34 -0800
Commit: b3ef06b, github.com/apache/spark/pull/3176
[SPARK-2548][HOTFIX][Streaming] Removed use of o.a.s.streaming.Durations in branch 1.1
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-10 14:13:42 -0800
Commit: 86b1bd0, github.com/apache/spark/pull/3188
Update RecoverableNetworkWordCount.scala
comcmipi <pitonak@fns.uniba.sk>
2014-11-10 12:33:48 -0800
Commit: 254b135, github.com/apache/spark/pull/2735
SPARK-2548 [STREAMING] JavaRecoverableWordCount is missing
Sean Owen <sowen@cloudera.com>
2014-11-10 11:47:27 -0800
Commit: cdcf546, github.com/apache/spark/pull/2564
[SPARK-4169] [Core] Accommodate non-English Locales in unit tests
Niklas Wilcke <1wilcke@informatik.uni-hamburg.de>
2014-11-10 11:37:38 -0800
Commit: dc38def, github.com/apache/spark/pull/3036
[SPARK-4301] StreamingContext should not allow start() to be called after calling stop()
Josh Rosen <joshrosen@databricks.com>
2014-11-08 18:10:23 -0800
Commit: 78cd3ab, github.com/apache/spark/pull/3160
[SPARK-4304] [PySpark] Fix sort on empty RDD
Davies Liu <davies@databricks.com>
2014-11-07 20:53:03 -0800
Commit: 4895f65, github.com/apache/spark/pull/3162
Update JavaCustomReceiver.java
xiao321 <1042460381@qq.com>
2014-11-07 12:56:49 -0800
Commit: 4fb26df, github.com/apache/spark/pull/3153
[SPARK-4249][GraphX]fix a problem of EdgePartitionBuilder in Graphx
lianhuiwang <lianhuiwang09@gmail.com>
2014-11-06 10:46:45 -0800
Commit: 0a40eac, github.com/apache/spark/pull/3138
[SPARK-4158] Fix for missing resources.
Brenden Matthews <brenden@diddyinc.com>
2014-11-05 16:02:44 -0800
Commit: c58c1bb, github.com/apache/spark/pull/3024
SPARK-3223 runAsSparkUser cannot change HDFS write permission properly i...
Jongyoul Lee <jongyoul@gmail.com>
2014-11-05 15:49:42 -0800
Commit: 590a943, github.com/apache/spark/pull/3034
[branch-1.1][SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sample
Xiangrui Meng <meng@databricks.com>
2014-11-05 10:30:10 -0800
Commit: 44751af, github.com/apache/spark/pull/3104
[SPARK-4115][GraphX] Add overrided count for edge counting of EdgeRDD.
luluorta <luluorta@gmail.com>
2014-11-01 01:22:46 -0700
Commit: 1b282cd, github.com/apache/spark/pull/2975
[SPARK-4097] Fix the race condition of 'thread'
zsxwing <zsxwing@gmail.com>
2014-10-29 14:42:50 -0700
Commit: abdb90b, github.com/apache/spark/pull/2957
[SPARK-4065] Add check for IPython on Windows
Michael Griffiths <msjgriffiths@gmail.com>
2014-10-28 12:47:21 -0700
Commit: f0c5717, github.com/apache/spark/pull/2910
[SPARK-4107] Fix incorrect handling of read() and skip() return values (branch-1.1 backport)
Josh Rosen <joshrosen@databricks.com>
2014-10-28 12:30:12 -0700
Commit: 286f1ef, github.com/apache/spark/pull/2974
[SPARK-4110] Wrong comments about default settings in spark-daemon.sh
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-10-28 12:29:01 -0700
Commit: dee3317, github.com/apache/spark/pull/2972
[MLlib] SPARK-3987: add test case on objective value for NNLS
coderxiang <shuoxiangpub@gmail.com>
2014-10-27 19:43:39 -0700
Commit: 2ef2f5a, github.com/apache/spark/pull/2965
Fix build breakage introduced by 6c10c2770c718287f9cc2af4109b701fa1057b70
Josh Rosen <joshrosen@databricks.com>
2014-10-25 20:33:17 -0700
Commit: 2eb9d7c
Revert "[SPARK-4056] Upgrade snappy-java to 1.1.1.5"
Josh Rosen <joshrosen@databricks.com>
2014-10-25 17:09:01 -0700
Commit: c1989aa
[SPARK-4056] Upgrade snappy-java to 1.1.1.5
Josh Rosen <rosenville@gmail.com>, Josh Rosen <joshrosen@databricks.com>
2014-10-24 17:21:08 -0700
Commit: b7541ae, github.com/apache/spark/pull/2911
[SPARK-4080] Only throw IOException from [write|read][Object|External]
Josh Rosen <joshrosen@databricks.com>
2014-10-24 15:06:15 -0700
Commit: 6c10c27, github.com/apache/spark/pull/2932
[SPARK-4006] In long running contexts, we encountered the situation of d...
Tal Sliwowicz <tal.s@taboola.com>
2014-10-24 13:51:25 -0700
Commit: 59297e9, github.com/apache/spark/pull/2915
[SPARK-4075] [Deploy] Jar url validation is not enough for Jar file
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-10-24 13:08:21 -0700
Commit: 80dde80, github.com/apache/spark/pull/2925
[SPARK-4076] Parameter expansion in spark-config is wrong
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-10-24 13:04:35 -0700
Commit: 386fc46, github.com/apache/spark/pull/2930
[SPARK-2652] [PySpark] donot use KyroSerializer as default serializer
Davies Liu <davies@databricks.com>
2014-10-23 23:58:00 -0700
Commit: 926f8ca, github.com/apache/spark/pull/2916
[SPARK-3426] Fix sort-based shuffle error when spark.shuffle.compress and spark.shuffle.spill.compress settings are different
Josh Rosen <joshrosen@databricks.com>
2014-10-22 14:49:58 -0700
Commit: 5e191fa, github.com/apache/spark/pull/2890
[SPARK-3877][YARN] Throw an exception when application is not successful so that the exit code wil be set to 1 (for branch-1.1)
zsxwing <zsxwing@gmail.com>
2014-10-22 15:08:28 -0700
Commit: eb62094, github.com/apache/spark/pull/2748
[SPARK-4010][Web UI]Spark UI returns 500 in yarn-client mode
GuoQiang Li <witgo@qq.com>
2014-10-20 11:01:26 -0700
Commit: 457ef59, github.com/apache/spark/pull/2858
[SPARK-3948][Shuffle]Fix stream corruption bug in sort-based shuffle
jerryshao <saisai.shao@intel.com>
2014-10-20 10:20:21 -0700
Commit: 12a61d8, github.com/apache/spark/pull/2824
[SPARK-2546] Clone JobConf for each task (branch-1.0 / 1.1 backport)
Josh Rosen <joshrosen@apache.org>
2014-10-19 00:31:06 -0700
Commit: 2cd40db, github.com/apache/spark/pull/2684
SPARK-3926 [CORE] Result of JavaRDD.collectAsMap() is not Serializable
Sean Owen <sowen@cloudera.com>
2014-10-18 12:33:20 -0700
Commit: 327404d, github.com/apache/spark/pull/2805
[SPARK-3606] [yarn] Correctly configure AmIpFilter for Yarn HA (1.1 vers...
Marcelo Vanzin <vanzin@cloudera.com>
2014-10-17 00:53:15 -0700
Commit: 0d958f1, github.com/apache/spark/pull/2497
[SPARK-3067] JobProgressPage could not show Fair Scheduler Pools section sometimes
yantangzhai <tyz0303@163.com>, YanTangZhai <hakeemzhai@tencent.com>
2014-10-16 19:25:37 -0700
Commit: 35875e9, github.com/apache/spark/pull/1966
[SPARK-3890][Docs]remove redundant spark.executor.memory in doc
WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
2014-10-16 19:12:39 -0700
Commit: 2c41170, github.com/apache/spark/pull/2745
[SQL]typo in HiveFromSpark
Kun Li <jacky.likun@gmail.com>
2014-10-16 19:00:10 -0700
Commit: 61e5903, github.com/apache/spark/pull/2809
SPARK-3807: SparkSql does not work for tables created using custom serde
chirag <chirag.aggarwal@guavus.com>
2014-10-13 13:47:26 -0700
Commit: 925e22d, github.com/apache/spark/pull/2674
[SPARK-3899][Doc]fix wrong links in streaming doc
w00228970 <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
2014-10-12 23:35:50 -0700
Commit: 4fc6638, github.com/apache/spark/pull/2749
[SPARK-3905][Web UI]The keys for sorting the columns of Executor page ,Stage page Storage page are incorrect
GuoQiang Li <witgo@qq.com>
2014-10-12 22:48:54 -0700
Commit: a36116c, github.com/apache/spark/pull/2763
[SPARK-3121] Wrong implementation of implicit bytesWritableConverter
Jakub Dubovský <james64@inMail.sk>, Dubovsky Jakub <dubovsky@avast.com>
2014-10-12 22:03:26 -0700
Commit: 0e32579, github.com/apache/spark/pull/2712
SPARK-3716 [GraphX] Update Analytics.scala for partitionStrategy assignment
NamelessAnalyst <NamelessAnalyst@users.noreply.github.com>
2014-10-12 14:18:55 -0700
Commit: 5a21e3e, github.com/apache/spark/pull/2569
[SPARK-3711][SQL] Optimize where in clause filter queries
Yash Datta <Yash.Datta@guavus.com>
2014-10-09 12:59:14 -0700
Commit: 18ef22a, github.com/apache/spark/pull/2561
[SPARK-3844][UI] Truncate appName in WebUI if it is too long
Xiangrui Meng <meng@databricks.com>
2014-10-09 00:00:24 -0700
Commit: 09d6a81, github.com/apache/spark/pull/2707
[SPARK-3788] [yarn] Fix compareFs to do the right thing for HDFS namespaces (1.1 version).
Marcelo Vanzin <vanzin@cloudera.com>
2014-10-08 08:51:17 -0500
Commit: a44af73, github.com/apache/spark/pull/2650
[SPARK-3829] Make Spark logo image on the header of HistoryPage as a link to HistoryPage's page #1
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-10-07 16:54:49 -0700
Commit: a1f833f, github.com/apache/spark/pull/2690
[SPARK-3777] Display "Executor ID" for Tasks in Stage page
zsxwing <zsxwing@gmail.com>
2014-10-07 16:00:22 -0700
Commit: e8afb73, github.com/apache/spark/pull/2642
[SPARK-3731] [PySpark] fix memory leak in PythonRDD
Davies Liu <davies.liu@gmail.com>
2014-10-07 12:20:12 -0700
Commit: 5531830, github.com/apache/spark/pull/2668
[SPARK-3825] Log more detail when unrolling a block fails
Andrew Or <andrewor14@gmail.com>
2014-10-07 12:52:10 -0700
Commit: 267c7be, github.com/apache/spark/pull/2688
[SPARK-3808] PySpark fails to start in Windows
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-10-07 11:53:22 -0700
Commit: 3a7875d, github.com/apache/spark/pull/2669
[SPARK-3827] Very long RDD names are not rendered properly in web UI
Hossein <hossein@databricks.com>
2014-10-07 11:46:26 -0700
Commit: 82ab4a7, github.com/apache/spark/pull/2687
[SPARK-3792][SQL] Enable JavaHiveQLSuite
scwf <wangfei1@huawei.com>
2014-10-05 17:47:20 -0700
Commit: 964e3aa, github.com/apache/spark/pull/2652
SPARK-1656: Fix potential resource leaks
zsxwing <zsxwing@gmail.com>
2014-10-05 09:55:17 -0700
Commit: c068d90, github.com/apache/spark/pull/577
[SPARK-3597][Mesos] Implement `killTask`.
Brenden Matthews <brenden@diddyinc.com>
2014-10-05 09:49:24 -0700
Commit: d9cf4d0, github.com/apache/spark/pull/2453
[SPARK-3774] typo comment in bin/utils.sh
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-10-03 13:12:37 -0700
Commit: e4ddede, github.com/apache/spark/pull/2639
[SPARK-3775] Not suitable error message in spark-shell.cmd
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-10-03 13:09:48 -0700
Commit: f130256, github.com/apache/spark/pull/2640
[SPARK-3535][Mesos] Fix resource handling.
Brenden Matthews <brenden@diddyinc.com>
2014-10-03 12:58:04 -0700
Commit: 6f15097, github.com/apache/spark/pull/2401
[SPARK-3696]Do not override the user-difined conf_dir
WangTaoTheTonic <barneystinson@aliyun.com>
2014-10-03 10:42:41 -0700
Commit: d5af9e1, github.com/apache/spark/pull/2541
SPARK-2058: Overriding SPARK_HOME/conf with SPARK_CONF_DIR
EugenCepoi <cepoi.eugen@gmail.com>
2014-10-03 10:03:15 -0700
Commit: 5d991db, github.com/apache/spark/pull/2481
[DEPLOY] SPARK-3759: Return the exit code of the driver process
Eric Eijkelenboom <ee@userreport.com>
2014-10-02 18:04:38 -0700
Commit: 699af62, github.com/apache/spark/pull/2628
[SPARK-3755][Core] avoid trying privileged port when request a non-privileged port
scwf <wangfei1@huawei.com>
2014-10-02 17:47:56 -0700
Commit: 16789f6, github.com/apache/spark/pull/2623
[SQL][Docs] Update the output of printSchema and fix a typo in SQL programming guide.
Yin Huai <huai@cse.ohio-state.edu>
2014-10-02 11:37:24 -0700
Commit: 6869351, github.com/apache/spark/pull/2630
SPARK-3638 | Forced a compatible version of http client in kinesis-asl profile
aniketbhatnagar <aniket.bhatnagar@gmail.com>
2014-10-01 18:31:18 -0700
Commit: c52c231, github.com/apache/spark/pull/2535
Typo error in KafkaWordCount example
Gaspar Munoz <munozs.88@gmail.com>
2014-10-01 13:47:22 -0700
Commit: 24ee616, github.com/apache/spark/pull/2614
[SPARK-3756] [Core]check exception is caused by an address-port collision properly
scwf <wangfei1@huawei.com>
2014-10-01 11:51:30 -0700
Commit: b4f690d, github.com/apache/spark/pull/2611
SPARK-2626 [DOCS] Stop SparkContext in all examples
Sean Owen <sowen@cloudera.com>
2014-10-01 11:28:22 -0700
Commit: 13f33cf, github.com/apache/spark/pull/2575
[SPARK-3755][Core] Do not bind port 1 - 1024 to server in spark
scwf <wangfei1@huawei.com>
2014-10-01 11:30:29 -0700
Commit: c8c3b49, github.com/apache/spark/pull/2610
[SPARK-3747] TaskResultGetter could incorrectly abort a stage if it cannot get result for a specific task
Reynold Xin <rxin@apache.org>
2014-10-01 00:29:14 -0700
Commit: a7d2df4, github.com/apache/spark/pull/2599
SPARK-3745 - fix check-license to properly download and check jar
shane knapp <incomplete@gmail.com>
2014-09-30 13:11:25 -0700
Commit: 06b96d4, github.com/apache/spark/pull/2596
[SPARK-3709] Executors don't always report broadcast block removal properly back to the driver (for branch-1.1)
Reynold Xin <rxin@apache.org>
2014-09-30 12:24:58 -0700
Commit: a8c6e82, github.com/apache/spark/pull/2591
[SPARK-3734] DriverRunner should not read SPARK_HOME from submitter's environment
Josh Rosen <joshrosen@apache.org>
2014-09-29 23:36:10 -0700
Commit: 48be657, github.com/apache/spark/pull/2586
Fixed the condition in StronglyConnectedComponents Issue: SPARK-3635
oded <oded@HP-DV6.c4internal.c4-security.com>
2014-09-29 18:05:53 -0700
Commit: 85dd513, github.com/apache/spark/pull/2486
[graphX] GraphOps: random pick vertex bug
yingjieMiao <yingjie@42go.com>
2014-09-29 18:01:27 -0700
Commit: e5ab113, github.com/apache/spark/pull/2553
[SPARK-3032][Shuffle] Fix key comparison integer overflow introduced sorting exception
jerryshao <saisai.shao@intel.com>
2014-09-29 11:25:32 -0700
Commit: df5a62f, github.com/apache/spark/pull/2514
[CORE] Bugfix: LogErr format in DAGScheduler.scala
Zhang, Liye <liye.zhang@intel.com>
2014-09-29 01:13:15 -0700
Commit: 7d88471, github.com/apache/spark/pull/2572
[SPARK-3715][Docs]minor typo
WangTaoTheTonic <barneystinson@aliyun.com>
2014-09-28 18:30:13 -0700
Commit: 004b6fa, github.com/apache/spark/pull/2567
Docs : use "--total-executor-cores" rather than "--cores" after spark-shell
CrazyJvm <crazyjvm@gmail.com>
2014-09-27 09:41:04 -0700
Commit: d9d94e0, github.com/apache/spark/pull/2540
SPARK-3639 | Removed settings master in examples
aniketbhatnagar <aniket.bhatnagar@gmail.com>
2014-09-26 09:47:58 -0700
Commit: d6ed5ab, github.com/apache/spark/pull/2536
[SPARK-1853] Show Streaming application code context (file, line number) in Spark Stages UI
Mubarak Seyed <mubarak.seyed@gmail.com>, Tathagata Das <tathagata.das1565@gmail.com>
2014-09-23 15:09:12 -0700
Commit: 505ed6b, github.com/apache/spark/pull/2464
[SPARK-3653] Respect SPARK_*_MEMORY for cluster mode
Andrew Or <andrewor14@gmail.com>
2014-09-23 14:00:33 -0700
Commit: 5bbc621, github.com/apache/spark/pull/2500
SPARK-3612. Executor shouldn't quit if heartbeat message fails to reach ...
Sandy Ryza <sandy@cloudera.com>
2014-09-23 13:44:18 -0700
Commit: ffd97be, github.com/apache/spark/pull/2487
Update docs to use jsonRDD instead of wrong jsonRdd.
Grega Kespret <grega.kespret@gmail.com>
2014-09-22 10:13:44 -0700
Commit: aab0a1d, github.com/apache/spark/pull/2479
[MLLib] Fix example code variable name misspelling in MLLib Feature Extraction guide
RJ Nowling <rnowling@gmail.com>
2014-09-22 09:10:41 -0700
Commit: 32bb97f, github.com/apache/spark/pull/2459
Revert "[SPARK-3595] Respect configured OutputCommitters when calling saveAsHadoopFile"
Patrick Wendell <pwendell@gmail.com>
2014-09-21 13:07:20 -0700
Commit: f5bf7de
[SPARK-3595] Respect configured OutputCommitters when calling saveAsHadoopFile
Ian Hummel <ian@themodernlife.net>
2014-09-21 13:04:36 -0700
Commit: 7a76657, github.com/apache/spark/pull/2450
[Docs] Fix outdated docs for standalone cluster
andrewor14 <andrewor14@gmail.com>, Andrew Or <andrewor14@gmail.com>
2014-09-19 16:02:38 -0700
Commit: fd88353, github.com/apache/spark/pull/2461
[SPARK-2062][GraphX] VertexRDD.apply does not use the mergeFunc
Larry Xiao <xiaodi@sjtu.edu.cn>, Blie Arkansol <xiaodi@sjtu.edu.cn>, Ankur Dave <ankurdave@gmail.com>
2014-09-18 23:32:32 -0700
Commit: 1687d6b, github.com/apache/spark/pull/1903
[Minor Hot Fix] Move a line in SparkSubmit to the right place
Andrew Or <andrewor14@gmail.com>
2014-09-18 17:49:28 -0700
Commit: cf15b22, github.com/apache/spark/pull/2452
[SPARK-3560] Fixed setting spark.jars system property in yarn-cluster mode
Victsm <victor.nju@gmail.com>, Min Shen <mshen@linkedin.com>
2014-09-18 15:58:14 -0700
Commit: 832dff6, github.com/apache/spark/pull/2449
[SPARK-3589][Minor]remove redundant code
WangTaoTheTonic <barneystinson@aliyun.com>
2014-09-18 12:07:24 -0700
Commit: 2b28692, github.com/apache/spark/pull/2445
[SPARK-3565]Fix configuration item not consistent with document
WangTaoTheTonic <barneystinson@aliyun.com>
2014-09-17 21:59:23 -0700
Commit: 32f2222, github.com/apache/spark/pull/2427
[SPARK-3564][WebUI] Display App ID on HistoryPage
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-09-17 16:31:58 -0700
Commit: 3f1f974, github.com/apache/spark/pull/2424
Docs: move HA subsections to a deeper indentation level
Andrew Ash <andrew@andrewash.com>
2014-09-17 15:07:57 -0700
Commit: 0690410, github.com/apache/spark/pull/2402
[SQL][DOCS] Improve table caching section
Michael Armbrust <michael@databricks.com>
2014-09-17 12:41:49 -0700
Commit: 85e7c52, github.com/apache/spark/pull/2434
[SPARK-3490] Disable SparkUI for tests (backport into 1.1)
Andrew Or <andrewor14@gmail.com>
2014-09-16 18:23:28 -0700
Commit: 937de93, github.com/apache/spark/pull/2415
[SPARK-3555] Fix UISuite race condition
Andrew Or <andrewor14@gmail.com>
2014-09-16 16:03:20 -0700
Commit: 856156b, github.com/apache/spark/pull/2418
[SQL][DOCS] Improve section on thrift-server
Michael Armbrust <michael@databricks.com>
2014-09-16 11:51:46 -0700
Commit: 75158a7, github.com/apache/spark/pull/2384
[SPARK-3518] Remove wasted statement in JsonProtocol
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-09-15 16:11:41 -0700
Commit: 99a6c5e, github.com/apache/spark/pull/2380
SPARK-3039: Allow spark to be built using avro-mapred for hadoop2
Bertrand Bossy <bertrandbossy@gmail.com>
2014-09-14 21:10:17 -0700
Commit: 78887f9, github.com/apache/spark/pull/1945
[SQL] [Docs] typo fixes
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-09-13 12:34:20 -0700
Commit: 70f93d5, github.com/apache/spark/pull/2367
[SPARK-3515][SQL] Moves test suite setup code to beforeAll rather than in constructor
Cheng Lian <lian.cs.zju@gmail.com>
2014-09-12 20:14:09 -0700
Commit: 44e534e, github.com/apache/spark/pull/2375
[SPARK-3500] [SQL] use JavaSchemaRDD as SchemaRDD._jschema_rdd
Davies Liu <davies.liu@gmail.com>
2014-09-12 19:05:39 -0700
Commit: 9c06c72, github.com/apache/spark/pull/2369
[SPARK-3481] [SQL] Eliminate the error log in local Hive comparison test
Cheng Hao <hao.cheng@intel.com>
2014-09-12 11:29:30 -0700
Commit: 6cbf83c, github.com/apache/spark/pull/2352
Revert "[Spark-3490] Disable SparkUI for tests"
Andrew Or <andrewor14@gmail.com>
2014-09-12 10:40:03 -0700
Commit: f17b795
[SPARK-3465] fix task metrics aggregation in local mode
Davies Liu <davies.liu@gmail.com>
2014-09-11 18:53:26 -0700
Commit: e69deb8, github.com/apache/spark/pull/2338
[SPARK-3429] Don't include the empty string "" as a defaultAclUser
Andrew Ash <andrew@andrewash.com>
2014-09-11 17:28:36 -0700
Commit: 4245404, github.com/apache/spark/pull/2286
[Spark-3490] Disable SparkUI for tests
Andrew Or <andrewor14@gmail.com>
2014-09-11 17:18:46 -0700
Commit: 2ffc798, github.com/apache/spark/pull/2363
[SPARK-2140] Updating heap memory calculation for YARN stable and alpha.
Chris Cope <ccope@resilientscience.com>
2014-09-11 08:13:07 -0500
Commit: 06fb2d0, github.com/apache/spark/pull/2253
HOTFIX: Changing color on doc menu
Patrick Wendell <pwendell@gmail.com>
2014-09-10 22:14:55 -0700
Commit: e51ce9a
[SPARK-1919] Fix Windows spark-shell --jars
Andrew Or <andrewor14@gmail.com>
2014-09-02 10:47:05 -0700
Commit: 359cd59, github.com/apache/spark/pull/2211
[SPARK-3061] Fix Maven build under Windows
Josh Rosen <joshrosen@apache.org>, Josh Rosen <rosenville@gmail.com>, Josh Rosen <joshrosen@databricks.com>
2014-09-02 10:45:14 -0700
Commit: 23fd3e8, github.com/apache/spark/pull/2165
[SPARK-3345] Do correct parameters for ShuffleFileGroup
Liang-Chi Hsieh <viirya@gmail.com>
2014-09-03 17:04:53 -0700
Commit: e5f77ae, github.com/apache/spark/pull/2235
[SPARK-3193]output errer info when Process exit code is not zero in test suite
scwf <wangfei1@huawei.com>
2014-09-09 11:57:01 -0700
Commit: 2426268, github.com/apache/spark/pull/2108
SPARK-2425 Don't kill a still-running Application because of some misbehaving Executors
Mark Hamstra <markhamstra@gmail.com>
2014-09-08 20:51:56 -0700
Commit: e884805, github.com/apache/spark/pull/1360
[SQL] Minor edits to sql programming guide.
Henry Cook <hcook@eecs.berkeley.edu>
2014-09-08 14:56:37 -0700
Commit: 7a236dc, github.com/apache/spark/pull/2316
[SPARK-938][doc] Add OpenStack Swift support
Reynold Xin <rxin@apache.org>, Gil Vernik <gilv@il.ibm.com>
2014-09-07 20:56:04 -0700
Commit: 8c6306a, github.com/apache/spark/pull/is
Fixed typos in make-distribution.sh
Cheng Lian <lian.cs.zju@gmail.com>
2014-09-07 20:38:32 -0700
Commit: e45bfa8, github.com/apache/spark/pull/2121
[SPARK-3408] Fixed Limit operator so it works with sort-based shuffle.
Reynold Xin <rxin@apache.org>
2014-09-07 18:42:24 -0700
Commit: d555c2e, github.com/apache/spark/pull/2281
[SQL] Update SQL Programming Guide
Michael Armbrust <michael@databricks.com>, Yin Huai <huai@cse.ohio-state.edu>
2014-09-07 21:34:46 -0400
Commit: 65dae63, github.com/apache/spark/pull/2258
[SPARK-3394] [SQL] Fix crash in TakeOrdered when limit is 0
Eric Liang <ekl@google.com>
2014-09-07 17:57:59 -0700
Commit: c5d8d82, github.com/apache/spark/pull/2264
[SPARK-2419][Streaming][Docs] More updates to the streaming programming guide
Tathagata Das <tathagata.das1565@gmail.com>, Chris Fregly <chris@fregly.com>
2014-09-06 14:46:43 -0700
Commit: ce4053c, github.com/apache/spark/pull/2307
SPARK-3211 .take() is OOM-prone with empty partitions
Andrew Ash <andrew@andrewash.com>
2014-09-05 18:52:05 -0700
Commit: 28ce67b, github.com/apache/spark/pull/2117
[Docs] fix minor MLlib case typo
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-09-04 23:37:06 -0700
Commit: 6b128be, github.com/apache/spark/pull/2278
[SPARK-3401][PySpark] Wrong usage of tee command in python/run-tests
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-09-04 10:29:11 -0700
Commit: dbf8120, github.com/apache/spark/pull/2272
[HOTFIX] [SPARK-3400] Revert 9b225ac "fix GraphX EdgeRDD zipPartitions"
Ankur Dave <ankurdave@gmail.com>
2014-09-03 23:49:47 -0700
Commit: 8c40ab5, github.com/apache/spark/pull/2271
[SPARK-3372] [MLlib] MLlib doesn't pass maven build / checkstyle due to multi-byte character contained in Gradient.scala
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-09-03 20:47:00 -0700
Commit: f41c45a, github.com/apache/spark/pull/2248
[SPARK-2419][Streaming][Docs] Updates to the streaming programming guide
Tathagata Das <tathagata.das1565@gmail.com>, Jacek Laskowski <jacek@japila.pl>
2014-09-03 17:38:01 -0700
Commit: 3111501, github.com/apache/spark/pull/2254
Release 1.1.0
[SPARK-3320][SQL] Made batched in-memory column buffer building work for SchemaRDDs with empty partitions
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-29 18:16:47 -0700
Commit: aa9364a, github.com/apache/spark/pull/2213
[SPARK-3296][mllib] spark-example should be run-example in head notation of DenseKMeans and SparseNaiveBayes
wangfei <wangfei_hello@126.com>
2014-08-29 17:37:15 -0700
Commit: b0facb5, github.com/apache/spark/pull/2193
[SPARK-3291][SQL]TestcaseName in createQueryTest should not contain ":"
qiping.lqp <qiping.lqp@alibaba-inc.com>
2014-08-29 15:37:43 -0700
Commit: c1333b8, github.com/apache/spark/pull/2191
[SPARK-3269][SQL] Decreases initial buffer size for row set to prevent OOM
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-29 15:36:04 -0700
Commit: 9bae345, github.com/apache/spark/pull/2171
[SPARK-3234][Build] Fixed environment variables that rely on deprecated command line options in make-distribution.sh
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-29 15:29:43 -0700
Commit: cf049ef, github.com/apache/spark/pull/2208
[Docs] SQL doc formatting and typo fixes
Nicholas Chammas <nicholas.chammas@gmail.com>, nchammas <nicholas.chammas@gmail.com>
2014-08-29 15:23:32 -0700
Commit: bfa2dc9, github.com/apache/spark/pull/2201
[SPARK-3307] [PySpark] Fix doc string of SparkContext.broadcast()
Davies Liu <davies.liu@gmail.com>
2014-08-29 11:47:49 -0700
Commit: 98d0716, github.com/apache/spark/pull/2202
HOTFIX: Bump spark-ec2 version to 1.1.0
Patrick Wendell <pwendell@gmail.com>
2014-08-29 11:20:45 -0700
Commit: c71b5c6
Adding new CHANGES.txt
Patrick Wendell <pwendell@gmail.com>
2014-08-28 17:17:30 -0700
Commit: 7db87b3
[SPARK-3277] Fix external spilling with LZ4 assertion error
Andrew Or <andrewor14@gmail.com>, Patrick Wendell <pwendell@gmail.com>
2014-08-28 17:05:21 -0700
Commit: fe4df34, github.com/apache/spark/pull/2187
SPARK-3082. yarn.Client.logClusterResourceDetails throws NPE if requeste...
Sandy Ryza <sandy@cloudera.com>
2014-08-28 16:18:50 -0700
Commit: f4cbf5e, github.com/apache/spark/pull/1984
[SPARK-3190] Avoid overflow in VertexRDD.count()
Ankur Dave <ankurdave@gmail.com>
2014-08-28 15:17:01 -0700
Commit: 0b9718a, github.com/apache/spark/pull/2106
[SPARK-3264] Allow users to set executor Spark home in Mesos
Andrew Or <andrewor14@gmail.com>
2014-08-28 11:05:44 -0700
Commit: 069ecfe, github.com/apache/spark/pull/2166
[SPARK-3150] Fix NullPointerException in in Spark recovery: Add initializing default values in DriverInfo.init()
Tatiana Borisova <tanyatik@yandex.ru>
2014-08-28 10:36:36 -0700
Commit: fd98020, github.com/apache/spark/pull/2062
Additional CHANGES.txt
Patrick Wendell <pwendell@gmail.com>
2014-08-28 00:19:03 -0700
Commit: a9df703
[SPARK-3230][SQL] Fix udfs that return structs
Michael Armbrust <michael@databricks.com>
2014-08-28 00:15:23 -0700
Commit: 2e8ad99, github.com/apache/spark/pull/2133
[SQL] Fixed 2 comment typos in SQLConf
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-28 00:08:09 -0700
Commit: c0e3bc1, github.com/apache/spark/pull/2172
HOTFIX: Don't build with YARN support for Mapr3
Patrick Wendell <pwendell@gmail.com>
2014-08-27 15:40:40 -0700
Commit: ad0fab2
[HOTFIX][SQL] Remove cleaning of UDFs
Michael Armbrust <michael@databricks.com>
2014-08-27 23:05:34 -0700
Commit: 233c283, github.com/apache/spark/pull/2174
[HOTFIX] Wait for EOF only for the PySpark shell
Andrew Or <andrewor14@gmail.com>
2014-08-27 23:03:46 -0700
Commit: 54ccd93, github.com/apache/spark/pull/2170
BUILD: Updating CHANGES.txt for Spark 1.1
Patrick Wendell <pwendell@gmail.com>
2014-08-27 15:55:59 -0700
Commit: 8597e9c
Add line continuation for script to work w/ py2.7.5
Matthew Farrellee <matt@redhat.com>
2014-08-27 15:50:30 -0700
Commit: d4cf7a0, github.com/apache/spark/pull/2139
[SPARK-3235][SQL] Ensure in-memory tables don't always broadcast.
Michael Armbrust <michael@databricks.com>
2014-08-27 15:14:08 -0700
Commit: 9a62cf3, github.com/apache/spark/pull/2147
[SPARK-3065][SQL] Add locale setting to fix results do not match for udf_unix_timestamp format "yyyy MMM dd h:mm:ss a" run with not "America/Los_Angeles" TimeZone in HiveCompatibilitySuite
luogankun <luogankun@gmail.com>
2014-08-27 15:08:22 -0700
Commit: 5ea260e, github.com/apache/spark/pull/1968
[SQL] [SPARK-3236] Reading Parquet tables from Metastore mangles location
Aaron Davidson <aaron@databricks.com>
2014-08-27 15:05:47 -0700
Commit: 7711687, github.com/apache/spark/pull/2150
[SPARK-3252][SQL] Add missing condition for test
viirya <viirya@gmail.com>
2014-08-27 14:55:05 -0700
Commit: b3d763b, github.com/apache/spark/pull/2159
[SPARK-3243] Don't use stale spark-driver.* system properties
Andrew Or <andrewor14@gmail.com>
2014-08-27 14:46:56 -0700
Commit: c1ffa3e, github.com/apache/spark/pull/2154
Spark-3213 Fixes issue with spark-ec2 not detecting slaves created with "Launch More like this"
Vida Ha <vida@databricks.com>
2014-08-27 14:26:06 -0700
Commit: 3cb4e17, github.com/apache/spark/pull/2163
[SPARK-3138][SQL] sqlContext.parquetFile should be able to take a single file as parameter
chutium <teng.qiu@gmail.com>
2014-08-27 13:13:04 -0700
Commit: 90f8f3e, github.com/apache/spark/pull/2044
[SPARK-3197] [SQL] Reduce the Expression tree object creations for aggregation function (min/max)
Cheng Hao <hao.cheng@intel.com>
2014-08-27 12:50:47 -0700
Commit: 4c7f082, github.com/apache/spark/pull/2113
[SPARK-3118][SQL]add "SHOW TBLPROPERTIES tblname;" and "SHOW COLUMNS (FROM|IN) table_name [(FROM|IN) db_name]" support
u0jing <u9jing@gmail.com>
2014-08-27 12:47:14 -0700
Commit: 19cda07, github.com/apache/spark/pull/2034
SPARK-3259 - User data should be given to the master
Allan Douglas R. de Oliveira <allan@chaordicsystems.com>
2014-08-27 12:43:22 -0700
Commit: 0c94a5b, github.com/apache/spark/pull/2162
[SPARK-2608][Core] Fixed command line option passing issue over Mesos via SPARK_EXECUTOR_OPTS
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-27 12:39:21 -0700
Commit: 935bffe, github.com/apache/spark/pull/2161
[SPARK-3239] [PySpark] randomize the dirs for each process
Davies Liu <davies.liu@gmail.com>
2014-08-27 10:40:35 -0700
Commit: 092121e, github.com/apache/spark/pull/2152
[SPARK-3170][CORE][BUG]:RDD info loss in "StorageTab" and "ExecutorTab"
uncleGen <hustyugm@gmail.com>
2014-08-27 10:32:13 -0700
Commit: 8f8e2a4, github.com/apache/spark/pull/2131
[SPARK-3154][STREAMING] Make FlumePollingInputDStream shutdown cleaner.
Hari Shreedharan <hshreedharan@apache.org>
2014-08-27 02:39:02 -0700
Commit: 1d468df, github.com/apache/spark/pull/2065
[SPARK-3227] [mllib] Added migration guide for v1.0 to v1.1
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-27 01:45:59 -0700
Commit: 7286d57, github.com/apache/spark/pull/2146
[SPARK-2830][MLLIB] doc update for 1.1
Xiangrui Meng <meng@databricks.com>
2014-08-27 01:19:48 -0700
Commit: 7401247, github.com/apache/spark/pull/2151
[SPARK-3237][SQL] Fix parquet filters with UDFs
Michael Armbrust <michael@databricks.com>
2014-08-27 00:59:23 -0700
Commit: ca01de1, github.com/apache/spark/pull/2153
[SPARK-3139] Made ContextCleaner to not block on shuffles
Tathagata Das <tathagata.das1565@gmail.com>
2014-08-27 00:13:38 -0700
Commit: 5cf1e44, github.com/apache/spark/pull/2143
HOTFIX: Minor typo in conf template
Patrick Wendell <pwendell@gmail.com>
2014-08-26 23:40:50 -0700
Commit: 6f82a4b
[SPARK-3167] Handle special driver configs in Windows (Branch 1.1)
Andrew Or <andrewor14@gmail.com>
2014-08-26 23:06:11 -0700
Commit: e7672f1, github.com/apache/spark/pull/2156
[SPARK-3224] FetchFailed reduce stages should only show up once in failed stages (in UI)
Reynold Xin <rxin@apache.org>, Kay Ousterhout <kayousterhout@gmail.com>
2014-08-26 21:59:48 -0700
Commit: 2381e90, github.com/apache/spark/pull/2127
Fix unclosed HTML tag in Yarn docs.
Josh Rosen <joshrosen@apache.org>
2014-08-26 18:55:00 -0700
Commit: 7726e56
[SPARK-3036][SPARK-3037][SQL] Add MapType/ArrayType containing null value support to Parquet.
Takuya UESHIN <ueshin@happy-camper.st>
2014-08-26 18:28:41 -0700
Commit: 8b5af6f, github.com/apache/spark/pull/2032
[Docs] Run tests like in contributing guide
nchammas <nicholas.chammas@gmail.com>
2014-08-26 17:50:04 -0700
Commit: 0d97233, github.com/apache/spark/pull/2149
[SPARK-2964] [SQL] Remove duplicated code from spark-sql and start-thriftserver.sh
Cheng Lian <lian.cs.zju@gmail.com>, Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-26 17:33:40 -0700
Commit: c0e1f99, github.com/apache/spark/pull/1886
[SPARK-3194][SQL] Add AttributeSet to fix bugs with invalid comparisons of AttributeReferences
Michael Armbrust <michael@databricks.com>
2014-08-26 16:29:14 -0700
Commit: a308a16, github.com/apache/spark/pull/2109
[SPARK-2839][MLlib] Stats Toolkit documentation updated
Burak <brkyvz@gmail.com>
2014-08-26 15:18:42 -0700
Commit: 2715eb7, github.com/apache/spark/pull/2130
[SPARK-3226][MLLIB] doc update for native libraries
Xiangrui Meng <meng@databricks.com>
2014-08-26 15:12:27 -0700
Commit: 5ff9000, github.com/apache/spark/pull/2128
[SPARK-3063][SQL] ExistingRdd should convert Map to catalyst Map.
Takuya UESHIN <ueshin@happy-camper.st>
2014-08-26 15:04:08 -0700
Commit: 5d981a4, github.com/apache/spark/pull/1963
[SPARK-2969][SQL] Make ScalaReflection be able to handle ArrayType.containsNull and MapType.valueContainsNull.
Takuya UESHIN <ueshin@happy-camper.st>
2014-08-26 13:22:55 -0700
Commit: 35a5853, github.com/apache/spark/pull/1889
[SPARK-2871] [PySpark] add histgram() API
Davies Liu <davies.liu@gmail.com>
2014-08-26 13:04:30 -0700
Commit: 83d2730, github.com/apache/spark/pull/2091
[SPARK-3131][SQL] Allow user to set parquet compression codec for writing ParquetFile in SQLContext
chutium <teng.qiu@gmail.com>
2014-08-26 11:51:26 -0700
Commit: 3a9d874, github.com/apache/spark/pull/2039
[SPARK-2886] Use more specific actor system name than "spark"
Andrew Or <andrewor14@gmail.com>
2014-08-25 23:36:09 -0700
Commit: 0f947f1, github.com/apache/spark/pull/1810
[Spark-3222] [SQL] Cross join support in HiveQL
Daoyuan Wang <daoyuan.wang@intel.com>, adrian-wang <daoyuanwong@gmail.com>
2014-08-25 22:56:35 -0700
Commit: 48a0749, github.com/apache/spark/pull/2124
SPARK-2481: The environment variables SPARK_HISTORY_OPTS is covered in spark-env.sh
witgo <witgo@qq.com>, GuoQiang Li <witgo@qq.com>
2014-08-25 19:22:27 -0700
Commit: 4d6a0e9, github.com/apache/spark/pull/1341
[SPARK-3011][SQL] _temporary directory should be filtered out by sqlContext.parquetFile
Chia-Yung Su <chiayung@appier.com>
2014-08-25 18:20:19 -0700
Commit: b5dc9b4, github.com/apache/spark/pull/1959
[SQL] logWarning should be logInfo in getResultSetSchema
wangfei <wangfei_hello@126.com>
2014-08-25 17:46:43 -0700
Commit: 957b356, github.com/apache/spark/pull/1939
[SPARK-3058] [SQL] Support EXTENDED for EXPLAIN
Cheng Hao <hao.cheng@intel.com>
2014-08-25 17:43:56 -0700
Commit: f8ac8ed, github.com/apache/spark/pull/1962
[SPARK-2929][SQL] Refactored Thrift server and CLI suites
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-25 16:29:59 -0700
Commit: 292f28d, github.com/apache/spark/pull/1856
[SPARK-3204][SQL] MaxOf would be foldable if both left and right are foldable.
Takuya UESHIN <ueshin@happy-camper.st>
2014-08-25 16:27:00 -0700
Commit: 19b01d6, github.com/apache/spark/pull/2116
Fixed a typo in docs/running-on-mesos.md
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-25 14:56:51 -0700
Commit: 8d33a6d, github.com/apache/spark/pull/2119
[FIX] fix error message in sendMessageReliably
Xiangrui Meng <meng@databricks.com>
2014-08-25 14:55:20 -0700
Commit: d892062, github.com/apache/spark/pull/2120
SPARK-2798 [BUILD] Correct several small errors in Flume module pom.xml files
Sean Owen <sowen@cloudera.com>
2014-08-25 13:29:07 -0700
Commit: ff616fd, github.com/apache/spark/pull/1726
[SPARK-2495][MLLIB] make KMeans constructor public
Xiangrui Meng <meng@databricks.com>
2014-08-25 12:30:02 -0700
Commit: 69a17f1, github.com/apache/spark/pull/2112
[SPARK-2871] [PySpark] add zipWithIndex() and zipWithUniqueId()
Davies Liu <davies.liu@gmail.com>
2014-08-24 21:16:05 -0700
Commit: b82da3d, github.com/apache/spark/pull/2092
[MLlib][SPARK-2997] Update SVD documentation to reflect roughly square
Reza Zadeh <rizlar@gmail.com>
2014-08-24 17:35:54 -0700
Commit: 749bddc, github.com/apache/spark/pull/2070
[SPARK-2841][MLlib] Documentation for feature transformations
DB Tsai <dbtsai@alpinenow.com>
2014-08-24 17:33:33 -0700
Commit: a4db81a, github.com/apache/spark/pull/2068
[SPARK-3192] Some scripts have 2 space indentation but other scripts have 4 space indentation.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-24 09:43:44 -0700
Commit: ce14cd1, github.com/apache/spark/pull/2104
[SPARK-2967][SQL] Follow-up: Also copy hash expressions in sort based shuffle fix.
Michael Armbrust <michael@databricks.com>
2014-08-23 16:21:08 -0700
Commit: e23f0bc, github.com/apache/spark/pull/2072
[SPARK-2554][SQL] CountDistinct partial aggregation and object allocation improvements
Michael Armbrust <michael@databricks.com>, Gregory Owen <greowen@gmail.com>
2014-08-23 16:19:10 -0700
Commit: 7112da8, github.com/apache/spark/pull/1935
[SQL] Make functionRegistry in HiveContext transient.
Yin Huai <huaiyin.thu@gmail.com>
2014-08-23 12:46:41 -0700
Commit: 9309786, github.com/apache/spark/pull/2074
[SPARK-2963] REGRESSION - The description about how to build for using CLI and Thrift JDBC server is absent in proper document -
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-22 22:28:05 -0700
Commit: 5689660, github.com/apache/spark/pull/2080
[SPARK-3169] Removed dependency on spark streaming test from spark flume sink
Tathagata Das <tathagata.das1565@gmail.com>
2014-08-22 21:34:48 -0700
Commit: cd73631, github.com/apache/spark/pull/2101
Revert "HOTFIX:Temporarily removing flume sink test in 1.1 branch"
Patrick Wendell <pwendell@gmail.com>
2014-08-22 21:31:52 -0700
Commit: 385c4f2
[SPARK-2840] [mllib] DecisionTree doc update (Java, Python examples)
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-21 00:17:29 -0700
Commit: 1e5d9cb, github.com/apache/spark/pull/2063
BUILD: Bump Hadoop versions in the release build.
Patrick Wendell <pwendell@gmail.com>
2014-08-20 12:18:41 -0700
Commit: da0a701
HOTFIX:Temporarily removing flume sink test in 1.1 branch
Patrick Wendell <pwendell@gmail.com>
2014-08-20 22:24:22 -0700
Commit: 1d5e84a
[HOTFIX][STREAMING] Allow the JVM/Netty to decide which port to bind to in Flume Polling Tests.
Hari Shreedharan <harishreedharan@gmail.com>
2014-08-17 19:50:31 -0700
Commit: 4485665, github.com/apache/spark/pull/1820
[HOTFIX][Streaming] Handle port collisions in flume polling test
Andrew Or <andrewor14@gmail.com>
2014-08-06 16:34:53 -0700
Commit: 3f91e9d, github.com/apache/spark/pull/1803
[SPARK-2843][MLLIB] add a section about regularization parameter in ALS
Xiangrui Meng <meng@databricks.com>
2014-08-20 17:47:39 -0700
Commit: eba399b, github.com/apache/spark/pull/2064
[SPARK-3143][MLLIB] add tf-idf user guide
Xiangrui Meng <meng@databricks.com>
2014-08-20 17:41:36 -0700
Commit: 1af68ca, github.com/apache/spark/pull/2061
[SPARK-3140] Clarify confusing PySpark exception message
Andrew Or <andrewor14@gmail.com>
2014-08-20 17:07:39 -0700
Commit: f8bcb12, github.com/apache/spark/pull/2067
[SPARK-2298] Encode stage attempt in SparkListener & UI.
Reynold Xin <rxin@apache.org>
2014-08-20 15:37:27 -0700
Commit: dc05282, github.com/apache/spark/pull/1545
[SPARK-2169] Don't copy appName / basePath everywhere.
Marcelo Vanzin <vanzin@cloudera.com>
2014-08-18 13:25:30 -0700
Commit: 2c1683e, github.com/apache/spark/pull/1252
[SPARK-2846][SQL] Add configureInputJobPropertiesForStorageHandler to initialization of job conf
Alex Liu <alex_liu68@yahoo.com>
2014-08-20 16:14:06 -0700
Commit: 64e136a, github.com/apache/spark/pull/1927
SPARK_LOGFILE and SPARK_ROOT_LOGGER no longer need in spark-daemon.sh
wangfei <wangfei_hello@126.com>
2014-08-20 16:00:46 -0700
Commit: 5f72d7b, github.com/apache/spark/pull/2057
[SPARK-2967][SQL] Fix sort based shuffle for spark sql.
Michael Armbrust <michael@databricks.com>
2014-08-20 15:51:14 -0700
Commit: 311831d, github.com/apache/spark/pull/2066
[SPARK-2849] Handle driver configs separately in client mode
Andrew Or <andrewor14@gmail.com>
2014-08-20 15:01:47 -0700
Commit: beb705a, github.com/apache/spark/pull/1845
[SPARK-3149] Connection establishment information is not enough.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-20 14:04:39 -0700
Commit: 25b01fd, github.com/apache/spark/pull/2060
[SPARK-3062] [SPARK-2970] [SQL] spark-sql script ends with IOException when EventLogging is enabled
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-20 13:26:11 -0700
Commit: 5095851, github.com/apache/spark/pull/1970
[SPARK-3126][SPARK-3127][SQL] Fixed HiveThriftServer2Suite
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-20 12:57:39 -0700
Commit: 99ca704, github.com/apache/spark/pull/2036
SPARK-3092 [SQL]: Always include the thriftserver when -Phive is enabled.
Patrick Wendell <pwendell@gmail.com>
2014-08-20 12:13:31 -0700
Commit: ca7322d, github.com/apache/spark/pull/2006
[SPARK-3054][STREAMING] Add unit tests for Spark Sink.
Hari Shreedharan <hshreedharan@apache.org>, Hari Shreedharan <hshreedharan@cloudera.com>
2014-08-20 04:09:54 -0700
Commit: 9b29099, github.com/apache/spark/pull/1958
[SPARK-3141] [PySpark] fix sortByKey() with take()
Davies Liu <davies.liu@gmail.com>
2014-08-19 22:43:49 -0700
Commit: 5b22ebf, github.com/apache/spark/pull/2045
[DOCS] Fixed wrong links
Ken Takagiwa <ugw.gi.world@gmail.com>
2014-08-19 22:43:22 -0700
Commit: f8c908e, github.com/apache/spark/pull/2042
[SPARK-2974] [SPARK-2975] Fix two bugs related to spark.local.dirs
Josh Rosen <joshrosen@apache.org>
2014-08-19 22:42:50 -0700
Commit: 5d1a878, github.com/apache/spark/pull/2002
[SPARK-3142][MLLIB] output shuffle data directly in Word2Vec
Xiangrui Meng <meng@databricks.com>
2014-08-19 22:16:22 -0700
Commit: a5bc9c6, github.com/apache/spark/pull/2049
[SPARK-3119] Re-implementation of TorrentBroadcast.
Reynold Xin <rxin@apache.org>
2014-08-19 22:11:13 -0700
Commit: 08c9973, github.com/apache/spark/pull/2030
[HOTFIX][Streaming][MLlib] use temp folder for checkpoint
Xiangrui Meng <meng@databricks.com>
2014-08-19 22:05:29 -0700
Commit: d5db95b, github.com/apache/spark/pull/2046
[SPARK-3130][MLLIB] detect negative values in naive Bayes
Xiangrui Meng <meng@databricks.com>
2014-08-19 21:01:23 -0700
Commit: 148e45b, github.com/apache/spark/pull/2038
[SQL] add note of use synchronizedMap in SQLConf
wangfei <wangfei_hello@126.com>, scwf <wangfei1@huawei.com>
2014-08-19 19:37:02 -0700
Commit: 607735c, github.com/apache/spark/pull/1996
[SPARK-3112][MLLIB] Add documentation and example for StreamingLR
freeman <the.freeman.lab@gmail.com>
2014-08-19 18:07:42 -0700
Commit: d75464d, github.com/apache/spark/pull/2047
[MLLIB] minor update to word2vec
Xiangrui Meng <meng@databricks.com>
2014-08-19 17:41:37 -0700
Commit: 023ed7c, github.com/apache/spark/pull/2043
[SPARK-2468] Netty based block server / client module
Reynold Xin <rxin@apache.org>
2014-08-19 17:40:35 -0700
Commit: 66b4c81, github.com/apache/spark/pull/1971
[SPARK-3136][MLLIB] Create Java-friendly methods in RandomRDDs
Xiangrui Meng <meng@databricks.com>
2014-08-19 16:06:48 -0700
Commit: d371c71, github.com/apache/spark/pull/2041
[SPARK-2790] [PySpark] fix zip with serializers which have different batch sizes.
Davies Liu <davies.liu@gmail.com>
2014-08-19 14:46:32 -0700
Commit: 3540d4b, github.com/apache/spark/pull/1894
Move a bracket in validateSettings of SparkConf
hzw19900416 <carlmartinmax@gmail.com>
2014-08-19 14:04:49 -0700
Commit: f6b4ab8, github.com/apache/spark/pull/2012
SPARK-2333 - spark_ec2 script should allow option for existing security group
Vida Ha <vida@databricks.com>
2014-08-19 13:35:05 -0700
Commit: c3952b0, github.com/apache/spark/pull/1899
[SPARK-3128][MLLIB] Use streaming test suite for StreamingLR
freeman <the.freeman.lab@gmail.com>
2014-08-19 13:28:57 -0700
Commit: 04a3208, github.com/apache/spark/pull/2037
[SPARK-3089] Fix meaningless error message in ConnectionManager
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-19 10:15:11 -0700
Commit: 5d895ad, github.com/apache/spark/pull/2000
[SPARK-3072] YARN - Exit when reach max number failed executors
Thomas Graves <tgraves@apache.org>
2014-08-19 09:40:31 -0500
Commit: 1418893, github.com/apache/spark/pull/2022
Fix typo in decision tree docs
Matt Forbes <matt@tellapart.com>
2014-08-18 21:43:32 -0700
Commit: f3b0f34, github.com/apache/spark/pull/1837
[SPARK-3116] Remove the excessive lockings in TorrentBroadcast
Reynold Xin <rxin@apache.org>
2014-08-18 20:51:41 -0700
Commit: b6d8e66, github.com/apache/spark/pull/2028
[SPARK-3114] [PySpark] Fix Python UDFs in Spark SQL.
Josh Rosen <joshrosen@apache.org>, Davies Liu <davies.liu@gmail.com>
2014-08-18 20:42:19 -0700
Commit: 3a03259, github.com/apache/spark/pull/2026.
[SPARK-3108][MLLIB] add predictOnValues to StreamingLR and fix predictOn
Xiangrui Meng <meng@databricks.com>
2014-08-18 18:20:54 -0700
Commit: 7d069bf, github.com/apache/spark/pull/2023
[SPARK-2850] [SPARK-2626] [mllib] MLlib stats examples + small fixes
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-18 18:01:39 -0700
Commit: e3f89e9, github.com/apache/spark/pull/1878
[mllib] DecisionTree: treeAggregate + Python example bug fix
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-18 14:40:05 -0700
Commit: 98778ff, github.com/apache/spark/pull/2015
[SPARK-2718] [yarn] Handle quotes and other characters in user args.
Marcelo Vanzin <vanzin@cloudera.com>
2014-08-18 14:10:10 -0700
Commit: 25cabd7, github.com/apache/spark/pull/1724
[SPARK-3103] [PySpark] fix saveAsTextFile() with utf-8
Davies Liu <davies.liu@gmail.com>
2014-08-18 13:58:35 -0700
Commit: e083334, github.com/apache/spark/pull/2018
[SPARK-2406][SQL] Initial support for using ParquetTableScan to read HiveMetaStore tables.
Michael Armbrust <michael@databricks.com>, Yin Huai <huai@cse.ohio-state.edu>
2014-08-18 13:17:10 -0700
Commit: cc4015d, github.com/apache/spark/pull/1819
[SPARK-3091] [SQL] Add support for caching metadata on Parquet files
Matei Zaharia <matei@databricks.com>
2014-08-18 11:00:10 -0700
Commit: 2ae2857, github.com/apache/spark/pull/2005
SPARK-3025 [SQL]: Allow JDBC clients to set a fair scheduler pool
Patrick Wendell <pwendell@gmail.com>
2014-08-18 10:52:20 -0700
Commit: 496f62d, github.com/apache/spark/pull/1937
[SPARK-3085] [SQL] Use compact data structures in SQL joins
Matei Zaharia <matei@databricks.com>
2014-08-18 10:45:24 -0700
Commit: 4da76fc, github.com/apache/spark/pull/1993
[SPARK-3084] [SQL] Collect broadcasted tables in parallel in joins
Matei Zaharia <matei@databricks.com>
2014-08-18 10:05:52 -0700
Commit: 55e9dd6, github.com/apache/spark/pull/1990
SPARK-3096: Include parquet hive serde by default in build
Patrick Wendell <pwendell@gmail.com>
2014-08-18 10:00:46 -0700
Commit: ec0b91e, github.com/apache/spark/pull/2009
[SPARK-2862] histogram method fails on some choices of bucketCount
Chandan Kumar <chandan.kumar@imaginea.com>
2014-08-18 09:52:25 -0700
Commit: 12f16ba, github.com/apache/spark/pull/1787
[MLlib] Remove transform(dataset: RDD[String]) from Word2Vec public API
Liquan Pei <liquanpei@gmail.com>
2014-08-18 01:15:45 -0700
Commit: e0bc333, github.com/apache/spark/pull/2010
[SPARK-2842][MLlib]Word2Vec documentation
Liquan Pei <liquanpei@gmail.com>
2014-08-17 23:30:47 -0700
Commit: 518258f, github.com/apache/spark/pull/2003
[SPARK-3097][MLlib] Word2Vec performance improvement
Liquan Pei <liquanpei@gmail.com>
2014-08-17 23:29:44 -0700
Commit: 708cde9, github.com/apache/spark/pull/1932
SPARK-2900. aggregate inputBytes per stage
Sandy Ryza <sandy@cloudera.com>
2014-08-17 22:39:06 -0700
Commit: 0506539, github.com/apache/spark/pull/1826
SPARK-2884: Create binary builds in parallel with release script.
Patrick Wendell <pwendell@gmail.com>
2014-08-17 22:29:58 -0700
Commit: a5ae720
[SPARK-3087][MLLIB] fix col indexing bug in chi-square and add a check for number of distinct values
Xiangrui Meng <meng@databricks.com>
2014-08-17 20:53:18 -0700
Commit: 8438daf, github.com/apache/spark/pull/1997
[SPARK-1981] updated streaming-kinesis.md
Chris Fregly <chris@fregly.com>
2014-08-17 19:33:15 -0700
Commit: 8263567, github.com/apache/spark/pull/1757
[SQL] Improve debug logging and toStrings.
Michael Armbrust <michael@databricks.com>
2014-08-17 19:00:38 -0700
Commit: 4f776df, github.com/apache/spark/pull/2004
Revert "[SPARK-2970] [SQL] spark-sql script ends with IOException when EventLogging is enabled"
Michael Armbrust <michael@databricks.com>
2014-08-17 18:10:45 -0700
Commit: c6a0091, github.com/apache/spark/pull/2007
SPARK-2881: Upgrade to Snappy 1.0.5.3 to avoid SPARK-2881.
Patrick Wendell <pwendell@gmail.com>
2014-08-17 15:48:39 -0700
Commit: d411f41, github.com/apache/spark/pull/1999
[SPARK-3042] [mllib] DecisionTree Filter top-down instead of bottom-up
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-16 23:53:14 -0700
Commit: 91af120, github.com/apache/spark/pull/1975
[SPARK-3077][MLLIB] fix some chisq-test
Xiangrui Meng <meng@databricks.com>
2014-08-16 21:16:27 -0700
Commit: 413a329, github.com/apache/spark/pull/1982
In the stop method of ConnectionManager to cancel the ackTimeoutMonitor
GuoQiang Li <witgo@qq.com>
2014-08-16 20:05:55 -0700
Commit: f02e327, github.com/apache/spark/pull/1989
[SPARK-1065] [PySpark] improve supporting for large broadcast
Davies Liu <davies.liu@gmail.com>
2014-08-16 16:59:34 -0700
Commit: 5dd571c, github.com/apache/spark/pull/1912
[SPARK-3035] Wrong example with SparkContext.addFile
iAmGhost <kdh7807@gmail.com>
2014-08-16 16:48:38 -0700
Commit: 721f2fd, github.com/apache/spark/pull/1942
[SPARK-3081][MLLIB] rename RandomRDDGenerators to RandomRDDs
Xiangrui Meng <meng@databricks.com>
2014-08-16 15:14:43 -0700
Commit: a12d3ae, github.com/apache/spark/pull/1979
[SPARK-3048][MLLIB] add LabeledPoint.parse and remove loadStreamingLabeledPoints
Xiangrui Meng <meng@databricks.com>
2014-08-16 15:13:34 -0700
Commit: 0b354be, github.com/apache/spark/pull/1952
[SPARK-2677] BasicBlockFetchIterator#next can wait forever
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-16 14:15:58 -0700
Commit: bd3ce2f, github.com/apache/spark/pull/1632
[SQL] Using safe floating-point numbers in doctest
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-16 11:26:51 -0700
Commit: 8c79574, github.com/apache/spark/pull/1925
[SPARK-2977] Ensure ShuffleManager is created before ShuffleBlockManager
Josh Rosen <joshrosen@apache.org>
2014-08-16 00:04:55 -0700
Commit: 0e0ec2e, github.com/apache/spark/pull/1976
[SPARK-3045] Make Serializer interface Java friendly
Reynold Xin <rxin@apache.org>
2014-08-15 23:12:34 -0700
Commit: fcf30cd, github.com/apache/spark/pull/1948
[SPARK-3015] Block on cleaning tasks to prevent Akka timeouts
Andrew Or <andrewor14@gmail.com>
2014-08-15 22:55:32 -0700
Commit: 2541537, github.com/apache/spark/pull/1931
[SPARK-3001][MLLIB] Improve Spearman's correlation
Xiangrui Meng <meng@databricks.com>
2014-08-15 21:07:55 -0700
Commit: ce06d7f, github.com/apache/spark/pull/1917
[SPARK-3078][MLLIB] Make LRWithLBFGS API consistent with others
Xiangrui Meng <meng@databricks.com>
2014-08-15 21:04:29 -0700
Commit: c085011, github.com/apache/spark/pull/1973
[SPARK-3046] use executor's class loader as the default serializer classloader
Reynold Xin <rxin@apache.org>
2014-08-15 17:04:15 -0700
Commit: 077213b, github.com/apache/spark/pull/1972
[SPARK-3022] [SPARK-3041] [mllib] Call findBins once per level + unordered feature bug fix
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-15 14:50:10 -0700
Commit: 407ea9f, github.com/apache/spark/pull/1950
SPARK-3028. sparkEventToJson should support SparkListenerExecutorMetrics...
Sandy Ryza <sandy@cloudera.com>
2014-08-15 11:35:08 -0700
Commit: 63376a0, github.com/apache/spark/pull/1961
Revert "[SPARK-2468] Netty based block server / client module"
Patrick Wendell <pwendell@gmail.com>
2014-08-15 09:01:35 -0700
Commit: b066af4
[SPARK-2924] remove default args to overloaded methods
Anand Avati <avati@redhat.com>
2014-08-15 08:53:52 -0700
Commit: debb3e3, github.com/apache/spark/pull/1704
[SPARK-2468] Netty based block server / client module
Reynold Xin <rxin@apache.org>
2014-08-14 19:01:33 -0700
Commit: 3f23d2a, github.com/apache/spark/pull/1907
[SPARK-2936] Migrate Netty network module from Java to Scala
Reynold Xin <rxin@apache.org>
2014-08-10 20:36:54 -0700
Commit: d3cce58, github.com/apache/spark/pull/1865
[SPARK-2736] PySpark converter and example script for reading Avro files
Kan Zhang <kzhang@apache.org>
2014-08-14 19:03:51 -0700
Commit: 72e730e, github.com/apache/spark/pull/1916
[SPARK-3027] TaskContext: tighten visibility and provide Java friendly callback API
Reynold Xin <rxin@apache.org>
2014-08-14 18:37:02 -0700
Commit: f99e4fc, github.com/apache/spark/pull/1938
Make dev/mima runnable on Mac OS X.
Reynold Xin <rxin@apache.org>
2014-08-14 16:27:11 -0700
Commit: 475a35b, github.com/apache/spark/pull/1953
SPARK-3009: Reverted readObject method in ApplicationInfo so that Applic...
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2014-08-14 15:01:39 -0700
Commit: f5d9176, github.com/apache/spark/pull/1947
Revert [SPARK-3011][SQL] _temporary directory should be filtered out by sqlContext.parquetFile
Michael Armbrust <michael@databricks.com>
2014-08-14 13:00:21 -0700
Commit: c39a3f3, github.com/apache/spark/pull/1949
[SPARK-2979][MLlib] Improve the convergence rate by minimizing the condition number
DB Tsai <dbtsai@alpinenow.com>
2014-08-14 11:56:13 -0700
Commit: dc8ef93, github.com/apache/spark/pull/1897
Minor cleanup of metrics.Source
Reynold Xin <rxin@apache.org>
2014-08-14 11:22:41 -0700
Commit: a3dc54f, github.com/apache/spark/pull/1943
[SPARK-2925] [sql]fix spark-sql and start-thriftserver shell bugs when set --driver-java-options
wangfei <wangfei_hello@126.com>, wangfei <wangfei1@huawei.com>
2014-08-14 10:55:51 -0700
Commit: df25acd, github.com/apache/spark/pull/1851
[SQL] Python JsonRDD UTF8 Encoding Fix
Ahir Reddy <ahirreddy@gmail.com>
2014-08-14 10:48:52 -0700
Commit: 850abaa, github.com/apache/spark/pull/1914
[SPARK-2927][SQL] Add a conf to configure if we always read Binary columns stored in Parquet as String columns
Yin Huai <huai@cse.ohio-state.edu>
2014-08-14 10:46:33 -0700
Commit: de501e1, github.com/apache/spark/pull/1855
[SPARK-3011][SQL] _temporary directory should be filtered out by sqlContext.parquetFile
Chia-Yung Su <chiayung@appier.com>
2014-08-14 10:43:08 -0700
Commit: 221c84e, github.com/apache/spark/pull/1924
SPARK-2893: Do not swallow Exceptions when running a custom kryo registrator
Graham Dennis <graham.dennis@gmail.com>
2014-08-14 02:24:18 -0700
Commit: af809de, github.com/apache/spark/pull/1827
[SPARK-3029] Disable local execution of Spark jobs by default
Aaron Davidson <aaron@databricks.com>
2014-08-14 01:37:38 -0700
Commit: 0cb2b82, github.com/apache/spark/pull/1321
[SPARK-2995][MLLIB] add ALS.setIntermediateRDDStorageLevel
Xiangrui Meng <meng@databricks.com>
2014-08-13 23:53:44 -0700
Commit: 1baf06f, github.com/apache/spark/pull/1913
[Docs] Add missing <code> tags (minor)
Andrew Or <andrewor14@gmail.com>
2014-08-13 23:24:23 -0700
Commit: bf7c6e1, github.com/apache/spark/pull/1936
[SPARK-3006] Failed to execute spark-shell in Windows OS
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-08-13 22:17:07 -0700
Commit: dcd99c3, github.com/apache/spark/pull/1918
SPARK-3020: Print completed indices rather than tasks in web UI
Patrick Wendell <pwendell@gmail.com>
2014-08-13 18:08:38 -0700
Commit: c6cb55a, github.com/apache/spark/pull/1933
[SPARK-2986] [SQL] fixed: setting properties does not effect
guowei <guowei@upyoo.com>
2014-08-13 17:45:24 -0700
Commit: a8d2649, github.com/apache/spark/pull/1904
[SPARK-2970] [SQL] spark-sql script ends with IOException when EventLogging is enabled
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-13 17:42:38 -0700
Commit: b5b632c, github.com/apache/spark/pull/1891
[SPARK-2935][SQL]Fix parquet predicate push down bug
Michael Armbrust <michael@databricks.com>
2014-08-13 17:40:59 -0700
Commit: e8e7f17, github.com/apache/spark/pull/1863
[SPARK-2650][SQL] More precise initial buffer size estimation for in-memory column buffer
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-13 17:37:55 -0700
Commit: ee7d2cc, github.com/apache/spark/pull/1901
[SPARK-2994][SQL] Support for udfs that take complex types
Michael Armbrust <michael@databricks.com>
2014-08-13 17:35:38 -0700
Commit: 71b8408, github.com/apache/spark/pull/1915
[SPARK-2817] [SQL] add "show create table" support
tianyi <tianyi@asiainfo-linkage.com>, tianyi <tianyi@asiainfo.com>, tianyi <tianyi.asiainfo@gmail.com>
2014-08-13 16:50:02 -0700
Commit: 0fb1198, github.com/apache/spark/pull/1760
[SPARK-3004][SQL] Added null checking when retrieving row set
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-13 16:27:50 -0700
Commit: 8732375, github.com/apache/spark/pull/1920
[MLLIB] use Iterator.fill instead of Array.fill
Xiangrui Meng <meng@databricks.com>
2014-08-13 16:20:49 -0700
Commit: e63bf87, github.com/apache/spark/pull/1930
[SPARK-2983] [PySpark] improve performance of sortByKey()
Davies Liu <davies.liu@gmail.com>
2014-08-13 14:57:12 -0700
Commit: a7bc21c, github.com/apache/spark/pull/1898
[SPARK-3013] [SQL] [PySpark] convert array into list
Davies Liu <davies.liu@gmail.com>
2014-08-13 14:56:11 -0700
Commit: 9936020, github.com/apache/spark/pull/1928
[SPARK-2963] [SQL] There no documentation about building to use HiveServer and CLI for SparkSQL
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-13 14:42:57 -0700
Commit: 78f2f99, github.com/apache/spark/pull/1885
[SPARK-2993] [MLLib] colStats (wrapper around MultivariateStatisticalSummary) in Statistics
Doris Xin <doris.s.xin@gmail.com>
2014-08-12 23:47:42 -0700
Commit: 5ebeb3f, github.com/apache/spark/pull/1911
[SPARK-1777 (partial)] bugfix: make size of requested memory correctly
Zhang, Liye <liye.zhang@intel.com>
2014-08-12 23:43:36 -0700
Commit: ec5e2b0, github.com/apache/spark/pull/1892
Use transferTo when copy merge files in ExternalSorter
Raymond Liu <raymond.liu@intel.com>
2014-08-12 23:19:35 -0700
Commit: be674b3, github.com/apache/spark/pull/1884
[SPARK-2953] Allow using short names for io compression codecs
Reynold Xin <rxin@apache.org>
2014-08-12 22:50:29 -0700
Commit: 837bf60, github.com/apache/spark/pull/1873
SPARK-2830 [MLlib]: re-organize mllib documentation
Ameet Talwalkar <atalwalkar@gmail.com>
2014-08-12 17:15:21 -0700
Commit: cffd9bb, github.com/apache/spark/pull/1908
fix flaky tests
Davies Liu <davies.liu@gmail.com>
2014-08-12 16:26:01 -0700
Commit: b5f8083, github.com/apache/spark/pull/1910
[MLlib] Correctly set vectorSize and alpha
Liquan Pei <liquanpei@gmail.com>
2014-08-12 00:28:00 -0700
Commit: 2a8117a, github.com/apache/spark/pull/1900
[SPARK-2923][MLLIB] Implement some basic BLAS routines
Xiangrui Meng <meng@databricks.com>
2014-08-11 22:33:45 -0700
Commit: 872c170, github.com/apache/spark/pull/1849
[SQL] [SPARK-2826] Reduce the memory copy while building the hashmap for HashOuterJoin
Cheng Hao <hao.cheng@intel.com>
2014-08-11 20:45:14 -0700
Commit: f66f260, github.com/apache/spark/pull/1765
[SPARK-2650][SQL] Build column buffers in smaller batches
Michael Armbrust <michael@databricks.com>
2014-08-11 20:21:56 -0700
Commit: 779d1eb, github.com/apache/spark/pull/1880
[SPARK-2968][SQL] Fix nullabilities of Explode.
Takuya UESHIN <ueshin@happy-camper.st>
2014-08-11 20:18:03 -0700
Commit: 54b387f, github.com/apache/spark/pull/1888
[SPARK-2965][SQL] Fix HashOuterJoin output nullabilities.
Takuya UESHIN <ueshin@happy-camper.st>
2014-08-11 20:15:01 -0700
Commit: dcbf079, github.com/apache/spark/pull/1887
[SQL] A tiny refactoring in HiveContext#analyze
Yin Huai <huaiyin.thu@gmail.com>
2014-08-11 20:11:29 -0700
Commit: fd8173f, github.com/apache/spark/pull/1881
[sql]use SparkSQLEnv.stop() in ShutdownHook
wangfei <wangfei1@huawei.com>
2014-08-11 20:10:13 -0700
Commit: 6d0af52, github.com/apache/spark/pull/1852
[SPARK-2590][SQL] Added option to handle incremental collection, disabled by default
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-11 20:08:06 -0700
Commit: cf2f807, github.com/apache/spark/pull/1853
[SPARK-2844][SQL] Correctly set JVM HiveContext if it is passed into Python HiveContext constructor
Ahir Reddy <ahirreddy@gmail.com>
2014-08-11 20:06:06 -0700
Commit: 8cb4e5b, github.com/apache/spark/pull/1768
[SPARK-2934][MLlib] Adding LogisticRegressionWithLBFGS Interface
DB Tsai <dbtsai@alpinenow.com>
2014-08-11 19:49:29 -0700
Commit: 8f6e2e9, github.com/apache/spark/pull/1862
[SPARK-2515][mllib] Chi Squared test
Doris Xin <doris.s.xin@gmail.com>
2014-08-11 19:22:14 -0700
Commit: 7e31f7c, github.com/apache/spark/pull/1733
[SPARK-2931] In TaskSetManager, reset currentLocalityIndex after recomputing locality levels
Josh Rosen <joshrosen@apache.org>
2014-08-11 19:15:01 -0700
Commit: 6c64d57, github.com/apache/spark/pull/1896
[SPARK-2952] Enable logging actor messages at DEBUG level
Reynold Xin <rxin@apache.org>
2014-08-11 15:25:21 -0700
Commit: 6ec1374, github.com/apache/spark/pull/1870
[PySpark] [SPARK-2954] [SPARK-2948] [SPARK-2910] [SPARK-2101] Python 2.6 Fixes
Josh Rosen <joshrosen@apache.org>
2014-08-11 11:54:09 -0700
Commit: 09b8a3c, github.com/apache/spark/pull/1868.
[SPARK-2937] Separate out samplyByKeyExact as its own API in PairRDDFunction
Doris Xin <doris.s.xin@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-08-10 16:31:07 -0700
Commit: 3def842, github.com/apache/spark/pull/1866
[SPARK-2898] [PySpark] fix bugs in deamon.py
Davies Liu <davies.liu@gmail.com>
2014-08-10 13:00:38 -0700
Commit: 92daffe, github.com/apache/spark/pull/1842
Remove extra semicolon in Task.scala
GuoQiang Li <witgo@qq.com>
2014-08-10 12:12:22 -0700
Commit: bb23b11, github.com/apache/spark/pull/1876
Turn UpdateBlockInfo into case class.
Reynold Xin <rxin@apache.org>
2014-08-09 23:06:54 -0700
Commit: 076ddda, github.com/apache/spark/pull/1872
Updated Spark SQL README to include the hive-thriftserver module
Reynold Xin <rxin@apache.org>
2014-08-09 22:05:36 -0700
Commit: e8f8e5f, github.com/apache/spark/pull/1867
[SPARK-2894] spark-shell doesn't accept flags
Kousuke Saruta <sarutak@oss.nttdata.co.jp>, Cheng Lian <lian.cs.zju@gmail.com>
2014-08-09 21:10:43 -0700
Commit: ba223b8, github.com/apache/spark/pull/1715,
[SPARK-1766] sorted functions to meet pedantic requirements
Chris Cope <ccope@resilientscience.com>
2014-08-09 20:58:56 -0700
Commit: 4a7f3ef, github.com/apache/spark/pull/1859
[SPARK-2861] Fix Doc comment of histogram method
Chandan Kumar <chandan.kumar@imaginea.com>
2014-08-09 00:45:54 -0700
Commit: 71fcd2e, github.com/apache/spark/pull/1786
[SPARK-2635] Fix race condition at SchedulerBackend.isReady in standalone mode
li-zhihui <zhihui.li@intel.com>, Li Zhihui <zhihui.li@intel.com>
2014-08-08 22:52:56 -0700
Commit: 3311da2, github.com/apache/spark/pull/1525
[SPARK-2897][SPARK-2920]TorrentBroadcast does use the serializer class specified in the spark option "spark.serializer"
GuoQiang Li <witgo@qq.com>
2014-08-08 16:57:26 -0700
Commit: dd11e4e, github.com/apache/spark/pull/1836
[SPARK-1997][MLLIB] update breeze to 0.9
Xiangrui Meng <meng@databricks.com>
2014-08-08 15:07:31 -0700
Commit: 8fba6de, github.com/apache/spark/pull/1749
[SPARK-2700] [SQL] Hidden files (such as .impala_insert_staging) should be filtered out by sqlContext.parquetFile
chutium <teng.qiu@gmail.com>
2014-08-08 13:31:08 -0700
Commit: e264503, github.com/apache/spark/pull/1691
[SPARK-2919] [SQL] Basic support for analyze command in HiveQl
Yin Huai <huai@cse.ohio-state.edu>
2014-08-08 11:23:58 -0700
Commit: daa090f, github.com/apache/spark/pull/1848
[SPARK-2877] [SQL] MetastoreRelation should use SparkClassLoader when creating the tableDesc
Yin Huai <huai@cse.ohio-state.edu>
2014-08-08 11:15:16 -0700
Commit: 8b0188b, github.com/apache/spark/pull/1806
[SPARK-2908] [SQL] JsonRDD.nullTypeToStringType does not convert all NullType to StringType
Yin Huai <huai@cse.ohio-state.edu>
2014-08-08 11:10:11 -0700
Commit: 544a909, github.com/apache/spark/pull/1840
[SPARK-2888] [SQL] Fix addColumnMetadataToConf in HiveTableScan
Yin Huai <huai@cse.ohio-state.edu>
2014-08-08 11:01:51 -0700
Commit: 3eb5dd0, github.com/apache/spark/pull/1817
[SPARK-2904] Remove non-used local variable in SparkSubmitArguments
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-07 18:53:15 -0700
Commit: a54b5d9, github.com/apache/spark/pull/1834
SPARK-2565. Update ShuffleReadMetrics as blocks are fetched
Sandy Ryza <sandy@cloudera.com>
2014-08-07 18:09:03 -0700
Commit: aab7735, github.com/apache/spark/pull/1507
SPARK-2787: Make sort-based shuffle write files directly when there's no sorting/aggregation and # partitions is small
Matei Zaharia <matei@databricks.com>
2014-08-07 18:04:49 -0700
Commit: 0f2274f, github.com/apache/spark/pull/1799
SPARK-2899 Doc generation is back to working in new SBT Build.
Prashant Sharma <prashant.s@imaginea.com>
2014-08-07 16:24:22 -0700
Commit: 30369b8, github.com/apache/spark/pull/1830
SPARK-2905 Fixed path sbin => bin
Oleg Danilov <oleg.danilov@wandisco.com>
2014-08-07 15:48:44 -0700
Commit: c65c810, github.com/apache/spark/pull/1835
[SPARK-2852][MLLIB] Separate model from IDF/StandardScaler algorithms
Xiangrui Meng <meng@databricks.com>
2014-08-07 11:28:12 -0700
Commit: f705c1d, github.com/apache/spark/pull/1814
[mllib] DecisionTree Strategy parameter checks
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-07 00:20:38 -0700
Commit: c089429, github.com/apache/spark/pull/1821
SPARK-2879 part 2 [BUILD] Use HTTPS to access Maven Central and other repos
Sean Owen <srowen@gmail.com>
2014-08-07 00:04:18 -0700
Commit: d6cd6fd0, github.com/apache/spark/pull/1828
[SPARK-2851] [mllib] DecisionTree Python consistency update
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-06 22:58:59 -0700
Commit: c9f0944, github.com/apache/spark/pull/1798
[SPARK-2887] fix bug of countApproxDistinct() when have more than one partition
Davies Liu <davies.liu@gmail.com>
2014-08-06 21:22:13 -0700
Commit: cc8a7e9, github.com/apache/spark/pull/1812
Updating versions for Spark 1.1.0
Patrick Wendell <pwendell@gmail.com>
2014-08-06 19:11:39 -0700
Commit: cf35b56
HOTFIX: Support custom Java 7 location
Patrick Wendell <pwendell@gmail.com>
2014-08-06 18:45:03 -0700
Commit: 53fa048
SPARK-2879 [BUILD] Use HTTPS to access Maven Central and other repos
Sean Owen <srowen@gmail.com>
2014-08-06 18:13:35 -0700
Commit: 40284a9, github.com/apache/spark/pull/1805
[SPARK-2583] ConnectionManager error reporting
Kousuke Saruta <sarutak@oss.nttdata.co.jp>, Josh Rosen <joshrosen@apache.org>
2014-08-06 17:27:55 -0700
Commit: 3f92ce4, github.com/apache/spark/pull/1758
SPARK-2882: Spark build now checks local maven cache for dependencies
Gregory Owen <greowen@gmail.com>
2014-08-06 16:52:00 -0700
Commit: c2ae0b0, github.com/apache/spark/pull/1818
[PySpark] Add blanklines to Python docstrings so example code renders correctly
RJ Nowling <rnowling@gmail.com>
2014-08-06 14:12:21 -0700
Commit: a314e29, github.com/apache/spark/pull/1808
[SPARK-2852][MLLIB] API consistency for `mllib.feature`
Xiangrui Meng <meng@databricks.com>
2014-08-06 14:07:51 -0700
Commit: e654cfd, github.com/apache/spark/pull/1807
SPARK-2566. Update ShuffleWriteMetrics incrementally
Sandy Ryza <sandy@cloudera.com>
2014-08-06 13:10:33 -0700
Commit: a65c9ac, github.com/apache/spark/pull/1481
[SPARK-2627] [PySpark] have the build enforce PEP 8 automatically
Nicholas Chammas <nicholas.chammas@gmail.com>, nchammas <nicholas.chammas@gmail.com>
2014-08-06 12:58:24 -0700
Commit: 4c19614, github.com/apache/spark/pull/1744
[SPARK-2678][Core][SQL] A workaround for SPARK-2678
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-06 12:28:35 -0700
Commit: cf8e7fd, github.com/apache/spark/pull/1801
[SPARK-2875] [PySpark] [SQL] handle null in schemaRDD()
Davies Liu <davies.liu@gmail.com>
2014-08-06 11:08:12 -0700
Commit: 27a8d4c, github.com/apache/spark/pull/1802
[SPARK-2157] Enable tight firewall rules for Spark
Andrew Or <andrewor14@gmail.com>, Andrew Ash <andrew@andrewash.com>
2014-08-06 00:07:40 -0700
Commit: 31090e4, github.com/apache/spark/pull/1777
[SPARK-1022][Streaming][HOTFIX] Fixed zookeeper dependency of Kafka
Tathagata Das <tathagata.das1565@gmail.com>
2014-08-05 23:41:34 -0700
Commit: 5b4bc84, github.com/apache/spark/pull/1797
[MLlib] Use this.type as return type in k-means' builder pattern
DB Tsai <dbtsai@alpinenow.com>
2014-08-05 23:32:29 -0700
Commit: aec217a, github.com/apache/spark/pull/1796
SPARK-2294: fix locality inversion bug in TaskManager
CodingCat <zhunansjtu@gmail.com>
2014-08-05 23:02:58 -0700
Commit: 1da2fdf, github.com/apache/spark/pull/1313
[SQL] Fix logging warn -> debug
Michael Armbrust <michael@databricks.com>
2014-08-05 22:30:32 -0700
Commit: 0482055, github.com/apache/spark/pull/1800
[SQL] Tighten the visibility of various SQLConf methods and renamed setter/getters
Reynold Xin <rxin@apache.org>
2014-08-05 22:29:19 -0700
Commit: 4f0b4f4, github.com/apache/spark/pull/1794
[SPARK-2806] core - upgrade to json4s-jackson 3.2.10
Anand Avati <avati@redhat.com>
2014-08-05 21:59:10 -0700
Commit: 6da8f41, github.com/apache/spark/pull/1702
[SPARK-2866][SQL] Support attributes in ORDER BY that aren't in SELECT
Michael Armbrust <michael@databricks.com>
2014-08-05 20:55:02 -0700
Commit: 936f61e, github.com/apache/spark/pull/1795
[SPARK-2854][SQL] Finalize _acceptable_types in pyspark.sql
Yin Huai <huai@cse.ohio-state.edu>
2014-08-05 18:56:10 -0700
Commit: a10e1b0, github.com/apache/spark/pull/1793
[SPARK-2650][SQL] Try to partially fix SPARK-2650 by adjusting initial buffer size and reducing memory allocation
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-05 18:50:37 -0700
Commit: 4233b02, github.com/apache/spark/pull/1769
[sql] rename project name in pom.xml of hive-thriftserver module
wangfei <wangfei1@huawei.com>
2014-08-05 18:30:02 -0700
Commit: 152e36c, github.com/apache/spark/pull/1789
SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection
Stephen Boesch <javadba>, Stephen Boesch <javadba@gmail.com>
2014-08-05 18:18:08 -0700
Commit: 58247a8, github.com/apache/spark/pull/1792
[SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods
Michael Giannakopoulos <miccagiann@gmail.com>
2014-08-05 16:30:32 -0700
Commit: 672904e, github.com/apache/spark/pull/1775
[SPARK-2503] Lower shuffle output buffer (spark.shuffle.file.buffer.kb) to 32KB.
Reynold Xin <rxin@apache.org>
2014-08-05 16:24:50 -0700
Commit: 0172277, github.com/apache/spark/pull/1781
[SPARK-2856] Decrease initial buffer size for Kryo to 64KB.
Reynold Xin <rxin@apache.org>
2014-08-05 01:30:46 -0700
Commit: 5081b0d, github.com/apache/spark/pull/1780
[SPARK-2864][MLLIB] fix random seed in word2vec; move model to local
Xiangrui Meng <meng@databricks.com>
2014-08-05 16:22:41 -0700
Commit: e77fa81, github.com/apache/spark/pull/1790
SPARK-1680: use configs for specifying environment variables on YARN
Thomas Graves <tgraves@apache.org>
2014-08-05 15:57:32 -0500
Commit: 7b798e1, github.com/apache/spark/pull/1512
SPARK-2380: Support displaying accumulator values in the web UI
Patrick Wendell <pwendell@gmail.com>
2014-08-05 13:08:23 -0700
Commit: 46b6983, github.com/apache/spark/pull/1309
[SPARK-2859] Update url of Kryo project in related docs
Guancheng (G.C.) Chen <chenguancheng@gmail.com>
2014-08-05 11:50:08 -0700
Commit: 0f541ab, github.com/apache/spark/pull/1782
[SPARK-2860][SQL] Fix coercion of CASE WHEN.
Michael Armbrust <michael@databricks.com>
2014-08-05 11:17:50 -0700
Commit: 388ab53, github.com/apache/spark/pull/1785
SPARK-1890 and SPARK-1891- add admin and modify acls
Thomas Graves <tgraves@apache.org>
2014-08-05 12:52:52 -0500
Commit: e3fe657, github.com/apache/spark/pull/1196
SPARK-1528 - spark on yarn, add support for accessing remote HDFS
Thomas Graves <tgraves@apache.org>
2014-08-05 12:48:26 -0500
Commit: 6c0c65f, github.com/apache/spark/pull/1159
[SPARK-1022][Streaming] Add Kafka real unit test
jerryshao <saisai.shao@intel.com>
2014-08-05 10:40:28 -0700
Commit: b92a450, github.com/apache/spark/pull/1751
[SPARK-1779] Throw an exception if memory fractions are not between 0 and 1
wangfei <scnbwf@yeah.net>, wangfei <wangfei1@huawei.com>
2014-08-05 00:51:07 -0700
Commit: 075ba67, github.com/apache/spark/pull/714
[SPARK-2857] Correct properties to set Master / Worker ports
Andrew Or <andrewor14@gmail.com>
2014-08-05 00:39:07 -0700
Commit: 12f99cf, github.com/apache/spark/pull/1779
SPARK-2711. Create a ShuffleMemoryManager to track memory for all spilling collections
Matei Zaharia <matei@databricks.com>
2014-08-04 23:41:03 -0700
Commit: d13d253, github.com/apache/spark/pull/1707
SPARK-2685. Update ExternalAppendOnlyMap to avoid buffer.remove()
Matei Zaharia <matei@databricks.com>
2014-08-04 23:27:53 -0700
Commit: a092285, github.com/apache/spark/pull/1773
[SPARK-2323] Exception in accumulator update should not crash DAGScheduler & SparkContext
Reynold Xin <rxin@apache.org>
2014-08-04 20:39:18 -0700
Commit: 4ed7b5a, github.com/apache/spark/pull/1772
[SPARK-1687] [PySpark] fix unit tests related to pickable namedtuple
Davies Liu <davies.liu@gmail.com>
2014-08-04 15:54:52 -0700
Commit: 2225d18, github.com/apache/spark/pull/1771
SPARK-2792. Fix reading too much or too little data from each stream in ExternalMap / Sorter
Matei Zaharia <matei@databricks.com>
2014-08-04 12:59:18 -0700
Commit: aa7a48e, github.com/apache/spark/pull/1722
[SPARK-1687] [PySpark] pickable namedtuple
Davies Liu <davies.liu@gmail.com>
2014-08-04 12:13:41 -0700
Commit: bfd2f39, github.com/apache/spark/pull/1623
[MLlib] [SPARK-2510]Word2Vec: Distributed Representation of Words
Liquan Pei <lpei@gopivotal.com>, Xiangrui Meng <meng@databricks.com>, Liquan Pei <liquanpei@gmail.com>
2014-08-03 23:55:58 -0700
Commit: 3823f6d, github.com/apache/spark/pull/1719
SPARK-2272 [MLlib] Feature scaling which standardizes the range of independent variables or features of data
DB Tsai <dbtsai@alpinenow.com>
2014-08-03 21:39:21 -0700
Commit: 9aa1459, github.com/apache/spark/pull/1207
Fix some bugs with spaces in directory name.
Sarah Gerweck <sarah.a180@gmail.com>
2014-08-03 19:47:05 -0700
Commit: 2152e24, github.com/apache/spark/pull/1756
[SPARK-2810] upgrade to scala-maven-plugin 3.2.0
Anand Avati <avati@redhat.com>
2014-08-03 17:47:49 -0700
Commit: 4784d24, github.com/apache/spark/pull/1711
[SPARK-1740] [PySpark] kill the python worker
Davies Liu <davies.liu@gmail.com>
2014-08-03 15:52:00 -0700
Commit: a4cdb77, github.com/apache/spark/pull/1643
[SPARK-2783][SQL] Basic support for analyze in HiveContext
Yin Huai <huai@cse.ohio-state.edu>
2014-08-03 14:54:41 -0700
Commit: 7c6afda, github.com/apache/spark/pull/1741
[SPARK-2814][SQL] HiveThriftServer2 throws NPE when executing native commands
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-03 12:34:46 -0700
Commit: 6ffdcc6, github.com/apache/spark/pull/1753
[SPARK-2784][SQL] Deprecate hql() method in favor of a config option, 'spark.sql.dialect'
Michael Armbrust <michael@databricks.com>
2014-08-03 12:28:29 -0700
Commit: c5ed1de, github.com/apache/spark/pull/1746
[SPARK-2197] [mllib] Java DecisionTree bug fix and easy-of-use
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-03 10:36:52 -0700
Commit: eaa9355, github.com/apache/spark/pull/1740
SPARK-2246: Add user-data option to EC2 scripts
Allan Douglas R. de Oliveira <allan@chaordicsystems.com>
2014-08-03 10:25:59 -0700
Commit: 162fc95, github.com/apache/spark/pull/1186
SPARK-2712 - Add a small note to maven doc that mvn package must happen ...
Stephen Boesch <javadba@gmail.com>
2014-08-03 10:19:04 -0700
Commit: 1992175, github.com/apache/spark/pull/1615
[Minor] Fixes on top of #1679
Andrew Or <andrewor14@gmail.com>
2014-08-02 22:00:46 -0700
Commit: fb2a207, github.com/apache/spark/pull/1736
SPARK-2414 [BUILD] Add LICENSE entry for jquery
Sean Owen <srowen@gmail.com>
2014-08-02 21:55:56 -0700
Commit: c137928, github.com/apache/spark/pull/1748
SPARK-2602 [BUILD] Tests steal focus under Java 6
Sean Owen <srowen@gmail.com>
2014-08-02 21:44:19 -0700
Commit: 0d47bb6, github.com/apache/spark/pull/1747
[SPARK-2739][SQL] Rename registerAsTable to registerTempTable
Michael Armbrust <michael@databricks.com>
2014-08-02 18:27:04 -0700
Commit: 5b30e00, github.com/apache/spark/pull/1743
[SPARK-2797] [SQL] SchemaRDDs don't support unpersist()
Yin Huai <huai@cse.ohio-state.edu>
2014-08-02 17:55:22 -0700
Commit: 5ef8282, github.com/apache/spark/pull/1745
[SPARK-2729][SQL] Added test case for SPARK-2729
Cheng Lian <lian.cs.zju@gmail.com>
2014-08-02 17:12:49 -0700
Commit: 460fad8, github.com/apache/spark/pull/1738
[SPARK-2785][SQL] Remove assertions that throw when users try unsupported Hive commands.
Michael Armbrust <michael@databricks.com>
2014-08-02 16:48:07 -0700
Commit: 4230df4, github.com/apache/spark/pull/1742
[SPARK-2097][SQL] UDF Support
Michael Armbrust <michael@databricks.com>
2014-08-02 16:33:48 -0700
Commit: 3b9f25f, github.com/apache/spark/pull/1063
SPARK-2804: Remove scalalogging-slf4j dependency
GuoQiang Li <witgo@qq.com>
2014-08-02 13:55:28 -0700
Commit: 7924d72, github.com/apache/spark/pull/also
[SPARK-1981] Add AWS Kinesis streaming support
Chris Fregly <chris@fregly.com>
2014-08-02 13:35:35 -0700
Commit: bb0ac6d, github.com/apache/spark/pull/1434
[SQL] Set outputPartitioning of BroadcastHashJoin correctly.
Yin Huai <huai@cse.ohio-state.edu>
2014-08-02 13:16:41 -0700
Commit: 91de0dc, github.com/apache/spark/pull/1735
[SPARK-2478] [mllib] DecisionTree Python API
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-02 13:07:17 -0700
Commit: 8d6ac2b, github.com/apache/spark/pull/1727
[HOTFIX] Do not throw NPE if spark.test.home is not set
Andrew Or <andrewor14@gmail.com>
2014-08-02 12:11:50 -0700
Commit: e221108, github.com/apache/spark/pull/1739
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-08-02 01:26:16 -0700
Commit: 87738bf, github.com/apache/spark/pull/706
HOTFIX: Fix concurrency issue in FlumePollingStreamSuite.
Patrick Wendell <pwendell@gmail.com>
2014-08-02 01:11:03 -0700
Commit: 44460ba
HOTFIX: Fixing test error in maven for flume-sink.
Patrick Wendell <pwendell@gmail.com>
2014-08-02 00:57:47 -0700
Commit: 25cad6a
[SPARK-1812] sql/catalyst - Provide explicit type information
Anand Avati <avati@redhat.com>
2014-08-02 00:48:17 -0700
Commit: 08c095b, github.com/apache/spark/pull/1709
[SPARK-2454] Do not ship spark home to Workers
Andrew Or <andrewor14@gmail.com>
2014-08-02 00:45:38 -0700
Commit: 148af60, github.com/apache/spark/pull/1734
[SPARK-2316] Avoid O(blocks) operations in listeners
Andrew Or <andrewor14@gmail.com>
2014-08-01 23:56:24 -0700
Commit: d934801, github.com/apache/spark/pull/1679
Revert "[SPARK-1470][SPARK-1842] Use the scala-logging wrapper instead of the directly sfl4j api"
Patrick Wendell <pwendell@gmail.com>
2014-08-01 23:55:30 -0700
Commit: dab3796
[SPARK-1470][SPARK-1842] Use the scala-logging wrapper instead of the directly sfl4j api
GuoQiang Li <witgo@qq.com>
2014-08-01 23:55:11 -0700
Commit: adc8303, github.com/apache/spark/pull/1369
StatCounter on NumPy arrays [PYSPARK][SPARK-2012]
Jeremy Freeman <the.freeman.lab@gmail.com>
2014-08-01 22:33:25 -0700
Commit: 4bc3bb2, github.com/apache/spark/pull/1725
[SPARK-2801][MLlib]: DistributionGenerator renamed to RandomDataGenerator. RandomRDD is now of generic type
Burak <brkyvz@gmail.com>
2014-08-01 22:32:12 -0700
Commit: fda4759, github.com/apache/spark/pull/1732
[SPARK-1580][MLLIB] Estimate ALS communication and computation costs.
Tor Myklebust <tmyklebu@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-08-01 21:25:02 -0700
Commit: e25ec06, github.com/apache/spark/pull/493
[SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods.
Michael Giannakopoulos <miccagiann@gmail.com>
2014-08-01 21:00:31 -0700
Commit: c281189, github.com/apache/spark/pull/1624
Streaming mllib [SPARK-2438][MLLIB]
Jeremy Freeman <the.freeman.lab@gmail.com>, freeman <the.freeman.lab@gmail.com>
2014-08-01 20:10:26 -0700
Commit: f6a1899, github.com/apache/spark/pull/1361
[SPARK-2764] Simplify daemon.py process structure
Josh Rosen <joshrosen@apache.org>
2014-08-01 19:38:21 -0700
Commit: e8e0fd6, github.com/apache/spark/pull/1680
[SPARK-2800]: Exclude scalastyle-output.xml Apache RAT checks
GuoQiang Li <witgo@qq.com>
2014-08-01 19:35:16 -0700
Commit: a38d3c9, github.com/apache/spark/pull/1729
[SPARK-2116] Load spark-defaults.conf from SPARK_CONF_DIR if set
Albert Chu <chu11@llnl.gov>
2014-08-01 19:00:38 -0700
Commit: 0da07da, github.com/apache/spark/pull/1059
[SPARK-2212][SQL] Hash Outer Join (follow-up bug fix).
Yin Huai <huai@cse.ohio-state.edu>
2014-08-01 18:52:01 -0700
Commit: 3822f33, github.com/apache/spark/pull/1721
[SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDD
Davies Liu <davies.liu@gmail.com>
2014-08-01 18:47:41 -0700
Commit: 880eabe, github.com/apache/spark/pull/1598
[SPARK-2796] [mllib] DecisionTree bug fix: ordered categorical features
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-08-01 15:52:21 -0700
Commit: 7058a53, github.com/apache/spark/pull/1720
[SPARK-2786][mllib] Python correlations
Doris Xin <doris.s.xin@gmail.com>
2014-08-01 15:02:17 -0700
Commit: d88e695, github.com/apache/spark/pull/1713
SPARK-2791: Fix committing, reverting and state tracking in shuffle file consolidation
Aaron Davidson <aaron@databricks.com>
2014-08-01 13:57:19 -0700
Commit: 78f2af5, github.com/apache/spark/pull/1678
[SPARK-2379] Fix the bug that streaming's receiver may fall into a dead loop
joyyoj <sunshch@gmail.com>
2014-08-01 13:41:55 -0700
Commit: b270309, github.com/apache/spark/pull/1694
SPARK-1612: Fix potential resource leaks
zsxwing <zsxwing@gmail.com>
2014-08-01 13:25:04 -0700
Commit: f5d9bea, github.com/apache/spark/pull/535
[SPARK-2490] Change recursive visiting on RDD dependencies to iterative approach
Liang-Chi Hsieh <viirya@gmail.com>
2014-08-01 12:12:30 -0700
Commit: baf9ce1, github.com/apache/spark/pull/1418
[SPARK-695] In DAGScheduler's getPreferredLocs, track set of visited partitions.
Aaron Staple <aaron.staple@gmail.com>
2014-08-01 12:04:04 -0700
Commit: eb5bdca, github.com/apache/spark/pull/1362
[SQL] Documentation: Explain cacheTable command
CrazyJvm <crazyjvm@gmail.com>
2014-08-01 11:46:13 -0700
Commit: c82fe47, github.com/apache/spark/pull/1681
[SPARK-2767] [SQL] SparkSQL CLI doens't output error message if query failed.
Cheng Hao <hao.cheng@intel.com>
2014-08-01 11:42:05 -0700
Commit: c0b47ba, github.com/apache/spark/pull/1686
[SPARK-2729] [SQL] Forgot to match Timestamp type in ColumnBuilder
chutium <teng.qiu@gmail.com>
2014-08-01 11:31:44 -0700
Commit: 580c701, github.com/apache/spark/pull/1636
[SQL][SPARK-2212]Hash Outer Join
Cheng Hao <hao.cheng@intel.com>
2014-08-01 11:27:12 -0700
Commit: 4415722, github.com/apache/spark/pull/1147
[SPARK-2179][SQL] A minor refactoring Java data type APIs (2179 follow-up).
Yin Huai <huai@cse.ohio-state.edu>
2014-08-01 11:14:53 -0700
Commit: c41fdf0, github.com/apache/spark/pull/1712
SPARK-2099. Report progress while task is running.
Sandy Ryza <sandy@cloudera.com>
2014-08-01 11:08:39 -0700
Commit: 8d338f6, github.com/apache/spark/pull/1056
[HOTFIX] downgrade breeze version to 0.7
Xiangrui Meng <meng@databricks.com>
2014-08-01 10:00:46 -0700
Commit: 5328c0a, github.com/apache/spark/pull/1718
[SPARK-1997] update breeze to version 0.8.1
witgo <witgo@qq.com>
2014-08-01 07:47:44 -0700
Commit: 0dacb1a, github.com/apache/spark/pull/940
SPARK-2768 [MLLIB] Add product, user recommend method to MatrixFactorizationModel
Sean Owen <srowen@gmail.com>
2014-08-01 07:32:53 -0700
Commit: 82d209d, github.com/apache/spark/pull/1687
[SPARK-2103][Streaming] Change to ClassTag for KafkaInputDStream and fix reflection issue
jerryshao <saisai.shao@intel.com>
2014-08-01 04:32:46 -0700
Commit: a32f0fb, github.com/apache/spark/pull/1508
[Spark 2557] fix LOCAL_N_REGEX in createTaskScheduler and make local-n and local-n-failures consistent
Ye Xianjin <advancedxy@gmail.com>
2014-08-01 00:34:39 -0700
Commit: 284771e, github.com/apache/spark/pull/1464
SPARK-2134: Report metrics before application finishes
Rahul Singhal <rahul.singhal@guavus.com>
2014-08-01 00:33:15 -0700
Commit: f1957e1, github.com/apache/spark/pull/1076
SPARK-983. Support external sorting in sortByKey()
Matei Zaharia <matei@databricks.com>
2014-08-01 00:16:18 -0700
Commit: 72e3369, github.com/apache/spark/pull/931
[SPARK-2670] FetchFailedException should be thrown when local fetch has failed
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-08-01 00:01:30 -0700
Commit: 8ff4417, github.com/apache/spark/pull/1578
SPARK-2738. Remove redundant imports in BlockManagerSuite
Sandy Ryza <sandy@cloudera.com>
2014-07-31 23:12:38 -0700
Commit: cb9e7d5, github.com/apache/spark/pull/1642
SPARK-2632, SPARK-2576. Fixed by only importing what is necessary during class definition.
Prashant Sharma <scrapcodes@gmail.com>, Yin Huai <huai@cse.ohio-state.edu>, Prashant Sharma <prashant.s@imaginea.com>
2014-07-31 22:57:13 -0700
Commit: 1499101, github.com/apache/spark/pull/1635
[SPARK-2702][Core] Upgrade Tachyon dependency to 0.5.0
Haoyuan Li <haoyuan@cs.berkeley.edu>
2014-07-31 22:53:42 -0700
Commit: 2cdc3e5, github.com/apache/spark/pull/1651
[SPARK-2782][mllib] Bug fix for getRanks in SpearmanCorrelation
Doris Xin <doris.s.xin@gmail.com>
2014-07-31 21:23:35 -0700
Commit: c475540, github.com/apache/spark/pull/1710
[SPARK-2777][MLLIB] change ALS factors storage level to MEMORY_AND_DISK
Xiangrui Meng <meng@databricks.com>
2014-07-31 21:14:08 -0700
Commit: b190083, github.com/apache/spark/pull/1700
SPARK-2766: ScalaReflectionSuite throw an llegalArgumentException in JDK 6
GuoQiang Li <witgo@qq.com>
2014-07-31 21:06:57 -0700
Commit: 9998efa, github.com/apache/spark/pull/1683
[SPARK-2779] [SQL] asInstanceOf[Map[...]] should use scala.collection.Map instead of scala.collection.immutable.Map
Yin Huai <huai@cse.ohio-state.edu>
2014-07-31 21:02:11 -0700
Commit: 9632719, github.com/apache/spark/pull/1705
[SPARK-2756] [mllib] Decision tree bug fixes
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-07-31 20:51:48 -0700
Commit: b124de5, github.com/apache/spark/pull/1673
[SPARK-2724] Python version of RandomRDDGenerators
Doris Xin <doris.s.xin@gmail.com>
2014-07-31 20:32:57 -0700
Commit: d843014, github.com/apache/spark/pull/1628
[SPARK-2531 & SPARK-2436] [SQL] Optimize the BuildSide when planning BroadcastNestedLoopJoin.
Zongheng Yang <zongheng.y@gmail.com>
2014-07-31 19:32:16 -0700
Commit: 8f51491, github.com/apache/spark/pull/1448
SPARK-2282: Reuse Socket for sending accumulator updates to Pyspark
Aaron Davidson <aaron@databricks.com>
2014-07-31 15:31:53 -0700
Commit: ef4ff00, github.com/apache/spark/pull/1503
SPARK-2740: allow user to specify ascending and numPartitions for sortBy...
Rui Li <rui.li@intel.com>
2014-07-31 15:07:26 -0700
Commit: 492a195, github.com/apache/spark/pull/1645
Docs: monitoring, streaming programming guide
kballou <kballou@devnulllabs.io>
2014-07-31 14:58:52 -0700
Commit: cc82050, github.com/apache/spark/pull/1662
Improvements to merge_spark_pr.py
Josh Rosen <joshrosen@apache.org>
2014-07-31 14:35:09 -0700
Commit: e021362, github.com/apache/spark/pull/1668
[SPARK-2523] [SQL] Hadoop table scan bug fixing (fix failing Jenkins maven test)
Yin Huai <huai@cse.ohio-state.edu>
2014-07-31 13:05:24 -0700
Commit: 49b3612, github.com/apache/spark/pull/1669
[SPARK-2511][MLLIB] add HashingTF and IDF
Xiangrui Meng <meng@databricks.com>
2014-07-31 12:55:00 -0700
Commit: dc0865b, github.com/apache/spark/pull/1671
SPARK-2646. log4j initialization not quite compatible with log4j 2.x
Sean Owen <srowen@gmail.com>
2014-07-31 12:26:36 -0700
Commit: e5749a1, github.com/apache/spark/pull/1547
SPARK-2749 [BUILD] Part 2. Fix a follow-on scalastyle error
Sean Owen <srowen@gmail.com>
2014-07-31 12:18:40 -0700
Commit: 4dbabb3, github.com/apache/spark/pull/1690
SPARK-2664. Deal with `--conf` options in spark-submit that relate to fl...
Sandy Ryza <sandy@cloudera.com>
2014-07-31 11:51:20 -0700
Commit: f68105d, github.com/apache/spark/pull/1665
SPARK-2028: Expose mapPartitionsWithInputSplit in HadoopRDD
Aaron Davidson <aaron@databricks.com>
2014-07-31 11:35:38 -0700
Commit: f193312, github.com/apache/spark/pull/973
[SPARK-2397][SQL] Deprecate LocalHiveContext
Michael Armbrust <michael@databricks.com>
2014-07-31 11:26:43 -0700
Commit: 72cfb13, github.com/apache/spark/pull/1641
[SPARK-2743][SQL] Resolve original attributes in ParquetTableScan
Michael Armbrust <michael@databricks.com>
2014-07-31 11:15:25 -0700
Commit: 3072b96, github.com/apache/spark/pull/1647
[SPARK-2762] SparkILoop leaks memory in multi-repl configurations
Timothy Hunter <timhunter@databricks.com>
2014-07-31 10:25:40 -0700
Commit: 92ca910, github.com/apache/spark/pull/1674
automatically set master according to `spark.master` in `spark-defaults....
CrazyJvm <crazyjvm@gmail.com>
2014-07-30 23:37:25 -0700
Commit: 669e3f0, github.com/apache/spark/pull/1644
[SPARK-2497] Included checks for module symbols too.
Prashant Sharma <prashant.s@imaginea.com>
2014-07-30 22:46:30 -0700
Commit: 5a110da, github.com/apache/spark/pull/1463
[SPARK-2737] Add retag() method for changing RDDs' ClassTags.
Josh Rosen <joshrosen@apache.org>
2014-07-30 22:40:57 -0700
Commit: 4fb2593, github.com/apache/spark/pull/1639
[SPARK-2340] Resolve event logging and History Server paths properly
Andrew Or <andrewor14@gmail.com>
2014-07-30 21:57:32 -0700
Commit: a7c305b, github.com/apache/spark/pull/1280
Required AM memory is "amMem", not "args.amMemory"
derek ma <maji3@asiainfo-linkage.com>
2014-07-30 21:37:59 -0700
Commit: 118c1c4, github.com/apache/spark/pull/1494
[SPARK-2758] UnionRDD's UnionPartition should not reference parent RDDs
Reynold Xin <rxin@apache.org>
2014-07-30 21:30:13 -0700
Commit: 894d48f, github.com/apache/spark/pull/1675
SPARK-2045 Sort-based shuffle
Matei Zaharia <matei@databricks.com>
2014-07-30 18:07:59 -0700
Commit: e966284, github.com/apache/spark/pull/1499
Update DecisionTreeRunner.scala
strat0sphere <stratos.dimopoulos@gmail.com>
2014-07-30 17:57:50 -0700
Commit: da50176, github.com/apache/spark/pull/1676
SPARK-2341 [MLLIB] loadLibSVMFile doesn't handle regression datasets
Sean Owen <srowen@gmail.com>
2014-07-30 17:34:32 -0700
Commit: e9b275b, github.com/apache/spark/pull/1663
[SPARK-2734][SQL] Remove tables from cache when DROP TABLE is run.
Michael Armbrust <michael@databricks.com>
2014-07-30 17:30:51 -0700
Commit: 88a519d, github.com/apache/spark/pull/1650
SPARK-2741 - Publish version of spark assembly which does not contain Hive
Brock Noland <brock@apache.org>
2014-07-30 17:04:30 -0700
Commit: 2ac37db, github.com/apache/spark/pull/1667
SPARK-2749 [BUILD]. Spark SQL Java tests aren't compiling in Jenkins' Maven builds; missing junit:junit dep
Sean Owen <srowen@gmail.com>
2014-07-30 15:04:33 -0700
Commit: 6ab96a6, github.com/apache/spark/pull/1660
Properly pass SBT_MAVEN_PROFILES into sbt.
Reynold Xin <rxin@apache.org>
2014-07-30 14:31:20 -0700
Commit: 2f4b170
Set AMPLAB_JENKINS_BUILD_PROFILE.
Reynold Xin <rxin@apache.org>
2014-07-30 14:08:24 -0700
Commit: 1097327
Wrap JAR_DL in dev/check-license.
Reynold Xin <rxin@apache.org>
2014-07-30 13:42:43 -0700
Commit: 7c7ce54
[SPARK-2024] Add saveAsSequenceFile to PySpark
Kan Zhang <kzhang@apache.org>
2014-07-30 13:19:05 -0700
Commit: 94d1f46, github.com/apache/spark/pull/1338
dev/check-license wrap folders in quotes.
Reynold Xin <rxin@apache.org>
2014-07-30 13:17:14 -0700
Commit: 437dc8c
[SQL] Fix compiling of catalyst docs.
Michael Armbrust <michael@databricks.com>
2014-07-30 13:11:09 -0700
Commit: 2248891, github.com/apache/spark/pull/1653
More wrapping FWDIR in quotes.
Reynold Xin <rxin@apache.org>
2014-07-30 13:04:20 -0700
Commit: 0feb349
Wrap FWDIR in quotes in dev/check-license.
Reynold Xin <rxin@apache.org>
2014-07-30 12:33:42 -0700
Commit: 95cf203
Wrap FWDIR in quotes.
Reynold Xin <rxin@apache.org>
2014-07-30 12:24:35 -0700
Commit: f2eb84f
[SPARK-2746] Set SBT_MAVEN_PROFILES only when it is not set explicitly by the user.
Reynold Xin <rxin@apache.org>
2014-07-30 11:45:24 -0700
Commit: ff511ba, github.com/apache/spark/pull/1655
[SPARK-2544][MLLIB] Improve ALS algorithm resource usage
GuoQiang Li <witgo@qq.com>, witgo <witgo@qq.com>
2014-07-30 11:00:11 -0700
Commit: fc47bb6, github.com/apache/spark/pull/929
Avoid numerical instability
Naftali Harris <naftaliharris@gmail.com>
2014-07-30 09:56:59 -0700
Commit: e3d85b7, github.com/apache/spark/pull/1652
[SPARK-2747] git diff --dirstat can miss sql changes and not run Hive tests
Reynold Xin <rxin@apache.org>
2014-07-30 09:28:53 -0700
Commit: 3bc3f18, github.com/apache/spark/pull/1656
[SPARK-2521] Broadcast RDD object (instead of sending it along with every task)
Reynold Xin <rxin@apache.org>
2014-07-30 09:27:43 -0700
Commit: 774142f, github.com/apache/spark/pull/1498
SPARK-2748 [MLLIB] [GRAPHX] Loss of precision for small arguments to Math.exp, Math.log
Sean Owen <srowen@gmail.com>
2014-07-30 08:55:15 -0700
Commit: ee07541, github.com/apache/spark/pull/1659
SPARK-2543: Allow user to set maximum Kryo buffer size
Koert Kuipers <koert@tresata.com>
2014-07-30 00:18:59 -0700
Commit: 7c5fc28, github.com/apache/spark/pull/735
[SPARK-2179][SQL] Public API for DataTypes and Schema
Yin Huai <huai@cse.ohio-state.edu>
2014-07-30 00:15:31 -0700
Commit: 7003c16, github.com/apache/spark/pull/1346
[SPARK-2260] Fix standalone-cluster mode, which was broken
Andrew Or <andrewor14@gmail.com>
2014-07-29 23:52:09 -0700
Commit: 4ce92cc, github.com/apache/spark/pull/1538
[SQL] Handle null values in debug()
Michael Armbrust <michael@databricks.com>
2014-07-29 22:42:54 -0700
Commit: 077f633, github.com/apache/spark/pull/1646
[SPARK-2568] RangePartitioner should run only one job if data is balanced
Xiangrui Meng <meng@databricks.com>, Reynold Xin <rxin@apache.org>
2014-07-29 22:16:20 -0700
Commit: 2e6efca, github.com/apache/spark/pull/1562
[SPARK-2054][SQL] Code Generation for Expression Evaluation
Michael Armbrust <michael@databricks.com>
2014-07-29 20:58:05 -0700
Commit: 8446746, github.com/apache/spark/pull/993
[SPARK-2305] [PySpark] Update Py4J to version 0.8.2.1
Josh Rosen <joshrosen@apache.org>
2014-07-29 19:02:06 -0700
Commit: 22649b6, github.com/apache/spark/pull/1626
[SPARK-2631][SQL] Use SQLConf to configure in-memory columnar caching
Michael Armbrust <michael@databricks.com>
2014-07-29 18:20:51 -0700
Commit: 86534d0, github.com/apache/spark/pull/1638
[SPARK-2716][SQL] Don't check resolved for having filters.
Michael Armbrust <michael@databricks.com>
2014-07-29 18:14:20 -0700
Commit: 39b8193, github.com/apache/spark/pull/1640
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-07-29 17:52:48 -0700
Commit: 2c35666, github.com/apache/spark/pull/740
[SPARK-2393][SQL] Cost estimation optimization framework for Catalyst logical plans & sample usage.
Zongheng Yang <zongheng.y@gmail.com>
2014-07-29 15:32:50 -0700
Commit: c7db274, github.com/apache/spark/pull/1238
[SPARK-2082] stratified sampling in PairRDDFunctions that guarantees exact sample size
Doris Xin <doris.s.xin@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-07-29 12:49:44 -0700
Commit: dc96536, github.com/apache/spark/pull/1025
[SPARK-2674] [SQL] [PySpark] support datetime type for SchemaRDD
Davies Liu <davies.liu@gmail.com>
2014-07-29 12:31:39 -0700
Commit: f0d880e, github.com/apache/spark/pull/1601
[SPARK-2730][SQL] When retrieving a value from a Map, GetItem evaluates key twice
Yin Huai <huai@cse.ohio-state.edu>
2014-07-29 12:23:34 -0700
Commit: e364348, github.com/apache/spark/pull/1637
[SQL]change some test lists
Daoyuan <daoyuan.wang@intel.com>
2014-07-29 12:22:48 -0700
Commit: 0c5c6a6, github.com/apache/spark/pull/1634
[STREAMING] SPARK-1729. Make Flume pull data from source, rather than the current pu...
Hari Shreedharan <harishreedharan@gmail.com>, Hari Shreedharan <hshreedharan@apache.org>, Tathagata Das <tathagata.das1565@gmail.com>, harishreedharan <hshreedharan@cloudera.com>
2014-07-29 11:11:29 -0700
Commit: 800ecff, github.com/apache/spark/pull/807
Minor indentation and comment typo fixes.
Aaron Staple <astaple@gmail.com>
2014-07-29 01:35:26 -0700
Commit: fc4d057, github.com/apache/spark/pull/1630
[SPARK-2174][MLLIB] treeReduce and treeAggregate
Xiangrui Meng <meng@databricks.com>
2014-07-29 01:16:41 -0700
Commit: 20424da, github.com/apache/spark/pull/1110
[SPARK-2726] and [SPARK-2727] Remove SortOrder and do in-place sort.
Reynold Xin <rxin@apache.org>
2014-07-29 01:12:44 -0700
Commit: 96ba04b, github.com/apache/spark/pull/1631
[SPARK-791] [PySpark] fix pickle itemgetter with cloudpickle
Davies Liu <davies.liu@gmail.com>
2014-07-29 01:02:18 -0700
Commit: 92ef026, github.com/apache/spark/pull/1627
[SPARK-2580] [PySpark] keep silent in worker if JVM close the socket
Davies Liu <davies.liu@gmail.com>
2014-07-29 00:15:45 -0700
Commit: ccd5ab5, github.com/apache/spark/pull/1625
Excess judgment
Yadong Qi <qiyadong2010@gmail.com>
2014-07-28 21:39:02 -0700
Commit: 16ef4d1, github.com/apache/spark/pull/1629
Use commons-lang3 in SignalLogger rather than commons-lang
Aaron Davidson <aaron@databricks.com>
2014-07-28 13:37:44 -0700
Commit: 39ab87b, github.com/apache/spark/pull/1621
[SPARK-2410][SQL] Merging Hive Thrift/JDBC server (with Maven profile fix)
Cheng Lian <lian.cs.zju@gmail.com>
2014-07-28 12:07:30 -0700
Commit: a7a9d14, github.com/apache/spark/pull/1620
[SPARK-2479][MLlib] Comparing floating-point numbers using relative error in UnitTests
DB Tsai <dbtsai@alpinenow.com>
2014-07-28 11:34:19 -0700
Commit: 255b56f, github.com/apache/spark/pull/1425
[SPARK-2523] [SQL] Hadoop table scan bug fixing
Cheng Hao <hao.cheng@intel.com>
2014-07-28 10:59:53 -0700
Commit: 2b8d89e, github.com/apache/spark/pull/1439
[SPARK-1550] [PySpark] Allow SparkContext creation after failed attempts
Josh Rosen <joshrosen@apache.org>
2014-07-27 22:54:43 -0700
Commit: a7d145e, github.com/apache/spark/pull/1606
SPARK-2651: Add maven scalastyle plugin
Rahul Singhal <rahul.singhal@guavus.com>
2014-07-27 18:50:32 -0700
Commit: d7eac4c, github.com/apache/spark/pull/1550
Revert "[SPARK-2410][SQL] Merging Hive Thrift/JDBC server"
Patrick Wendell <pwendell@gmail.com>
2014-07-27 18:46:58 -0700
Commit: e5bbce9
[SPARK-2514] [mllib] Random RDD generator
Doris Xin <doris.s.xin@gmail.com>
2014-07-27 16:16:39 -0700
Commit: 81fcdd2, github.com/apache/spark/pull/1520
[SPARK-1777] Prevent OOMs from single partitions
Andrew Or <andrewor14@gmail.com>
2014-07-27 16:08:16 -0700
Commit: ecf30ee, github.com/apache/spark/pull/1165
[SPARK-2410][SQL] Merging Hive Thrift/JDBC server
Cheng Lian <lian.cs.zju@gmail.com>
2014-07-27 13:03:38 -0700
Commit: f6ff2a6, github.com/apache/spark/pull/1600
[SPARK-2705][CORE] Fixed stage description in stage info page
Cheng Lian <lian.cs.zju@gmail.com>
2014-07-27 12:35:21 -0700
Commit: 2bbf235, github.com/apache/spark/pull/1524
SPARK-2684: Update ExternalAppendOnlyMap to take an iterator as input
Matei Zaharia <matei@databricks.com>
2014-07-27 11:20:20 -0700
Commit: 9857053, github.com/apache/spark/pull/1607
[SPARK-2679] [MLLib] Ser/De for Double
Doris Xin <doris.s.xin@gmail.com>
2014-07-27 07:21:07 -0700
Commit: 3a69c72, github.com/apache/spark/pull/1581
[SPARK-2361][MLLIB] Use broadcast instead of serializing data directly into task closure
Xiangrui Meng <meng@databricks.com>
2014-07-26 22:56:07 -0700
Commit: aaf2b73, github.com/apache/spark/pull/1427
SPARK-2680: Lower spark.shuffle.memoryFraction to 0.2 by default
Matei Zaharia <matei@databricks.com>
2014-07-26 22:44:17 -0700
Commit: b547f69, github.com/apache/spark/pull/1593
[SPARK-2601] [PySpark] Fix Py4J error when transforming pickleFiles
Josh Rosen <joshrosen@apache.org>
2014-07-26 17:37:05 -0700
Commit: ba46bbe, github.com/apache/spark/pull/1605
[SPARK-2704] Name threads in ConnectionManager and mark them as daemon.
Reynold Xin <rxin@apache.org>
2014-07-26 15:00:32 -0700
Commit: 1290164, github.com/apache/spark/pull/1604
[SPARK-2279] Added emptyRDD method to Java API
bpaulin <bob@bobpaulin.com>
2014-07-26 10:27:09 -0700
Commit: c183b92, github.com/apache/spark/pull/1597
[SPARK-2652] [PySpark] Turning some default configs for PySpark
Davies Liu <davies.liu@gmail.com>
2014-07-26 01:07:08 -0700
Commit: 75663b5, github.com/apache/spark/pull/1568
[SPARK-2696] Reduce default value of spark.serializer.objectStreamReset
Hossein <hossein@databricks.com>
2014-07-26 01:04:56 -0700
Commit: 66f26a4, github.com/apache/spark/pull/1595
[SPARK-1458] [PySpark] Expose sc.version in Java and PySpark
Josh Rosen <joshrosen@apache.org>
2014-07-26 00:54:05 -0700
Commit: cf3e9fd, github.com/apache/spark/pull/1596
[SPARK-2659][SQL] Fix division semantics for hive
Michael Armbrust <michael@databricks.com>
2014-07-25 19:17:49 -0700
Commit: 8904791, github.com/apache/spark/pull/1557
Part of [SPARK-2456] Removed some HashMaps from DAGScheduler by storing information in Stage.
Reynold Xin <rxin@apache.org>
2014-07-25 18:45:02 -0700
Commit: 9d8666c, github.com/apache/spark/pull/1561
Revert "[SPARK-2410][SQL] Merging Hive Thrift/JDBC server"
Michael Armbrust <michael@databricks.com>
2014-07-25 15:36:57 -0700
Commit: afd757a, github.com/apache/spark/pull/1594
[SPARK-1726] [SPARK-2567] Eliminate zombie stages in UI.
Kay Ousterhout <kayousterhout@gmail.com>
2014-07-25 15:14:13 -0700
Commit: 37ad3b7, github.com/apache/spark/pull/1566
[SPARK-2125] Add sort flag and move sort into shuffle implementations
jerryshao <saisai.shao@intel.com>
2014-07-25 14:34:38 -0700
Commit: 47b6b38, github.com/apache/spark/pull/1210
[SQL]Update HiveMetastoreCatalog.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-07-25 13:59:45 -0700
Commit: ab3c6a4, github.com/apache/spark/pull/1569
[SPARK-2682] Javadoc generated from Scala source code is not in javadoc's index
Yin Huai <huai@cse.ohio-state.edu>
2014-07-25 13:00:13 -0700
Commit: a19d8c8, github.com/apache/spark/pull/1584
[SPARK-2410][SQL] Merging Hive Thrift/JDBC server
Cheng Lian <lian.cs.zju@gmail.com>
2014-07-25 12:20:49 -0700
Commit: 06dc0d2, github.com/apache/spark/pull/1399
[SPARK-2683] unidoc failed because org.apache.spark.util.CallSite uses Java keywords as value names
Yin Huai <huai@cse.ohio-state.edu>
2014-07-25 11:14:51 -0700
Commit: 32bcf9a, github.com/apache/spark/pull/1585
replace println to log4j
fireflyc <fireflyc@126.com>
2014-07-25 10:47:52 -0700
Commit: a2715cc, github.com/apache/spark/pull/1372
[SPARK-2665] [SQL] Add EqualNS & Unit Tests
Cheng Hao <hao.cheng@intel.com>
2014-07-25 01:30:22 -0700
Commit: 184aa1c, github.com/apache/spark/pull/1570
[SPARK-2529] Clean closures in foreach and foreachPartition.
Reynold Xin <rxin@apache.org>
2014-07-25 01:10:05 -0700
Commit: eb82abd, github.com/apache/spark/pull/1583
SPARK-2657 Use more compact data structures than ArrayBuffer in groupBy & cogroup
Matei Zaharia <matei@databricks.com>
2014-07-25 00:32:32 -0700
Commit: 8529ced, github.com/apache/spark/pull/1555
[SPARK-2656] Python version of stratified sampling
Doris Xin <doris.s.xin@gmail.com>
2014-07-24 23:42:08 -0700
Commit: 2f75a4a, github.com/apache/spark/pull/1554
[SPARK-2538] [PySpark] Hash based disk spilling aggregation
Davies Liu <davies.liu@gmail.com>
2014-07-24 22:53:47 -0700
Commit: 14174ab, github.com/apache/spark/pull/1460
[SPARK-2014] Make PySpark store RDDs in MEMORY_ONLY_SER with compression by default
Prashant Sharma <prashant.s@imaginea.com>
2014-07-24 18:15:37 -0700
Commit: eff9714, github.com/apache/spark/pull/1051
[SPARK-2464][Streaming] Fixed Twitter stream stopping bug
Tathagata Das <tathagata.das1565@gmail.com>
2014-07-24 15:59:09 -0700
Commit: a45d548, github.com/apache/spark/pull/1577
SPARK-2250: show stage RDDs in UI
Neville Li <neville@spotify.com>
2014-07-24 14:13:00 -0700
Commit: fec641b, github.com/apache/spark/pull/1188
[SPARK-2037]: yarn client mode doesn't support spark.yarn.max.executor.failures
GuoQiang Li <witgo@qq.com>
2014-07-24 14:46:10 -0500
Commit: 323a83c, github.com/apache/spark/pull/1180
[SPARK-2479 (partial)][MLLIB] fix binary metrics unit tests
Xiangrui Meng <meng@databricks.com>
2014-07-24 12:37:02 -0700
Commit: c960b50, github.com/apache/spark/pull/1576
[SPARK-2603][SQL] Remove unnecessary toMap and toList in converting Java collections to Scala collections JsonRDD.scala
Yin Huai <huai@cse.ohio-state.edu>
2014-07-24 11:19:19 -0700
Commit: b352ef1, github.com/apache/spark/pull/1504
[Build] SPARK-2619: Configurable filemode for the spark/bin folder in debian package
tzolov <christian.tzolov@gmail.com>
2014-07-24 11:12:25 -0700
Commit: 9fd1414, github.com/apache/spark/pull/1531
SPARK-2150: Provide direct link to finished application UI in yarn resou...
Rahul Singhal <rahul.singhal@guavus.com>
2014-07-24 09:31:04 -0500
Commit: 46e224a, github.com/apache/spark/pull/1094
[SPARK-2661][bagel]unpersist old processed rdd
Daoyuan <daoyuan.wang@intel.com>
2014-07-24 00:09:36 -0700
Commit: 42dfab7, github.com/apache/spark/pull/1519
SPARK-2310. Support arbitrary Spark properties on the command line with ...
Sandy Ryza <sandy@cloudera.com>
2014-07-23 23:09:25 -0700
Commit: e34922a, github.com/apache/spark/pull/1253
[SPARK-2658][SQL] Add rule for true = 1.
Michael Armbrust <michael@databricks.com>
2014-07-23 22:52:49 -0700
Commit: 78d18fd, github.com/apache/spark/pull/1556
SPARK-2662: Fix NPE for JsonProtocol
GuoQiang Li <witgo@qq.com>
2014-07-23 22:50:39 -0700
Commit: 9e7725c, github.com/apache/spark/pull/1511
Replace RoutingTableMessage with pair
Ankur Dave <ankurdave@gmail.com>
2014-07-23 20:11:28 -0700
Commit: 2d25e34, github.com/apache/spark/pull/1553
[SPARK-2484][SQL] Build should not run hivecompatibility tests by default.
witgo <witgo@qq.com>
2014-07-23 18:17:05 -0700
Commit: 60f0ae3, github.com/apache/spark/pull/1403
[SPARK-2549] Functions defined inside of other functions trigger failures
Prashant Sharma <prashant.s@imaginea.com>
2014-07-23 17:12:28 -0700
Commit: 9b76332, github.com/apache/spark/pull/1510
[SPARK-2102][SQL][CORE] Add option for kryo registration required and use a resource pool in Spark SQL for Kryo instances.
Ian O Connell <ioconnell@twitter.com>
2014-07-23 16:30:06 -0700
Commit: efdaeb1, github.com/apache/spark/pull/1377
[SPARK-2569][SQL] Fix shipping of TEMPORARY hive UDFs.
Michael Armbrust <michael@databricks.com>
2014-07-23 16:26:55 -0700
Commit: 1871574, github.com/apache/spark/pull/1552
SPARK-2226: [SQL] transform HAVING clauses with aggregate expressions that aren't in the aggregation list
William Benton <willb@redhat.com>
2014-07-23 16:25:32 -0700
Commit: e060d3e, github.com/apache/spark/pull/1497
SPARK-2277: clear host->rack info properly
Rui Li <rui.li@intel.com>
2014-07-23 16:23:24 -0700
Commit: 91903e0, github.com/apache/spark/pull/1454
[SPARK-2588][SQL] Add some more DSLs.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-23 14:47:23 -0700
Commit: 1b790cf, github.com/apache/spark/pull/1491
[CORE] SPARK-2640: In "local[N]", free cores of the only executor should be touched by "spark.task.cpus" for every finish/start-up of tasks.
woshilaiceshide <woshilaiceshide@qq.com>
2014-07-23 11:05:41 -0700
Commit: f776bc9, github.com/apache/spark/pull/1544
[SPARK-2609] Log thread ID when spilling ExternalAppendOnlyMap
Andrew Or <andrewor14@gmail.com>
2014-07-23 10:31:45 -0700
Commit: 2592111, github.com/apache/spark/pull/1517
[SPARK-2617] Correct doc and usages of preservesPartitioning
Xiangrui Meng <meng@databricks.com>
2014-07-23 00:58:55 -0700
Commit: 4c7243e, github.com/apache/spark/pull/1526
Remove GraphX MessageToPartition for compatibility with sort-based shuffle
Ankur Dave <ankurdave@gmail.com>
2014-07-22 22:18:30 -0700
Commit: 6c2be93, github.com/apache/spark/pull/1537
[YARN] SPARK-2577: File upload to viewfs is broken due to mount point re...
Gera Shegalov <gera@twitter.com>
2014-07-22 21:05:12 -0500
Commit: 02e4572, github.com/apache/spark/pull/1483
[YARN][SPARK-2606]:In some cases,the spark UI pages display incorrect
GuoQiang Li <witgo@qq.com>
2014-07-22 20:34:40 -0500
Commit: ddadf1b, github.com/apache/spark/pull/1501
Graphx example
CrazyJvm <crazyjvm@gmail.com>
2014-07-22 18:14:44 -0700
Commit: 5f7b991, github.com/apache/spark/pull/1523
[SPARK-2615] [SQL] Add Equal Sign "==" Support for HiveQl
Cheng Hao <hao.cheng@intel.com>
2014-07-22 18:13:28 -0700
Commit: 79fe763, github.com/apache/spark/pull/1522
SPARK-2047: Introduce an in-mem Sorter, and use it to reduce mem usage
Aaron Davidson <aaron@databricks.com>
2014-07-22 11:58:53 -0700
Commit: 85d3596, github.com/apache/spark/pull/1502
[MLLIB] make Mima ignore updateFeatures (private) in ALS
Xiangrui Meng <meng@databricks.com>
2014-07-22 11:45:37 -0700
Commit: 1407871, github.com/apache/spark/pull/1533
[SPARK-2612] [mllib] Fix data skew in ALS
peng.zhang <peng.zhang@xiaomi.com>
2014-07-22 02:39:07 -0700
Commit: 75db174, github.com/apache/spark/pull/1521
[SPARK-2452] Create a new valid for each instead of using lineId.
Prashant Sharma <prashant@apache.org>
2014-07-22 00:38:26 -0700
Commit: 81fec99, github.com/apache/spark/pull/1441
[SPARK-2470] PEP8 fixes to PySpark
Nicholas Chammas <nicholas.chammas@gmail.com>, nchammas <nicholas.chammas@gmail.com>
2014-07-21 22:30:53 -0700
Commit: 5d16d5b, github.com/apache/spark/pull/1505
[SPARK-2086] Improve output of toDebugString to make shuffle boundaries more clear
Gregory Owen <greowen@gmail.com>
2014-07-21 18:55:01 -0700
Commit: c3462c6, github.com/apache/spark/pull/1364
[SPARK-2561][SQL] Fix apply schema
Michael Armbrust <michael@databricks.com>
2014-07-21 18:18:17 -0700
Commit: 511a731, github.com/apache/spark/pull/1470
[SPARK-2434][MLlib]: Warning messages that point users to original MLlib implementations added to Examples
Burak <brkyvz@gmail.com>
2014-07-21 17:03:40 -0700
Commit: a4d6020, github.com/apache/spark/pull/1515
Fix flakey HiveQuerySuite test
Aaron Davidson <aaron@databricks.com>
2014-07-21 14:35:15 -0700
Commit: abeacff, github.com/apache/spark/pull/1514
[SPARK-2494] [PySpark] make hash of None consistant cross machines
Davies Liu <davies.liu@gmail.com>
2014-07-21 11:59:54 -0700
Commit: 872538c, github.com/apache/spark/pull/1371
SPARK-1707. Remove unnecessary 3 second sleep in YarnClusterScheduler
Sandy Ryza <sandy@cloudera.com>
2014-07-21 13:15:46 -0500
Commit: f89cf65, github.com/apache/spark/pull/634
[SPARK-2190][SQL] Specialized ColumnType for Timestamp
Cheng Lian <lian.cs.zju@gmail.com>
2014-07-21 00:46:28 -0700
Commit: cd273a2, github.com/apache/spark/pull/1440
[SPARK-1945][MLLIB] Documentation Improvements for Spark 1.0
Michael Giannakopoulos <miccagiann@gmail.com>
2014-07-20 20:48:44 -0700
Commit: db56f2d, github.com/apache/spark/pull/1311
Improve scheduler delay tooltip.
Kay Ousterhout <kayousterhout@gmail.com>
2014-07-20 20:18:18 -0700
Commit: f6e7302, github.com/apache/spark/pull/1488
[SPARK-2552][MLLIB] stabilize logistic function in pyspark
Xiangrui Meng <meng@databricks.com>
2014-07-20 18:40:36 -0700
Commit: b86db51, github.com/apache/spark/pull/1493
SPARK-2564. ShuffleReadMetrics.totalBlocksRead is redundant
Sandy Ryza <sandy@cloudera.com>
2014-07-20 14:45:34 -0700
Commit: 9564f85, github.com/apache/spark/pull/1474
[SPARK-2495][MLLIB] remove private[mllib] from linear models' constructors
Xiangrui Meng <meng@databricks.com>
2014-07-20 13:04:59 -0700
Commit: 1b10b81, github.com/apache/spark/pull/1492
[SPARK-2598] RangePartitioner's binary search does not use the given Ordering
Reynold Xin <rxin@apache.org>
2014-07-20 11:06:06 -0700
Commit: fa51b0f, github.com/apache/spark/pull/1500
SPARK-2519 part 2. Remove pattern matching on Tuple2 in critical section...
Sandy Ryza <sandy@cloudera.com>
2014-07-20 01:24:32 -0700
Commit: 98ab411, github.com/apache/spark/pull/1447
[SPARK-2524] missing document about spark.deploy.retainedDrivers
lianhuiwang <lianhuiwang09@gmail.com>, Wang Lianhui <lianhuiwang09@gmail.com>, unknown <Administrator@taguswang-PC1.tencent.com>
2014-07-19 20:46:59 -0700
Commit: 4da01e3, github.com/apache/spark/pull/1443
SPARK-2587: Fix error message in make-distribution.sh
Mark Wagner <mwagner@mwagner-ld.linkedin.biz>
2014-07-19 20:24:13 -0700
Commit: c119498, github.com/apache/spark/pull/1489
Typo fix to the programming guide in the docs
Cesar Arevalo <cesar@zephyrhealthinc.com>
2014-07-19 20:20:07 -0700
Commit: 0d01e85, github.com/apache/spark/pull/1495
SPARK-2596 HOTFIX: Deal with non-existent JIRAs.
Patrick Wendell <pwendell@gmail.com>
2014-07-19 18:24:21 -0700
Commit: d39e3b9
SPARK-2596 A tool for mirroring github pull requests on JIRA.
Patrick Wendell <pwendell@gmail.com>
2014-07-19 18:19:08 -0700
Commit: 49e4727, github.com/apache/spark/pull/1496
Revert "[SPARK-2521] Broadcast RDD object (instead of sending it along with every task)."
Reynold Xin <rxin@apache.org>
2014-07-19 16:56:22 -0700
Commit: 1efb369
SPARK-2407: Added Parser of SQL SUBSTR()
chutium <teng.qiu@gmail.com>
2014-07-19 11:04:41 -0500
Commit: 2a73211, github.com/apache/spark/pull/1442
put 'curRequestSize = 0' after 'logDebug' it
Lijie Xu <csxulijie@gmail.com>
2014-07-19 01:27:26 -0700
Commit: 805f329, github.com/apache/spark/pull/1477
[SPARK-2521] Broadcast RDD object (instead of sending it along with every task).
Reynold Xin <rxin@apache.org>
2014-07-18 23:52:47 -0700
Commit: 7b8cd17, github.com/apache/spark/pull/1452
[SPARK-2359][MLlib] Correlations
Doris Xin <doris.s.xin@gmail.com>
2014-07-18 17:25:32 -0700
Commit: a243364, github.com/apache/spark/pull/1367
[SPARK-2571] Correctly report shuffle read metrics.
Kay Ousterhout <kayousterhout@gmail.com>
2014-07-18 14:40:32 -0700
Commit: 7b971b9, github.com/apache/spark/pull/1476
[SPARK-2540] [SQL] Add HiveDecimal & HiveVarchar support in unwrapping data
Cheng Hao <hao.cheng@intel.com>
2014-07-18 16:38:11 -0500
Commit: 7f17208, github.com/apache/spark/pull/1436
[SPARK-2535][SQL] Add StringComparison case to NullPropagation.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-18 16:24:00 -0500
Commit: 3a1709f, github.com/apache/spark/pull/1451
[MLlib] SPARK-1536: multiclass classification support for decision tree
Manish Amde <manish9ue@gmail.com>, manishamde <manish9ue@gmail.com>, Evan Sparks <sparks@cs.berkeley.edu>
2014-07-18 14:00:13 -0700
Commit: d88f6be, github.com/apache/spark/pull/886
Reservoir sampling implementation.
Reynold Xin <rxin@apache.org>
2014-07-18 12:41:50 -0700
Commit: 586e716, github.com/apache/spark/pull/1478
Added t2 instance types
Basit Mustafa <basitmustafa@computes-things-for-basit.local>
2014-07-18 12:23:47 -0700
Commit: 7f87ab9, github.com/apache/spark/pull/1446
SPARK-2553. Fix compile error
Sandy Ryza <sandy@cloudera.com>
2014-07-18 00:47:43 -0700
Commit: 30b8d36, github.com/apache/spark/pull/1479
SPARK-2553. CoGroupedRDD unnecessarily allocates a Tuple2 per dependency...
Sandy Ryza <sandy@cloudera.com>
2014-07-17 23:57:08 -0700
Commit: e52b871, github.com/apache/spark/pull/1461
[SPARK-2570] [SQL] Fix the bug of ClassCastException
Cheng Hao <hao.cheng@intel.com>
2014-07-17 23:25:01 -0700
Commit: 29809a6, github.com/apache/spark/pull/1475
[SPARK-2411] Add a history-not-found page to standalone Master
Andrew Or <andrewor14@gmail.com>
2014-07-17 19:45:59 -0700
Commit: 6afca2d, github.com/apache/spark/pull/1336
[SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressListener
Reynold Xin <rxin@apache.org>
2014-07-17 18:58:48 -0700
Commit: 72e9021, github.com/apache/spark/pull/1262
SPARK-1215 [MLLIB]: Clustering: Index out of bounds error (2)
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-07-17 15:05:02 -0700
Commit: 935fe65, github.com/apache/spark/pull/1468
SPARK-1478.2 Fix incorrect NioServerSocketChannelFactory constructor call
Sean Owen <srowen@gmail.com>
2014-07-17 12:20:48 -0700
Commit: 1fcd5dc, github.com/apache/spark/pull/1466
[SPARK-2534] Avoid pulling in the entire RDD in various operators
Reynold Xin <rxin@apache.org>
2014-07-17 10:54:53 -0700
Commit: d988d34, github.com/apache/spark/pull/1450
[SPARK-2423] Clean up SparkSubmit for readability
Andrew Or <andrewor14@gmail.com>
2014-07-17 01:13:32 -0700
Commit: 9c73822, github.com/apache/spark/pull/1349
SPARK-2526: Simplify options in make-distribution.sh
Patrick Wendell <pwendell@gmail.com>
2014-07-17 01:02:35 -0700
Commit: d0ea496, github.com/apache/spark/pull/1445
[SPARK-2412] CoalescedRDD throws exception with certain pref locs
Aaron Davidson <aaron@databricks.com>
2014-07-17 01:01:14 -0700
Commit: 7c23c0d, github.com/apache/spark/pull/1337
[SPARK-2154] Schedule next Driver when one completes (standalone mode)
Aaron Davidson <aaron@databricks.com>
2014-07-16 14:16:48 -0700
Commit: 9c24974, github.com/apache/spark/pull/1405
SPARK-1097: Do not introduce deadlock while fixing concurrency bug
Aaron Davidson <aaron@databricks.com>
2014-07-16 14:10:17 -0700
Commit: 8867cd0, github.com/apache/spark/pull/1409
[SPARK-2317] Improve task logging.
Reynold Xin <rxin@apache.org>
2014-07-16 11:50:49 -0700
Commit: 7c8d123, github.com/apache/spark/pull/1259
fix compile error of streaming project
James Z.M. Gao <gaozhm@mediav.com>
2014-07-16 11:35:21 -0700
Commit: caa163f, github.com/apache/spark/pull/153
[SPARK-2522] set default broadcast factory to torrent
Xiangrui Meng <meng@databricks.com>
2014-07-16 11:27:51 -0700
Commit: 96f28c9, github.com/apache/spark/pull/1437
[SPARK-2517] Remove some compiler warnings.
Reynold Xin <rxin@apache.org>
2014-07-16 11:15:07 -0700
Commit: ef48222, github.com/apache/spark/pull/1433
[SPARK-2518][SQL] Fix foldability of Substring expression.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-16 11:13:38 -0700
Commit: cc965ee, github.com/apache/spark/pull/1432
SPARK-2519. Eliminate pattern-matching on Tuple2 in performance-critical...
Sandy Ryza <sandy@cloudera.com>
2014-07-16 11:07:16 -0700
Commit: fc7edc9, github.com/apache/spark/pull/1435
[SQL] Cleaned up ConstantFolding slightly.
Reynold Xin <rxin@apache.org>
2014-07-16 10:55:47 -0700
Commit: 1c5739f, github.com/apache/spark/pull/1430
[SPARK-2525][SQL] Remove as many compilation warning messages as possible in Spark SQL
Yin Huai <huai@cse.ohio-state.edu>
2014-07-16 10:53:59 -0700
Commit: df95d82, github.com/apache/spark/pull/1444
Tightening visibility for various Broadcast related classes.
Reynold Xin <rxin@apache.org>
2014-07-16 10:44:54 -0700
Commit: efe2a8b, github.com/apache/spark/pull/1438
SPARK-2277: make TaskScheduler track hosts on rack
Rui Li <rui.li@intel.com>
2014-07-16 22:53:37 +0530
Commit: 33e64ec, github.com/apache/spark/pull/1212
[SPARK-2119][SQL] Improved Parquet performance when reading off S3
Cheng Lian <lian.cs.zju@gmail.com>
2014-07-16 12:44:51 -0400
Commit: efc452a, github.com/apache/spark/pull/1370
[SPARK-2504][SQL] Fix nullability of Substring expression.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-15 22:43:48 -0700
Commit: 632fb3d, github.com/apache/spark/pull/1426
[SPARK-2509][SQL] Add optimization for Substring.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-15 22:35:34 -0700
Commit: 9b38b7c, github.com/apache/spark/pull/1428
[SPARK-2314][SQL] Override collect and take in JavaSchemaRDD, forwarding to SchemaRDD implementations.
Aaron Staple <aaron.staple@gmail.com>
2014-07-15 21:35:36 -0700
Commit: 90ca532, github.com/apache/spark/pull/1421
follow pep8 None should be compared using is or is not
Ken Takagiwa <ken@Kens-MacBook-Pro.local>
2014-07-15 21:34:05 -0700
Commit: 563acf5, github.com/apache/spark/pull/1422
[SPARK-2500] Move the logInfo for registering BlockManager to BlockManagerMasterActor.register method
Henry Saputra <henry.saputra@gmail.com>
2014-07-15 21:21:52 -0700
Commit: 9c12de5, github.com/apache/spark/pull/1424
[SPARK-2469] Use Snappy (instead of LZF) for default shuffle compression codec
Reynold Xin <rxin@apache.org>
2014-07-15 18:47:39 -0700
Commit: 4576d80, github.com/apache/spark/pull/1415
[SPARK-2498] [SQL] Synchronize on a lock when using scala reflection inside data type objects.
Zongheng Yang <zongheng.y@gmail.com>
2014-07-15 17:58:28 -0700
Commit: c2048a5, github.com/apache/spark/pull/1423
[SQL] Attribute equality comparisons should be done by exprId.
Michael Armbrust <michael@databricks.com>
2014-07-15 17:56:17 -0700
Commit: 502f907, github.com/apache/spark/pull/1414
SPARK-2407: Added internal implementation of SQL SUBSTR()
William Benton <willb@redhat.com>
2014-07-15 14:11:57 -0700
Commit: 61de65b, github.com/apache/spark/pull/1359
[SPARK-2474][SQL] For a registered table in OverrideCatalog, the Analyzer failed to resolve references in the format of "tableName.fieldName"
Yin Huai <huai@cse.ohio-state.edu>
2014-07-15 14:06:45 -0700
Commit: 8af46d5, github.com/apache/spark/pull/1406
[SQL] Whitelist more Hive tests.
Michael Armbrust <michael@databricks.com>
2014-07-15 14:04:01 -0700
Commit: bcd0c30, github.com/apache/spark/pull/1396
[SPARK-2483][SQL] Fix parsing of repeated, nested data access.
Michael Armbrust <michael@databricks.com>
2014-07-15 14:01:48 -0700
Commit: 0f98ef1, github.com/apache/spark/pull/1411
[SPARK-2471] remove runtime scope for jets3t
Xiangrui Meng <meng@databricks.com>
2014-07-15 14:00:54 -0700
Commit: a21f9a7, github.com/apache/spark/pull/1402
Added LZ4 to compression codec in configuration page.
Reynold Xin <rxin@apache.org>
2014-07-15 13:13:33 -0700
Commit: e7ec815, github.com/apache/spark/pull/1417
SPARK-1291: Link the spark UI to RM ui in yarn-client mode
witgo <witgo@qq.com>
2014-07-15 13:52:56 -0500
Commit: 72ea56d, github.com/apache/spark/pull/1112
SPARK-2480: Resolve sbt warnings "NOTE: SPARK_YARN is deprecated, please use -Pyarn flag"
witgo <witgo@qq.com>
2014-07-15 10:46:17 -0700
Commit: 9dd635e, github.com/apache/spark/pull/1404
Reformat multi-line closure argument.
William Benton <willb@redhat.com>
2014-07-15 09:13:39 -0700
Commit: cb09e93, github.com/apache/spark/pull/1419
[MLLIB] [SPARK-2222] Add multiclass evaluation metrics
Alexander Ulanov <nashb@yandex.ru>, unknown <ulanov@ULANOV1.emea.hpqcorp.net>, Xiangrui Meng <meng@databricks.com>
2014-07-15 08:40:22 -0700
Commit: 04b01bb, github.com/apache/spark/pull/1155
README update: added "for Big Data".
Reynold Xin <rxin@apache.org>
2014-07-15 02:20:01 -0700
Commit: 6555618
Update README.md to include a slightly more informative project description.
Reynold Xin <rxin@apache.org>
2014-07-15 02:15:29 -0700
Commit: 8f1d422
[SPARK-2477][MLlib] Using appendBias for adding intercept in GeneralizedLinearAlgorithm
DB Tsai <dbtsai@alpinenow.com>
2014-07-15 02:14:58 -0700
Commit: 52beb20, github.com/apache/spark/pull/1410
[SPARK-2399] Add support for LZ4 compression.
Reynold Xin <rxin@apache.org>
2014-07-15 01:46:57 -0700
Commit: dd95aba, github.com/apache/spark/pull/1416
discarded exceeded completedDrivers
lianhuiwang <lianhuiwang09@gmail.com>
2014-07-15 00:22:06 -0700
Commit: 7446f5f, github.com/apache/spark/pull/1114
[SPARK-2485][SQL] Lock usage of hive client.
Michael Armbrust <michael@databricks.com>
2014-07-15 00:13:51 -0700
Commit: c7c7ac8, github.com/apache/spark/pull/1412
[SPARK-2390] Files in staging directory cannot be deleted and wastes the space of HDFS
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-07-14 23:55:39 -0700
Commit: c6d7574, github.com/apache/spark/pull/1326
Add/increase severity of warning in documentation of groupBy()
Aaron Davidson <aaron@databricks.com>
2014-07-14 23:38:12 -0700
Commit: a2aa7be, github.com/apache/spark/pull/1380
SPARK-2486: Utils.getCallSite is now resilient to bogus frames
William Benton <willb@redhat.com>
2014-07-14 23:09:13 -0700
Commit: 1f99fea, github.com/apache/spark/pull/1413
[SPARK-2467] Revert SparkBuild to publish-local to both .m2 and .ivy2.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-14 23:06:35 -0700
Commit: e2255e4, github.com/apache/spark/pull/1398
[SPARK-2446][SQL] Add BinaryType support to Parquet I/O.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-14 15:42:28 -0700
Commit: 9fe693b, github.com/apache/spark/pull/1373
[SPARK-1946] Submit tasks after (configured ratio) executors have been registered
li-zhihui <zhihui.li@intel.com>
2014-07-14 15:32:49 -0500
Commit: 3dd8af7, github.com/apache/spark/pull/900
[SPARK-2443][SQL] Fix slow read from partitioned tables
Zongheng Yang <zongheng.y@gmail.com>
2014-07-14 13:22:24 -0700
Commit: d60b09b, github.com/apache/spark/pull/1408
move some test file to match src code
Daoyuan <daoyuan.wang@intel.com>
2014-07-14 10:40:44 -0700
Commit: 38ccd6e, github.com/apache/spark/pull/1401
Made rdd.py pep8 complaint by using Autopep8 and a little manual editing.
Prashant Sharma <prashant.s@imaginea.com>
2014-07-14 00:42:59 -0700
Commit: aab5349, github.com/apache/spark/pull/1354
SPARK-2363. Clean MLlib's sample data files
Sean Owen <sowen@cloudera.com>
2014-07-13 19:27:43 -0700
Commit: 635888c, github.com/apache/spark/pull/1394
SPARK-2462. Make Vector.apply public.
Sandy Ryza <sandy@cloudera.com>
2014-07-12 16:55:15 -0700
Commit: 4c8be64, github.com/apache/spark/pull/1389
[SPARK-2405][SQL] Reusue same byte buffers when creating new instance of InMemoryRelation
Michael Armbrust <michael@databricks.com>
2014-07-12 12:13:32 -0700
Commit: 1a7d7cc, github.com/apache/spark/pull/1332
[SPARK-2441][SQL] Add more efficient distinct operator.
Michael Armbrust <michael@databricks.com>
2014-07-12 12:07:27 -0700
Commit: 7e26b57, github.com/apache/spark/pull/1366
[SPARK-2455] Mark (Shippable)VertexPartition serializable
Ankur Dave <ankurdave@gmail.com>
2014-07-12 12:05:34 -0700
Commit: 7a01352, github.com/apache/spark/pull/1376
Use the Executor's ClassLoader in sc.objectFile().
Daniel Darabos <darabos.daniel@gmail.com>
2014-07-12 00:07:42 -0700
Commit: 2245c87, github.com/apache/spark/pull/181
use specialized axpy in RowMatrix for SVD
Li Pu <lpu@twitter.com>, Xiangrui Meng <meng@databricks.com>, Li Pu <li.pu@outlook.com>
2014-07-11 23:26:47 -0700
Commit: d38887b, github.com/apache/spark/pull/1378
[SPARK-1969][MLlib] Online summarizer APIs for mean, variance, min, and max
DB Tsai <dbtsai@dbtsai.com>
2014-07-11 23:04:43 -0700
Commit: 5596086, github.com/apache/spark/pull/955
[SPARK-2457] Inconsistent description in README about build option
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-07-11 21:10:26 -0700
Commit: cbff187, github.com/apache/spark/pull/1382
[SPARK-2437] Rename MAVEN_PROFILES to SBT_MAVEN_PROFILES and add SBT_MAVEN_PROPERTIES
Prashant Sharma <prashant.s@imaginea.com>
2014-07-11 11:52:35 -0700
Commit: b23e9c3, github.com/apache/spark/pull/1374
[Minor] Remove unused val in Master
Andrew Or <andrewor14@gmail.com>
2014-07-11 00:21:16 -0700
Commit: f4f46de, github.com/apache/spark/pull/1365
fix Graph partitionStrategy comment
CrazyJvm <crazyjvm@gmail.com>
2014-07-11 00:02:24 -0700
Commit: 282cca0, github.com/apache/spark/pull/1368
[SPARK-2358][MLLIB] Add an option to include native BLAS/LAPACK loader in the build
Xiangrui Meng <meng@databricks.com>
2014-07-10 21:57:54 -0700
Commit: 2f59ce7, github.com/apache/spark/pull/1295
[SPARK-2428][SQL] Add except and intersect methods to SchemaRDD.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-10 19:27:24 -0700
Commit: 10b59ba, github.com/apache/spark/pull/1355
[SPARK-2415] [SQL] RowWriteSupport should handle empty ArrayType correctly.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-10 19:23:44 -0700
Commit: f5abd27, github.com/apache/spark/pull/1339
[SPARK-2431][SQL] Refine StringComparison and related codes.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-10 19:20:00 -0700
Commit: f62c427, github.com/apache/spark/pull/1357
SPARK-2427: Fix Scala examples that use the wrong command line arguments index
Artjom-Metro <Artjom-Metro@users.noreply.github.com>, Artjom-Metro <artjom31415@googlemail.com>
2014-07-10 16:03:30 -0700
Commit: ae8ca4d, github.com/apache/spark/pull/1353
[SPARK-1341] [Streaming] Throttle BlockGenerator to limit rate of data consumption.
Issac Buenrostro <buenrostro@ooyala.com>
2014-07-10 16:01:08 -0700
Commit: 2dd6724, github.com/apache/spark/pull/945
[SPARK-1478].3: Upgrade FlumeInputDStream's FlumeReceiver to support FLUME-1915
tmalaska <ted.malaska@cloudera.com>, Tathagata Das <tathagata.das1565@gmail.com>
2014-07-10 13:15:02 -0700
Commit: 40a8fef, github.com/apache/spark/pull/1347
name ec2 instances and security groups consistently
Nicholas Chammas <nicholas.chammas@gmail.com>, nchammas <nicholas.chammas@gmail.com>
2014-07-10 12:56:00 -0700
Commit: 369aa84, github.com/apache/spark/pull/1344
HOTFIX: Minor doc update for sbt change
Patrick Wendell <pwendell@gmail.com>
2014-07-10 11:10:43 -0700
Commit: 88006a6
[SPARK-1776] Have Spark's SBT build read dependencies from Maven.
Prashant Sharma <prashant.s@imaginea.com>, Patrick Wendell <pwendell@gmail.com>
2014-07-10 11:03:37 -0700
Commit: 628932b, github.com/apache/spark/pull/772
SPARK-2115: Stage kill link is too close to stage details link
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-07-10 01:18:37 -0700
Commit: c2babc0, github.com/apache/spark/pull/1350
Clean up SparkKMeans example's code
Raymond Liu <raymond.liu@intel.com>
2014-07-09 23:39:29 -0700
Commit: 2b18ea9, github.com/apache/spark/pull/1352
HOTFIX: Remove persistently failing test in master.
Patrick Wendell <pwendell@gmail.com>
2014-07-09 19:44:24 -0700
Commit: 553c578
Revert "[HOTFIX] Synchronize on SQLContext.settings in tests."
Patrick Wendell <pwendell@gmail.com>
2014-07-09 19:36:38 -0700
Commit: dd22bc2
SPARK-2416: Allow richer reporting of unit test results
Patrick Wendell <pwendell@gmail.com>
2014-07-09 19:26:16 -0700
Commit: 2e0a037, github.com/apache/spark/pull/1340
SPARK-1782: svd for sparse matrix using ARPACK
Li Pu <lpu@twitter.com>, Xiangrui Meng <meng@databricks.com>, Li Pu <li.pu@outlook.com>
2014-07-09 12:15:08 -0700
Commit: 1f33e1f, github.com/apache/spark/pull/964
[SPARK-2417][MLlib] Fix DecisionTree tests
johnnywalleye <jsondag@gmail.com>
2014-07-09 11:06:34 -0700
Commit: d35e3db, github.com/apache/spark/pull/1343
[STREAMING] SPARK-2343: Fix QueueInputDStream with oneAtATime false
Manuel Laflamme <manuel.laflamme@gmail.com>
2014-07-09 10:45:45 -0700
Commit: 0eb1152, github.com/apache/spark/pull/1285
[SPARK-2384] Add tooltips to UI.
Kay Ousterhout <kayousterhout@gmail.com>
2014-07-08 22:57:21 -0700
Commit: 339441f, github.com/apache/spark/pull/1314
[SPARK-2152][MLlib] fix bin offset in DecisionTree node aggregations (also resolves SPARK-2160)
johnnywalleye <jsondag@gmail.com>
2014-07-08 19:17:26 -0700
Commit: 1114207, github.com/apache/spark/pull/1316
[SPARK-2413] Upgrade junit_xml_listener to 0.5.1
DB Tsai <dbtsai@alpinenow.com>
2014-07-08 17:50:36 -0700
Commit: ac9cdc1, github.com/apache/spark/pull/1333
[SPARK-2392] Executors should not start their own HTTP servers
Andrew Or <andrewor14@gmail.com>
2014-07-08 17:35:31 -0700
Commit: bf04a39, github.com/apache/spark/pull/1335
[SPARK-2362] Fix for newFilesOnly logic in file DStream
Gabriele Nizzoli <mail@nizzoli.net>
2014-07-08 14:23:38 -0700
Commit: e6f7bfc, github.com/apache/spark/pull/1077
[SPARK-2409] Make SQLConf thread safe.
Reynold Xin <rxin@apache.org>
2014-07-08 14:00:47 -0700
Commit: 32516f8, github.com/apache/spark/pull/1334
SPARK-2400 : fix spark.yarn.max.executor.failures explaination
CrazyJvm <crazyjvm@gmail.com>
2014-07-08 13:55:42 -0500
Commit: b520b64, github.com/apache/spark/pull/1282
[SPARK-2403] Catch all errors during serialization in DAGScheduler
Daniel Darabos <darabos.daniel@gmail.com>
2014-07-08 10:43:46 -0700
Commit: c8a2313, github.com/apache/spark/pull/1329
[SPARK-2395][SQL] Optimize common LIKE patterns.
Michael Armbrust <michael@databricks.com>
2014-07-08 10:36:18 -0700
Commit: cc3e0a1, github.com/apache/spark/pull/1325
[EC2] Add default history server port to ec2 script
Andrew Or <andrewor14@gmail.com>
2014-07-08 16:49:31 +0900
Commit: 56e009d, github.com/apache/spark/pull/1296
[SPARK-2391][SQL] Custom take() for LIMIT queries.
Michael Armbrust <michael@databricks.com>
2014-07-08 00:41:46 -0700
Commit: 5a40636, github.com/apache/spark/pull/1318
Resolve sbt warnings during build â…¡
witgo <witgo@qq.com>
2014-07-08 00:31:42 -0700
Commit: 3cd5029, github.com/apache/spark/pull/1153
Updated programming-guide.md
Rishi Verma <riverma@apache.org>
2014-07-08 00:29:23 -0700
Commit: 0128905, github.com/apache/spark/pull/1324
[SPARK-2235][SQL]Spark SQL basicOperator add Intersect operator
Yanjie Gao <gaoyanjie55@163.com>, YanjieGao <396154235@qq.com>
2014-07-07 19:40:04 -0700
Commit: 50561f4, github.com/apache/spark/pull/1150
[SPARK-2376][SQL] Selecting list values inside nested JSON objects raises java.lang.IllegalArgumentException
Yin Huai <huai@cse.ohio-state.edu>
2014-07-07 18:37:38 -0700
Commit: 4352a2f, github.com/apache/spark/pull/1320
[SPARK-2375][SQL] JSON schema inference may not resolve type conflicts correctly for a field inside an array of structs
Yin Huai <huaiyin.thu@gmail.com>
2014-07-07 17:05:59 -0700
Commit: f0496ee, github.com/apache/spark/pull/1308
[SPARK-2386] [SQL] RowWriteSupport should use the exact types to cast.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-07 17:04:02 -0700
Commit: 4deeed1, github.com/apache/spark/pull/1315
[SPARK-2339][SQL] SQL parser in sql-core is case sensitive, but a table alias is converted to lower case when we create Subquery
Yin Huai <huai@cse.ohio-state.edu>
2014-07-07 17:01:44 -0700
Commit: c0b4cf0, github.com/apache/spark/pull/1317
[SPARK-1977][MLLIB] register mutable BitSet in MovieLenseALS
Neville Li <neville@spotify.com>
2014-07-07 15:06:14 -0700
Commit: f7ce1b3, github.com/apache/spark/pull/1319
[SPARK-2327] [SQL] Fix nullabilities of Join/Generate/Aggregate.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-05 11:51:48 -0700
Commit: 9d5ecf8, github.com/apache/spark/pull/1266
[SPARK-2366] [SQL] Add column pruning for the right side of LeftSemi join.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-05 11:48:08 -0700
Commit: 3da8df9, github.com/apache/spark/pull/1301
[SPARK-2306]:BoundedPriorityQueue is private and not registered with Kry...
ankit.bhardwaj <ankit.bhardwaj@guavus.com>
2014-07-04 22:06:10 -0700
Commit: 42f3abd, github.com/apache/spark/pull/1299
[SPARK-2370][SQL] Decrease metadata retrieved for partitioned hive queries.
Michael Armbrust <michael@databricks.com>
2014-07-04 19:15:48 -0700
Commit: 9d006c9, github.com/apache/spark/pull/1305
Added SignalLogger to HistoryServer.
Reynold Xin <rxin@apache.org>
2014-07-04 17:33:07 -0700
Commit: 0db5d5a, github.com/apache/spark/pull/1300
HOTFIX: Clean before building docs during release.
Patrick Wendell <pwendell@gmail.com>
2014-07-04 10:01:19 -0700
Commit: fc71658
[SPARK-2234][SQL]Spark SQL basicOperators add Except operator
Yanjie Gao <gaoyanjie55@163.com>, YanjieGao <396154235@qq.com>, root <root@node4.(none)>, gaoyanjie <gaoyanjie55@163.com>
2014-07-04 02:43:57 -0700
Commit: 5dadda8, github.com/apache/spark/pull/1151
[SPARK-2059][SQL] Add analysis checks
Reynold Xin <rxin@apache.org>, Michael Armbrust <michael@databricks.com>
2014-07-04 00:53:41 -0700
Commit: b3e768e, github.com/apache/spark/pull/1265
Update SQLConf.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-07-04 00:25:31 -0700
Commit: 0bbe612, github.com/apache/spark/pull/1272
[SPARK-1199][REPL] Remove VALId and use the original import style for defined classes.
Prashant Sharma <prashant.s@imaginea.com>
2014-07-04 00:05:27 -0700
Commit: d434150, github.com/apache/spark/pull/1179
[SPARK-2059][SQL] Don't throw TreeNodeException in `execution.ExplainCommand`
Cheng Lian <lian.cs.zju@gmail.com>
2014-07-03 23:41:54 -0700
Commit: 5448804, github.com/apache/spark/pull/1294
SPARK-2282: Reuse PySpark Accumulator sockets to avoid crashing Spark
Aaron Davidson <aaron@databricks.com>
2014-07-03 23:02:36 -0700
Commit: 97a0bfe, github.com/apache/spark/pull/1220
[SPARK-2307][Reprise] Correctly report RDD blocks on SparkUI
Andrew Or <andrewor14@gmail.com>
2014-07-03 22:48:23 -0700
Commit: 3894a49, github.com/apache/spark/pull/1255
[SPARK-2350] Don't NPE while launching drivers
Aaron Davidson <aaron@databricks.com>
2014-07-03 22:31:41 -0700
Commit: 586feb5, github.com/apache/spark/pull/1289
[SPARK-1097] Workaround Hadoop conf ConcurrentModification issue
Raymond Liu <raymond.liu@intel.com>
2014-07-03 19:24:22 -0700
Commit: 5fa0a05, github.com/apache/spark/pull/1273
Streaming programming guide typos
Clément MATHIEU <clement@unportant.info>
2014-07-03 18:31:18 -0700
Commit: fdc4c11, github.com/apache/spark/pull/1286
[HOTFIX] Synchronize on SQLContext.settings in tests.
Zongheng Yang <zongheng.y@gmail.com>
2014-07-03 17:37:53 -0700
Commit: d4c30cd, github.com/apache/spark/pull/1277
[SPARK-2109] Setting SPARK_MEM for bin/pyspark does not work.
Prashant Sharma <prashant.s@imaginea.com>
2014-07-03 15:06:58 -0700
Commit: 731f683b, github.com/apache/spark/pull/1050
[SPARK-2342] Evaluation helper's output type doesn't conform to input ty...
Yijie Shen <henry.yijieshen@gmail.com>
2014-07-03 13:22:13 -0700
Commit: a9b52e5, github.com/apache/spark/pull/1283
SPARK-1675. Make clear whether computePrincipalComponents requires centered data
Sean Owen <sowen@cloudera.com>
2014-07-03 11:54:51 -0700
Commit: 2b36344, github.com/apache/spark/pull/1171
[SPARK] Fix NPE for ExternalAppendOnlyMap
Andrew Or <andrewor14@gmail.com>
2014-07-03 10:26:50 -0700
Commit: c480537, github.com/apache/spark/pull/1288
[SPARK-2324] SparkContext should not exit directly when spark.local.dir is a list of multiple paths and one of them has error
yantangzhai <tyz0303@163.com>
2014-07-03 10:14:35 -0700
Commit: 3bbeca6, github.com/apache/spark/pull/1274
[SPARK-2287] [SQL] Make ScalaReflection be able to handle Generic case classes.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-02 10:10:36 -0700
Commit: bc7041a, github.com/apache/spark/pull/1226
[SPARK-2328] [SQL] Add execution of `SHOW TABLES` before `TestHive.reset()`.
Takuya UESHIN <ueshin@happy-camper.st>
2014-07-02 10:07:01 -0700
Commit: 1e2c26c, github.com/apache/spark/pull/1268
SPARK-2186: Spark SQL DSL support for simple aggregations such as SUM and AVG
Ximo Guanter Gonzalbez <ximo@tid.es>
2014-07-02 10:03:44 -0700
Commit: 5c6ec94, github.com/apache/spark/pull/1211
update the comments in SqlParser
CodingCat <zhunansjtu@gmail.com>
2014-07-01 20:37:10 -0700
Commit: 6596392, github.com/apache/spark/pull/1275
[SPARK-2185] Emit warning when task size exceeds a threshold.
Kay Ousterhout <kayousterhout@gmail.com>
2014-07-01 01:56:51 -0700
Commit: 05c3d90, github.com/apache/spark/pull/1149
SPARK-2332 [build] add exclusion for old servlet-api on hadoop-client in core
Peter MacKinnon <pmackinn@redhat.com>
2014-07-01 00:28:38 -0700
Commit: 3319a3e, github.com/apache/spark/pull/1271
SPARK-2293. Replace RDD.zip usage by map with predict inside.
Sean Owen <sowen@cloudera.com>
2014-06-30 16:03:38 -0700
Commit: 04fa122, github.com/apache/spark/pull/1250
[SPARK-2318] When exiting on a signal, print the signal name first.
Reynold Xin <rxin@apache.org>
2014-06-30 15:12:38 -0700
Commit: 5fccb56, github.com/apache/spark/pull/1260
[SPARK-2322] Exception in resultHandler should NOT crash DAGScheduler and shutdown SparkContext.
Reynold Xin <rxin@apache.org>
2014-06-30 11:50:22 -0700
Commit: 358ae15, github.com/apache/spark/pull/1264
SPARK-2077 Log serializer that actually ends up being used
Andrew Ash <andrew@andrewash.com>
2014-06-29 23:29:05 -0700
Commit: 6803642, github.com/apache/spark/pull/1017
SPARK-897: preemptively serialize closures
William Benton <willb@redhat.com>
2014-06-29 23:27:34 -0700
Commit: a484030, github.com/apache/spark/pull/143
[SPARK-2104] Fix task serializing issues when sort with Java non serializable class
jerryshao <saisai.shao@intel.com>
2014-06-29 23:00:00 -0700
Commit: 66135a3, github.com/apache/spark/pull/1245
[SPARK-1683] Track task read metrics.
Kay Ousterhout <kayousterhout@gmail.com>
2014-06-29 22:01:42 -0700
Commit: 7b71a0e, github.com/apache/spark/pull/962
[SPARK-2320] Reduce exception/code block font size in web ui
Reynold Xin <rxin@apache.org>
2014-06-29 16:46:28 -0700
Commit: cdf613f, github.com/apache/spark/pull/1261
Improve MapOutputTracker error logging.
Reynold Xin <rxin@apache.org>
2014-06-28 21:05:03 -0700
Commit: 2053d79, github.com/apache/spark/pull/1258
[SPARK-1394] Remove SIGCHLD handler in worker subprocess
Matthew Farrellee <matt@redhat.com>
2014-06-28 18:39:27 -0700
Commit: 3c104c7, github.com/apache/spark/pull/1247
[SPARK-2233] make-distribution script should list the git hash in the RELEASE file
Guillaume Ballet <gballet@gmail.com>
2014-06-28 13:07:12 -0700
Commit: b8f2e13, github.com/apache/spark/pull/1216
[SPARK-2003] Fix python SparkContext example
Matthew Farrellee <matt@redhat.com>
2014-06-27 18:20:33 -0700
Commit: 0e0686d, github.com/apache/spark/pull/1246
[SPARK-2259] Fix highly misleading docs on cluster / client deploy modes
Andrew Or <andrewor14@gmail.com>
2014-06-27 16:11:31 -0700
Commit: f17510e, github.com/apache/spark/pull/1200
[SPARK-2307] SparkUI - storage tab displays incorrect RDDs
Andrew Or <andrewor14@gmail.com>
2014-06-27 15:23:25 -0700
Commit: 21e0f77, github.com/apache/spark/pull/1249
SPARK-2181:The keys for sorting the columns of Executor page in SparkUI are incorrect
witgo <witgo@qq.com>
2014-06-26 21:59:21 -0700
Commit: 18f29b9, github.com/apache/spark/pull/1135
[SPARK-2251] fix concurrency issues in random sampler
Xiangrui Meng <meng@databricks.com>
2014-06-26 21:46:55 -0700
Commit: c23f5db, github.com/apache/spark/pull/1229
[SPARK-2297][UI] Make task attempt and speculation more explicit in UI.
Reynold Xin <rxin@apache.org>
2014-06-26 21:13:26 -0700
Commit: d1636dd, github.com/apache/spark/pull/1236
Removed throwable field from FetchFailedException and added MetadataFetchFailedException
Reynold Xin <rxin@apache.org>
2014-06-26 21:12:16 -0700
Commit: bf578de, github.com/apache/spark/pull/1227
[SQL]Extract the joinkeys from join condition
Cheng Hao <hao.cheng@intel.com>
2014-06-26 19:18:11 -0700
Commit: 981bde9, github.com/apache/spark/pull/1190
Strip '@' symbols when merging pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-06-26 17:09:24 -0700
Commit: f1f7385, github.com/apache/spark/pull/1239
Fixing AWS instance type information based upon current EC2 data
Zichuan Ye <jerry@tangentds.com>
2014-06-26 15:21:29 -0700
Commit: 62d4a0f, github.com/apache/spark/pull/1156
[SPARK-2286][UI] Report exception/errors for failed tasks that are not ExceptionFailure
Reynold Xin <rxin@apache.org>
2014-06-26 14:00:45 -0700
Commit: 6587ef7, github.com/apache/spark/pull/1225
[SPARK-2295] [SQL] Make JavaBeans nullability stricter.
Takuya UESHIN <ueshin@happy-camper.st>
2014-06-26 13:37:19 -0700
Commit: 32a1ad7, github.com/apache/spark/pull/1235
Remove use of spark.worker.instances
Kay Ousterhout <kayousterhout@gmail.com>
2014-06-26 08:20:27 -0500
Commit: 48a82a8, github.com/apache/spark/pull/1214
[SPARK-2254] [SQL] ScalaRefection should mark primitive types as non-nullable.
Takuya UESHIN <ueshin@happy-camper.st>
2014-06-25 23:55:31 -0700
Commit: e4899a2, github.com/apache/spark/pull/1193
[SPARK-2172] PySpark cannot import mllib modules in YARN-client mode
Szul, Piotr <Piotr.Szul@csiro.au>
2014-06-25 21:55:49 -0700
Commit: 441cdcc, github.com/apache/spark/pull/1223
[SPARK-2284][UI] Mark all failed tasks as failures.
Reynold Xin <rxin@apache.org>
2014-06-25 22:35:03 -0700
Commit: 4a346e2, github.com/apache/spark/pull/1224
[SPARK-1749] Job cancellation when SchedulerBackend does not implement killTask
Mark Hamstra <markhamstra@gmail.com>, Kay Ousterhout <kayousterhout@gmail.com>
2014-06-25 20:57:48 -0700
Commit: b88a59a, github.com/apache/spark/pull/1219
[SPARK-2283][SQL] Reset test environment before running PruningSuite
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-25 18:41:47 -0700
Commit: 7f196b0, github.com/apache/spark/pull/1221
[SQL] SPARK-1800 Add broadcast hash join operator & associated hints.
Zongheng Yang <zongheng.y@gmail.com>, Michael Armbrust <michael@databricks.com>
2014-06-25 18:06:33 -0700
Commit: 9d824fe, github.com/apache/spark/pull/1163
[SPARK-2204] Launch tasks on the proper executors in mesos fine-grained mode
Sebastien Rainville <sebastien@hopper.com>
2014-06-25 13:21:18 -0700
Commit: 1132e47, github.com/apache/spark/pull/1140
[SPARK-2270] Kryo cannot serialize results returned by asJavaIterable
Reynold Xin <rxin@apache.org>
2014-06-25 12:43:22 -0700
Commit: 7ff2c75, github.com/apache/spark/pull/1206
[SPARK-2258 / 2266] Fix a few worker UI bugs
Andrew Or <andrewor14@gmail.com>
2014-06-25 12:23:08 -0700
Commit: 9aa6032, github.com/apache/spark/pull/1213
[SPARK-2242] HOTFIX: pyspark shell hangs on simple job
Andrew Or <andrewor14@gmail.com>
2014-06-25 10:47:22 -0700
Commit: 5603e4c, github.com/apache/spark/pull/1178
Replace doc reference to Shark with Spark SQL.
Reynold Xin <rxin@apache.org>
2014-06-25 01:01:23 -0700
Commit: ac06a85
SPARK-2038: rename "conf" parameters in the saveAsHadoop functions with source-compatibility
CodingCat <zhunansjtu@gmail.com>
2014-06-25 00:23:32 -0700
Commit: acc01ab, github.com/apache/spark/pull/1137
[BUGFIX][SQL] Should match java.math.BigDecimal when wnrapping Hive output
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-25 00:17:28 -0700
Commit: 22036ae, github.com/apache/spark/pull/1199
[SPARK-2263][SQL] Support inserting MAP<K, V> to Hive tables
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-25 00:14:34 -0700
Commit: 8fade89, github.com/apache/spark/pull/1205
SPARK-2248: spark.default.parallelism does not apply in local mode
witgo <witgo@qq.com>
2014-06-24 19:44:37 -0700
Commit: b6b4485, github.com/apache/spark/pull/1194
Fix possible null pointer in acumulator toString
Michael Armbrust <michael@databricks.com>
2014-06-24 19:39:19 -0700
Commit: 2714968, github.com/apache/spark/pull/1204
Autodetect JAVA_HOME on RPM-based systems
Matthew Farrellee <matt@redhat.com>
2014-06-24 19:31:20 -0700
Commit: 54055fb, github.com/apache/spark/pull/1185
[SQL]Add base row updating methods for JoinedRow
Cheng Hao <hao.cheng@intel.com>
2014-06-24 19:07:02 -0700
Commit: 133495d, github.com/apache/spark/pull/1187
[SPARK-1112, 2156] Bootstrap to fetch the driver's Spark properties.
Xiangrui Meng <meng@databricks.com>
2014-06-24 19:06:07 -0700
Commit: 8ca4176, github.com/apache/spark/pull/1132
[SPARK-2264][SQL] Fix failing CachedTableSuite
Michael Armbrust <michael@databricks.com>
2014-06-24 19:04:29 -0700
Commit: a162c9b, github.com/apache/spark/pull/1201
Fix broken Json tests.
Kay Ousterhout <kayousterhout@gmail.com>
2014-06-24 16:54:50 -0700
Commit: 1978a90, github.com/apache/spark/pull/1198
HOTFIX: Disabling tests per SPARK-2264
Patrick Wendell <pwendell@gmail.com>
2014-06-24 15:09:30 -0700
Commit: 221909e
SPARK-1937: fix issue with task locality
Rui Li <rui.li@intel.com>, lirui-intel <rui.li@intel.com>
2014-06-24 11:40:37 -0700
Commit: 924b708, github.com/apache/spark/pull/892
[SPARK-2252] Fix MathJax for HTTPs.
Reynold Xin <rxin@apache.org>
2014-06-23 23:18:47 -0700
Commit: 420c1c3, github.com/apache/spark/pull/1189
[SPARK-2124] Move aggregation into shuffle implementations
jerryshao <saisai.shao@intel.com>
2014-06-23 20:25:46 -0700
Commit: 56eb8af, github.com/apache/spark/pull/1064
[SPARK-2227] Support dfs command in SQL.
Reynold Xin <rxin@apache.org>
2014-06-23 18:34:54 -0700
Commit: 51c8168, github.com/apache/spark/pull/1167
Cleanup on Connection, ConnectionManagerId, ConnectionManager classes part 2
Henry Saputra <henry.saputra@gmail.com>
2014-06-23 17:13:26 -0700
Commit: 383bf72, github.com/apache/spark/pull/1157
[SPARK-1768] History server enhancements.
Marcelo Vanzin <vanzin@cloudera.com>
2014-06-23 13:53:44 -0700
Commit: 21ddd7d, github.com/apache/spark/pull/718
[SPARK-2118] spark class should complain if tools jar is missing.
Prashant Sharma <prashant.s@imaginea.com>
2014-06-23 13:35:09 -0700
Commit: 6dc6722, github.com/apache/spark/pull/1068
[SPARK-1669][SQL] Made cacheTable idempotent
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-23 13:24:33 -0700
Commit: a4bc442, github.com/apache/spark/pull/1183
Fix mvn detection
Matthew Farrellee <matt@redhat.com>
2014-06-23 11:24:05 -0700
Commit: 853a2b9, github.com/apache/spark/pull/1181
Fixed small running on YARN docs typo
Vlad <frolvlad@gmail.com>
2014-06-23 10:55:49 -0500
Commit: b88238f, github.com/apache/spark/pull/1158
[SPARK-1395] Fix "local:" URI support in Yarn mode (again).
Marcelo Vanzin <vanzin@cloudera.com>
2014-06-23 08:51:11 -0500
Commit: e380767, github.com/apache/spark/pull/560
SPARK-2166 - Listing of instances to be terminated before the prompt
Jean-Martin Archer <jeanmartin.archer@pulseenergy.com>
2014-06-22 20:52:02 -0700
Commit: 9cb64b2, github.com/apache/spark/pull/270
SPARK-2241: quote command line args in ec2 script
Ori Kremer <ori.kremer@gmail.com>
2014-06-22 20:21:23 -0700
Commit: 9fc373e, github.com/apache/spark/pull/1169
SPARK-2229: FileAppender throw an llegalArgumentException in jdk6
witgo <witgo@qq.com>
2014-06-22 18:25:16 -0700
Commit: 409d24e, github.com/apache/spark/pull/1174
SPARK-1316. Remove use of Commons IO
Sean Owen <sowen@cloudera.com>
2014-06-22 11:47:49 -0700
Commit: 9fe28c3, github.com/apache/spark/pull/1173
SPARK-2034. KafkaInputDStream doesn't close resources and may prevent JVM shutdown
Sean Owen <sowen@cloudera.com>
2014-06-22 01:12:15 -0700
Commit: 476581e, github.com/apache/spark/pull/980
SPARK-2231: dev/run-tests should include YARN and use a recent Hadoop version
Patrick Wendell <pwendell@gmail.com>
2014-06-22 00:55:27 -0700
Commit: 58b32f3, github.com/apache/spark/pull/1175
SPARK-1996. Remove use of special Maven repo for Akka
Sean Owen <sowen@cloudera.com>
2014-06-21 23:29:57 -0700
Commit: 1db9cbc, github.com/apache/spark/pull/1170
HOTFIX: Add excludes for new MIMA files
Patrick Wendell <pwendell@gmail.com>
2014-06-21 15:20:15 -0700
Commit: 3e0b078
HOTFIX: Fix missing MIMA ignore
Patrick Wendell <pwendell@gmail.com>
2014-06-21 13:02:49 -0700
Commit: 0a432d6
[SQL] Break hiveOperators.scala into multiple files.
Reynold Xin <rxin@apache.org>
2014-06-21 12:04:18 -0700
Commit: ec935ab, github.com/apache/spark/pull/1166
[SQL] Pass SQLContext instead of SparkContext into physical operators.
Reynold Xin <rxin@apache.org>
2014-06-20 22:49:48 -0700
Commit: ca5d8b5, github.com/apache/spark/pull/1164
Fix some tests.
Marcelo Vanzin <vanzin@cloudera.com>
2014-06-20 20:05:12 -0700
Commit: 648553d, github.com/apache/spark/pull/917
[SPARK-2061] Made splits deprecated in JavaRDDLike
Anant <anant.asty@gmail.com>
2014-06-20 18:54:00 -0700
Commit: 010c460, github.com/apache/spark/pull/1062
HOTFIX: Fixing style error introduced by 08d0ac
Patrick Wendell <pwendell@gmail.com>
2014-06-20 18:44:54 -0700
Commit: a678642
[SPARK-1970] Update unit test in XORShiftRandomSuite to use ChiSquareTest from commons-math3
Doris Xin <doris.s.xin@gmail.com>
2014-06-20 18:42:02 -0700
Commit: e99903b, github.com/apache/spark/pull/1073
SPARK-1902 Silence stacktrace from logs when doing port failover to port n+1
Andrew Ash <andrew@andrewash.com>
2014-06-20 18:25:33 -0700
Commit: 08d0aca, github.com/apache/spark/pull/1019
[SQL] Use hive.SessionState, not the thread local SessionState
Aaron Davidson <aaron@databricks.com>
2014-06-20 17:55:54 -0700
Commit: 2044784, github.com/apache/spark/pull/1148
Move ScriptTransformation into the appropriate place.
Reynold Xin <rxin@apache.org>
2014-06-20 17:16:56 -0700
Commit: d4c7572, github.com/apache/spark/pull/1162
Clean up CacheManager et al.
Andrew Or <andrewor14@gmail.com>
2014-06-20 17:14:33 -0700
Commit: 01125a1, github.com/apache/spark/pull/1083
[SPARK-2225] Turn HAVING without GROUP BY into WHERE.
Reynold Xin <rxin@apache.org>
2014-06-20 15:38:02 -0700
Commit: 0ac71d1, github.com/apache/spark/pull/1161
SPARK-2180: support HAVING clauses in Hive queries
William Benton <willb@redhat.com>
2014-06-20 13:41:38 -0700
Commit: 171ebb3, github.com/apache/spark/pull/1136
SPARK-1868: Users should be allowed to cogroup at least 4 RDDs
Allan Douglas R. de Oliveira <allandouglas@gmail.com>
2014-06-20 11:03:03 -0700
Commit: 6a224c3, github.com/apache/spark/pull/813
[SPARK-2163] class LBFGS optimize with Double tolerance instead of Int
Gang Bai <me@baigang.net>
2014-06-20 08:52:20 -0700
Commit: d484dde, github.com/apache/spark/pull/1104
[SPARK-2218] rename Equals to EqualTo in Spark SQL expressions.
Reynold Xin <rxin@apache.org>
2014-06-20 00:34:59 -0700
Commit: 2f6a835, github.com/apache/spark/pull/1146
[SPARK-2196] [SQL] Fix nullability of CaseWhen.
Takuya UESHIN <ueshin@happy-camper.st>
2014-06-20 00:12:52 -0700
Commit: 3249528, github.com/apache/spark/pull/1133
SPARK-2203: PySpark defaults to use same num reduce partitions as map side
Aaron Davidson <aaron@databricks.com>
2014-06-20 00:06:57 -0700
Commit: f46e02f, github.com/apache/spark/pull/1138
[SPARK-2209][SQL] Cast shouldn't do null check twice.
Reynold Xin <rxin@apache.org>
2014-06-20 00:01:19 -0700
Commit: c55bbb4, github.com/apache/spark/pull/1143
[SPARK-2210] cast to boolean on boolean value gets turned into NOT((boolean_condition) = 0)
Reynold Xin <rxin@apache.org>
2014-06-19 23:58:23 -0700
Commit: 6175640, github.com/apache/spark/pull/1144
SPARK-1293 [SQL] Parquet support for nested types
Andre Schumacher <andre.schumacher@iki.fi>, Michael Armbrust <michael@databricks.com>
2014-06-19 23:47:45 -0700
Commit: f479cf3, github.com/apache/spark/pull/360
[SPARK-2177][SQL] describe table result contains only one column
Yin Huai <huai@cse.ohio-state.edu>
2014-06-19 23:41:38 -0700
Commit: f397e92, github.com/apache/spark/pull/1118
[SQL] Improve Speed of InsertIntoHiveTable
Michael Armbrust <michael@databricks.com>
2014-06-19 23:39:03 -0700
Commit: d3b7671, github.com/apache/spark/pull/1130
More minor scaladoc cleanup for Spark SQL.
Reynold Xin <rxin@apache.org>
2014-06-19 22:34:21 -0700
Commit: 278ec8a, github.com/apache/spark/pull/1142
HOTFIX: SPARK-2208 local metrics tests can fail on fast machines
Patrick Wendell <pwendell@gmail.com>
2014-06-19 21:06:28 -0700
Commit: e551479, github.com/apache/spark/pull/1141
A few minor Spark SQL Scaladoc fixes.
Reynold Xin <rxin@apache.org>
2014-06-19 18:24:05 -0700
Commit: 5464e79, github.com/apache/spark/pull/1139
[SPARK-2151] Recognize memory format for spark-submit
nravi <nravi@c1704.halxg.cloudera.com>
2014-06-19 17:11:06 -0700
Commit: f14b00a, github.com/apache/spark/pull/1095
[SPARK-2191][SQL] Make sure InsertIntoHiveTable doesn't execute more than once.
Michael Armbrust <michael@databricks.com>
2014-06-19 14:14:03 -0700
Commit: 777c595, github.com/apache/spark/pull/1129
[SPARK-2051]In yarn.ClientBase spark.yarn.dist.* do not work
witgo <witgo@qq.com>
2014-06-19 12:11:26 -0500
Commit: bce0897, github.com/apache/spark/pull/969
Minor fix
WangTao <barneystinson@aliyun.com>
2014-06-18 23:24:57 -0700
Commit: 67fca18, github.com/apache/spark/pull/1105
[SPARK-2187] Explain should not run the optimizer twice.
Reynold Xin <rxin@apache.org>
2014-06-18 22:44:12 -0700
Commit: 640c294, github.com/apache/spark/pull/1123
Squishing a typo bug before it causes real harm
Doris Xin <doris.s.xin@gmail.com>
2014-06-18 22:19:06 -0700
Commit: 566f70f, github.com/apache/spark/pull/1125
[SPARK-2184][SQL] AddExchange isn't idempotent
Michael Armbrust <michael@databricks.com>
2014-06-18 17:52:42 -0700
Commit: 5ff75c7, github.com/apache/spark/pull/1122
Remove unicode operator from RDD.scala
Doris Xin <doris.s.xin@gmail.com>
2014-06-18 15:01:29 -0700
Commit: 45a95f8, github.com/apache/spark/pull/1119
SPARK-2158 Clean up core/stdout file from FileAppenderSuite
Mark Hamstra <markhamstra@gmail.com>
2014-06-18 14:56:41 -0700
Commit: 4cbeea8, github.com/apache/spark/pull/1100
[SPARK-1466] Raise exception if pyspark Gateway process doesn't start.
Kay Ousterhout <kayousterhout@gmail.com>
2014-06-18 13:16:26 -0700
Commit: 3870248, github.com/apache/spark/pull/383
Updated the comment for SPARK-2162.
Reynold Xin <rxin@apache.org>
2014-06-18 12:48:58 -0700
Commit: dd96fcd, github.com/apache/spark/pull/1117
[SPARK-2162] Double check in doGetLocal to avoid read on removed block.
Raymond Liu <raymond.liu@intel.com>
2014-06-18 10:57:45 -0700
Commit: 5ad5e34, github.com/apache/spark/pull/1103
[SPARK-2176][SQL] Extra unnecessary exchange operator in the result of an explain command
Yin Huai <huai@cse.ohio-state.edu>
2014-06-18 10:51:32 -0700
Commit: 587d320, github.com/apache/spark/pull/1116
[STREAMING] SPARK-2009 Key not found exception when slow receiver starts
Vadim Chekan <kot.begemot@gmail.com>
2014-06-17 22:03:50 -0700
Commit: 889f7b7, github.com/apache/spark/pull/961
Revert "SPARK-2038: rename "conf" parameters in the saveAsHadoop functions"
Patrick Wendell <pwendell@gmail.com>
2014-06-17 19:34:17 -0700
Commit: 9e4b4bd
[SPARK-2060][SQL] Querying JSON Datasets with SQL and DSL in Spark SQL
Yin Huai <huai@cse.ohio-state.edu>
2014-06-17 19:14:59 -0700
Commit: d2f4f30, github.com/apache/spark/pull/999
HOTFIX: bug caused by #941
Patrick Wendell <pwendell@gmail.com>
2014-06-17 15:09:24 -0700
Commit: b2ebf42, github.com/apache/spark/pull/1108
[SPARK-2147 / 2161] Show removed executors on the UI
Andrew Or <andrewor14@gmail.com>
2014-06-17 12:25:55 -0700
Commit: a14807e, github.com/apache/spark/pull/1102
SPARK-2038: rename "conf" parameters in the saveAsHadoop functions
CodingCat <zhunansjtu@gmail.com>
2014-06-17 12:17:48 -0700
Commit: 443f5e1, github.com/apache/spark/pull/1087
SPARK-2146. Fix takeOrdered doc
Sandy Ryza <sandy@cloudera.com>
2014-06-17 12:03:22 -0700
Commit: 2794990, github.com/apache/spark/pull/1086
SPARK-1063 Add .sortBy(f) method on RDD
Andrew Ash <andrew@andrewash.com>
2014-06-17 11:47:48 -0700
Commit: b92d16b, github.com/apache/spark/pull/369
[SPARK-2053][SQL] Add Catalyst expressions for CASE WHEN.
Zongheng Yang <zongheng.y@gmail.com>
2014-06-17 13:30:17 +0200
Commit: e243c5f, github.com/apache/spark/pull/1055
[SPARK-2164][SQL] Allow Hive UDF on columns of type struct
Xi Liu <xil@conviva.com>
2014-06-17 13:14:40 +0200
Commit: f5a4049, github.com/apache/spark/pull/796
[SPARK-2144] ExecutorsPage reports incorrect # of RDD blocks
Andrew Or <andrewor14@gmail.com>
2014-06-17 01:28:22 -0700
Commit: 09deb3e, github.com/apache/spark/pull/1080
SPARK-2035: Store call stack for stages, display it on the UI.
Daniel Darabos <darabos.daniel@gmail.com>, Patrick Wendell <pwendell@gmail.com>
2014-06-17 00:08:05 -0700
Commit: 23a12ce, github.com/apache/spark/pull/981
SPARK-1990: added compatibility for python 2.6 for ssh_read command
Anant <anant.asty@gmail.com>
2014-06-16 23:42:27 -0700
Commit: 8cd04c3, github.com/apache/spark/pull/941
[SPARK-2130] End-user friendly String repr for StorageLevel in Python
Kan Zhang <kzhang@apache.org>
2014-06-16 23:31:31 -0700
Commit: d81c08b, github.com/apache/spark/pull/1096
MLlib documentation fix
Anatoli Fomenko <fa@apache.org>
2014-06-16 23:10:36 -0700
Commit: 7afa912, github.com/apache/spark/pull/1098
Minor fix: made "EXPLAIN" output to play well with JDBC output format
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-16 16:42:17 -0700
Commit: 237b96b, github.com/apache/spark/pull/1097
[SQL][SPARK-2094] Follow up of PR #1071 for Java API
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-16 21:30:29 +0200
Commit: 273afcb, github.com/apache/spark/pull/1085
[SPARK-1930] The Container is running beyond physical memory limits, so as to be killed
witgo <witgo@qq.com>
2014-06-16 14:27:31 -0500
Commit: cdf2b04, github.com/apache/spark/pull/894
[SPARK-2010] Support for nested data in PySpark SQL
Kan Zhang <kzhang@apache.org>
2014-06-16 11:11:29 -0700
Commit: 4fdb491, github.com/apache/spark/pull/1041
SPARK-2039: apply output dir existence checking for all output formats
CodingCat <zhunansjtu@gmail.com>
2014-06-15 23:47:58 -0700
Commit: 716c88a, github.com/apache/spark/pull/1088
Updating docs to include missing information about reducers and clarify ...
Ali Ghodsi <alig@cs.berkeley.edu>
2014-06-15 23:44:30 -0700
Commit: 119b06a, github.com/apache/spark/pull/1089
SPARK-2148 Add link to requirements for custom equals() and hashcode() methods
Andrew Ash <andrew@andrewash.com>
2014-06-15 23:32:55 -0700
Commit: 9672ee0, github.com/apache/spark/pull/1092
SPARK-1999: StorageLevel in storage tab and RDD Storage Info never changes
CrazyJvm <crazyjvm@gmail.com>
2014-06-15 23:23:26 -0700
Commit: a63aa1a, github.com/apache/spark/pull/968
[SPARK-937] adding EXITED executor state and not relaunching cleanly exited executors
Kan Zhang <kzhang@apache.org>
2014-06-15 14:55:34 -0700
Commit: ca5d9d4, github.com/apache/spark/pull/306
[SQL] Support transforming TreeNodes with Option children.
Michael Armbrust <michael@databricks.com>, Zongheng Yang <zongheng.y@gmail.com>
2014-06-15 11:28:34 +0200
Commit: 269fc62, github.com/apache/spark/pull/1074
[SPARK-1837] NumericRange should be partitioned in the same way as other...
Kan Zhang <kzhang@apache.org>
2014-06-14 14:31:28 -0700
Commit: 7dd9fc6, github.com/apache/spark/pull/776
[SPARK-2013] Documentation for saveAsPickleFile and pickleFile in Python
Kan Zhang <kzhang@apache.org>
2014-06-14 13:22:30 -0700
Commit: b52603b, github.com/apache/spark/pull/983
[SPARK-2079] Support batching when serializing SchemaRDD to Python
Kan Zhang <kzhang@apache.org>
2014-06-14 13:17:22 -0700
Commit: 2550533, github.com/apache/spark/pull/1023
[Spark-2137][SQL] Timestamp UDFs broken
Yin Huai <huai@cse.ohio-state.edu>
2014-06-13 23:28:57 -0700
Commit: 8919685, github.com/apache/spark/pull/1081
Small correction in Streaming Programming Guide doc
akkomar <ak.komar@gmail.com>
2014-06-13 15:37:26 -0700
Commit: edb1f0e, github.com/apache/spark/pull/1079
[SPARK-2094][SQL] "Exactly once" semantics for DDL and command statements
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-13 12:59:48 -0700
Commit: ac96d96, github.com/apache/spark/pull/1071
[SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType
Michael Armbrust <michael@databricks.com>
2014-06-13 12:55:15 -0700
Commit: 1c2fd01, github.com/apache/spark/pull/1061
Workaround in Spark for ConcurrentModification issue (JIRA Hadoop-10456, Spark-1097)
nravi <nravi@c1704.halxg.cloudera.com>
2014-06-13 10:52:21 -0700
Commit: 70c8116, github.com/apache/spark/pull/1000
[HOTFIX] add math3 version to pom
Xiangrui Meng <meng@databricks.com>
2014-06-13 02:59:38 -0700
Commit: b3736e3, github.com/apache/spark/pull/1075
[SPARK-2135][SQL] Use planner for in-memory scans
Michael Armbrust <michael@databricks.com>
2014-06-12 23:09:41 -0700
Commit: 13f8cfd, github.com/apache/spark/pull/1072
[SPARK-1516]Throw exception in yarn client instead of run system.exit directly.
John Zhao <jzhao@alpinenow.com>
2014-06-12 21:39:00 -0700
Commit: f95ac68, github.com/apache/spark/pull/490
[Minor] Fix style, formatting and naming in BlockManager etc.
Andrew Or <andrewor14@gmail.com>
2014-06-12 20:40:58 -0700
Commit: 44daec5, github.com/apache/spark/pull/1058
SPARK-1939 Refactor takeSample method in RDD to use ScaSRS
Doris Xin <doris.s.xin@gmail.com>, dorx <doris.s.xin@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-06-12 19:44:27 -0700
Commit: 1de1d70, github.com/apache/spark/pull/916
document laziness of parallelize
Ariel Rabkin <asrabkin@cs.princeton.edu>
2014-06-12 17:51:33 -0700
Commit: 0154587, github.com/apache/spark/pull/1070
SPARK-2085: [MLlib] Apply user-specific regularization instead of uniform regularization in ALS
Shuo Xiang <sxiang@twitter.com>
2014-06-12 17:37:06 -0700
Commit: a6e0afd, github.com/apache/spark/pull/1026
SPARK-1843: Replace assemble-deps with env variable.
Patrick Wendell <pwendell@gmail.com>
2014-06-12 15:43:32 -0700
Commit: 1c04652, github.com/apache/spark/pull/877
[SPARK-2080] Yarn: report HS URL in client mode, correct user in cluster mode.
Marcelo Vanzin <vanzin@cloudera.com>
2014-06-12 16:19:36 -0500
Commit: ecde5b8, github.com/apache/spark/pull/1002
[SPARK-2088] fix NPE in toString
Doris Xin <doris.s.xin@gmail.com>
2014-06-12 12:53:07 -0700
Commit: 83c226d, github.com/apache/spark/pull/1028
SPARK-554. Add aggregateByKey.
Sandy Ryza <sandy@cloudera.com>
2014-06-12 08:14:25 -0700
Commit: ce92a9c, github.com/apache/spark/pull/705
fixed typo in docstring for min()
Jeff Thompson <jeffreykeatingthompson@gmail.com>
2014-06-12 08:10:51 -0700
Commit: 43d53d5, github.com/apache/spark/pull/1065
Cleanup on Connection and ConnectionManager
Henry Saputra <henry.saputra@gmail.com>
2014-06-11 23:17:51 -0700
Commit: 4d8ae70, github.com/apache/spark/pull/1060
'killFuture' is never used
Yadong <qiyadong2010@gmail.com>
2014-06-11 20:58:39 -0700
Commit: e056320, github.com/apache/spark/pull/1052
[SPARK-2044] Pluggable interface for shuffles
Matei Zaharia <matei@databricks.com>
2014-06-11 20:45:29 -0700
Commit: 508fd37, github.com/apache/spark/pull/1009
[SPARK-1672][MLLIB] Separate user and product partitioning in ALS
Tor Myklebust <tmyklebu@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-06-11 18:16:33 -0700
Commit: d920335, github.com/apache/spark/pull/1014
[SPARK-2052] [SQL] Add optimization for CaseConversionExpression's.
Takuya UESHIN <ueshin@happy-camper.st>
2014-06-11 17:58:35 -0700
Commit: 9a2448d, github.com/apache/spark/pull/990
HOTFIX: Forgot to remove false change in previous commit
Patrick Wendell <pwendell@gmail.com>
2014-06-11 15:55:41 -0700
Commit: d45e0c6
HOTFIX: PySpark tests should be order insensitive.
Patrick Wendell <pwendell@gmail.com>
2014-06-11 15:54:41 -0700
Commit: 14e6dc9, github.com/apache/spark/pull/1054
HOTFIX: A few PySpark tests were not actually run
Andrew Or <andrewor14@gmail.com>
2014-06-11 12:11:46 -0700
Commit: fe78b8b, github.com/apache/spark/pull/1053
[SQL] Code Cleanup: Left Semi Hash Join
Daoyuan <daoyuan.wang@intel.com>
2014-06-11 12:08:28 -0700
Commit: ce6deb1, github.com/apache/spark/pull/1049
[SPARK-2042] Prevent unnecessary shuffle triggered by take()
Sameer Agarwal <sameer@databricks.com>
2014-06-11 12:01:04 -0700
Commit: 4107cce, github.com/apache/spark/pull/1048
SPARK-2113: awaitTermination() after stop() will hang in Spark Stremaing
Lars Albertsson <lalle@spotify.com>
2014-06-11 10:54:42 -0700
Commit: 4d5c12a, github.com/apache/spark/pull/1001
[SPARK-2108] Mark SparkContext methods that return block information as developer API's
Prashant Sharma <prashant.s@imaginea.com>
2014-06-11 10:49:34 -0700
Commit: e508f59, github.com/apache/spark/pull/1047
[SPARK-2069] MIMA false positives
Prashant Sharma <prashant.s@imaginea.com>
2014-06-11 10:47:06 -0700
Commit: 5b754b4, github.com/apache/spark/pull/1021
SPARK-1639. Tidy up some Spark on YARN code
Sandy Ryza <sandy@cloudera.com>
2014-06-11 07:57:28 -0500
Commit: 2a4225d, github.com/apache/spark/pull/561
SPARK-2107: FilterPushdownSuite doesn't need Junit jar.
Qiuzhuang.Lian <Qiuzhuang.Lian@gmail.com>
2014-06-11 00:36:06 -0700
Commit: 6e11930, github.com/apache/spark/pull/1046
[SPARK-2091][MLLIB] use numpy.dot instead of ndarray.dot
Xiangrui Meng <meng@databricks.com>
2014-06-11 00:22:40 -0700
Commit: 0f1dc3a, github.com/apache/spark/pull/1035
[SPARK-1968][SQL] SQL/HiveQL command for caching/uncaching tables
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-11 00:06:50 -0700
Commit: 0266a0c, github.com/apache/spark/pull/1038
[SPARK-2093] [SQL] NullPropagation should use exact type value.
Takuya UESHIN <ueshin@happy-camper.st>
2014-06-10 23:13:48 -0700
Commit: 0402bd7, github.com/apache/spark/pull/1034
HOTFIX: clear() configs in SQLConf-related unit tests.
Zongheng Yang <zongheng.y@gmail.com>
2014-06-10 21:59:01 -0700
Commit: 601032f, github.com/apache/spark/pull/1040
[SPARK-2065] give launched instances names
Nicholas Chammas <nicholas.chammas@gmail.com>, nchammas <nicholas.chammas@gmail.com>
2014-06-10 21:49:08 -0700
Commit: a2052a4, github.com/apache/spark/pull/1043
Resolve scalatest warnings during build
witgo <witgo@qq.com>
2014-06-10 20:24:05 -0700
Commit: c48b622, github.com/apache/spark/pull/1032
[SPARK-1940] Enabling rolling of executor logs, and automatic cleanup of old executor logs
Tathagata Das <tathagata.das1565@gmail.com>
2014-06-10 20:22:02 -0700
Commit: 4823bf4, github.com/apache/spark/pull/895
[SPARK-1998] SparkFlumeEvent with body bigger than 1020 bytes are not re...
joyyoj <sunshch@gmail.com>
2014-06-10 17:26:17 -0700
Commit: 2966044, github.com/apache/spark/pull/951
[SQL] Add average overflow test case from #978
egraldlo <egraldlo@gmail.com>, Michael Armbrust <michael@databricks.com>
2014-06-10 14:07:55 -0700
Commit: 1abbde0, github.com/apache/spark/pull/1033
HOTFIX: Increase time limit for Bagel test
Ankur Dave <ankurdave@gmail.com>
2014-06-10 13:15:06 -0700
Commit: 55a0e87, github.com/apache/spark/pull/1037
HOTFIX: Fix Python tests on Jenkins.
Patrick Wendell <pwendell@gmail.com>
2014-06-10 13:13:17 -0700
Commit: fb499be, github.com/apache/spark/pull/1036
[SPARK-2076][SQL] Pushdown the join filter & predication for outer join
Cheng Hao <hao.cheng@intel.com>
2014-06-10 12:59:52 -0700
Commit: db0c038, github.com/apache/spark/pull/1015
[SPARK-1978] In some cases, spark-yarn does not automatically restart the failed container
witgo <witgo@qq.com>
2014-06-10 10:34:57 -0500
Commit: 884ca71, github.com/apache/spark/pull/921
Moved hiveOperators.scala to the right package folder
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-10 01:14:44 -0700
Commit: a9a461c, github.com/apache/spark/pull/1029
[SPARK-1508][SQL] Add SQLConf to SQLContext.
Zongheng Yang <zongheng.y@gmail.com>
2014-06-10 00:49:09 -0700
Commit: 08ed9ad, github.com/apache/spark/pull/956
SPARK-1416: PySpark support for SequenceFile and Hadoop InputFormats
Nick Pentreath <nick.pentreath@gmail.com>
2014-06-09 22:21:03 -0700
Commit: f971d6c, github.com/apache/spark/pull/455
Make sure that empty string is filtered out when we get the secondary jars from conf
DB Tsai <dbtsai@dbtsai.com>
2014-06-09 22:18:50 -0700
Commit: 6f2db8c, github.com/apache/spark/pull/1027
[SPARK-1704][SQL] Fully support EXPLAIN commands as SchemaRDD.
Zongheng Yang <zongheng.y@gmail.com>
2014-06-09 16:47:44 -0700
Commit: a9ec033, github.com/apache/spark/pull/1003
[SQL] Simple framework for debugging query execution
Michael Armbrust <michael@databricks.com>
2014-06-09 14:24:19 -0700
Commit: c6e041d, github.com/apache/spark/pull/1005
[SPARK-1522] : YARN ClientBase throws a NPE if there is no YARN Application CP
Bernardo Gomez Palacio <bernardo.gomezpalacio@gmail.com>
2014-06-09 16:14:54 -0500
Commit: e273447, github.com/apache/spark/pull/433
Added a TaskSetManager unit test.
Kay Ousterhout <kayousterhout@gmail.com>
2014-06-09 13:13:53 -0700
Commit: 6cf335d, github.com/apache/spark/pull/1024
[SPARK-1495][SQL]add support for left semi join
Daoyuan <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>, Daoyuan Wang <daoyuan.wang@intel.com>
2014-06-09 11:31:36 -0700
Commit: 0cf6002, github.com/apache/spark/pull/837
SPARK-1944 Document --verbose in spark-shell -h
Andrew Ash <andrew@andrewash.com>
2014-06-09 10:21:21 -0700
Commit: 35630c8, github.com/apache/spark/pull/1020
[SPARK-1308] Add getNumPartitions to pyspark RDD
Syed Hashmi <shashmi@cloudera.com>
2014-06-09 00:08:40 -0700
Commit: 6113ac1, github.com/apache/spark/pull/995
Grammar: read -> reads
Andrew Ash <andrew@andrewash.com>
2014-06-08 23:20:10 -0700
Commit: 32ee9f0, github.com/apache/spark/pull/1016
[SPARK-2067] use relative path for Spark logo in UI
Neville Li <neville@spotify.com>
2014-06-08 23:18:27 -0700
Commit: 15ddbef, github.com/apache/spark/pull/1006
SPARK-1628 follow up: Improve RangePartitioner's documentation.
Reynold Xin <rxin@apache.org>
2014-06-08 18:39:57 -0700
Commit: 219dc00, github.com/apache/spark/pull/1012
Update run-example
maji2014 <maji3@asiainfo-linkage.com>
2014-06-08 15:14:27 -0700
Commit: e9261d0, github.com/apache/spark/pull/1011
SPARK-1628: Add missing hashCode methods in Partitioner subclasses
zsxwing <zsxwing@gmail.com>
2014-06-08 14:18:52 -0700
Commit: a71c6d1, github.com/apache/spark/pull/549
SPARK-1898: In deploy.yarn.Client, use YarnClient not YarnClientImpl
Colin Patrick McCabe <cmccabe@cloudera.com>
2014-06-08 12:27:34 -0700
Commit: ee96e94, github.com/apache/spark/pull/850
SPARK-2026: Maven Hadoop Profiles Should Set The Hadoop Version
Bernardo Gomez Palacio <bernardo.gomezpalacio@gmail.com>
2014-06-08 01:24:52 -0700
Commit: a338834, github.com/apache/spark/pull/998
SPARK-2056 Set RDD name to input path
Neville Li <neville@spotify.com>
2014-06-07 16:22:26 -0700
Commit: 7b877b2, github.com/apache/spark/pull/992
HOTFIX: Support empty body in merge script
Patrick Wendell <pwendell@gmail.com>
2014-06-07 16:16:37 -0700
Commit: 3ace10d, github.com/apache/spark/pull/1007
[SPARK-1994][SQL] Weird data corruption bug when running Spark SQL on data in HDFS
Michael Armbrust <michael@databricks.com>
2014-06-07 14:20:33 -0700
Commit: a6c72ab, github.com/apache/spark/pull/1004
[SPARK-1841]: update scalatest to version 2.1.5
witgo <witgo@qq.com>
2014-06-06 11:45:21 -0700
Commit: 41c4a33, github.com/apache/spark/pull/713
[SPARK-2050 - 2][SQL] DIV and BETWEEN should not be case sensitive.
Michael Armbrust <michael@databricks.com>
2014-06-06 11:31:37 -0700
Commit: 8d21056, github.com/apache/spark/pull/994
[SPARK-1552] Fix type comparison bug in {map,outerJoin}Vertices
Ankur Dave <ankurdave@gmail.com>
2014-06-05 23:33:12 -0700
Commit: 8d85359, github.com/apache/spark/pull/967
[SPARK-2050][SQL] LIKE, RLIKE and IN in HQL should not be case sensitive.
Michael Armbrust <michael@databricks.com>
2014-06-05 23:20:59 -0700
Commit: 41db44c, github.com/apache/spark/pull/989
SPARK-2043: ExternalAppendOnlyMap doesn't always find matching keys
Matei Zaharia <matei@databricks.com>
2014-06-05 23:01:48 -0700
Commit: b45c13e, github.com/apache/spark/pull/986
[SPARK-2025] Unpersist edges of previous graph in Pregel
Ankur Dave <ankurdave@gmail.com>
2014-06-05 17:45:38 -0700
Commit: 9bad0b7, github.com/apache/spark/pull/972
Use pluggable clock in DAGSheduler #SPARK-2031
CrazyJvm <crazyjvm@gmail.com>
2014-06-05 17:44:46 -0700
Commit: 3d3f8c8, github.com/apache/spark/pull/976
[SPARK-2041][SQL] Correctly analyze queries where columnName == tableName.
Michael Armbrust <michael@databricks.com>
2014-06-05 17:42:08 -0700
Commit: c7a183b, github.com/apache/spark/pull/985
Remove compile-scoped junit dependency.
Marcelo Vanzin <vanzin@cloudera.com>
2014-06-05 13:13:33 -0700
Commit: 668cb1d, github.com/apache/spark/pull/794
sbt 0.13.X should be using sbt-assembly 0.11.X
Kalpit Shah <shahkalpit84@gmail.com>
2014-06-05 13:07:26 -0700
Commit: 5473aa7, github.com/apache/spark/pull/555
HOTFIX: Remove generated-mima-excludes file after runing MIMA.
Patrick Wendell <pwendell@gmail.com>
2014-06-05 13:06:46 -0700
Commit: f6143f1, github.com/apache/spark/pull/971
[SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null.
Takuya UESHIN <ueshin@happy-camper.st>
2014-06-05 12:00:31 -0700
Commit: e4c11ee, github.com/apache/spark/pull/982
SPARK-1677: allow user to disable output dir existence checking
CodingCat <zhunansjtu@gmail.com>
2014-06-05 11:39:35 -0700
Commit: 89cdbb0, github.com/apache/spark/pull/947
[SPARK-2029] Bump pom.xml version number of master branch to 1.1.0-SNAPSHOT.
Takuya UESHIN <ueshin@happy-camper.st>
2014-06-05 11:27:33 -0700
Commit: 7c16029, github.com/apache/spark/pull/974
Fix issue in ReplSuite with hadoop-provided profile.
Marcelo Vanzin <vanzin@cloudera.com>
2014-06-04 22:56:49 -0700
Commit: b77c19b, github.com/apache/spark/pull/781
Minor: Fix documentation error from apache/spark#946
Ankur Dave <ankurdave@gmail.com>
2014-06-04 16:45:53 -0700
Commit: abea2d4, github.com/apache/spark/pull/970
SPARK-1790: Update EC2 scripts to support r3 instance types
Varakhedi Sujeet <svarakhedi@gopivotal.com>
2014-06-04 16:01:56 -0700
Commit: 11ded3f, github.com/apache/spark/pull/960
SPARK-1518: FileLogger: Fix compile against Hadoop trunk
Colin McCabe <cmccabe@cloudera.com>
2014-06-04 15:56:29 -0700
Commit: 1765c8d, github.com/apache/spark/pull/898
[SPARK-1752][MLLIB] Standardize text format for vectors and labeled points
Xiangrui Meng <meng@databricks.com>
2014-06-04 12:56:56 -0700
Commit: 189df16, github.com/apache/spark/pull/685
SPARK-1973. Add randomSplit to JavaRDD (with tests, and tidy Java tests)
Sean Owen <sowen@cloudera.com>, Xiangrui Meng <meng@databricks.com>
2014-06-04 11:27:08 -0700
Commit: d341b17, github.com/apache/spark/pull/919
[MLLIB] set RDD names in ALS
Neville Li <neville@spotify.com>
2014-06-04 01:51:34 -0700
Commit: b8d2580, github.com/apache/spark/pull/966
[SPARK-1817] RDD.zip() should verify partition sizes for each partition
Kan Zhang <kzhang@apache.org>
2014-06-03 22:47:18 -0700
Commit: c402a4a, github.com/apache/spark/pull/944
SPARK-1806 (addendum) Use non-deprecated methods in Mesos 0.18
Sean Owen <sowen@cloudera.com>
2014-06-03 22:37:20 -0700
Commit: 4ca0625, github.com/apache/spark/pull/920
Update spark-ec2 scripts for 1.0.0 on master
Aaron Davidson <aaron@databricks.com>
2014-06-03 22:33:04 -0700
Commit: ab7c62d, github.com/apache/spark/pull/938
Enable repartitioning of graph over different number of partitions
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-06-03 20:49:14 -0700
Commit: 5284ca7, github.com/apache/spark/pull/719
use env default python in merge_spark_pr.py
Xiangrui Meng <meng@databricks.com>
2014-06-03 18:53:13 -0700
Commit: e8d93ee, github.com/apache/spark/pull/965
SPARK-1941: Update streamlib to 2.7.0 and use HyperLogLogPlus instead of HyperLogLog.
Reynold Xin <rxin@apache.org>
2014-06-03 18:37:40 -0700
Commit: 1faef14, github.com/apache/spark/pull/897
[SPARK-1161] Add saveAsPickleFile and SparkContext.pickleFile in Python
Kan Zhang <kzhang@apache.org>
2014-06-03 18:18:25 -0700
Commit: 21e40ed, github.com/apache/spark/pull/755
Fixed a typo
DB Tsai <dbtsai@dbtsai.com>
2014-06-03 18:10:58 -0700
Commit: f4dd665, github.com/apache/spark/pull/959
[SPARK-1991] Support custom storage levels for vertices and edges
Ankur Dave <ankurdave@gmail.com>
2014-06-03 14:54:26 -0700
Commit: b1feb60, github.com/apache/spark/pull/946
Synthetic GraphX Benchmark
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>, Ankur Dave <ankurdave@gmail.com>
2014-06-03 14:14:48 -0700
Commit: 894ecde, github.com/apache/spark/pull/720
fix java.lang.ClassCastException
baishuo(白硕) <vc_java@hotmail.com>
2014-06-03 13:39:47 -0700
Commit: aa41a52, github.com/apache/spark/pull/949
[SPARK-1468] Modify the partition function used by partitionBy.
Erik Selin <erik.selin@jadedpixel.com>
2014-06-03 13:31:16 -0700
Commit: 8edc9d0, github.com/apache/spark/pull/371
Add support for Pivotal HD in the Maven build: SPARK-1992
tzolov <christian.tzolov@gmail.com>
2014-06-03 13:26:29 -0700
Commit: b1f2853, github.com/apache/spark/pull/942
[SPARK-1912] fix compress memory issue during reduce
Wenchen Fan(Cloud) <cloud0fan@gmail.com>
2014-06-03 13:18:20 -0700
Commit: 45e9bc8, github.com/apache/spark/pull/860
SPARK-2001 : Remove docs/spark-debugger.md from master
Henry Saputra <henry.saputra@gmail.com>
2014-06-03 13:03:51 -0700
Commit: 6c044ed, github.com/apache/spark/pull/953
[SPARK-1942] Stop clearing spark.driver.port in unit tests
Syed Hashmi <shashmi@cloudera.com>, CodingCat <zhunansjtu@gmail.com>
2014-06-03 12:04:47 -0700
Commit: 7782a30, github.com/apache/spark/pull/943
Avoid dynamic dispatching when unwrapping Hive data.
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-02 19:20:23 -0700
Commit: 862283e, github.com/apache/spark/pull/935
[SPARK-1995][SQL] system function upper and lower can be supported
egraldlo <egraldlo@gmail.com>
2014-06-02 18:02:57 -0700
Commit: ec8be27, github.com/apache/spark/pull/936
[SPARK-1958] Calling .collect() on a SchemaRDD should call executeCollect() on the underlying query plan.
Cheng Lian <lian.cs.zju@gmail.com>
2014-06-02 12:09:43 -0700
Commit: d000ca9, github.com/apache/spark/pull/939
[SPARK-1553] Alternating nonnegative least-squares
Tor Myklebust <tmyklebu@gmail.com>
2014-06-02 11:48:09 -0700
Commit: 9a5d482, github.com/apache/spark/pull/460
Add landmark-based Shortest Path algorithm to graphx.lib
Ankur Dave <ankurdave@gmail.com>, Andres Perez <andres@tresata.com>
2014-06-02 00:00:24 -0700
Commit: 9535f40, github.com/apache/spark/pull/933
Better explanation for how to use MIMA excludes.
Patrick Wendell <pwendell@gmail.com>
2014-06-01 17:27:05 -0700
Commit: d17d221, github.com/apache/spark/pull/937
Made spark_ec2.py PEP8 compliant.
Reynold Xin <rxin@apache.org>
2014-06-01 15:39:04 -0700
Commit: eea3aab, github.com/apache/spark/pull/891
updated java code blocks in spark SQL guide such that ctx will refer to ...
Yadid Ayzenberg <yadid@media.mit.edu>
2014-05-31 19:44:13 -0700
Commit: 366c0c4, github.com/apache/spark/pull/932
SPARK-1917: fix PySpark import of scipy.special functions
Uri Laserson <laserson@cloudera.com>
2014-05-31 14:59:09 -0700
Commit: 5e98967, github.com/apache/spark/pull/866
Improve maven plugin configuration
witgo <witgo@qq.com>
2014-05-31 14:36:27 -0700
Commit: d8c005d, github.com/apache/spark/pull/786
SPARK-1839: PySpark RDD#take() shouldn't always read from driver
Aaron Davidson <aaron@databricks.com>
2014-05-31 13:04:57 -0700
Commit: 9909efc, github.com/apache/spark/pull/922
Super minor: Close inputStream in SparkSubmitArguments
Aaron Davidson <aaron@databricks.com>
2014-05-31 12:36:58 -0700
Commit: 7d52777, github.com/apache/spark/pull/914
[SQL] SPARK-1964 Add timestamp to hive metastore type parser.
Michael Armbrust <michael@databricks.com>
2014-05-31 12:34:22 -0700
Commit: 1a0da0e, github.com/apache/spark/pull/913
Optionally include Hive as a dependency of the REPL.
Michael Armbrust <michael@databricks.com>
2014-05-31 12:24:35 -0700
Commit: 7463cd2, github.com/apache/spark/pull/801
[SPARK-1947] [SQL] Child of SumDistinct or Average should be widened to prevent overflows the same as Sum.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-31 11:30:03 -0700
Commit: 3ce8149, github.com/apache/spark/pull/902
correct tiny comment error
Chen Chao <crazyjvm@gmail.com>
2014-05-31 00:06:49 -0700
Commit: 9ecc40d, github.com/apache/spark/pull/928
[SPARK-1959] String "NULL" shouldn't be interpreted as null value
Cheng Lian <lian.cs.zju@gmail.com>
2014-05-30 22:13:11 -0700
Commit: cf98960, github.com/apache/spark/pull/909
SPARK-1976: fix the misleading part in streaming docs
CodingCat <zhunansjtu@gmail.com>
2014-05-30 22:06:08 -0700
Commit: 41bfdda, github.com/apache/spark/pull/924
updated link to mailing list
nchammas <nicholas.chammas@gmail.com>
2014-05-30 22:04:57 -0700
Commit: 23ae366, github.com/apache/spark/pull/923
Typo: and -> an
Andrew Ash <andrew@andrewash.com>
2014-05-30 22:02:04 -0700
Commit: 9c1f204, github.com/apache/spark/pull/927
[SPARK-1901] worker should make sure executor has exited before updating executor's info
Zhen Peng <zhenpeng01@baidu.com>
2014-05-30 10:11:02 -0700
Commit: ff562b2, github.com/apache/spark/pull/854
[SPARK-1971] Update MIMA to compare against Spark 1.0.0
Prashant Sharma <prashant.s@imaginea.com>
2014-05-30 01:13:51 -0700
Commit: 79fa8fd, github.com/apache/spark/pull/910
[SPARK-1566] consolidate programming guide, and general doc updates
Matei Zaharia <matei@databricks.com>
2014-05-30 00:34:33 -0700
Commit: c8bf413, github.com/apache/spark/pull/896
[SPARK-1820] Make GenerateMimaIgnore @DeveloperApi annotation aware.
Prashant Sharma <prashant.s@imaginea.com>, nikhil7sh <nikhilsharmalnmiit@gmail.ccom>
2014-05-29 23:20:20 -0700
Commit: eeee978, github.com/apache/spark/pull/904
initial version of LPA
Ankur Dave <ankurdave@gmail.com>, haroldsultan <haroldsultan@gmail.com>, Harold Sultan <haroldsultan@gmail.com>
2014-05-29 15:39:25 -0700
Commit: b7e28fa, github.com/apache/spark/pull/905
[SPARK-1368][SQL] Optimized HiveTableScan
Cheng Lian <lian.cs.zju@gmail.com>
2014-05-29 15:24:03 -0700
Commit: 8f7141f, github.com/apache/spark/pull/758
SPARK-1935: Explicitly add commons-codec 1.5 as a dependency.
Yin Huai <huai@cse.ohio-state.edu>
2014-05-29 09:07:39 -0700
Commit: 60b89fe, github.com/apache/spark/pull/889
Added doctest and method description in context.py
Jyotiska NK <jyotiska123@gmail.com>
2014-05-28 23:08:39 -0700
Commit: 9cff1dd, github.com/apache/spark/pull/187
[SPARK-1712]: TaskDescription instance is too big causes Spark to hang
witgo <witgo@qq.com>
2014-05-28 15:57:05 -0700
Commit: 4dbb27b, github.com/apache/spark/pull/694
Spark 1916
David Lemieux <david.lemieux@radialpoint.com>
2014-05-28 15:50:35 -0700
Commit: 4312cf0, github.com/apache/spark/pull/865
Organize configuration docs
Patrick Wendell <pwendell@gmail.com>
2014-05-28 15:49:54 -0700
Commit: 7801d44, github.com/apache/spark/pull/880
Fix doc about NetworkWordCount/JavaNetworkWordCount usage of spark streaming
jmu <jmujmu@gmail.com>
2014-05-27 22:41:47 -0700
Commit: 82eadc3, github.com/apache/spark/pull/826
[SPARK-1938] [SQL] ApproxCountDistinctMergeFunction should return Int value.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-27 22:17:50 -0700
Commit: 9df8683, github.com/apache/spark/pull/893
[SQL] SPARK-1922
LY Lai <ly.lai@vpon.com>
2014-05-27 16:08:38 -0700
Commit: 0682567, github.com/apache/spark/pull/873
[SPARK-1915] [SQL] AverageFunction should not count if the evaluated value is null.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-27 14:55:23 -0700
Commit: 3b0baba, github.com/apache/spark/pull/862
[SPARK-1926] [SQL] Nullability of Max/Min/First should be true.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-27 14:53:57 -0700
Commit: d1375a2, github.com/apache/spark/pull/881
bugfix worker DriverStateChanged state should match DriverState.FAILED
lianhuiwang <lianhuiwang09@gmail.com>
2014-05-27 11:53:38 -0700
Commit: 95e4c9c, github.com/apache/spark/pull/864
SPARK-1932: Fix race conditions in onReceiveCallback and cachedPeers
zsxwing <zsxwing@gmail.com>
2014-05-26 23:17:39 -0700
Commit: 549830b, github.com/apache/spark/pull/887
SPARK-1933: Throw a more meaningful exception when a directory is passed to addJar/addFile.
Reynold Xin <rxin@apache.org>
2014-05-26 22:05:23 -0700
Commit: 90e281b, github.com/apache/spark/pull/888
Updated dev Python scripts to make them PEP8 compliant.
Reynold Xin <rxin@apache.org>
2014-05-26 21:40:52 -0700
Commit: 9ed3719, github.com/apache/spark/pull/875
Fixed the error message for OutOfMemoryError in DAGScheduler.
Reynold Xin <rxin@apache.org>
2014-05-26 21:31:27 -0700
Commit: ef690e1
SPARK-1929 DAGScheduler suspended by local task OOM
Zhen Peng <zhenpeng01@baidu.com>
2014-05-26 21:30:25 -0700
Commit: 8d271c9, github.com/apache/spark/pull/883
[SPARK-1931] Reconstruct routing tables in Graph.partitionBy
Ankur Dave <ankurdave@gmail.com>
2014-05-26 16:10:22 -0700
Commit: 56c771c, github.com/apache/spark/pull/885
SPARK-1925: Replace '&' with '&&'
zsxwing <zsxwing@gmail.com>
2014-05-26 14:34:58 -0700
Commit: cb7fe50, github.com/apache/spark/pull/879
Fix scalastyle warnings in yarn alpha
witgo <witgo@qq.com>
2014-05-26 13:16:35 -0700
Commit: bee6c4f, github.com/apache/spark/pull/884
[SPARK-1914] [SQL] Simplify CountFunction not to traverse to evaluate all child expressions.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-26 00:17:20 -0700
Commit: d6395d8, github.com/apache/spark/pull/861
HOTFIX: Add no-arg SparkContext constructor in Java
Patrick Wendell <pwendell@gmail.com>
2014-05-25 20:13:32 -0700
Commit: b6d22af, github.com/apache/spark/pull/878
[SQL] Minor: Introduce SchemaRDD#aggregate() for simple aggregations
Aaron Davidson <aaron@databricks.com>
2014-05-25 18:37:44 -0700
Commit: c3576ff, github.com/apache/spark/pull/874
SPARK-1903 Document Spark's network connections
Andrew Ash <andrew@andrewash.com>
2014-05-25 17:15:47 -0700
Commit: 0659529, github.com/apache/spark/pull/856
Fix PEP8 violations in Python mllib.
Reynold Xin <rxin@apache.org>
2014-05-25 17:15:01 -0700
Commit: d33d3c6, github.com/apache/spark/pull/871
Python docstring update for sql.py.
Reynold Xin <rxin@apache.org>
2014-05-25 16:04:17 -0700
Commit: 14f0358, github.com/apache/spark/pull/869
Fix PEP8 violations in examples/src/main/python.
Reynold Xin <rxin@apache.org>
2014-05-25 14:48:27 -0700
Commit: d79c2b2, github.com/apache/spark/pull/870
Added license header for tox.ini.
Reynold Xin <rxin@apache.org>
2014-05-25 01:47:08 -0700
Commit: 55fddf9
SPARK-1822: Some minor cleanup work on SchemaRDD.count()
Reynold Xin <rxin@apache.org>
2014-05-25 01:44:49 -0700
Commit: d66642e, github.com/apache/spark/pull/868
Added PEP8 style configuration file.
Reynold Xin <rxin@apache.org>
2014-05-25 01:32:15 -0700
Commit: 5c7faec, github.com/apache/spark/pull/872
[SPARK-1822] SchemaRDD.count() should use query optimizer
Kan Zhang <kzhang@apache.org>
2014-05-25 00:06:42 -0700
Commit: 6052db9, github.com/apache/spark/pull/841
spark-submit: add exec at the end of the script
Colin Patrick Mccabe <cmccabe@cloudera.com>
2014-05-24 22:39:27 -0700
Commit: 6e9fb63, github.com/apache/spark/pull/858
[SPARK-1913][SQL] Bug fix: column pruning error in Parquet support
Cheng Lian <lian.cs.zju@gmail.com>
2014-05-24 20:42:01 -0700
Commit: 5afe6af, github.com/apache/spark/pull/863
[SPARK-1886] check executor id existence when executor exit
Zhen Peng <zhenpeng01@baidu.com>
2014-05-24 20:40:19 -0700
Commit: 4e4831b, github.com/apache/spark/pull/827
SPARK-1911: Emphasize that Spark jars should be built with Java 6.
Patrick Wendell <pwendell@gmail.com>
2014-05-24 18:27:00 -0700
Commit: 75a0327, github.com/apache/spark/pull/859
[SPARK-1900 / 1918] PySpark on YARN is broken
Andrew Or <andrewor14@gmail.com>
2014-05-24 18:01:49 -0700
Commit: 5081a0a, github.com/apache/spark/pull/853
Update LBFGSSuite.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-05-23 13:02:40 -0700
Commit: a08262d, github.com/apache/spark/pull/815
Updated scripts for auditing releases
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-22 20:48:55 -0700
Commit: b2bdd0e, github.com/apache/spark/pull/844
[SPARK-1896] Respect spark.master (and --master) before MASTER in spark-shell
Andrew Or <andrewor14@gmail.com>
2014-05-22 20:32:27 -0700
Commit: cce7745, github.com/apache/spark/pull/846
[SPARK-1897] Respect spark.jars (and --jars) in spark-shell
Andrew Or <andrewor14@gmail.com>
2014-05-22 20:25:41 -0700
Commit: 8edbee7, github.com/apache/spark/pull/849
Fix UISuite unit test that fails under Jenkins contention
Aaron Davidson <aaron@databricks.com>
2014-05-22 15:11:05 -0700
Commit: f9f5fd5, github.com/apache/spark/pull/857
[SPARK-1870] Make spark-submit --jars work in yarn-cluster mode.
Xiangrui Meng <meng@databricks.com>
2014-05-22 01:52:50 -0700
Commit: dba3140, github.com/apache/spark/pull/848
Configuration documentation updates
Reynold Xin <rxin@apache.org>
2014-05-21 18:49:12 -0700
Commit: 2a948e7, github.com/apache/spark/pull/851
[SPARK-1889] [SQL] Apply splitConjunctivePredicates to join condition while finding join ke...
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-21 15:37:47 -0700
Commit: bb88875, github.com/apache/spark/pull/836
[SPARK-1519] Support minPartitions param of wholeTextFiles() in PySpark
Kan Zhang <kzhang@apache.org>
2014-05-21 13:26:53 -0700
Commit: f18fd05, github.com/apache/spark/pull/697
[Typo] Stoped -> Stopped
Andrew Or <andrewor14@gmail.com>
2014-05-21 11:59:05 -0700
Commit: ba5d4a9, github.com/apache/spark/pull/847
[Minor] Move JdbcRDDSuite to the correct package
Andrew Or <andrewor14@gmail.com>
2014-05-21 01:25:10 -0700
Commit: 7c79ef7, github.com/apache/spark/pull/839
[Docs] Correct example of creating a new SparkConf
Andrew Or <andrewor14@gmail.com>
2014-05-21 01:23:34 -0700
Commit: 1014668, github.com/apache/spark/pull/842
[SPARK-1250] Fixed misleading comments in bin/pyspark, bin/spark-class
Sumedh Mungee <smungee@gmail.com>
2014-05-21 01:22:25 -0700
Commit: 6e33738, github.com/apache/spark/pull/843
[Hotfix] Blacklisted flaky HiveCompatibility test
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-20 10:27:12 -0700
Commit: 7f0cfe4, github.com/apache/spark/pull/838
[Spark 1877] ClassNotFoundException when loading RDD with serialized objects
Tathagata Das <tathagata.das1565@gmail.com>, Ghidireac <bogdang@u448a5b0a73d45358d94a.ant.amazon.com>
2014-05-19 22:36:24 -0700
Commit: 52eb54d, github.com/apache/spark/pull/835
[SPARK-1874][MLLIB] Clean up MLlib sample data
Xiangrui Meng <meng@databricks.com>
2014-05-19 21:29:33 -0700
Commit: bcb9dce, github.com/apache/spark/pull/833
SPARK-1689: Spark application should die when removed by Master
Aaron Davidson <aaron@databricks.com>
2014-05-19 20:55:26 -0700
Commit: b0ce22e, github.com/apache/spark/pull/832
[SPARK-1875]NoClassDefFoundError: StringUtils when building with hadoop 1.x and hive
witgo <witgo@qq.com>
2014-05-19 19:40:29 -0700
Commit: 6a2c5c6, github.com/apache/spark/pull/824
SPARK-1879. Increase MaxPermSize since some of our builds have many classes
Matei Zaharia <matei@databricks.com>
2014-05-19 18:42:28 -0700
Commit: 5af99d7, github.com/apache/spark/pull/823
SPARK-1878: Fix the incorrect initialization order
zsxwing <zsxwing@gmail.com>
2014-05-19 16:41:31 -0700
Commit: 1811ba8, github.com/apache/spark/pull/822
[SPARK-1876] Windows fixes to deal with latest distribution layout changes
Matei Zaharia <matei@databricks.com>
2014-05-19 15:02:35 -0700
Commit: 7b70a70, github.com/apache/spark/pull/819
[WIP][SPARK-1871][MLLIB] Improve MLlib guide for v1.0
Xiangrui Meng <meng@databricks.com>
2014-05-18 17:00:57 -0700
Commit: df0aa83, github.com/apache/spark/pull/816
SPARK-1873: Add README.md file when making distributions
Patrick Wendell <pwendell@gmail.com>
2014-05-18 16:51:53 -0700
Commit: 4ce4793, github.com/apache/spark/pull/818
Fix spark-submit path in spark-shell & pyspark
Neville Li <neville@spotify.com>
2014-05-18 13:31:23 -0700
Commit: ebcd2d6, github.com/apache/spark/pull/812
Make deprecation warning less severe
Patrick Wendell <pwendell@gmail.com>
2014-05-16 22:58:47 -0700
Commit: 442808a, github.com/apache/spark/pull/810
[SPARK-1824] Remove <master> from Python examples
Andrew Or <andrewor14@gmail.com>
2014-05-16 22:36:23 -0700
Commit: cf6cbe9, github.com/apache/spark/pull/802
[SPARK-1808] Route bin/pyspark through Spark submit
Andrew Or <andrewor14@gmail.com>
2014-05-16 22:34:38 -0700
Commit: 4b8ec6f, github.com/apache/spark/pull/799
Version bump of spark-ec2 scripts
Patrick Wendell <pwendell@gmail.com>
2014-05-16 21:42:14 -0700
Commit: c0ab85d, github.com/apache/spark/pull/809
SPARK-1864 Look in spark conf instead of system properties when propagating configuration to executors.
Michael Armbrust <michael@databricks.com>
2014-05-16 20:25:10 -0700
Commit: a80a6a1, github.com/apache/spark/pull/808
Tweaks to Mesos docs
Matei Zaharia <matei@databricks.com>
2014-05-16 17:35:05 -0700
Commit: fed6303, github.com/apache/spark/pull/806
SPARK-1487 [SQL] Support record filtering via predicate pushdown in Parquet
Andre Schumacher <andre.schumacher@iki.fi>
2014-05-16 13:41:41 -0700
Commit: 40d6acd, github.com/apache/spark/pull/511
[SQL] Implement between in hql
Michael Armbrust <michael@databricks.com>
2014-05-16 11:47:00 -0700
Commit: 032d663, github.com/apache/spark/pull/804
bugfix: overflow of graphx Edge compare function
Zhen Peng <zhenpeng01@baidu.com>
2014-05-16 11:37:18 -0700
Commit: fa6de40, github.com/apache/spark/pull/769
HOTFIX: Duplication of hbase version
Patrick Wendell <pwendell@gmail.com>
2014-05-15 23:33:27 -0700
Commit: e304eb9
SPARK-1862: Support for MapR in the Maven build.
Patrick Wendell <pwendell@gmail.com>
2014-05-15 23:31:43 -0700
Commit: 17702e2, github.com/apache/spark/pull/803
[Spark-1461] Deferred Expression Evaluation (short-circuit evaluation)
Cheng Hao <hao.cheng@intel.com>
2014-05-15 22:12:34 -0700
Commit: a20fea9, github.com/apache/spark/pull/446
SPARK-1860: Do not cleanup application work/ directories by default
Aaron Davidson <aaron@databricks.com>
2014-05-15 21:37:58 -0700
Commit: bb98eca, github.com/apache/spark/pull/800
Typos in Spark
Huajian Mao <huajianmao@gmail.com>
2014-05-15 18:20:16 -0700
Commit: 94c5139, github.com/apache/spark/pull/798
Fixes a misplaced comment.
Prashant Sharma <prashant.s@imaginea.com>
2014-05-15 16:58:37 -0700
Commit: e1e3416, github.com/apache/spark/pull/788
[SQL] Fix tiny/small ints from HiveMetastore.
Michael Armbrust <michael@databricks.com>
2014-05-15 16:50:42 -0700
Commit: a4aafe5, github.com/apache/spark/pull/797
SPARK-1803 Replaced colon in filenames with a dash
Stevo Slavić <sslavic@gmail.com>, Stevo Slavic <sslavic@gmail.com>
2014-05-15 16:44:14 -0700
Commit: e66e31b, github.com/apache/spark/pull/739
SPARK-1851. Upgrade Avro dependency to 1.7.6 so Spark can read Avro file...
Sandy Ryza <sandy@cloudera.com>
2014-05-15 16:35:39 -0700
Commit: 08e7606, github.com/apache/spark/pull/795
[SPARK-1741][MLLIB] add predict(JavaRDD) to RegressionModel, ClassificationModel, and KMeans
Xiangrui Meng <meng@databricks.com>
2014-05-15 11:59:59 -0700
Commit: d52761d, github.com/apache/spark/pull/670
[SPARK-1819] [SQL] Fix GetField.nullable.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-15 11:21:33 -0700
Commit: 94c9d6f, github.com/apache/spark/pull/757
[SPARK-1845] [SQL] Use AllScalaRegistrar for SparkSqlSerializer to register serializers of ...
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-15 11:20:21 -0700
Commit: db8cc6f, github.com/apache/spark/pull/790
SPARK-1846 Ignore logs directory in RAT checks
Andrew Ash <andrew@andrewash.com>
2014-05-15 11:05:39 -0700
Commit: 3abe2b7, github.com/apache/spark/pull/793
HOTFIX: Don't build Javadoc in Maven when creating releases.
Patrick Wendell <pwendell@gmail.com>
2014-05-14 23:48:03 -0700
Commit: 514157f
fix different versions of commons-lang dependency and apache/spark#746 addendum
witgo <witgo@qq.com>
2014-05-14 22:26:26 -0700
Commit: bae07e3, github.com/apache/spark/pull/754
Package docs
Prashant Sharma <prashant.s@imaginea.com>, Patrick Wendell <pwendell@gmail.com>
2014-05-14 22:24:41 -0700
Commit: 4632427, github.com/apache/spark/pull/785
Documentation: Encourage use of reduceByKey instead of groupByKey.
Patrick Wendell <pwendell@gmail.com>
2014-05-14 22:24:04 -0700
Commit: 21570b4, github.com/apache/spark/pull/784
Add language tabs and Python version to interactive part of quick-start
Matei Zaharia <matei@databricks.com>
2014-05-14 21:45:20 -0700
Commit: f10de04, github.com/apache/spark/pull/782
[SPARK-1840] SparkListenerBus prints out scary error message when terminated normally
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-14 21:13:41 -0700
Commit: ad4e60e, github.com/apache/spark/pull/783
default task number misleading in several places
Chen Chao <crazyjvm@gmail.com>
2014-05-14 18:20:20 -0700
Commit: 2f63995, github.com/apache/spark/pull/766
[SPARK-1826] fix the head notation of package object dsl
wangfei <scnbwf@yeah.net>
2014-05-14 17:59:11 -0700
Commit: 44165fc, github.com/apache/spark/pull/765
[Typo] propertes -> properties
andrewor14 <andrewor14@gmail.com>
2014-05-14 17:54:53 -0700
Commit: 9ad096d, github.com/apache/spark/pull/780
[SPARK-1696][MLLIB] use alpha in dense dspr
Xiangrui Meng <meng@databricks.com>
2014-05-14 17:18:30 -0700
Commit: e3d72a7, github.com/apache/spark/pull/778
String interpolation + some other small changes
Jacek Laskowski <jacek@japila.pl>
2014-05-14 15:45:52 -0700
Commit: 601e371, github.com/apache/spark/pull/748
[FIX] do not load defaults when testing SparkConf in pyspark
Xiangrui Meng <meng@databricks.com>
2014-05-14 14:57:17 -0700
Commit: 94c6c06, github.com/apache/spark/pull/775
SPARK-1833 - Have an empty SparkContext constructor.
Patrick Wendell <pwendell@gmail.com>
2014-05-14 12:53:30 -0700
Commit: 65533c7, github.com/apache/spark/pull/774
SPARK-1829 Sub-second durations shouldn't round to "0 s"
Andrew Ash <andrew@andrewash.com>
2014-05-14 12:01:14 -0700
Commit: a3315d7, github.com/apache/spark/pull/768
Fix: sbt test throw an java.lang.OutOfMemoryError: PermGen space
witgo <witgo@qq.com>
2014-05-14 11:19:26 -0700
Commit: fde82c1, github.com/apache/spark/pull/773
[SPARK-1620] Handle uncaught exceptions in function run by Akka scheduler
Mark Hamstra <markhamstra@gmail.com>
2014-05-14 10:07:25 -0700
Commit: 17f3075, github.com/apache/spark/pull/622
SPARK-1828: Created forked version of hive-exec that doesn't bundle other dependencies
Patrick Wendell <pwendell@gmail.com>
2014-05-14 09:51:01 -0700
Commit: d58cb33, github.com/apache/spark/pull/767
SPARK-1818 Freshen Mesos documentation
Andrew Ash <andrew@andrewash.com>
2014-05-14 09:45:33 -0700
Commit: d1d41cc, github.com/apache/spark/pull/756
SPARK-1827. LICENSE and NOTICE files need a refresh to contain transitive dependency info
Sean Owen <sowen@cloudera.com>
2014-05-14 09:38:33 -0700
Commit: 2e5a7cd, github.com/apache/spark/pull/770
Fixed streaming examples docs to use run-example instead of spark-submit
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-14 04:17:32 -0700
Commit: 68f28da, github.com/apache/spark/pull/722
[SPARK-1769] Executor loss causes NPE race condition
Andrew Or <andrewor14@gmail.com>
2014-05-14 00:54:33 -0700
Commit: 69f7502, github.com/apache/spark/pull/762
Fix dep exclusion: avro-ipc, not avro, depends on netty.
Marcelo Vanzin <vanzin@cloudera.com>
2014-05-14 00:37:57 -0700
Commit: 54ae832, github.com/apache/spark/pull/763
SPARK-1801. expose InterruptibleIterator and TaskKilledException in deve...
Koert Kuipers <koert@tresata.com>
2014-05-14 00:10:12 -0700
Commit: b22952f, github.com/apache/spark/pull/764
[SQL] Improve column pruning.
Michael Armbrust <michael@databricks.com>
2014-05-13 23:27:22 -0700
Commit: 6ce0884, github.com/apache/spark/pull/729
Revert "[SPARK-1784] Add a new partitioner to allow specifying # of keys per partition"
Patrick Wendell <pwendell@gmail.com>
2014-05-13 23:24:51 -0700
Commit: 7bb9a52
Implement ApproximateCountDistinct for SparkSql
larvaboy <larvaboy@gmail.com>
2014-05-13 21:26:08 -0700
Commit: c33b8dc, github.com/apache/spark/pull/737
[SPARK-1784] Add a new partitioner to allow specifying # of keys per partition
Syed Hashmi <shashmi@cloudera.com>
2014-05-13 21:24:23 -0700
Commit: 92cebad, github.com/apache/spark/pull/721
[SQL] Make it possible to create Java/Python SQLContexts from an existing Scala SQLContext.
Michael Armbrust <michael@databricks.com>
2014-05-13 21:23:51 -0700
Commit: 4423386, github.com/apache/spark/pull/761
[SPARK-1527] change rootDir*.getName to rootDir*.getAbsolutePath
Ye Xianjin <advancedxy@gmail.com>
2014-05-13 19:03:51 -0700
Commit: 753b04d, github.com/apache/spark/pull/436
[SPARK-1816] LiveListenerBus dies if a listener throws an exception
Andrew Or <andrewor14@gmail.com>
2014-05-13 18:32:32 -0700
Commit: 5c0dafc, github.com/apache/spark/pull/759
SPARK-1791 - SVM implementation does not use threshold parameter
Andrew Tulloch <andrew@tullo.ch>
2014-05-13 17:31:27 -0700
Commit: d1e4874, github.com/apache/spark/pull/725
SPARK-571: forbid return statements in cleaned closures
William Benton <willb@redhat.com>
2014-05-13 13:45:23 -0700
Commit: 16ffadc, github.com/apache/spark/pull/717
BUILD: Add more content to make-distribution.sh.
Patrick Wendell <pwendell@gmail.com>
2014-05-12 23:02:54 -0700
Commit: 52d9052
SPARK-1815. SparkContext should not be marked DeveloperApi
Sandy Ryza <sandy@cloudera.com>
2014-05-12 20:08:30 -0700
Commit: 2792bd0, github.com/apache/spark/pull/753
[SPARK-1753 / 1773 / 1814] Update outdated docs for spark-submit, YARN, standalone etc.
Andrew Or <andrewor14@gmail.com>
2014-05-12 19:44:14 -0700
Commit: 2ffd1ea, github.com/apache/spark/pull/701
[SPARK-1780] Non-existent SPARK_DAEMON_OPTS is lurking around
Andrew Or <andrewor14@gmail.com>
2014-05-12 19:42:35 -0700
Commit: ba96bb3, github.com/apache/spark/pull/751
SPARK-1757 Failing test for saving null primitives with .saveAsParquetFile()
Andrew Ash <andrew@andrewash.com>, Michael Armbrust <michael@databricks.com>
2014-05-12 19:23:39 -0700
Commit: 156df87, github.com/apache/spark/pull/690
Modify a typo in monitoring.md
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-05-12 19:21:06 -0700
Commit: 9cf9f18, github.com/apache/spark/pull/698
L-BFGS Documentation
DB Tsai <dbtsai@alpinenow.com>
2014-05-12 19:20:24 -0700
Commit: 5c2275d, github.com/apache/spark/pull/702
Typo: resond -> respond
Andrew Ash <andrew@andrewash.com>
2014-05-12 18:46:28 -0700
Commit: a5150d1, github.com/apache/spark/pull/743
[SQL] Make Hive Metastore conversion functions publicly visible.
Michael Armbrust <michael@databricks.com>
2014-05-12 18:40:30 -0700
Commit: 2f1a337, github.com/apache/spark/pull/750
Adding hadoop-2.2 profile to the build
Patrick Wendell <pwendell@gmail.com>
2014-05-12 15:40:48 -0700
Commit: 3e13b8c
[SPARK-1736] Spark submit for Windows
Andrew Or <andrewor14@gmail.com>
2014-05-12 17:39:40 -0700
Commit: beb9cba, github.com/apache/spark/pull/745
SPARK-1802. (Addendium) Audit dependency graph when Spark is built with -Pyarn
Sean Owen <sowen@cloudera.com>
2014-05-12 17:35:29 -0700
Commit: 4b31f4e, github.com/apache/spark/pull/746
SPARK-1623: Use File objects instead of String's in HTTPBroadcast
Patrick Wendell <pwendell@gmail.com>
2014-05-12 17:27:28 -0700
Commit: 925d8b2, github.com/apache/spark/pull/749
Rename testExecutorEnvs --> executorEnvs.
Patrick Wendell <pwendell@gmail.com>
2014-05-12 17:09:13 -0700
Commit: 3ce526b, github.com/apache/spark/pull/747
SPARK-1802. Audit dependency graph when Spark is built with -Phive
Sean Owen <sowen@cloudera.com>
2014-05-12 14:17:25 -0700
Commit: 8586bf5, github.com/apache/spark/pull/744
SPARK-1798. Tests should clean up temp files
Sean Owen <sowen@cloudera.com>
2014-05-12 14:16:19 -0700
Commit: 7120a29, github.com/apache/spark/pull/732
BUILD: Include Hive with default packages when creating a release
Patrick Wendell <pwendell@gmail.com>
2014-05-12 13:20:23 -0700
Commit: 1e4a65e
SPARK-1786: Reopening PR 724
Ankur Dave <ankurdave@gmail.com>, Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-05-12 13:05:24 -0700
Commit: 0e2bde2, github.com/apache/spark/pull/742
SPARK-1806: Upgrade Mesos dependency to 0.18.1
Bernardo Gomez Palacio <bernardo.gomezpalacio@gmail.com>
2014-05-12 11:10:28 -0700
Commit: d9c97ba, github.com/apache/spark/pull/741
SPARK-1772 Stop catching Throwable, let Executors die
Aaron Davidson <aaron@databricks.com>
2014-05-12 11:08:52 -0700
Commit: 3af1f38, github.com/apache/spark/pull/715
Revert "SPARK-1786: Edge Partition Serialization"
Patrick Wendell <pwendell@gmail.com>
2014-05-12 10:49:03 -0700
Commit: af15c82
SPARK-1786: Edge Partition Serialization
Ankur Dave <ankurdave@gmail.com>, Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-05-11 19:20:42 -0700
Commit: a6b02fb, github.com/apache/spark/pull/724
Fix error in 2d Graph Partitioner
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-05-11 18:33:46 -0700
Commit: f938a15, github.com/apache/spark/pull/709
SPARK-1652: Set driver memory correctly in spark-submit.
Patrick Wendell <pwendell@gmail.com>
2014-05-11 18:17:34 -0700
Commit: 05c9aa9, github.com/apache/spark/pull/730
SPARK-1770: Load balance elements when repartitioning.
Patrick Wendell <pwendell@gmail.com>
2014-05-11 17:11:55 -0700
Commit: 7d9cc92, github.com/apache/spark/pull/727
remove outdated runtime Information scala home
witgo <witgo@qq.com>
2014-05-11 14:34:27 -0700
Commit: 6bee01d, github.com/apache/spark/pull/728
Enabled incremental build that comes with sbt 0.13.2
Prashant Sharma <prashant.s@imaginea.com>
2014-05-10 21:08:04 -0700
Commit: 70bcdef, github.com/apache/spark/pull/525
[SPARK-1774] Respect SparkSubmit --jars on YARN (client)
Andrew Or <andrewor14@gmail.com>
2014-05-10 20:58:02 -0700
Commit: 83e0424, github.com/apache/spark/pull/710
SPARK-1789. Multiple versions of Netty dependencies cause FlumeStreamSuite failure
Sean Owen <sowen@cloudera.com>
2014-05-10 20:50:40 -0700
Commit: 2b7bd29, github.com/apache/spark/pull/723
Unify GraphImpl RDDs + other graph load optimizations
Ankur Dave <ankurdave@gmail.com>
2014-05-10 14:48:07 -0700
Commit: 905173d, github.com/apache/spark/pull/497
[SPARK-1690] Tolerating empty elements when saving Python RDD to text files
Kan Zhang <kzhang@apache.org>
2014-05-10 14:01:08 -0700
Commit: 6c2691d, github.com/apache/spark/pull/644
Add Python includes to path before depickling broadcast values
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-05-10 13:02:13 -0700
Commit: 3776f2f, github.com/apache/spark/pull/656
fix broken in link in python docs
Andy Konwinski <andykonwinski@gmail.com>
2014-05-10 12:46:51 -0700
Commit: c05d11b, github.com/apache/spark/pull/650
SPARK-1708. Add a ClassTag on Serializer and things that depend on it
Matei Zaharia <matei@databricks.com>
2014-05-10 12:10:24 -0700
Commit: 7eefc9d, github.com/apache/spark/pull/700
[SPARK-1778] [SQL] Add 'limit' transformation to SchemaRDD.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-10 12:03:27 -0700
Commit: 8e94d27, github.com/apache/spark/pull/711
[SQL] Upgrade parquet library.
Michael Armbrust <michael@databricks.com>
2014-05-10 11:48:01 -0700
Commit: 4d60553, github.com/apache/spark/pull/684
[SPARK-1644] The org.datanucleus:* should not be packaged into spark-assembly-*.jar
witgo <witgo@qq.com>
2014-05-10 10:15:04 -0700
Commit: 5615108, github.com/apache/spark/pull/688
SPARK-1686: keep schedule() calling in the main thread
CodingCat <zhunansjtu@gmail.com>
2014-05-09 21:50:23 -0700
Commit: 2f452cb, github.com/apache/spark/pull/639
SPARK-1770: Revert accidental(?) fix
Aaron Davidson <aaron@databricks.com>
2014-05-09 14:51:34 -0700
Commit: 59577df, github.com/apache/spark/pull/716
[SPARK-1760]: fix building spark with maven documentation
witgo <witgo@qq.com>
2014-05-09 01:51:26 -0700
Commit: bd67551, github.com/apache/spark/pull/712
Converted bang to ask to avoid scary warning when a block is removed
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-08 22:34:08 -0700
Commit: 32868f3, github.com/apache/spark/pull/708
MINOR: Removing dead code.
Patrick Wendell <pwendell@gmail.com>
2014-05-08 22:33:06 -0700
Commit: 4c60fd1
SPARK-1775: Unneeded lock in ShuffleMapTask.deserializeInfo
Sandeep <sandeep@techaddict.me>
2014-05-08 22:30:17 -0700
Commit: 7db47c4, github.com/apache/spark/pull/707
SPARK-1565 (Addendum): Replace `run-example` with `spark-submit`.
Patrick Wendell <pwendell@gmail.com>
2014-05-08 22:26:17 -0700
Commit: 06b15ba, github.com/apache/spark/pull/704
[SPARK-1631] Correctly set the Yarn app name when launching the AM.
Marcelo Vanzin <vanzin@cloudera.com>
2014-05-08 20:46:11 -0700
Commit: 3f779d8, github.com/apache/spark/pull/539
[SPARK-1755] Respect SparkSubmit --name on YARN
Andrew Or <andrewor14@gmail.com>
2014-05-08 20:45:29 -0700
Commit: 8b78412, github.com/apache/spark/pull/699
Include the sbin/spark-config.sh in spark-executor
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-05-08 20:43:37 -0700
Commit: 2fd2752, github.com/apache/spark/pull/651
Bug fix of sparse vector conversion
Funes <tianshaocun@gmail.com>, funes <tianshaocun@gmail.com>
2014-05-08 17:54:10 -0700
Commit: 191279c, github.com/apache/spark/pull/661
[SPARK-1157][MLlib] Bug fix: lossHistory should exclude rejection steps, and remove miniBatch
DB Tsai <dbtsai@alpinenow.com>
2014-05-08 17:53:22 -0700
Commit: 910a13b, github.com/apache/spark/pull/582
MLlib documentation fix
DB Tsai <dbtsai@alpinenow.com>
2014-05-08 17:52:32 -0700
Commit: d38febe, github.com/apache/spark/pull/703
[SPARK-1754] [SQL] Add missing arithmetic DSL operations.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-08 15:31:47 -0700
Commit: 322b180, github.com/apache/spark/pull/689
Fixing typo in als.py
Evan Sparks <evan.sparks@gmail.com>
2014-05-08 13:07:30 -0700
Commit: 5c5e7d5, github.com/apache/spark/pull/696
[SPARK-1745] Move interrupted flag from TaskContext constructor (minor)
Andrew Or <andrewor14@gmail.com>
2014-05-08 12:13:07 -0700
Commit: c3f8b78, github.com/apache/spark/pull/675
SPARK-1565, update examples to be used with spark-submit script.
Prashant Sharma <prashant.s@imaginea.com>
2014-05-08 10:23:05 -0700
Commit: 44dd57f, github.com/apache/spark/pull/552
[SQL] Improve SparkSQL Aggregates
Michael Armbrust <michael@databricks.com>
2014-05-08 01:08:43 -0400
Commit: 19c8fb0, github.com/apache/spark/pull/683
Use numpy directly for matrix multiply.
Evan Sparks <evan.sparks@gmail.com>
2014-05-08 00:24:36 -0400
Commit: 6ed7e2c, github.com/apache/spark/pull/687
SPARK-1668: Add implicit preference as an option to examples/MovieLensALS
Sandeep <sandeep@techaddict.me>
2014-05-08 00:15:05 -0400
Commit: 108c4c1, github.com/apache/spark/pull/597
SPARK-1544 Add support for deep decision trees.
Manish Amde <manish9ue@gmail.com>, manishamde <manish9ue@gmail.com>, Evan Sparks <sparks@cs.berkeley.edu>
2014-05-07 17:08:38 -0700
Commit: f269b01, github.com/apache/spark/pull/475
Update GradientDescentSuite.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-05-07 16:02:55 -0700
Commit: 0c19bb1, github.com/apache/spark/pull/588
[SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark
Xiangrui Meng <meng@databricks.com>
2014-05-07 16:01:11 -0700
Commit: 3188553, github.com/apache/spark/pull/672
SPARK-1569 Spark on Yarn, authentication broken by pr299
Thomas Graves <tgraves@apache.org>
2014-05-07 15:51:53 -0700
Commit: 4bec84b, github.com/apache/spark/pull/649
[SPARK-1688] Propagate PySpark worker stderr to driver
Andrew Or <andrewor14@gmail.com>
2014-05-07 14:35:22 -0700
Commit: 5200872, github.com/apache/spark/pull/603
Typo fix: fetchting -> fetching
Andrew Ash <andrew@andrewash.com>
2014-05-07 17:24:49 -0400
Commit: d00981a, github.com/apache/spark/pull/680
Nicer logging for SecurityManager startup
Andrew Ash <andrew@andrewash.com>
2014-05-07 17:24:12 -0400
Commit: 7f6f4a1, github.com/apache/spark/pull/678
[SQL] Fix Performance Issue in data type casting
Cheng Hao <hao.cheng@intel.com>
2014-05-07 16:54:58 -0400
Commit: ca43186, github.com/apache/spark/pull/679
SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptions
Aaron Davidson <aaron@databricks.com>
2014-05-07 09:48:31 -0700
Commit: 3308722, github.com/apache/spark/pull/640
[SPARK-1460] Returning SchemaRDD instead of normal RDD on Set operations...
Kan Zhang <kzhang@apache.org>
2014-05-07 09:41:31 -0700
Commit: 967635a, github.com/apache/spark/pull/448
[WIP][Spark-SQL] Optimize the Constant Folding for Expression
Cheng Hao <hao.cheng@intel.com>, Michael Armbrust <michael@databricks.com>
2014-05-07 03:37:12 -0400
Commit: 3eb53bd, github.com/apache/spark/pull/482
SPARK-1746: Support setting SPARK_JAVA_OPTS on executors for backwards compatibility
Patrick Wendell <pwendell@gmail.com>
2014-05-07 00:11:05 -0700
Commit: 913a0a9, github.com/apache/spark/pull/676
[HOTFIX] SPARK-1637: There are some Streaming examples added after the PR #571 was last updated.
Sandeep <sandeep@techaddict.me>
2014-05-06 21:55:05 -0700
Commit: fdae095, github.com/apache/spark/pull/673
Proposal: clarify Scala programming guide on caching ...
Ethan Jewett <esjewett@gmail.com>
2014-05-06 20:50:08 -0700
Commit: 48ba3b8, github.com/apache/spark/pull/668
SPARK-1727. Correct small compile errors, typos, and markdown issues in (primarly) MLlib docs
Sean Owen <sowen@cloudera.com>
2014-05-06 20:07:22 -0700
Commit: 25ad8f9, github.com/apache/spark/pull/653
SPARK-1637: Clean up examples for 1.0
Sandeep <sandeep@techaddict.me>
2014-05-06 17:27:52 -0700
Commit: a000b5c, github.com/apache/spark/pull/571
SPARK-1737: Warn rather than fail when Java 7+ is used to create distributions
Patrick Wendell <pwendell@gmail.com>
2014-05-06 15:41:46 -0700
Commit: 39b8b14, github.com/apache/spark/pull/669
[SPARK-1549] Add Python support to spark-submit
Matei Zaharia <matei@databricks.com>
2014-05-06 15:12:35 -0700
Commit: 951a5d9, github.com/apache/spark/pull/664
SPARK-1734: spark-submit throws an exception: Exception in thread "main"...
witgo <witgo@qq.com>
2014-05-06 14:17:39 -0700
Commit: ec09acd, github.com/apache/spark/pull/665
[SPARK-1685] Cancel retryTimer on restart of Worker or AppClient
Mark Hamstra <markhamstra@gmail.com>
2014-05-06 12:53:39 -0700
Commit: fbfe69d, github.com/apache/spark/pull/602
Fix two download suggestions in the docs:
Patrick Wendell <pwendell@gmail.com>
2014-05-06 12:07:46 -0700
Commit: 7b978c1, github.com/apache/spark/pull/662
SPARK-1474: Spark on yarn assembly doesn't include AmIpFilter
Thomas Graves <tgraves@apache.org>
2014-05-06 12:00:09 -0700
Commit: 1e82990, github.com/apache/spark/pull/406
Update OpenHashSet.scala
ArcherShao <ArcherShao@users.noreply.github.com>
2014-05-06 10:12:59 -0700
Commit: 0a5a468, github.com/apache/spark/pull/667
[SQL] SPARK-1732 - Support for null primitive values.
Michael Armbrust <michael@databricks.com>
2014-05-05 22:59:42 -0700
Commit: 3c64750, github.com/apache/spark/pull/658
[SPARK-1735] Add the missing special profiles to make-distribution.sh
Andrew Or <andrewor14@gmail.com>
2014-05-05 22:14:47 -0700
Commit: a2262cd, github.com/apache/spark/pull/660
[SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default
Cheng Lian <lian.cs.zju@gmail.com>
2014-05-05 19:38:59 -0700
Commit: 6d721c5, github.com/apache/spark/pull/608
[SPARK-1594][MLLIB] Cleaning up MLlib APIs and guide
Xiangrui Meng <meng@databricks.com>
2014-05-05 18:32:54 -0700
Commit: 98750a7, github.com/apache/spark/pull/524
Expose SparkListeners and relevant classes as DeveloperApi
Andrew Or <andrewor14@gmail.com>
2014-05-05 18:32:14 -0700
Commit: ea10b31, github.com/apache/spark/pull/648
SPARK-1728. JavaRDDLike.mapPartitionsWithIndex requires ClassTag
Sandy Ryza <sandy@cloudera.com>
2014-05-05 18:26:34 -0700
Commit: 8e724dc, github.com/apache/spark/pull/657
[SPARK-1681] Include datanucleus jars in Spark Hive distribution
Andrew Or <andrewor14@gmail.com>
2014-05-05 16:28:07 -0700
Commit: cf0a8f0, github.com/apache/spark/pull/610
[SPARK-1504], [SPARK-1505], [SPARK-1558] Updated Spark Streaming guide
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-05 15:28:19 -0700
Commit: a975a19, github.com/apache/spark/pull/652
SPARK-1721: Reset the thread classLoader in the Mesos Executor
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-05-05 11:19:35 -0700
Commit: 3292e2a, github.com/apache/spark/pull/620
SPARK-1556. jets3t dep doesn't update properly with newer Hadoop versions
Sean Owen <sowen@cloudera.com>
2014-05-05 10:33:49 -0700
Commit: 73b0cbc, github.com/apache/spark/pull/629
Updated doc for spark.closure.serializer to indicate only Java serializer work.
Reynold Xin <rxin@apache.org>
2014-05-05 00:52:06 -0700
Commit: f2eb070, github.com/apache/spark/pull/642
Address SPARK-1717
msiddalingaiah <madhu@madhu.com>
2014-05-04 21:59:10 -0700
Commit: bb2bb0c, github.com/apache/spark/pull/641
SPARK-1710: spark-submit should print better errors than "InvocationTargetException"
Sandeep <sandeep@techaddict.me>
2014-05-04 20:51:53 -0700
Commit: b48a55a, github.com/apache/spark/pull/630
EC2 script should exit with non-zero code on UsageError
Allan Douglas R. de Oliveira <allan@chaordicsystems.com>
2014-05-04 20:36:51 -0700
Commit: bcb9b7f, github.com/apache/spark/pull/638
SPARK-1693: Most of the tests throw a java.lang.SecurityException when s...
witgo <witgo@qq.com>
2014-05-04 17:48:52 -0700
Commit: d940e4c, github.com/apache/spark/pull/628
SPARK-1629. Addendum: Depend on commons lang3 (already used by tachyon) as it's used in ReplSuite, and return to use lang3 utility in Utils.scala
Sean Owen <sowen@cloudera.com>
2014-05-04 17:43:28 -0700
Commit: f504157, github.com/apache/spark/pull/635
SPARK-1703 Warn users if Spark is run on JRE6 but compiled with JDK7.
Patrick Wendell <pwendell@gmail.com>
2014-05-04 12:22:23 -0700
Commit: 0c98a8f, github.com/apache/spark/pull/627
SPARK-1663. (Addendum) Fix signature of one version of JavaPairRDDStream.reduceByKeyAndWindow()
Sean Owen <sowen@cloudera.com>
2014-05-04 11:55:29 -0700
Commit: 0088ced, github.com/apache/spark/pull/633
SPARK-1658: Correctly identify if maven is installed and working
Rahul Singhal <rahul.singhal@guavus.com>
2014-05-04 11:08:39 -0700
Commit: e97a2e6, github.com/apache/spark/pull/580
The default version of yarn is equal to the hadoop version
witgo <witgo@qq.com>
2014-05-03 23:32:12 -0700
Commit: fb05432, github.com/apache/spark/pull/626
Whitelist Hive Tests
Michael Armbrust <michael@databricks.com>
2014-05-03 23:13:51 -0700
Commit: 92b2902, github.com/apache/spark/pull/596
[SQL] Better logging when applying rules.
Michael Armbrust <michael@databricks.com>
2014-05-03 18:38:44 -0700
Commit: b295714, github.com/apache/spark/pull/616
EC2 configurable workers
Allan Douglas R. de Oliveira <allan@chaordicsystems.com>
2014-05-03 16:52:19 -0700
Commit: 4669a84, github.com/apache/spark/pull/612
SPARK-1689 AppClient should indicate app is dead() when removed
Aaron Davidson <aaron@databricks.com>
2014-05-03 13:27:10 -0700
Commit: 34719ba, github.com/apache/spark/pull/605
[Bugfix] Tachyon file cleanup logical error
Cheng Lian <lian.cs.zju@gmail.com>
2014-05-03 13:23:52 -0700
Commit: ce72c72, github.com/apache/spark/pull/575
SPARK-1663. Corrections for several compile errors in streaming code examples, and updates to follow API changes
Sean Owen <sowen@cloudera.com>
2014-05-03 12:31:31 -0700
Commit: 11d5494, github.com/apache/spark/pull/589
[WIP] SPARK-1676: Cache Hadoop UGIs by default to prevent FileSystem leak
Thomas Graves <tgraves@apache.org>
2014-05-03 10:59:05 -0700
Commit: 3d0a02d, github.com/apache/spark/pull/621
Update SchemaRDD.scala
ArcherShao <ArcherShao@users.noreply.github.com>
2014-05-03 00:17:36 -0700
Commit: 9347565, github.com/apache/spark/pull/619
SPARK-1700: Close socket file descriptors on task completion
Aaron Davidson <aaron@databricks.com>
2014-05-02 23:55:13 -0700
Commit: 0a14421, github.com/apache/spark/pull/623
SPARK-1492. Update Spark YARN docs to use spark-submit
Sandy Ryza <sandy@cloudera.com>
2014-05-02 21:42:31 -0700
Commit: 2b961d8, github.com/apache/spark/pull/601
delete no use var
wangfei <wangfei_hello@126.com>
2014-05-02 21:34:54 -0700
Commit: 4bf24f7, github.com/apache/spark/pull/613
SPARK-1695: java8-tests compiler error: package com.google.common.co...
witgo <witgo@qq.com>
2014-05-02 12:40:27 -0700
Commit: f25ebed, github.com/apache/spark/pull/611
Add tests for FileLogger, EventLoggingListener, and ReplayListenerBus
Andrew Or <andrewor14@gmail.com>
2014-05-01 21:42:06 -0700
Commit: 394d8cb, github.com/apache/spark/pull/591
SPARK-1659: improvements spark-submit usage
witgo <witgo@qq.com>
2014-05-01 21:39:40 -0700
Commit: 40cf6d3, github.com/apache/spark/pull/581
fix the spelling mistake
wangfei <wangfei_hello@126.com>
2014-05-01 21:37:22 -0700
Commit: 55c760f, github.com/apache/spark/pull/614
[SQL] SPARK-1661 - Fix regex_serde test
Michael Armbrust <michael@databricks.com>
2014-05-01 21:32:43 -0700
Commit: a43d9c1, github.com/apache/spark/pull/595
SPARK-1691: Support quoted arguments inside of spark-submit.
Patrick Wendell <pwendell@gmail.com>
2014-05-01 01:15:51 -0700
Commit: 98b6559, github.com/apache/spark/pull/609
Fix SPARK-1629: Spark should inline use of commons-lang `SystemUtils.IS_...
witgo <witgo@qq.com>
2014-04-30 09:49:45 -0700
Commit: 55100da, github.com/apache/spark/pull/569
SPARK-1004. PySpark on YARN
Sandy Ryza <sandy@cloudera.com>
2014-04-29 23:24:34 -0700
Commit: ff5be9a, github.com/apache/spark/pull/30
Handle the vals that never used
WangTao <barneystinson@aliyun.com>
2014-04-29 22:07:20 -0700
Commit: 7025dda, github.com/apache/spark/pull/565
Args for worker rather than master
Chen Chao <crazyjvm@gmail.com>
2014-04-29 22:05:40 -0700
Commit: b3d2ab6, github.com/apache/spark/pull/587
[SPARK-1646] Micro-optimisation of ALS
Tor Myklebust <tmyklebu@gmail.com>
2014-04-29 22:04:34 -0700
Commit: 5c0cd5c, github.com/apache/spark/pull/568
[SPARK-1674] fix interrupted system call error in pyspark's RDD.pipe
Xiangrui Meng <meng@databricks.com>
2014-04-29 18:06:45 -0700
Commit: d33df1c, github.com/apache/spark/pull/594
SPARK-1588. Restore SPARK_YARN_USER_ENV and SPARK_JAVA_OPTS for YARN.
Sandy Ryza <sandy@cloudera.com>
2014-04-29 12:54:02 -0700
Commit: bf8d0aa, github.com/apache/spark/pull/586
SPARK-1509: add zipWithIndex zipWithUniqueId methods to java api
witgo <witgo@qq.com>
2014-04-29 11:30:47 -0700
Commit: 7d15058, github.com/apache/spark/pull/423
SPARK-1557 Set permissions on event log files/directories
Thomas Graves <tgraves@apache.org>
2014-04-29 09:19:48 -0500
Commit: 8db0f7e, github.com/apache/spark/pull/538
HOTFIX: minor change to release script
Patrick Wendell <pwendell@gmail.com>
2014-04-29 00:59:38 -0700
Commit: 9a1184a
HOTFIX: minor change to release script
Patrick Wendell <pwendell@gmail.com>
2014-04-29 00:53:32 -0700
Commit: f04bcaf
[SPARK-1636][MLLIB] Move main methods to examples
Xiangrui Meng <meng@databricks.com>
2014-04-29 00:41:03 -0700
Commit: 3f38334, github.com/apache/spark/pull/584
Minor fix to python table caching API.
Michael Armbrust <michael@databricks.com>
2014-04-29 00:36:15 -0700
Commit: 497be3c, github.com/apache/spark/pull/585
HOTFIX: Bug in release script
Patrick Wendell <pwendell@gmail.com>
2014-04-29 00:10:17 -0700
Commit: 719c8bc
Improved build configuration
witgo <witgo@qq.com>
2014-04-28 22:50:51 -0700
Commit: 030f2c2, github.com/apache/spark/pull/480
SPARK-1652: Remove incorrect deprecation warning in spark-submit
Patrick Wendell <pwendell@gmail.com>
2014-04-28 18:14:59 -0700
Commit: 9f7a095, github.com/apache/spark/pull/578
SPARK-1654 and SPARK-1653: Fixes in spark-submit.
Patrick Wendell <pwendell@gmail.com>
2014-04-28 17:29:22 -0700
Commit: 949e393, github.com/apache/spark/pull/576
SPARK-1652: Spark submit should fail gracefully if YARN not enabled
Patrick Wendell <pwendell@gmail.com>
2014-04-28 17:26:57 -0700
Commit: cae054a, github.com/apache/spark/pull/579
Changes to dev release script
Patrick Wendell <pwendell@gmail.com>
2014-04-28 13:58:42 -0700
Commit: 8421034
[SPARK-1633][Streaming] Java API unit test and example for custom streaming receiver in Java
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-28 13:58:09 -0700
Commit: 1d84964, github.com/apache/spark/pull/558
[SQL]Append some missing types for HiveUDF
Cheng Hao <hao.cheng@intel.com>
2014-04-27 23:59:42 -0700
Commit: f735884, github.com/apache/spark/pull/459
Update the import package name for TestHive in sbt shell
Cheng Hao <hao.cheng@intel.com>
2014-04-27 23:57:29 -0700
Commit: ea01aff, github.com/apache/spark/pull/574
Fix SPARK-1609: Executor fails to start when Command.extraJavaOptions contains multiple Java options
witgo <witgo@qq.com>
2014-04-27 19:41:02 -0700
Commit: 71f4d26, github.com/apache/spark/pull/547
SPARK-1145: Memory mapping with many small blocks can cause JVM allocation failures
Patrick Wendell <pwendell@gmail.com>
2014-04-27 17:40:56 -0700
Commit: 6b3c6e5, github.com/apache/spark/pull/43
HOTFIX: Minor patch to merge script.
Patrick Wendell <pwendell@gmail.com>
2014-04-27 15:45:17 -0700
Commit: 3d9fb09
SPARK-1651: Delete existing deployment directory
Rahul Singhal <rahul.singhal@guavus.com>
2014-04-27 15:50:48 -0700
Commit: eefb90d, github.com/apache/spark/pull/573
SPARK-1648 Support closing JIRA's as part of merge script.
Patrick Wendell <pwendell@gmail.com>
2014-04-27 15:41:57 -0700
Commit: fe65bee, github.com/apache/spark/pull/570
SPARK-1650: Correctly identify maven project version
Rahul Singhal <rahul.singhal@guavus.com>
2014-04-27 15:17:06 -0700
Commit: 7b2527d, github.com/apache/spark/pull/572
SPARK-1606: Infer user application arguments instead of requiring --arg.
Patrick Wendell <pwendell@gmail.com>
2014-04-26 19:24:29 -0700
Commit: aa9a7f5, github.com/apache/spark/pull/563
SPARK-1467: Make StorageLevel.apply() factory methods Developer APIs
Sandeep <sandeep@techaddict.me>
2014-04-26 19:04:33 -0700
Commit: 762af4e, github.com/apache/spark/pull/551
[SPARK-1608] [SQL] Fix Cast.nullable when cast from StringType to NumericType/TimestampType.
Takuya UESHIN <ueshin@happy-camper.st>
2014-04-26 14:39:54 -0700
Commit: 8e37ed6, github.com/apache/spark/pull/532
add note of how to support table with more than 22 fields
wangfei <wangfei1@huawei.com>
2014-04-26 14:38:42 -0700
Commit: e6e44e4, github.com/apache/spark/pull/564
[Spark-1382] Fix NPE in DStream.slice (updated version of #365)
zsxwing <zsxwing@gmail.com>, Tathagata Das <tathagata.das1565@gmail.com>
2014-04-25 19:04:34 -0700
Commit: 058797c, github.com/apache/spark/pull/562
SPARK-1632. Remove unnecessary boxing in compares in ExternalAppendOnlyM...
Sandy Ryza <sandy@cloudera.com>
2014-04-25 17:55:04 -0700
Commit: 87cf35c, github.com/apache/spark/pull/559
SPARK-1235: manage the DAGScheduler EventProcessActor with supervisor and refactor the DAGScheduler with Akka
CodingCat <zhunansjtu@gmail.com>, Xiangrui Meng <meng@databricks.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-04-25 16:04:48 -0700
Commit: 027f1b8, github.com/apache/spark/pull/186
SPARK-1607. HOTFIX: Fix syntax adapting Int result to Short
Sean Owen <sowen@cloudera.com>
2014-04-25 14:17:38 -0700
Commit: df6d814, github.com/apache/spark/pull/556
Update KafkaWordCount.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-04-25 13:18:49 -0700
Commit: 8aaef5c, github.com/apache/spark/pull/523
Delete the val that never used
WangTao <barneystinson@aliyun.com>
2014-04-25 11:47:01 -0700
Commit: 25a276d, github.com/apache/spark/pull/553
SPARK-1621 Upgrade Chill to 0.3.6
Matei Zaharia <matei@databricks.com>
2014-04-25 11:12:41 -0700
Commit: a24d918, github.com/apache/spark/pull/543
SPARK-1619 Launch spark-shell with spark-submit
Patrick Wendell <pwendell@gmail.com>
2014-04-24 23:59:16 -0700
Commit: dc3b640, github.com/apache/spark/pull/542
SPARK-1607. Replace octal literals, removed in Scala 2.11, with hex literals
Sean Owen <sowen@cloudera.com>
2014-04-24 23:34:00 -0700
Commit: 6e101f1, github.com/apache/spark/pull/529
Call correct stop().
Aaron Davidson <aaron@databricks.com>
2014-04-24 23:22:03 -0700
Commit: 45ad7f0, github.com/apache/spark/pull/527
SPARK-1242 Add aggregate to python rdd
Holden Karau <holden@pigscanfly.ca>
2014-04-24 23:07:54 -0700
Commit: e03bc37, github.com/apache/spark/pull/139
Fix [SPARK-1078]: Remove the Unnecessary lift-json dependency
Sandeep <sandeep@techaddict.me>
2014-04-24 21:51:52 -0700
Commit: 095b518, github.com/apache/spark/pull/536
[Typo] In the maven docs: chd -> cdh
Andrew Or <andrewor14@gmail.com>
2014-04-24 21:51:17 -0700
Commit: 06e82d9, github.com/apache/spark/pull/548
Generalize pattern for planning hash joins.
Michael Armbrust <michael@databricks.com>
2014-04-24 21:42:33 -0700
Commit: 86ff8b1, github.com/apache/spark/pull/418
[SPARK-1617] and [SPARK-1618] Improvements to streaming ui and bug fix to socket receiver
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-24 21:34:37 -0700
Commit: cd12dd9, github.com/apache/spark/pull/540
SPARK-1586 Windows build fixes
Mridul Muralidharan <mridulm80@apache.org>
2014-04-24 20:48:33 -0700
Commit: 968c018, github.com/apache/spark/pull/505
SPARK-1584: Upgrade Flume dependency to 1.4.0
tmalaska <ted.malaska@cloudera.com>
2014-04-24 20:31:17 -0700
Commit: d5c6ae6, github.com/apache/spark/pull/507
[SPARK-986]: Job cancelation for PySpark
Ahir Reddy <ahirreddy@gmail.com>
2014-04-24 20:21:10 -0700
Commit: e53eb4f, github.com/apache/spark/pull/541
[SPARK-1615] Synchronize accesses to the LiveListenerBus' event queue
Andrew Or <andrewor14@gmail.com>
2014-04-24 20:18:15 -0700
Commit: ee6f7e2, github.com/apache/spark/pull/544
[SPARK-1510] Spark Streaming metrics source for metrics system
jerryshao <saisai.shao@intel.com>, Tathagata Das <tathagata.das1565@gmail.com>
2014-04-24 18:56:57 -0700
Commit: 80429f3, github.com/apache/spark/pull/545
Spark 1489 Fix the HistoryServer view acls
Thomas Graves <tgraves@apache.org>
2014-04-24 18:38:10 -0700
Commit: 44da5ab, github.com/apache/spark/pull/509
[SQL] Add support for parsing indexing into arrays in SQL.
Michael Armbrust <michael@databricks.com>
2014-04-24 18:21:00 -0700
Commit: 4660991, github.com/apache/spark/pull/518
[SPARK-1592][streaming] Automatically remove streaming input blocks
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-24 18:18:22 -0700
Commit: 526a518, github.com/apache/spark/pull/512
SPARK-1438 RDD.sample() make seed param optional
Arun Ramakrishnan <smartnut007@gmail.com>
2014-04-24 17:27:16 -0700
Commit: 35e3d19, github.com/apache/spark/pull/477
SPARK-1104: kill Process in workerThread of ExecutorRunner
CodingCat <zhunansjtu@gmail.com>
2014-04-24 15:55:18 -0700
Commit: f99af85, github.com/apache/spark/pull/35
Fix Scala Style
Sandeep <sandeep@techaddict.me>
2014-04-24 15:07:23 -0700
Commit: a03ac22, github.com/apache/spark/pull/531
SPARK-1494 Don't initialize classes loaded by MIMA excludes, attempt 2
Michael Armbrust <michael@databricks.com>
2014-04-24 14:54:01 -0700
Commit: c5c1916, github.com/apache/spark/pull/526
Spark 1490 Add kerberos support to the HistoryServer
Thomas Graves <tgraves@apache.org>
2014-04-24 11:15:12 -0700
Commit: bd37509, github.com/apache/spark/pull/513
SPARK-1611: Fix incorrect initialization order in AppendOnlyMap
zsxwing <zsxwing@gmail.com>
2014-04-24 11:13:40 -0700
Commit: 78a49b2, github.com/apache/spark/pull/534
SPARK-1488. Squash more language feature warnings in new commits by importing implicitConversion
Sean Owen <sowen@cloudera.com>
2014-04-24 10:06:18 -0700
Commit: 6338a93, github.com/apache/spark/pull/528
Small changes to release script
Patrick Wendell <pwendell@gmail.com>
2014-04-24 09:59:44 -0700
Commit: faeb761
[SPARK-1610] [SQL] Fix Cast to use exact type value when cast from BooleanType to NumericTy...
Takuya UESHIN <ueshin@happy-camper.st>
2014-04-24 09:57:28 -0700
Commit: 27b2821, github.com/apache/spark/pull/533
SPARK-1601 & SPARK-1602: two bug fixes related to cancellation
Reynold Xin <rxin@apache.org>
2014-04-24 00:27:45 -0700
Commit: 1fdf659, github.com/apache/spark/pull/521
SPARK-1587 Fix thread leak
Mridul Muralidharan <mridulm80@apache.org>
2014-04-23 23:20:55 -0700
Commit: dd681f5, github.com/apache/spark/pull/504
[Fix #79] Replace Breakable For Loops By While Loops
Sandeep <sandeep@techaddict.me>
2014-04-23 22:47:59 -0700
Commit: bb68f47, github.com/apache/spark/pull/503
SPARK-1589: Fix the incorrect compare
zsxwing <zsxwing@gmail.com>
2014-04-23 22:36:02 -0700
Commit: 6ab7578, github.com/apache/spark/pull/508
Mark all fields of EdgePartition, Graph, and GraphOps transient
Ankur Dave <ankurdave@gmail.com>
2014-04-23 22:01:13 -0700
Commit: 1d6abe3, github.com/apache/spark/pull/520
Update Java api for setJobGroup with interruptOnCancel
Aaron Davidson <aaron@databricks.com>
2014-04-23 22:00:22 -0700
Commit: d485eec, github.com/apache/spark/pull/522
[Hot Fix #469] Fix flaky test in SparkListenerSuite
Andrew Or <andrewor14@gmail.com>
2014-04-23 21:59:33 -0700
Commit: 4b2bab1, github.com/apache/spark/pull/516
[SPARK-1540] Add an optional Ordering parameter to PairRDDFunctions.
Matei Zaharia <matei@databricks.com>
2014-04-23 17:03:54 -0700
Commit: 640f9a0, github.com/apache/spark/pull/487
SPARK-1582 Invoke Thread.interrupt() when cancelling jobs
Aaron Davidson <aaron@databricks.com>
2014-04-23 16:52:49 -0700
Commit: 432201c, github.com/apache/spark/pull/498
Honor default fs name when initializing event logger.
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-23 14:47:38 -0700
Commit: dd1b7a6, github.com/apache/spark/pull/450
SPARK-1572 Don't kill Executor if PythonRDD fails while computing parent
Aaron Davidson <aaron@databricks.com>
2014-04-23 14:46:30 -0700
Commit: a967b00, github.com/apache/spark/pull/486
SPARK-1583: Fix a bug that using java.util.HashMap by mistake
zsxwing <zsxwing@gmail.com>
2014-04-23 14:12:20 -0700
Commit: a664606, github.com/apache/spark/pull/500
SPARK-1119 and other build improvements
Patrick Wendell <pwendell@gmail.com>
2014-04-23 10:19:32 -0700
Commit: cd4ed29, github.com/apache/spark/pull/502
[SQL] SPARK-1571 Mistake in java example code
Michael Armbrust <michael@databricks.com>
2014-04-22 22:19:32 -0700
Commit: 39f85e0, github.com/apache/spark/pull/496
SPARK-1494 Don't initialize classes loaded by MIMA excludes.
Michael Armbrust <michael@databricks.com>
2014-04-22 21:56:15 -0700
Commit: 8e95081, github.com/apache/spark/pull/494
SPARK-1562 Fix visibility / annotation of Spark SQL APIs
Michael Armbrust <michael@databricks.com>
2014-04-22 20:02:33 -0700
Commit: aa77f8a, github.com/apache/spark/pull/489
[FIX: SPARK-1376] use --arg instead of --args in SparkSubmit to avoid warning messages
Xiangrui Meng <meng@databricks.com>
2014-04-22 19:38:27 -0700
Commit: 662c860, github.com/apache/spark/pull/485
[streaming][SPARK-1578] Removed requirement for TTL in StreamingContext.
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-22 19:35:13 -0700
Commit: f3d19a9, github.com/apache/spark/pull/491
[Spark-1538] Fix SparkUI incorrectly hiding persisted RDDs
Andrew Or <andrewor14@gmail.com>
2014-04-22 19:24:03 -0700
Commit: 2de5738, github.com/apache/spark/pull/469
Assorted clean-up for Spark-on-YARN.
Patrick Wendell <pwendell@gmail.com>
2014-04-22 19:22:06 -0700
Commit: 995fdc9, github.com/apache/spark/pull/488
[SPARK-1570] Fix classloading in JavaSQLContext.applySchema
Kan Zhang <kzhang@apache.org>
2014-04-22 15:05:12 -0700
Commit: ea8cea8, github.com/apache/spark/pull/484
Fix compilation on Hadoop 2.4.x.
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-22 14:28:41 -0700
Commit: 0ea0b1a, github.com/apache/spark/pull/483
[Fix #204] Eliminate delay between binding and log checking
Andrew Or <andrewor14@gmail.com>
2014-04-22 14:27:49 -0700
Commit: 745e496, github.com/apache/spark/pull/441
[SPARK-1506][MLLIB] Documentation improvements for MLlib 1.0
Xiangrui Meng <meng@databricks.com>
2014-04-22 11:20:47 -0700
Commit: 26d35f3, github.com/apache/spark/pull/422
[SPARK-1281] Improve partitioning in ALS
Tor Myklebust <tmyklebu@gmail.com>
2014-04-22 11:07:30 -0700
Commit: bf9d49b, github.com/apache/spark/pull/407
fix bugs of dot in python
Xusen Yin <yinxusen@gmail.com>
2014-04-22 11:06:18 -0700
Commit: c919798, github.com/apache/spark/pull/463
[SPARK-1560]: Updated Pyrolite Dependency to be Java 6 compatible
Ahir Reddy <ahirreddy@gmail.com>
2014-04-22 09:44:41 -0700
Commit: 0f87e6a, github.com/apache/spark/pull/479
[HOTFIX] SPARK-1399: remove outdated comments
CodingCat <zhunansjtu@gmail.com>
2014-04-22 09:43:13 -0700
Commit: 87de290, github.com/apache/spark/pull/474
SPARK-1496: Have jarOfClass return Option[String]
Patrick Wendell <pwendell@gmail.com>
2014-04-22 00:42:16 -0700
Commit: 83084d3, github.com/apache/spark/pull/438
[SPARK-1459] Use local path (and not complete URL) when opening local lo...
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-21 23:10:53 -0700
Commit: ac164b7, github.com/apache/spark/pull/375
[Fix #274] Document + fix annotation usages
Andrew Or <andrewor14@gmail.com>
2014-04-21 22:24:44 -0700
Commit: b3e5366, github.com/apache/spark/pull/470
[SPARK-1439, SPARK-1440] Generate unified Scaladoc across projects and Javadocs
Matei Zaharia <matei@databricks.com>
2014-04-21 21:57:40 -0700
Commit: fc78384, github.com/apache/spark/pull/457
[SPARK-1332] Improve Spark Streaming's Network Receiver and InputDStream API [WIP]
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-21 19:04:49 -0700
Commit: 04c37b6, github.com/apache/spark/pull/300
Dev script: include RC name in git tag
Patrick Wendell <pwendell@gmail.com>
2014-04-21 14:21:17 -0700
Commit: 5a5b334
SPARK-1399: show stage failure reason in UI
CodingCat <zhunansjtu@gmail.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-04-21 14:10:23 -0700
Commit: 43e4a29, github.com/apache/spark/pull/421
SPARK-1539: RDDPage.scala contains RddPage class
Xiangrui Meng <meng@databricks.com>
2014-04-21 12:48:02 -0700
Commit: b7df31e, github.com/apache/spark/pull/454
[Hot Fix] Ignore org.apache.spark.ui.UISuite tests
Andrew Or <andrewor14@gmail.com>
2014-04-21 12:37:43 -0700
Commit: af46f1f, github.com/apache/spark/pull/466
Clean up and simplify Spark configuration
Patrick Wendell <pwendell@gmail.com>
2014-04-21 10:26:33 -0700
Commit: fb98488, github.com/apache/spark/pull/299
REPL cleanup.
Michael Armbrust <michael@databricks.com>
2014-04-19 17:32:24 -0700
Commit: 3a390bf, github.com/apache/spark/pull/451
[SPARK-1535] ALS: Avoid the garbage-creating ctor of DoubleMatrix
Tor Myklebust <tmyklebu@gmail.com>
2014-04-19 15:10:18 -0700
Commit: 25fc318, github.com/apache/spark/pull/442
Add insertInto and saveAsTable to Python API.
Michael Armbrust <michael@databricks.com>
2014-04-19 15:08:54 -0700
Commit: 10d0421, github.com/apache/spark/pull/447
Use scala deprecation instead of java.
Michael Armbrust <michael@databricks.com>
2014-04-19 15:06:04 -0700
Commit: 5d0f58b, github.com/apache/spark/pull/452
README update
Reynold Xin <rxin@apache.org>
2014-04-18 22:34:39 -0700
Commit: 28238c8, github.com/apache/spark/pull/443
SPARK-1482: Fix potential resource leaks in saveAsHadoopDataset and save...
zsxwing <zsxwing@gmail.com>
2014-04-18 17:49:22 -0700
Commit: 2089e0e, github.com/apache/spark/pull/400
SPARK-1456 Remove view bounds on Ordered in favor of a context bound on Ordering.
Michael Armbrust <michael@databricks.com>
2014-04-18 12:04:13 -0700
Commit: c399baa, github.com/apache/spark/pull/410
Fixed broken pyspark shell.
Reynold Xin <rxin@apache.org>
2014-04-18 10:10:13 -0700
Commit: 81a152c, github.com/apache/spark/pull/444
SPARK-1523: improve the readability of code in AkkaUtil
CodingCat <zhunansjtu@gmail.com>
2014-04-18 10:05:00 -0700
Commit: 3c7a9ba, github.com/apache/spark/pull/434
SPARK-1357 (addendum). More Experimental items in MLlib
Sean Owen <sowen@cloudera.com>
2014-04-18 10:04:02 -0700
Commit: 8aa1f4c, github.com/apache/spark/pull/372
[SPARK-1520] remove fastutil from dependencies
Xiangrui Meng <meng@databricks.com>
2014-04-18 10:03:15 -0700
Commit: aa17f02, github.com/apache/spark/pull/437
Reuses Row object in ExistingRdd.productToRowRdd()
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-18 10:02:27 -0700
Commit: 89f4743, github.com/apache/spark/pull/432
SPARK-1483: Rename minSplits to minPartitions in public APIs
CodingCat <zhunansjtu@gmail.com>
2014-04-18 10:01:16 -0700
Commit: e31c8ff, github.com/apache/spark/pull/430
HOTFIX: Ignore streaming UI test
Patrick Wendell <pwendell@gmail.com>
2014-04-17 17:33:24 -0700
Commit: 7863ecc, github.com/apache/spark/pull/440
FIX: Don't build Hive in assembly unless running Hive tests.
Patrick Wendell <pwendell@gmail.com>
2014-04-17 17:24:00 -0700
Commit: 6c746ba, github.com/apache/spark/pull/439
SPARK-1408 Modify Spark on Yarn to point to the history server when app ...
Thomas Graves <tgraves@apache.org>
2014-04-17 16:36:37 -0500
Commit: 0058b5d, github.com/apache/spark/pull/362
[SPARK-1395] Allow "local:" URIs to work on Yarn.
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-17 10:29:38 -0500
Commit: 6904750, github.com/apache/spark/pull/303
[python alternative] pyspark require Python2, failing if system default is Py3 from shell.py
AbhishekKr <abhikumar163@gmail.com>
2014-04-16 19:05:40 -0700
Commit: bb76eae, github.com/apache/spark/pull/399
SPARK-1462: Examples of ML algorithms are using deprecated APIs
Sandeep <sandeep@techaddict.me>
2014-04-16 18:23:07 -0700
Commit: 6ad4c54, github.com/apache/spark/pull/416
Include stack trace for exceptions thrown by user code.
Michael Armbrust <michael@databricks.com>
2014-04-16 18:12:56 -0700
Commit: d4916a8, github.com/apache/spark/pull/409
Update ReducedWindowedDStream.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-04-16 18:08:11 -0700
Commit: 07b7ad3, github.com/apache/spark/pull/425
misleading task number of groupByKey
Chen Chao <crazyjvm@gmail.com>
2014-04-16 17:58:42 -0700
Commit: 9c40b9e, github.com/apache/spark/pull/403
Fixing a race condition in event listener unit test
Kan Zhang <kzhang@apache.org>
2014-04-16 17:39:11 -0700
Commit: 38877cc, github.com/apache/spark/pull/401
remove unnecessary brace and semicolon in 'putBlockInfo.synchronize' block
Chen Chao <crazyjvm@gmail.com>
2014-04-16 17:30:01 -0700
Commit: 016a877, github.com/apache/spark/pull/411
SPARK-1329: Create pid2vid with correct number of partitions
Ankur Dave <ankurdave@gmail.com>
2014-04-16 17:16:55 -0700
Commit: 17d3234, github.com/apache/spark/pull/368
Rebuild routing table after Graph.reverse
Ankur Dave <ankurdave@gmail.com>
2014-04-16 17:15:50 -0700
Commit: 235a47c, github.com/apache/spark/pull/431
Add clean to build
Patrick Wendell <pwendell@gmail.com>
2014-04-16 16:32:34 -0700
Commit: 987760e
[SPARK-1511] use Files.move instead of renameTo in TestUtils.scala
Ye Xianjin <advancedxy@gmail.com>
2014-04-16 14:56:22 -0700
Commit: 10b1c59, github.com/apache/spark/pull/427
SPARK-1465: Spark compilation is broken with the latest hadoop-2.4.0 release
xuan <xuan@MacBook-Pro.local>, xuan <xuan@macbook-pro.home>
2014-04-16 14:41:22 -0500
Commit: 725925c, github.com/apache/spark/pull/396
SPARK-1469: Scheduler mode should accept lower-case definitions and have...
Sandeep <sandeep@techaddict.me>
2014-04-16 09:58:57 -0700
Commit: e269c24, github.com/apache/spark/pull/388
Minor addition to SPARK-1497
Patrick Wendell <pwendell@gmail.com>
2014-04-16 09:43:17 -0700
Commit: 82349fb
SPARK-1497. Fix scalastyle warnings in YARN, Hive code
Sean Owen <sowen@cloudera.com>
2014-04-16 09:34:59 -0700
Commit: 77f8367, github.com/apache/spark/pull/413
SPARK-1310: Start adding k-fold cross validation to MLLib [adds kFold to MLUtils & fixes bug in BernoulliSampler]
Holden Karau <holden@pigscanfly.ca>
2014-04-16 09:33:27 -0700
Commit: c3527a3, github.com/apache/spark/pull/18
update spark.default.parallelism
Chen Chao <crazyjvm@gmail.com>
2014-04-16 09:14:18 -0700
Commit: 9edd887, github.com/apache/spark/pull/389
Loads test tables when running "sbt hive/console" without HIVE_DEV_HOME
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-16 08:54:34 -0700
Commit: fec462c, github.com/apache/spark/pull/417
Make "spark logo" link refer to "/".
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-16 08:53:01 -0700
Commit: c0273d8, github.com/apache/spark/pull/408
[SPARK-959] Updated SBT from 0.13.1 to 0.13.2
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-16 08:52:14 -0700
Commit: 6a10d80, github.com/apache/spark/pull/426
[SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDs
Michael Armbrust <michael@databricks.com>
2014-04-15 20:40:40 -0700
Commit: 273c2fd, github.com/apache/spark/pull/354
[WIP] SPARK-1430: Support sparse data in Python MLlib
Matei Zaharia <matei@databricks.com>
2014-04-15 20:33:24 -0700
Commit: 63ca581, github.com/apache/spark/pull/341
[FIX] update sbt-idea to version 1.6.0
Xiangrui Meng <meng@databricks.com>
2014-04-15 19:37:32 -0700
Commit: 8517911, github.com/apache/spark/pull/419
SPARK-1455: Better isolation for unit tests.
Patrick Wendell <pwendell@gmail.com>
2014-04-15 19:34:39 -0700
Commit: 5aaf983, github.com/apache/spark/pull/420
Decision Tree documentation for MLlib programming guide
Manish Amde <manish9ue@gmail.com>
2014-04-15 11:14:28 -0700
Commit: 07d72fe, github.com/apache/spark/pull/402
[SPARK-1157][MLlib] L-BFGS Optimizer based on Breeze's implementation.
DB Tsai <dbtsai@alpinenow.com>
2014-04-15 11:12:47 -0700
Commit: 6843d63, github.com/apache/spark/pull/353
SPARK-1501: Ensure assertions in Graph.apply are asserted.
William Benton <willb@redhat.com>
2014-04-15 10:38:42 -0700
Commit: 2580a3b, github.com/apache/spark/pull/415
SPARK-1426: Make MLlib work with NumPy versions older than 1.7
Sandeep <sandeep@techaddict.me>
2014-04-15 00:19:43 -0700
Commit: df36091, github.com/apache/spark/pull/391
SPARK-1374: PySpark API for SparkSQL
Ahir Reddy <ahirreddy@gmail.com>, Michael Armbrust <michael@databricks.com>
2014-04-15 00:07:55 -0700
Commit: c99bcb7f, github.com/apache/spark/pull/363
SPARK-1488. Resolve scalac feature warnings during build
Sean Owen <sowen@cloudera.com>
2014-04-14 19:50:00 -0700
Commit: 0247b5c, github.com/apache/spark/pull/404
HOTFIX: Use file name and not paths for excludes
Patrick Wendell <pwendell@gmail.com>
2014-04-14 15:51:54 -0700
Commit: 268b535
[BUGFIX] In-memory columnar storage bug fixes
Cheng Lian <lian.cs.zju@gmail.com>, Michael Armbrust <michael@databricks.com>
2014-04-14 15:22:43 -0700
Commit: 7dbca68, github.com/apache/spark/pull/374
[SPARK-1415] Hadoop min split for wholeTextFiles()
Xusen Yin <yinxusen@gmail.com>
2014-04-13 13:18:52 -0700
Commit: 037fe4d, github.com/apache/spark/pull/376
SPARK-1480: Clean up use of classloaders
Patrick Wendell <pwendell@gmail.com>
2014-04-13 08:58:37 -0700
Commit: 4bc07ee, github.com/apache/spark/pull/398
[SPARK-1403] Move the class loader creation back to where it was in 0.9.0
Bharath Bhushan <manku.timma@outlook.com>
2014-04-12 20:52:29 -0700
Commit: ca11919, github.com/apache/spark/pull/322
[Fix #204] Update out-dated comments
Andrew Or <andrewor14@gmail.com>
2014-04-12 16:33:38 -0700
Commit: c2d160f, github.com/apache/spark/pull/381
[SPARK-1386] Web UI for Spark Streaming
Tathagata Das <tathagata.das1565@gmail.com>, Andrew Or <andrewor14@gmail.com>
2014-04-11 23:33:49 -0700
Commit: 6aa08c3, github.com/apache/spark/pull/290
SPARK-1057 (alternative) Remove fastutil
Sean Owen <sowen@cloudera.com>
2014-04-11 22:46:47 -0700
Commit: 165e06a, github.com/apache/spark/pull/266
Update WindowedDStream.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-04-11 20:33:42 -0700
Commit: aa8bb11, github.com/apache/spark/pull/390
[WIP] [SPARK-1328] Add vector statistics
Xusen Yin <yinxusen@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-04-11 19:43:22 -0700
Commit: fdfb45e, github.com/apache/spark/pull/268
[FIX] make coalesce test deterministic in RDDSuite
Xiangrui Meng <meng@databricks.com>
2014-04-11 19:41:40 -0700
Commit: 7038b00, github.com/apache/spark/pull/387
HOTFIX: Ignore python metastore files in RAT checks.
Patrick Wendell <pwendell@gmail.com>
2014-04-11 13:23:21 -0700
Commit: 6a0f8e3, github.com/apache/spark/pull/393
[SPARK-1225, 1241] [MLLIB] Add AreaUnderCurve and BinaryClassificationMetrics
Xiangrui Meng <meng@databricks.com>
2014-04-11 12:06:13 -0700
Commit: f5ace8d, github.com/apache/spark/pull/364
Some clean up in build/docs
Patrick Wendell <pwendell@gmail.com>
2014-04-11 10:45:27 -0700
Commit: 98225a6, github.com/apache/spark/pull/382
SPARK-1417: Spark on Yarn - spark UI link from resourcemanager is broken
Thomas Graves <tgraves@apache.org>
2014-04-11 13:17:48 +0530
Commit: 446bb34, github.com/apache/spark/pull/344
SPARK-1202: Improvements to task killing in the UI.
Patrick Wendell <pwendell@gmail.com>
2014-04-10 20:43:56 -0700
Commit: 44f654e, github.com/apache/spark/pull/386
Add Spark v0.9.1 to ec2 launch script and use it as the default
Harvey Feng <hyfeng224@gmail.com>
2014-04-10 18:25:54 -0700
Commit: 7b4203a, github.com/apache/spark/pull/385
Set spark.executor.uri from environment variable (needed by Mesos)
Ivan Wick <ivanwick+github@gmail.com>
2014-04-10 17:49:30 -0700
Commit: 5cd11d5, github.com/apache/spark/pull/311
SPARK-1202 - Add a "cancel" button in the UI for stages
Sundeep Narravula <sundeepn@superduel.local>, Sundeep Narravula <sundeepn@dhcpx-204-110.corp.yahoo.com>
2014-04-10 17:10:11 -0700
Commit: 2c55783, github.com/apache/spark/pull/246
[SQL] Improve column pruning in the optimizer.
Michael Armbrust <michael@databricks.com>
2014-04-10 16:20:33 -0700
Commit: f99401a, github.com/apache/spark/pull/378
Remove Unnecessary Whitespace's
Sandeep <sandeep@techaddict.me>
2014-04-10 15:04:13 -0700
Commit: 930b70f, github.com/apache/spark/pull/380
Update tuning.md
Andrew Ash <andrew@andrewash.com>
2014-04-10 14:59:58 -0700
Commit: f046662, github.com/apache/spark/pull/384
Revert "SPARK-1433: Upgrade Mesos dependency to 0.17.0"
Patrick Wendell <pwendell@gmail.com>
2014-04-10 14:43:29 -0700
Commit: 7b52b66
SPARK-1428: MLlib should convert non-float64 NumPy arrays to float64 instead of complaining
Sandeep <sandeep@techaddict.me>
2014-04-10 11:17:41 -0700
Commit: 3bd3129, github.com/apache/spark/pull/356
[SPARK-1276] Add a HistoryServer to render persisted UI
Andrew Or <andrewor14@gmail.com>
2014-04-10 10:39:34 -0700
Commit: 79820fe, github.com/apache/spark/pull/204
Release 1.0.0
HOTFIX: Add no-arg SparkContext constructor in Java
Patrick Wendell <pwendell@gmail.com>
2014-05-25 20:13:32 -0700
Commit: 18c77cb, github.com/apache/spark/pull/878
[SQL] Minor: Introduce SchemaRDD#aggregate() for simple aggregations
Aaron Davidson <aaron@databricks.com>
2014-05-25 18:37:44 -0700
Commit: a3976a2, github.com/apache/spark/pull/874
SPARK-1903 Document Spark's network connections
Andrew Ash <andrew@andrewash.com>
2014-05-25 17:15:47 -0700
Commit: 5107a6f, github.com/apache/spark/pull/856
Fix PEP8 violations in Python mllib.
Reynold Xin <rxin@apache.org>
2014-05-25 17:15:01 -0700
Commit: 07f34ca, github.com/apache/spark/pull/871
Python docstring update for sql.py.
Reynold Xin <rxin@apache.org>
2014-05-25 16:04:17 -0700
Commit: 8891495, github.com/apache/spark/pull/869
Fix PEP8 violations in examples/src/main/python.
Reynold Xin <rxin@apache.org>
2014-05-25 14:48:27 -0700
Commit: 3368397, github.com/apache/spark/pull/870
Added license header for tox.ini.
Reynold Xin <rxin@apache.org>
2014-05-25 01:47:08 -0700
Commit: 7273bfc
SPARK-1822: Some minor cleanup work on SchemaRDD.count()
Reynold Xin <rxin@apache.org>
2014-05-25 01:44:49 -0700
Commit: aeffc20, github.com/apache/spark/pull/868
Added PEP8 style configuration file.
Reynold Xin <rxin@apache.org>
2014-05-25 01:32:15 -0700
Commit: 291567d, github.com/apache/spark/pull/872
[SPARK-1822] SchemaRDD.count() should use query optimizer
Kan Zhang <kzhang@apache.org>
2014-05-25 00:06:42 -0700
Commit: 64d0fb5, github.com/apache/spark/pull/841
spark-submit: add exec at the end of the script
Colin Patrick Mccabe <cmccabe@cloudera.com>
2014-05-24 22:39:27 -0700
Commit: 7e59335, github.com/apache/spark/pull/858
[SPARK-1886] check executor id existence when executor exit
Zhen Peng <zhenpeng01@baidu.com>
2014-05-24 20:40:19 -0700
Commit: b5e9686, github.com/apache/spark/pull/827
Updated CHANGES.txt
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-25 02:20:13 +0000
Commit: 8406092
SPARK-1911: Emphasize that Spark jars should be built with Java 6.
Patrick Wendell <pwendell@gmail.com>
2014-05-24 18:27:00 -0700
Commit: 217bd56, github.com/apache/spark/pull/859
[SPARK-1900 / 1918] PySpark on YARN is broken
Andrew Or <andrewor14@gmail.com>
2014-05-24 18:01:49 -0700
Commit: 12f5ecc, github.com/apache/spark/pull/853
Update LBFGSSuite.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-05-23 13:02:40 -0700
Commit: 9be103a, github.com/apache/spark/pull/815
Updated scripts for auditing releases
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-22 20:48:55 -0700
Commit: 6541ca2, github.com/apache/spark/pull/844
[SPARK-1896] Respect spark.master (and --master) before MASTER in spark-shell
Andrew Or <andrewor14@gmail.com>
2014-05-22 20:32:27 -0700
Commit: c3b4065, github.com/apache/spark/pull/846
[SPARK-1897] Respect spark.jars (and --jars) in spark-shell
Andrew Or <andrewor14@gmail.com>
2014-05-22 20:25:41 -0700
Commit: 23cc40e, github.com/apache/spark/pull/849
Fix UISuite unit test that fails under Jenkins contention
Aaron Davidson <aaron@databricks.com>
2014-05-22 15:11:05 -0700
Commit: a566216, github.com/apache/spark/pull/857
[SPARK-1870] Make spark-submit --jars work in yarn-cluster mode.
Xiangrui Meng <meng@databricks.com>
2014-05-22 01:52:50 -0700
Commit: 79cd26c, github.com/apache/spark/pull/848
Configuration documentation updates
Reynold Xin <rxin@apache.org>
2014-05-21 18:49:12 -0700
Commit: 75af8bd, github.com/apache/spark/pull/851
[SPARK-1889] [SQL] Apply splitConjunctivePredicates to join condition while finding join ke...
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-21 15:37:47 -0700
Commit: 6e7934e, github.com/apache/spark/pull/836
[SPARK-1519] Support minPartitions param of wholeTextFiles() in PySpark
Kan Zhang <kzhang@apache.org>
2014-05-21 13:26:53 -0700
Commit: 30d1df5, github.com/apache/spark/pull/697
[Typo] Stoped -> Stopped
Andrew Or <andrewor14@gmail.com>
2014-05-21 11:59:05 -0700
Commit: 9b8f772, github.com/apache/spark/pull/847
[Minor] Move JdbcRDDSuite to the correct package
Andrew Or <andrewor14@gmail.com>
2014-05-21 01:25:10 -0700
Commit: bc6bbfa, github.com/apache/spark/pull/839
[Docs] Correct example of creating a new SparkConf
Andrew Or <andrewor14@gmail.com>
2014-05-21 01:23:34 -0700
Commit: 7295dd9, github.com/apache/spark/pull/842
[SPARK-1250] Fixed misleading comments in bin/pyspark, bin/spark-class
Sumedh Mungee <smungee@gmail.com>
2014-05-21 01:22:25 -0700
Commit: 364c14a, github.com/apache/spark/pull/843
[Hotfix] Blacklisted flaky HiveCompatibility test
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-20 10:27:12 -0700
Commit: b4d93d3, github.com/apache/spark/pull/838
Updated CHANGES.txt
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-19 23:12:24 -0700
Commit: 1c00f2a
[Spark 1877] ClassNotFoundException when loading RDD with serialized objects
Tathagata Das <tathagata.das1565@gmail.com>, Ghidireac <bogdang@u448a5b0a73d45358d94a.ant.amazon.com>
2014-05-19 22:36:24 -0700
Commit: 6cbe2a3, github.com/apache/spark/pull/835
[SPARK-1874][MLLIB] Clean up MLlib sample data
Xiangrui Meng <meng@databricks.com>
2014-05-19 21:29:33 -0700
Commit: 1c6c8b5, github.com/apache/spark/pull/833
SPARK-1689: Spark application should die when removed by Master
Aaron Davidson <aaron@databricks.com>
2014-05-19 20:55:26 -0700
Commit: 78b6e6f, github.com/apache/spark/pull/832
[SPARK-1875]NoClassDefFoundError: StringUtils when building with hadoop 1.x and hive
witgo <witgo@qq.com>
2014-05-19 19:40:29 -0700
Commit: 875c54f, github.com/apache/spark/pull/824
SPARK-1879. Increase MaxPermSize since some of our builds have many classes
Matei Zaharia <matei@databricks.com>
2014-05-19 18:42:28 -0700
Commit: 00563e1, github.com/apache/spark/pull/823
SPARK-1878: Fix the incorrect initialization order
zsxwing <zsxwing@gmail.com>
2014-05-19 16:41:31 -0700
Commit: 901102c, github.com/apache/spark/pull/822
[SPARK-1876] Windows fixes to deal with latest distribution layout changes
Matei Zaharia <matei@databricks.com>
2014-05-19 15:02:35 -0700
Commit: 111c121, github.com/apache/spark/pull/819
[WIP][SPARK-1871][MLLIB] Improve MLlib guide for v1.0
Xiangrui Meng <meng@databricks.com>
2014-05-18 17:00:57 -0700
Commit: ecab8a2, github.com/apache/spark/pull/816
SPARK-1873: Add README.md file when making distributions
Patrick Wendell <pwendell@gmail.com>
2014-05-18 16:51:53 -0700
Commit: 8e8b351, github.com/apache/spark/pull/818
Fix spark-submit path in spark-shell & pyspark
Neville Li <neville@spotify.com>
2014-05-18 13:31:23 -0700
Commit: e06e4b0, github.com/apache/spark/pull/812
Make deprecation warning less severe
Patrick Wendell <pwendell@gmail.com>
2014-05-16 22:58:47 -0700
Commit: 3b3d7c8, github.com/apache/spark/pull/810
[SPARK-1824] Remove <master> from Python examples
Andrew Or <andrewor14@gmail.com>
2014-05-16 22:36:23 -0700
Commit: 03b4242, github.com/apache/spark/pull/802
[SPARK-1808] Route bin/pyspark through Spark submit
Andrew Or <andrewor14@gmail.com>
2014-05-16 22:34:38 -0700
Commit: 318739a, github.com/apache/spark/pull/799
Version bump of spark-ec2 scripts
Patrick Wendell <pwendell@gmail.com>
2014-05-16 21:42:14 -0700
Commit: 9cd12f3, github.com/apache/spark/pull/809
SPARK-1864 Look in spark conf instead of system properties when propagating configuration to executors.
Michael Armbrust <michael@databricks.com>
2014-05-16 20:25:10 -0700
Commit: a16a19f, github.com/apache/spark/pull/808
Tweaks to Mesos docs
Matei Zaharia <matei@databricks.com>
2014-05-16 17:35:05 -0700
Commit: 2ba6711, github.com/apache/spark/pull/806
[SQL] Implement between in hql
Michael Armbrust <michael@databricks.com>
2014-05-16 11:47:00 -0700
Commit: 386b31c, github.com/apache/spark/pull/804
bugfix: overflow of graphx Edge compare function
Zhen Peng <zhenpeng01@baidu.com>
2014-05-16 11:37:18 -0700
Commit: ff47cdc, github.com/apache/spark/pull/769
SPARK-1862: Support for MapR in the Maven build.
Patrick Wendell <pwendell@gmail.com>
2014-05-15 23:31:43 -0700
Commit: eec4dd8, github.com/apache/spark/pull/803
[Spark-1461] Deferred Expression Evaluation (short-circuit evaluation)
Cheng Hao <hao.cheng@intel.com>
2014-05-15 22:12:34 -0700
Commit: eac4ee8, github.com/apache/spark/pull/446
SPARK-1860: Do not cleanup application work/ directories by default
Aaron Davidson <aaron@databricks.com>
2014-05-15 21:37:58 -0700
Commit: 5441471, github.com/apache/spark/pull/800
Typos in Spark
Huajian Mao <huajianmao@gmail.com>
2014-05-15 18:20:16 -0700
Commit: a2742d8, github.com/apache/spark/pull/798
Fixes a misplaced comment.
Prashant Sharma <prashant.s@imaginea.com>
2014-05-15 16:58:37 -0700
Commit: 2e418f5, github.com/apache/spark/pull/788
[SQL] Fix tiny/small ints from HiveMetastore.
Michael Armbrust <michael@databricks.com>
2014-05-15 16:50:42 -0700
Commit: ffa9c49, github.com/apache/spark/pull/797
SPARK-1803 Replaced colon in filenames with a dash
Stevo Slavić <sslavic@gmail.com>, Stevo Slavic <sslavic@gmail.com>
2014-05-15 16:44:14 -0700
Commit: 22f261a, github.com/apache/spark/pull/739
SPARK-1851. Upgrade Avro dependency to 1.7.6 so Spark can read Avro file...
Sandy Ryza <sandy@cloudera.com>
2014-05-15 16:35:39 -0700
Commit: 3587057, github.com/apache/spark/pull/795
[SPARK-1741][MLLIB] add predict(JavaRDD) to RegressionModel, ClassificationModel, and KMeans
Xiangrui Meng <meng@databricks.com>
2014-05-15 11:59:59 -0700
Commit: bc9a96e, github.com/apache/spark/pull/670
[SPARK-1819] [SQL] Fix GetField.nullable.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-15 11:21:33 -0700
Commit: f9eeddc, github.com/apache/spark/pull/757
[SPARK-1845] [SQL] Use AllScalaRegistrar for SparkSqlSerializer to register serializers of ...
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-15 11:20:21 -0700
Commit: 7515367, github.com/apache/spark/pull/790
SPARK-1846 Ignore logs directory in RAT checks
Andrew Ash <andrew@andrewash.com>
2014-05-15 11:05:39 -0700
Commit: aa5f989, github.com/apache/spark/pull/793
HOTFIX: Don't build Javadoc in Maven when creating releases.
Patrick Wendell <pwendell@gmail.com>
2014-05-14 23:48:03 -0700
Commit: 88f1da3
fix different versions of commons-lang dependency and apache/spark#746 addendum
witgo <witgo@qq.com>
2014-05-14 22:26:26 -0700
Commit: 31b853c, github.com/apache/spark/pull/754
Package docs
Prashant Sharma <prashant.s@imaginea.com>, Patrick Wendell <pwendell@gmail.com>
2014-05-14 22:24:41 -0700
Commit: c02d614, github.com/apache/spark/pull/785
Documentation: Encourage use of reduceByKey instead of groupByKey.
Patrick Wendell <pwendell@gmail.com>
2014-05-14 22:24:04 -0700
Commit: f2faa37, github.com/apache/spark/pull/784
Add language tabs and Python version to interactive part of quick-start
Matei Zaharia <matei@databricks.com>
2014-05-14 21:45:20 -0700
Commit: 976784b, github.com/apache/spark/pull/782
[SPARK-1840] SparkListenerBus prints out scary error message when terminated normally
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-14 21:13:41 -0700
Commit: ba87123, github.com/apache/spark/pull/783
default task number misleading in several places
Chen Chao <crazyjvm@gmail.com>
2014-05-14 18:20:20 -0700
Commit: 9f0f2ec, github.com/apache/spark/pull/766
[SPARK-1826] fix the head notation of package object dsl
wangfei <scnbwf@yeah.net>
2014-05-14 17:59:11 -0700
Commit: fdf9717, github.com/apache/spark/pull/765
[Typo] propertes -> properties
andrewor14 <andrewor14@gmail.com>
2014-05-14 17:54:53 -0700
Commit: 5ca3096, github.com/apache/spark/pull/780
[SPARK-1696][MLLIB] use alpha in dense dspr
Xiangrui Meng <meng@databricks.com>
2014-05-14 17:18:30 -0700
Commit: d6f1a75, github.com/apache/spark/pull/778
[FIX] do not load defaults when testing SparkConf in pyspark
Xiangrui Meng <meng@databricks.com>
2014-05-14 14:57:17 -0700
Commit: 31faec7, github.com/apache/spark/pull/775
SPARK-1833 - Have an empty SparkContext constructor.
Patrick Wendell <pwendell@gmail.com>
2014-05-14 12:53:30 -0700
Commit: 8e13ab2, github.com/apache/spark/pull/774
SPARK-1829 Sub-second durations shouldn't round to "0 s"
Andrew Ash <andrew@andrewash.com>
2014-05-14 12:01:14 -0700
Commit: 530bdf7, github.com/apache/spark/pull/768
Fix: sbt test throw an java.lang.OutOfMemoryError: PermGen space
witgo <witgo@qq.com>
2014-05-14 11:19:26 -0700
Commit: 379f733, github.com/apache/spark/pull/773
Adding back hive support
Patrick Wendell <pwendell@gmail.com>
2014-05-14 10:21:27 -0700
Commit: e8ca397
[SPARK-1620] Handle uncaught exceptions in function run by Akka scheduler
Mark Hamstra <markhamstra@gmail.com>
2014-05-14 10:07:25 -0700
Commit: 9ff9078, github.com/apache/spark/pull/622
SPARK-1828: Created forked version of hive-exec that doesn't bundle other dependencies
Patrick Wendell <pwendell@gmail.com>
2014-05-14 09:51:01 -0700
Commit: 34f6fa9, github.com/apache/spark/pull/767
SPARK-1818 Freshen Mesos documentation
Andrew Ash <andrew@andrewash.com>
2014-05-14 09:45:33 -0700
Commit: fc6b652, github.com/apache/spark/pull/756
SPARK-1827. LICENSE and NOTICE files need a refresh to contain transitive dependency info
Sean Owen <sowen@cloudera.com>
2014-05-14 09:38:33 -0700
Commit: 7083282, github.com/apache/spark/pull/770
Fixed streaming examples docs to use run-example instead of spark-submit
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-14 04:17:32 -0700
Commit: c7571d8, github.com/apache/spark/pull/722
[SPARK-1769] Executor loss causes NPE race condition
Andrew Or <andrewor14@gmail.com>
2014-05-14 00:54:33 -0700
Commit: 69ec314, github.com/apache/spark/pull/762
Fix dep exclusion: avro-ipc, not avro, depends on netty.
Marcelo Vanzin <vanzin@cloudera.com>
2014-05-14 00:37:57 -0700
Commit: b3d9878, github.com/apache/spark/pull/763
SPARK-1801. expose InterruptibleIterator and TaskKilledException in deve...
Koert Kuipers <koert@tresata.com>
2014-05-14 00:10:12 -0700
Commit: 7da80a3, github.com/apache/spark/pull/764
[SQL] Improve column pruning.
Michael Armbrust <michael@databricks.com>
2014-05-13 23:27:22 -0700
Commit: f66f766, github.com/apache/spark/pull/729
Revert "[SPARK-1784] Add a new partitioner to allow specifying # of keys per partition"
Patrick Wendell <pwendell@gmail.com>
2014-05-13 23:25:19 -0700
Commit: 721194b
Implement ApproximateCountDistinct for SparkSql
larvaboy <larvaboy@gmail.com>
2014-05-13 21:26:08 -0700
Commit: 92b0ec9, github.com/apache/spark/pull/737
[SPARK-1784] Add a new partitioner to allow specifying # of keys per partition
Syed Hashmi <shashmi@cloudera.com>
2014-05-13 21:24:23 -0700
Commit: 66fe479, github.com/apache/spark/pull/721
[SQL] Make it possible to create Java/Python SQLContexts from an existing Scala SQLContext.
Michael Armbrust <michael@databricks.com>
2014-05-13 21:23:51 -0700
Commit: 618b3e6, github.com/apache/spark/pull/761
[SPARK-1527] change rootDir*.getName to rootDir*.getAbsolutePath
Ye Xianjin <advancedxy@gmail.com>
2014-05-13 19:03:51 -0700
Commit: ef5e9d7, github.com/apache/spark/pull/436
[SPARK-1816] LiveListenerBus dies if a listener throws an exception
Andrew Or <andrewor14@gmail.com>
2014-05-13 18:32:32 -0700
Commit: 3892ec5, github.com/apache/spark/pull/759
SPARK-1791 - SVM implementation does not use threshold parameter
Andrew Tulloch <andrew@tullo.ch>
2014-05-13 17:31:27 -0700
Commit: d6994f4, github.com/apache/spark/pull/725
BUILD: Add more content to make-distribution.sh.
Patrick Wendell <pwendell@gmail.com>
2014-05-12 23:02:54 -0700
Commit: 716462c
Adding CHANGES.txt file and removing YARN support for now
Patrick Wendell <pwendell@gmail.com>
2014-05-12 20:21:23 -0700
Commit: fa2d4d8
SPARK-1815. SparkContext should not be marked DeveloperApi
Sandy Ryza <sandy@cloudera.com>
2014-05-12 20:08:30 -0700
Commit: 31d54c0, github.com/apache/spark/pull/753
[SPARK-1753 / 1773 / 1814] Update outdated docs for spark-submit, YARN, standalone etc.
Andrew Or <andrewor14@gmail.com>
2014-05-12 19:44:14 -0700
Commit: b9e41f4, github.com/apache/spark/pull/701
[SPARK-1780] Non-existent SPARK_DAEMON_OPTS is lurking around
Andrew Or <andrewor14@gmail.com>
2014-05-12 19:42:35 -0700
Commit: 5ef24a0, github.com/apache/spark/pull/751
SPARK-1757 Failing test for saving null primitives with .saveAsParquetFile()
Andrew Ash <andrew@andrewash.com>, Michael Armbrust <michael@databricks.com>
2014-05-12 19:23:39 -0700
Commit: b52ac0e, github.com/apache/spark/pull/690
Modify a typo in monitoring.md
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-05-12 19:21:06 -0700
Commit: 89b56d7, github.com/apache/spark/pull/698
L-BFGS Documentation
DB Tsai <dbtsai@alpinenow.com>
2014-05-12 19:20:24 -0700
Commit: bad4c9d, github.com/apache/spark/pull/702
Typo: resond -> respond
Andrew Ash <andrew@andrewash.com>
2014-05-12 18:46:28 -0700
Commit: 1fbebca, github.com/apache/spark/pull/743
[SQL] Make Hive Metastore conversion functions publicly visible.
Michael Armbrust <michael@databricks.com>
2014-05-12 18:40:30 -0700
Commit: 24cc933, github.com/apache/spark/pull/750
[SPARK-1736] Spark submit for Windows
Andrew Or <andrewor14@gmail.com>
2014-05-12 17:39:40 -0700
Commit: 59695b3, github.com/apache/spark/pull/745
SPARK-1802. (Addendium) Audit dependency graph when Spark is built with -Pyarn
Sean Owen <sowen@cloudera.com>
2014-05-12 17:35:29 -0700
Commit: 02caa7e, github.com/apache/spark/pull/746
SPARK-1623: Use File objects instead of String's in HTTPBroadcast
Patrick Wendell <pwendell@gmail.com>
2014-05-12 17:27:28 -0700
Commit: c294f37, github.com/apache/spark/pull/749
Rename testExecutorEnvs --> executorEnvs.
Patrick Wendell <pwendell@gmail.com>
2014-05-12 17:09:13 -0700
Commit: e9d602d, github.com/apache/spark/pull/747
Adding hadoop-2.2 profile to the build
Patrick Wendell <pwendell@gmail.com>
2014-05-12 15:40:48 -0700
Commit: b66051e
Rollback versions for 1.0.0-rc4
Patrick Wendell <pwendell@gmail.com>
2014-05-12 15:23:53 -0700
Commit: 51142b7
SPARK-1802. Audit dependency graph when Spark is built with -Phive
Sean Owen <sowen@cloudera.com>
2014-05-12 14:17:25 -0700
Commit: e185281, github.com/apache/spark/pull/744
SPARK-1798. Tests should clean up temp files
Sean Owen <sowen@cloudera.com>
2014-05-12 14:16:19 -0700
Commit: 14515b4, github.com/apache/spark/pull/732
BUILD: Include Hive with default packages when creating a release
Patrick Wendell <pwendell@gmail.com>
2014-05-12 13:20:23 -0700
Commit: 722ecaa
SPARK-1786: Reopening PR 724
Ankur Dave <ankurdave@gmail.com>, Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-05-12 13:05:24 -0700
Commit: 642ad49, github.com/apache/spark/pull/742
SPARK-1806: Upgrade Mesos dependency to 0.18.1
Bernardo Gomez Palacio <bernardo.gomezpalacio@gmail.com>
2014-05-12 11:10:28 -0700
Commit: 0be8b45, github.com/apache/spark/pull/741
SPARK-1772 Stop catching Throwable, let Executors die
Aaron Davidson <aaron@databricks.com>
2014-05-12 11:08:52 -0700
Commit: c88adbb, github.com/apache/spark/pull/715
Revert "SPARK-1786: Edge Partition Serialization"
Patrick Wendell <pwendell@gmail.com>
2014-05-12 10:51:01 -0700
Commit: 19ccf20
SPARK-1786: Edge Partition Serialization
Ankur Dave <ankurdave@gmail.com>, Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-05-11 19:20:42 -0700
Commit: 09e7aa4, github.com/apache/spark/pull/724
Fix error in 2d Graph Partitioner
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-05-11 18:33:46 -0700
Commit: f84b798, github.com/apache/spark/pull/709
SPARK-1652: Set driver memory correctly in spark-submit.
Patrick Wendell <pwendell@gmail.com>
2014-05-11 18:17:34 -0700
Commit: 2eea663, github.com/apache/spark/pull/730
SPARK-1770: Load balance elements when repartitioning.
Patrick Wendell <pwendell@gmail.com>
2014-05-11 17:11:55 -0700
Commit: 5d69699, github.com/apache/spark/pull/727
remove outdated runtime Information scala home
witgo <witgo@qq.com>
2014-05-11 14:34:27 -0700
Commit: 8097bb2, github.com/apache/spark/pull/728
Revert "Enabled incremental build that comes with sbt 0.13.2"
Patrick Wendell <pwendell@gmail.com>
2014-05-10 21:08:53 -0700
Commit: 758e543
Enabled incremental build that comes with sbt 0.13.2
Prashant Sharma <prashant.s@imaginea.com>
2014-05-10 21:08:04 -0700
Commit: 71ce7eb, github.com/apache/spark/pull/525
[SPARK-1774] Respect SparkSubmit --jars on YARN (client)
Andrew Or <andrewor14@gmail.com>
2014-05-10 20:58:02 -0700
Commit: 012f904, github.com/apache/spark/pull/710
SPARK-1789. Multiple versions of Netty dependencies cause FlumeStreamSuite failure
Sean Owen <sowen@cloudera.com>
2014-05-10 20:50:40 -0700
Commit: c7253da, github.com/apache/spark/pull/723
Unify GraphImpl RDDs + other graph load optimizations
Ankur Dave <ankurdave@gmail.com>
2014-05-10 14:48:07 -0700
Commit: 4e9a0cb, github.com/apache/spark/pull/497
[SPARK-1690] Tolerating empty elements when saving Python RDD to text files
Kan Zhang <kzhang@apache.org>
2014-05-10 14:01:08 -0700
Commit: ac86af8, github.com/apache/spark/pull/644
Add Python includes to path before depickling broadcast values
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-05-10 13:02:13 -0700
Commit: 2a669a7, github.com/apache/spark/pull/656
fix broken in link in python docs
Andy Konwinski <andykonwinski@gmail.com>
2014-05-10 12:46:51 -0700
Commit: 71ad53f, github.com/apache/spark/pull/650
SPARK-1708. Add a ClassTag on Serializer and things that depend on it
Matei Zaharia <matei@databricks.com>
2014-05-10 12:10:24 -0700
Commit: 9fbb22c, github.com/apache/spark/pull/700
[SPARK-1778] [SQL] Add 'limit' transformation to SchemaRDD.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-10 12:03:27 -0700
Commit: 7486474, github.com/apache/spark/pull/711
[SQL] Upgrade parquet library.
Michael Armbrust <michael@databricks.com>
2014-05-10 11:48:01 -0700
Commit: a61b71c, github.com/apache/spark/pull/684
[SPARK-1644] The org.datanucleus:* should not be packaged into spark-assembly-*.jar
witgo <witgo@qq.com>
2014-05-10 10:15:04 -0700
Commit: 2a878da, github.com/apache/spark/pull/688
SPARK-1686: keep schedule() calling in the main thread
CodingCat <zhunansjtu@gmail.com>
2014-05-09 21:50:23 -0700
Commit: adf8cdd, github.com/apache/spark/pull/639
SPARK-1770: Revert accidental(?) fix
Aaron Davidson <aaron@databricks.com>
2014-05-09 14:51:34 -0700
Commit: 8202276, github.com/apache/spark/pull/716
[SPARK-1760]: fix building spark with maven documentation
witgo <witgo@qq.com>
2014-05-09 01:51:26 -0700
Commit: 80f292a, github.com/apache/spark/pull/712
Converted bang to ask to avoid scary warning when a block is removed
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-08 22:34:08 -0700
Commit: b8c17e3, github.com/apache/spark/pull/708
MINOR: Removing dead code.
Patrick Wendell <pwendell@gmail.com>
2014-05-08 22:33:06 -0700
Commit: 1d56cd5
SPARK-1775: Unneeded lock in ShuffleMapTask.deserializeInfo
Sandeep <sandeep@techaddict.me>
2014-05-08 22:30:17 -0700
Commit: 5c8e8de, github.com/apache/spark/pull/707
SPARK-1565 (Addendum): Replace `run-example` with `spark-submit`.
Patrick Wendell <pwendell@gmail.com>
2014-05-08 22:26:17 -0700
Commit: f6323eb, github.com/apache/spark/pull/704
[SPARK-1631] Correctly set the Yarn app name when launching the AM.
Marcelo Vanzin <vanzin@cloudera.com>
2014-05-08 20:46:11 -0700
Commit: 7e19334, github.com/apache/spark/pull/539
[SPARK-1755] Respect SparkSubmit --name on YARN
Andrew Or <andrewor14@gmail.com>
2014-05-08 20:45:29 -0700
Commit: 666bebe, github.com/apache/spark/pull/699
Include the sbin/spark-config.sh in spark-executor
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-05-08 20:43:37 -0700
Commit: ab91227, github.com/apache/spark/pull/651
Bug fix of sparse vector conversion
Funes <tianshaocun@gmail.com>, funes <tianshaocun@gmail.com>
2014-05-08 17:54:10 -0700
Commit: 9ed17ff, github.com/apache/spark/pull/661
[SPARK-1157][MLlib] Bug fix: lossHistory should exclude rejection steps, and remove miniBatch
DB Tsai <dbtsai@alpinenow.com>
2014-05-08 17:53:22 -0700
Commit: 3452997, github.com/apache/spark/pull/582
MLlib documentation fix
DB Tsai <dbtsai@alpinenow.com>
2014-05-08 17:52:32 -0700
Commit: d81d626, github.com/apache/spark/pull/703
Fixing typo in als.py
Evan Sparks <evan.sparks@gmail.com>
2014-05-08 13:07:30 -0700
Commit: 98944a9, github.com/apache/spark/pull/696
[SPARK-1754] [SQL] Add missing arithmetic DSL operations.
Takuya UESHIN <ueshin@happy-camper.st>
2014-05-08 15:31:47 -0700
Commit: 6f701ff, github.com/apache/spark/pull/689
[SPARK-1745] Move interrupted flag from TaskContext constructor (minor)
Andrew Or <andrewor14@gmail.com>
2014-05-08 12:13:07 -0700
Commit: ee63321, github.com/apache/spark/pull/675
SPARK-1565, update examples to be used with spark-submit script.
Prashant Sharma <prashant.s@imaginea.com>
2014-05-08 10:23:05 -0700
Commit: 30cfa8d, github.com/apache/spark/pull/552
[SQL] Improve SparkSQL Aggregates
Michael Armbrust <michael@databricks.com>
2014-05-08 01:08:43 -0400
Commit: 8f3b925, github.com/apache/spark/pull/683
Use numpy directly for matrix multiply.
Evan Sparks <evan.sparks@gmail.com>
2014-05-08 00:24:36 -0400
Commit: 010040f, github.com/apache/spark/pull/687
SPARK-1668: Add implicit preference as an option to examples/MovieLensALS
Sandeep <sandeep@techaddict.me>
2014-05-08 00:15:05 -0400
Commit: 35aa244, github.com/apache/spark/pull/597
SPARK-1544 Add support for deep decision trees.
Manish Amde <manish9ue@gmail.com>, manishamde <manish9ue@gmail.com>, Evan Sparks <sparks@cs.berkeley.edu>
2014-05-07 17:08:38 -0700
Commit: c7b2704, github.com/apache/spark/pull/475
Update GradientDescentSuite.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-05-07 16:02:55 -0700
Commit: 0972b62, github.com/apache/spark/pull/588
[SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark
Xiangrui Meng <meng@databricks.com>
2014-05-07 16:01:11 -0700
Commit: bb90e87, github.com/apache/spark/pull/672
SPARK-1569 Spark on Yarn, authentication broken by pr299
Thomas Graves <tgraves@apache.org>
2014-05-07 15:51:53 -0700
Commit: 879eeee, github.com/apache/spark/pull/649
[SPARK-1688] Propagate PySpark worker stderr to driver
Andrew Or <andrewor14@gmail.com>
2014-05-07 14:35:22 -0700
Commit: 82c8e89, github.com/apache/spark/pull/603
Typo fix: fetchting -> fetching
Andrew Ash <andrew@andrewash.com>
2014-05-07 17:24:49 -0400
Commit: 0759ee7, github.com/apache/spark/pull/680
Nicer logging for SecurityManager startup
Andrew Ash <andrew@andrewash.com>
2014-05-07 17:24:12 -0400
Commit: 69e2726, github.com/apache/spark/pull/678
[SQL] Fix Performance Issue in data type casting
Cheng Hao <hao.cheng@intel.com>
2014-05-07 16:54:58 -0400
Commit: 82ceda2, github.com/apache/spark/pull/679
SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptions
Aaron Davidson <aaron@databricks.com>
2014-05-07 09:48:31 -0700
Commit: 18caa8c, github.com/apache/spark/pull/640
[SPARK-1460] Returning SchemaRDD instead of normal RDD on Set operations...
Kan Zhang <kzhang@apache.org>
2014-05-07 09:41:31 -0700
Commit: da9f9e0, github.com/apache/spark/pull/448
[WIP][Spark-SQL] Optimize the Constant Folding for Expression
Cheng Hao <hao.cheng@intel.com>, Michael Armbrust <michael@databricks.com>
2014-05-07 03:37:12 -0400
Commit: 756c969, github.com/apache/spark/pull/482
SPARK-1746: Support setting SPARK_JAVA_OPTS on executors for backwards compatibility
Patrick Wendell <pwendell@gmail.com>
2014-05-07 00:11:05 -0700
Commit: 00fac73, github.com/apache/spark/pull/676
[HOTFIX] SPARK-1637: There are some Streaming examples added after the PR #571 was last updated.
Sandeep <sandeep@techaddict.me>
2014-05-06 21:55:05 -0700
Commit: ade4756, github.com/apache/spark/pull/673
Proposal: clarify Scala programming guide on caching ...
Ethan Jewett <esjewett@gmail.com>
2014-05-06 20:50:08 -0700
Commit: 51e2775, github.com/apache/spark/pull/668
SPARK-1727. Correct small compile errors, typos, and markdown issues in (primarly) MLlib docs
Sean Owen <sowen@cloudera.com>
2014-05-06 20:07:22 -0700
Commit: 514ee93, github.com/apache/spark/pull/653
SPARK-1637: Clean up examples for 1.0
Sandeep <sandeep@techaddict.me>
2014-05-06 17:27:52 -0700
Commit: 8cfebf5, github.com/apache/spark/pull/571
SPARK-1737: Warn rather than fail when Java 7+ is used to create distributions
Patrick Wendell <pwendell@gmail.com>
2014-05-06 15:41:46 -0700
Commit: d8f1b33, github.com/apache/spark/pull/669
[SPARK-1549] Add Python support to spark-submit
Matei Zaharia <matei@databricks.com>
2014-05-06 15:12:35 -0700
Commit: d7ddb26, github.com/apache/spark/pull/664
SPARK-1734: spark-submit throws an exception: Exception in thread "main"...
witgo <witgo@qq.com>
2014-05-06 14:17:39 -0700
Commit: 48cc9a9, github.com/apache/spark/pull/665
[SPARK-1685] Cancel retryTimer on restart of Worker or AppClient
Mark Hamstra <markhamstra@gmail.com>
2014-05-06 12:53:39 -0700
Commit: 0aaa2c6, github.com/apache/spark/pull/602
Fix two download suggestions in the docs:
Patrick Wendell <pwendell@gmail.com>
2014-05-06 12:07:46 -0700
Commit: 1083f2b, github.com/apache/spark/pull/662
SPARK-1474: Spark on yarn assembly doesn't include AmIpFilter
Thomas Graves <tgraves@apache.org>
2014-05-06 12:00:09 -0700
Commit: 0c3e415, github.com/apache/spark/pull/406
Update OpenHashSet.scala
ArcherShao <ArcherShao@users.noreply.github.com>
2014-05-06 10:12:59 -0700
Commit: 4ff3929, github.com/apache/spark/pull/667
[SQL] SPARK-1732 - Support for null primitive values.
Michael Armbrust <michael@databricks.com>
2014-05-05 22:59:42 -0700
Commit: 39ac62d, github.com/apache/spark/pull/658
[SPARK-1735] Add the missing special profiles to make-distribution.sh
Andrew Or <andrewor14@gmail.com>
2014-05-05 22:14:47 -0700
Commit: 4708eff, github.com/apache/spark/pull/660
[SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default
Cheng Lian <lian.cs.zju@gmail.com>
2014-05-05 19:38:59 -0700
Commit: 2853e56, github.com/apache/spark/pull/608
[SPARK-1594][MLLIB] Cleaning up MLlib APIs and guide
Xiangrui Meng <meng@databricks.com>
2014-05-05 18:32:54 -0700
Commit: 32c960a, github.com/apache/spark/pull/524
Expose SparkListeners and relevant classes as DeveloperApi
Andrew Or <andrewor14@gmail.com>
2014-05-05 18:32:14 -0700
Commit: a5f765c, github.com/apache/spark/pull/648
SPARK-1728. JavaRDDLike.mapPartitionsWithIndex requires ClassTag
Sandy Ryza <sandy@cloudera.com>
2014-05-05 18:26:34 -0700
Commit: 01e3ff0, github.com/apache/spark/pull/657
[SPARK-1681] Include datanucleus jars in Spark Hive distribution
Andrew Or <andrewor14@gmail.com>
2014-05-05 16:28:07 -0700
Commit: 4d0dd50, github.com/apache/spark/pull/610
[SPARK-1504], [SPARK-1505], [SPARK-1558] Updated Spark Streaming guide
Tathagata Das <tathagata.das1565@gmail.com>
2014-05-05 15:28:19 -0700
Commit: 1fac4ec, github.com/apache/spark/pull/652
SPARK-1721: Reset the thread classLoader in the Mesos Executor
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-05-05 11:19:35 -0700
Commit: 80f4360, github.com/apache/spark/pull/620
SPARK-1556. jets3t dep doesn't update properly with newer Hadoop versions
Sean Owen <sowen@cloudera.com>
2014-05-05 10:33:49 -0700
Commit: 5d72283, github.com/apache/spark/pull/629
Updated doc for spark.closure.serializer to indicate only Java serializer work.
Reynold Xin <rxin@apache.org>
2014-05-05 00:52:06 -0700
Commit: 6be7226, github.com/apache/spark/pull/642
Address SPARK-1717
msiddalingaiah <madhu@madhu.com>
2014-05-04 21:59:10 -0700
Commit: b5c62c8, github.com/apache/spark/pull/641
SPARK-1710: spark-submit should print better errors than "InvocationTargetException"
Sandeep <sandeep@techaddict.me>
2014-05-04 20:51:53 -0700
Commit: afcb9ae, github.com/apache/spark/pull/630
EC2 script should exit with non-zero code on UsageError
Allan Douglas R. de Oliveira <allan@chaordicsystems.com>
2014-05-04 20:36:51 -0700
Commit: 7305278, github.com/apache/spark/pull/638
SPARK-1693: Most of the tests throw a java.lang.SecurityException when s...
witgo <witgo@qq.com>
2014-05-04 17:48:52 -0700
Commit: ec0bce1, github.com/apache/spark/pull/628
SPARK-1629. Addendum: Depend on commons lang3 (already used by tachyon) as it's used in ReplSuite, and return to use lang3 utility in Utils.scala
Sean Owen <sowen@cloudera.com>
2014-05-04 17:43:28 -0700
Commit: 4505bc2, github.com/apache/spark/pull/635
SPARK-1703 Warn users if Spark is run on JRE6 but compiled with JDK7.
Patrick Wendell <pwendell@gmail.com>
2014-05-04 12:22:23 -0700
Commit: 2f091d5, github.com/apache/spark/pull/627
SPARK-1663. (Addendum) Fix signature of one version of JavaPairRDDStream.reduceByKeyAndWindow()
Sean Owen <sowen@cloudera.com>
2014-05-04 11:55:29 -0700
Commit: 40d05a4, github.com/apache/spark/pull/633
SPARK-1658: Correctly identify if maven is installed and working
Rahul Singhal <rahul.singhal@guavus.com>
2014-05-04 11:08:39 -0700
Commit: 2ee5f04, github.com/apache/spark/pull/580
The default version of yarn is equal to the hadoop version
witgo <witgo@qq.com>
2014-05-03 23:32:12 -0700
Commit: acbf307, github.com/apache/spark/pull/626
Whitelist Hive Tests
Michael Armbrust <michael@databricks.com>
2014-05-03 23:13:51 -0700
Commit: e88a636, github.com/apache/spark/pull/596
[SQL] Better logging when applying rules.
Michael Armbrust <michael@databricks.com>
2014-05-03 18:38:44 -0700
Commit: e24d5cd, github.com/apache/spark/pull/616
EC2 configurable workers
Allan Douglas R. de Oliveira <allan@chaordicsystems.com>
2014-05-03 16:52:19 -0700
Commit: 8406ac4, github.com/apache/spark/pull/612
SPARK-1689 AppClient should indicate app is dead() when removed
Aaron Davidson <aaron@databricks.com>
2014-05-03 13:27:10 -0700
Commit: 36e687d, github.com/apache/spark/pull/605
[Bugfix] Tachyon file cleanup logical error
Cheng Lian <lian.cs.zju@gmail.com>
2014-05-03 13:23:52 -0700
Commit: bc3bfea, github.com/apache/spark/pull/575
SPARK-1663. Corrections for several compile errors in streaming code examples, and updates to follow API changes
Sean Owen <sowen@cloudera.com>
2014-05-03 12:31:31 -0700
Commit: 08c4d11, github.com/apache/spark/pull/589
[WIP] SPARK-1676: Cache Hadoop UGIs by default to prevent FileSystem leak
Thomas Graves <tgraves@apache.org>
2014-05-03 10:59:05 -0700
Commit: 0441515, github.com/apache/spark/pull/621
Update SchemaRDD.scala
ArcherShao <ArcherShao@users.noreply.github.com>
2014-05-03 00:17:36 -0700
Commit: 34f22bc, github.com/apache/spark/pull/619
SPARK-1700: Close socket file descriptors on task completion
Aaron Davidson <aaron@databricks.com>
2014-05-02 23:55:13 -0700
Commit: d2cbd3d, github.com/apache/spark/pull/623
SPARK-1492. Update Spark YARN docs to use spark-submit
Sandy Ryza <sandy@cloudera.com>
2014-05-02 21:42:31 -0700
Commit: a314342, github.com/apache/spark/pull/601
delete no use var
wangfei <wangfei_hello@126.com>
2014-05-02 21:34:54 -0700
Commit: b65def7, github.com/apache/spark/pull/613
SPARK-1695: java8-tests compiler error: package com.google.common.co...
witgo <witgo@qq.com>
2014-05-02 12:40:27 -0700
Commit: d28c058, github.com/apache/spark/pull/611
Add tests for FileLogger, EventLoggingListener, and ReplayListenerBus
Andrew Or <andrewor14@gmail.com>
2014-05-01 21:42:06 -0700
Commit: d4c8af8, github.com/apache/spark/pull/591
SPARK-1659: improvements spark-submit usage
witgo <witgo@qq.com>
2014-05-01 21:39:40 -0700
Commit: 18595dd, github.com/apache/spark/pull/581
fix the spelling mistake
wangfei <wangfei_hello@126.com>
2014-05-01 21:37:22 -0700
Commit: 35ca6c5, github.com/apache/spark/pull/614
[SQL] SPARK-1661 - Fix regex_serde test
Michael Armbrust <michael@databricks.com>
2014-05-01 21:32:43 -0700
Commit: d339b33, github.com/apache/spark/pull/595
SPARK-1691: Support quoted arguments inside of spark-submit.
Patrick Wendell <pwendell@gmail.com>
2014-05-01 01:15:51 -0700
Commit: dd601bf, github.com/apache/spark/pull/609
Fix SPARK-1629: Spark should inline use of commons-lang `SystemUtils.IS_...
witgo <witgo@qq.com>
2014-04-30 09:49:45 -0700
Commit: 74bb88b, github.com/apache/spark/pull/569
SPARK-1004. PySpark on YARN
Sandy Ryza <sandy@cloudera.com>
2014-04-29 23:24:34 -0700
Commit: 177361c, github.com/apache/spark/pull/30
Handle the vals that never used
WangTao <barneystinson@aliyun.com>
2014-04-29 22:07:20 -0700
Commit: b0ded1f, github.com/apache/spark/pull/565
Args for worker rather than master
Chen Chao <crazyjvm@gmail.com>
2014-04-29 22:05:40 -0700
Commit: 775020f, github.com/apache/spark/pull/587
[SPARK-1646] Micro-optimisation of ALS
Tor Myklebust <tmyklebu@gmail.com>
2014-04-29 22:04:34 -0700
Commit: 92269f9, github.com/apache/spark/pull/568
[SPARK-1674] fix interrupted system call error in pyspark's RDD.pipe
Xiangrui Meng <meng@databricks.com>
2014-04-29 18:06:45 -0700
Commit: 919ed31, github.com/apache/spark/pull/594
SPARK-1588. Restore SPARK_YARN_USER_ENV and SPARK_JAVA_OPTS for YARN.
Sandy Ryza <sandy@cloudera.com>
2014-04-29 12:54:02 -0700
Commit: 5f48721, github.com/apache/spark/pull/586
SPARK-1509: add zipWithIndex zipWithUniqueId methods to java api
witgo <witgo@qq.com>
2014-04-29 11:30:47 -0700
Commit: 9754d1b, github.com/apache/spark/pull/423
SPARK-1557 Set permissions on event log files/directories
Thomas Graves <tgraves@apache.org>
2014-04-29 09:19:48 -0500
Commit: bccd13e, github.com/apache/spark/pull/538
HOTFIX: minor change to release script
Patrick Wendell <pwendell@gmail.com>
2014-04-29 00:59:38 -0700
Commit: c27ce2b
HOTFIX: minor change to release script
Patrick Wendell <pwendell@gmail.com>
2014-04-29 00:53:32 -0700
Commit: 838cb0e
[SPARK-1636][MLLIB] Move main methods to examples
Xiangrui Meng <meng@databricks.com>
2014-04-29 00:41:03 -0700
Commit: aa519e3, github.com/apache/spark/pull/584
Minor fix to python table caching API.
Michael Armbrust <michael@databricks.com>
2014-04-29 00:36:15 -0700
Commit: 0995787, github.com/apache/spark/pull/585
HOTFIX: Bug in release script
Patrick Wendell <pwendell@gmail.com>
2014-04-29 00:10:17 -0700
Commit: 4ed58aa
Manual revert of rc2 version changes.
Patrick Wendell <pwendell@gmail.com>
2014-04-28 22:59:09 -0700
Commit: 2863344
Improved build configuration
witgo <witgo@qq.com>
2014-04-28 22:50:51 -0700
Commit: ee96460, github.com/apache/spark/pull/480
SPARK-1652: Remove incorrect deprecation warning in spark-submit
Patrick Wendell <pwendell@gmail.com>
2014-04-28 18:14:59 -0700
Commit: 42ba706, github.com/apache/spark/pull/578
SPARK-1654 and SPARK-1653: Fixes in spark-submit.
Patrick Wendell <pwendell@gmail.com>
2014-04-28 17:29:22 -0700
Commit: 2c9ce20, github.com/apache/spark/pull/576
SPARK-1652: Spark submit should fail gracefully if YARN not enabled
Patrick Wendell <pwendell@gmail.com>
2014-04-28 17:26:57 -0700
Commit: 38bf23e, github.com/apache/spark/pull/579
Changes to dev release script
Patrick Wendell <pwendell@gmail.com>
2014-04-28 13:58:42 -0700
Commit: 32d9db3
[SPARK-1633][Streaming] Java API unit test and example for custom streaming receiver in Java
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-28 13:58:09 -0700
Commit: 6d89faf, github.com/apache/spark/pull/558
[SQL]Append some missing types for HiveUDF
Cheng Hao <hao.cheng@intel.com>
2014-04-27 23:59:42 -0700
Commit: 42cb3b4, github.com/apache/spark/pull/459
Update the import package name for TestHive in sbt shell
Cheng Hao <hao.cheng@intel.com>
2014-04-27 23:57:29 -0700
Commit: eb9308e, github.com/apache/spark/pull/574
Fix SPARK-1609: Executor fails to start when Command.extraJavaOptions contains multiple Java options
witgo <witgo@qq.com>
2014-04-27 19:41:02 -0700
Commit: 7bbf313, github.com/apache/spark/pull/547
SPARK-1145: Memory mapping with many small blocks can cause JVM allocation failures
Patrick Wendell <pwendell@gmail.com>
2014-04-27 17:40:56 -0700
Commit: 2f24159, github.com/apache/spark/pull/43
HOTFIX: Minor patch to merge script.
Patrick Wendell <pwendell@gmail.com>
2014-04-27 15:45:17 -0700
Commit: 99285d0
SPARK-1651: Delete existing deployment directory
Rahul Singhal <rahul.singhal@guavus.com>
2014-04-27 15:50:48 -0700
Commit: 3c6c6c2, github.com/apache/spark/pull/573
SPARK-1648 Support closing JIRA's as part of merge script.
Patrick Wendell <pwendell@gmail.com>
2014-04-27 15:41:57 -0700
Commit: da26f9b, github.com/apache/spark/pull/570
SPARK-1650: Correctly identify maven project version
Rahul Singhal <rahul.singhal@guavus.com>
2014-04-27 15:17:06 -0700
Commit: 98b13e0, github.com/apache/spark/pull/572
SPARK-1606: Infer user application arguments instead of requiring --arg.
Patrick Wendell <pwendell@gmail.com>
2014-04-26 19:24:29 -0700
Commit: ce57624, github.com/apache/spark/pull/563
SPARK-1467: Make StorageLevel.apply() factory methods Developer APIs
Sandeep <sandeep@techaddict.me>
2014-04-26 19:04:33 -0700
Commit: 18ecc63, github.com/apache/spark/pull/551
[SPARK-1608] [SQL] Fix Cast.nullable when cast from StringType to NumericType/TimestampType.
Takuya UESHIN <ueshin@happy-camper.st>
2014-04-26 14:39:54 -0700
Commit: dcea67f, github.com/apache/spark/pull/532
add note of how to support table with more than 22 fields
wangfei <wangfei1@huawei.com>
2014-04-26 14:38:42 -0700
Commit: a020686, github.com/apache/spark/pull/564
[Spark-1382] Fix NPE in DStream.slice (updated version of #365)
zsxwing <zsxwing@gmail.com>, Tathagata Das <tathagata.das1565@gmail.com>
2014-04-25 19:04:34 -0700
Commit: f85c681, github.com/apache/spark/pull/562
SPARK-1632. Remove unnecessary boxing in compares in ExternalAppendOnlyM...
Sandy Ryza <sandy@cloudera.com>
2014-04-25 17:55:04 -0700
Commit: 94c71e0, github.com/apache/spark/pull/559
SPARK-1235: manage the DAGScheduler EventProcessActor with supervisor and refactor the DAGScheduler with Akka
CodingCat <zhunansjtu@gmail.com>, Xiangrui Meng <meng@databricks.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-04-25 16:04:48 -0700
Commit: 5673c1e, github.com/apache/spark/pull/186
SPARK-1607. HOTFIX: Fix syntax adapting Int result to Short
Sean Owen <sowen@cloudera.com>
2014-04-25 14:17:38 -0700
Commit: 784b2a6, github.com/apache/spark/pull/556
Update KafkaWordCount.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-04-25 13:18:49 -0700
Commit: a0912a8, github.com/apache/spark/pull/523
Delete the val that never used
WangTao <barneystinson@aliyun.com>
2014-04-25 11:47:01 -0700
Commit: 97bfeda, github.com/apache/spark/pull/553
SPARK-1621 Upgrade Chill to 0.3.6
Matei Zaharia <matei@databricks.com>
2014-04-25 11:12:41 -0700
Commit: 2c8dfd4, github.com/apache/spark/pull/543
SPARK-1619 Launch spark-shell with spark-submit
Patrick Wendell <pwendell@gmail.com>
2014-04-24 23:59:16 -0700
Commit: 8ba7f40, github.com/apache/spark/pull/542
SPARK-1607. Replace octal literals, removed in Scala 2.11, with hex literals
Sean Owen <sowen@cloudera.com>
2014-04-24 23:34:00 -0700
Commit: 7493ca9, github.com/apache/spark/pull/529
Call correct stop().
Aaron Davidson <aaron@databricks.com>
2014-04-24 23:22:03 -0700
Commit: 3eba9bd, github.com/apache/spark/pull/527
SPARK-1242 Add aggregate to python rdd
Holden Karau <holden@pigscanfly.ca>
2014-04-24 23:07:54 -0700
Commit: f09a2c0, github.com/apache/spark/pull/139
Fix [SPARK-1078]: Remove the Unnecessary lift-json dependency
Sandeep <sandeep@techaddict.me>
2014-04-24 21:51:52 -0700
Commit: 496b9ae, github.com/apache/spark/pull/536
[Typo] In the maven docs: chd -> cdh
Andrew Or <andrewor14@gmail.com>
2014-04-24 21:51:17 -0700
Commit: db69841, github.com/apache/spark/pull/548
Generalize pattern for planning hash joins.
Michael Armbrust <michael@databricks.com>
2014-04-24 21:42:33 -0700
Commit: ab131ab, github.com/apache/spark/pull/418
[SPARK-1617] and [SPARK-1618] Improvements to streaming ui and bug fix to socket receiver
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-24 21:34:37 -0700
Commit: d933c71, github.com/apache/spark/pull/540
SPARK-1584: Upgrade Flume dependency to 1.4.0
tmalaska <ted.malaska@cloudera.com>
2014-04-24 20:31:17 -0700
Commit: 777a9a5, github.com/apache/spark/pull/507
SPARK-1586 Windows build fixes
Mridul Muralidharan <mridulm80@apache.org>
2014-04-24 20:48:33 -0700
Commit: 51a387a, github.com/apache/spark/pull/505
[SPARK-986]: Job cancelation for PySpark
Ahir Reddy <ahirreddy@gmail.com>
2014-04-24 20:21:10 -0700
Commit: 7b6d774, github.com/apache/spark/pull/541
[SPARK-1615] Synchronize accesses to the LiveListenerBus' event queue
Andrew Or <andrewor14@gmail.com>
2014-04-24 20:18:15 -0700
Commit: 963046c, github.com/apache/spark/pull/544
[SPARK-1510] Spark Streaming metrics source for metrics system
jerryshao <saisai.shao@intel.com>, Tathagata Das <tathagata.das1565@gmail.com>
2014-04-24 18:56:57 -0700
Commit: 0bc0f36, github.com/apache/spark/pull/545
Spark 1489 Fix the HistoryServer view acls
Thomas Graves <tgraves@apache.org>
2014-04-24 18:38:10 -0700
Commit: c8dd132, github.com/apache/spark/pull/509
[SQL] Add support for parsing indexing into arrays in SQL.
Michael Armbrust <michael@databricks.com>
2014-04-24 18:21:00 -0700
Commit: 2a35fba, github.com/apache/spark/pull/518
[SPARK-1592][streaming] Automatically remove streaming input blocks
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-24 18:18:22 -0700
Commit: a3b6d85, github.com/apache/spark/pull/512
SPARK-1438 RDD.sample() make seed param optional
Arun Ramakrishnan <smartnut007@gmail.com>
2014-04-24 17:27:16 -0700
Commit: 521d435, github.com/apache/spark/pull/477
SPARK-1104: kill Process in workerThread of ExecutorRunner
CodingCat <zhunansjtu@gmail.com>
2014-04-24 15:55:18 -0700
Commit: a1f8779, github.com/apache/spark/pull/35
Fix Scala Style
Sandeep <sandeep@techaddict.me>
2014-04-24 15:07:23 -0700
Commit: 2250c7a, github.com/apache/spark/pull/531
SPARK-1494 Don't initialize classes loaded by MIMA excludes, attempt 2
Michael Armbrust <michael@databricks.com>
2014-04-24 14:54:01 -0700
Commit: 5ca01f6, github.com/apache/spark/pull/526
Spark 1490 Add kerberos support to the HistoryServer
Thomas Graves <tgraves@apache.org>
2014-04-24 11:15:12 -0700
Commit: 866b03e, github.com/apache/spark/pull/513
SPARK-1611: Fix incorrect initialization order in AppendOnlyMap
zsxwing <zsxwing@gmail.com>
2014-04-24 11:13:40 -0700
Commit: 00a3ccc, github.com/apache/spark/pull/534
SPARK-1488. Squash more language feature warnings in new commits by importing implicitConversion
Sean Owen <sowen@cloudera.com>
2014-04-24 10:06:18 -0700
Commit: 8d92d93, github.com/apache/spark/pull/528
Small changes to release script
Patrick Wendell <pwendell@gmail.com>
2014-04-24 09:59:44 -0700
Commit: 563be2f
[SPARK-1610] [SQL] Fix Cast to use exact type value when cast from BooleanType to NumericTy...
Takuya UESHIN <ueshin@happy-camper.st>
2014-04-24 09:57:28 -0700
Commit: 8f8e051, github.com/apache/spark/pull/533
SPARK-1601 & SPARK-1602: two bug fixes related to cancellation
Reynold Xin <rxin@apache.org>
2014-04-24 00:27:45 -0700
Commit: f98aac9, github.com/apache/spark/pull/521
SPARK-1587 Fix thread leak
Mridul Muralidharan <mridulm80@apache.org>
2014-04-23 23:20:55 -0700
Commit: 8684a15, github.com/apache/spark/pull/504
[Fix #79] Replace Breakable For Loops By While Loops
Sandeep <sandeep@techaddict.me>
2014-04-23 22:47:59 -0700
Commit: e890771, github.com/apache/spark/pull/503
SPARK-1589: Fix the incorrect compare
zsxwing <zsxwing@gmail.com>
2014-04-23 22:36:02 -0700
Commit: 9716a72, github.com/apache/spark/pull/508
Mark all fields of EdgePartition, Graph, and GraphOps transient
Ankur Dave <ankurdave@gmail.com>
2014-04-23 22:01:13 -0700
Commit: bdd2691, github.com/apache/spark/pull/520
Update Java api for setJobGroup with interruptOnCancel
Aaron Davidson <aaron@databricks.com>
2014-04-23 22:00:22 -0700
Commit: 36511ea, github.com/apache/spark/pull/522
[Hot Fix #469] Fix flaky test in SparkListenerSuite
Andrew Or <andrewor14@gmail.com>
2014-04-23 21:59:33 -0700
Commit: 99c0c33, github.com/apache/spark/pull/516
[SPARK-1540] Add an optional Ordering parameter to PairRDDFunctions.
Matei Zaharia <matei@databricks.com>
2014-04-23 17:03:54 -0700
Commit: 31c7d83, github.com/apache/spark/pull/487
SPARK-1582 Invoke Thread.interrupt() when cancelling jobs
Aaron Davidson <aaron@databricks.com>
2014-04-23 16:52:49 -0700
Commit: 55e6bea, github.com/apache/spark/pull/498
Honor default fs name when initializing event logger.
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-23 14:47:38 -0700
Commit: 46b30f9, github.com/apache/spark/pull/450
SPARK-1572 Don't kill Executor if PythonRDD fails while computing parent
Aaron Davidson <aaron@databricks.com>
2014-04-23 14:46:30 -0700
Commit: be8f26f, github.com/apache/spark/pull/486
SPARK-1583: Fix a bug that using java.util.HashMap by mistake
zsxwing <zsxwing@gmail.com>
2014-04-23 14:12:20 -0700
Commit: 19ef78f, github.com/apache/spark/pull/500
SPARK-1119 and other build improvements
Patrick Wendell <pwendell@gmail.com>
2014-04-23 10:19:32 -0700
Commit: d36d75c, github.com/apache/spark/pull/502
[SQL] SPARK-1571 Mistake in java example code
Michael Armbrust <michael@databricks.com>
2014-04-22 22:19:32 -0700
Commit: b0d8793, github.com/apache/spark/pull/496
SPARK-1494 Don't initialize classes loaded by MIMA excludes.
Michael Armbrust <michael@databricks.com>
2014-04-22 21:56:15 -0700
Commit: 18b1867, github.com/apache/spark/pull/494
SPARK-1562 Fix visibility / annotation of Spark SQL APIs
Michael Armbrust <michael@databricks.com>
2014-04-22 20:02:33 -0700
Commit: 0e03e6a, github.com/apache/spark/pull/489
[FIX: SPARK-1376] use --arg instead of --args in SparkSubmit to avoid warning messages
Xiangrui Meng <meng@databricks.com>
2014-04-22 19:38:27 -0700
Commit: 61930bd, github.com/apache/spark/pull/485
[streaming][SPARK-1578] Removed requirement for TTL in StreamingContext.
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-22 19:35:13 -0700
Commit: bf47559, github.com/apache/spark/pull/491
[Spark-1538] Fix SparkUI incorrectly hiding persisted RDDs
Andrew Or <andrewor14@gmail.com>
2014-04-22 19:24:03 -0700
Commit: 104590c, github.com/apache/spark/pull/469
Assorted clean-up for Spark-on-YARN.
Patrick Wendell <pwendell@gmail.com>
2014-04-22 19:22:06 -0700
Commit: f764f47, github.com/apache/spark/pull/488
[SPARK-1570] Fix classloading in JavaSQLContext.applySchema
Kan Zhang <kzhang@apache.org>
2014-04-22 15:05:12 -0700
Commit: f9734e2, github.com/apache/spark/pull/484
Fix compilation on Hadoop 2.4.x.
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-22 14:28:41 -0700
Commit: b6ba546, github.com/apache/spark/pull/483
[Fix #204] Eliminate delay between binding and log checking
Andrew Or <andrewor14@gmail.com>
2014-04-22 14:27:49 -0700
Commit: 54c96c2, github.com/apache/spark/pull/441
[Fix #274] Document + fix annotation usages
Andrew Or <andrewor14@gmail.com>
2014-04-21 22:24:44 -0700
Commit: 898fc34, github.com/apache/spark/pull/470
[HOTFIX] SPARK-1399: remove outdated comments
CodingCat <zhunansjtu@gmail.com>
2014-04-22 09:43:13 -0700
Commit: 61d7401, github.com/apache/spark/pull/474
[SPARK-1281] Improve partitioning in ALS
Tor Myklebust <tmyklebu@gmail.com>
2014-04-22 11:07:30 -0700
Commit: 4834adf, github.com/apache/spark/pull/407
fix bugs of dot in python
Xusen Yin <yinxusen@gmail.com>
2014-04-22 11:06:18 -0700
Commit: 4f2f093, github.com/apache/spark/pull/463
[SPARK-1506][MLLIB] Documentation improvements for MLlib 1.0
Xiangrui Meng <meng@databricks.com>
2014-04-22 11:20:47 -0700
Commit: 3f708f5, github.com/apache/spark/pull/422
[SPARK-1560]: Updated Pyrolite Dependency to be Java 6 compatible
Ahir Reddy <ahirreddy@gmail.com>
2014-04-22 09:44:41 -0700
Commit: 798d93f, github.com/apache/spark/pull/479
SPARK-1496: Have jarOfClass return Option[String]
Patrick Wendell <pwendell@gmail.com>
2014-04-22 00:42:16 -0700
Commit: 72aa131, github.com/apache/spark/pull/438
[SPARK-1459] Use local path (and not complete URL) when opening local lo...
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-21 23:10:53 -0700
Commit: 0a73103, github.com/apache/spark/pull/375
[SPARK-1439, SPARK-1440] Generate unified Scaladoc across projects and Javadocs
Matei Zaharia <matei@databricks.com>
2014-04-21 21:57:40 -0700
Commit: b0d70e4, github.com/apache/spark/pull/457
[SPARK-1332] Improve Spark Streaming's Network Receiver and InputDStream API [WIP]
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-21 19:04:49 -0700
Commit: 94cbe23, github.com/apache/spark/pull/300
Updating versions for Spark 1.0
Patrick Wendell <pwendell@gmail.com>
2014-04-21 16:59:38 -0700
Commit: a34e6fd
Dev script: include RC name in git tag
Patrick Wendell <pwendell@gmail.com>
2014-04-21 14:21:17 -0700
Commit: 1532af9
SPARK-1399: show stage failure reason in UI
CodingCat <zhunansjtu@gmail.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-04-21 14:10:23 -0700
Commit: 4b9220d, github.com/apache/spark/pull/421
SPARK-1539: RDDPage.scala contains RddPage class
Xiangrui Meng <meng@databricks.com>
2014-04-21 12:48:02 -0700
Commit: 8aa3860, github.com/apache/spark/pull/454
[Hot Fix] Ignore org.apache.spark.ui.UISuite tests
Andrew Or <andrewor14@gmail.com>
2014-04-21 12:37:43 -0700
Commit: d62ce6d, github.com/apache/spark/pull/466
REPL cleanup.
Michael Armbrust <michael@databricks.com>
2014-04-19 17:32:24 -0700
Commit: 8e1e7ec, github.com/apache/spark/pull/451
Clean up and simplify Spark configuration
Patrick Wendell <pwendell@gmail.com>
2014-04-21 10:26:33 -0700
Commit: 29ee101, github.com/apache/spark/pull/299
[SPARK-1535] ALS: Avoid the garbage-creating ctor of DoubleMatrix
Tor Myklebust <tmyklebu@gmail.com>
2014-04-19 15:10:18 -0700
Commit: 9ce6ed4, github.com/apache/spark/pull/442
Add insertInto and saveAsTable to Python API.
Michael Armbrust <michael@databricks.com>
2014-04-19 15:08:54 -0700
Commit: 6ab0719, github.com/apache/spark/pull/447
Use scala deprecation instead of java.
Michael Armbrust <michael@databricks.com>
2014-04-19 15:06:04 -0700
Commit: bfb09c6, github.com/apache/spark/pull/452
README update
Reynold Xin <rxin@apache.org>
2014-04-18 22:34:39 -0700
Commit: 2fe6b18, github.com/apache/spark/pull/443
SPARK-1482: Fix potential resource leaks in saveAsHadoopDataset and save...
zsxwing <zsxwing@gmail.com>
2014-04-18 17:49:22 -0700
Commit: ea17460, github.com/apache/spark/pull/400
SPARK-1456 Remove view bounds on Ordered in favor of a context bound on Ordering.
Michael Armbrust <michael@databricks.com>
2014-04-18 12:04:13 -0700
Commit: 9e21b97, github.com/apache/spark/pull/410
Fixed broken pyspark shell.
Reynold Xin <rxin@apache.org>
2014-04-18 10:10:13 -0700
Commit: d8767c4, github.com/apache/spark/pull/444
SPARK-1523: improve the readability of code in AkkaUtil
CodingCat <zhunansjtu@gmail.com>
2014-04-18 10:05:00 -0700
Commit: 171cea8, github.com/apache/spark/pull/434
SPARK-1357 (addendum). More Experimental items in MLlib
Sean Owen <sowen@cloudera.com>
2014-04-18 10:04:02 -0700
Commit: 1a30429, github.com/apache/spark/pull/372
[SPARK-1520] remove fastutil from dependencies
Xiangrui Meng <meng@databricks.com>
2014-04-18 10:03:15 -0700
Commit: c40eec8, github.com/apache/spark/pull/437
Reuses Row object in ExistingRdd.productToRowRdd()
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-18 10:02:27 -0700
Commit: 977467e, github.com/apache/spark/pull/432
SPARK-1483: Rename minSplits to minPartitions in public APIs
CodingCat <zhunansjtu@gmail.com>
2014-04-18 10:01:16 -0700
Commit: 969a075, github.com/apache/spark/pull/430
HOTFIX: Ignore streaming UI test
Patrick Wendell <pwendell@gmail.com>
2014-04-17 17:33:24 -0700
Commit: 1c0dc37, github.com/apache/spark/pull/440
FIX: Don't build Hive in assembly unless running Hive tests.
Patrick Wendell <pwendell@gmail.com>
2014-04-17 17:24:00 -0700
Commit: 3225272, github.com/apache/spark/pull/439
Add clean to build
Patrick Wendell <pwendell@gmail.com>
2014-04-16 16:32:34 -0700
Commit: 67d01d8
HOTFIX: Use file name and not paths for excludes
Patrick Wendell <pwendell@gmail.com>
2014-04-14 15:51:54 -0700
Commit: 5349fab
SPARK-1408 Modify Spark on Yarn to point to the history server when app ...
Thomas Graves <tgraves@apache.org>
2014-04-17 16:36:37 -0500
Commit: 6195fb8, github.com/apache/spark/pull/362
[SPARK-1395] Allow "local:" URIs to work on Yarn.
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-17 10:29:38 -0500
Commit: a83a794, github.com/apache/spark/pull/303
[python alternative] pyspark require Python2, failing if system default is Py3 from shell.py
AbhishekKr <abhikumar163@gmail.com>
2014-04-16 19:05:40 -0700
Commit: b3ad707, github.com/apache/spark/pull/399
SPARK-1462: Examples of ML algorithms are using deprecated APIs
Sandeep <sandeep@techaddict.me>
2014-04-16 18:23:07 -0700
Commit: 13fb4c7, github.com/apache/spark/pull/416
Include stack trace for exceptions thrown by user code.
Michael Armbrust <michael@databricks.com>
2014-04-16 18:12:56 -0700
Commit: aef8a4a, github.com/apache/spark/pull/409
Update ReducedWindowedDStream.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-04-16 18:08:11 -0700
Commit: 822353d, github.com/apache/spark/pull/425
misleading task number of groupByKey
Chen Chao <crazyjvm@gmail.com>
2014-04-16 17:58:42 -0700
Commit: 51c41da, github.com/apache/spark/pull/403
Fixing a race condition in event listener unit test
Kan Zhang <kzhang@apache.org>
2014-04-16 17:39:11 -0700
Commit: f0abf5f, github.com/apache/spark/pull/401
remove unnecessary brace and semicolon in 'putBlockInfo.synchronize' block
Chen Chao <crazyjvm@gmail.com>
2014-04-16 17:30:01 -0700
Commit: e43e31d, github.com/apache/spark/pull/411
SPARK-1329: Create pid2vid with correct number of partitions
Ankur Dave <ankurdave@gmail.com>
2014-04-16 17:16:55 -0700
Commit: b4ea3d9, github.com/apache/spark/pull/368
Rebuild routing table after Graph.reverse
Ankur Dave <ankurdave@gmail.com>
2014-04-16 17:15:50 -0700
Commit: 602b9ea, github.com/apache/spark/pull/431
[SPARK-1511] use Files.move instead of renameTo in TestUtils.scala
Ye Xianjin <advancedxy@gmail.com>
2014-04-16 14:56:22 -0700
Commit: 87a7c4f, github.com/apache/spark/pull/427
SPARK-1465: Spark compilation is broken with the latest hadoop-2.4.0 release
xuan <xuan@MacBook-Pro.local>, xuan <xuan@macbook-pro.home>
2014-04-16 14:41:22 -0500
Commit: d8fc4a4, github.com/apache/spark/pull/396
SPARK-1469: Scheduler mode should accept lower-case definitions and have...
Sandeep <sandeep@techaddict.me>
2014-04-16 09:58:57 -0700
Commit: b75301f, github.com/apache/spark/pull/388
Minor addition to SPARK-1497
Patrick Wendell <pwendell@gmail.com>
2014-04-16 09:43:17 -0700
Commit: 4479ecd
SPARK-1497. Fix scalastyle warnings in YARN, Hive code
Sean Owen <sowen@cloudera.com>
2014-04-16 09:34:59 -0700
Commit: c744d66, github.com/apache/spark/pull/413
SPARK-1310: Start adding k-fold cross validation to MLLib [adds kFold to MLUtils & fixes bug in BernoulliSampler]
Holden Karau <holden@pigscanfly.ca>
2014-04-16 09:33:27 -0700
Commit: 8efec04, github.com/apache/spark/pull/18
update spark.default.parallelism
Chen Chao <crazyjvm@gmail.com>
2014-04-16 09:14:18 -0700
Commit: e4f5577, github.com/apache/spark/pull/389
Loads test tables when running "sbt hive/console" without HIVE_DEV_HOME
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-16 08:54:34 -0700
Commit: 9e908ab, github.com/apache/spark/pull/417
Make "spark logo" link refer to "/".
Marcelo Vanzin <vanzin@cloudera.com>
2014-04-16 08:53:01 -0700
Commit: 5fe18a7, github.com/apache/spark/pull/408
[SPARK-959] Updated SBT from 0.13.1 to 0.13.2
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-16 08:52:14 -0700
Commit: 1ea9a21, github.com/apache/spark/pull/426
[SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDs
Michael Armbrust <michael@databricks.com>
2014-04-15 20:40:40 -0700
Commit: e5130d9, github.com/apache/spark/pull/354
[WIP] SPARK-1430: Support sparse data in Python MLlib
Matei Zaharia <matei@databricks.com>
2014-04-15 20:33:24 -0700
Commit: 95647fa, github.com/apache/spark/pull/341
[FIX] update sbt-idea to version 1.6.0
Xiangrui Meng <meng@databricks.com>
2014-04-15 19:37:32 -0700
Commit: 33d6e37, github.com/apache/spark/pull/419
SPARK-1455: Better isolation for unit tests.
Patrick Wendell <pwendell@gmail.com>
2014-04-15 19:34:39 -0700
Commit: 110e825, github.com/apache/spark/pull/420
Decision Tree documentation for MLlib programming guide
Manish Amde <manish9ue@gmail.com>
2014-04-15 11:14:28 -0700
Commit: 194ed06, github.com/apache/spark/pull/402
[SPARK-1157][MLlib] L-BFGS Optimizer based on Breeze's implementation.
DB Tsai <dbtsai@alpinenow.com>
2014-04-15 11:12:47 -0700
Commit: 5812472, github.com/apache/spark/pull/353
SPARK-1501: Ensure assertions in Graph.apply are asserted.
William Benton <willb@redhat.com>
2014-04-15 10:38:42 -0700
Commit: 692dd69, github.com/apache/spark/pull/415
SPARK-1426: Make MLlib work with NumPy versions older than 1.7
Sandeep <sandeep@techaddict.me>
2014-04-15 00:19:43 -0700
Commit: 1491b2a, github.com/apache/spark/pull/391
SPARK-1374: PySpark API for SparkSQL
Ahir Reddy <ahirreddy@gmail.com>, Michael Armbrust <michael@databricks.com>
2014-04-15 00:07:55 -0700
Commit: 7433f64, github.com/apache/spark/pull/363
SPARK-1488. Resolve scalac feature warnings during build
Sean Owen <sowen@cloudera.com>
2014-04-14 19:50:00 -0700
Commit: 7471828, github.com/apache/spark/pull/404
[BUGFIX] In-memory columnar storage bug fixes
Cheng Lian <lian.cs.zju@gmail.com>, Michael Armbrust <michael@databricks.com>
2014-04-14 15:22:43 -0700
Commit: fdebb69, github.com/apache/spark/pull/374
[SPARK-1415] Hadoop min split for wholeTextFiles()
Xusen Yin <yinxusen@gmail.com>
2014-04-13 13:18:52 -0700
Commit: 1cf565f, github.com/apache/spark/pull/376
SPARK-1480: Clean up use of classloaders
Patrick Wendell <pwendell@gmail.com>
2014-04-13 08:58:37 -0700
Commit: 3537e25, github.com/apache/spark/pull/398
[SPARK-1403] Move the class loader creation back to where it was in 0.9.0
Bharath Bhushan <manku.timma@outlook.com>
2014-04-12 20:52:29 -0700
Commit: c970d86, github.com/apache/spark/pull/322
[Fix #204] Update out-dated comments
Andrew Or <andrewor14@gmail.com>
2014-04-12 16:33:38 -0700
Commit: 52d401b, github.com/apache/spark/pull/381
[SPARK-1386] Web UI for Spark Streaming
Tathagata Das <tathagata.das1565@gmail.com>, Andrew Or <andrewor14@gmail.com>
2014-04-11 23:33:49 -0700
Commit: f36dc3f, github.com/apache/spark/pull/290
SPARK-1057 (alternative) Remove fastutil
Sean Owen <sowen@cloudera.com>
2014-04-11 22:46:47 -0700
Commit: 4dfcb38, github.com/apache/spark/pull/266
Update WindowedDStream.scala
baishuo(白硕) <vc_java@hotmail.com>
2014-04-11 20:33:42 -0700
Commit: dac6240, github.com/apache/spark/pull/390
[WIP] [SPARK-1328] Add vector statistics
Xusen Yin <yinxusen@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-04-11 19:43:22 -0700
Commit: ce0ce3d, github.com/apache/spark/pull/268
[FIX] make coalesce test deterministic in RDDSuite
Xiangrui Meng <meng@databricks.com>
2014-04-11 19:41:40 -0700
Commit: 9afaeed, github.com/apache/spark/pull/387
HOTFIX: Ignore python metastore files in RAT checks.
Patrick Wendell <pwendell@gmail.com>
2014-04-11 13:23:21 -0700
Commit: 79eb276, github.com/apache/spark/pull/393
[SPARK-1225, 1241] [MLLIB] Add AreaUnderCurve and BinaryClassificationMetrics
Xiangrui Meng <meng@databricks.com>
2014-04-11 12:06:13 -0700
Commit: e6128b5, github.com/apache/spark/pull/364
Some clean up in build/docs
Patrick Wendell <pwendell@gmail.com>
2014-04-11 10:45:27 -0700
Commit: 170b09d, github.com/apache/spark/pull/382
SPARK-1417: Spark on Yarn - spark UI link from resourcemanager is broken
Thomas Graves <tgraves@apache.org>
2014-04-11 13:17:48 +0530
Commit: 9e90c46, github.com/apache/spark/pull/344
SPARK-1202: Improvements to task killing in the UI.
Patrick Wendell <pwendell@gmail.com>
2014-04-10 20:43:56 -0700
Commit: a1bb4c6, github.com/apache/spark/pull/386
Add Spark v0.9.1 to ec2 launch script and use it as the default
Harvey Feng <hyfeng224@gmail.com>
2014-04-10 18:25:54 -0700
Commit: 59de39b, github.com/apache/spark/pull/385
Set spark.executor.uri from environment variable (needed by Mesos)
Ivan Wick <ivanwick+github@gmail.com>
2014-04-10 17:49:30 -0700
Commit: 41df293, github.com/apache/spark/pull/311
SPARK-1202 - Add a "cancel" button in the UI for stages
Sundeep Narravula <sundeepn@superduel.local>, Sundeep Narravula <sundeepn@dhcpx-204-110.corp.yahoo.com>
2014-04-10 17:10:11 -0700
Commit: 211f974, github.com/apache/spark/pull/246
[SQL] Improve column pruning in the optimizer.
Michael Armbrust <michael@databricks.com>
2014-04-10 16:20:33 -0700
Commit: 4843ef0, github.com/apache/spark/pull/378
Remove Unnecessary Whitespace's
Sandeep <sandeep@techaddict.me>
2014-04-10 15:04:13 -0700
Commit: 09bf14b, github.com/apache/spark/pull/380
Update tuning.md
Andrew Ash <andrew@andrewash.com>
2014-04-10 14:59:58 -0700
Commit: 4c9906d, github.com/apache/spark/pull/384
Revert "SPARK-1433: Upgrade Mesos dependency to 0.17.0"
Patrick Wendell <pwendell@gmail.com>
2014-04-10 14:43:29 -0700
Commit: 1e2cdbc
SPARK-1428: MLlib should convert non-float64 NumPy arrays to float64 instead of complaining
Sandeep <sandeep@techaddict.me>
2014-04-10 11:17:41 -0700
Commit: 2ac43ad, github.com/apache/spark/pull/356
[SPARK-1276] Add a HistoryServer to render persisted UI
Andrew Or <andrewor14@gmail.com>
2014-04-10 10:39:34 -0700
Commit: 9ae80bf, github.com/apache/spark/pull/204
Fix SPARK-1413: Parquet messes up stdout and stdin when used in Spark REPL
witgo <witgo@qq.com>
2014-04-10 10:35:24 -0700
Commit: a74fbbb, github.com/apache/spark/pull/325
Revert "SPARK-729: Closures not always serialized at capture time"
Patrick Wendell <pwendell@gmail.com>
2014-04-10 02:10:40 -0700
Commit: e6d4a74
SPARK-1446: Spark examples should not do a System.exit
Sandeep <sandeep@techaddict.me>
2014-04-10 00:37:21 -0700
Commit: e55cc4b, github.com/apache/spark/pull/370
SPARK-729: Closures not always serialized at capture time
William Benton <willb@redhat.com>
2014-04-09 18:56:27 -0700
Commit: 8ca3b2b, github.com/apache/spark/pull/189
[SPARK-1357 (fix)] remove empty line after :: DeveloperApi/Experimental ::
Xiangrui Meng <meng@databricks.com>
2014-04-09 17:08:17 -0700
Commit: 0adc932, github.com/apache/spark/pull/373
SPARK-1407 drain event queue before stopping event logger
Kan Zhang <kzhang@apache.org>
2014-04-09 15:24:33 -0700
Commit: eb5f2b6, github.com/apache/spark/pull/366
[SPARK-1357] [MLLIB] Annotate developer and experimental APIs
Xiangrui Meng <meng@databricks.com>
2014-04-09 02:21:15 -0700
Commit: bde9cc1, github.com/apache/spark/pull/298
SPARK-1093: Annotate developer and experimental API's
Patrick Wendell <pwendell@gmail.com>, Andrew Or <andrewor14@gmail.com>
2014-04-09 01:14:46 -0700
Commit: 87bd1f9, github.com/apache/spark/pull/274
[SPARK-1390] Refactoring of matrices backed by RDDs
Xiangrui Meng <meng@databricks.com>
2014-04-08 23:01:15 -0700
Commit: 9689b66, github.com/apache/spark/pull/296
Spark-939: allow user jars to take precedence over spark jars
Holden Karau <holden@pigscanfly.ca>
2014-04-08 22:29:21 -0700
Commit: fa0524f, github.com/apache/spark/pull/217
[SPARK-1434] [MLLIB] change labelParser from anonymous function to trait
Xiangrui Meng <meng@databricks.com>
2014-04-08 20:37:01 -0700
Commit: b9e0c93, github.com/apache/spark/pull/345
Spark 1271: Co-Group and Group-By should pass Iterable[X]
Holden Karau <holden@pigscanfly.ca>
2014-04-08 18:15:52 -0700
Commit: ce8ec54, github.com/apache/spark/pull/242
SPARK-1433: Upgrade Mesos dependency to 0.17.0
Sandeep <sandeep@techaddict.me>
2014-04-08 16:19:22 -0700
Commit: 12c077d, github.com/apache/spark/pull/355
[SPARK-1397] Notify SparkListeners when stages fail or are cancelled.
Kay Ousterhout <kayousterhout@gmail.com>
2014-04-08 14:42:02 -0700
Commit: fac6085, github.com/apache/spark/pull/309
SPARK-1445: compute-classpath should not print error if lib_managed not found
Aaron Davidson <aaron@databricks.com>
2014-04-08 14:40:20 -0700
Commit: e25b593, github.com/apache/spark/pull/361
SPARK-1348 binding Master, Worker, and App Web UI to all interfaces
Kan Zhang <kzhang@apache.org>
2014-04-08 14:30:24 -0700
Commit: a8d86b0, github.com/apache/spark/pull/318
Remove extra semicolon in import statement and unused import in ApplicationMaster
Henry Saputra <hsaputra@apache.org>
2014-04-08 14:23:16 -0700
Commit: 3bc0548, github.com/apache/spark/pull/358
[SPARK-1396] Properly cleanup DAGScheduler on job cancellation.
Kay Ousterhout <kayousterhout@gmail.com>
2014-04-08 01:03:33 -0700
Commit: 6dc5f58, github.com/apache/spark/pull/305
[SPARK-1331] Added graceful shutdown to Spark Streaming
Tathagata Das <tathagata.das1565@gmail.com>
2014-04-08 00:00:17 -0700
Commit: 83ac9a4, github.com/apache/spark/pull/247
[SPARK-1103] Automatic garbage collection of RDD, shuffle and broadcast data
Tathagata Das <tathagata.das1565@gmail.com>, Andrew Or <andrewor14@gmail.com>, Roman Pastukhov <ignatich@mail.ru>
2014-04-07 23:40:21 -0700
Commit: 11eabbe, github.com/apache/spark/pull/126
[SPARK-1402] Added 3 more compression schemes
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-07 22:24:12 -0700
Commit: 0d0493f, github.com/apache/spark/pull/330
Change timestamp cast semantics. When cast to numeric types, return the unix time in seconds (instead of millis).
Reynold Xin <rxin@apache.org>
2014-04-07 19:28:24 -0700
Commit: f27e56a, github.com/apache/spark/pull/352
Added eval for Rand (without any support for user-defined seed).
Reynold Xin <rxin@apache.org>
2014-04-07 18:40:08 -0700
Commit: 31e6fff, github.com/apache/spark/pull/349
Removed the default eval implementation from Expression, and added a bunch of override's in classes I touched.
Reynold Xin <rxin@apache.org>
2014-04-07 18:39:18 -0700
Commit: 55dfd5d, github.com/apache/spark/pull/350
[sql] Rename execution/aggregates.scala Aggregate.scala, and added a bunch of private[this] to variables.
Reynold Xin <rxin@apache.org>
2014-04-07 18:38:44 -0700
Commit: 14c9238, github.com/apache/spark/pull/348
SPARK-1099: Introduce local[*] mode to infer number of cores
Aaron Davidson <aaron@databricks.com>
2014-04-07 13:06:30 -0700
Commit: 0307db0, github.com/apache/spark/pull/182
HOTFIX: Disable actor input stream test.
Patrick Wendell <pwendell@gmail.com>
2014-04-07 12:47:27 -0700
Commit: 2a2ca48, github.com/apache/spark/pull/347
SPARK-1252. On YARN, use container-log4j.properties for executors
Sandy Ryza <sandy@cloudera.com>
2014-04-07 13:28:14 -0500
Commit: 9dd8b91, github.com/apache/spark/pull/148
[sql] Rename Expression.apply to eval for better readability.
Reynold Xin <rxin@apache.org>
2014-04-07 10:45:31 -0700
Commit: 83f2a2f, github.com/apache/spark/pull/340
SPARK-1432: Make sure that all metadata fields are properly cleaned
Davis Shepherd <davis@conviva.com>
2014-04-07 10:02:00 -0700
Commit: a3c51c6, github.com/apache/spark/pull/338
[SQL] SPARK-1427 Fix toString for SchemaRDD NativeCommands.
Michael Armbrust <michael@databricks.com>
2014-04-07 01:46:50 -0700
Commit: b5bae84, github.com/apache/spark/pull/343
[SQL] SPARK-1371 Hash Aggregation Improvements
Michael Armbrust <michael@databricks.com>
2014-04-07 00:14:00 -0700
Commit: accd099, github.com/apache/spark/pull/295
SPARK-1431: Allow merging conflicting pull requests
Patrick Wendell <pwendell@gmail.com>
2014-04-06 21:04:45 -0700
Commit: 87d0928, github.com/apache/spark/pull/342
SPARK-1154: Clean up app folders in worker nodes
Evan Chan <ev@ooyala.com>, Kelvin Chu <kelvinkwchu@yahoo.com>
2014-04-06 19:17:33 -0700
Commit: 1440154, github.com/apache/spark/pull/288
SPARK-1314: Use SPARK_HIVE to determine if we include Hive in packaging
Aaron Davidson <aaron@databricks.com>
2014-04-06 17:48:41 -0700
Commit: 4106558, github.com/apache/spark/pull/237
SPARK-1349: spark-shell gets its own command history
Aaron Davidson <aaron@databricks.com>
2014-04-06 17:43:44 -0700
Commit: 7ce52c4, github.com/apache/spark/pull/267
SPARK-1387. Update build plugins, avoid plugin version warning, centralize versions
Sean Owen <sowen@cloudera.com>
2014-04-06 17:40:37 -0700
Commit: 856c50f, github.com/apache/spark/pull/291
[SPARK-1259] Make RDD locally iterable
Egor Pakhomov <pahomov.egor@gmail.com>
2014-04-06 16:41:23 -0700
Commit: e258e50, github.com/apache/spark/pull/156
Fix SPARK-1420 The maven build error for Spark Catalyst
witgo <witgo@qq.com>
2014-04-06 16:03:06 -0700
Commit: 7012ffa, github.com/apache/spark/pull/333
SPARK-1421. Make MLlib work on Python 2.6
Matei Zaharia <matei@databricks.com>
2014-04-05 20:52:05 -0700
Commit: 0b85516, github.com/apache/spark/pull/335
Fix for PR #195 for Java 6
Sean Owen <sowen@cloudera.com>
2014-04-05 19:08:24 -0700
Commit: 890d63b, github.com/apache/spark/pull/334
[SPARK-1371] fix computePreferredLocations signature to not depend on underlying implementation
Mridul Muralidharan <mridulm80@apache.org>
2014-04-05 15:23:37 -0700
Commit: 6e88583, github.com/apache/spark/pull/302
Remove the getStageInfo() method from SparkContext.
Kay Ousterhout <kayousterhout@gmail.com>
2014-04-05 15:17:50 -0700
Commit: 2d0150c, github.com/apache/spark/pull/308
HOTFIX for broken CI, by SPARK-1336
Prashant Sharma <prashant.s@imaginea.com>, Prashant Sharma <scrapcodes@gmail.com>
2014-04-04 22:49:19 -0700
Commit: 7c18428, github.com/apache/spark/pull/321
small fix ( proogram -> program )
Prabeesh K <prabsmails@gmail.com>
2014-04-04 21:32:00 -0700
Commit: 0acc7a0, github.com/apache/spark/pull/331
[SQL] SPARK-1366 Consistent sql function across different types of SQLContexts
Michael Armbrust <michael@databricks.com>
2014-04-04 21:15:33 -0700
Commit: 8de038e, github.com/apache/spark/pull/319
SPARK-1305: Support persisting RDD's directly to Tachyon
Haoyuan Li <haoyuan@cs.berkeley.edu>, RongGu <gurongwalker@gmail.com>
2014-04-04 20:36:24 -0700
Commit: b50ddfd, github.com/apache/spark/pull/158
[SPARK-1419] Bumped parent POM to apache 14
Mark Hamstra <markhamstra@gmail.com>
2014-04-04 19:19:48 -0700
Commit: 1347ebd, github.com/apache/spark/pull/328
Add test utility for generating Jar files with compiled classes.
Patrick Wendell <pwendell@gmail.com>
2014-04-04 19:15:15 -0700
Commit: 5f3c1bb, github.com/apache/spark/pull/326
SPARK-1414. Python API for SparkContext.wholeTextFiles
Matei Zaharia <matei@databricks.com>
2014-04-04 17:29:29 -0700
Commit: 60e18ce, github.com/apache/spark/pull/327
[SQL] Minor fixes.
Michael Armbrust <michael@databricks.com>
2014-04-04 17:23:17 -0700
Commit: d956cc2, github.com/apache/spark/pull/315
[SPARK-1198] Allow pipes tasks to run in different sub-directories
Thomas Graves <tgraves@apache.org>
2014-04-04 17:16:31 -0700
Commit: 198892f, github.com/apache/spark/pull/128
Don't create SparkContext in JobProgressListenerSuite.
Patrick Wendell <pwendell@gmail.com>
2014-04-04 14:46:32 -0700
Commit: a02b535, github.com/apache/spark/pull/324
SPARK-1375. Additional spark-submit cleanup
Sandy Ryza <sandy@cloudera.com>
2014-04-04 13:28:42 -0700
Commit: 16b8308, github.com/apache/spark/pull/278
[SPARK-1133] Add whole text files reader in MLlib
Xusen Yin <yinxusen@gmail.com>
2014-04-04 11:12:47 -0700
Commit: f1fa617, github.com/apache/spark/pull/252
SPARK-1404: Always upgrade spark-env.sh vars to environment vars
Aaron Davidson <aaron@databricks.com>
2014-04-04 09:50:24 -0700
Commit: 01cf4c4, github.com/apache/spark/pull/310
SPARK-1350. Always use JAVA_HOME to run executor container JVMs.
Sandy Ryza <sandy@cloudera.com>
2014-04-04 08:54:04 -0500
Commit: 7f32fd4, github.com/apache/spark/pull/313
SPARK-1337: Application web UI garbage collects newest stages
Patrick Wendell <pwendell@gmail.com>
2014-04-03 22:13:56 -0700
Commit: ee6e9e7, github.com/apache/spark/pull/320
Revert "[SPARK-1398] Removed findbugs jsr305 dependency"
Patrick Wendell <pwendell@gmail.com>
2014-04-03 17:00:06 -0700
Commit: 33e6361
Fix jenkins from giving the green light to builds that don't compile.
Michael Armbrust <michael@databricks.com>
2014-04-03 16:53:35 -0700
Commit: 9231b01, github.com/apache/spark/pull/317
[BUILD FIX] Fix compilation of Spark SQL Java API.
Michael Armbrust <michael@databricks.com>
2014-04-03 16:12:08 -0700
Commit: d94826b, github.com/apache/spark/pull/316
[SPARK-1134] Fix and document passing of arguments to IPython
Diana Carroll <dcarroll@cloudera.com>
2014-04-03 15:48:42 -0700
Commit: a599e43, github.com/apache/spark/pull/294
[SQL] SPARK-1333 First draft of java API
Michael Armbrust <michael@databricks.com>
2014-04-03 15:45:34 -0700
Commit: b8f5341, github.com/apache/spark/pull/248
Spark 1162 Implemented takeOrdered in pyspark.
Prashant Sharma <prashant.s@imaginea.com>
2014-04-03 15:42:17 -0700
Commit: c1ea3af, github.com/apache/spark/pull/97
[SPARK-1360] Add Timestamp Support for SQL
Cheng Hao <hao.cheng@intel.com>
2014-04-03 15:33:17 -0700
Commit: 5d1feda, github.com/apache/spark/pull/275
Spark parquet improvements
Andre Schumacher <andre.schumacher@iki.fi>
2014-04-03 15:31:47 -0700
Commit: fbebaed, github.com/apache/spark/pull/195
[SPARK-1398] Removed findbugs jsr305 dependency
Mark Hamstra <markhamstra@gmail.com>
2014-04-03 14:08:47 -0700
Commit: 92a86b2, github.com/apache/spark/pull/307
[SQL] SPARK-1364 Improve datatype and test coverage for ScalaReflection schema inference.
Michael Armbrust <michael@databricks.com>
2014-04-02 18:14:31 -0700
Commit: 47ebea5, github.com/apache/spark/pull/293
[SPARK-1212, Part II] Support sparse data in MLlib
Xiangrui Meng <meng@databricks.com>
2014-04-02 14:01:12 -0700
Commit: 9c65fa7, github.com/apache/spark/pull/245
StopAfter / TopK related changes
Reynold Xin <rxin@apache.org>, Michael Armbrust <michael@databricks.com>
2014-04-02 12:48:04 -0700
Commit: ed730c9, github.com/apache/spark/pull/233
[SPARK-1371][WIP] Compression support for Spark SQL in-memory columnar storage
Cheng Lian <lian.cs.zju@gmail.com>
2014-04-02 12:47:22 -0700
Commit: 1faa579, github.com/apache/spark/pull/285
Do not re-use objects in the EdgePartition/EdgeTriplet iterators.
Daniel Darabos <darabos.daniel@gmail.com>
2014-04-02 12:27:37 -0700
Commit: 7823633, github.com/apache/spark/pull/276
[SPARK-1385] Use existing code for JSON de/serialization of BlockId
Andrew Or <andrewor14@gmail.com>
2014-04-02 10:43:09 -0700
Commit: de8eefa, github.com/apache/spark/pull/289
Renamed stageIdToActiveJob to jobIdToActiveJob.
Kay Ousterhout <kayousterhout@gmail.com>
2014-04-02 10:35:52 -0700
Commit: 11973a7, github.com/apache/spark/pull/301
Remove * from test case golden filename.
Michael Armbrust <michael@databricks.com>
2014-04-01 23:54:38 -0700
Commit: ea9de65, github.com/apache/spark/pull/297
MLI-1 Decision Trees
Manish Amde <manish9ue@gmail.com>, manishamde <manish9ue@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-04-01 21:40:49 -0700
Commit: 8b3045c, github.com/apache/spark/pull/79
Revert "[Spark-1134] only call ipython if no arguments are given; remove IPYTHONOPTS from call"
Matei Zaharia <matei@databricks.com>
2014-04-01 19:31:50 -0700
Commit: 45df912
[Spark-1134] only call ipython if no arguments are given; remove IPYTHONOPTS from call
Diana Carroll <dcarroll@cloudera.com>
2014-04-01 19:29:26 -0700
Commit: afb5ea6, github.com/apache/spark/pull/227
[SPARK-1342] Scala 2.10.4
Mark Hamstra <markhamstra@gmail.com>
2014-04-01 18:35:50 -0700
Commit: 764353d, github.com/apache/spark/pull/259
[SQL] SPARK-1372 Support for caching and uncaching tables in a SQLContext.
Michael Armbrust <michael@databricks.com>
2014-04-01 14:45:44 -0700
Commit: f5c418d, github.com/apache/spark/pull/282
[Hot Fix #42] Persisted RDD disappears on storage page if re-used
Andrew Or <andrewor14@gmail.com>
2014-03-31 23:01:14 -0700
Commit: ada310a, github.com/apache/spark/pull/281
[SPARK-1377] Upgrade Jetty to 8.1.14v20131031
Andrew Or <andrewor14@gmail.com>
2014-03-31 21:42:36 -0700
Commit: 94fe7fd, github.com/apache/spark/pull/280
SPARK-1376. In the yarn-cluster submitter, rename "args" option to "arg"
Sandy Ryza <sandy@cloudera.com>
2014-04-01 08:26:31 +0530
Commit: 564f1c1, github.com/apache/spark/pull/279
SPARK-1365 [HOTFIX] Fix RateLimitedOutputStream test
Patrick Wendell <pwendell@gmail.com>
2014-03-31 16:25:43 -0700
Commit: 33b3c2a, github.com/apache/spark/pull/277
[SQL] Rewrite join implementation to allow streaming of one relation.
Michael Armbrust <michael@databricks.com>
2014-03-31 15:23:46 -0700
Commit: 5731af5, github.com/apache/spark/pull/250
SPARK-1352: Improve robustness of spark-submit script
Patrick Wendell <pwendell@gmail.com>
2014-03-31 12:07:14 -0700
Commit: 841721e, github.com/apache/spark/pull/271
SPARK-1352 - Comment style single space before ending */ check.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-30 10:06:56 -0700
Commit: d666053, github.com/apache/spark/pull/261
[SPARK-1354][SQL] Add tableName as a qualifier for SimpleCatelogy
jerryshao <saisai.shao@intel.com>
2014-03-30 10:03:58 -0700
Commit: 95d7d2a, github.com/apache/spark/pull/272
SPARK-1336 Reducing the output of run-tests script.
Prashant Sharma <prashant.s@imaginea.com>, Prashant Sharma <scrapcodes@gmail.com>
2014-03-29 23:03:03 -0700
Commit: df1b9f7, github.com/apache/spark/pull/262
[SQL] SPARK-1354 Fix self-joins of parquet relations
Michael Armbrust <michael@databricks.com>
2014-03-29 22:02:53 -0700
Commit: 2861b07, github.com/apache/spark/pull/269
Don't swallow all kryo errors, only those that indicate we are out of data.
Michael Armbrust <michael@databricks.com>
2014-03-29 22:01:29 -0700
Commit: 92b8395, github.com/apache/spark/pull/142
[SPARK-1186] : Enrich the Spark Shell to support additional arguments.
Bernardo Gomez Palacio <bernardo.gomezpalacio@gmail.com>
2014-03-29 19:49:22 -0700
Commit: fda86d8, github.com/apache/spark/pull/116
Implement the RLike & Like in catalyst
Cheng Hao <hao.cheng@intel.com>
2014-03-29 15:12:43 -0700
Commit: af3746c, github.com/apache/spark/pull/224
SPARK-1126. spark-app preliminary
Sandy Ryza <sandy@cloudera.com>
2014-03-29 14:41:36 -0700
Commit: 1617816, github.com/apache/spark/pull/86
SPARK-1345 adding missing dependency on avro for hadoop 0.23 to the new ...
Thomas Graves <tgraves@apache.org>
2014-03-28 23:09:29 -0700
Commit: 3738f24, github.com/apache/spark/pull/263
fix path for jar, make sed actually work on OSX
Nick Lanham <nick@afternight.org>
2014-03-28 13:33:35 -0700
Commit: 75d46be, github.com/apache/spark/pull/264
SPARK-1096, a space after comment start style checker.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-28 00:21:49 -0700
Commit: 60abc25, github.com/apache/spark/pull/124
Make sed do -i '' on OSX
Nick Lanham <nick@afternight.org>
2014-03-27 22:45:00 -0700
Commit: 632c322, github.com/apache/spark/pull/258
[SPARK-1210] Prevent ContextClassLoader of Actor from becoming ClassLoader of Executo...
Takuya UESHIN <ueshin@happy-camper.st>
2014-03-27 22:17:15 -0700
Commit: 3d89043, github.com/apache/spark/pull/15
[SPARK-1268] Adding XOR and AND-NOT operations to spark.util.collection.BitSet
Petko Nikolov <nikolov@soundcloud.com>
2014-03-27 15:49:07 -0700
Commit: 6f986f0, github.com/apache/spark/pull/172
SPARK-1335. Also increase perm gen / code cache for scalatest when invoked via Maven build
Sean Owen <sowen@cloudera.com>
2014-03-27 11:49:11 -0700
Commit: 53953d0, github.com/apache/spark/pull/253
SPARK-1330 removed extra echo from comput_classpath.sh
Thomas Graves <tgraves@apache.org>
2014-03-27 11:54:43 -0500
Commit: 426042a, github.com/apache/spark/pull/241
Cut down the granularity of travis tests.
Michael Armbrust <michael@databricks.com>
2014-03-27 08:53:42 -0700
Commit: 5b2d863, github.com/apache/spark/pull/255
[SPARK-1327] GLM needs to check addIntercept for intercept and weights
Xiangrui Meng <meng@databricks.com>
2014-03-26 19:30:20 -0700
Commit: d679843, github.com/apache/spark/pull/236
SPARK-1325. The maven build error for Spark Tools
Sean Owen <sowen@cloudera.com>, witgo <witgo@qq.com>
2014-03-26 18:31:52 -0700
Commit: 1fa48d9, github.com/apache/spark/pull/240
Spark 1095 : Adding explicit return types to all public methods
NirmalReddy <nirmal_reddy2000@yahoo.com>, NirmalReddy <nirmal.reddy@imaginea.com>
2014-03-26 18:24:55 -0700
Commit: 3e63d98, github.com/apache/spark/pull/168
SPARK-1324: SparkUI Should Not Bind to SPARK_PUBLIC_DNS
Patrick Wendell <pwendell@gmail.com>
2014-03-26 18:22:15 -0700
Commit: be6d96c, github.com/apache/spark/pull/231
[SQL] Add a custom serializer for maps since they do not have a no-arg constructor.
Michael Armbrust <michael@databricks.com>
2014-03-26 18:19:49 -0700
Commit: e15e574, github.com/apache/spark/pull/243
[SQL] Un-ignore a test that is now passing.
Michael Armbrust <michael@databricks.com>
2014-03-26 18:19:15 -0700
Commit: 32cbdfd, github.com/apache/spark/pull/244
Unified package definition format in Spark SQL
Cheng Lian <lian.cs.zju@gmail.com>
2014-03-26 15:36:18 -0700
Commit: 345825d, github.com/apache/spark/pull/225
SPARK-1322, top in pyspark should sort result in descending order.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-26 09:16:37 -0700
Commit: a0853a3, github.com/apache/spark/pull/235
SPARK-1321 Use Guava's top k implementation rather than our BoundedPriorityQueue based implementation
Reynold Xin <rxin@apache.org>
2014-03-26 00:09:44 -0700
Commit: b859853, github.com/apache/spark/pull/229
Initial experimentation with Travis CI configuration
Michael Armbrust <michael@databricks.com>
2014-03-25 19:01:18 -0700
Commit: 4f7d547, github.com/apache/spark/pull/230
Avoid Option while generating call site
witgo <witgo@qq.com>, Aaron Davidson <aaron@databricks.com>
2014-03-25 13:28:13 -0700
Commit: 8237df8, github.com/apache/spark/pull/222
SPARK-1319: Fix scheduler to account for tasks using > 1 CPUs.
Shivaram Venkataraman <shivaram@eecs.berkeley.edu>
2014-03-25 13:05:30 -0700
Commit: f8111ea, github.com/apache/spark/pull/219
SPARK-1316. Remove use of Commons IO
Sean Owen <sowen@cloudera.com>
2014-03-25 10:21:25 -0700
Commit: 71d4ed2, github.com/apache/spark/pull/226
Add more hive compatability tests to whitelist
Michael Armbrust <michael@databricks.com>
2014-03-25 09:57:26 -0700
Commit: 134ace7, github.com/apache/spark/pull/220
SPARK-1286: Make usage of spark-env.sh idempotent
Aaron Davidson <aaron@databricks.com>
2014-03-24 22:24:21 -0700
Commit: 007a733, github.com/apache/spark/pull/184
Unify the logic for column pruning, projection, and filtering of table scans.
Michael Armbrust <michael@databricks.com>
2014-03-24 22:15:51 -0700
Commit: b637f2d, github.com/apache/spark/pull/213
SPARK-1128: set hadoop task properties when constructing HadoopRDD
CodingCat <zhunansjtu@gmail.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-03-24 21:55:03 -0700
Commit: 5140598, github.com/apache/spark/pull/101
SPARK-1094 Support MiMa for reporting binary compatibility accross versions.
Patrick Wendell <pwendell@gmail.com>, Prashant Sharma <prashant.s@imaginea.com>, Prashant Sharma <scrapcodes@gmail.com>
2014-03-24 21:20:23 -0700
Commit: dc126f2, github.com/apache/spark/pull/207
SPARK-1294 Fix resolution of uppercase field names using a HiveContext.
Michael Armbrust <michael@databricks.com>
2014-03-24 19:24:22 -0700
Commit: 8043b7b, github.com/apache/spark/pull/202
HOT FIX: Exclude test files from RAT
Patrick Wendell <pwendell@gmail.com>
2014-03-24 13:38:07 -0700
Commit: 56db8a2
SPARK-1144 Added license and RAT to check licenses.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-24 08:44:12 -0700
Commit: 21109fb, github.com/apache/spark/pull/125
[SPARK-1212] Adding sparse data support and update KMeans
Xiangrui Meng <meng@databricks.com>
2014-03-23 17:34:02 -0700
Commit: 80c2968, github.com/apache/spark/pull/117
Fixed coding style issues in Spark SQL
Cheng Lian <lian.cs.zju@gmail.com>
2014-03-23 15:21:40 -0700
Commit: 8265dc7, github.com/apache/spark/pull/208
[SPARK-1292] In-memory columnar representation for Spark SQL
Cheng Lian <lian@databricks.com>, Cheng Lian <lian.cs.zju@gmail.com>
2014-03-23 12:08:55 -0700
Commit: 57a4379, github.com/apache/spark/pull/205
SPARK-1254. Supplemental fix for HTTPS on Maven Central
Sean Owen <sowen@cloudera.com>
2014-03-23 10:57:01 -0700
Commit: abf6714, github.com/apache/spark/pull/209
Fix to Stage UI to display numbers on progress bar
Emtiaz Ahmed <emtiazahmed@gmail.com>
2014-03-21 18:05:53 -0700
Commit: 646e554, github.com/apache/spark/pull/201
Add asCode function for dumping raw tree representations.
Michael Armbrust <michael@databricks.com>
2014-03-21 16:54:06 -0700
Commit: d780983, github.com/apache/spark/pull/200
Make SQL keywords case-insensitive
Matei Zaharia <matei@databricks.com>
2014-03-21 16:53:18 -0700
Commit: dab5439, github.com/apache/spark/pull/193
SPARK-1279: Fix improper use of SimpleDateFormat
zsxwing <zsxwing@gmail.com>
2014-03-21 16:07:22 -0700
Commit: 2c0aa22, github.com/apache/spark/pull/179
Add hive test files to repository. Remove download script.
Michael Armbrust <michael@databricks.com>
2014-03-21 15:05:45 -0700
Commit: 7e17fe6, github.com/apache/spark/pull/199
Fix maven jenkins: Add explicit init for required tables in SQLQuerySuite
Michael Armbrust <michael@databricks.com>
2014-03-20 22:31:11 -0700
Commit: e09139d, github.com/apache/spark/pull/191
SPARK-1251 Support for optimizing and executing structured queries
Michael Armbrust <michael@databricks.com>, Yin Huai <huaiyin.thu@gmail.com>, Reynold Xin <rxin@apache.org>, Lian, Cheng <rhythm.mail@gmail.com>, Andre Schumacher <andre.schumacher@iki.fi>, Yin Huai <huai@cse.ohio-state.edu>, Timothy Chen <tnachen@gmail.com>, Cheng Lian <lian.cs.zju@gmail.com>, Timothy Chen <tnachen@apache.org>, Henry Cook <henry.m.cook+github@gmail.com>, Mark Hamstra <markhamstra@gmail.com>
2014-03-20 18:03:20 -0700
Commit: 9aadcff, github.com/apache/spark/pull/146
[Hot Fix #42] Do not stop SparkUI if bind() is not called
Andrew Or <andrewor14@gmail.com>
2014-03-20 14:13:16 -0700
Commit: ca76423, github.com/apache/spark/pull/188
Principal Component Analysis
Reza Zadeh <rizlar@gmail.com>
2014-03-20 10:39:20 -0700
Commit: 66a03e5, github.com/apache/spark/pull/88
Revert "SPARK-1099:Spark's local mode should probably respect spark.cores.max by default"
Aaron Davidson <aaron@databricks.com>
2014-03-19 17:56:48 -0700
Commit: ffe272d
SPARK-1099:Spark's local mode should probably respect spark.cores.max by default
qqsun8819 <jin.oyj@alibaba-inc.com>
2014-03-19 16:33:54 -0700
Commit: 1678931, github.com/apache/spark/pull/110
Added doctest for map function in rdd.py
Jyotiska NK <jyotiska123@gmail.com>
2014-03-19 14:04:45 -0700
Commit: 67fa71c, github.com/apache/spark/pull/177
[SPARK-1132] Persisting Web UI through refactoring the SparkListener interface
Andrew Or <andrewor14@gmail.com>, andrewor14 <andrewor14@gmail.com>
2014-03-19 13:17:01 -0700
Commit: 79d07d6, github.com/apache/spark/pull/42
Bugfixes/improvements to scheduler
Mridul Muralidharan <mridul@gmail.com>
2014-03-19 12:46:55 -0700
Commit: ab747d3, github.com/apache/spark/pull/159
SPARK-1203 fix saving to hdfs from yarn
Thomas Graves <tgraves@apache.org>
2014-03-19 08:09:20 -0500
Commit: 6112270, github.com/apache/spark/pull/173
bugfix: Wrong "Duration" in "Active Stages" in stages page
shiyun.wxm <shiyun.wxm@taobao.com>
2014-03-19 01:42:34 -0700
Commit: d55ec86, github.com/apache/spark/pull/170
Bundle tachyon: SPARK-1269
Nick Lanham <nick@afternight.org>
2014-03-18 22:04:57 -0700
Commit: a18ea00, github.com/apache/spark/pull/137
Fix SPARK-1256: Master web UI and Worker web UI returns a 404 error
witgo <witgo@qq.com>
2014-03-18 21:57:47 -0700
Commit: cc2655a, github.com/apache/spark/pull/150
[SPARK-1266] persist factors in implicit ALS
Xiangrui Meng <meng@databricks.com>
2014-03-18 17:20:42 -0700
Commit: f9d8a83, github.com/apache/spark/pull/165
[SPARK-1260]: faster construction of features with intercept
Xiangrui Meng <meng@databricks.com>
2014-03-18 15:14:13 -0700
Commit: e108b9a, github.com/apache/spark/pull/161
Update copyright year in NOTICE to 2014
Matei Zaharia <matei@databricks.com>
2014-03-18 14:34:31 -0700
Commit: 79e547f, github.com/apache/spark/pull/174
SPARK-1102: Create a saveAsNewAPIHadoopDataset method
CodingCat <zhunansjtu@gmail.com>
2014-03-18 11:06:18 -0700
Commit: 2fa26ec, github.com/apache/spark/pull/12
Revert "SPARK-1236 - Upgrade Jetty to 9.1.3.v20140225."
Patrick Wendell <pwendell@gmail.com>
2014-03-18 00:46:03 -0700
Commit: e7423d4, github.com/apache/spark/pull/167
Spark 1246 add min max to stat counter
Dan McClary <dan.mcclary@gmail.com>
2014-03-18 00:45:47 -0700
Commit: e3681f2, github.com/apache/spark/pull/144
[Spark-1261] add instructions for running python examples to doc overview page
Diana Carroll <dcarroll@cloudera.com>
2014-03-17 17:35:51 -0700
Commit: 087eedc, github.com/apache/spark/pull/162
SPARK-1244: Throw exception if map output status exceeds frame size
Patrick Wendell <pwendell@gmail.com>, Andrew Or <andrewor14@gmail.com>
2014-03-17 14:03:32 -0700
Commit: 796977a, github.com/apache/spark/pull/152
SPARK-1240: handle the case of empty RDD when takeSample
CodingCat <zhunansjtu@gmail.com>
2014-03-16 22:14:59 -0700
Commit: dc96546, github.com/apache/spark/pull/135
SPARK-1255: Allow user to pass Serializer object instead of class name for shuffle.
Reynold Xin <rxin@apache.org>
2014-03-16 09:57:21 -0700
Commit: f5486e9, github.com/apache/spark/pull/149
SPARK-1254. Consolidate, order, and harmonize repository declarations in Maven/SBT builds
Sean Owen <sowen@cloudera.com>
2014-03-15 16:43:27 -0700
Commit: 97e4459, github.com/apache/spark/pull/145
Fix serialization of MutablePair. Also provide an interface for easy updating.
Michael Armbrust <michael@databricks.com>
2014-03-14 11:40:26 -0700
Commit: e19044c, github.com/apache/spark/pull/141
[bugfix] wrong client arg, should use executor-cores
Tianshuo Deng <tdeng@twitter.com>
2014-03-13 20:27:36 -0700
Commit: 181b130, github.com/apache/spark/pull/138
SPARK-1236 - Upgrade Jetty to 9.1.3.v20140225.
Reynold Xin <rxin@apache.org>
2014-03-13 12:16:04 -0700
Commit: ca4bf8c, github.com/apache/spark/pull/113
SPARK-1183. Don't use "worker" to mean executor
Sandy Ryza <sandy@cloudera.com>
2014-03-13 12:11:33 -0700
Commit: 6983732, github.com/apache/spark/pull/120
[SPARK-1237, 1238] Improve the computation of YtY for implicit ALS
Xiangrui Meng <meng@databricks.com>
2014-03-13 00:43:19 -0700
Commit: e4e8d8f, github.com/apache/spark/pull/131
SPARK-1019: pyspark RDD take() throws an NPE
Patrick Wendell <pwendell@gmail.com>
2014-03-12 23:16:59 -0700
Commit: 4ea23db, github.com/apache/spark/pull/112
hot fix for PR105 - change to Java annotation
CodingCat <zhunansjtu@gmail.com>
2014-03-12 19:49:18 -0700
Commit: 6bd2eaa, github.com/apache/spark/pull/133
Fix example bug: compile error
jianghan <jianghan@xiaomi.com>
2014-03-12 19:46:12 -0700
Commit: 31a7040, github.com/apache/spark/pull/132
SPARK-1160: Deprecate toArray in RDD
CodingCat <zhunansjtu@gmail.com>
2014-03-12 17:43:12 -0700
Commit: 9032f7c, github.com/apache/spark/pull/105
SPARK-1162 Added top in python.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-12 15:57:44 -0700
Commit: b8afe30, github.com/apache/spark/pull/93
Fix #SPARK-1149 Bad partitioners can cause Spark to hang
liguoqiang <liguoqiang@rd.tuan800.com>
2014-03-12 12:59:51 -0700
Commit: 5d1ec64, github.com/apache/spark/pull/44
[SPARK-1233] Fix running hadoop 0.23 due to java.lang.NoSuchFieldException: DEFAULT_M...
Thomas Graves <tgraves@apache.org>
2014-03-12 11:25:41 -0700
Commit: b5162f4, github.com/apache/spark/pull/129
[SPARK-1232] Fix the hadoop 0.23 yarn build
Thomas Graves <tgraves@apache.org>
2014-03-12 10:32:01 -0700
Commit: c8c59b3, github.com/apache/spark/pull/127
Spark-1163, Added missing Python RDD functions
prabinb <prabin.banka@imaginea.com>
2014-03-11 23:57:05 -0700
Commit: af7f2f1, github.com/apache/spark/pull/92
SPARK-1064
Sandy Ryza <sandy@cloudera.com>
2014-03-11 22:39:17 -0700
Commit: 2409af9, github.com/apache/spark/pull/102
SPARK-1167: Remove metrics-ganglia from default build due to LGPL issues...
Patrick Wendell <pwendell@gmail.com>
2014-03-11 11:16:59 -0700
Commit: 16788a6, github.com/apache/spark/pull/108
SPARK-1211. In ApplicationMaster, set spark.master system property to "y...
Sandy Ryza <sandy@cloudera.com>
2014-03-10 17:42:33 -0700
Commit: 2a2c964, github.com/apache/spark/pull/118
SPARK-1205: Clean up callSite/origin/generator.
Patrick Wendell <pwendell@gmail.com>
2014-03-10 16:28:41 -0700
Commit: 2a51617, github.com/apache/spark/pull/106
SPARK-1168, Added foldByKey to pyspark.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-10 13:37:11 -0700
Commit: a59419c, github.com/apache/spark/pull/115
[SPARK-972] Added detailed callsite info for ValueError in context.py (resubmitted)
jyotiska <jyotiska123@gmail.com>
2014-03-10 13:34:49 -0700
Commit: f551898, github.com/apache/spark/pull/34
SPARK-977 Added Python RDD.zip function
Prabin Banka <prabin.banka@imaginea.com>
2014-03-10 13:27:00 -0700
Commit: e1e09e0, github.com/apache/spark/pull/76
maintain arbitrary state data for each key
Chen Chao <crazyjvm@gmail.com>
2014-03-09 22:42:12 -0700
Commit: 5d98cfc, github.com/apache/spark/pull/114
SPARK-782 Clean up for ASM dependency.
Patrick Wendell <pwendell@gmail.com>
2014-03-09 13:17:07 -0700
Commit: b9be160, github.com/apache/spark/pull/100
Fix markup errors introduced in #33 (SPARK-1189)
Patrick Wendell <pwendell@gmail.com>
2014-03-09 11:57:06 -0700
Commit: faf4cad, github.com/apache/spark/pull/111
Add timeout for fetch file
Jiacheng Guo <guojc03@gmail.com>
2014-03-09 11:37:44 -0700
Commit: f6f9d02, github.com/apache/spark/pull/98
SPARK-929: Fully deprecate usage of SPARK_MEM
Aaron Davidson <aaron@databricks.com>
2014-03-09 11:08:39 -0700
Commit: 52834d7, github.com/apache/spark/pull/99
SPARK-1190: Do not initialize log4j if slf4j log4j backend is not being used
Patrick Wendell <pwendell@gmail.com>
2014-03-08 16:02:42 -0800
Commit: e59a3b6, github.com/apache/spark/pull/107
Update junitxml plugin to the latest version to avoid recompilation in every SBT command.
Reynold Xin <rxin@apache.org>
2014-03-08 12:40:26 -0800
Commit: c2834ec, github.com/apache/spark/pull/104
[SPARK-1194] Fix the same-RDD rule for cache replacement
Cheng Lian <lian.cs.zju@gmail.com>
2014-03-07 23:26:46 -0800
Commit: 0b7b7fd, github.com/apache/spark/pull/96
Allow sbt to use more than 1G of heap.
Reynold Xin <rxin@apache.org>
2014-03-07 23:23:59 -0800
Commit: 8ad486a, github.com/apache/spark/pull/103
SPARK-1193. Fix indentation in pom.xmls
Sandy Ryza <sandy@cloudera.com>
2014-03-07 23:10:35 -0800
Commit: a99fb37, github.com/apache/spark/pull/91
Spark 1165 rdd.intersection in python and java
Prashant Sharma <prashant.s@imaginea.com>, Prashant Sharma <scrapcodes@gmail.com>
2014-03-07 18:48:07 -0800
Commit: 6e730ed, github.com/apache/spark/pull/80
SPARK-1195: set map_input_file environment variable in PipedRDD
Thomas Graves <tgraves@apache.org>
2014-03-07 10:36:55 -0800
Commit: b7cd9e9, github.com/apache/spark/pull/94
SPARK-1136: Fix FaultToleranceTest for Docker 0.8.1
Aaron Davidson <aaron@databricks.com>
2014-03-07 10:22:27 -0800
Commit: dabeb6f, github.com/apache/spark/pull/5
Small clean-up to flatmap tests
Patrick Wendell <pwendell@gmail.com>
2014-03-06 17:57:31 -0800
Commit: 33baf14
Example for cassandra CQL read/write from spark
anitatailor <tailor.anita@gmail.com>
2014-03-06 17:46:43 -0800
Commit: 9ae919c, github.com/apache/spark/pull/87
SPARK-1197. Change yarn-standalone to yarn-cluster and fix up running on YARN docs
Sandy Ryza <sandy@cloudera.com>
2014-03-06 17:12:58 -0800
Commit: 328c73d, github.com/apache/spark/pull/95
SPARK-1189: Add Security to Spark - Akka, Http, ConnectionManager, UI use servlets
Thomas Graves <tgraves@apache.org>
2014-03-06 18:27:50 -0600
Commit: 7edbea4, github.com/apache/spark/pull/33
SPARK-942: Do not materialize partitions when DISK_ONLY storage level is used
Kyle Ellrott <kellrott@gmail.com>
2014-03-06 14:51:00 -0800
Commit: 40566e1, github.com/apache/spark/pull/50
SPARK-1187, Added missing Python APIs
Prabin Banka <prabin.banka@imaginea.com>
2014-03-06 12:45:27 -0800
Commit: 3d3acef, github.com/apache/spark/pull/75
SPARK-1156: allow user to login into a cluster without slaves
CodingCat <zhunansjtu@gmail.com>
2014-03-05 21:47:34 -0800
Commit: 3eb009f, github.com/apache/spark/pull/58
SPARK-1184: Update the distribution tar.gz to include spark-assembly jar
Mark Grover <mark@apache.org>
2014-03-05 16:52:58 -0800
Commit: cda381f, github.com/apache/spark/pull/78
Improve building with maven docs
liguoqiang <liguoqiang@rd.tuan800.com>
2014-03-05 16:38:43 -0800
Commit: 51ca7bd, github.com/apache/spark/pull/70
SPARK-1171: when executor is removed, we should minus totalCores instead of just freeCores on that executor
CodingCat <zhunansjtu@gmail.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-03-05 14:00:28 -0800
Commit: a3da508, github.com/apache/spark/pull/63
SPARK-1109 wrong API docs for pyspark map function
Prashant Sharma <prashant.s@imaginea.com>
2014-03-04 15:32:43 -0800
Commit: 0283665, github.com/apache/spark/pull/73
SPARK-1178: missing document of spark.scheduler.revive.interval
CodingCat <zhunansjtu@gmail.com>
2014-03-04 10:28:17 -0800
Commit: 1865dd6, github.com/apache/spark/pull/74
SPARK-1164 Deprecated reduceByKeyToDriver as it is an alias for reduceByKeyLocally
Prashant Sharma <prashant.s@imaginea.com>
2014-03-04 10:27:02 -0800
Commit: 2d8e0a0, github.com/apache/spark/pull/72
[java8API] SPARK-964 Investigate the potential for using JDK 8 lambda expressions for the Java/Scala APIs
Prashant Sharma <prashant.s@imaginea.com>, Patrick Wendell <pwendell@gmail.com>
2014-03-03 22:31:30 -0800
Commit: 181ec50, github.com/apache/spark/pull/17
Remove broken/unused Connection.getChunkFIFO method.
Kay Ousterhout <kayousterhout@gmail.com>
2014-03-03 21:27:18 -0800
Commit: b14ede7, github.com/apache/spark/pull/69
SPARK-1158: Fix flaky RateLimitedOutputStreamSuite.
Reynold Xin <rxin@apache.org>
2014-03-03 21:24:19 -0800
Commit: f5ae38a, github.com/apache/spark/pull/55
Added a unit test for PairRDDFunctions.lookup
Bryn Keller <bryn.keller@intel.com>
2014-03-03 16:38:57 -0800
Commit: 923dba5, github.com/apache/spark/pull/36
Remove the remoteFetchTime metric.
Kay Ousterhout <kayousterhout@gmail.com>
2014-03-03 16:12:00 -0800
Commit: b55cade, github.com/apache/spark/pull/62
update proportion of memory
Chen Chao <crazyjvm@gmail.com>
2014-03-03 14:41:25 -0800
Commit: 9d225a9, github.com/apache/spark/pull/66
Removed accidentally checked in comment
Kay Ousterhout <kayousterhout@gmail.com>
2014-03-03 14:39:49 -0800
Commit: 369aad6, github.com/apache/spark/pull/61
SPARK-1173. (#2) Fix typo in Java streaming example.
Aaron Kimball <aaron@magnify.io>
2014-03-02 23:48:48 -0800
Commit: f65c1f3, github.com/apache/spark/pull/65
SPARK-1173. Improve scala streaming docs.
Aaron Kimball <aaron@magnify.io>
2014-03-02 23:26:47 -0800
Commit: 2b53447, github.com/apache/spark/pull/64
Add Jekyll tag to isolate "production-only" doc components.
Patrick Wendell <pwendell@gmail.com>
2014-03-02 18:19:01 -0800
Commit: 55a4f11, github.com/apache/spark/pull/56
SPARK-1121: Include avro for yarn-alpha builds
Patrick Wendell <pwendell@gmail.com>
2014-03-02 15:18:19 -0800
Commit: c3f5e07, github.com/apache/spark/pull/49
SPARK-1084.2 (resubmitted)
Sean Owen <sowen@cloudera.com>
2014-03-02 14:27:53 -0800
Commit: fd31adb, github.com/apache/spark/pull/32
Ignore RateLimitedOutputStreamSuite for now.
Reynold Xin <rxin@apache.org>
2014-03-02 14:27:19 -0800
Commit: 353ac6b, github.com/apache/spark/pull/54
SPARK-1137: Make ZK PersistenceEngine not crash for wrong serialVersionUID
Aaron Davidson <aaron@databricks.com>
2014-03-02 01:00:42 -0800
Commit: 46bcb95, github.com/apache/spark/pull/4
Remove remaining references to incubation
Patrick Wendell <pwendell@gmail.com>
2014-03-02 01:00:16 -0800
Commit: 1fd2bfd, github.com/apache/spark/pull/51
Update io.netty from 4.0.13 Final to 4.0.17.Final
Binh Nguyen <ngbinh@gmail.com>, Binh Nguyen <ngbinh@gmail.com>
2014-03-02 00:48:50 -0800
Commit: b70823c, github.com/apache/spark/pull/41
Merge the old sbt-launch-lib.bash with the new sbt-launcher jar downloading logic.
Michael Armbrust <michael@databricks.com>
2014-03-02 00:35:23 -0800
Commit: 012bd5f, github.com/apache/spark/pull/14
Initialized the regVal for first iteration in SGD optimizer
DB Tsai <dbtsai@alpinenow.com>
2014-03-02 00:31:59 -0800
Commit: 6fc76e4, github.com/apache/spark/pull/40
[SPARK-1100] prevent Spark from overwriting directory silently
CodingCat <zhunansjtu@gmail.com>
2014-03-01 17:27:54 -0800
Commit: 3a8b698, github.com/apache/spark/pull/11
[SPARK-1150] fix repo location in create script (re-open)
CodingCat <zhunansjtu@gmail.com>
2014-03-01 17:24:53 -0800
Commit: fe195ae, github.com/apache/spark/pull/52
Revert "[SPARK-1150] fix repo location in create script"
Patrick Wendell <pwendell@gmail.com>
2014-03-01 17:15:38 -0800
Commit: ec992e1
[SPARK-1150] fix repo location in create script
Mark Grover <mark@apache.org>
2014-03-01 16:21:22 -0800
Commit: 9aa0957, github.com/apache/spark/pull/48
[SPARK-979] Randomize order of offers.
Kay Ousterhout <kayousterhout@gmail.com>
2014-03-01 11:24:22 -0800
Commit: 556c566, github.com/apache/spark/pull/27
SPARK-1151: Update dev merge script to use spark.git instead of incubator-spark
Thomas Graves <tgraves@apache.org>
2014-02-28 18:28:33 -0800
Commit: 4ba3f70, github.com/apache/spark/pull/47
SPARK-1051. On YARN, executors don't doAs submitting user
Sandy Ryza <sandy@cloudera.com>
2014-02-28 12:43:01 -0600
Commit: 46dff34, github.com/apache/spark/pull/29
SPARK-1032. If Yarn app fails before registering, app master stays aroun...
Sandy Ryza <sandy@cloudera.com>
2014-02-28 09:40:47 -0600
Commit: 5f419bf, github.com/apache/spark/pull/28
Remote BlockFetchTracker trait
Kay Ousterhout <kayousterhout@gmail.com>
2014-02-27 21:52:55 -0800
Commit: edf8a56, github.com/apache/spark/pull/39
Removed reference to incubation in Spark user docs.
Reynold Xin <rxin@apache.org>
2014-02-27 21:13:22 -0800
Commit: 40e080a, github.com/apache/spark/pull/2
[HOTFIX] Patching maven build after #6 (SPARK-1121).
Patrick Wendell <pwendell@gmail.com>
2014-02-27 15:06:20 -0800
Commit: c42557b, github.com/apache/spark/pull/37
SPARK 1084.1 (resubmitted)
Sean Owen <sowen@cloudera.com>
2014-02-27 11:12:21 -0800
Commit: 12bbca2, github.com/apache/spark/pull/31
Show Master status on UI page
Raymond Liu <raymond.liu@intel.com>
2014-02-26 23:51:32 -0800
Commit: aace2c0, github.com/apache/spark/pull/24
[SPARK-1089] fix the regression problem on ADD_JARS in 0.9
CodingCat <zhunansjtu@gmail.com>
2014-02-26 23:42:15 -0800
Commit: 345df5f, github.com/apache/spark/pull/13
SPARK-1121 Only add avro if the build is for Hadoop 0.23.X and SPARK_YARN is set
Prashant Sharma <prashant.s@imaginea.com>
2014-02-26 23:40:49 -0800
Commit: 6ccd6c5, github.com/apache/spark/pull/6
SPARK-1129: use a predefined seed when seed is zero in XORShiftRandom
Xiangrui Meng <meng@databricks.com>
2014-02-26 23:22:30 -0800
Commit: 5a3ad10, github.com/apache/spark/pull/645
Remove references to ClusterScheduler (SPARK-1140)
Kay Ousterhout <kayousterhout@gmail.com>
2014-02-26 22:52:42 -0800
Commit: 71f69d6, github.com/apache/spark/pull/9
Updated link for pyspark examples in docs
Jyotiska NK <jyotiska123@gmail.com>
2014-02-26 21:37:04 -0800
Commit: 2645035, github.com/apache/spark/pull/22
Deprecated and added a few java api methods for corresponding scala api.
Prashant Sharma <prashant.s@imaginea.com>
2014-02-26 21:17:44 -0800
Commit: 0e40e2b, github.com/apache/spark/pull/19
Removed reference to incubation in README.md.
Reynold Xin <rxin@apache.org>
2014-02-26 16:52:26 -0800
Commit: 84f7ca1, github.com/apache/spark/pull/1
SPARK-1115: Catch depickling errors
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-02-26 14:50:37 -0800
Commit: 12738c1, github.com/apache/spark/pull/644
SPARK-1135: fix broken anchors in docs
Matei Zaharia <matei@databricks.com>
2014-02-26 11:20:16 -0800
Commit: c86eec5, github.com/apache/spark/pull/3
SPARK-1078: Replace lift-json with json4s-jackson.
William Benton <willb@redhat.com>
2014-02-26 10:09:50 -0800
Commit: fbedc8e, github.com/apache/spark/pull/582
SPARK-1053. Don't require SPARK_YARN_APP_JAR
Sandy Ryza <sandy@cloudera.com>
2014-02-26 10:00:02 -0600
Commit: b8a1871, github.com/apache/spark/pull/553
For SPARK-1082, Use Curator for ZK interaction in standalone cluster
Raymond Liu <raymond.liu@intel.com>
2014-02-24 23:20:38 -0800
Commit: c852201, github.com/apache/incubator-spark/pull/611
Graph primitives2
Semih Salihoglu <semihsalihoglu@gmail.com>
2014-02-24 22:42:30 -0800
Commit: 1f4c7f7, github.com/apache/incubator-spark/pull/580
Include reference to twitter/chill in tuning docs
Andrew Ash <andrew@andrewash.com>
2014-02-24 21:13:38 -0800
Commit: a4f4fbc, github.com/apache/incubator-spark/pull/647
For outputformats that are Configurable, call setConf before sending data to them.
Bryn Keller <bryn.keller@intel.com>
2014-02-24 17:35:22 -0800
Commit: 4d88030, github.com/apache/incubator-spark/pull/638
d8d190e 2014-02-24 16:58:57 -0800
Merge pull request #641 from mateiz/spark-1124-master
[SPARK-1124: Fix infinite retries of reduce stage when a map stage failed]
Fix removal from shuffleToMapStage to search for a key-value pair with our stage instead of using our shuffleID.
Matei Zaharia <matei@databricks.com>
2014-02-24 13:14:56 -0800
Commit: 0187cef
SPARK-1124: Fix infinite retries of reduce stage when a map stage failed
Matei Zaharia <matei@databricks.com>
2014-02-23 23:45:48 -0800
Commit: cd32d5e
SPARK-1071: Tidy logging strategy and use of log4j
Sean Owen <sowen@cloudera.com>
2014-02-23 11:40:55 -0800
Commit: c0ef3af, github.com/apache/incubator-spark/pull/570
[SPARK-1041] remove dead code in start script, remind user to set that in spark-env.sh
CodingCat <zhunansjtu@gmail.com>
2014-02-22 20:21:15 -0800
Commit: 437b62f, github.com/apache/incubator-spark/pull/588
Migrate Java code to Scala or move it to src/main/java
Punya Biswal <pbiswal@palantir.com>
2014-02-22 17:53:48 -0800
Commit: 29ac7ea, github.com/apache/incubator-spark/pull/605
[SPARK-1055] fix the SCALA_VERSION and SPARK_VERSION in docker file
CodingCat <zhunansjtu@gmail.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-02-22 15:39:25 -0800
Commit: 1aa4f8a, github.com/apache/incubator-spark/pull/634
doctest updated for mapValues, flatMapValues in rdd.py
jyotiska <jyotiska123@gmail.com>
2014-02-22 15:10:31 -0800
Commit: 722199f, github.com/apache/incubator-spark/pull/621
Fixed minor typo in worker.py
jyotiska <jyotiska123@gmail.com>
2014-02-22 10:09:50 -0800
Commit: 3ff077d, github.com/apache/incubator-spark/pull/630
SPARK-1117: update accumulator docs
Xiangrui Meng <meng@databricks.com>
2014-02-21 22:44:45 -0800
Commit: aaec7d4, github.com/apache/incubator-spark/pull/631
[SPARK-1113] External spilling - fix Int.MaxValue hash code collision bug
Andrew Or <andrewor14@gmail.com>
2014-02-21 20:05:39 -0800
Commit: fefd22f, github.com/apache/incubator-spark/pull/624
MLLIB-25: Implicit ALS runs out of memory for moderately large numbers of features
Sean Owen <sowen@cloudera.com>
2014-02-21 12:46:12 -0800
Commit: c8a4c9b, github.com/apache/incubator-spark/pull/629
SPARK-1111: URL Validation Throws Error for HDFS URL's
Patrick Wendell <pwendell@gmail.com>
2014-02-21 11:11:55 -0800
Commit: 45b15e2, github.com/apache/incubator-spark/pull/625
SPARK-1114: Allow PySpark to use existing JVM and Gateway
Ahir Reddy <ahirreddy@gmail.com>
2014-02-20 21:20:39 -0800
Commit: 59b1379, github.com/apache/incubator-spark/pull/622
Super minor: Add require for mergeCombiners in combineByKey
Aaron Davidson <aaron@databricks.com>
2014-02-20 16:46:13 -0800
Commit: 3fede48, github.com/apache/incubator-spark/pull/623
MLLIB-22. Support negative implicit input in ALS
Sean Owen <sowen@cloudera.com>
2014-02-19 23:44:53 -0800
Commit: 9e63f80, github.com/apache/incubator-spark/pull/500
MLLIB-24: url of "Collaborative Filtering for Implicit Feedback Datasets" in ALS is invalid now
Chen Chao <crazyjvm@gmail.com>
2014-02-19 22:06:35 -0800
Commit: f9b7d64, github.com/apache/incubator-spark/pull/619
[SPARK-1105] fix site scala version error in docs
CodingCat <zhunansjtu@gmail.com>
2014-02-19 15:54:03 -0800
Commit: 7b012c9, github.com/apache/incubator-spark/pull/618
SPARK-1106: check key name and identity file before launch a cluster
Xiangrui Meng <meng@databricks.com>
2014-02-18 18:30:02 -0800
Commit: b61435c, github.com/apache/incubator-spark/pull/617
Revert "[SPARK-1105] fix site scala version error in doc"
Patrick Wendell <pwendell@gmail.com>
2014-02-18 17:46:47 -0800
Commit: d9bb32a
[SPARK-1105] fix site scala version error in doc
CodingCat <zhunansjtu@gmail.com>
2014-02-18 16:29:23 -0800
Commit: d99773d, github.com/apache/incubator-spark/pull/616
Optimized imports
NirmalReddy <nirmal.reddy@imaginea.com>, NirmalReddy <nirmal_reddy2000@yahoo.com>
2014-02-18 14:44:36 -0800
Commit: ccb327a, github.com/apache/incubator-spark/pull/613
SPARK-1098: Minor cleanup of ClassTag usage in Java API
Aaron Davidson <aaron@databricks.com>
2014-02-17 19:23:27 -0800
Commit: f74ae0e, github.com/apache/incubator-spark/pull/604
[SPARK-1090] improvement on spark_shell (help information, configure memory)
CodingCat <zhunansjtu@gmail.com>
2014-02-17 15:12:52 -0800
Commit: e0d49ad, github.com/apache/incubator-spark/pull/599
Fix typos in Spark Streaming programming guide
Andrew Or <andrewor14@gmail.com>
2014-02-17 10:59:02 -0800
Commit: 767e3ae, github.com/apache/incubator-spark/pull/536
Worker registration logging fix
Andrew Ash <andrew@andrewash.com>
2014-02-17 09:51:55 -0800
Commit: c0795cf, github.com/apache/incubator-spark/pull/608
Add subtractByKey to the JavaPairRDD wrapper
Punya Biswal <pbiswal@palantir.com>
2014-02-16 18:55:59 -0800
Commit: 5af4477, github.com/apache/incubator-spark/pull/600
fix for https://spark-project.atlassian.net/browse/SPARK-1052
Bijay Bisht <bijay.bisht@gmail.com>
2014-02-16 16:52:57 -0800
Commit: 73cfdcf, github.com/apache/incubator-spark/pull/568
[SPARK-1092] print warning information if user use SPARK_MEM to regulate executor memory usage
CodingCat <zhunansjtu@gmail.com>
2014-02-16 12:25:38 -0800
Commit: 1cad381, github.com/apache/incubator-spark/pull/602
Typo: Standlone -> Standalone
Andrew Ash <andrew@andrewash.com>
2014-02-14 10:01:01 -0800
Commit: eec4bd1, github.com/apache/incubator-spark/pull/601
2414ed3 2014-02-13 14:26:06 -0800
Merge pull request #598 from shivaram/master.
[Update spark_ec2 to use 0.9.0 by default]
Add c3 instance types to Spark EC2
Christian Lundgren <christian.lundgren@gameanalytics.com>
2014-02-13 12:44:21 -0800
Commit: 5fa53c0, github.com/apache/incubator-spark/pull/595
Ported hadoopClient jar for < 1.0.1 fix
Bijay Bisht <bijay.bisht@gmail.com>
2014-02-12 23:42:10 -0800
Commit: a3bb861, github.com/apache/incubator-spark/pull/584
SPARK-1073 Keep GitHub pull request title as commit summary
Andrew Ash <andrew@andrewash.com>
2014-02-12 23:23:06 -0800
Commit: 6ee0ad8, github.com/apache/incubator-spark/pull/574
7fe7a55 2014-02-12 22:35:09 -0800
Merge pull request #592 from rxin/test.
[SPARK-1088: Create a script for running tests so we can have version specific testing on Jenkins.]
7e29e02 2014-02-12 16:26:25 -0800
Merge pull request #591 from mengxr/transient-new.
[SPARK-1076: [Fix #578] add @transient to some vals]
2bea070 2014-02-12 10:47:52 -0800
Merge pull request #589 from mengxr/index.
[SPARK-1076: Convert Int to Long to avoid overflow]
e733d65 2014-02-12 00:42:42 -0800
Merge pull request #578 from mengxr/rank.
[SPARK-1076: zipWithIndex and zipWithUniqueId to RDD]
68b2c0d 2014-02-11 22:39:48 -0800
Merge pull request #583 from colorant/zookeeper.
[Minor fix for ZooKeeperPersistenceEngine to use configured working dir]
b0dab1b 2014-02-11 14:48:59 -0800
Merge pull request #571 from holdenk/switchtobinarysearch.
[SPARK-1072 Use binary search when needed in RangePartioner]
ba38d98 2014-02-11 14:46:22 -0800
Merge pull request #577 from hsaputra/fix_simple_streaming_doc.
[SPARK-1075 Fix doc in the Spark Streaming custom receiver closing bracket in the class constructor]
4afe6cc 2014-02-10 22:28:39 -0800
Merge pull request #579 from CrazyJvm/patch-1.
["in the source DStream" rather than "int the source DStream"]
d6a9bdc 2014-02-09 23:35:06 -0800
Revert "Merge pull request #560 from pwendell/logging. Closes #560."
[This reverts commit b6d40b782327188a25ded5b22790552121e5271f.]
919bd7f 2014-02-09 22:17:52 -0800
Merge pull request #567 from ScrapCodes/style2.
[SPARK-1058, Fix Style Errors and Add Scala Style to Spark Build. Pt 2]
2182aa3 2014-02-09 15:19:50 -0800
Merge pull request #566 from martinjaggi/copy-MLlib-d.
[new MLlib documentation for optimization, regression and classification]
afc8f3c 2014-02-09 13:57:29 -0800
Merge pull request #551 from qqsun8819/json-protocol.
[[SPARK-1038] Add more fields in JsonProtocol and add tests that verify the JSON itself]
94ccf86 2014-02-09 13:54:27 -0800
Merge pull request #569 from pwendell/merge-fixes.
[Fixes bug where merges won't close associated pull request.]
b69f8b2 2014-02-09 10:09:19 -0800
Merge pull request #557 from ScrapCodes/style. Closes #557.
[SPARK-1058, Fix Style Errors and Add Scala Style to Spark Build.]
b6dba10 2014-02-08 23:39:17 -0800
Merge pull request #556 from CodingCat/JettyUtil. Closes #556.
[[SPARK-1060] startJettyServer should explicitly use IP information]
2ef37c9 2014-02-08 23:36:48 -0800
Merge pull request #562 from jyotiska/master. Closes #562.
[Added example Python code for sort]
b6d40b7 2014-02-08 23:35:31 -0800
Merge pull request #560 from pwendell/logging. Closes #560.
[[WIP] SPARK-1067: Default log4j initialization causes errors for those not using log4j]
f892da8 2014-02-08 23:13:34 -0800
Merge pull request #565 from pwendell/dev-scripts. Closes #565.
[SPARK-1066: Add developer scripts to repository.]
c2341c9 2014-02-08 16:00:43 -0800
Merge pull request #542 from markhamstra/versionBump. Closes #542.
[Version number to 1.0.0-SNAPSHOT]
f0ce736 2014-02-08 12:59:48 -0800
Merge pull request #561 from Qiuzhuang/master. Closes #561.
[Kill drivers in postStop() for Worker.]
7805080 2014-02-08 12:24:08 -0800
Merge pull request #454 from jey/atomic-sbt-download. Closes #454.
[Make sbt download an atomic operation]
fabf174 2014-02-08 11:39:13 -0800
Merge pull request #552 from martinjaggi/master. Closes #552.
[tex formulas in the documentation]
3a9d82c 2014-02-06 22:38:36 -0800
Merge pull request #506 from ash211/intersection. Closes #506.
[SPARK-1062 Add rdd.intersection(otherRdd) method]
1896c6e 2014-02-06 22:05:53 -0800
Merge pull request #533 from andrewor14/master. Closes #533.
[External spilling - generalize batching logic]
0b448df 2014-02-06 16:15:24 -0800
Merge pull request #450 from kayousterhout/fetch_failures. Closes #450.
[Only run ResubmitFailedStages event after a fetch fails]
18ad59e 2014-02-06 16:10:48 -0800
Merge pull request #321 from kayousterhout/ui_kill_fix. Closes #321.
[Inform DAG scheduler about all started/finished tasks.]
446403b 2014-02-06 15:41:16 -0800
Merge pull request #554 from sryza/sandy-spark-1056. Closes #554.
[SPARK-1056. Fix header comment in Executor to not imply that it's only u...]
084839b 2014-02-06 14:58:35 -0800
Merge pull request #498 from ScrapCodes/python-api. Closes #498.
[Python api additions]
79c9552 2014-02-05 23:38:12 -0800
Merge pull request #545 from kayousterhout/fix_progress. Closes #545.
[Fix off-by-one error with task progress info log.]
3802096 2014-02-05 23:37:07 -0800
Merge pull request #526 from tgravescs/yarn_client_stop_am_fix. Closes #526.
[spark on yarn - yarn-client mode doesn't always exit immediately]
18c4ee7 2014-02-05 22:08:47 -0800
Merge pull request #549 from CodingCat/deadcode_master. Closes #549.
[remove actorToWorker in master.scala, which is actually not used]
cc14ba9 2014-02-05 12:44:24 -0800
Merge pull request #544 from kayousterhout/fix_test_warnings. Closes #544.
[Fixed warnings in test compilation.]
f7fd80d 2014-02-05 10:29:45 -0800
Merge pull request #540 from sslavic/patch-3. Closes #540.
[Fix line end character stripping for Windows]
9209287 2014-02-04 09:47:11 -0800
Merge pull request #534 from sslavic/patch-1. Closes #534.
[Fixed wrong path to compute-classpath.cmd]
0c05cd3 2014-02-04 09:45:46 -0800
Merge pull request #535 from sslavic/patch-2. Closes #535.
[Fixed typo in scaladoc]
23af00f 2014-02-03 13:02:09 -0800
Merge pull request #528 from mengxr/sample. Closes #528.
[ Refactor RDD sampling and add randomSplit to RDD (update)]
1625d8c 2014-02-03 11:25:39 -0800
Merge pull request #530 from aarondav/cleanup. Closes #530.
[Remove explicit conversion to PairRDDFunctions in cogroup()]
0386f42 2014-02-02 21:51:17 -0800
Merge pull request #529 from hsaputra/cleanup_right_arrowop_scala
[Change the ⇒ character (maybe from scalariform) to => in Scala code for style consistency]
a8cf3ec 2014-01-31 16:52:02 -0800
Merge pull request #527 from ankurdave/graphx-assembly-pom
[Add GraphX to assembly/pom.xml]
ac712e4 2014-01-30 09:33:18 -0800
Merge pull request #524 from rxin/doc
[Added spark.shuffle.file.buffer.kb to configuration doc.]
0ff38c2 2014-01-29 12:44:54 -0800
Merge pull request #494 from tyro89/worker_registration_issue
[Issue with failed worker registrations]
7930209 2014-01-28 21:51:05 -0800
Merge pull request #497 from tdas/docs-update
[Updated Spark Streaming Programming Guide]
f8c742c 2014-01-28 21:30:20 -0800
Merge pull request #523 from JoshRosen/SPARK-1043
[Switch from MUTF8 to UTF8 in PySpark serializers.]
Switch from MUTF8 to UTF8 in PySpark serializers.
Josh Rosen <joshrosen@apache.org>
2014-01-28 19:50:26 -0800
Commit: 1381fc7
84670f2 2014-01-27 17:08:35 -0800
Merge pull request #466 from liyinan926/file-overwrite-new
[Allow files added through SparkContext.addFile() to be overwritten]
3d5c03e 2014-01-27 16:27:01 -0800
Merge pull request #516 from sarutak/master
[modified SparkPluginBuild.scala to use https protocol for accessing gith...]
f16c21e 2014-01-27 14:24:06 -0800
Merge pull request #490 from hsaputra/modify_checkoption_with_isdefined
[Replace the check for None Option with isDefined and isEmpty in Scala code]
f67ce3e 2014-01-27 11:15:51 -0800
Merge pull request #460 from srowen/RandomInitialALSVectors
[Choose initial user/item vectors uniformly on the unit sphere]
modified SparkPluginBuild.scala to use https protocol for accessing github.
sarutak <sarutak@oss.nttdata.co.jp>
2014-01-27 17:00:26 +0900
Commit: 6a5af7b
c40619d 2014-01-25 22:41:30 -0800
Merge pull request #504 from JoshRosen/SPARK-1025
[Fix PySpark hang when input files are deleted (SPARK-1025)]
c66a2ef 2014-01-25 22:36:07 -0800
Merge pull request #511 from JoshRosen/SPARK-1040
[Fix ClassCastException in JavaPairRDD.collectAsMap() (SPARK-1040)]
Fix ClassCastException in JavaPairRDD.collectAsMap() (SPARK-1040)
Josh Rosen <joshrosen@apache.org>
2014-01-25 16:39:20 -0800
Commit: 740e865
Increase JUnit test verbosity under SBT.
Josh Rosen <joshrosen@apache.org>
2014-01-25 16:32:44 -0800
Commit: 531d9d7
05be704 2014-01-23 20:53:18 -0800
Merge pull request #505 from JoshRosen/SPARK-1026
[Deprecate mapPartitionsWithSplit in PySpark (SPARK-1026)]
Deprecate mapPartitionsWithSplit in PySpark.
Josh Rosen <joshrosen@apache.org>
2014-01-23 20:01:36 -0800
Commit: 4cebb79
3d6e754 2014-01-23 19:47:00 -0800
Merge pull request #503 from pwendell/master
[Fix bug on read-side of external sort when using Snappy.]
Minor fix
Patrick Wendell <pwendell@gmail.com>
2014-01-23 19:23:12 -0800
Commit: ff44732
c319617 2014-01-23 19:11:59 -0800
Merge pull request #502 from pwendell/clone-1
[Remove Hadoop object cloning and warn users making Hadoop RDD's.]
cad3002 2014-01-23 19:08:34 -0800
Merge pull request #501 from JoshRosen/cartesian-rdd-fixes
[Fix two bugs in PySpark cartesian(): SPARK-978 and SPARK-1034]
Minor changes after auditing diff from earlier version
Patrick Wendell <pwendell@gmail.com>
2014-01-23 18:30:11 -0800
Commit: 268ecbd
Fix for SPARK-1025: PySpark hang on missing files.
Josh Rosen <joshrosen@apache.org>
2014-01-23 18:10:16 -0800
Commit: f830684
Response to Matei's review
Patrick Wendell <pwendell@gmail.com>
2014-01-23 18:12:40 -0800
Commit: c58d4ea
Fix bug on read-side of external sort when using Snappy.
Patrick Wendell <pwendell@gmail.com>
2014-01-23 17:59:42 -0800
Commit: 0213b40
Remove Hadoop object cloning and warn users making Hadoop RDD's.
Patrick Wendell <pwendell@gmail.com>
2014-01-23 13:30:54 -0800
Commit: 7101017
Fix SPARK-978: ClassCastException in PySpark cartesian.
Josh Rosen <joshrosen@apache.org>
2014-01-23 15:09:19 -0800
Commit: 6156990
Fix SPARK-1034: Py4JException on PySpark Cartesian Result
Josh Rosen <joshrosen@apache.org>
2014-01-23 13:05:59 -0800
Commit: 0035dbb
fad6aac 2014-01-23 11:14:15 -0800
Merge pull request #406 from eklavya/master
[Extending Java API coverage]
a2b47da 2014-01-23 10:48:26 -0800
Merge pull request #499 from jianpingjwang/dev1
[Replace commons-math with jblas in SVDPlusPlus]
fixed ClassTag in mapPartitions
eklavya <sr.eklavya@gmail.com>
2014-01-23 17:40:36 +0530
Commit: 60e7457
Add jblas dependency
Jianping J Wang <jianping.j.wang@gmail.com>
2014-01-23 19:54:01 +0800
Commit: 19a01c1
Add jblas dependency
Jianping J Wang <jianping.j.wang@gmail.com>
2014-01-23 19:48:39 +0800
Commit: a5a513e
Replace commons-math with jblas
Jianping J Wang <jianping.j.wang@gmail.com>
2014-01-23 19:44:30 +0800
Commit: cc0fd33
a1cd185 2014-01-22 19:37:29 -0800
Merge pull request #496 from pwendell/master
[Fix bug in worker clean-up in UI]
034dce2 2014-01-22 18:58:02 -0800
Merge pull request #447 from CodingCat/SPARK-1027
[fix for SPARK-1027]
Fix bug in worker clean-up in UI
Patrick Wendell <pwendell@gmail.com>
2014-01-22 18:19:46 -0800
Commit: 6285513
refactor sparkHome to val
CodingCat <zhunansjtu@gmail.com>
2014-01-22 19:32:51 -0500
Commit: 2b3c461
3184fac 2014-01-22 15:45:04 -0800
Merge pull request #495 from srowen/GraphXCommonsMathDependency
[Fix graphx Commons Math dependency]
Also add graphx commons-math3 dependeny in sbt build
Sean Owen <sowen@cloudera.com>
2014-01-22 22:40:41 +0000
Commit: 4476398
a1238bb 2014-01-22 14:32:59 -0800
Merge pull request #492 from skicavs/master
[fixed job name and usage information for the JavaSparkPi example]
Depend on Commons Math explicitly instead of accidentally getting it from Hadoop (which stops working in 2.2.x) and also use the newer commons-math3
Sean Owen <sowen@cloudera.com>
2014-01-22 22:25:49 +0000
Commit: fd0c5b8
576c4a4 2014-01-22 14:10:07 -0800
Merge pull request #478 from sryza/sandy-spark-1033
[SPARK-1033. Ask for cores in Yarn container requests]
5bcfd79 2014-01-22 14:05:48 -0800
Merge pull request #493 from kayousterhout/double_add
[Fixed bug where task set managers are added to queue twice]
d009b17 2014-01-22 14:01:30 -0800
Merge pull request #315 from rezazadeh/sparsesvd
[Sparse SVD]
Fixed bug where task set managers are added to queue twice
Kay Ousterhout <kayousterhout@gmail.com>
2014-01-22 09:49:31 -0800
Commit: 19da82c
fixed job name and usage information for the JavaSparkPi example
Kevin Mader <kevinmader@gmail.com>
2014-01-22 15:58:23 +0100
Commit: 36f9a64
Replace the code to check for Option != None with Option.isDefined call in Scala code.
Henry Saputra <hsaputra@apache.org>
2014-01-21 23:22:10 -0800
Commit: 90ea9d5
749f842 2014-01-21 14:53:49 -0800
Merge pull request #489 from ash211/patch-6
[Clarify spark.default.parallelism]
Clarify spark.default.parallelism
Andrew Ash <andrew@andrewash.com>
2014-01-21 14:49:35 -0800
Commit: 069bb94
f854498 2014-01-21 10:49:54 -0800
Merge pull request #469 from ajtulloch/use-local-spark-context-in-tests-for-mllib
[[MLlib] Use a LocalSparkContext trait in test suites]
Fixed import order
Andrew Tulloch <andrew@tullo.ch>
2014-01-21 13:34:59 +0000
Commit: 3a067b4
Incorporate Tom's comments - update doc and code to reflect that core requests may not always be honored
Sandy Ryza <sandy@cloudera.com>
2014-01-21 00:38:02 -0800
Commit: adf4261
77b986f 2014-01-21 00:09:42 -0800
Merge pull request #480 from pwendell/0.9-fixes
[Handful of 0.9 fixes]
Style clean-up
Patrick Wendell <pwendell@gmail.com>
2014-01-20 23:42:24 -0800
Commit: a9bcc98
c67d3d8 2014-01-20 23:34:35 -0800
Merge pull request #484 from tdas/run-example-fix
[Made run-example respect SPARK_JAVA_OPTS and SPARK_MEM.]
Removed SPARK_MEM from run-examples.
Tathagata Das <tathagata.das1565@gmail.com>
2014-01-20 23:15:28 -0800
Commit: 65869f8
Adding small code comment
Patrick Wendell <pwendell@gmail.com>
2014-01-20 23:11:45 -0800
Commit: a917a87
6b4eed7 2014-01-20 22:35:45 -0800
Merge pull request #449 from CrazyJvm/master
[SPARK-1028 : fix "set MASTER automatically fails" bug.]
0367981 2014-01-20 22:25:50 -0800
Merge pull request #482 from tdas/streaming-example-fix
[Added StreamingContext.awaitTermination to streaming examples]
7373ffb 2014-01-20 21:44:29 -0800
Merge pull request #483 from pwendell/gitignore
[Restricting /lib to top level directory in .gitignore]
Made run-example respect SPARK_JAVA_OPTS and SPARK_MEM.
Tathagata Das <tathagata.das1565@gmail.com>
2014-01-20 20:48:59 -0800
Commit: e0b741d
Restricting /lib to top level directory in .gitignore
Patrick Wendell <pwendell@gmail.com>
2014-01-20 20:39:10 -0800
Commit: e437069
Added StreamingContext.awaitTermination to streaming examples.
Tathagata Das <tathagata.das1565@gmail.com>
2014-01-20 20:25:04 -0800
Commit: 2e95174
Avoid matching attempt files in the checkpoint
Patrick Wendell <pwendell@gmail.com>
2014-01-20 20:02:02 -0800
Commit: d46df96
Remove shuffle files if they are still present on a machine.
Patrick Wendell <pwendell@gmail.com>
2014-01-20 19:11:22 -0800
Commit: de526ad
Fixing speculation bug
Patrick Wendell <pwendell@gmail.com>
2014-01-20 19:05:03 -0800
Commit: f84400e
Force use of LZF when spilling data
Patrick Wendell <pwendell@gmail.com>
2014-01-20 19:00:48 -0800
Commit: c324ac1
Bug fix for reporting of spill output
Patrick Wendell <pwendell@gmail.com>
2014-01-20 18:34:00 -0800
Commit: 1b29914
Minor fixes
Patrick Wendell <pwendell@gmail.com>
2014-01-20 18:33:21 -0800
Commit: 54867e9
Removing docs on akka options
Patrick Wendell <pwendell@gmail.com>
2014-01-20 16:35:26 -0800
Commit: cdb003e
SPARK-1033. Ask for cores in Yarn container requests
Sandy Ryza <sandy@cloudera.com>
2014-01-19 10:16:25 -0800
Commit: 3e85b87
fix for SPARK-1027
CodingCat <zhunansjtu@gmail.com>
2014-01-15 20:46:14 -0500
Commit: 29f4b6a
executor creation failed should not make the worker restart
CodingCat <zhunansjtu@gmail.com>
2014-01-15 19:32:50 -0500
Commit: f9a95d6
792d908 2014-01-19 11:33:11 -0800
Merge pull request #470 from tgravescs/fix_spark_examples_yarn
[Only log error on missing jar to allow spark examples to jar.]
256a355 2014-01-19 10:29:54 -0800
Merge pull request #458 from tdas/docs-update
[Updated java API docs for streaming, along with very minor changes in the code examples.]
update comment
Thomas Graves <tgraves@apache.org>
2014-01-19 12:21:39 -0600
Commit: dd56b21
Only log error on missing jar to allow spark examples to jar.
Thomas Graves <tgraves@apache.org>
2014-01-19 12:16:58 -0600
Commit: ceb79a3
LocalSparkContext for MLlib
Andrew Tulloch <andrew@tullo.ch>
2014-01-19 17:51:00 +0000
Commit: 720836a
Addressed comments from Reynold
Yinan Li <liyinan926@gmail.com>
2014-01-18 21:28:17 -0800
Commit: 584323c
fe8a354 2014-01-18 16:29:23 -0800
Merge pull request #459 from srowen/UpdaterL2Regularization
[Correct L2 regularized weight update with canonical form]
73dfd42 2014-01-18 16:23:56 -0800
Merge pull request #437 from mridulm/master
[Minor api usability changes]
4c16f79 2014-01-18 16:21:43 -0800
Merge pull request #426 from mateiz/py-ml-tests
[Re-enable Python MLlib tests (require Python 2.7 and NumPy 1.7+)]
bf56995 2014-01-18 16:17:34 -0800
Merge pull request #462 from mateiz/conf-file-fix
[Remove Typesafe Config usage and conf files to fix nested property names]
Allow files added through SparkContext.addFile() to be overwritten
Yinan Li <liyinan926@gmail.com>
2014-01-17 17:27:25 -0800
Commit: fd833e7
aa981e4 2014-01-18 12:49:21 -0800
Merge pull request #461 from pwendell/master
[Use renamed shuffle spill config in CoGroupedRDD.scala]
Use renamed shuffle spill config in CoGroupedRDD.scala
Patrick Wendell <pwendell@gmail.com>
2014-01-18 11:55:10 -0800
Commit: 5316bca
Correct L2 regularized weight update with canonical form
Sean Owen <sowen@cloudera.com>
2014-01-18 12:53:01 +0000
Commit: e91ad3f
rename to MatrixSVD
Reza Zadeh <rizlar@gmail.com>
2014-01-17 14:40:51 -0800
Commit: 85b95d0
rename to MatrixSVD
Reza Zadeh <rizlar@gmail.com>
2014-01-17 14:39:30 -0800
Commit: fa32998
Merge remote-tracking branch 'upstream/master' into sparsesvd
Reza Zadeh <rizlar@gmail.com>
2014-01-17 14:34:03 -0800
Commit: caf97a2
make example 0-indexed
Reza Zadeh <rizlar@gmail.com>
2014-01-17 14:33:03 -0800
Commit: 4e96757
0index docs
Reza Zadeh <rizlar@gmail.com>
2014-01-17 14:31:39 -0800
Commit: 5c639d7
prettify
Reza Zadeh <rizlar@gmail.com>
2014-01-17 14:14:29 -0800
Commit: c9b4845
add rename computeSVD
Reza Zadeh <rizlar@gmail.com>
2014-01-17 13:59:05 -0800
Commit: dbec69b
replace this.type with SVD
Reza Zadeh <rizlar@gmail.com>
2014-01-17 13:57:27 -0800
Commit: eb2d8c4
use 0-indexing
Reza Zadeh <rizlar@gmail.com>
2014-01-17 13:55:42 -0800
Commit: cb13b15
changes from PR
Reza Zadeh <rizlar@gmail.com>
2014-01-17 13:39:40 -0800
Commit: d28bf41
Address review comment
Mridul Muralidharan <mridul@gmail.com>
2014-01-17 18:28:55 +0530
Commit: b690e11
d749d47 2014-01-16 23:18:15 -0800
Merge pull request #451 from Qiuzhuang/master
[Fixed Window spark shell launch script error.]
d4fd89e 2014-01-16 23:17:30 -0800
Merge pull request #438 from ScrapCodes/clone-records-java-api
[Clone records java api]
adding clone records field to equivaled java apis
Prashant Sharma <scrapcodes@gmail.com>
2014-01-14 20:13:55 +0530
Commit: fcb4fc6
Updated java API docs for streaming, along with very minor changes in the code examples.
Tathagata Das <tathagata.das1565@gmail.com>
2014-01-16 14:44:02 -0800
Commit: 11e6534
Use method, not variable
Mridul Muralidharan <mridul@gmail.com>
2014-01-16 17:26:42 +0530
Commit: edd82c5
Address review comments
Mridul Muralidharan <mridul@gmail.com>
2014-01-16 17:23:25 +0530
Commit: 1a0da89
Fixed Window spark shell launch script error. JIRA SPARK-1029:https://spark-project.atlassian.net/browse/SPARK-1029
Qiuzhuang Lian <Qiuzhuang.Lian@gmail.com>
2014-01-16 16:09:10 +0800
Commit: 4e510b0
c06a307 2014-01-15 23:47:25 -0800
Merge pull request #445 from kayousterhout/exec_lost
[Fail rather than hanging if a task crashes the JVM.]
Updated unit test comment
Kay Ousterhout <kayousterhout@gmail.com>
2014-01-15 23:46:14 -0800
Commit: 718a13c
84595ea 2014-01-15 20:15:29 -0800
Merge pull request #414 from soulmachine/code-style
[Code clean up for mllib]
fix some format problem.
CrazyJvm <crazyjvm@gmail.com>
2014-01-16 11:57:46 +0800
Commit: 8400536
fix "set MASTER automatically fails" bug.
CrazyJvm <crazyjvm@gmail.com>
2014-01-16 11:45:02 +0800
Commit: 7a0c5b5
0675ca5 2014-01-15 16:09:03 -0800
Merge pull request #439 from CrazyJvm/master
[SPARK-1024 Remove "-XX:+UseCompressedStrings" option from tuning guide]
Fail rather than hanging if a task crashes the JVM.
Kay Ousterhout <kayousterhout@gmail.com>
2014-01-15 16:03:40 -0800
Commit: a268d63
4f0c361 2014-01-15 14:25:45 -0800
Merge pull request #444 from mateiz/py-version
[Clarify that Python 2.7 is only needed for MLlib]
Clarify that Python 2.7 is only needed for MLlib
Matei Zaharia <matei@databricks.com>
2014-01-15 14:20:39 -0800
Commit: 2ffdaef
59f475c 2014-01-15 13:55:14 -0800
Merge pull request #442 from pwendell/standalone
[Workers should use working directory as spark home if it's not specified]
2a05403 2014-01-15 13:54:45 -0800
Merge pull request #443 from tdas/filestream-fix
[Made some classes private[stremaing] and deprecated a method in JavaStreamingContext.]
Made some classes private[stremaing] and deprecated a method in JavaStreamingContext.
Tathagata Das <tathagata.das1565@gmail.com>
2014-01-15 12:15:46 -0800
Commit: 9e63753
5fecd25 2014-01-15 11:15:07 -0800
Merge pull request #441 from pwendell/graphx-build
[GraphX shouldn't list Spark as provided.]
Workers should use working directory as spark home if it's not specified
Patrick Wendell <pwendell@gmail.com>
2014-01-15 10:58:02 -0800
Commit: 00a3f7e
GraphX shouldn't list Spark as provided
Patrick Wendell <pwendell@gmail.com>
2014-01-15 10:44:17 -0800
Commit: 9259d70
494d3c0 2014-01-15 10:00:50 -0800
Merge pull request #433 from markhamstra/debFix
[Updated Debian packaging]
cef2af9 2014-01-15 10:06:17 -0600
Merge pull request #366 from colorant/yarn-dev
[More yarn code refactor]
remove "-XX:+UseCompressedStrings" option
CrazyJvm <crazyjvm@gmail.com>
2014-01-15 22:26:15 +0800
Commit: 263933d
3d9e66d 2014-01-14 23:17:05 -0800
Merge pull request #436 from ankurdave/VertexId-case
[Rename VertexID -> VertexId in GraphX]
Expose method and class - so that we can use it from user code (particularly since checkpoint directory is autogenerated now
Mridul Muralidharan <mridul@gmail.com>
2014-01-15 12:44:44 +0530
Commit: 0aea33d
139c24e 2014-01-14 23:07:55 -0800
Merge pull request #435 from tdas/filestream-fix
[Fixed the flaky tests by making SparkConf not serializable]
087487e 2014-01-14 22:50:36 -0800
Merge pull request #434 from rxin/graphxmaven
[Fixed SVDPlusPlusSuite in Maven build.]
Merge remote-tracking branch 'apache/master' into filestream-fix
Tathagata Das <tathagata.das1565@gmail.com>
2014-01-14 22:21:20 -0800
Commit: 0e15bd7
Changed SparkConf to not be serializable. And also fixed unit-test log paths in log4j.properties of external modules.
Tathagata Das <tathagata.das1565@gmail.com>
2014-01-14 22:20:14 -0800
Commit: 1f4718c
Fixed SVDPlusPlusSuite in Maven build.
Reynold Xin <rxin@apache.org>
2014-01-14 22:18:43 -0800
Commit: dfb1524
Removed repl-bin and updated maven build doc.
Mark Hamstra <markhamstra@gmail.com>
2014-01-14 21:36:58 -0800
Commit: 147a943
VertexID -> VertexId
Ankur Dave <ankurdave@gmail.com>
2014-01-14 22:17:18 -0800
Commit: f4d9019
Add deb profile to assembly/pom.xml
Mark Hamstra <markhamstra@gmail.com>
2014-01-14 21:23:09 -0800
Commit: 148757e
3a386e2 2014-01-14 21:52:50 -0800
Merge pull request #424 from jegonzal/GraphXProgrammingGuide
[Additional edits for clarity in the graphx programming guide.]
ad294db 2014-01-14 21:51:06 -0800
Merge pull request #431 from ankurdave/graphx-caching-doc
[Describe caching and uncaching in GraphX programming guide]
Describe GraphX caching and uncaching in guide
Ankur Dave <ankurdave@gmail.com>
2014-01-14 17:24:25 -0800
Commit: 1210ec2
74b46ac 2014-01-14 14:59:13 -0800
Merge pull request #428 from pwendell/writeable-objects
[Don't clone records for text files]
193a075 2014-01-14 14:53:24 -0800
Merge pull request #429 from ankurdave/graphx-examples-pom.xml
[Add GraphX dependency to examples/pom.xml]
d601a76 2014-01-14 14:52:24 -0800
Merge pull request #427 from pwendell/deprecate-aggregator
[Deprecate rather than remove old combineValuesByKey function]
Add GraphX dependency to examples/pom.xml
Ankur Dave <ankurdave@gmail.com>
2014-01-14 13:57:51 -0800
Commit: 8ea056d
Style fix
Patrick Wendell <pwendell@gmail.com>
2014-01-14 13:56:27 -0800
Commit: b1b22b7
Adding fix covering combineCombinersByKey as well
Patrick Wendell <pwendell@gmail.com>
2014-01-14 13:52:23 -0800
Commit: 8ea2cd5
2ce23a5 2014-01-14 13:28:44 -0800
Merge pull request #425 from rxin/scaladoc
[API doc update & make Broadcast public]
Complain if Python and NumPy versions are too old for MLlib
Matei Zaharia <matei@databricks.com>
2014-01-14 12:27:58 -0800
Commit: 5b3a3e2
Deprecate rather than remove old combineValuesByKey function
Patrick Wendell <pwendell@gmail.com>
2014-01-14 12:15:10 -0800
Commit: b683608
Re-enable Python MLlib tests (require Python 2.7 and NumPy 1.7+)
Matei Zaharia <matei@databricks.com>
2014-01-14 12:14:48 -0800
Commit: 938e4a0
Don't clone records for text files
Patrick Wendell <pwendell@gmail.com>
2014-01-14 11:57:53 -0800
Commit: 6f965a4
Fixed a typo in JavaSparkContext's API doc.
Reynold Xin <rxin@apache.org>
2014-01-14 11:42:28 -0800
Commit: f12e506
Maintain Serializable API compatibility by reverting back to java.io.Serializable for Broadcast and Accumulator.
Reynold Xin <rxin@apache.org>
2014-01-14 11:30:59 -0800
Commit: 1b5623f
Added license header for package.scala in the Java API package.
Reynold Xin <rxin@apache.org>
2014-01-14 11:20:12 -0800
Commit: 55db774
Added package doc for the Java API.
Reynold Xin <rxin@apache.org>
2014-01-14 11:16:25 -0800
Commit: f8c12e9
Updated API doc for Accumulable and Accumulator.
Reynold Xin <rxin@apache.org>
2014-01-14 11:16:08 -0800
Commit: 6a12b9e
Broadcast variable visibility change & doc update.
Reynold Xin <rxin@apache.org>
2014-01-14 11:15:21 -0800
Commit: 71b3007
Additional edits for clarity in the graphx programming guide.
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-01-14 10:31:43 -0800
Commit: 0bba773
3fcc68b 2014-01-14 09:44:43 -0800
Merge pull request #423 from jegonzal/GraphXProgrammingGuide
[Improving the graphx-programming-guide]
Improving the graphx-programming-guide.
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-01-14 09:40:06 -0800
Commit: 486f37c
Added parentheses for that getDouble() also has side effect
Frank Dai <soulmachine@gmail.com>
2014-01-14 18:56:11 +0800
Commit: 57fcfc7
fa75e5e 2014-01-14 01:18:34 -0800
Merge pull request #420 from pwendell/header-files
[Add missing header files]
Add missing header files
Patrick Wendell <pwendell@gmail.com>
2014-01-14 01:14:20 -0800
Commit: 2303479
Modifications as suggested in PR feedback-
Saurabh Rawat <sr.eklavya@gmail.com>
2014-01-14 14:19:02 +0530
Commit: 1442cd5
Merge remote-tracking branch 'upstream/master' into sparsesvd
Reza Zadeh <rizlar@gmail.com>
2014-01-13 23:52:34 -0800
Commit: 845e568
Merge remote-tracking branch 'upstream/master' into code-style
Frank Dai <soulmachine@gmail.com>
2014-01-14 15:29:17 +0800
Commit: a3da468
Indent two spaces
Frank Dai <soulmachine@gmail.com>
2014-01-14 14:59:01 +0800
Commit: c2852cf
Since getLong() and getInt() have side effect, get back parentheses, and remove an empty line
Frank Dai <soulmachine@gmail.com>
2014-01-14 14:53:10 +0800
Commit: 12386b3
Code clean up for mllib
Frank Dai <soulmachine@gmail.com>
2014-01-14 14:37:26 +0800
Commit: 0d94d74
Address comments to fix code formats
Raymond Liu <raymond.liu@intel.com>
2014-01-10 09:44:44 +0800
Commit: 4c22c55
Yarn workerRunnable refactor
Raymond Liu <raymond.liu@intel.com>
2014-01-09 14:16:07 +0800
Commit: 161ab93
Yarn Client refactor
Raymond Liu <raymond.liu@intel.com>
2014-01-09 09:53:50 +0800
Commit: 79a5ba3
Modifications as suggested in PR feedback-
Saurabh Rawat <sr.eklavya@gmail.com>
2014-01-13 23:40:04 +0530
Commit: e922973
Remove default param from mapPartitions
eklavya <sr.eklavya@gmail.com>
2014-01-13 18:13:22 +0530
Commit: fa42951
Remove classtag from mapPartitions.
eklavya <sr.eklavya@gmail.com>
2014-01-13 18:09:58 +0530
Commit: 8fe562c
Added foreachPartition method to JavaRDD.
eklavya <sr.eklavya@gmail.com>
2014-01-13 17:56:47 +0530
Commit: 6a65fee
Added mapPartitions method to JavaRDD.
eklavya <sr.eklavya@gmail.com>
2014-01-13 17:56:10 +0530
Commit: dbadc6b
Added setter method setGenerator to JavaRDD.
eklavya <sr.eklavya@gmail.com>
2014-01-13 17:53:35 +0530
Commit: aae8a01
Merge remote-tracking branch 'upstream/master' into sparsesvd
Reza Zadeh <rizlar@gmail.com>
2014-01-11 13:27:15 -0800
Commit: f324d53
add dimension parameters to example
Reza Zadeh <rizlar@gmail.com>
2014-01-10 21:30:54 -0800
Commit: 1afdeae
Merge remote-tracking branch 'upstream/master' into sparsesvd
Reza Zadeh <rizlar@gmail.com>
2014-01-09 22:45:32 -0800
Commit: 21c8a54
fix example
Reza Zadeh <rizlar@gmail.com>
2014-01-09 22:39:41 -0800
Commit: cf5bd4a
documentation for sparsematrix
Reza Zadeh <rizlar@gmail.com>
2014-01-07 17:19:28 -0800
Commit: 4f38b6f
More sparse matrix usage.
Reza Zadeh <rizlar@gmail.com>
2014-01-07 17:16:17 -0800
Commit: 7d7490b
fix docs to use SparseMatrix
Reza Zadeh <rizlar@gmail.com>
2014-01-05 18:03:57 -0800
Commit: 746148b
use SparseMatrix everywhere
Reza Zadeh <rizlar@gmail.com>
2014-01-04 14:28:07 -0800
Commit: 06c0f76
prettify
Reza Zadeh <rizlar@gmail.com>
2014-01-04 12:44:04 -0800
Commit: cdff9fc
new example file
Reza Zadeh <rizlar@gmail.com>
2014-01-04 12:33:22 -0800
Commit: e9bd6cb
fix tests
Reza Zadeh <rizlar@gmail.com>
2014-01-04 11:52:42 -0800
Commit: 8bfcce1
set methods
Reza Zadeh <rizlar@gmail.com>
2014-01-04 11:30:36 -0800
Commit: 35adc72
add k parameter
Reza Zadeh <rizlar@gmail.com>
2014-01-04 01:52:28 -0800
Commit: 73daa70
using decomposed matrix struct now
Reza Zadeh <rizlar@gmail.com>
2014-01-04 00:38:53 -0800
Commit: 26a74f0
new return struct
Reza Zadeh <rizlar@gmail.com>
2014-01-04 00:15:04 -0800
Commit: d2d5e5e
start using matrixentry
Reza Zadeh <rizlar@gmail.com>
2014-01-03 22:17:24 -0800
Commit: 7f631dd
rename sparsesvd.scala
Reza Zadeh <rizlar@gmail.com>
2014-01-03 21:55:38 -0800
Commit: 6bcdb76
New matrix entry file
Reza Zadeh <rizlar@gmail.com>
2014-01-03 21:54:57 -0800
Commit: b059a2a
fix error message
Reza Zadeh <rizlar@gmail.com>
2014-01-02 01:51:38 -0800
Commit: e617ae2
Merge remote-tracking branch 'upstream/master' into sparsesvd
Reza Zadeh <rizlar@gmail.com>
2014-01-02 01:50:30 -0800
Commit: 6140578
more docs yay
Reza Zadeh <rizlar@gmail.com>
2014-01-01 20:22:29 -0800
Commit: 2612164
javadoc for sparsesvd
Reza Zadeh <rizlar@gmail.com>
2014-01-01 20:20:16 -0800
Commit: 915d53f
old version of spark_ec2
Reza Zadeh <rizlar@gmail.com>
2014-01-01 20:08:01 -0800
Commit: c868d71
remove accidental changes to ec2 script
Reza Zadeh <rizlar@gmail.com>
2014-01-01 20:05:03 -0800
Commit: 0c3797d
doc tweaks
Reza Zadeh <rizlar@gmail.com>
2014-01-01 20:03:47 -0800
Commit: 53ccf65
doc tweak
Reza Zadeh <rizlar@gmail.com>
2014-01-01 20:02:37 -0800
Commit: 97dc527
doc tweaks
Reza Zadeh <rizlar@gmail.com>
2014-01-01 20:01:13 -0800
Commit: b941b6f
tweaks to docs
Reza Zadeh <rizlar@gmail.com>
2014-01-01 19:53:14 -0800
Commit: 185c882
New documentation
Reza Zadeh <rizlar@gmail.com>
2014-01-01 19:53:04 -0800
Commit: dd0d3f0
Merge remote-tracking branch 'upstream/master' into sparsesvd
Reza Zadeh <rizlar@gmail.com>
2014-01-01 18:12:35 -0800
Commit: 7c04b31
large scale considerations
Reza Zadeh <rizlar@gmail.com>
2013-12-27 04:15:13 -0500
Commit: ae5102a
initial large scale testing begin
Reza Zadeh <rizlar@gmail.com>
2013-12-27 01:51:19 -0500
Commit: 642ab5c
cleanup documentation
Reza Zadeh <rizlar@gmail.com>
2013-12-27 00:41:46 -0500
Commit: 3369c2d
add all tests
Reza Zadeh <rizlar@gmail.com>
2013-12-27 00:36:41 -0500
Commit: bdb5037
test for truncated svd
Reza Zadeh <rizlar@gmail.com>
2013-12-27 00:34:59 -0500
Commit: fa1e8d8
full rank matrix test added
Reza Zadeh <rizlar@gmail.com>
2013-12-26 23:21:57 -0500
Commit: 16de526
Main method added for svd
Reza Zadeh <rizlar@gmail.com>
2013-12-26 18:13:21 -0500
Commit: fe1a132
new main file
Reza Zadeh <rizlar@gmail.com>
2013-12-26 18:09:33 -0500
Commit: 1a21ba2
Object to hold the svd methods
Reza Zadeh <rizlar@gmail.com>
2013-12-26 17:39:25 -0500
Commit: 6c3674c
Some documentation
Reza Zadeh <rizlar@gmail.com>
2013-12-26 16:12:40 -0500
Commit: 6e740cc
Initial files - no tests
Reza Zadeh <rizlar@gmail.com>
2013-12-26 15:01:03 -0500
Commit: 1a173f0
Release 0.9.1
Revert "[maven-release-plugin] prepare release v0.9.1-rc2"
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-26 21:53:07 -0700
Commit: ea5da04
Revert "[maven-release-plugin] prepare for next development iteration"
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-26 21:51:40 -0700
Commit: d16e863
[SPARK-1327] GLM needs to check addIntercept for intercept and weights
Xiangrui Meng <meng@databricks.com>
2014-03-26 19:30:57 -0700
Commit: 4901604, github.com/apache/spark/pull/236
SPARK-1322, top in pyspark should sort result in descending order.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-26 11:15:02 -0700
Commit: 2f90dc5, github.com/apache/spark/pull/235
[maven-release-plugin] prepare for next development iteration
Ubuntu <ubuntu@ip-172-31-18-245.us-west-2.compute.internal>
2014-03-26 09:26:45 +0000
Commit: 1f785d4
[maven-release-plugin] prepare release v0.9.1-rc2
Ubuntu <ubuntu@ip-172-31-18-245.us-west-2.compute.internal>
2014-03-26 09:26:40 +0000
Commit: 1197280
Updated CHANGES.txt
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-26 02:10:57 -0700
Commit: 7495dba
[SPARK-782] Made Spark use existing shaded ASM and removed Spark's ASM dependency
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-25 21:35:36 -0700
Commit: da87240, github.com/apache/spark/pull/232
Revert "[maven-release-plugin] prepare release v0.9.1-rc1"
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-25 15:01:52 -0700
Commit: 55abe72
Revert "[maven-release-plugin] prepare for next development iteration"
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-25 15:01:36 -0700
Commit: b94f997
[maven-release-plugin] prepare for next development iteration
Ubuntu <ubuntu@ip-172-31-18-245.us-west-2.compute.internal>
2014-03-24 06:56:16 +0000
Commit: 12e237e
[maven-release-plugin] prepare release v0.9.1-rc1
Ubuntu <ubuntu@ip-172-31-18-245.us-west-2.compute.internal>
2014-03-24 06:56:10 +0000
Commit: 81c6a06
Removed all occurences of incubator from all the pom.xml.
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-23 23:31:59 -0700
Commit: 60ddb34
Updated CHANGES.txt file.
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-23 13:16:50 -0700
Commit: f176b03
Fix to Stage UI to display numbers on progress bar
Emtiaz Ahmed <emtiazahmed@gmail.com>
2014-03-21 18:07:05 -0700
Commit: 5e7ac0d, github.com/apache/spark/pull/201
SPARK-1284: Fix improper use of SimpleDateFormat
zsxwing <zsxwing@gmail.com>
2014-03-21 16:39:23 -0700
Commit: 8856076, github.com/apache/spark/pull/179
[SPARK-1273] use doi links in mllib-guide
Xiangrui Meng <meng@databricks.com>
2014-03-21 14:35:32 -0700
Commit: d68549e, github.com/apache/spark/pull/198
Removed incubating from Spark version in all the pom.xml.
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-20 18:02:55 -0700
Commit: 8b1e793
Bumped versions to Spark 0.9.1
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-20 16:55:35 -0700
Commit: 8a882ef, github.com/apache/spark/pull/190
SPARK-1032. If Yarn app fails before registering, app master stays aroun...
Sandy Ryza <sandy@cloudera.com>
2014-03-20 16:50:44 -0500
Commit: c6630d3, github.com/apache/spark/pull/28
SPARK-1051. On YARN, executors don't doAs submitting user
Sandy Ryza <sandy@cloudera.com>
2014-03-20 14:48:05 -0500
Commit: 748f002, github.com/apache/spark/pull/29
[SPARK-1285] Backporting updates to streaming docs to branch 0.9
Aaron Kimball <aaron@magnify.io>, Tathagata Das <tathagata.das1565@gmail.com>, Chen Chao <crazyjvm@gmail.com>, Andrew Or <andrewor14@gmail.com>
2014-03-20 12:27:47 -0700
Commit: 1e36690, github.com/apache/spark/pull/183
[SPARK-1273] MLlib bug fixes, improvements, and doc updates for v0.9.1
Xiangrui Meng <meng@databricks.com>, Sean Owen <sowen@cloudera.com>, Andrew Tulloch <andrew@tullo.ch>, Chen Chao <crazyjvm@gmail.com>
2014-03-19 19:05:26 -0700
Commit: 1cc979e, github.com/apache/spark/pull/175
[SPARK-1275] Made dev/run-tests executable.
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-19 16:10:45 -0700
Commit: a4eef65, github.com/apache/spark/pull/178
Update the yarn alpha version to 0.9.1-incubating-SNAPSHOT
Thomas Graves <tgraves@apache.org>
2014-03-19 12:41:11 -0500
Commit: 72875b2
SPARK-1203 fix saving to hdfs from yarn
Thomas Graves <tgraves@apache.org>
2014-03-19 08:19:47 -0500
Commit: 250ec27, github.com/apache/spark/pull/173
bugfix: Wrong "Duration" in "Active Stages" in stages page
shiyun.wxm <shiyun.wxm@taobao.com>
2014-03-19 01:42:42 -0700
Commit: d385b5a, github.com/apache/spark/pull/170
[SPARK-1274] Add dev scripts to merge PRs and create releases from master to branch-0.9
Tathagata Das <tathagata.das1565@gmail.com>
2014-03-18 22:09:16 -0700
Commit: 7ec78bc, github.com/apache/spark/pull/176
Bundle tachyon: SPARK-1269
Nick Lanham <nick@afternight.org>
2014-03-18 22:05:18 -0700
Commit: 0183ddd, github.com/apache/spark/pull/137
[Spark-1261] add instructions for running python examples to doc overview page
Diana Carroll <dcarroll@cloudera.com>
2014-03-17 17:37:03 -0700
Commit: 20d9458, github.com/apache/spark/pull/162
SPARK-1244: Throw exception if map output status exceeds frame size
Patrick Wendell <pwendell@gmail.com>, Andrew Or <andrewor14@gmail.com>
2014-03-17 14:06:28 -0700
Commit: 4562140, github.com/apache/spark/pull/152
SPARK-1240: handle the case of empty RDD when takeSample
CodingCat <zhunansjtu@gmail.com>
2014-03-16 22:40:22 -0700
Commit: af7e8b1, github.com/apache/spark/pull/135
SPARK-977 Added Python RDD.zip function
Prabin Banka <prabin.banka@imaginea.com>
2014-03-16 22:16:17 -0700
Commit: 1dc1e98, github.com/apache/spark/pull/76
Spark-1163, Added missing Python RDD functions
prabinb <prabin.banka@imaginea.com>
2014-03-16 22:14:53 -0700
Commit: 249930a, github.com/apache/spark/pull/92
SPARK-1168, Added foldByKey to pyspark.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-16 22:13:33 -0700
Commit: 4480505, github.com/apache/spark/pull/115
Updated link for pyspark examples in docs
Jyotiska NK <jyotiska123@gmail.com>
2014-03-16 22:12:51 -0700
Commit: e74e79a, github.com/apache/spark/pull/22
SPARK-1019: pyspark RDD take() throws an NPE
Patrick Wendell <pwendell@gmail.com>
2014-03-12 23:17:17 -0700
Commit: ef74e44, github.com/apache/spark/pull/112
Fix example bug: compile error
jianghan <jianghan@xiaomi.com>
2014-03-12 19:46:48 -0700
Commit: 87e4dd5, github.com/apache/spark/pull/132
SPARK-1162 Added top in python.
Prashant Sharma <prashant.s@imaginea.com>
2014-03-12 15:57:54 -0700
Commit: 51a77e9, github.com/apache/spark/pull/93
Version fix in pom file
Patrick Wendell <pwendell@gmail.com>
2014-03-11 14:48:01 -0700
Commit: 7049164
Log4j build fix on 0.9 branch
Patrick Wendell <pwendell@gmail.com>
2014-03-11 11:53:29 -0700
Commit: 6cbd580
SPARK-1167: Remove metrics-ganglia from default build due to LGPL issues...
Patrick Wendell <pwendell@gmail.com>
2014-03-11 11:24:21 -0700
Commit: 0c91927, github.com/apache/spark/pull/108
For outputformats that are Configurable, call setConf before sending data to them.
Bryn Keller <bryn.keller@intel.com>
2014-03-09 17:47:46 -0700
Commit: 6f0db0a, github.com/apache/spark/pull/638
SPARK-1190: Do not initialize log4j if slf4j log4j backend is not being used
Patrick Wendell <pwendell@gmail.com>
2014-03-08 16:02:56 -0800
Commit: 0f0d044, github.com/apache/spark/pull/107
SPARK-1184: Update the distribution tar.gz to include spark-assembly jar
Mark Grover <mark@apache.org>
2014-03-05 16:55:36 -0800
Commit: 0fc0fdb, github.com/apache/spark/pull/78
SPARK-1109 wrong API docs for pyspark map function
Prashant Sharma <prashant.s@imaginea.com>
2014-03-04 15:33:00 -0800
Commit: 7ea89ec, github.com/apache/spark/pull/73
Add Jekyll tag to isolate "production-only" doc components. (0.9 version)
Patrick Wendell <pwendell@gmail.com>
2014-03-02 18:18:44 -0800
Commit: 267d96c, github.com/apache/spark/pull/57
Removed reference to incubation in Spark user docs.
Reynold Xin <rxin@apache.org>
2014-02-27 21:14:18 -0800
Commit: f2bf44a, github.com/apache/spark/pull/2
[SPARK-1089] fix the regression problem on ADD_JARS in 0.9
CodingCat <zhunansjtu@gmail.com>
2014-02-26 23:42:53 -0800
Commit: bc5e7d7, github.com/apache/spark/pull/13
Removed reference to incubation in README.md.
Reynold Xin <rxin@apache.org>
2014-02-26 16:53:56 -0800
Commit: 349764d, github.com/apache/spark/pull/1
SPARK-1115: Catch depickling errors
Bouke van der Bijl <boukevanderbijl@gmail.com>
2014-02-26 14:53:30 -0800
Commit: 886a466, github.com/apache/incubator-spark/pull/644
SPARK-1135: fix broken anchors in docs
Matei Zaharia <matei@databricks.com>
2014-02-26 11:56:12 -0800
Commit: 6fe72dd, github.com/apache/spark/pull/3
Fix removal from shuffleToMapStage to search for a key-value pair with our stage instead of using our shuffleID.
Matei Zaharia <matei@databricks.com>
2014-02-24 17:01:21 -0800
Commit: 0661cdc
SPARK-1124: Fix infinite retries of reduce stage when a map stage failed
Matei Zaharia <matei@databricks.com>
2014-02-24 17:00:47 -0800
Commit: 5e74b8e
[SPARK-1055] fix the SCALA_VERSION and SPARK_VERSION in docker file
CodingCat <zhunansjtu@gmail.com>, Nan Zhu <CodingCat@users.noreply.github.com>
2014-02-22 15:39:41 -0800
Commit: 00db30c, github.com/apache/incubator-spark/pull/634
SPARK-1117: update accumulator docs
Xiangrui Meng <meng@databricks.com>
2014-02-21 22:44:59 -0800
Commit: ed58742, github.com/apache/incubator-spark/pull/631
[SPARK-1113] External spilling - fix Int.MaxValue hash code collision bug
Andrew Or <andrewor14@gmail.com>
2014-02-21 20:06:09 -0800
Commit: 84131fe, github.com/apache/incubator-spark/pull/624
MLLIB-25: Implicit ALS runs out of memory for moderately large numbers of features
Sean Owen <sowen@cloudera.com>
2014-02-21 13:39:17 -0800
Commit: 998abae, github.com/apache/incubator-spark/pull/629
SPARK-1111: URL Validation Throws Error for HDFS URL's
Patrick Wendell <pwendell@gmail.com>
2014-02-21 11:12:38 -0800
Commit: b3fff96, github.com/apache/incubator-spark/pull/625
Super minor: Add require for mergeCombiners in combineByKey
Aaron Davidson <aaron@databricks.com>
2014-02-20 16:46:29 -0800
Commit: 3c44ff4, github.com/apache/incubator-spark/pull/623
[SPARK-1105] fix site scala version error in docs
CodingCat <zhunansjtu@gmail.com>
2014-02-19 15:56:24 -0800
Commit: 289d761, github.com/apache/incubator-spark/pull/618
Revert "[SPARK-1105] fix site scala version error in doc"
Patrick Wendell <pwendell@gmail.com>
2014-02-18 17:47:34 -0800
Commit: 7bde72e
[SPARK-1105] fix site scala version error in doc
CodingCat <zhunansjtu@gmail.com>
2014-02-18 16:33:41 -0800
Commit: 0f0395c, github.com/apache/incubator-spark/pull/616
Worker registration logging fix
Andrew Ash <andrew@andrewash.com>
2014-02-17 09:52:16 -0800
Commit: b0b5288, github.com/apache/incubator-spark/pull/608
fix for https://spark-project.atlassian.net/browse/SPARK-1052
Bijay Bisht <bijay.bisht@gmail.com>
2014-02-16 16:52:57 -0800
Commit: e797c1a, github.com/apache/incubator-spark/pull/568
Add c3 instance types to Spark EC2
Christian Lundgren <christian.lundgren@gameanalytics.com>
2014-02-13 12:44:21 -0800
Commit: 19b4bb2, github.com/apache/incubator-spark/pull/595
SPARK-1088: Create a script for running tests so we can have version specific testing on Jenkins (branch-0.9)
Reynold Xin <rxin@apache.org>
2014-02-12 23:42:58 -0800
Commit: e5b86b1, github.com/apache/incubator-spark/pull/593
Ported hadoopClient jar for < 1.0.1 fix
Bijay Bisht <bijay.bisht@gmail.com>
2014-02-12 23:42:10 -0800
Commit: 8093de1, github.com/apache/incubator-spark/pull/584
754bc18 2014-02-12 14:26:39 -0800
Merge pull request #590 from rxin/scalastyle.
[SPARK-1085: Fix Jenkins pull request builder for branch-0.9 (scalastyle command not found)]
28f88c5 2014-02-11 22:43:09 -0800
Merge pull request #583 from colorant/zookeeper.
[Minor fix for ZooKeeperPersistenceEngine to use configured working dir]
e70690f 2014-02-09 23:33:35 -0800
Revert "Merge pull request #560 from pwendell/logging. Closes #560."
[This reverts commit 2e3d1c31db55c7f961e559e47bb497ae15cb74d7.]
de22abc 2014-02-08 23:37:05 -0800
Merge pull request #562 from jyotiska/master. Closes #562.
[Added example Python code for sort]
2e3d1c3 2014-02-08 23:35:41 -0800
Merge pull request #560 from pwendell/logging. Closes #560.
[[WIP] SPARK-1067: Default log4j initialization causes errors for those not using log4j]
22e0a3b 2014-02-08 13:00:07 -0800
Merge pull request #561 from Qiuzhuang/master. Closes #561.
[Kill drivers in postStop() for Worker.]
ce179f6 2014-02-06 22:06:30 -0800
Merge pull request #533 from andrewor14/master. Closes #533.
[External spilling - generalize batching logic]
24e5298 2014-02-06 16:15:36 -0800
Merge pull request #450 from kayousterhout/fetch_failures. Closes #450.
[Only run ResubmitFailedStages event after a fetch fails]
94896bb 2014-02-06 16:13:10 -0800
Merge pull request #321 from kayousterhout/ui_kill_fix. Closes #321.
[Inform DAG scheduler about all started/finished tasks.]
44a2b03 2014-02-05 23:38:25 -0800
Merge pull request #545 from kayousterhout/fix_progress. Closes #545.
[Fix off-by-one error with task progress info log.]
b044b0b 2014-02-05 23:37:38 -0800
Merge pull request #526 from tgravescs/yarn_client_stop_am_fix. Closes #526.
[spark on yarn - yarn-client mode doesn't always exit immediately]
d815cfa 2014-02-04 09:47:25 -0800
Merge pull request #534 from sslavic/patch-1. Closes #534.
[Fixed wrong path to compute-classpath.cmd]
f3cba2d 2014-02-04 09:46:00 -0800
Merge pull request #535 from sslavic/patch-2. Closes #535.
[Fixed typo in scaladoc]
5f63f32 2014-02-03 22:44:30 -0800
Merge pull request #449 from CrazyJvm/master
[SPARK-1028 : fix "set MASTER automatically fails" bug.]
6e4d089 2014-02-03 22:42:43 -0800
Merge pull request #414 from soulmachine/code-style
[Code clean up for mllib]
0021ef9 2014-02-03 22:42:01 -0800
Merge pull request #445 from kayousterhout/exec_lost
[Fail rather than hanging if a task crashes the JVM.]
dc8adf1 2014-02-03 22:41:30 -0800
Merge pull request #489 from ash211/patch-6
[Clarify spark.default.parallelism]
574741f 2014-02-03 22:40:55 -0800
Merge pull request #493 from kayousterhout/double_add
[Fixed bug where task set managers are added to queue twice]
1280e8a 2014-02-03 22:40:29 -0800
Merge pull request #511 from JoshRosen/SPARK-1040
[Fix ClassCastException in JavaPairRDD.collectAsMap() (SPARK-1040)]
2c6c9b9 2014-02-03 22:39:59 -0800
Merge pull request #504 from JoshRosen/SPARK-1025
[Fix PySpark hang when input files are deleted (SPARK-1025)]
b10f607 2014-02-03 22:39:10 -0800
Merge pull request #516 from sarutak/master
[modified SparkPluginBuild.scala to use https protocol for accessing gith...]
18520f5 2014-02-03 22:37:38 -0800
Merge pull request #490 from hsaputra/modify_checkoption_with_isdefined
[Replace the check for None Option with isDefined and isEmpty in Scala code]
a414071 2014-01-31 16:54:33 -0800
Merge pull request #524 from rxin/doc
[Added spark.shuffle.file.buffer.kb to configuration doc.]
a41a83c 2014-01-31 16:53:26 -0800
Merge pull request #527 from ankurdave/graphx-assembly-pom
[Add GraphX to assembly/pom.xml]
d18fe1f 2014-01-28 21:55:15 -0800
Merge pull request #497 from tdas/docs-update
[Updated Spark Streaming Programming Guide]
5edbd17 2014-01-28 21:32:58 -0800
Merge pull request #523 from JoshRosen/SPARK-1043
[Switch from MUTF8 to UTF8 in PySpark serializers.]
[maven-release-plugin] prepare for next development iteration
Ubuntu <ubuntu@ip-10-109-132-81.ec2.internal>
2014-01-24 06:15:15 +0000
Commit: 0f60ef2
Release 0.9.0-incubating
d0a105d Thu Jan 23 20:53:31 2014 -0800
Merge pull request #505 from JoshRosen/SPARK-1026
[Deprecate mapPartitionsWithSplit in PySpark (SPARK-1026)]
e66d4c2 Thu Jan 23 19:47:16 2014 -0800
Merge pull request #503 from pwendell/master
[Fix bug on read-side of external sort when using Snappy.]
e8d3f2b Thu Jan 23 19:20:22 2014 -0800
Merge pull request #502 from pwendell/clone-1
[Remove Hadoop object cloning and warn users making Hadoop RDD's.]
7a62353 Thu Jan 23 19:09:25 2014 -0800
Merge pull request #501 from JoshRosen/cartesian-rdd-fixes
[Fix two bugs in PySpark cartesian(): SPARK-978 and SPARK-1034]
51960b8 Wed Jan 22 19:37:50 2014 -0800
Merge pull request #496 from pwendell/master
[Fix bug in worker clean-up in UI]
828f7b4 Wed Jan 22 15:45:18 2014 -0800
Merge pull request #495 from srowen/GraphXCommonsMathDependency
[Fix graphx Commons Math dependency]
dc5857a Wed Jan 22 14:33:25 2014 -0800
Merge pull request #492 from skicavs/master
[fixed job name and usage information for the JavaSparkPi example]
dd533c9 Wed Jan 22 14:15:58 2014 -0800
Merge pull request #478 from sryza/sandy-spark-1033
[SPARK-1033. Ask for cores in Yarn container requests]
b6fd3cd Tue Jan 21 00:12:01 2014 -0800
Merge pull request #480 from pwendell/0.9-fixes
[Handful of 0.9 fixes]
e5f8917 Mon Jan 20 23:35:07 2014 -0800
Merge pull request #484 from tdas/run-example-fix
[Made run-example respect SPARK_JAVA_OPTS and SPARK_MEM.]
410ba06 Mon Jan 20 22:26:14 2014 -0800
Merge pull request #482 from tdas/streaming-example-fix
[Added StreamingContext.awaitTermination to streaming examples]
f137947 Mon Jan 20 22:24:07 2014 -0800
Merge pull request #483 from pwendell/gitignore
[Restricting /lib to top level directory in .gitignore]
94ae25d Sun Jan 19 11:33:51 2014 -0800
Merge pull request #470 from tgravescs/fix_spark_examples_yarn
[Only log error on missing jar to allow spark examples to jar.]
0f077b5 Sun Jan 19 10:30:29 2014 -0800
Merge pull request #458 from tdas/docs-update
[Updated java API docs for streaming, along with very minor changes in the code examples.]
03019d1 Sat Jan 18 16:29:43 2014 -0800
Merge pull request #459 from srowen/UpdaterL2Regularization
[Correct L2 regularized weight update with canonical form]
76147a2 Sat Jan 18 16:24:16 2014 -0800
Merge pull request #437 from mridulm/master
[Minor api usability changes]
4ac8cab Sat Jan 18 16:22:46 2014 -0800
Merge pull request #426 from mateiz/py-ml-tests
[Re-enable Python MLlib tests (require Python 2.7 and NumPy 1.7+)]
34e911c Sat Jan 18 16:17:34 2014 -0800
Merge pull request #462 from mateiz/conf-file-fix
[Remove Typesafe Config usage and conf files to fix nested property names]
ff7201c Sat Jan 18 12:50:02 2014 -0800
Merge pull request #461 from pwendell/master
[Use renamed shuffle spill config in CoGroupedRDD.scala]
7b0d5a5 Thu Jan 16 23:18:48 2014 -0800
Merge pull request #451 from Qiuzhuang/master
[Fixed Window spark shell launch script error.]
4ccedb3 Wed Jan 15 14:26:48 2014 -0800
Merge pull request #444 from mateiz/py-version
[Clarify that Python 2.7 is only needed for MLlib]
e3fa36f Wed Jan 15 13:56:04 2014 -0800
Merge pull request #442 from pwendell/standalone
[Workers should use working directory as spark home if it's not specified]
29c76d9 Wed Jan 15 13:55:48 2014 -0800
Merge pull request #443 from tdas/filestream-fix
[Made some classes private[stremaing] and deprecated a method in JavaStreamingContext.]
aca40aa Wed Jan 15 11:15:47 2014 -0800
Merge pull request #441 from pwendell/graphx-build
[GraphX shouldn't list Spark as provided.]
e12c374 Wed Jan 15 10:01:43 2014 -0800
Merge pull request #433 from markhamstra/debFix
[Updated Debian packaging]
2f015c2 Tue Jan 14 23:17:28 2014 -0800
Merge pull request #436 from ankurdave/VertexId-case
[Rename VertexID -> VertexId in GraphX]
2859cab Tue Jan 14 23:08:19 2014 -0800
Merge pull request #435 from tdas/filestream-fix
[Fixed the flaky tests by making SparkConf not serializable]
fbfbb33 Tue Jan 14 23:06:29 2014 -0800
Merge pull request #434 from rxin/graphxmaven
[Fixed SVDPlusPlusSuite in Maven build.]
2c6c07f Tue Jan 14 21:53:05 2014 -0800
Merge pull request #424 from jegonzal/GraphXProgrammingGuide
[Additional edits for clarity in the graphx programming guide.]
6fa4e02 Tue Jan 14 21:51:25 2014 -0800
Merge pull request #431 from ankurdave/graphx-caching-doc
[Describe caching and uncaching in GraphX programming guide]
2f930d5 Tue Jan 14 15:00:11 2014 -0800
Merge pull request #428 from pwendell/writeable-objects
[Don't clone records for text files]
329c9df Tue Jan 14 14:53:36 2014 -0800
Merge pull request #429 from ankurdave/graphx-examples-pom.xml
[Add GraphX dependency to examples/pom.xml]
a14933d Tue Jan 14 14:52:42 2014 -0800
Merge pull request #427 from pwendell/deprecate-aggregator
[Deprecate rather than remove old combineValuesByKey function]
119b6c5 Tue Jan 14 13:29:08 2014 -0800
Merge pull request #425 from rxin/scaladoc
[API doc update & make Broadcast public]
bf3b150 Tue Jan 14 09:45:22 2014 -0800
Merge pull request #423 from jegonzal/GraphXProgrammingGuide
[Improving the graphx-programming-guide]
1b4adc2 Tue Jan 14 01:19:24 2014 -0800
Merge pull request #420 from pwendell/header-files
[Add missing header files]
b60840e Tue Jan 14 00:48:34 2014 -0800
Merge pull request #418 from pwendell/0.9-versions
[Version changes for release 0.9.0.]
980250b Tue Jan 14 00:05:37 2014 -0800
Merge pull request #416 from tdas/filestream-fix
[Removed unnecessary DStream operations and updated docs]
055be5c Mon Jan 13 23:26:44 2014 -0800
Merge pull request #415 from pwendell/shuffle-compress
[Enable compression by default for spills]
fdaabdc Mon Jan 13 23:08:26 2014 -0800
Merge pull request #380 from mateiz/py-bayes
[Add Naive Bayes to Python MLlib, and some API fixes]
4a805af Mon Jan 13 22:58:38 2014 -0800
Merge pull request #367 from ankurdave/graphx
[GraphX: Unifying Graphs and Tables]
945fe7a Mon Jan 13 22:56:12 2014 -0800
Merge pull request #408 from pwendell/external-serializers
[Improvements to external sorting]
68641bc Mon Jan 13 22:54:13 2014 -0800
Merge pull request #413 from rxin/scaladoc
[Adjusted visibility of various components and documentation for 0.9.0 release.]
0ca0d4d Mon Jan 13 22:32:21 2014 -0800
Merge pull request #401 from andrewor14/master
[External sorting - Add number of bytes spilled to Web UI]
08b9fec Mon Jan 13 22:29:03 2014 -0800
Merge pull request #409 from tdas/unpersist
[Automatically unpersisting RDDs that have been cleaned up from DStreams]
b07bc02 Mon Jan 13 20:45:22 2014 -0800
Merge pull request #412 from harveyfeng/master
[Add default value for HadoopRDD's `cloneRecords` constructor arg]
a2fee38 Mon Jan 13 19:45:26 2014 -0800
Merge pull request #411 from tdas/filestream-fix
[Improved logic of finding new files in FileInputDStream]
01c0d72 Mon Jan 13 16:24:30 2014 -0800
Merge pull request #410 from rxin/scaladoc1
[Updated JavaStreamingContext to make scaladoc compile.]
8038da2 Mon Jan 13 14:59:30 2014 -0800
Merge pull request #2 from jegonzal/GraphXCCIssue
[Improving documentation and identifying potential bug in CC calculation.]
b93f9d4 Mon Jan 13 12:18:05 2014 -0800
Merge pull request #400 from tdas/dstream-move
[Moved DStream and PairDSream to org.apache.spark.streaming.dstream]
e6ed13f Sun Jan 12 22:35:14 2014 -0800
Merge pull request #397 from pwendell/host-port
[Remove now un-needed hostPort option]
0b96d85 Sun Jan 12 21:31:43 2014 -0800
Merge pull request #399 from pwendell/consolidate-off
[Disable shuffle file consolidation by default]
0ab505a Sun Jan 12 21:31:04 2014 -0800
Merge pull request #395 from hsaputra/remove_simpleredundantreturn_scala
[Remove simple redundant return statements for Scala methods/functions]
405bfe8 Sun Jan 12 20:04:21 2014 -0800
Merge pull request #394 from tdas/error-handling
[Better error handling in Spark Streaming and more API cleanup]
28a6b0c Sun Jan 12 19:49:36 2014 -0800
Merge pull request #398 from pwendell/streaming-api
[Rename DStream.foreach to DStream.foreachRDD]
074f502 Sun Jan 12 17:01:13 2014 -0800
Merge pull request #396 from pwendell/executor-env
[Setting load defaults to true in executor]
82e2b92 Sun Jan 12 16:55:11 2014 -0800
Merge pull request #392 from rxin/listenerbus
[Stop SparkListenerBus daemon thread when DAGScheduler is stopped.]
288a878 Sat Jan 11 21:53:19 2014 -0800
Merge pull request #389 from rxin/clone-writables
[Minor update for clone writables and more documentation.]
dbc11df Sat Jan 11 18:07:13 2014 -0800
Merge pull request #388 from pwendell/master
[Fix UI bug introduced in #244.]
409866b Sat Jan 11 17:12:06 2014 -0800
Merge pull request #393 from pwendell/revert-381
[Revert PR 381]
6510f04 Sat Jan 11 12:48:26 2014 -0800
Merge pull request #387 from jerryshao/conf-fix
[Fix configure didn't work small problem in ALS]
ee6e7f9 Sat Jan 11 12:07:55 2014 -0800
Merge pull request #359 from ScrapCodes/clone-writables
[We clone hadoop key and values by default and reuse objects if asked to.]
4216178 Sat Jan 11 09:46:48 2014 -0800
Merge pull request #373 from jerryshao/kafka-upgrade
[Upgrade Kafka dependecy to 0.8.0 release version]
92ad18b Fri Jan 10 23:25:15 2014 -0800
Merge pull request #376 from prabeesh/master
[Change clientId to random clientId]
0b5ce7a Fri Jan 10 23:23:21 2014 -0800
Merge pull request #386 from pwendell/typo-fix
[Small typo fix]
1d7bef0 Fri Jan 10 18:53:03 2014 -0800
Merge pull request #381 from mateiz/default-ttl
[Fix default TTL for metadata cleaner]
44d6a8e Fri Jan 10 17:51:50 2014 -0800
Merge pull request #382 from RongGu/master
[Fix a type error in comment lines]
88faa30 Fri Jan 10 17:14:22 2014 -0800
Merge pull request #385 from shivaram/add-i2-instances
[Add i2 instance types to Spark EC2.]
f265531 Fri Jan 10 16:25:44 2014 -0800
Merge pull request #383 from tdas/driver-test
[API for automatic driver recovery for streaming programs and other bug fixes]
d37408f Fri Jan 10 16:25:01 2014 -0800
Merge pull request #377 from andrewor14/master
[External Sorting for Aggregator and CoGroupedRDDs (Revisited)]
0eaf01c Fri Jan 10 15:32:19 2014 -0800
Merge pull request #369 from pillis/master
[SPARK-961 Add a Vector.random() method]
7cef843 Fri Jan 10 15:34:15 2014 -0600
Merge pull request #371 from tgravescs/yarn_client_addjar_misc_fixes
[Yarn client addjar and misc fixes]
7b58f11 Fri Jan 10 12:47:46 2014 -0800
Merge pull request #384 from pwendell/debug-logs
[Make DEBUG-level logs consummable.]
23d2995 Fri Jan 10 10:20:02 2014 -0800
Merge pull request #1 from jegonzal/graphx
[ProgrammingGuide]
0ebc973 Thu Jan 9 23:58:49 2014 -0800
Merge pull request #375 from mateiz/option-fix
[Fix bug added when we changed AppDescription.maxCores to an Option]
dd03cea Thu Jan 9 23:38:03 2014 -0800
Merge pull request #378 from pwendell/consolidate_on
[Enable shuffle consolidation by default.]
997c830 Thu Jan 9 22:22:20 2014 -0800
Merge pull request #363 from pwendell/streaming-logs
[Set default logging to WARN for Spark streaming examples.]
300eaa9 Thu Jan 9 20:29:51 2014 -0800
Merge pull request #353 from pwendell/ipython-simplify
[Simplify and fix pyspark script.]
4b074fa Thu Jan 9 19:03:55 2014 -0800
Merge pull request #374 from mateiz/completeness
[Add some missing Java API methods]
a9d5333 Thu Jan 9 18:46:46 2014 -0800
Merge pull request #294 from RongGu/master
[Bug fixes for updating the RDD block's memory and disk usage information]
d86a85e Thu Jan 9 18:37:52 2014 -0800
Merge pull request #293 from pwendell/standalone-driver
[SPARK-998: Support Launching Driver Inside of Standalone Mode]
26cdb5f Thu Jan 9 17:16:34 2014 -0800
Merge pull request #372 from pwendell/log4j-fix-1
[Send logs to stderr by default (instead of stdout).]
12f414e Thu Jan 9 15:31:30 2014 -0800
Merge pull request #362 from mateiz/conf-getters
[Use typed getters for configuration settings]
365cac9 Thu Jan 9 00:56:16 2014 -0800
Merge pull request #361 from rxin/clean
[Minor style cleanup. Mostly on indenting & line width changes.]
73c724e Thu Jan 9 00:32:19 2014 -0800
Merge pull request #368 from pwendell/sbt-fix
[Don't delegate to users `sbt`.]
dceedb4 Wed Jan 8 23:19:28 2014 -0800
Merge pull request #364 from pwendell/fix
[Fixing config option "retained_stages" => "retainedStages".]
04d83fc Wed Jan 8 11:55:37 2014 -0800
Merge pull request #360 from witgo/master
[fix make-distribution.sh show version: command not found]
56ebfea Wed Jan 8 11:50:06 2014 -0800
Merge pull request #357 from hsaputra/set_boolean_paramname
[Set boolean param name for call to SparkHadoopMapReduceUtil.newTaskAttemptID]
bdeaeaf Wed Jan 8 11:48:39 2014 -0800
Merge pull request #358 from pwendell/add-cdh
[Add CDH Repository to Maven Build]
5cae05f Wed Jan 8 11:47:28 2014 -0800
Merge pull request #356 from hsaputra/remove_deprecated_cleanup_method
[Remove calls to deprecated mapred's OutputCommitter.cleanupJob]
6eef78d Wed Jan 8 08:49:20 2014 -0600
Merge pull request #345 from colorant/yarn
[support distributing extra files to worker for yarn client mode]
bb6a39a Tue Jan 7 22:32:18 2014 -0800
Merge pull request #322 from falaki/MLLibDocumentationImprovement
[SPARK-1009 Updated MLlib docs to show how to use it in Python]
cb1b927 Tue Jan 7 22:26:28 2014 -0800
Merge pull request #355 from ScrapCodes/patch-1
[Update README.md]
c0f0155 Tue Jan 7 22:21:52 2014 -0800
Merge pull request #313 from tdas/project-refactor
[Refactored the streaming project to separate external libraries like Twitter, Kafka, Flume, etc.]
f5f12dc Tue Jan 7 21:56:35 2014 -0800
Merge pull request #336 from liancheng/akka-remote-lookup
[Get rid of `Either[ActorRef, ActorSelection]']
11891e6 Wed Jan 8 00:32:18 2014 -0500
Merge pull request #327 from lucarosellini/master
[Added ‘-i’ command line option to Spark REPL]
7d0aac9 Wed Jan 8 00:30:45 2014 -0500
Merge pull request #354 from hsaputra/addasfheadertosbt
[Add ASF header to the new sbt script.]
d75dc42 Wed Jan 8 00:30:03 2014 -0500
Merge pull request #350 from mateiz/standalone-limit
[Add way to limit default # of cores used by apps in standalone mode]
61674bc Tue Jan 7 18:32:13 2014 -0800
Merge pull request #352 from markhamstra/oldArch
[Don't leave os.arch unset after BlockManagerSuite]
b2e690f Tue Jan 7 16:57:08 2014 -0800
Merge pull request #328 from falaki/MatrixFactorizationModel-fix
[SPARK-1012: DAGScheduler Exception Fix]
6ccf8ce Tue Jan 7 15:49:14 2014 -0800
Merge pull request #351 from pwendell/maven-fix
[Add log4j exclusion rule to maven.]
7d5fa17 Tue Jan 7 11:31:34 2014 -0800
Merge pull request #337 from yinxusen/mllib-16-bugfix
[Mllib 16 bugfix]
71fc113 Tue Jan 7 11:30:35 2014 -0800
Merge pull request #349 from CodingCat/support-worker_dir
[add the comments about SPARK_WORKER_DIR]
15d9534 Tue Jan 7 08:10:02 2014 -0800
Merge pull request #318 from srowen/master
[Suggested small changes to Java code for slightly more standard style, encapsulation and in some cases performance]
468af0f Tue Jan 7 08:09:01 2014 -0800
Merge pull request #348 from prabeesh/master
[spark -> org.apache.spark]
c3cf047 Tue Jan 7 00:54:25 2014 -0800
Merge pull request #339 from ScrapCodes/conf-improvements
[Conf improvements]
a862caf Tue Jan 7 00:18:20 2014 -0800
Merge pull request #331 from holdenk/master
[Add a script to download sbt if not present on the system]
b97ef21 Mon Jan 6 20:12:57 2014 -0800
Merge pull request #346 from sproblvem/patch-1
[Update stop-slaves.sh]
7210257 Mon Jan 6 18:25:44 2014 -0800
Merge pull request #128 from adamnovak/master
[Fix failing "sbt/sbt publish-local" by adding a no-argument PrimitiveKeyOpenHashMap constructor ]
e4d6057 Mon Jan 6 14:56:54 2014 -0800
Merge pull request #343 from pwendell/build-fix
[Fix test breaking downstream builds]
93bf962 Mon Jan 6 11:42:41 2014 -0800
Merge pull request #340 from ScrapCodes/sbt-fixes
[Made java options to be applied during tests so that they become self explanatory.]
60edeb3 Mon Jan 6 11:40:32 2014 -0800
Merge pull request #338 from ScrapCodes/ning-upgrade
[SPARK-1005 Ning upgrade]
c708e81 Mon Jan 6 11:35:48 2014 -0800
Merge pull request #341 from ash211/patch-5
[Clarify spark.cores.max in docs]
33fcb91 Mon Jan 6 11:19:23 2014 -0800
Merge pull request #342 from tgravescs/fix_maven_protobuf
[Change protobuf version for yarn alpha back to 2.4.1]
357083c Mon Jan 6 10:29:04 2014 -0800
Merge pull request #330 from tgravescs/fix_addjars_null_handling
[Fix handling of empty SPARK_EXAMPLES_JAR]
a2e7e04 Sun Jan 5 22:37:36 2014 -0800
Merge pull request #333 from pwendell/logging-silence
[Quiet ERROR-level Akka Logs]
5b0986a Sun Jan 5 19:25:09 2014 -0800
Merge pull request #334 from pwendell/examples-fix
[Removing SPARK_EXAMPLES_JAR in the code]
f4b924f Sun Jan 5 17:11:47 2014 -0800
Merge pull request #335 from rxin/ser
[Fall back to zero-arg constructor for Serializer initialization if there is no constructor that accepts SparkConf.]
d43ad3e Sat Jan 4 16:29:30 2014 -0800
Merge pull request #292 from soulmachine/naive-bayes
[standard Naive Bayes classifier]
86404da Sat Jan 4 14:55:54 2014 -0800
Merge pull request #127 from jegonzal/MapByPartition
[Adding mapEdges and mapTriplets by Partition]
e68cdb1 Sat Jan 4 13:46:02 2014 -0800
Merge pull request #124 from jianpingjwang/master
[refactor and bug fix]
280ddf6 Sat Jan 4 12:54:41 2014 -0800
Merge pull request #121 from ankurdave/more-simplify
[Simplify GraphImpl internals further]
10fe23b Fri Jan 3 23:50:14 2014 -0800
Merge pull request #329 from pwendell/remove-binaries
[SPARK-1002: Remove Binaries from Spark Source]
c4d6145 Fri Jan 3 16:30:53 2014 -0800
Merge pull request #325 from witgo/master
[Modify spark on yarn to create SparkConf process]
4ae101f Fri Jan 3 11:24:35 2014 -0800
Merge pull request #317 from ScrapCodes/spark-915-segregate-scripts
[Spark-915 segregate scripts]
87248bd Fri Jan 3 00:45:31 2014 -0800
Merge pull request #1 from apache/master
[Merge latest Spark changes]
30b9db0 Thu Jan 2 23:15:55 2014 -0800
Merge pull request #285 from colorant/yarn-refactor
[Yarn refactor]
498a5f0 Thu Jan 2 19:06:40 2014 -0800
Merge pull request #323 from tgravescs/sparkconf_yarn_fix
[fix spark on yarn after the sparkConf changes]
0475ca8 Thu Jan 2 15:17:08 2014 -0800
Merge pull request #320 from kayousterhout/erroneous_failed_msg
[Remove erroneous FAILED state for killed tasks.]
588a169 Thu Jan 2 13:20:54 2014 -0800
Merge pull request #297 from tdas/window-improvement
[Improvements to DStream window ops and refactoring of Spark's CheckpointSuite]
5e67cdc Thu Jan 2 12:56:28 2014 -0800
Merge pull request #319 from kayousterhout/remove_error_method
[Removed redundant TaskSetManager.error() function.]
ca67909 Thu Jan 2 15:54:54 2014 -0500
Merge pull request #311 from tmyklebu/master
[SPARK-991: Report information gleaned from a Python stacktrace in the UI]
3713f81 Wed Jan 1 21:29:12 2014 -0800
Merge pull request #309 from mateiz/conf2
[SPARK-544. Migrate configuration to a SparkConf class]
c1d928a Wed Jan 1 17:03:48 2014 -0800
Merge pull request #312 from pwendell/log4j-fix-2
[SPARK-1008: Logging improvments]
dc9cb83 Wed Jan 1 13:28:34 2014 -0800
Merge pull request #126 from jegonzal/FixingPersist
[Fixing Persist Behavior]
9a0ff72 Tue Dec 31 21:50:24 2013 -0800
Merge pull request #314 from witgo/master
[restore core/pom.xml file modification]
8b8e70e Tue Dec 31 17:48:24 2013 -0800
Merge pull request #73 from falaki/ApproximateDistinctCount
[Approximate distinct count]
63b411d Tue Dec 31 14:31:28 2013 -0800
Merge pull request #238 from ngbinh/upgradeNetty
[upgrade Netty from 4.0.0.Beta2 to 4.0.13.Final]
32d6ae9 Tue Dec 31 13:51:07 2013 -0800
Merge pull request #120 from ankurdave/subgraph-reuses-view
[Reuse VTableReplicated in GraphImpl.subgraph]
55b7e2f Tue Dec 31 10:12:51 2013 -0800
Merge pull request #289 from tdas/filestream-fix
[Bug fixes for file input stream and checkpointing]
2b71ab9 Mon Dec 30 11:01:30 2013 -0800
Merge pull request from aarondav: Utilize DiskBlockManager pathway for temp file writing
[This gives us a couple advantages:]
50e3b8e Mon Dec 30 07:44:26 2013 -0800
Merge pull request #308 from kayousterhout/stage_naming
[Changed naming of StageCompleted event to be consistent]
72a17b6 Sat Dec 28 21:25:40 2013 -1000
Revert "Merge pull request #310 from jyunfan/master"
[This reverts commit 79b20e4dbe3dcd8559ec8316784d3334bb55868b, reversing]
79b20e4 Sat Dec 28 21:13:36 2013 -1000
Merge pull request #310 from jyunfan/master
[Fix typo in the Accumulators section]
7375047 Sat Dec 28 13:25:06 2013 -0800
Merge pull request #304 from kayousterhout/remove_unused
[Removed unused failed and causeOfFailure variables (in TaskSetManager)]
ad3dfd1 Fri Dec 27 22:10:14 2013 -0500
Merge pull request #307 from kayousterhout/other_failure
[Removed unused OtherFailure TaskEndReason.]
b579b83 Fri Dec 27 22:09:04 2013 -0500
Merge pull request #306 from kayousterhout/remove_pending
[Remove unused hasPendingTasks methods]
19672dc Fri Dec 27 13:37:10 2013 -0800
Merge pull request #305 from kayousterhout/line_spacing
[Fixed >100char lines in DAGScheduler.scala]
7be1e57 Thu Dec 26 23:41:40 2013 -1000
Merge pull request #298 from aarondav/minor
[Minor: Decrease margin of left side of Log page]
7d811ba Thu Dec 26 23:39:58 2013 -1000
Merge pull request #302 from pwendell/SPARK-1007
[SPARK-1007: spark-class2.cmd should change SCALA_VERSION to be 2.10]
5e69fc5 Thu Dec 26 19:10:39 2013 -0500
Merge pull request #295 from markhamstra/JobProgressListenerNPE
[Avoid a lump of coal (NPE) in JobProgressListener's stocking.]
da20270 Thu Dec 26 12:11:52 2013 -0800
Merge pull request #1 from aarondav/driver
[Refactor DriverClient to be more Actor-based]
e240bad Thu Dec 26 12:30:48 2013 -0500
Merge pull request #296 from witgo/master
[Renamed ClusterScheduler to TaskSchedulerImpl for yarn and new-yarn package]
c344ed0 Thu Dec 26 01:31:06 2013 -0500
Merge pull request #283 from tmyklebu/master
[Python bindings for mllib]
56094bc Wed Dec 25 13:14:33 2013 -0500
Merge pull request #290 from ash211/patch-3
[Typo: avaiable -> available]
4842a07 Wed Dec 25 01:52:15 2013 -0800
Merge pull request #287 from azuryyu/master
[Fixed job name in the java streaming example.]
85a344b Tue Dec 24 16:35:06 2013 -0800
Merge pull request #127 from kayousterhout/consolidate_schedulers
[Deduplicate Local and Cluster schedulers.]
c2dd6bc Tue Dec 24 14:36:47 2013 -0800
Merge pull request #279 from aarondav/shuffle-cleanup0
[Clean up shuffle files once their metadata is gone]
3bf7c70 Tue Dec 24 16:37:13 2013 -0500
Merge pull request #275 from ueshin/wip/changeclasspathorder
[Change the order of CLASSPATH.]
d63856c Mon Dec 23 22:07:26 2013 -0800
Merge pull request #286 from rxin/build
[Show full stack trace and time taken in unit tests.]
23a9ae6 Tue Dec 24 00:08:48 2013 -0500
Merge pull request #277 from tdas/scheduler-update
[Refactored the streaming scheduler and added StreamingListener interface]
11107c9 Mon Dec 23 10:38:20 2013 -0800
Merge pull request #244 from leftnoteasy/master
[Added SPARK-968 implementation for review]
44e4205 Sun Dec 22 11:44:18 2013 -0800
Merge pull request #116 from jianpingjwang/master
[remove unused variables and fix a bug]
4797c22 Fri Dec 20 13:30:39 2013 -0800
Merge pull request #118 from ankurdave/VertexPartitionSuite
[Test VertexPartition and fix bugs]
0bc57c5 Fri Dec 20 11:56:54 2013 -0800
Merge pull request #280 from aarondav/minor
[Minor cleanup for standalone scheduler]
ac70b8f Fri Dec 20 10:56:10 2013 -0800
Merge pull request #117 from ankurdave/more-tests
[More tests]
45310d4 Thu Dec 19 22:08:20 2013 -0800
Merge pull request #115 from ankurdave/test-reorg
[Reorganize unit tests; improve GraphSuite test coverage]
9228ec8 Thu Dec 19 21:37:15 2013 -0800
Merge pull request #1 from aarondav/127
[Merge master into 127]
eca68d4 Thu Dec 19 18:12:22 2013 -0800
Merge pull request #272 from tmyklebu/master
[Track and report task result serialisation time.]
7990c56 Thu Dec 19 13:35:09 2013 -0800
Merge pull request #276 from shivaram/collectPartition
[Add collectPartition to JavaRDD interface.]
440e531 Thu Dec 19 10:38:56 2013 -0800
Merge pull request #278 from MLnick/java-python-tostring
[Add toString to Java RDD, and __repr__ to Python RDD]
d8d3f3e Thu Dec 19 00:06:43 2013 -0800
Merge pull request #183 from aarondav/spark-959
[[SPARK-959] Explicitly depend on org.eclipse.jetty.orbit jar]
bfba532 Wed Dec 18 22:22:21 2013 -0800
Merge pull request #247 from aarondav/minor
[Increase spark.akka.askTimeout default to 30 seconds]
da301b5 Wed Dec 18 20:03:29 2013 -0800
Merge pull request #112 from amatsukawa/scc
[Strongly connected component algorithm]
c64a53a Wed Dec 18 16:56:26 2013 -0800
Merge pull request #267 from JoshRosen/cygwin
[Fix Cygwin support in several scripts.]
a645ef6 Wed Dec 18 16:07:52 2013 -0800
Merge pull request #48 from amatsukawa/add_project_to_graph
[Add mask operation on graph and filter graph primitive]
d7ebff0 Wed Dec 18 15:38:48 2013 -0800
Merge pull request #1 from ankurdave/add_project_to_graph
[Merge current master and reimplement Graph.mask using innerJoin]
5ea1872 Wed Dec 18 15:27:24 2013 -0800
Merge pull request #274 from azuryy/master
[Fixed the example link in the Scala programing guid.]
3fd2e09 Wed Dec 18 12:52:36 2013 -0800
Merge pull request #104 from jianpingjwang/master
[SVD++ demo]
f4effb3 Tue Dec 17 22:26:21 2013 -0800
Merge pull request #273 from rxin/top
[Fixed a performance problem in RDD.top and BoundedPriorityQueue]
1b5eacb Tue Dec 17 13:49:17 2013 -0800
Merge pull request #102 from ankurdave/clustered-edge-index
[Add clustered index on edges by source vertex]
7a8169b Mon Dec 16 22:42:21 2013 -0800
Merge pull request #268 from pwendell/shaded-protobuf
[Add support for 2.2. to master (via shaded jars)]
0476c84 Mon Dec 16 17:19:25 2013 -0800
Merge pull request #100 from ankurdave/mrTriplets-active-set
[Support activeSet option in mapReduceTriplets]
964a3b6 Mon Dec 16 15:23:51 2013 -0800
Merge pull request #270 from ewencp/really-force-ssh-pseudo-tty-master
[Force pseudo-tty allocation in spark-ec2 script.]
5192ef3 Mon Dec 16 15:08:08 2013 -0800
Merge pull request #94 from ankurdave/load-edges-columnar
[Load edges in columnar format]
883e034 Mon Dec 16 14:16:02 2013 -0800
Merge pull request #245 from gregakespret/task-maxfailures-fix
[Fix for spark.task.maxFailures not enforced correctly.]
a51f340 Sun Dec 15 22:02:30 2013 -0800
Merge pull request #265 from markhamstra/scala.binary.version
[DRY out the POMs with scala.binary.version]
ded10ce Sun Dec 15 17:25:33 2013 -0800
Merge pull request #103 from amplab/optimizations
[Optimizations cherry-picked from SIGMOD branches]
d2ced6d Sun Dec 15 14:11:34 2013 -0800
Merge pull request #256 from MLnick/master
[Fix 'IPYTHON=1 ./pyspark' throwing ValueError]
c55e698 Sun Dec 15 12:49:02 2013 -0800
Merge pull request #257 from tgravescs/sparkYarnFixName
[Fix the --name option for Spark on Yarn]
ab85f88 Sun Dec 15 12:48:32 2013 -0800
Merge pull request #264 from shivaram/spark-class-fix
[Use CoarseGrainedExecutorBackend in spark-class]
8a56c1f Sat Dec 14 16:29:24 2013 -0800
Merge pull request #84 from amatsukawa/graphlab_enhancements
[GraphLab bug fix & set start vertex]
7db9165 Sat Dec 14 14:16:34 2013 -0800
Merge pull request #251 from pwendell/master
[Fix list rendering in YARN markdown docs.]
2fd781d Sat Dec 14 12:59:37 2013 -0800
Merge pull request #249 from ngbinh/partitionInJavaSortByKey
[Expose numPartitions parameter in JavaPairRDD.sortByKey()]
9bf192b Sat Dec 14 12:52:18 2013 -0800
Merge pull request #91 from amplab/standalone-pagerank
[Standalone PageRank]
840af5e Sat Dec 14 12:51:51 2013 -0800
Merge pull request #99 from ankurdave/only-dynamic-pregel
[Remove static Pregel; take maxIterations in dynamic Pregel]
97ac060 Sat Dec 14 00:22:45 2013 -0800
Merge pull request #259 from pwendell/scala-2.10
[Migration to Scala 2.10]
7ac944f Fri Dec 13 23:22:08 2013 -0800
Merge pull request #262 from pwendell/mvn-fix
[Fix maven build issues in 2.10 branch]
6defb06 Fri Dec 13 21:18:57 2013 -0800
Merge pull request #261 from ScrapCodes/scala-2.10
[Added a comment about ActorRef and ActorSelection difference.]
76566b1 Fri Dec 13 10:11:02 2013 -0800
Merge pull request #260 from ScrapCodes/scala-2.10
[Review comments on the PR for scala 2.10 migration.]
0aeb182 Thu Dec 12 21:14:42 2013 -0800
Merge pull request #255 from ScrapCodes/scala-2.10
[Disabled yarn 2.2 in sbt and mvn build and added a message in the sbt build.]
2e89398 Wed Dec 11 23:10:53 2013 -0800
Merge pull request #254 from ScrapCodes/scala-2.10
[Scala 2.10 migration]
ce6ca4e Wed Dec 11 22:30:54 2013 -0800
Merge pull request #97 from dcrankshaw/fix-rddtop
[Added BoundedPriorityQueue kryo registrator. Fixes top issue.]
d2efe13 Tue Dec 10 13:01:26 2013 -0800
Merge pull request #250 from pwendell/master
[README incorrectly suggests build sources spark-env.sh]
6169fe1 Mon Dec 9 16:51:36 2013 -0800
Merge pull request #246 from pwendell/master
[Add missing license headers]
d992ec6 Sun Dec 8 20:49:20 2013 -0800
Merge pull request #195 from dhardy92/fix_DebScriptPackage
[[Deb] fix package of Spark classes adding org.apache prefix in scripts embeded in .deb]
1f4a4bc Sat Dec 7 22:34:34 2013 -0800
Merge pull request #242 from pwendell/master
[Update broken links and add HDP 2.0 version string]
6494d62 Sat Dec 7 11:56:16 2013 -0800
Merge pull request #240 from pwendell/master
[SPARK-917 Improve API links in nav bar]
f466f79 Sat Dec 7 11:51:52 2013 -0800
Merge pull request #239 from aarondav/nit
[Correct spellling error in configuration.md]
3abfbfb Sat Dec 7 11:24:19 2013 -0800
Merge pull request #92 from ankurdave/rdd-names
[Set RDD names for easy debugging]
31e8a14 Fri Dec 6 21:49:55 2013 -0800
Merge pull request #90 from amplab/pregel-replicate-changed
[Replicate only changed vertices]
10c3c0c Fri Dec 6 20:29:45 2013 -0800
Merge pull request #237 from pwendell/formatting-fix
[Formatting fix]
1b38f5f Fri Dec 6 20:16:15 2013 -0800
Merge pull request #236 from pwendell/shuffle-docs
[Adding disclaimer for shuffle file consolidation]
e5d5728 Fri Dec 6 20:14:56 2013 -0800
Merge pull request #235 from pwendell/master
[Minor doc fixes and updating README]
241336a Fri Dec 6 17:29:03 2013 -0800
Merge pull request #234 from alig/master
[Updated documentation about the YARN v2.2 build process]
e039234 Fri Dec 6 11:49:59 2013 -0800
Merge pull request #190 from markhamstra/Stages4Jobs
[stageId <--> jobId mapping in DAGScheduler]
bfa6860 Fri Dec 6 11:04:03 2013 -0800
Merge pull request #233 from hsaputra/changecontexttobackend
[Change the name of input argument in ClusterScheduler#initialize from context to backend.]
3fb302c Fri Dec 6 11:03:32 2013 -0800
Merge pull request #205 from kayousterhout/logging
[Added logging of scheduler delays to UI]
87676a6 Fri Dec 6 11:01:42 2013 -0800
Merge pull request #220 from rxin/zippart
[Memoize preferred locations in ZippedPartitionsBaseRDD]
0780498 Thu Dec 5 23:29:42 2013 -0800
Merge pull request #232 from markhamstra/FiniteWait
[jobWaiter.synchronized before jobWaiter.wait]
1c8500e Thu Dec 5 16:25:44 2013 -0800
Merge pull request #88 from amplab/varenc
[Fixed a bug that variable encoding doesn't work for ints that use all 64 bits.]
e0bcaa0 Thu Dec 5 12:37:02 2013 -0800
Merge pull request #86 from ankurdave/vid-varenc
[Finish work on #85]
5d46025 Thu Dec 5 12:31:24 2013 -0800
Merge pull request #228 from pwendell/master
[Document missing configs and set shuffle consolidation to false.]
3e96b9a Thu Dec 5 12:07:36 2013 -0800
Merge pull request #85 from ankurdave/vid-varenc
[Always write Vids using variable encoding]
72b6961 Wed Dec 4 23:33:04 2013 -0800
Merge pull request #199 from harveyfeng/yarn-2.2
[Hadoop 2.2 migration]
e0347ba Wed Dec 4 17:38:06 2013 -0800
Merge pull request #83 from ankurdave/fix-tests
[Fix compile errors in GraphSuite and SerializerSuite]
182f9ba Wed Dec 4 15:52:07 2013 -0800
Merge pull request #227 from pwendell/master
[Fix small bug in web UI and minor clean-up.]
cbd3b75 Wed Dec 4 15:35:26 2013 -0800
Merge pull request #81 from amplab/clean1
[Codebase refactoring]
b9e7609 Wed Dec 4 14:42:09 2013 -0800
Merge pull request #225 from ash211/patch-3
[Add missing space after "Serialized" in StorageLevel]
055462c Wed Dec 4 14:02:11 2013 -0800
Merge pull request #226 from ash211/patch-4
[Typo: applicaton]
d6e5473 Wed Dec 4 10:28:50 2013 -0800
Merge pull request #223 from rxin/transient
[Mark partitioner, name, and generator field in RDD as @transient.]
8a3475a Tue Dec 3 14:21:40 2013 -0800
Merge pull request #218 from JoshRosen/spark-970-pyspark-unicode-error
[Fix UnicodeEncodeError in PySpark saveAsTextFile() (SPARK-970)]
46b87b8 Tue Dec 3 00:41:11 2013 -0800
Merge pull request #2 from colorant/yarn-client-2.2
[Fix pom.xml for maven build]
58d9bbc Mon Dec 2 21:58:53 2013 -0800
Merge pull request #217 from aarondav/mesos-urls
[Re-enable zk:// urls for Mesos SparkContexts]
740922f Sun Dec 1 12:46:58 2013 -0800
Merge pull request #219 from sundeepn/schedulerexception
[Scheduler quits when newStage fails]
60e23a5 Sat Nov 30 23:38:49 2013 -0800
Merge pull request #216 from liancheng/fix-spark-966
[Bugfix: SPARK-965 & SPARK-966]
34ee814 Sat Nov 30 15:10:30 2013 -0800
Merged Ankur's pull request #80 and fixed subgraph.
[]
743a31a Wed Nov 27 18:24:39 2013 -0800
Merge pull request #210 from haitaoyao/http-timeout
[add http timeout for httpbroadcast]
993e293 Wed Nov 27 00:57:54 2013 -0800
Merge pull request #1 from colorant/yarn-client-2.2
[Port yarn-client mode for new-yarn]
fb6875d Tue Nov 26 20:55:40 2013 -0800
Merge pull request #146 from JoshRosen/pyspark-custom-serializers
[Custom Serializers for PySpark]
330ada1 Tue Nov 26 19:08:33 2013 -0800
Merge pull request #207 from henrydavidge/master
[Log a warning if a task's serialized size is very big]
615213f Tue Nov 26 19:07:20 2013 -0800
Merge pull request #212 from markhamstra/SPARK-963
[[SPARK-963] Fixed races in JobLoggerSuite]
cb976df Tue Nov 26 10:23:19 2013 -0800
Merge pull request #209 from pwendell/better-docs
[Improve docs for shuffle instrumentation]
18d6df0 Tue Nov 26 00:00:07 2013 -0800
Merge pull request #86 from holdenk/master
[Add histogram functionality to DoubleRDDFunctions]
0e2109d Mon Nov 25 20:48:37 2013 -0800
Merge pull request #204 from rxin/hash
[OpenHashSet fixes]
c46067f Mon Nov 25 19:09:31 2013 -0800
Merge pull request #206 from ash211/patch-2
[Update tuning.md]
14bb465 Mon Nov 25 18:50:18 2013 -0800
Merge pull request #201 from rxin/mappartitions
[Use the proper partition index in mapPartitionsWIthIndex]
eb4296c Mon Nov 25 15:25:29 2013 -0800
Merge pull request #101 from colorant/yarn-client-scheduler
[For SPARK-527, Support spark-shell when running on YARN]
466fd06 Mon Nov 25 18:27:26 2013 +0800
Incorporated ideas from pull request #200. - Use Murmur Hash 3 finalization step to scramble the bits of HashCode instead of the simpler version in java.util.HashMap; the latter one had trouble with ranges of consecutive integers. Murmur Hash 3 is used by fastutil.
[- Don't check keys for equality when re-inserting due to growing the]
088995f Mon Nov 25 00:57:51 2013 -0800
Merge pull request #77 from amplab/upgrade
[Sync with Spark master]
62889c4 Mon Nov 25 11:27:45 2013 +0800
Merge pull request #203 from witgo/master
[ Fix Maven build for metrics-graphite]
6af03ed Sun Nov 24 16:42:37 2013 -0800
Merge pull request #76 from dcrankshaw/fix_partitioners
[Actually use partitioner command line args in Analytics.]
859d62d Sun Nov 24 16:19:51 2013 -0800
Merge pull request #151 from russellcardullo/add-graphite-sink
[Add graphite sink for metrics]
65de73c Sun Nov 24 15:52:33 2013 -0800
Merge pull request #185 from mkolod/random-number-generator
[XORShift RNG with unit tests and benchmark]
972171b Mon Nov 25 07:50:46 2013 +0800
Merge pull request #197 from aarondav/patrick-fix
[Fix 'timeWriting' stat for shuffle files]
a1a7e36 Sun Nov 24 05:15:09 2013 -0800
Merge pull request #75 from amplab/simplify
[Simplify GraphImpl internals]
718cc80 Sun Nov 24 11:02:02 2013 +0800
Merge pull request #200 from mateiz/hash-fix
[AppendOnlyMap fixes]
51aa9d6 Sat Nov 23 19:46:46 2013 +0800
Merge pull request #198 from ankurdave/zipPartitions-preservesPartitioning
[Support preservesPartitioning in RDD.zipPartitions]
18ce7e9 Fri Nov 22 17:02:40 2013 -0800
Merge pull request #73 from jegonzal/TriangleCount
[Triangle count]
086b097 Fri Nov 22 10:26:39 2013 +0800
Merge pull request #193 from aoiwelle/patch-1
[Fix Kryo Serializer buffer documentation inconsistency]
f20093c Fri Nov 22 10:12:13 2013 +0800
Merge pull request #196 from pwendell/master
[TimeTrackingOutputStream should pass on calls to close() and flush().]
4b89501 Wed Nov 20 10:36:10 2013 -0800
Merge pull request #191 from hsaputra/removesemicolonscala
[Cleanup to remove semicolons (;) from Scala code]
202f8e6 Wed Nov 20 03:26:08 2013 -0800
Merge pull request #74 from dcrankshaw/remove_sleep
[Removed sleep from pagerank in Analytics]
74ade9e Tue Nov 19 16:53:58 2013 -0800
Merge pull request #62 from dcrankshaw/partitioners
[Allow user to choose a partitioner at runtime]
f568912 Tue Nov 19 16:11:31 2013 -0800
Merge pull request #181 from BlackNiuza/fix_tasks_number
[correct number of tasks in ExecutorsUI]
aa638ed Tue Nov 19 16:05:44 2013 -0800
Merge pull request #189 from tgravescs/sparkYarnErrorHandling
[Impove Spark on Yarn Error handling]
5592580 Tue Nov 19 16:04:01 2013 -0800
Merge pull request #187 from aarondav/example-bcast-test
[Enable the Broadcast examples to work in a cluster setting]
99cfe89 Mon Nov 18 22:00:36 2013 -0500
Updates to reflect pull request code review
[]
e2ebc3a Sun Nov 17 18:42:18 2013 -0800
Merge pull request #182 from rxin/vector
[Slightly enhanced PrimitiveVector:]
26f616d Sun Nov 17 18:18:16 2013 -0800
Merge pull request #3 from aarondav/pv-test
[Add PrimitiveVectorSuite and fix bug in resize()]
1b5b358 Sat Nov 16 11:44:10 2013 -0800
Merge pull request #178 from hsaputra/simplecleanupcode
[Simple cleanup on Spark's Scala code]
62a2a71 Fri Nov 15 13:12:07 2013 -0800
Merge pull request #65 from amplab/varenc
[Use variable encoding for ints, longs, and doubles in the specialized serializers.]
f6b2e59 Thu Nov 14 23:04:55 2013 -0800
Merge pull request #1 from aarondav/scala210-master
[Various merge corrections]
96e0fb4 Thu Nov 14 22:29:28 2013 -0800
Merge pull request #173 from kayousterhout/scheduler_hang
[Fix bug where scheduler could hang after task failure.]
dfd40e9 Thu Nov 14 19:44:50 2013 -0800
Merge pull request #175 from kayousterhout/no_retry_not_serializable
[Don't retry tasks when they fail due to a NotSerializableException]
ed25105 Thu Nov 14 19:43:55 2013 -0800
Merge pull request #174 from ahirreddy/master
[Write Spark UI url to driver file on HDFS]
1a4cfbe Thu Nov 14 10:32:11 2013 -0800
Merge pull request #169 from kayousterhout/mesos_fix
[Don't ignore spark.cores.max when using Mesos Coarse mode]
5a4f483 Thu Nov 14 10:30:36 2013 -0800
Merge pull request #170 from liancheng/hadooprdd-doc-typo
[Fixed a scaladoc typo in HadoopRDD.scala]
d76f520 Thu Nov 14 10:25:48 2013 -0800
Merge pull request #171 from RIA-pierre-borckmans/master
[Fixed typos in the CDH4 distributions version codes.]
2c39d80 Wed Nov 13 23:28:01 2013 -0800
Merge pull request #69 from jegonzal/MissingVertices
[Addressing issue in Graph creation]
33b2dea Wed Nov 13 17:55:58 2013 -0800
Merge pull request #1 from ankurdave/MissingVertices
[During graph creation, create eTable earlier]
2054c61 Wed Nov 13 16:49:55 2013 -0800
Merge pull request #159 from liancheng/dagscheduler-actor-refine
[Migrate the daemon thread started by DAGScheduler to Akka actor]
9290e5b Wed Nov 13 16:48:44 2013 -0800
Merge pull request #165 from NathanHowell/kerberos-master
[spark-assembly.jar fails to authenticate with YARN ResourceManager]
a81fcb7 Wed Nov 13 10:41:01 2013 -0800
Merge pull request #68 from jegonzal/BitSetSetUntilBug
[Addressing bug in BitSet.setUntil(ind)]
39af914 Wed Nov 13 08:39:05 2013 -0800
Merge pull request #166 from ahirreddy/simr-spark-ui
[SIMR Backend Scheduler will now write Spark UI URL to HDFS, which is to ...]
f49ea28 Tue Nov 12 19:13:39 2013 -0800
Merge pull request #137 from tgravescs/sparkYarnJarsHdfsRebase
[Allow spark on yarn to be run from HDFS.]
87f2f4e Tue Nov 12 16:26:09 2013 -0800
Merge pull request #153 from ankurdave/stop-spot-cluster
[Enable stopping and starting a spot cluster]
b8bf04a Tue Nov 12 16:19:50 2013 -0800
Merge pull request #160 from xiajunluan/JIRA-923
[Fix bug JIRA-923]
dfd1ebc Tue Nov 12 09:10:05 2013 -0800
Merge pull request #164 from tdas/kafka-fix
[Made block generator thread safe to fix Kafka bug.]
2e8d450 Mon Nov 11 17:34:09 2013 -0800
Merge pull request #63 from jegonzal/VertexSetCleanup
[Cleanup of VertexSetRDD]
b8e294a Mon Nov 11 16:25:42 2013 -0800
Merge pull request #61 from ankurdave/pid2vid
[Shuffle replicated vertex attributes efficiently in columnar format]
3d7277c Mon Nov 11 15:49:28 2013 -0800
Merge pull request #55 from ankurdave/aggregateNeighbors-variants
[Specialize mapReduceTriplets for accessing subsets of vertex attributes]
23b53ef Mon Nov 11 12:30:02 2013 -0800
Merge pull request #156 from haoyuan/master
[add tachyon module]
1a06f70 Sun Nov 10 10:54:44 2013 -0800
Merge pull request #60 from amplab/rxin
[Looks good to me.]
58d4f6c Sun Nov 10 09:23:56 2013 -0800
Merge pull request #157 from rxin/kryo
[3 Kryo related changes.]
3efc019 Sat Nov 9 17:53:49 2013 -0800
Merge pull request #147 from JoshRosen/fix-java-api-completeness-checker
[Add spark-tools assembly to spark-class'ss classpath]
87954d4 Sat Nov 9 17:53:25 2013 -0800
Merge pull request #154 from soulmachine/ClusterScheduler
[Replace the thread inside ClusterScheduler.start() with an Akka scheduler]
f6c9462 Sat Nov 9 16:14:45 2013 -0800
Merge pull request #58 from jegonzal/KryoMessages
[Kryo messages]
83bf192 Sat Nov 9 15:40:29 2013 -0800
Merge pull request #155 from rxin/jobgroup
[Don't reset job group when a new job description is set.]
8af99f2 Sat Nov 9 13:48:00 2013 -0800
Merge pull request #149 from tgravescs/fixSecureHdfsAccess
[Fix secure hdfs access for spark on yarn]
72a601e Sat Nov 9 11:55:16 2013 -0800
Merge pull request #152 from rxin/repl
[Propagate SparkContext local properties from spark-repl caller thread to the repl execution thread.]
6ee05be Thu Nov 7 19:12:41 2013 -0800
Merge pull request #49 from jegonzal/graphxshell
[GraphX Console with Logo Text]
a9f96b5 Thu Nov 7 18:56:56 2013 -0800
Merge pull request #56 from jegonzal/PregelAPIChanges
[Changing Pregel API to use mapReduceTriplets instead of aggregateNeighbors]
5907137 Thu Nov 7 16:58:31 2013 -0800
Merge pull request #54 from amplab/rxin
[Converted for loops to while loops in EdgePartition.]
edf4164 Thu Nov 7 16:22:43 2013 -0800
Merge pull request #53 from amplab/rxin
[Added GraphX to classpath.]
c379e10 Thu Nov 7 16:01:47 2013 -0800
Merge pull request #51 from jegonzal/VertexSetRDD
[Reverting to Array based (materialized) output in VertexSetRDD]
3d4ad84 Thu Nov 7 11:08:27 2013 -0800
Merge pull request #148 from squito/include_appId
[Include appId in executor cmd line args]
be7e8da Wed Nov 6 23:22:47 2013 -0800
Merge pull request #23 from jerryshao/multi-user
[Add Spark multi-user support for standalone mode and Mesos]
aadeda5 Wed Nov 6 13:27:47 2013 -0800
Merge pull request #144 from liancheng/runjob-clean
[Removed unused return value in SparkContext.runJob]
951024f Wed Nov 6 09:36:14 2013 -0800
Merge pull request #145 from aarondav/sls-fix
[Attempt to fix SparkListenerSuite breakage]
bf4e613 Tue Nov 5 23:14:09 2013 -0800
Merge pull request #143 from rxin/scheduler-hang
[Ignore a task update status if the executor doesn't exist anymore.]
9f7b9bb Tue Nov 5 10:42:19 2013 -0800
Merge pull request #142 from liancheng/dagscheduler-pattern-matching
[Using case class deep match to simplify code in DAGScheduler.processEvent]
ca44b51 Tue Nov 5 01:32:55 2013 -0800
Merge pull request #50 from amplab/mergemerge
[Merge Spark master into graphx]
8106532 Mon Nov 4 20:47:14 2013 -0800
Merge pull request #139 from aarondav/shuffle-next
[Never store shuffle blocks in BlockManager]
0b26a39 Mon Nov 4 18:22:06 2013 -0800
Merge pull request #128 from shimingfei/joblogger-doc
[add javadoc to JobLogger, and some small fix]
7a26104 Mon Nov 4 17:54:06 2013 -0800
Merge pull request #130 from aarondav/shuffle
[Memory-optimized shuffle file consolidation]
b5dc339 Sun Nov 3 20:43:15 2013 -0800
Merge pull request #70 from rxin/hash1
[Fast, memory-efficient hash set, hash table implementations optimized for primitive data types.]
41ead7a Sat Nov 2 14:41:50 2013 -0700
Merge pull request #133 from Mistobaan/link_fix
[update default github]
d407c07 Sat Nov 2 14:36:37 2013 -0700
Merge pull request #134 from rxin/readme
[Fixed a typo in Hadoop version in README.]
e7c7b80 Fri Nov 1 17:58:10 2013 -0700
Merge pull request #132 from Mistobaan/doc_fix
[fix persistent-hdfs]
d6d11c2 Fri Nov 1 15:40:33 2013 -0700
Merge pull request #129 from velvia/2013-11/document-local-uris
[Document & finish support for local: URIs]
99bfcc9 Thu Oct 31 21:38:10 2013 -0700
Merge pull request #46 from jegonzal/VertexSetWithHashSet
[Switched VertexSetRDD and GraphImpl to use OpenHashSet]
fcaaf86 Thu Oct 31 18:27:30 2013 -0700
Merge pull request #44 from jegonzal/rxinBitSet
[Switching to VertexSetRDD to use @rxin BitSet and OpenHash ]
3f3c727 Thu Oct 31 09:52:25 2013 -0700
Merge pull request #41 from jegonzal/LineageTracking
[Optimizing Graph Lineage]
944f6b8 Thu Oct 31 09:40:35 2013 -0700
Merge pull request #43 from amplab/FixBitSetCastException
[Fix BitSet cast exception]
8f1098a Wed Oct 30 20:11:48 2013 -0700
Merge pull request #117 from stephenh/avoid_concurrent_modification_exception
[Handle ConcurrentModificationExceptions in SparkContext init.]
dc9ce16 Wed Oct 30 17:01:56 2013 -0700
Merge pull request #126 from kayousterhout/local_fix
[Fixed incorrect log message in local scheduler]
33de11c Wed Oct 30 16:58:27 2013 -0700
Merge pull request #124 from tgravescs/sparkHadoopUtilFix
[Pull SparkHadoopUtil out of SparkEnv (jira SPARK-886)]
a0c86c3 Wed Oct 30 15:34:39 2013 -0700
Merge pull request #38 from jegonzal/Documentation
[Improving Documentation]
618c1f6 Wed Oct 30 12:03:44 2013 -0700
Merge pull request #125 from velvia/2013-10/local-jar-uri
[Add support for local:// URI scheme for addJars()]
745dc42 Tue Oct 29 23:47:10 2013 -0700
Merge pull request #118 from JoshRosen/blockinfo-memory-usage
[Reduce the memory footprint of BlockInfo objects]
06adf63 Tue Oct 29 16:43:46 2013 -0700
Merge pull request #33 from kellrott/master
[Fixing graph/pom.xml]
098768e Tue Oct 29 15:08:36 2013 -0700
Merge pull request #37 from jegonzal/AnalyticsCleanup
[Updated Connected Components and Pregel Docs]
f0e23a0 Tue Oct 29 01:41:44 2013 -0400
Merge pull request #119 from soulmachine/master
[A little revise for the document]
aec9bf9 Sun Oct 27 19:32:00 2013 -0700
Merge pull request #112 from kayousterhout/ui_task_attempt_id
[Display both task ID and task attempt ID in UI, and rename taskId to taskAttemptId]
d4df474 Sun Oct 27 22:11:21 2013 -0400
Merge pull request #115 from aarondav/shuffle-fix
[Eliminate extra memory usage when shuffle file consolidation is disabled]
e018f2d Sat Oct 26 11:39:15 2013 -0700
Merge pull request #113 from pwendell/master
[Improve error message when multiple assembly jars are present.]
662ee9f Sat Oct 26 11:35:59 2013 -0700
Merge pull request #114 from soulmachine/master
[A little revise for the document]
bab496c Fri Oct 25 18:28:43 2013 -0700
Merge pull request #108 from alig/master
[Changes to enable executing by using HDFS as a synchronization point between driver and executors, as well as ensuring executors exit properly.]
d307db6 Fri Oct 25 17:26:06 2013 -0700
Merge pull request #102 from tdas/transform
[Added new Spark Streaming operations]
85e2cab Fri Oct 25 14:46:06 2013 -0700
Merge pull request #111 from kayousterhout/ui_name
[Properly display the name of a stage in the UI.]
ab35ec4 Fri Oct 25 10:16:18 2013 -0700
Merge pull request #110 from pwendell/master
[Exclude jopt from kafka dependency.]
4f2c943 Thu Oct 24 22:32:02 2013 -0700
Merge pull request #109 from pwendell/master
[Adding Java/Java Streaming versions of `repartition` with associated tests]
99ad4a6 Thu Oct 24 17:08:39 2013 -0700
Merge pull request #106 from pwendell/master
[Add a `repartition` operator.]
5429d62 Thu Oct 24 11:15:55 2013 -0700
Merge pull request #107 from ScrapCodes/scala-2.10
[Updating to latest akka 2.2.3, which fixes our only failing test Driver Suite.]
6f82c42 Thu Oct 24 11:09:46 2013 -0700
Merge pull request #34 from jegonzal/AnalyticsCleanup
[Analytics Cleanup]
1dc776b Wed Oct 23 22:05:52 2013 -0700
Merge pull request #93 from kayousterhout/ui_new_state
[Show "GETTING_RESULTS" state in UI.]
c4b187d Wed Oct 23 21:56:18 2013 -0700
Merge pull request #105 from pwendell/doc-fix
[Fixing broken links in programming guide]
a098438 Wed Oct 23 18:03:08 2013 -0700
Merge pull request #103 from JoshRosen/unpersist-fix
[Add unpersist() to JavaDoubleRDD and JavaPairRDD.]
dd65964 Wed Oct 23 15:07:59 2013 -0700
Merge pull request #64 from prabeesh/master
[MQTT Adapter for Spark Streaming]
452aa36 Tue Oct 22 23:15:33 2013 -0700
Merge pull request #97 from ewencp/pyspark-system-properties
[Add classmethod to SparkContext to set system properties.]
9dfcf53 Tue Oct 22 16:01:42 2013 -0700
Merge pull request #100 from JoshRosen/spark-902
[Remove redundant Java Function call() definitions]
49d5cda Tue Oct 22 15:38:02 2013 -0700
Merge pull request #30 from jegonzal/VertexSetRDD_Tests
[Testing and Documenting VertexSetRDD]
97184de Tue Oct 22 13:10:14 2013 -0700
Merge pull request #99 from pwendell/master
[Use correct formatting for comments in StoragePerfTester]
c404adb Tue Oct 22 11:30:19 2013 -0700
Merge pull request #90 from pwendell/master
[SPARK-940: Do not directly pass Stage objects to SparkListener.]
aa9019f Tue Oct 22 10:30:02 2013 -0700
Merge pull request #98 from aarondav/docs
[Docs: Fix links to RDD API documentation]
a0e08f0 Tue Oct 22 10:20:43 2013 -0700
Merge pull request #82 from JoshRosen/map-output-tracker-refactoring
[Split MapOutputTracker into Master/Worker classes]
b84193c Mon Oct 21 23:35:13 2013 -0700
Merge pull request #92 from tgravescs/sparkYarnFixClasspath
[Fix the Worker to use CoarseGrainedExecutorBackend and modify classpath ...]
731c94e Mon Oct 21 23:31:38 2013 -0700
Merge pull request #56 from jerryshao/kafka-0.8-dev
[Upgrade Kafka 0.7.2 to Kafka 0.8.0-beta1 for Spark Streaming]
48952d6 Mon Oct 21 22:45:00 2013 -0700
Merge pull request #87 from aarondav/shuffle-base
[Basic shuffle file consolidation]
a51359c Mon Oct 21 20:33:29 2013 -0700
Merge pull request #95 from aarondav/perftest
[Minor: Put StoragePerfTester in org/apache/]
39d2e9b Mon Oct 21 18:58:48 2013 -0700
Merge pull request #94 from aarondav/mesos-fix
[Fix mesos urls]
aa61bfd Mon Oct 21 11:57:05 2013 -0700
Merge pull request #88 from rxin/clean
[Made the following traits/interfaces/classes non-public:]
35886f3 Sun Oct 20 22:20:32 2013 -0700
Merge pull request #41 from pwendell/shuffle-benchmark
[Provide Instrumentation for Shuffle Write Performance]
5b9380e Sun Oct 20 21:03:51 2013 -0700
Merge pull request #89 from rxin/executor
[Don't setup the uncaught exception handler in local mode.]
261bcf2 Sun Oct 20 17:59:51 2013 -0700
Merge pull request #80 from rxin/build
[Exclusion rules for Maven build files.]
edc5e3f Sun Oct 20 17:18:06 2013 -0700
Merge pull request #75 from JoshRosen/block-manager-cleanup
[Code de-duplication in BlockManager]
2a7ae17 Sun Oct 20 11:45:21 2013 -0700
Merge pull request #84 from rxin/kill1
[Added documentation for setJobGroup. Also some minor cleanup in SparkContext.]
e4abb75 Sun Oct 20 09:38:37 2013 -0700
Merge pull request #85 from rxin/clean
[Moved the top level spark package object from spark to org.apache.spark]
136b9b3 Sun Oct 20 02:58:26 2013 -0700
Basic shuffle file consolidation
[The Spark shuffle phase can produce a large number of files, as one file is created]
747f538 Sat Oct 19 23:40:40 2013 -0700
Merge pull request #83 from ewencp/pyspark-accumulator-add-method
[Add an add() method to pyspark accumulators.]
6511bbe Sat Oct 19 11:34:56 2013 -0700
Merge pull request #78 from mosharaf/master
[Removed BitTorrentBroadcast and TreeBroadcast.]
f628804 Fri Oct 18 23:19:42 2013 -0700
Merge pull request #76 from pwendell/master
[Clarify compression property.]
599dcb0 Fri Oct 18 22:49:00 2013 -0700
Merge pull request #74 from rxin/kill
[Job cancellation via job group id.]
9cf43cf Fri Oct 18 22:07:21 2013 -0700
Merge pull request #28 from jegonzal/VertexSetRDD
[Refactoring IndexedRDD to VertexSetRDD.]
f888a5b Fri Oct 18 22:06:58 2013 -0700
Merge pull request #29 from ankurdave/unit-tests
[Unit tests for Graph and GraphOps]
8de9706 Fri Oct 18 20:32:39 2013 -0700
Merge pull request #66 from shivaram/sbt-assembly-deps
[Add SBT target to assemble dependencies]
e5316d0 Fri Oct 18 20:30:56 2013 -0700
Merge pull request #68 from mosharaf/master
[Faster and stable/reliable broadcast]
8d528af Fri Oct 18 20:24:10 2013 -0700
Merge pull request #71 from aarondav/scdefaults
[Spark shell exits if it cannot create SparkContext]
0794bd7 Fri Oct 18 18:59:58 2013 -0700
Merge pull request #27 from jegonzal/removed_indexedrdd_from_core
[Removing IndexedRDD changes for spark/core]
099977f Thu Oct 17 14:17:08 2013 -0700
Merge pull request #26 from ankurdave/split-vTableReplicated
[Great work!]
fc26e5b Thu Oct 17 13:21:07 2013 -0700
Merge pull request #69 from KarthikTunga/master
[Fix for issue SPARK-627. Implementing --config argument in the scripts.]
cf64f63 Thu Oct 17 11:12:28 2013 -0700
Merge pull request #67 from kayousterhout/remove_tsl
[Removed TaskSchedulerListener interface.]
f9973ca Wed Oct 16 15:58:41 2013 -0700
Merge pull request #65 from tgravescs/fixYarn
[Fix yarn build]
28e9c2a Tue Oct 15 23:59:56 2013 -0700
Merge pull request #63 from pwendell/master
[Fixing spark streaming example and a bug in examples build.]
4e46fde Tue Oct 15 23:14:27 2013 -0700
Merge pull request #62 from harveyfeng/master
[Make TaskContext's stageId publicly accessible.]
b534606 Tue Oct 15 21:25:03 2013 -0700
Merge pull request #8 from vchekan/checkpoint-ttl-restore
[Serialize and restore spark.cleaner.ttl to savepoint]
6dbd220 Tue Oct 15 19:02:57 2013 -0700
Merge pull request #34 from kayousterhout/rename
[Renamed StandaloneX to CoarseGrainedX.]
983b83f Tue Oct 15 19:02:46 2013 -0700
Merge pull request #61 from kayousterhout/daemon_thread
[Unified daemon thread pools]
3249e0e Tue Oct 15 14:12:33 2013 -0700
Merge pull request #59 from rxin/warning
[Bump up logging level to warning for failed tasks.]
678dec6 Tue Oct 15 10:51:46 2013 -0700
Merge pull request #58 from hsaputra/update-pom-asf
[Update pom.xml to use version 13 of the ASF parent pom]
e33b183 Mon Oct 14 22:25:47 2013 -0700
Merge pull request #29 from rxin/kill
[Job killing]
3b11f43 Mon Oct 14 14:20:01 2013 -0700
Merge pull request #57 from aarondav/bid
[Refactor BlockId into an actual type]
9979690 Sat Oct 12 21:23:26 2013 -0700
Merge pull request #52 from harveyfeng/hadoop-closure
[Add an optional closure parameter to HadoopRDD instantiation to use when creating local JobConfs.]
dca8009 Fri Oct 11 16:08:15 2013 -0700
Merge pull request #54 from aoiwelle/remove_unused_imports
[Remove unnecessary mutable imports]
0e5052b Fri Oct 11 15:45:16 2013 -0700
Merge pull request #51 from ScrapCodes/scala-2.10
[Scala 2.10]
fb25f32 Fri Oct 11 15:44:43 2013 -0700
Merge pull request #53 from witgo/master
[Add a zookeeper compile dependency to fix build in maven]
d6ead47 Fri Oct 11 15:43:01 2013 -0700
Merge pull request #32 from mridulm/master
[Address review comments, move to incubator spark]
c71499b Thu Oct 10 17:16:42 2013 -0700
Merge pull request #19 from aarondav/master-zk
[Standalone Scheduler fault tolerance using ZooKeeper]
5867a82 Thu Oct 10 14:02:37 2013 -0700
Merge pull request #19 from dcrankshaw/master
[Merge canonical 2d partitioner and group edges into benchmarks]
cd08f73 Thu Oct 10 13:55:47 2013 -0700
Merge pull request #44 from mateiz/fast-map
[A fast and low-memory append-only map for shuffle operations]
4b46d51 Thu Oct 10 13:35:36 2013 -0700
Merge pull request #17 from amplab/product2
[product 2 change]
320418f Wed Oct 9 16:55:30 2013 -0700
Merge pull request #49 from mateiz/kryo-fix-2
[Fix Chill serialization of Range objects]
215238c Wed Oct 9 16:49:44 2013 -0700
Merge pull request #50 from kayousterhout/SPARK-908
[Fix race condition in SparkListenerSuite (fixes SPARK-908).]
7827efc Wed Oct 9 15:07:25 2013 -0700
Merge pull request #46 from mateiz/py-sort-update
[Fix PySpark docs and an overly long line of code after #38]
7b3ae04 Wed Oct 9 12:14:19 2013 -0700
Merge pull request #45 from pwendell/metrics_units
[Use standard abbreviation in metrics description (MBytes -> MB)]
b4fa11f Wed Oct 9 11:59:47 2013 -0700
Merge pull request #38 from AndreSchumacher/pyspark_sorting
[SPARK-705: implement sortByKey() in PySpark]
19d445d Wed Oct 9 11:08:34 2013 -0700
Merge pull request #22 from GraceH/metrics-naming
[SPARK-900 Use coarser grained naming for metrics]
7d50f9f Wed Oct 9 10:32:42 2013 -0700
Merge pull request #35 from MartinWeindel/scala-2.10
[Fixing inconsistencies and warnings on Scala 2.10 branch]
3218fa7 Tue Oct 8 23:44:55 2013 -0700
Merge pull request #4 from MLnick/implicit-als
[Adding algorithm for implicit feedback data to ALS]
e67d5b9 Tue Oct 8 22:57:38 2013 -0700
Merge pull request #43 from mateiz/kryo-fix
[Don't allocate Kryo buffers unless needed]
ea34c52 Mon Oct 7 20:45:58 2013 -0700
Merge pull request #42 from pwendell/shuffle-read-perf
[Fix inconsistent and incorrect log messages in shuffle read path]
02f37ee Mon Oct 7 15:48:52 2013 -0700
Merge pull request #39 from pwendell/master
[Adding Shark 0.7.1 to EC2 scripts]
213b70a Mon Oct 7 10:54:22 2013 -0700
Merge pull request #31 from sundeepn/branch-0.8
[Resolving package conflicts with hadoop 0.23.9]
d585613 Sat Oct 5 22:57:05 2013 -0700
Merge pull request #37 from pwendell/merge-0.8
[merge in remaining changes from `branch-0.8`]
4a25b11 Sat Oct 5 19:28:55 2013 -0700
Merge pull request #20 from harveyfeng/hadoop-config-cache
[Allow users to pass broadcasted Configurations and cache InputFormats across Hadoop file reads.]
8fc68d0 Sat Oct 5 17:24:35 2013 -0700
Merge pull request #36 from pwendell/versions
[Bumping EC2 default version in master to .]
100222b Sat Oct 5 13:38:59 2013 -0700
Merge pull request #27 from davidmccauley/master
[SPARK-920/921 - JSON endpoint updates]
0864193 Sat Oct 5 13:25:18 2013 -0700
Merge pull request #33 from AndreSchumacher/pyspark_partition_key_change
[Fixing SPARK-602: PythonPartitioner]
61ffcde Fri Oct 4 10:52:17 2013 -0700
Merge pull request #15 from dcrankshaw/master
[Add synthetic generators]
3fe12cc Fri Oct 4 10:51:28 2013 -0700
Merge pull request #946 from ScrapCodes/scala-2.10
[Fixed non termination of Executor backend, when sc.stop is not called and system.exit instead.]
232765f Thu Oct 3 12:00:48 2013 -0700
Merge pull request #26 from Du-Li/master
[fixed a wildcard bug in make-distribution.sh; ask sbt to check local]
405e69b Thu Oct 3 10:52:41 2013 -0700
Merge pull request #25 from CruncherBigData/master
[Update README: updated the link]
49dbfcc Thu Oct 3 10:52:06 2013 -0700
Merge pull request #28 from tgravescs/sparYarnAppName
[Allow users to set the application name for Spark on Yarn]
e597ea3 Wed Oct 2 21:14:24 2013 -0700
Merge pull request #10 from kayousterhout/results_through-bm
[Send Task results through the block manager when larger than Akka frame size (fixes SPARK-669).]
714fdab Thu Sep 26 14:28:55 2013 -0700
Merge pull request #17 from rxin/optimize
[Remove -optimize flag]
13eced7 Thu Sep 26 14:18:19 2013 -0700
Merge pull request #16 from pwendell/master
[Bug fix in master build]
70a0b99 Thu Sep 26 14:11:54 2013 -0700
Merge pull request #14 from kayousterhout/untangle_scheduler
[Improved organization of scheduling packages.]
afd03b2 Thu Sep 26 14:09:55 2013 -0700
Merge pull request #943 from ScrapCodes/scala-2.10
[Scala 2.10 with akka 2.2]
76677b8 Thu Sep 26 14:03:46 2013 -0700
Merge pull request #670 from jey/ec2-ssh-improvements
[EC2 SSH improvements]
c514cd1 Thu Sep 26 13:48:20 2013 -0700
Merge pull request #930 from holdenk/master
[Add mapPartitionsWithIndex]
560ee5c Thu Sep 26 11:27:34 2013 -0700
Merge pull request #7 from wannabeast/memorystore-fixes
[some minor fixes to MemoryStore]
6566a19 Thu Sep 26 08:01:04 2013 -0700
Merge pull request #9 from rxin/limit
[Smarter take/limit implementation.]
834686b Sun Sep 22 15:06:48 2013 -0700
Merge pull request #928 from jerryshao/fairscheduler-refactor
[Refactor FairSchedulableBuilder]
a2ea069 Sat Sep 21 23:04:42 2013 -0700
Merge pull request #937 from jerryshao/localProperties-fix
[Fix PR926 local properties issues in Spark Streaming like scenarios]
f06f2da Sat Sep 21 22:43:34 2013 -0700
Merge pull request #941 from ilikerps/master
[Add "org.apache." prefix to packages in spark-class]
7bb12a2 Sat Sep 21 22:42:46 2013 -0700
Merge pull request #940 from ankurdave/clear-port-properties-after-tests
[After unit tests, clear port properties unconditionally]
a00317b Fri Sep 20 11:29:31 2013 -0700
Merge pull request #1 from ankurdave/aggregateNeighbors-returns-graph
[Return Graph from Graph.aggregateNeighbors]
6a5e665 Thu Sep 19 22:41:44 2013 -0700
Merge pull request #3 from ankurdave/clear-port-properties-after-tests
[After unit tests, clear port properties unconditionally ]
68ad33a Thu Sep 19 21:30:27 2013 -0700
Merge pull request #2 from ankurdave/package-fixes
[Package fixes (spark.graph -> org.apache.spark.graph)]
cd7222c Thu Sep 19 14:21:24 2013 -0700
Merge pull request #938 from ilikerps/master
[Fix issue with spark_ec2 seeing empty security groups]
e0dd24d Sat Aug 31 17:54:15 2013 -0700
Merge pull request #879 from AndreSchumacher/scala-2.10
[PySpark: replacing class manifest by class tag for Scala 2.10.2 in rdd.py]
ad61349 Thu Jul 18 13:53:48 2013 -0700
Merge pull request #709 from ScrapCodes/scala-2.10
[Fixed warnings in scala 2.10 branch.]
a289ded Mon Jul 15 15:59:43 2013 -0700
Merge pull request #700 from ScrapCodes/scala-2.10
[Scala 2.10 ]
1044a95 Fri Jun 14 20:04:24 2013 -0700
Merge pull request #652 from ScrapCodes/scala-2.10
[Fixed maven build without netty fix]
4b57f83 Sat Apr 20 10:40:07 2013 -0700
Merge pull request #535 from ScrapCodes/scala-2.10-repl-port
[porting of repl to scala-2.10]
73b3fee Sun Jan 20 10:11:49 2013 -0800
Merge pull request #388 from folone/master
[Updated maven build configuration for Scala 2.10]
20adf27 Tue Jan 15 11:03:49 2013 -0800
Merge pull request #371 from folone/master
[Scala 2.10.0]
Release 0.8.0-incubating
2aff798 Sun Sep 15 14:05:04 2013 -0700
Merge pull request #933 from jey/yarn-typo-fix
[Fix typo in Maven build docs]
dbd2c4f Sun Sep 15 13:20:41 2013 -0700
Merge pull request #932 from pwendell/mesos-version
[Bumping Mesos version to 0.13.0]
9fb0b9d Sun Sep 15 13:02:53 2013 -0700
Merge pull request #931 from pwendell/yarn-docs
[Explain yarn.version in Maven build docs]
c4c1db2 Fri Sep 13 19:52:12 2013 -0700
Merge pull request #929 from pwendell/master
[Use different Hadoop version for YARN artifacts.]
a310de6 Wed Sep 11 19:36:11 2013 -0700
Merge pull request #926 from kayousterhout/dynamic
[Changed localProperties to use ThreadLocal (not DynamicVariable).]
58c7d8b Wed Sep 11 17:33:42 2013 -0700
Merge pull request #927 from benh/mesos-docs
[Updated Spark on Mesos documentation.]
91a59e6 Wed Sep 11 10:21:48 2013 -0700
Merge pull request #919 from mateiz/jets3t
[Add explicit jets3t dependency, which is excluded in hadoop-client]
b9128d3 Wed Sep 11 10:03:06 2013 -0700
Merge pull request #922 from pwendell/port-change
[Change default port number from 3030 to 4030.]
e07eef8 Wed Sep 11 07:35:39 2013 -0700
Merge pull request #925 from davidmccauley/master
[SPARK-894 - Not all WebUI fields delivered VIA JSON]
8432f27 Tue Sep 10 23:19:53 2013 -0700
Merge pull request #923 from haoyuan/master
[fix run-example script]
d40f140 Tue Sep 10 23:05:29 2013 -0700
Merge pull request #921 from pwendell/master
[Fix HDFS access bug with assembly build.]
0a6c051 Mon Sep 9 23:37:57 2013 -0700
Merge pull request #918 from pwendell/branch-0.8
[Update versions for 0.8.0 release.]
8c14f4b Mon Sep 9 22:07:58 2013 -0700
Merge pull request #917 from pwendell/master
[Document libgfortran dependency for MLBase]
c81377b Mon Sep 9 20:16:19 2013 -0700
Merge pull request #915 from ooyala/master
[Get rid of / improve ugly NPE when Utils.deleteRecursively() fails]
61d2a01 Mon Sep 9 18:21:01 2013 -0700
Merge pull request #916 from mateiz/mkdist-fix
[Fix copy issue in https://github.com/mesos/spark/pull/899]
a85758c Mon Sep 9 13:45:40 2013 -0700
Merge pull request #907 from stephenh/document_coalesce_shuffle
[Add better docs for coalesce.]
084fc36 Mon Sep 9 12:01:35 2013 -0700
Merge pull request #912 from tgravescs/ganglia-pom
[Add metrics-ganglia to core pom file]
0456384 Mon Sep 9 09:57:54 2013 -0700
Merge pull request #911 from pwendell/ganglia-sink
[Adding Manen dependency for Ganglia]
bf984e2 Sun Sep 8 23:50:24 2013 -0700
Merge pull request #890 from mridulm/master
[Fix hash bug]
e9d4f44 Sun Sep 8 23:36:48 2013 -0700
Merge pull request #909 from mateiz/exec-id-fix
[Fix an instance where full standalone mode executor IDs were passed to]
2447b1c Sun Sep 8 22:27:49 2013 -0700
Merge pull request #910 from mateiz/ml-doc-tweaks
[Small tweaks to MLlib docs]
7d3204b Sun Sep 8 21:39:12 2013 -0700
Merge pull request #905 from mateiz/docs2
[Job scheduling and cluster mode docs]
f1f8371 Sun Sep 8 21:26:11 2013 -0700
Merge pull request #896 from atalwalkar/master
[updated content]
f68848d Sun Sep 8 18:32:16 2013 -0700
Merge pull request #906 from pwendell/ganglia-sink
[Clean-up of Metrics Code/Docs and Add Ganglia Sink]
0b95799 Sun Sep 8 15:30:16 2013 -0700
Merge pull request #908 from pwendell/master
[Fix target JVM version in scala build]
04cfb3a Sun Sep 8 10:33:20 2013 -0700
Merge pull request #898 from ilikerps/660
[SPARK-660: Add StorageLevel support in Python]
38488ac Sun Sep 8 00:28:53 2013 -0700
Merge pull request #900 from pwendell/cdh-docs
[Provide docs to describe running on CDH/HDP cluster.]
a8e376e Sat Sep 7 21:16:01 2013 -0700
Merge pull request #904 from pwendell/master
[Adding Apache license to two files]
cfde85e Sat Sep 7 13:53:08 2013 -0700
Merge pull request #901 from ooyala/2013-09/0.8-doc-changes
[0.8 Doc changes for make-distribution.sh]
4a7813a Sat Sep 7 13:52:24 2013 -0700
Merge pull request #903 from rxin/resulttask
[Fixed the bug that ResultTask was not properly deserializing outputId.]
afe46ba Sat Sep 7 07:28:51 2013 -0700
Merge pull request #892 from jey/fix-yarn-assembly
[YARN build fixes]
2eebeff Fri Sep 6 15:25:22 2013 -0700
Merge pull request #897 from pwendell/master
[Docs describing Spark monitoring and instrumentation]
ddcb9d3 Thu Sep 5 23:54:09 2013 -0700
Merge pull request #895 from ilikerps/821
[SPARK-821: Don't cache results when action run locally on driver]
699c331 Thu Sep 5 20:21:53 2013 -0700
Merge pull request #891 from xiajunluan/SPARK-864
[[SPARK-864]DAGScheduler Exception if we delete Worker and StandaloneExecutorBackend then add Worker]
5c7494d Wed Sep 4 22:47:03 2013 -0700
Merge pull request #893 from ilikerps/master
[SPARK-884: Add unit test to validate Spark JSON output]
a547866 Wed Sep 4 21:11:56 2013 -0700
Merge pull request #894 from c0s/master
[Updating assembly README to reflect recent changes in the build.]
19f7027 Tue Sep 3 14:29:10 2013 -0700
Merge pull request #878 from tgravescs/yarnUILink
[Link the Spark UI up to the Yarn UI ]
68df246 Tue Sep 3 13:01:17 2013 -0700
Merge pull request #889 from alig/master
[Return the port the WebUI is bound to (useful if port 0 was used)]
d3dd48f Mon Sep 2 16:44:54 2013 -0700
Merge pull request #887 from mateiz/misc-fixes
[Miscellaneous fixes for 0.8]
636fc0c Mon Sep 2 11:20:39 2013 -0700
Merge pull request #886 from mateiz/codec
[Fix spark.io.compression.codec and change default codec to LZF]
d9a53b9 Sun Sep 1 22:12:30 2013 -0700
Merge pull request #885 from mateiz/win-py
[Allow PySpark to run on Windows]
3c520fe Sun Sep 1 17:26:55 2013 -0700
Merge pull request #884 from mateiz/win-fixes
[Run script fixes for Windows after package & assembly change]
f957c26 Sun Sep 1 14:53:57 2013 -0700
Merge pull request #882 from mateiz/package-rename
[Rename spark package to org.apache.spark]
a30fac1 Sun Sep 1 12:27:50 2013 -0700
Merge pull request #883 from alig/master
[Don't require the spark home environment variable to be set for standalone mode (change needed by SIMR)]
03cc765 Sun Sep 1 10:20:56 2013 -0700
Merge pull request #881 from pwendell/master
[Extend QuickStart to include next steps]
0e9565a Sat Aug 31 18:55:41 2013 -0700
Merge pull request #880 from mateiz/ui-tweaks
[Various UI tweaks]
2b29a1d Sat Aug 31 17:49:45 2013 -0700
Merge pull request #877 from mateiz/docs
[Doc improvements for 0.8]
6edef9c Sat Aug 31 13:39:24 2013 -0700
Merge pull request #861 from AndreSchumacher/pyspark_sampling_function
[Pyspark sampling function]
fd89835 Sat Aug 31 13:18:12 2013 -0700
Merge pull request #870 from JoshRosen/spark-885
[Don't send SIGINT / ctrl-c to Py4J gateway subprocess]
618f0ec Fri Aug 30 18:17:13 2013 -0700
Merge pull request #869 from AndreSchumacher/subtract
[PySpark: implementing subtractByKey(), subtract() and keyBy()]
94bb7fd Fri Aug 30 12:05:13 2013 -0700
Merge pull request #876 from mbautin/master_hadoop_rdd_conf
[Make HadoopRDD's configuration accessible]
9e17e45 Fri Aug 30 00:22:53 2013 -0700
Merge pull request #875 from shivaram/build-fix
[Fix broken build by removing addIntercept]
016787d Thu Aug 29 22:15:14 2013 -0700
Merge pull request #863 from shivaram/etrain-ridge
[Adding linear regression and refactoring Ridge regression to use SGD]
852d810 Thu Aug 29 22:13:15 2013 -0700
Merge pull request #819 from shivaram/sgd-cleanup
[Change SVM to use {0,1} labels]
ca71620 Thu Aug 29 21:51:14 2013 -0700
Merge pull request #857 from mateiz/assembly
[Change build and run instructions to use assemblies]
1528776 Thu Aug 29 21:30:47 2013 -0700
Merge pull request #874 from jerryshao/fix-report-bug
[Fix removed block zero size log reporting]
abdbacf Wed Aug 28 21:11:31 2013 -0700
Merge pull request #871 from pwendell/expose-local
[Expose `isLocal` in SparkContext.]
afcade3 Wed Aug 28 20:15:40 2013 -0700
Merge pull request #873 from pwendell/master
[Hot fix for command runner]
baa84e7 Wed Aug 28 12:44:46 2013 -0700
Merge pull request #865 from tgravescs/fixtmpdir
[Spark on Yarn should use yarn approved directories for spark.local.dir and tmp]
cd043cf Tue Aug 27 19:50:32 2013 -0700
Merge pull request #867 from tgravescs/yarnenvconfigs
[Spark on Yarn allow users to specify environment variables ]
898da7e Mon Aug 26 20:40:49 2013 -0700
Merge pull request #859 from ianbuss/sbt_opts
[Pass SBT_OPTS environment through to sbt_launcher]
17bafea Mon Aug 26 11:59:32 2013 -0700
Merge pull request #864 from rxin/json1
[Revert json library change]
f9fc5c1 Sat Aug 24 15:19:56 2013 -0700
Merge pull request #603 from pwendell/ec2-updates
[Several Improvements to EC2 Scripts]
d282c1e Fri Aug 23 11:20:20 2013 -0700
Merge pull request #860 from jey/sbt-ide-fixes
[Fix IDE project generation under SBT]
5a6ac12 Thu Aug 22 22:08:03 2013 -0700
Merge pull request #701 from ScrapCodes/documentation-suggestions
[Documentation suggestions for spark streaming.]
46ea0c1 Thu Aug 22 15:57:28 2013 -0700
Merge pull request #814 from holdenk/master
[Create less instances of the random class during ALS initialization.]
9ac3d62 Thu Aug 22 15:51:10 2013 -0700
Merge pull request #856 from jey/sbt-fix-hadoop-0.23.9
[Re-add removed dependency to fix build under Hadoop 0.23.9]
ae8ba83 Thu Aug 22 10:14:54 2013 -0700
Merge pull request #855 from jey/update-build-docs
[Update build docs]
8a36fd0 Thu Aug 22 10:13:35 2013 -0700
Merge pull request #854 from markhamstra/pomUpdate
[Synced sbt and maven builds to use the same dependencies, etc.]
c2d00f1 Thu Aug 22 10:13:03 2013 -0700
Merge pull request #832 from alig/coalesce
[Coalesced RDD with locality]
e6d66c8 Wed Aug 21 17:44:31 2013 -0700
Merge pull request #853 from AndreSchumacher/double_rdd
[Implementing SPARK-838: Add DoubleRDDFunctions methods to PySpark]
2905611 Tue Aug 20 17:36:14 2013 -0700
Merge pull request #851 from markhamstra/MutablePairTE
[Removed meaningless types]
d61337f Tue Aug 20 10:06:06 2013 -0700
Merge pull request #844 from markhamstra/priorityRename
[Renamed 'priority' to 'jobId' and assorted minor changes]
8cae72e Mon Aug 19 23:40:04 2013 -0700
Merge pull request #828 from mateiz/sched-improvements
[Scheduler fixes and improvements]
efeb142 Mon Aug 19 19:23:50 2013 -0700
Merge pull request #849 from mateiz/web-fixes
[Small fixes to web UI]
abdc1f8 Mon Aug 19 18:30:56 2013 -0700
Merge pull request #847 from rxin/rdd
[Allow subclasses of Product2 in all key-value related classes]
8fa0747 Sun Aug 18 17:02:54 2013 -0700
Merge pull request #840 from AndreSchumacher/zipegg
[Implementing SPARK-878 for PySpark: adding zip and egg files to context ...]
1e137a5 Sat Aug 17 22:22:32 2013 -0700
Merge pull request #846 from rxin/rdd
[Two minor RDD refactoring]
e89ffc7 Fri Aug 16 14:02:34 2013 -0700
Merge pull request #839 from jegonzal/zip_partitions
[Currying RDD.zipPartitions ]
1fb1b09 Thu Aug 15 22:15:05 2013 -0700
Merge pull request #841 from rxin/json
[Use the JSON formatter from Scala library and removed dependency on lift-json.]
c69c489 Thu Aug 15 20:55:09 2013 -0700
Merge pull request #843 from Reinvigorate/bug-879
[fixing typo in conf/slaves]
230ab27 Thu Aug 15 17:45:17 2013 -0700
Merge pull request #834 from Daemoen/master
[Updated json output to allow for display of worker state]
659553b Thu Aug 15 16:56:31 2013 -0700
Merge pull request #836 from pwendell/rename
[Rename `memoryBytesToString` and `memoryMegabytesToString`]
28369ff Thu Aug 15 16:44:02 2013 -0700
Merge pull request #829 from JoshRosen/pyspark-unit-tests-python-2.6
[Fix PySpark unit tests on Python 2.6]
1a13460 Thu Aug 15 15:50:44 2013 -0700
Merge pull request #833 from rxin/ui
[Various UI improvements.]
044a088 Wed Aug 14 20:43:49 2013 -0700
Merge pull request #831 from rxin/scheduler
[A few small scheduler / job description changes.]
839f2d4 Wed Aug 14 16:17:23 2013 -0700
Merge pull request #822 from pwendell/ui-features
[Adding GC Stats to TaskMetrics (and three small fixes)]
63446f9 Wed Aug 14 00:17:07 2013 -0700
Merge pull request #826 from kayousterhout/ui_fix
[Fixed 2 bugs in executor UI (incl. SPARK-877)]
3f14cba Tue Aug 13 20:09:51 2013 -0700
Merge pull request #825 from shivaram/maven-repl-fix
[Set SPARK_CLASSPATH for maven repl tests]
596adc6 Tue Aug 13 19:41:34 2013 -0700
Merge pull request #824 from mateiz/mesos-0.12.1
[Update to Mesos 0.12.1]
d316af9 Tue Aug 13 15:31:01 2013 -0700
Merge pull request #821 from pwendell/print-launch-command
[Print run command to stderr rather than stdout]
1f79d21 Tue Aug 13 15:23:54 2013 -0700
Merge pull request #818 from kayousterhout/killed_fix
[Properly account for killed tasks.]
622f83c Tue Aug 13 09:58:52 2013 -0700
Merge pull request #817 from pwendell/pr_784
[Minor clean-up in metrics servlet code]
a0133bf Tue Aug 13 09:28:18 2013 -0700
Merge pull request #784 from jerryshao/dev-metrics-servlet
[Add MetricsServlet for Spark metrics system]
e2fdac6 Mon Aug 12 21:26:59 2013 -0700
Merge pull request #802 from stayhf/SPARK-760-Python
[Simple PageRank algorithm implementation in Python for SPARK-760]
d3525ba Mon Aug 12 21:02:39 2013 -0700
Merge pull request #813 from AndreSchumacher/add_files_pyspark
[Implementing SPARK-865: Add the equivalent of ADD_JARS to PySpark]
9e02da2 Mon Aug 12 20:22:27 2013 -0700
Merge pull request #812 from shivaram/maven-mllib-tests
[Create SparkContext in beforeAll for MLLib tests]
65d0d91 Mon Aug 12 19:00:57 2013 -0700
Merge pull request #807 from JoshRosen/guava-optional
[Change scala.Option to Guava Optional in Java APIs]
4346f0a Mon Aug 12 12:12:12 2013 -0700
Merge pull request #809 from shivaram/sgd-cleanup
[Clean up scaladoc in ML Lib.]
ea1b4ba Mon Aug 12 08:09:58 2013 -0700
Merge pull request #806 from apivovarov/yarn-205
[Changed yarn.version to 2.0.5 in pom.xml]
2a39d2c Sun Aug 11 20:35:09 2013 -0700
Merge pull request #810 from pwendell/dead_doc_code
[Remove now dead code inside of docs]
e5b9ed2 Sun Aug 11 17:22:47 2013 -0700
Merge pull request #808 from pwendell/ui_compressed_bytes
[Report compressed bytes read when calculating TaskMetrics]
3796486 Sun Aug 11 14:51:47 2013 -0700
Merge pull request #805 from woggle/hadoop-rdd-jobconf
[Use new Configuration() instead of slower new JobConf() in SerializableWritable]
ff9ebfa Sun Aug 11 10:52:55 2013 -0700
Merge pull request #762 from shivaram/sgd-cleanup
[Refactor SGD options into a new class.]
95c62ca Sun Aug 11 10:30:52 2013 -0700
Merge pull request #804 from apivovarov/master
[Fixed path to JavaALS.java and JavaKMeans.java, fixed hadoop2-yarn profi...]
06e4f2a Sat Aug 10 18:06:23 2013 -0700
Merge pull request #789 from MLnick/master
[Adding Scala version of PageRank example]
71c63de Sat Aug 10 10:21:20 2013 -0700
Merge pull request #795 from mridulm/master
[Fix bug reported in PR 791 : a race condition in ConnectionManager and Connection]
d17eeb9 Sat Aug 10 09:02:27 2013 -0700
Merge pull request #785 from anfeng/master
[expose HDFS file system stats via Executor metrics]
dce5e47 Fri Aug 9 21:53:45 2013 -0700
Merge pull request #800 from dlyubimov/HBASE_VERSION
[Pull HBASE_VERSION in the head of sbt build]
cd247ba Fri Aug 9 20:41:13 2013 -0700
Merge pull request #786 from shivaram/mllib-java
[Java fixes, tests and examples for ALS, KMeans]
b09d4b7 Fri Aug 9 13:17:08 2013 -0700
Merge pull request #799 from woggle/sync-fix
[Remove extra synchronization in ResultTask]
0bc63bf Fri Aug 9 13:16:25 2013 -0700
Merge pull request #801 from pwendell/print-launch-command
[Print launch command [Branch 0.8 version]]
cc6b92e Fri Aug 9 13:00:33 2013 -0700
Merge pull request #775 from pwendell/print-launch-command
[Log the launch command for Spark daemons]
f94fc75 Fri Aug 9 10:04:03 2013 -0700
Merge pull request #788 from shane-huang/sparkjavaopts
[For standalone mode, add worker local env setting of SPARK_JAVA_OPTS as ...]
63b6e02 Thu Aug 8 14:02:02 2013 -0700
Merge pull request #797 from mateiz/chill-0.3.1
[Update to Chill 0.3.1]
9955e5a Thu Aug 8 11:03:38 2013 -0700
Merge pull request #796 from pwendell/bootstrap-design
[Bootstrap re-design]
5133e4b Wed Aug 7 15:50:45 2013 -0700
Merge pull request #790 from kayousterhout/fix_throughput
[Fixed issue in UI that decreased scheduler throughput by 5x or more]
3c8478e Tue Aug 6 23:25:03 2013 -0700
Merge pull request #747 from mateiz/improved-lr
[Update the Python logistic regression example]
6b043a6 Tue Aug 6 22:31:02 2013 -0700
Merge pull request #724 from dlyubimov/SPARK-826
[SPARK-826: fold(), reduce(), collect() always attempt to use java serialization]
de6c4c9 Tue Aug 6 17:09:50 2013 -0700
Merge pull request #787 from ash211/master
[Update spark-standalone.md]
df4d10d Tue Aug 6 15:44:05 2013 -0700
Merge pull request #779 from adatao/adatao-global-SparkEnv
[[HOTFIX] Extend thread safety for SparkEnv.get()]
d2b0f0c Tue Aug 6 14:49:39 2013 -0700
Merge pull request #770 from stayhf/SPARK-760-Java
[Simple PageRank algorithm implementation in Java for SPARK-760]
d031f73 Mon Aug 5 22:33:00 2013 -0700
Merge pull request #782 from WANdisco/master
[SHARK-94 Log the files computed by HadoopRDD and NewHadoopRDD]
1b63dea Mon Aug 5 22:21:26 2013 -0700
Merge pull request #769 from markhamstra/NegativeCores
[SPARK-847 + SPARK-845: Zombie workers and negative cores]
828aff7 Mon Aug 5 21:37:33 2013 -0700
Merge pull request #776 from gingsmith/master
[adding matrix factorization data generator]
8b27789 Mon Aug 5 19:14:52 2013 -0700
Merge pull request #774 from pwendell/job-description
[Show user-defined job name in UI]
550b0cf Mon Aug 5 12:10:32 2013 -0700
Merge pull request #780 from cybermaster/master
[SPARK-850]
22abbc1 Fri Aug 2 16:37:59 2013 -0700
Merge pull request #772 from karenfeng/ui-843
[Show app duration]
9d7dfd2 Thu Aug 1 17:41:58 2013 -0700
Merge pull request #743 from pwendell/app-metrics
[Add application metrics to standalone master]
6d7afd7 Thu Aug 1 17:13:28 2013 -0700
Merge pull request #768 from pwendell/pr-695
[Minor clean-up of fair scheduler UI]
5e7b38f Thu Aug 1 14:59:33 2013 -0700
Merge pull request #695 from xiajunluan/pool_ui
[Enhance job ui in spark ui system with adding pool information]
0a96493 Thu Aug 1 11:27:17 2013 -0700
Merge pull request #760 from karenfeng/heading-update
[Clean up web UI page headers]
cb7dd86 Thu Aug 1 11:06:10 2013 -0700
Merge pull request #758 from pwendell/master-json
[Add JSON path to master index page]
58756b7 Wed Jul 31 23:45:41 2013 -0700
Merge pull request #761 from mateiz/kmeans-generator
[Add data generator for K-means]
ecab635 Wed Jul 31 18:16:55 2013 -0700
Merge pull request #763 from c0s/assembly
[SPARK-842. Maven assembly is including examples libs and dependencies]
39c75f3 Wed Jul 31 15:52:36 2013 -0700
Merge pull request #757 from BlackNiuza/result_task_generation
[Bug fix: SPARK-837]
b2b86c2 Wed Jul 31 15:51:39 2013 -0700
Merge pull request #753 from shivaram/glm-refactor
[Build changes for ML lib]
14bf2fe Wed Jul 31 14:18:16 2013 -0700
Merge pull request #749 from benh/spark-executor-uri
[Added property 'spark.executor.uri' for launching on Mesos.]
4ba4c3f Wed Jul 31 13:14:49 2013 -0700
Merge pull request #759 from mateiz/split-fix
[Use the Char version of split() instead of the String one in MLUtils]
a386ced Wed Jul 31 11:22:50 2013 -0700
Merge pull request #754 from rxin/compression
[Compression codec change]
0be071a Wed Jul 31 11:11:59 2013 -0700
Merge pull request #756 from cdshines/patch-1
[Refactored Vector.apply(length, initializer) replacing excessive code with library method]
d4556f4 Wed Jul 31 08:48:14 2013 -0700
Merge pull request #751 from cdshines/master
[Cleaned Partitioner & PythonPartitioner source by taking out non-related logic to Utils]
29b8cd3 Tue Jul 30 21:30:33 2013 -0700
Merge pull request #755 from jerryshao/add-apache-header
[Add Apache license header to metrics system]
e87de03 Tue Jul 30 15:00:08 2013 -0700
Merge pull request #744 from karenfeng/bootstrap-update
[Use Bootstrap progress bars in web UI]
ae57020 Tue Jul 30 14:56:41 2013 -0700
Merge pull request #752 from rxin/master
[Minor mllib cleanup]
8aee118 Tue Jul 30 10:27:54 2013 -0700
Merge pull request #748 from atalwalkar/master
[made SimpleUpdater consistent with other updaters]
468a36c Mon Jul 29 19:44:33 2013 -0700
Merge pull request #746 from rxin/cleanup
[Internal cleanup]
1e1ffb1 Mon Jul 29 19:26:19 2013 -0700
Merge pull request #745 from shivaram/loss-update-fix
[Remove duplicate loss history in Gradient Descent]
c99b674 Mon Jul 29 16:32:55 2013 -0700
Merge pull request #735 from karenfeng/ui-807
[Totals for shuffle data and CPU time]
fe7298b Mon Jul 29 14:01:00 2013 -0700
Merge pull request #741 from pwendell/usability
[Fix two small usability issues]
c34c0f6 Mon Jul 29 13:18:10 2013 -0700
Merge pull request #731 from pxinghao/master
[Adding SVM and Lasso]
f3d72ff Fri Jul 26 17:19:27 2013 -0700
Merge pull request #739 from markhamstra/toolsPom
[Missing tools/pom.xml scalatest dependency]
cb36677 Fri Jul 26 16:59:30 2013 -0700
Merge pull request #738 from harsha2010/pruning
[Fix bug in Partition Pruning.]
f3cf094 Thu Jul 25 14:53:21 2013 -0700
Merge pull request #734 from woggle/executor-env2
[Get more env vars from driver rather than worker]
51c2427 Thu Jul 25 00:03:11 2013 -0700
Merge pull request #732 from ryanlecompte/master
[Refactor Kryo serializer support to use chill/chill-java]
52723b9 Wed Jul 24 14:33:02 2013 -0700
Merge pull request #728 from jey/examples-jar-env
[Fix setting of SPARK_EXAMPLES_JAR]
20338c2 Wed Jul 24 14:32:24 2013 -0700
Merge pull request #729 from karenfeng/ui-811
[Stage Page updates]
5584ebc Wed Jul 24 11:46:46 2013 -0700
Merge pull request #675 from c0s/assembly
[Building spark assembly for further consumption of the Spark project with a deployed cluster]
a73f3ee Wed Jul 24 08:59:14 2013 -0700
Merge pull request #671 from jerryshao/master
[Add metrics system for Spark]
b011329 Tue Jul 23 22:50:09 2013 -0700
Merge pull request #727 from rxin/scheduler
[Scheduler code style cleanup.]
876125b Tue Jul 23 22:28:21 2013 -0700
Merge pull request #726 from rxin/spark-826
[SPARK-829: scheduler shouldn't hang if a task contains unserializable objects in its closure]
2f1736c Tue Jul 23 15:53:30 2013 -0700
Merge pull request #725 from karenfeng/task-start
[Creates task start events]
5364f64 Tue Jul 23 13:40:34 2013 -0700
Merge pull request #723 from rxin/mllib
[Made RegressionModel serializable and added unit tests to make sure predict methods would work.]
f369e0e Tue Jul 23 13:22:27 2013 -0700
Merge pull request #720 from ooyala/2013-07/persistent-rdds-api
[Add a public method getCachedRdds to SparkContext]
401aac8 Mon Jul 22 16:57:16 2013 -0700
Merge pull request #719 from karenfeng/ui-808
[Creates Executors tab for Jobs UI]
8ae1436 Mon Jul 22 16:03:04 2013 -0700
Merge pull request #722 from JoshRosen/spark-825
[Fix bug: DoubleRDDFunctions.sampleStdev() computed non-sample stdev()]
15fb394 Sun Jul 21 10:33:38 2013 -0700
Merge pull request #716 from c0s/webui-port
[Regression: default webui-port can't be set via command line "--webui-port" anymore]
c40f0f2 Fri Jul 19 13:33:04 2013 -0700
Merge pull request #711 from shivaram/ml-generators
[Move ML lib data generator files to util/]
413b841 Fri Jul 19 13:31:38 2013 -0700
Merge pull request #717 from viirya/dev1
[Do not copy local jars given to SparkContext in yarn mode]
0d0a47c Thu Jul 18 12:06:37 2013 -0700
Merge pull request #710 from shivaram/ml-updates
[Updates to LogisticRegression]
c6235b5 Thu Jul 18 11:43:48 2013 -0700
Merge pull request #714 from adatao/master
[[BUGFIX] Fix for sbt/sbt script SPARK_HOME setting]
009c79e Thu Jul 18 11:41:52 2013 -0700
Merge pull request #715 from viirya/dev1
[fix a bug in build process that pulls in two versions of ASM.]
985a9e3 Wed Jul 17 22:27:19 2013 -0700
Merge pull request #712 from stayhf/SPARK-817
[Consistently invoke bash with /usr/bin/env bash in scripts to make code ...]
cad48ed Tue Jul 16 21:41:28 2013 -0700
Merge pull request #708 from ScrapCodes/dependencies-upgrade
[Dependency upgrade Akka 2.0.3 -> 2.0.5]
8a8a8f2 Mon Jul 15 23:09:21 2013 -0700
Merge pull request #705 from rxin/errormessages
[Throw a more meaningful message when runJob is called to launch tasks on non-existent partitions.]
ed8415b Mon Jul 15 16:41:04 2013 -0700
Merge pull request #703 from karenfeng/ui-802
[Link to job UI from standalone deploy cluster web UI]
e3d3e6f Mon Jul 15 14:59:44 2013 -0700
Merge pull request #702 from karenfeng/ui-fixes
[Adds app name in HTML page titles on job web UI]
c7877d5 Sun Jul 14 12:58:13 2013 -0700
Merge pull request #689 from BlackNiuza/application_status
[Bug fix: SPARK-796]
10c0593 Sun Jul 14 11:45:18 2013 -0700
Merge pull request #699 from pwendell/ui-env
[Add `Environment` tab to SparkUI.]
89e8549 Sat Jul 13 16:11:08 2013 -0700
Merge pull request #698 from Reinvigorate/sm-deps-change
[changing com.google.code.findbugs maven coordinates]
77c69ae Fri Jul 12 23:05:21 2013 -0700
Merge pull request #697 from pwendell/block-locations
[Show block locations in Web UI.]
5a7835c Fri Jul 12 20:28:21 2013 -0700
Merge pull request #691 from karenfeng/logpaging
[Create log pages]
71ccca0 Fri Jul 12 20:25:06 2013 -0700
Merge pull request #696 from woggle/executor-env
[Pass executor env vars (e.g. SPARK_CLASSPATH) to compute-classpath.sh]
90fc3f3 Fri Jul 12 20:23:36 2013 -0700
Merge pull request #692 from Reinvigorate/takeOrdered
[adding takeOrdered() to RDD]
018d04c Thu Jul 11 12:48:37 2013 -0700
Merge pull request #684 from woggle/mesos-classloader
[Explicitly set class loader for MesosSchedulerDriver callbacks.]
bc19477 Wed Jul 10 22:29:41 2013 -0700
Merge pull request #693 from c0s/readme
[Updating README to reflect Scala 2.9.3 requirements]
7dcda9a Mon Jul 8 23:24:23 2013 -0700
Merge pull request #688 from markhamstra/scalaDependencies
[Fixed SPARK-795 with explicit dependencies]
638927b Mon Jul 8 22:58:50 2013 -0700
Merge pull request #683 from shivaram/sbt-test-fix
[Remove some stack traces from sbt test output]
3c13178 Mon Jul 8 14:50:34 2013 -0700
Merge pull request #687 from atalwalkar/master
[Added "Labeled" to util functions for labeled data]
744da8e Sun Jul 7 17:42:25 2013 -0700
Merge pull request #679 from ryanlecompte/master
[Make binSearch method tail-recursive for RidgeRegression]
3cc6818 Sat Jul 6 19:51:20 2013 -0700
Merge pull request #668 from shimingfei/guava-14.0.1
[update guava version from 11.0.1 to 14.0.1]
2216188 Sat Jul 6 16:18:15 2013 -0700
Merge pull request #676 from c0s/asf-avro
[Use standard ASF published avro module instead of a proprietory built one]
94871e4 Sat Jul 6 15:26:19 2013 -0700
Merge pull request #655 from tgravescs/master
[Add support for running Spark on Yarn on a secure Hadoop Cluster]
3f918b3 Sat Jul 6 12:45:18 2013 -0700
Merge pull request #672 from holdenk/master
[s/ActorSystemImpl/ExtendedActorSystem/ as ActorSystemImpl results in a warning]
2a36e54 Sat Jul 6 12:43:21 2013 -0700
Merge pull request #673 from xiajunluan/master
[Add config template file for fair scheduler feature]
7ba7fa1 Sat Jul 6 11:45:08 2013 -0700
Merge pull request #674 from liancheng/master
[Bug fix: SPARK-789]
f4416a1 Sat Jul 6 11:41:58 2013 -0700
Merge pull request #681 from BlackNiuza/memory_leak
[Remove active job from idToActiveJob when job finished or aborted]
e063e29 Fri Jul 5 21:54:52 2013 -0700
Merge pull request #680 from tdas/master
[Fixed major performance bug in Network Receiver]
bf1311e Fri Jul 5 17:32:44 2013 -0700
Merge pull request #678 from mateiz/ml-examples
[Start of ML package]
6ad85d0 Thu Jul 4 21:32:29 2013 -0700
Merge pull request #677 from jerryshao/fix_stage_clean
[Clean StageToInfos periodically when spark.cleaner.ttl is enabled]
2e32fc8 Thu Jul 4 12:18:20 2013 -0700
Merge pull request #666 from c0s/master
[hbase dependency is missed in hadoop2-yarn profile of examples module
]
6d60fe5 Mon Jul 1 18:24:03 2013 -0700
Merge pull request #666 from c0s/master
[hbase dependency is missed in hadoop2-yarn profile of examples module]
ccfe953 Sat Jun 29 17:57:53 2013 -0700
Merge pull request #577 from skumargithub/master
[Example of cumulative counting using updateStateByKey]
50ca176 Thu Jun 27 22:24:52 2013 -0700
Merge pull request #664 from pwendell/test-fix
[Removing incorrect test statement]
e49bc8c Wed Jun 26 11:13:33 2013 -0700
Merge pull request #663 from stephenh/option_and_getenv
[Be cute with Option and getenv.]
f5e32ed Tue Jun 25 09:16:57 2013 -0700
Merge pull request #661 from mesos/streaming
[Kafka fixes and DStream.count fix for master]
1249e91 Mon Jun 24 21:46:33 2013 -0700
Merge pull request #572 from Reinvigorate/sm-block-interval
[Adding spark.streaming.blockInterval property]
cfcda95 Mon Jun 24 21:44:50 2013 -0700
Merge pull request #571 from Reinvigorate/sm-kafka-serializers
[Surfacing decoders on KafkaInputDStream]
575aff6 Mon Jun 24 21:35:50 2013 -0700
Merge pull request #567 from Reinvigorate/sm-count-fix
[Fixing count() in Spark Streaming]
3e61bef Sat Jun 22 16:22:47 2013 -0700
Merge pull request #648 from shivaram/netty-dbg
[Shuffle fixes and cleanup]
1ef5d0d Sat Jun 22 09:35:57 2013 -0700
Merge pull request #644 from shimingfei/joblogger
[add Joblogger to Spark (on new Spark code)]
7e4b266 Sat Jun 22 07:53:18 2013 -0700
Merge pull request #563 from jey/python-optimization
[Optimize PySpark worker invocation]
71030ba Wed Jun 19 15:21:03 2013 -0700
Merge pull request #654 from lyogavin/enhance_pipe
[fix typo and coding style in #638]
73f4c7d Tue Jun 18 04:21:17 2013 -0700
Merge pull request #605 from esjewett/SPARK-699
[Add hBase example (retry of pull request #596)]
9933836 Tue Jun 18 02:41:10 2013 -0700
Merge pull request #647 from jerryshao/master
[Reduce ZippedPartitionsRDD's getPreferredLocations complexity from O(2^2n) to O(2^n)]
db42451 Mon Jun 17 15:26:36 2013 -0700
Merge pull request #643 from adatao/master
[Bug fix: Zero-length partitions result in NaN for overall mean & variance]
e82a2ff Mon Jun 17 15:13:15 2013 -0700
Merge pull request #653 from rxin/logging
[SPARK-781: Log the temp directory path when Spark says "Failed to create temp directory."]
e6d1277 Mon Jun 17 12:56:25 2013 -0700
Merge pull request #638 from lyogavin/enhance_pipe
[Enhance pipe to support more features we can do in hadoop streaming]
f961aac Sat Jun 15 00:53:41 2013 -0700
Merge pull request #649 from ryanlecompte/master
[Add top K method to RDD using a bounded priority queue]
6602d94 Fri Jun 14 10:41:31 2013 -0700
Merge pull request #651 from rxin/groupbykey
[SPARK-772 / SPARK-774: groupByKey and cogroup should disable map side combine]
d93851a Thu Jun 13 13:38:45 2013 -0700
Merge pull request #645 from pwendell/compression
[Adding compression to Hadoop save functions]
f1da591 Wed Jun 12 17:55:08 2013 -0700
Merge pull request #646 from markhamstra/jvmArgs
[Fixed jvmArgs in maven build.]
0e94b73 Mon Jun 10 13:00:31 2013 -0700
Merge pull request #625 from stephenh/fix-start-slave
[Fix start-slave not passing instance number to spark-daemon.]
74b91d5 Sat Jun 8 01:19:40 2013 -0700
Merge pull request #629 from c0s/master
[Sometime Maven build runs out of PermGen space.]
c8fc423 Fri Jun 7 22:43:18 2013 -0700
Merge pull request #631 from jerryshao/master
[Fix block manager UI display issue when enable spark.cleaner.ttl]
1ae60bc Fri Jun 7 22:39:06 2013 -0700
Merge pull request #634 from xiajunluan/master
[[Spark-753] Fix ClusterSchedulSuite unit test failed ]
fff3728 Tue Jun 4 16:09:50 2013 -0700
Merge pull request #640 from pwendell/timeout-update
[Fixing bug in BlockManager timeout]
f420d4f Tue Jun 4 15:25:58 2013 -0700
Merge pull request #639 from pwendell/timeout-update
[Bump akka and blockmanager timeouts to 60 seconds]
84530ba Fri May 31 17:06:13 2013 -0700
Merge pull request #636 from rxin/unpersist
[Unpersist More block manager cleanup.]
ef77bb7 Thu May 30 14:50:06 2013 -0700
Merge pull request #627 from shivaram/master
[Netty and shuffle bug fixes]
8cb8178 Thu May 30 14:17:44 2013 -0700
Merge pull request #628 from shivaram/zero-block-size
[Skip fetching zero-sized blocks in NIO.]
6ed7139 Wed May 29 10:14:22 2013 -0700
Merge pull request #626 from stephenh/remove-add-if-no-port
[Remove unused addIfNoPort.]
41d230c Tue May 28 23:35:24 2013 -0700
Merge pull request #611 from squito/classloader
[Use default classloaders for akka & deserializing task results]
3db1e17 Mon May 27 21:31:43 2013 -0700
Merge pull request #620 from jerryshao/master
[Fix CheckpointRDD java.io.FileNotFoundException when calling getPreferredLocations]
3d4891d Sat May 25 23:38:05 2013 -0700
Merge pull request #621 from JoshRosen/spark-613
[Use ec2-metadata in start-slave.sh to detect if running on EC2]
e8d4b6c Sat May 25 21:09:03 2013 -0700
Merge pull request #529 from xiajunluan/master
[[SPARK-663]Implement Fair Scheduler in Spark Cluster Scheduler ]
9a3c344 Sat May 25 17:53:43 2013 -0700
Merge pull request #624 from rxin/master
[NonJavaSerializableClass should not be Java serializable...]
24e41aa Fri May 24 16:48:52 2013 -0700
Merge pull request #623 from rxin/master
[Automatically configure Netty port.]
69161f9 Fri May 24 14:42:13 2013 -0700
Merge pull request #622 from rxin/master
[bug fix: Shuffle block iterator is ignoring the shuffle serializer setting.]
dbbedfc Thu May 23 23:11:06 2013 -0700
Merge pull request #616 from jey/maven-netty-exclusion
[Exclude old versions of Netty from Maven-based build]
a2b0a79 Tue May 21 18:16:20 2013 -0700
Merge pull request #619 from woggling/adjust-sampling
[Use ARRAY_SAMPLE_SIZE constant instead of hard-coded 100.0 in SizeEstimator]
66dac44 Tue May 21 11:41:42 2013 -0700
Merge pull request #618 from woggling/dead-code-disttest
[DistributedSuite: remove dead code]
5912cc4 Fri May 17 19:58:40 2013 -0700
Merge pull request #610 from JoshRosen/spark-747
[Throw exception if TaskResult exceeds Akka frame size]
6c27c38 Thu May 16 17:33:56 2013 -0700
Merge pull request #615 from rxin/build-fix
[Maven build fix & two other small changes]
2f576ab Wed May 15 18:06:24 2013 -0700
Merge pull request #602 from rxin/shufflemerge
[Manual merge & cleanup of Shane's Shuffle Performance Optimization]
48c6f46 Wed May 15 10:47:19 2013 -0700
Merge pull request #612 from ash211/patch-4
[Docs: Mention spark shell's default for MASTER]
203d7b7 Wed May 15 00:47:20 2013 -0700
Merge pull request #593 from squito/driver_ui_link
[Master UI has link to Application UI]
016ac86 Mon May 13 21:45:36 2013 -0700
Merge pull request #601 from rxin/emptyrdd-master
[EmptyRDD (master branch 0.8)]
4b354e0 Mon May 13 17:39:19 2013 -0700
Merge pull request #589 from mridulm/master
[Add support for instance local scheduling]
5dbc9b2 Sun May 12 11:03:10 2013 -0700
Merge pull request #608 from pwendell/SPARK-738
[SPARK-738: Spark should detect and wrap nonserializable exceptions]
63e1999 Fri May 10 13:54:03 2013 -0700
Merge pull request #606 from markhamstra/foreachPartition_fix
[Actually use the cleaned closure in foreachPartition]
42bbe89 Wed May 8 22:30:31 2013 -0700
Merge pull request #599 from JoshRosen/spark-670
[Fix SPARK-670: EC2 'start' command should require -i option.]
0f1b7a0 Wed May 8 13:38:50 2013 -0700
Merge pull request #596 from esjewett/master
[hBase example]
7af92f2 Sat May 4 22:29:17 2013 -0700
Merge pull request #597 from JoshRosen/webui-fixes
[Two minor bug fixes for Spark Web UI]
c74ce60 Sat May 4 22:26:35 2013 -0700
Merge pull request #598 from rxin/blockmanager
[Fixed flaky unpersist test in DistributedSuite.]
3bf2c86 Fri May 3 18:27:30 2013 -0700
Merge pull request #594 from shivaram/master
[Add zip partitions to Java API]
2484ad7 Fri May 3 17:08:55 2013 -0700
Merge pull request #587 from rxin/blockmanager
[A set of shuffle map output related changes]
6fe9d4e Thu May 2 21:33:56 2013 -0700
Merge pull request #592 from woggling/localdir-fix
[Don't accept generated local directory names that can't be created]
538ee75 Thu May 2 09:01:42 2013 -0700
Merge pull request #581 from jerryshao/master
[fix [SPARK-740] block manage UI throws exception when enabling Spark Streaming]
9abcbcc Wed May 1 22:45:10 2013 -0700
Merge pull request #591 from rxin/removerdd
[RDD.unpersist: probably the most desired feature of Spark]
aa8fe1a Tue Apr 30 22:30:18 2013 -0700
Merge pull request #586 from mridulm/master
[Pull request to address issues Reynold Xin reported]
f708dda Tue Apr 30 07:51:40 2013 -0700
Merge pull request #585 from pwendell/listener-perf
[[Fix SPARK-742] Task Metrics should not employ per-record timing by default]
68c07ea Sun Apr 28 20:19:33 2013 -0700
Merge pull request #582 from shivaram/master
[Add zip partitions interface]
f6ee9a8 Sun Apr 28 15:36:04 2013 -0700
Merge pull request #583 from mridulm/master
[Fix issues with streaming test cases after yarn branch merge]
cf54b82 Thu Apr 25 11:45:58 2013 -0700
Merge pull request #580 from pwendell/quickstart
[SPARK-739 Have quickstart standlone job use README]
118a6c7 Wed Apr 24 08:42:30 2013 -0700
Merge pull request #575 from mridulm/master
[Manual merge of yarn branch to trunk]
5d8a71c Tue Apr 16 19:48:02 2013 -0700
Merge pull request #570 from jey/increase-codecache-size
[Increase ReservedCodeCacheSize for sbt]
ec5e553 Sun Apr 14 08:20:13 2013 -0700
Merge pull request #558 from ash211/patch-jackson-conflict
[Don't pull in old versions of Jackson via hadoop-core]
c1c219e Sun Apr 14 08:11:23 2013 -0700
Merge pull request #564 from maspotts/master
[Allow latest scala in PATH, with SCALA_HOME as override (instead of vice-versa)]
7c10b3e Fri Apr 12 20:55:22 2013 -0700
Merge pull request #565 from andyk/master
[Update wording of section on RDD operations in quick start guide in docs]
077ae0a Thu Apr 11 19:34:14 2013 -0700
Merge pull request #561 from ash211/patch-4
[Add details when BlockManager heartbeats time out]
c91ff8d Wed Apr 10 15:08:23 2013 -0700
Merge pull request #560 from ash211/patch-3
[Typos: cluser -> cluster]
7cd83bf Tue Apr 9 22:07:35 2013 -0700
Merge pull request #559 from ash211/patch-example-whitespace
[Uniform whitespace across scala examples]
271a4f3 Tue Apr 9 22:04:52 2013 -0700
Merge pull request #555 from holdenk/master
[Retry failed ssh commands in the ec2 python script.]
8ac9efb Tue Apr 9 13:50:50 2013 -0700
Merge pull request #527 from Reinvigorate/sm-kafka-cleanup
[KafkaInputDStream fixes and improvements]
eed54a2 Mon Apr 8 09:44:30 2013 -0700
Merge pull request #553 from pwendell/akka-standalone
[SPARK-724 - Have Akka logging enabled by default for standalone daemons]
b362df3 Sun Apr 7 17:17:52 2013 -0700
Merge pull request #552 from MLnick/master
[Bumping version for Twitter Algebird to latest]
4b30190 Sun Apr 7 17:15:10 2013 -0700
Merge pull request #554 from andyk/scala2.9.3
[Fixes SPARK-723 - Update build to Scala 2.9.3]
dfe98ca Tue Apr 2 19:24:12 2013 -0700
Merge pull request #550 from erikvanoosten/master
[corrected Algebird example]
b5d7830 Tue Apr 2 19:23:45 2013 -0700
Merge pull request #551 from jey/python-bugfixes
[Python bugfixes]
2be2295 Sun Mar 31 18:09:14 2013 -0700
Merge pull request #548 from markhamstra/getWritableClass_filter
[Fixed broken filter in getWritableClass[T]]
9831bc1 Fri Mar 29 22:16:22 2013 -0700
Merge pull request #539 from cgrothaus/fix-webui-workdirpath
[Bugfix: WorkerWebUI must respect workDirPath from Worker]
3cc8ab6 Fri Mar 29 22:14:07 2013 -0700
Merge pull request #541 from stephenh/shufflecoalesce
[Add a shuffle parameter to coalesce.]
cad507a Fri Mar 29 22:13:12 2013 -0700
Merge pull request #547 from jey/maven-streaming-tests-initialization-fix
[Move streaming test initialization into 'before' blocks]
a98996d Fri Mar 29 22:12:15 2013 -0700
Merge pull request #545 from ash211/patch-1
[Don't use deprecated Application in example]
104c694 Fri Mar 29 22:11:50 2013 -0700
Merge pull request #546 from ash211/patch-2
[Update tuning.md]
bc36ee4 Tue Mar 26 15:05:13 2013 -0700
Merge pull request #543 from holdenk/master
[Re-enable deprecation warnings and fix deprecated warning.]
b8949ca Sat Mar 23 07:19:34 2013 -0700
Merge pull request #505 from stephenh/volatile
[Make Executor fields volatile since they're read from the thread pool.]
fd53f2f Sat Mar 23 07:13:21 2013 -0700
Merge pull request #510 from markhamstra/WithThing
[mapWith, flatMapWith and filterWith]
4c5efcf Wed Mar 20 19:29:23 2013 -0700
Merge pull request #532 from andyk/master
[SPARK-715: Adds instructions for building with Maven to documentation]
3558849 Wed Mar 20 19:27:47 2013 -0700
Merge pull request #538 from rxin/cogroup
[Added mapSideCombine flag to CoGroupedRDD. Added unit test for CoGroupedRDD.]
ca4d083 Wed Mar 20 11:22:36 2013 -0700
Merge pull request #528 from MLnick/java-examples
[[SPARK-707] Adding Java versions of Pi, LogQuery and K-Means examples]
b812e6b Wed Mar 20 11:21:02 2013 -0700
Merge pull request #526 from markhamstra/foldByKey
[Add foldByKey]
945d1e7 Tue Mar 19 21:59:06 2013 -0700
Merge pull request #536 from sasurfer/master
[CoalescedRDD for many partitions]
1cbbe94 Tue Mar 19 21:34:34 2013 -0700
Merge pull request #534 from stephenh/removetrycatch
[Remove try/catch block that can't be hit.]
71e53f8 Tue Mar 19 21:31:41 2013 -0700
Merge pull request #537 from wishbear/configurableInputFormat
[call setConf from input format if it is Configurable]
c1e9cdc Sat Mar 16 11:47:45 2013 -0700
Merge pull request #525 from stephenh/subtractByKey
[Add PairRDDFunctions.subtractByKey.]
cdbfd1e Fri Mar 15 15:13:28 2013 -0700
Merge pull request #516 from squito/fix_local_metrics
[Fix local metrics]
f9fa2ad Fri Mar 15 15:12:43 2013 -0700
Merge pull request #530 from mbautin/master-update-log4j-and-make-compile-in-IntelliJ
[Add a log4j compile dependency to fix build in IntelliJ]
4032beb Wed Mar 13 19:29:46 2013 -0700
Merge pull request #521 from stephenh/earlyclose
[Close the reader in HadoopRDD as soon as iteration end.]
3c97276 Wed Mar 13 19:25:08 2013 -0700
Merge pull request #524 from andyk/master
[Fix broken link to YARN documentation]
1c3d981 Wed Mar 13 19:23:48 2013 -0700
Merge pull request #517 from Reinvigorate/sm-build-fixes
[Build fixes for streaming /w SBT]
2d477fd Wed Mar 13 06:49:16 2013 -0700
Merge pull request #523 from andyk/master
[Fix broken link in Quick Start]
00c4d23 Tue Mar 12 22:19:00 2013 -0700
Merge pull request #518 from woggling/long-bm-sizes
[Send block sizes as longs in BlockManager updates]
cbf8f0d Mon Mar 11 00:23:57 2013 -0700
Merge pull request #513 from MLnick/bagel-caching
[Adds choice of persistence level to Bagel.]
91a9d09 Sun Mar 10 15:48:23 2013 -0700
Merge pull request #512 from patelh/fix-kryo-serializer
[Fix reference bug in Kryo serializer, add test, update version]
557cfd0 Sun Mar 10 15:44:57 2013 -0700
Merge pull request #515 from woggling/deploy-app-death
[Notify standalone deploy client of application death.]
04fb81f Sun Mar 3 17:20:07 2013 -0800
Merge pull request #506 from rxin/spark-706
[Fixed SPARK-706: Failures in block manager put leads to read task hanging.]
6cf4be4 Sun Mar 3 17:16:22 2013 -0800
Merge pull request #462 from squito/stageInfo
[Track assorted metrics for each task, report summaries to user at stage completion]
6bfc7ca Sat Mar 2 22:14:49 2013 -0800
Merge pull request #504 from mosharaf/master
[Worker address was getting removed when removing an app.]
94b3db1 Sat Mar 2 22:13:52 2013 -0800
Merge pull request #508 from markhamstra/TestServerInUse
[Avoid bind failure in InputStreamsSuite]
25c71d3 Fri Mar 1 08:00:18 2013 -0800
Merge pull request #507 from markhamstra/poms271
[bump version to 0.7.1-SNAPSHOT in the subproject poms]