Skip to content

update #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 578 commits into from
Mar 16, 2015
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
578 commits
Select commit Hold shift + click to select a range
31efb39
[Minor] fix typo in SQL document
CodingCat Feb 17, 2015
4611de1
[SPARK-5862][SQL] Only transformUp the given plan once in HiveMetasto…
viirya Feb 17, 2015
ac506b7
[Minor][SQL] Use same function to check path parameter in JSONRelation
viirya Feb 17, 2015
9d281fa
[SQL] [Minor] Update the HiveContext Unittest
chenghao-intel Feb 17, 2015
de4836f
[SPARK-5868][SQL] Fix python UDFs in HiveContext and checks in SQLCon…
marmbrus Feb 17, 2015
445a755
[SPARK-4172] [PySpark] Progress API in Python
Feb 17, 2015
3df85dc
[SPARK-5871] output explain in Python
Feb 17, 2015
4d4cc76
[SPARK-5872] [SQL] create a sqlCtx in pyspark shell
Feb 17, 2015
117121a
[SPARK-5852][SQL]Fail to convert a newly created empty metastore parq…
yhuai Feb 17, 2015
c3d2b90
[SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
Feb 18, 2015
ae6cfb3
[SPARK-5811] Added documentation for maven coordinates and added Spar…
brkyvz Feb 18, 2015
d46d624
[SPARK-4454] Properly synchronize accesses to DAGScheduler cacheLocs map
JoshRosen Feb 18, 2015
a51fc7e
[SPARK-4454] Revert getOrElse() cleanup in DAGScheduler.getCacheLocs()
JoshRosen Feb 18, 2015
d5f12bf
[SPARK-5875][SQL]logical.Project should not be resolved if it contain…
yhuai Feb 18, 2015
e50934f
[SPARK-5723][SQL]Change the default file format to Parquet for CTAS s…
yhuai Feb 18, 2015
3912d33
[SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStream…
tdas Feb 18, 2015
61ab085
[Minor] [SQL] Cleans up DataFrame variable names and toDF() calls
liancheng Feb 18, 2015
de0dd6d
Avoid deprecation warnings in JDBCSuite.
tmyklebu Feb 18, 2015
c1b6fa9
[SPARK-5878] fix DataFrame.repartition() in Python
Feb 18, 2015
e79a7a6
SPARK-4610 addendum: [Minor] [MLlib] Minor doc fix in GBT classificat…
MechCoder Feb 18, 2015
82197ed
[SPARK-4949]shutdownCallback in SparkDeploySchedulerBackend should be…
sarutak Feb 18, 2015
5aecdcf
SPARK-5669 [BUILD] [HOTFIX] Spark assembly includes incompatibly lice…
srowen Feb 18, 2015
85e9d09
[SPARK-5519][MLLIB] add user guide with example code for fp-growth
mengxr Feb 18, 2015
a8eb92d
[SPARK-5507] Added documentation for BlockMatrix
brkyvz Feb 18, 2015
f0e3b71
[SPARK-5840][SQL] HiveContext cannot be serialized due to tuple extra…
rxin Feb 18, 2015
aa8f10e
[SPARK-5722] [SQL] [PySpark] infer int as LongType
Feb 18, 2015
d12d2ad
[SPARK-5879][MLLIB] update PIC user guide and add a Java example
mengxr Feb 19, 2015
e945aa6
[SPARK-5846] Correctly set job description and pool for SQL jobs
kayousterhout Feb 19, 2015
fb87f44
SPARK-5548: Fix for AkkaUtilsSuite failure - attempt 2
jacek-lewandowski Feb 19, 2015
38e624a
[SPARK-5816] Add huge compatibility warning in DriverWrapper
Feb 19, 2015
90095bf
[SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to e…
zsxwing Feb 19, 2015
94cdb05
[SPARK-5825] [Spark Submit] Remove the double checking instance name …
chenghao-intel Feb 19, 2015
8ca3418
[SPARK-5904][SQL] DataFrame API fixes.
rxin Feb 19, 2015
a5fed34
[SPARK-5902] [ml] Made PipelineStage.transformSchema public instead o…
jkbradley Feb 19, 2015
ad6b169
[Spark-5889] Remove pid file after stopping service.
zhzhan Feb 19, 2015
34b7c35
SPARK-4682 [CORE] Consolidate various 'Clock' classes
srowen Feb 19, 2015
6bddc40
SPARK-5570: No docs stating that `new SparkConf().set("spark.driver.m…
Feb 19, 2015
0cfd2ce
[SPARK-5900][MLLIB] make PIC and FPGrowth Java-friendly
mengxr Feb 20, 2015
3be92cd
[SPARK-4808] Removing minimum number of elements read before spill check
mccheah Feb 20, 2015
70bfb5c
[SPARK-5909][SQL] Add a clearCache command to Spark SQL's cache manager
yhuai Feb 20, 2015
d3dfebe
SPARK-5744 [CORE] Take 2. RDD.isEmpty / take fails for (empty) RDD of…
srowen Feb 20, 2015
4a17eed
[SPARK-5867] [SPARK-5892] [doc] [ml] [mllib] Doc cleanups for 1.3 rel…
jkbradley Feb 20, 2015
5b0a42c
[SPARK-5898] [SPARK-5896] [SQL] [PySpark] create DataFrame from pand…
Feb 20, 2015
e155324
[MLlib] fix typo
jackylk Feb 21, 2015
d3cbd38
SPARK-5841 [CORE] [HOTFIX 2] Memory leak in DiskBlockManager
nishkamravi2 Feb 21, 2015
7138816
[SPARK-5937][YARN] Fix ClientSuite to set YARN mode, so that the corr…
harishreedharan Feb 21, 2015
7683982
[SPARK-5860][CORE] JdbcRDD: overflow on large range with high number …
hotou Feb 21, 2015
46462ff
MAINTENANCE: Automated closing of pull requests.
pwendell Feb 22, 2015
a7f9039
[DOCS] Fix typo in API for custom InputFormats based on the “new” Map…
Feb 22, 2015
275b1be
[DataFrame] [Typo] Fix the typo
chenghao-intel Feb 22, 2015
e4f9d03
[SPARK-911] allow efficient queries for a range if RDD is partitioned…
aaronjosephs Feb 23, 2015
95cd643
[SPARK-3885] Provide mechanism to remove accumulators once they are n…
Feb 23, 2015
9348767
[EXAMPLES] fix typo.
fukuo33 Feb 23, 2015
757b14b
[SPARK-5943][Streaming] Update the test to use new API to reduce the …
jerryshao Feb 23, 2015
242d495
[SPARK-5724] fix the misconfiguration in AkkaUtils
CodingCat Feb 23, 2015
651a1c0
[SPARK-5939][MLLib] make FPGrowth example app take parameters
Feb 23, 2015
28ccf5e
[MLLIB] SPARK-5912 Programming guide for feature selection
avulanov Feb 23, 2015
59536cc
[SPARK-5912] [docs] [mllib] Small fixes to ChiSqSelector docs
jkbradley Feb 24, 2015
48376bf
[SPARK-5935][SQL] Accept MapType in the schema provided to a JSON dat…
yhuai Feb 24, 2015
1ed5708
[SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryE…
marmbrus Feb 24, 2015
cf2e416
[SPARK-5958][MLLIB][DOC] update block matrix user guide
mengxr Feb 24, 2015
8403331
[SPARK-5968] [SQL] Suppresses ParquetOutputCommitter WARN logs
liancheng Feb 24, 2015
0a59e45
[SPARK-5910][SQL] Support for as in selectExpr
marmbrus Feb 24, 2015
2012366
[SPARK-5532][SQL] Repartition should not use external rdd representation
marmbrus Feb 24, 2015
64d2c01
[Spark-5967] [UI] Correctly clean JobProgressListener.stageIdToActive…
tdas Feb 24, 2015
6d2caa5
[SPARK-5965] Standalone Worker UI displays {{USER_JAR}}
Feb 24, 2015
105791e
[MLLIB] Change x_i to y_i in Variance's user guide
mengxr Feb 24, 2015
c5ba975
[Spark-5708] Add Slf4jSink to Spark Metrics
judynash Feb 24, 2015
a2b9137
[SPARK-5952][SQL] Lock when using hive metastore client
marmbrus Feb 24, 2015
da505e5
[SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
Feb 24, 2015
2a0fe34
[SPARK-5436] [MLlib] Validate GradientBoostedTrees using runWithValid…
MechCoder Feb 24, 2015
f816e73
[SPARK-5751] [SQL] [WIP] Revamped HiveThriftServer2Suite for robustness
liancheng Feb 25, 2015
53a1ebf
[SPARK-5904][SQL] DataFrame Java API test suites.
rxin Feb 25, 2015
fba11c2
[SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.
rxin Feb 25, 2015
922b43b
[SPARK-5993][Streaming][Build] Fix assembly jar location of kafka-ass…
tdas Feb 25, 2015
769e092
[SPARK-5286][SQL] SPARK-5286 followup
yhuai Feb 25, 2015
d641fbb
[SPARK-5994] [SQL] Python DataFrame documentation fixes
Feb 25, 2015
d51ed26
[SPARK-5666][streaming][MQTT streaming] some trivial fixes
prabeesh Feb 25, 2015
5b8480e
[GraphX] fixing 3 typos in the graphx programming guide
1123 Feb 25, 2015
dd077ab
[SPARK-5771] Number of Cores in Completed Applications of Standalone …
Feb 25, 2015
f84c799
[SPARK-5996][SQL] Fix specialized outbound conversions
marmbrus Feb 25, 2015
7d8e6a2
SPARK-5930 [DOCS] Documented default of spark.shuffle.io.retryWait is…
srowen Feb 25, 2015
a777c65
[SPARK-5970][core] Register directory created in getOrCreateLocalRoot…
foxik Feb 25, 2015
9f603fc
[SPARK-1955][GraphX]: VertexRDD can incorrectly assume index sharing
Feb 25, 2015
838a480
[SPARK-5982] Remove incorrect Local Read Time Metric
kayousterhout Feb 25, 2015
f3f4c87
[SPARK-5944] [PySpark] fix version in Python API docs
Feb 25, 2015
e0fdd46
[SPARK-6010] [SQL] Merging compatible Parquet schemas before computin…
liancheng Feb 25, 2015
12dbf98
[SPARK-5999][SQL] Remove duplicate Literal matching block
viirya Feb 25, 2015
41e2e5a
[SPARK-5926] [SQL] make DataFrame.explain leverage queryExecution.log…
yanboliang Feb 25, 2015
46a044a
[SPARK-1182][Docs] Sort the configuration parameters in configuration.md
Feb 26, 2015
d20559b
[SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide wit…
jkbradley Feb 26, 2015
e43139f
[SPARK-5976][MLLIB] Add partitioner to factors returned by ALS
mengxr Feb 26, 2015
51a6f90
[SPARK-5914] to run spark-submit requiring only user perm on windows
judynash Feb 26, 2015
f02394d
[SPARK-6023][SQL] ParquetConversions fails to replace the destination…
yhuai Feb 26, 2015
192e42a
[SPARK-6016][SQL] Cannot read the parquet table after overwriting the…
yhuai Feb 26, 2015
df3d559
[SPARK-5801] [core] Avoid creating nested directories.
Feb 26, 2015
2358657
[SPARK-6007][SQL] Add numRows param in DataFrame.show()
Feb 26, 2015
cfff397
[SPARK-6004][MLlib] Pick the best model when training GradientBoosted…
viirya Feb 26, 2015
7fa960e
[SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe
Feb 26, 2015
cd5c8d7
SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output
srowen Feb 26, 2015
10094a5
Modify default value description for spark.scheduler.minRegisteredRes…
li-zhihui Feb 26, 2015
8942b52
[SPARK-3562]Periodic cleanup event logs
viper-kun Feb 26, 2015
aa63f63
[SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for K…
tdas Feb 26, 2015
5f3238b
[SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YA…
Feb 26, 2015
3fb53c0
SPARK-4300 [CORE] Race condition during SparkWorker shutdown
srowen Feb 26, 2015
c871e2d
Add a note for context termination for History server on Yarn
moutai Feb 26, 2015
b38dec2
[SPARK-5951][YARN] Remove unreachable driver memory properties in yar…
mohitgoyal557 Feb 26, 2015
e60ad2f
SPARK-6045 RecordWriter should be checked against null in PairRDDFunc…
tedyu Feb 26, 2015
fbc4694
SPARK-4579 [WEBUI] Scheduling Delay appears negative
srowen Feb 27, 2015
18f2098
[SPARK-5529][CORE]Add expireDeadHosts in HeartbeatReceiver
shenh062326 Feb 27, 2015
4ad5153
[SPARK-6037][SQL] Avoiding duplicate Parquet schema merging
viirya Feb 27, 2015
5e5ad65
[SPARK-6024][SQL] When a data source table has too many columns, it's…
yhuai Feb 27, 2015
12135e9
[SPARK-5771][UI][hotfix] Change Requested Cores into * if default cor…
jerryshao Feb 27, 2015
67595eb
[SPARK-5495][UI] Add app and driver kill function in master web UI
jerryshao Feb 27, 2015
4a8a0a8
SPARK-2168 [Spark core] Use relative URIs for the app links in the Hi…
elyast Feb 27, 2015
7c99a01
[SPARK-6046] Privatize SparkConf.translateConfKey
Feb 27, 2015
0375a41
fix spark-6033, clarify the spark.worker.cleanup behavior in standalo…
Feb 27, 2015
8cd1692
[SPARK-6036][CORE] avoid race condition between eventlogListener and …
liyezhang556520 Feb 27, 2015
e747e98
[SPARK-6058][Yarn] Log the user class exception in ApplicationMaster
zsxwing Feb 27, 2015
57566d0
[SPARK-6059][Yarn] Add volatile to ApplicationMaster's reporterThread…
zsxwing Feb 27, 2015
d17cb2b
[SPARK-4587] [mllib] [docs] Fixed save,load calls in ML guide examples
jkbradley Feb 27, 2015
5f7f3b9
[Streaming][Minor] Remove useless type signature of Java Kafka direct…
jerryshao Feb 27, 2015
8c468a6
[SPARK-5751] [SQL] Sets SPARK_HOME as SPARK_PID_DIR when running Thri…
liancheng Feb 28, 2015
e0e64ba
[SPARK-6055] [PySpark] fix incorrect __eq__ of DataType
Feb 28, 2015
dba08d1
[SPARK-6070] [yarn] Remove unneeded classes from shuffle service jar.
Feb 28, 2015
6d8e5fb
[SPARK-5979][SPARK-6032] Smaller safer --packages fix
brkyvz Feb 28, 2015
9168259
MAINTENANCE: Automated closing of pull requests.
pwendell Feb 28, 2015
e6003f0
[SPARK-5775] [SQL] BugFix: GenericRow cannot be cast to SpecificMutab…
liancheng Feb 28, 2015
b36b1bc
SPARK-6063 MLlib doesn't pass mvn scalastyle check due to UTF chars i…
msjgriffiths Feb 28, 2015
f91298e
SPARK-5983 [WEBUI] Don't respond to HTTP TRACE in HTTP-based UIs
srowen Feb 28, 2015
86fcdae
SPARK-1965 [WEBUI] Spark UI throws NPE on trying to load the app page…
srowen Feb 28, 2015
643300a
SPARK-5984: Fix TimSort bug causes ArrayOutOfBoundsException
hotou Mar 1, 2015
2df5f1f
[SPARK-6075] Fix bug in that caused lost accumulator updates: do not …
JoshRosen Mar 1, 2015
fd8d283
[SPARK-6074] [sql] Package pyspark sql bindings.
Mar 1, 2015
aedbbaa
[SPARK-6053][MLLIB] support save/load in PySpark's ALS
mengxr Mar 2, 2015
3f00bb3
[SPARK-6083] [MLLib] [DOC] Make Python API example consistent in Naiv…
MechCoder Mar 2, 2015
d8fb40e
[Streaming][Minor]Fix some error docs in streaming examples
jerryshao Mar 2, 2015
948c239
SPARK-3357 [CORE] Internal log messages should be set at DEBUG level …
srowen Mar 2, 2015
49c7a8f
[SPARK-6103][Graphx]remove unused class to import in EdgeRDDImpl
lianhuiwang Mar 2, 2015
39a54b4
[SPARK-6073][SQL] Need to refresh metastore cache after append data i…
yhuai Mar 2, 2015
3efd8bb
[SPARK-6052][SQL]In JSON schema inference, we should always set conta…
yhuai Mar 2, 2015
95ac68b
[SPARK-6111] Fixed usage string in documentation.
kennethmyers-ibm Mar 2, 2015
9ce12aa
[SPARK-5741][SQL] Support the path contains comma in HiveContext
watermen Mar 2, 2015
e7d8ae4
aggregateMessages example in graphX doc
Mar 2, 2015
af2effd
[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType pa…
yanboliang Mar 2, 2015
d9a8bae
[DOCS] Refactored Dataframe join comment to use correct parameter ord…
Mar 2, 2015
0b472f6
SPARK-5390 [DOCS] Encourage users to post on Stack Overflow in Commun…
srowen Mar 2, 2015
3f9def8
[Minor] Fix doc typo for describing primitiveTerm effectiveness condi…
viirya Mar 2, 2015
582e5a2
[SPARK-6040][SQL] Fix the percent bug in tablesample
watermen Mar 2, 2015
6b348d9
[SPARK-6050] [yarn] Relax matching of vcore count in received contain…
Mar 2, 2015
26c1c56
[SPARK-5522] Accelerate the Histroty Server start
Mar 2, 2015
8223ce6
[SPARK-6114][SQL] Avoid metastore conversions before plan is resolved
marmbrus Mar 3, 2015
1a49496
[SPARK-6082] [SQL] Provides better error message for malformed rows w…
liancheng Mar 3, 2015
6776cb3
[SPARK-6066] Make event log format easier to parse
Mar 3, 2015
258d154
[SPARK-6048] SparkConf should not translate deprecated configs on set
Mar 3, 2015
e3a88d1
[SPARK-4777][CORE] Some block memory after unrollSafely not count int…
suyanNone Mar 3, 2015
2db6a85
[SPARK-6121][SQL][MLLIB] simpleString for UDT
mengxr Mar 3, 2015
9d6c5ae
[SPARK-5537] Add user guide for multinomial logistic regression
mengxr Mar 3, 2015
9eb22ec
[SPARK-6127][Streaming][Docs] Add Kafka to Python api docs
tdas Mar 3, 2015
1259994
[SPARK-5950][SQL]Insert array into a metastore table saved as parquet…
yhuai Mar 3, 2015
54d1968
[SPARK-5310][SQL] Fixes to Docs and Datasources API
rxin Mar 3, 2015
7e53a79
[SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlib
mengxr Mar 3, 2015
c2fe3a6
[SPARK-6120] [mllib] Warnings about memory in tree, ensemble model save
jkbradley Mar 3, 2015
b196056
[SPARK-5537][MLlib][Docs] Add user guide for multinomial logistic reg…
Mar 3, 2015
165ff36
HOTFIX: Bump HBase version in MapR profiles.
pwendell Mar 3, 2015
0c9a8ea
BUILD: Minor tweaks to internal build scripts
pwendell Mar 3, 2015
975643c
[SPARK-6118] making package name of deploy.worker.CommandUtils and de…
CodingCat Mar 3, 2015
e359794
[SPARK-6138][CORE][minor] enhance the `toArray` method in `SizeTracki…
cloud-fan Mar 3, 2015
9af0017
Revert "[SPARK-5423][Core] Cleanup resources in DiskMapIterator.final…
Mar 3, 2015
e750a6b
SPARK-1911 [DOCS] Warn users if their assembly jars are not built wit…
srowen Mar 3, 2015
fe63e82
[SPARK-6132] ContextCleaner race condition across SparkContexts
Mar 3, 2015
6c20f35
[SPARK-6133] Make sc.stop() idempotent
Mar 3, 2015
1f1fccc
[SPARK-5949] HighlyCompressedMapStatus needs more classes registered …
squito Mar 3, 2015
d334bfb
[SPARK-6132][HOTFIX] ContextCleaner InterruptedException should be quiet
Mar 4, 2015
76e20a0
[SPARK-6141][MLlib] Upgrade Breeze from 0.10 to 0.11 to fix convergen…
mengxr Mar 4, 2015
8d3e241
SPARK-6085 Increase default value for memory overhead
tedyu Mar 4, 2015
418f38d
[SPARK-3355][Core]: Allow running maven tests in run-tests
Mar 4, 2015
76b472f
[SPARK-6136] [SQL] Removed JDBC integration tests which depends on do…
liancheng Mar 4, 2015
aef8a84
[SPARK-6134][SQL] Fix wrong datatype for casting FloatType and defaul…
viirya Mar 4, 2015
f6773ed
[SPARK-6107][CORE] Display inprogress application information for eve…
liyezhang556520 Mar 4, 2015
3a35a0d
[SPARK-6144] [core] Fix addFile when source files are on "hdfs:"
Mar 4, 2015
1aa90e3
[SPARK-6149] [SQL] [Build] Excludes Guava 15 referenced by jackson-mo…
liancheng Mar 5, 2015
7ac072f
SPARK-5143 [BUILD] [WIP] spark-network-yarn 2.11 depends on spark-net…
srowen Mar 5, 2015
e06c7df
[SPARK-6153] [SQL] promote guava dep for hive-thriftserver
adrian-wang Mar 5, 2015
c9cfba0
SPARK-6182 [BUILD] spark-parent pom needs to be published for both 2.…
srowen Mar 5, 2015
0bfacd5
[SPARK-6090][MLLIB] add a basic BinaryClassificationMetrics to PySpar…
mengxr Mar 5, 2015
424a86a
[SPARK-6175] Fix standalone executor log links when ephemeral ports o…
JoshRosen Mar 5, 2015
5873c71
[SPARK-6145][SQL] fix ORDER BY on nested fields
cloud-fan Mar 5, 2015
1b4bb25
[SPARK-6163][SQL] jsonFile should be backed by the data source API
yhuai Mar 5, 2015
eb48fd6
[SQL] Make Strategies a public developer API
marmbrus Mar 5, 2015
d8b3da9
[CORE, DEPLOY][minor] align arguments order with docs of worker
liyezhang556520 Mar 6, 2015
cd7594c
[core] [minor] Don't pollute source directory when running UtilsSuite.
Mar 6, 2015
05cb6b3
[Minor] Resolve sbt warnings: postfix operator second should be enabled
witgo Mar 6, 2015
dba0b2e
[SPARK-6178][Shuffle] Removed unused imports
Mar 6, 2015
48a723c
Fix python typo (+ Scala, Java typos)
robertzk Mar 7, 2015
2646794
[EC2] Reorder print statements on termination
nchammas Mar 7, 2015
729c05b
[Minor]fix the wrong description
WangTaoTheTonic Mar 7, 2015
334c5bd
[SPARK-5641] [EC2] Allow spark_ec2.py to copy arbitrary files to cluster
florianverhein Mar 7, 2015
52ed7da
[SPARK-6193] [EC2] Push group filter up to EC2
nchammas Mar 8, 2015
f16b7b0
SPARK-6205 [CORE] UISeleniumSuite fails for Hadoop 2.x test with NoCl…
srowen Mar 8, 2015
55b1b32
[GraphX] Improve LiveJournalPageRank example
Mar 8, 2015
f7c7992
[EC2] [SPARK-6188] Instance types can be mislabeled when re-starting …
thvasilo Mar 9, 2015
70f8814
[Docs] Replace references to SchemaRDD with DataFrame
rxin Mar 9, 2015
3cac199
[SPARK-5310][Doc] Update SQL Programming Guide to include DataFrames.
rxin Mar 9, 2015
8767565
[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
Mar 9, 2015
9a0272f
[SPARK-6177][MLlib]Add note in LDA example to remind possible coalesce
hhbyyh Mar 10, 2015
c4c4b07
[SPARK-6087][CORE] Provide actionable exception if Kryo buffer is not…
levkhomich Mar 10, 2015
d14df06
[SPARK-6191] [EC2] Generalize ability to download libs
nchammas Mar 10, 2015
7c7d2d5
[SPARK-6186] [EC2] Make Tachyon version configurable in EC2 deploymen…
Mar 10, 2015
74fb433
Minor doc: Remove the extra blank line in data types javadoc.
rxin Mar 11, 2015
2672374
[SPARK-5183][SQL] Update SQL Docs with JDBC and Migration Guide
marmbrus Mar 11, 2015
2d4e00e
[SPARK-5986][MLLib] Add save/load for k-means
yinxusen Mar 11, 2015
517975d
[SPARK-4924] Add a library for launching Spark jobs programmatically.
Mar 11, 2015
35b2564
[MINOR] [DOCS] Fix map -> mapToPair in Streaming Java example
srowen Mar 11, 2015
40f4979
[SQL][Minor] fix typo in comments
liuhb86 Mar 11, 2015
ec30c17
[SPARK-6279][Streaming]In KafkaRDD.scala, Miss expressions flag "s" a…
zzcclp Mar 11, 2015
6e94c4e
SPARK-6225 [CORE] [SQL] [STREAMING] Resolve most build warnings, 1.3.…
srowen Mar 11, 2015
5b335bd
[SPARK-6228] [network] Move SASL classes from network/shuffle to netw…
Mar 11, 2015
548643a
[SPARK-4423] Improve foreach() documentation to avoid confusion betwe…
Mar 11, 2015
2d87a41
SPARK-3642. Document the nuances of shared variables.
sryza Mar 11, 2015
55c4831
SPARK-6245 [SQL] jsonRDD() of empty RDD results in exception
srowen Mar 11, 2015
51a79a7
[SPARK-6274][Streaming][Examples] Added examples streaming + sql exam…
tdas Mar 11, 2015
cd3b68d
[SPARK-6128][Streaming][Documentation] Updates to Spark Streaming Pro…
tdas Mar 12, 2015
e921a66
BUILD: Adding more known contributor names
pwendell Mar 12, 2015
25b71d8
[SPARK-6296] [SQL] Added equals to Column
vlyubin Mar 12, 2015
712679a
[SPARK-6294] fix hang when call take() in JVM on PythonRDD
Mar 12, 2015
0cba802
[SPARK-5814][MLLIB][GRAPHX] Remove JBLAS from runtime
mengxr Mar 12, 2015
4e47d54
[docs] [SPARK-6306] Readme points to dead link
Mar 12, 2015
304366c
[SPARK-6275][Documentation]Miss toDF() function in docs/sql-programmi…
zzcclp Mar 12, 2015
8f1bc79
[build] [hotfix] Fix make-distribution.sh for Scala 2.11.
Mar 12, 2015
fb4787c
[SPARK-6268][MLlib] KMeans parameter getter methods
hhbyyh Mar 12, 2015
a4b2716
[SPARK-4588] ML Attributes
mengxr Mar 12, 2015
17c309c
[mllib] [python] Add LassoModel to __all__ in regression.py
jkbradley Mar 12, 2015
3980ebd
HOTFIX: Changes to release script.
pwendell Mar 13, 2015
0af9ea7
[SPARK-5845][Shuffle] Time to cleanup spilled shuffle files not inclu…
Mar 13, 2015
69ff8e8
[SPARK-5310] [SQL] [DOC] Parquet section for the SQL programming guide
liancheng Mar 13, 2015
9048e81
[SPARK-6197][CORE] handle json exception when hisotry file not finish…
liyezhang556520 Mar 13, 2015
ea3d2ee
[CORE][minor] remove unnecessary ClassTag in `DAGScheduler`
cloud-fan Mar 13, 2015
dc4abd4
[SPARK-6252] [mllib] Added getLambda to Scala NaiveBayes
jkbradley Mar 13, 2015
7f13434
[SPARK-6278][MLLIB] Mention the change of objective in linear regression
mengxr Mar 13, 2015
b943f5d
[SPARK-4600][GraphX]: org.apache.spark.graphx.VertexRDD.diff does not…
Mar 13, 2015
cdc34ed
[SPARK-6285] [SQL] Removes unused ParquetTestData and duplicated Test…
liancheng Mar 13, 2015
e360d5e
[SPARK-6317][SQL]Fixed HIVE console startup issue
vinodkc Mar 13, 2015
b38e073
[SPARK-6210] [SQL] use prettyString as column name in agg()
Mar 14, 2015
ee15404
[SQL]Delete some dupliate code in HiveThriftServer2
ArcherShao Mar 14, 2015
5be6b0e
[SPARK-6195] [SQL] Adds in-memory column type for fixed-precision dec…
liancheng Mar 14, 2015
127268b
[SPARK-6329][Docs]: Minor doc changes for Mesos and TOC
Mar 14, 2015
c49d156
[SPARK-5790][GraphX]: VertexRDD's won't zip properly for `diff` capab…
Mar 14, 2015
62ede53
[SPARK-6285][SQL]Remove ParquetTestData in SparkBuild.scala and in RE…
OopsOutOfMemory Mar 15, 2015
aa6536f
[SPARK-3619] Part 2. Upgrade to Mesos 0.21 to work around MESOS-1688
jongyoul Mar 15, 2015
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,7 @@
*.iml
*.iws
*.pyc
*.pyo
.idea/
.idea_modules/
build/*.jar
Expand Down
2 changes: 2 additions & 0 deletions .rat-excludes
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
target
cache
.gitignore
.gitattributes
.project
Expand All @@ -18,6 +19,7 @@ fairscheduler.xml.template
spark-defaults.conf.template
log4j.properties
log4j.properties.template
metrics.properties
metrics.properties.template
slaves
slaves.template
Expand Down
16 changes: 16 additions & 0 deletions LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -771,6 +771,22 @@ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

========================================================================
For TestTimSort (core/src/test/java/org/apache/spark/util/collection/TestTimSort.java):
========================================================================
Copyright (C) 2015 Stijn de Gouw

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

========================================================================
For LimitedInputStream
Expand Down
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ To build Spark and its example programs, run:

(You do not need to do this if you downloaded a pre-built package.)
More detailed documentation is available from the project site, at
["Building Spark with Maven"](http://spark.apache.org/docs/latest/building-spark.html).
["Building Spark"](http://spark.apache.org/docs/latest/building-spark.html).

## Interactive Scala Shell

Expand Down Expand Up @@ -85,7 +85,7 @@ storage systems. Because the protocols have changed in different versions of
Hadoop, you must build Spark against the same version that your cluster runs.

Please refer to the build documentation at
["Specifying the Hadoop Version"](http://spark.apache.org/docs/latest/building-with-maven.html#specifying-the-hadoop-version)
["Specifying the Hadoop Version"](http://spark.apache.org/docs/latest/building-spark.html#specifying-the-hadoop-version)
for detailed guidance on building for a particular distribution of Hadoop, including
building for particular Hive and Hive Thriftserver distributions. See also
["Third Party Hadoop Distributions"](http://spark.apache.org/docs/latest/hadoop-third-party-distributions.html)
Expand Down
113 changes: 1 addition & 112 deletions assembly/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.apache.spark</groupId>
<artifactId>spark-parent</artifactId>
<artifactId>spark-parent_2.10</artifactId>
<version>1.3.0-SNAPSHOT</version>
<relativePath>../pom.xml</relativePath>
</parent>
Expand All @@ -36,10 +36,6 @@
<spark.jar.dir>scala-${scala.binary.version}</spark.jar.dir>
<spark.jar.basename>spark-assembly-${project.version}-hadoop${hadoop.version}.jar</spark.jar.basename>
<spark.jar>${project.build.directory}/${spark.jar.dir}/${spark.jar.basename}</spark.jar>
<deb.pkg.name>spark</deb.pkg.name>
<deb.install.path>/usr/share/spark</deb.install.path>
<deb.user>root</deb.user>
<deb.bin.filemode>744</deb.bin.filemode>
</properties>

<dependencies>
Expand Down Expand Up @@ -217,113 +213,6 @@
</plugins>
</build>
</profile>
<profile>
<id>deb</id>
<build>
<plugins>
<plugin>
<groupId>org.codehaus.mojo</groupId>
<artifactId>buildnumber-maven-plugin</artifactId>
<version>1.2</version>
<executions>
<execution>
<phase>validate</phase>
<goals>
<goal>create</goal>
</goals>
<configuration>
<shortRevisionLength>8</shortRevisionLength>
</configuration>
</execution>
</executions>
</plugin>
<plugin>
<groupId>org.vafer</groupId>
<artifactId>jdeb</artifactId>
<version>0.11</version>
<executions>
<execution>
<phase>package</phase>
<goals>
<goal>jdeb</goal>
</goals>
<configuration>
<deb>${project.build.directory}/${deb.pkg.name}_${project.version}-${buildNumber}_all.deb</deb>
<attach>false</attach>
<compression>gzip</compression>
<dataSet>
<data>
<src>${spark.jar}</src>
<type>file</type>
<mapper>
<type>perm</type>
<user>${deb.user}</user>
<group>${deb.user}</group>
<prefix>${deb.install.path}/jars</prefix>
</mapper>
</data>
<data>
<src>${basedir}/src/deb/RELEASE</src>
<type>file</type>
<mapper>
<type>perm</type>
<user>${deb.user}</user>
<group>${deb.user}</group>
<prefix>${deb.install.path}</prefix>
</mapper>
</data>
<data>
<src>${basedir}/../conf</src>
<type>directory</type>
<mapper>
<type>perm</type>
<user>${deb.user}</user>
<group>${deb.user}</group>
<prefix>${deb.install.path}/conf</prefix>
<filemode>744</filemode>
</mapper>
</data>
<data>
<src>${basedir}/../bin</src>
<type>directory</type>
<mapper>
<type>perm</type>
<user>${deb.user}</user>
<group>${deb.user}</group>
<prefix>${deb.install.path}/bin</prefix>
<filemode>${deb.bin.filemode}</filemode>
</mapper>
</data>
<data>
<src>${basedir}/../sbin</src>
<type>directory</type>
<mapper>
<type>perm</type>
<user>${deb.user}</user>
<group>${deb.user}</group>
<prefix>${deb.install.path}/sbin</prefix>
<filemode>744</filemode>
</mapper>
</data>
<data>
<src>${basedir}/../python</src>
<type>directory</type>
<mapper>
<type>perm</type>
<user>${deb.user}</user>
<group>${deb.user}</group>
<prefix>${deb.install.path}/python</prefix>
<filemode>744</filemode>
</mapper>
</data>
</dataSet>
</configuration>
</execution>
</executions>
</plugin>
</plugins>
</build>
</profile>
<profile>
<id>kinesis-asl</id>
<dependencies>
Expand Down
2 changes: 0 additions & 2 deletions assembly/src/deb/RELEASE

This file was deleted.

8 changes: 0 additions & 8 deletions assembly/src/deb/control/control

This file was deleted.

2 changes: 1 addition & 1 deletion bagel/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.apache.spark</groupId>
<artifactId>spark-parent</artifactId>
<artifactId>spark-parent_2.10</artifactId>
<version>1.3.0-SNAPSHOT</version>
<relativePath>../pom.xml</relativePath>
</parent>
Expand Down
124 changes: 0 additions & 124 deletions bin/compute-classpath.cmd

This file was deleted.

Loading