[SPARK-14579][SQL]Fix the race condition in StreamExecution.processAllAvailable again #12582

zsxwing · 2016-04-21T20:18:28Z

What changes were proposed in this pull request?

#12339 didn't fix the race condition. MemorySinkSuite is still flaky: https://amplab.cs.berkeley.edu/jenkins/job/spark-master-test-maven-hadoop-2.2/814/testReport/junit/org.apache.spark.sql.streaming/MemorySinkSuite/registering_as_a_table/

Here is an execution order to reproduce it.

Time	Thread 1	MicroBatchThread
1		`MemorySink.getOffset`
2		availableOffsets ++= newData (availableOffsets is not changed here)
3	addData(newData)
4	Set `noNewData` to `false` in processAllAvailable
5		`dataAvailable` returns `false`
6		noNewData = true
7	`noNewData` is true so just return
8	assert results and fail
9		`dataAvailable` returns true so process the new batch

This PR expands the scope of awaitBatchLock.synchronized to eliminate the above race.

How was this patch tested?

test("stress test"). It always failed before this patch. And it will pass after applying this patch. Ignore this test in the PR as it takes several minutes to finish.

zsxwing · 2016-04-21T20:18:46Z

cc @marmbrus

SparkQA · 2016-04-21T21:45:48Z

Test build #56583 has finished for PR 12582 at commit eddf9fd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2016-04-21T23:36:46Z

You can add the test and mark it @Ignore.

zsxwing · 2016-04-22T00:04:31Z

Added the stress test

SparkQA · 2016-04-22T01:30:11Z

Test build #56612 has finished for PR 12582 at commit 6fadd0f.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

zsxwing · 2016-04-22T18:28:45Z

cc @tdas

zsxwing · 2016-05-02T17:04:15Z

ping @marmbrus

marmbrus · 2016-05-02T18:28:05Z

Thanks, merging to master and 2.0

…llAvailable again ## What changes were proposed in this pull request? #12339 didn't fix the race condition. MemorySinkSuite is still flaky: https://amplab.cs.berkeley.edu/jenkins/job/spark-master-test-maven-hadoop-2.2/814/testReport/junit/org.apache.spark.sql.streaming/MemorySinkSuite/registering_as_a_table/ Here is an execution order to reproduce it. | Time |Thread 1 | MicroBatchThread | |:-------------:|:-------------:|:-----:| | 1 | | `MemorySink.getOffset` | | 2 | | availableOffsets ++= newData (availableOffsets is not changed here) | | 3 | addData(newData) | | | 4 | Set `noNewData` to `false` in processAllAvailable | | | 5 | | `dataAvailable` returns `false` | | 6 | | noNewData = true | | 7 | `noNewData` is true so just return | | | 8 | assert results and fail | | | 9 | | `dataAvailable` returns true so process the new batch | This PR expands the scope of `awaitBatchLock.synchronized` to eliminate the above race. ## How was this patch tested? test("stress test"). It always failed before this patch. And it will pass after applying this patch. Ignore this test in the PR as it takes several minutes to finish. Author: Shixiong Zhu <[email protected]> Closes #12582 from zsxwing/SPARK-14579-2. (cherry picked from commit a35a67a) Signed-off-by: Michael Armbrust <[email protected]>

Fix the race condition in StreamExecution.processAllAvailable again

eddf9fd

Add stress test

6fadd0f

asfgit closed this in a35a67a May 2, 2016

zsxwing deleted the SPARK-14579-2 branch May 2, 2016 18:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-14579][SQL]Fix the race condition in StreamExecution.processAllAvailable again #12582

[SPARK-14579][SQL]Fix the race condition in StreamExecution.processAllAvailable again #12582

zsxwing commented Apr 21, 2016 •

edited

Loading

Uh oh!

zsxwing commented Apr 21, 2016

Uh oh!

SparkQA commented Apr 21, 2016

Uh oh!

marmbrus commented Apr 21, 2016

Uh oh!

zsxwing commented Apr 22, 2016

Uh oh!

SparkQA commented Apr 22, 2016

Uh oh!

zsxwing commented Apr 22, 2016

Uh oh!

zsxwing commented May 2, 2016

Uh oh!

marmbrus commented May 2, 2016

Uh oh!

Uh oh!

[SPARK-14579][SQL]Fix the race condition in StreamExecution.processAllAvailable again #12582

[SPARK-14579][SQL]Fix the race condition in StreamExecution.processAllAvailable again #12582

Conversation

zsxwing commented Apr 21, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

zsxwing commented Apr 21, 2016

Uh oh!

SparkQA commented Apr 21, 2016

Uh oh!

marmbrus commented Apr 21, 2016

Uh oh!

zsxwing commented Apr 22, 2016

Uh oh!

SparkQA commented Apr 22, 2016

Uh oh!

zsxwing commented Apr 22, 2016

Uh oh!

zsxwing commented May 2, 2016

Uh oh!

marmbrus commented May 2, 2016

Uh oh!

Uh oh!

zsxwing commented Apr 21, 2016 •

edited

Loading