WIP - [SPARK-10816][SS] Support session window natively #22482

HeartSaVioR · 2018-09-20T04:53:09Z

What changes were proposed in this pull request?

This patch proposes native support of session window, like Spark has been supporting for time window.

Please refer the attached doc in SPARK-10816 for more details on rationalization, concepts, and limitation, etc.

In point of end users' view, only the change is addition of "session" SQL function. End users could define query with session window as replacing "window" function to "session" function, and "window" column to "session" column. After then the patch will provide same experience with time window.

Internally, this patch will change the physical plan of aggregation a bit: if there's session function being used in query, it will sort the input rows as "grouping keys" + "session", and merge overlapped sessions into one with applying aggregations, so it's like a sort based aggregation but the unit of group is grouping keys + session.

Due to handle late event, there's a case multiple session windows co-exist per key which are not yet to evict. This patch handles the case via borrowing state implementation from streaming join which can handle multiple values for given key.

How was this patch tested?

Many UTs are added to verify session window queries for both batch and streaming.

Please review http://spark.apache.org/contributing.html before opening a pull request.

SparkQA · 2018-09-20T05:00:03Z

Test build #96321 has finished for PR 22482 at commit fb19879.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

HeartSaVioR · 2018-09-20T05:03:46Z

The patch is a bit huge, so I'm not sure we would be better to squash commits into one before reviewing.

Two TODOs are left hence marking the patch as WIP, but it's closer to be a complete patch:

Optimal implementation of state for session window.

It borrowed the state implementation from streaming join since it fits the necessary concept of state for session window, but it may not be optimal one so I'm going to see we can have better implementation.

Javadoc (Maybe structured streaming guide doc too?)

I didn't add javadoc yet to speed up POC and actual development, but to complete the patch I guess I need to write javadoc for new classes as well as methods (maybe).

SparkQA · 2018-09-20T05:20:49Z

Test build #96323 has finished for PR 22482 at commit 0072ebe.

This patch fails Python style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-09-20T05:31:18Z

Test build #96324 has finished for PR 22482 at commit 7d8371c.

This patch fails Python style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-09-20T07:05:01Z

Test build #96328 has finished for PR 22482 at commit ad0b746.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

HeartSaVioR · 2018-09-20T07:20:10Z

retest this, please

SparkQA · 2018-09-20T10:35:21Z

Test build #96336 has finished for PR 22482 at commit ad0b746.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

HeartSaVioR · 2018-09-20T11:33:27Z

retest this, please

HeartSaVioR · 2018-09-20T11:48:41Z

Please review the general approach and direction first. I'm planning to spend time to rewrite streaming part to tightly integrate logic with state so that updating state is going to be minimized.

SparkQA · 2018-09-20T14:43:18Z

Test build #96348 has finished for PR 22482 at commit ad0b746.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

arunmahadevan · 2018-09-20T17:21:53Z

+1 for the idea to provide native session window support.

On the approach, it would be ideal if all windowing aggregations can be handled via single plan and state store (v/s the separate plan and state store the patch proposes for session window). Underlying steps are more or less the same for Fixed, Session and Sliding windows. The sort/merge operations have to be part of a window merge function rather than the plan itself.

K,Values -> AssignWindows (produces [k, v, timestamp, window]) -> GroupByKey (shuffle) -> MergeWindows (optional step) -> GroupWindows -> aggregate values.

Based on how we want to approach it, it could be handled now or as a follow up item (with major refactoring).

HeartSaVioR · 2018-09-20T19:28:30Z

@arunmahadevan
We may want to be aware is that the requirement is pretty different from other streaming frameworks like Flink, which normally set a long period of checkpoint interval and do a full snapshot (though it supports incremental checkpoint, which deals with how it minimizes amount of storing data).

Here in Spark, we are expecting smaller batch interval, and Spark deals with the requirement as storing "delta" of state change. The behavior brings concern about the strategy of how we store and how we remove the state.

Let's say we have 3 rows in group in batch result and there're also 3 rows in same group in state, and we want to replace state with new batch result. For full snapshot removing 3 rows first and putting 3 rows may not matter much, but with delta approach, we should compare them side-by-side and bring less changes on state. (We also avoid having List[V] as value for state and have two different states because of that. If you make a change of any element on List, Spark's state store will treat it as whole change of List[V] and store whole elements to delta.)

The difference is not trivial one for session window, because arbitrary changes are required: for example, two different sessions in state can be merged later when late events come in between two sessions, then we ideally should have to overwrite one and remove others. Some new sessions can be created as well as existing session, and we want to overwrite session if the new output session is originated from old state, and append session if not. For other window, it is just a "put" because there's no group and we are just safe to put (overwrite if any, and without evict there's no need to remove). The different requirements between time window and session window are hard to be combined into one.

That's what I realized the difficulty of state part for session window, and that's why I feel I need to make change on streaming part. (Not sure I'm too worry about optimization of the state change, but minimizing delta was the core concept of #21733 and it really affects the performance.) For batch part current patch is doing OK.

Btw, we can assume AssignWindows as TimeWindowing and SessionWindowing as we are logically assign rows to individual window. So unless we would like to support custom window like dynamic gap session window, I think we can address it later whenever needed.

HeartSaVioR · 2018-09-20T19:40:53Z

If we are fine with ignoring the optimal delta of state, or OK with addressing it in follow-up issue (it should be addressed in same release version to avoid having state V1, V2, etc...), I think the only TODO is writing javadoc as well as deduplicate some codes.

HeartSaVioR · 2018-09-21T02:18:54Z

According to the discussion on SPARK-10816, I'm holding up effort to improve and plan to discuss further from JIRA issue. I guess someone interested for this patch can still review or try this out and share feedback.

SparkQA · 2018-10-02T07:05:01Z

Test build #96838 has finished for PR 22482 at commit 9a60cf3.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-10-08T11:35:22Z

Test build #97107 has finished for PR 22482 at commit 78fdd99.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-10-09T01:57:44Z

Test build #97135 has finished for PR 22482 at commit 94e9859.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

…key"

* This will be also used from session window state as well

… stateful operators (WIP...)

…XMEs to go

We can enable it but there're lots of approaches on aggregations in batch side... * AggUtils.planAggregateWithoutDistinct * AggUtils.planAggregateWithOneDistinct * RewriteDistinctAggregates * AggregateInPandasExec So unless we are sure which things to support, just block them for now...

… node * we will leverage such node for batch case if we want

… numbers of randomized operations

… pointer

…store

SparkQA · 2018-10-31T07:05:01Z

Test build #98289 has finished for PR 22482 at commit c03c946.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

…w for group key Also modify CodeGenerator to print out debug information when code generation takes too long

SparkQA · 2018-11-01T07:14:32Z

Test build #98347 has finished for PR 22482 at commit 5de4075.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-11-01T11:33:22Z

Test build #98348 has finished for PR 22482 at commit d1536b4.

This patch fails SparkR unit tests.
This patch merges cleanly.
This patch adds no public classes.

HeartSaVioR · 2018-11-02T01:43:50Z

retest this, please

SparkQA · 2018-11-02T04:15:22Z

Test build #98382 has finished for PR 22482 at commit 5a76383.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-11-02T05:06:58Z

Test build #98380 has finished for PR 22482 at commit ee67bca.

This patch fails SparkR unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-11-02T05:11:09Z

Test build #98379 has finished for PR 22482 at commit ee67bca.

This patch fails SparkR unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-11-02T05:45:18Z

Test build #98383 has finished for PR 22482 at commit b6ccecd.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

… review is in progress

SparkQA · 2018-11-02T07:05:02Z

Test build #98385 has finished for PR 22482 at commit 75c7611.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

HeartSaVioR · 2018-11-02T07:09:21Z

retest this, please

SparkQA · 2018-11-02T10:49:09Z

Test build #98387 has finished for PR 22482 at commit 75c7611.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2019-09-16T18:19:06Z

Can one of the admins verify this patch?

github-actions · 2020-01-06T00:07:30Z

We're closing this PR because it hasn't been updated in a while.
This isn't a judgement on the merit of the PR in any way. It's just
a way of keeping the PR queue manageable.

If you'd like to revive this PR, please reopen it!

HeartSaVioR force-pushed the SPARK-10816 branch from 7d8371c to ad0b746 Compare September 20, 2018 05:58

HeartSaVioR force-pushed the SPARK-10816 branch from ad0b746 to 9a60cf3 Compare October 2, 2018 05:01

HeartSaVioR added 11 commits October 10, 2018 10:43

WIP nothing worked, just recording the progress

eabb65b

WIP not working yet... lots of implementations needed

c3076d2

WIP Finished implementing UpdatingSessionIterator

9d59c7a

WIP add verification on precondition "rows in iterator are sorted by …

b38f2b9

…key"

Rename SymmetricHashJoinStateManager to MultiValuesStateManager

668c1f5

* This will be also used from session window state as well

Move package of UpdatingSessionIterator

9f63a3c

WIP add MergingSortWithMultiValuesStateIterator, now integrating with…

5d17ac8

… stateful operators (WIP...)

WIP the first version of working one! Still have lots of TODOs and FI…

ec33265

…XMEs to go

Add more explanations

8b210d5

More works: majorly split out updating session to individual physical…

7b57fe5

… node * we will leverage such node for batch case if we want

HeartSaVioR added 7 commits October 25, 2018 18:33

WIP it works but a bit suboptimal

7bb0060

WIP optimized!

35c9712

WIP remove requirement on sort, add UT to test linked list state with…

ede078a

… numbers of randomized operations

WIP add code to print out information when task crashes with dangling…

f8e8ff6

… pointer

WIP fixed the issue with benchmark run

b05abc7

WIP optimize a bit on storing new sessions

17570f2

WIP Fixed critical bug which tasks don't respect preference on state …

958de31

…store

WIP Fix critical perf. issue: remove codegen on generating session ro…

ee67bca

…w for group key Also modify CodeGenerator to print out debug information when code generation takes too long

HeartSaVioR force-pushed the SPARK-10816 branch from 5de4075 to d1536b4 Compare November 1, 2018 07:53

HeartSaVioR force-pushed the SPARK-10816 branch from d1536b4 to ee67bca Compare November 2, 2018 01:42

HeartSaVioR added 2 commits November 2, 2018 14:12

WIP Rolling back unnecessary changes

8a0331e

WIP Apply removing codegen to UpdatingSessionIterator as well

b6ccecd

HeartSaVioR force-pushed the SPARK-10816 branch from 5a76383 to b6ccecd Compare November 2, 2018 05:37

WIP remove state version for now: it will be reintroduced when actual…

75c7611

… review is in progress

dongjoon-hyun added the STRUCTURED STREAMING label Jun 14, 2019

github-actions bot added the Stale label Jan 6, 2020

github-actions bot closed this Jan 7, 2020

WIP - [SPARK-10816][SS] Support session window natively #22482

WIP - [SPARK-10816][SS] Support session window natively #22482

Uh oh!

Conversation

HeartSaVioR commented Sep 20, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

HeartSaVioR commented Sep 20, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

HeartSaVioR commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

HeartSaVioR commented Sep 20, 2018

Uh oh!

HeartSaVioR commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

arunmahadevan commented Sep 20, 2018

Uh oh!

HeartSaVioR commented Sep 20, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HeartSaVioR commented Sep 20, 2018

Uh oh!

HeartSaVioR commented Sep 21, 2018

Uh oh!

SparkQA commented Oct 2, 2018

Uh oh!

SparkQA commented Oct 8, 2018

Uh oh!

SparkQA commented Oct 9, 2018

Uh oh!

SparkQA commented Oct 31, 2018

Uh oh!

SparkQA commented Nov 1, 2018

Uh oh!

SparkQA commented Nov 1, 2018

Uh oh!

HeartSaVioR commented Nov 2, 2018

Uh oh!

SparkQA commented Nov 2, 2018

Uh oh!

SparkQA commented Nov 2, 2018

Uh oh!

SparkQA commented Nov 2, 2018

Uh oh!

SparkQA commented Nov 2, 2018

Uh oh!

SparkQA commented Nov 2, 2018

Uh oh!

HeartSaVioR commented Nov 2, 2018

Uh oh!

SparkQA commented Nov 2, 2018

Uh oh!

AmplabJenkins commented Sep 16, 2019

Uh oh!

github-actions bot commented Jan 6, 2020

Uh oh!

Uh oh!

HeartSaVioR commented Sep 20, 2018 •

edited

Loading

HeartSaVioR commented Sep 20, 2018 •

edited

Loading