SHS-NG M4.3: Port StorageTab to the new backend. #33

vanzin · 2017-05-30T23:07:22Z

This required adding information about StreamBlockId to the store,
which is not available yet via the API. So an internal type was added
until there's a need to expose that information in the API.

The UI only lists RDDs that have cached partitions, and that information
wasn't being correctly captured in the listener, so that's also fixed,
along with some minor (internal) API adjustments so that the UI can
get the correct data.

This required adding information about StreamBlockId to the store, which is not available yet via the API. So an internal type was added until there's a need to expose that information in the API. The UI only lists RDDs that have cached partitions, and that information wasn't being correctly captured in the listener, so that's also fixed, along with some minor (internal) API adjustments so that the UI can get the correct data. Because of the way partitions are cached, some optimizations w.r.t. how often the data is flushed to the store could not be applied to this code; because of that, some different ways to make the code more performant were added to the data structures tracking RDD blocks, with the goal of avoiding expensive copies when lots of blocks are being updated.

## What changes were proposed in this pull request? This PR upgrade Janino version to 3.0.8. [Janino 3.0.8](https://janino-compiler.github.io/janino/changelog.html) includes an important fix to reduce the number of constant pool entries by using 'sipush' java bytecode. * SIPUSH bytecode is not used for short integer constant [#33](janino-compiler/janino#33). Please see detail in [this discussion thread](apache#19518 (comment)). ## How was this patch tested? Existing tests Author: Kazuaki Ishizaki <[email protected]> Closes apache#19890 from kiszk/SPARK-22688.

This PR upgrade Janino version to 3.0.8. [Janino 3.0.8](https://janino-compiler.github.io/janino/changelog.html) includes an important fix to reduce the number of constant pool entries by using 'sipush' java bytecode. * SIPUSH bytecode is not used for short integer constant [#33](janino-compiler/janino#33). Please see detail in [this discussion thread](apache#19518 (comment)). Existing tests Author: Kazuaki Ishizaki <[email protected]> Closes apache#19890 from kiszk/SPARK-22688. (cherry picked from commit 8ae004b) Signed-off-by: Sean Owen <[email protected]>

…nput of UDF as double in the failed test in udf-aggregate_part1.sql ## What changes were proposed in this pull request? It still can be flaky on certain environments due to float limitation described at apache#25110 . See apache#25110 (comment) - https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-2.7/6584/testReport/org.apache.spark.sql/SQLQueryTestSuite/udf_pgSQL_udf_aggregates_part1_sql___Regular_Python_UDF/ ``` Expected "700000000000[6] 1", but got "700000000000[5] 1" Result did not match for query #33
SELECT CAST(avg(udf(CAST(x AS DOUBLE))) AS long), CAST(udf(var_pop(CAST(x AS DOUBLE))) AS decimal(10,3))
FROM (VALUES (7000000000005), (7000000000007)) v(x) ``` Here;s what's going on: apache#25110 (comment) ``` scala> Seq("7000000000004.999", "7000000000006.999").toDF().selectExpr("CAST(avg(value) AS long)").show() +--------------------------+ |CAST(avg(value) AS BIGINT)| +--------------------------+ | 7000000000005| +--------------------------+ ``` Therefore, this PR just avoid to cast in the specific test. This is a temp fix. We need more robust way to avoid such cases. ## How was this patch tested? It passes with Maven in my local before/after this PR. I believe the problem seems similarly the Python or OS installed in the machine. I should test this against PR builder with `test-maven` for sure.. Closes apache#25128 from HyukjinKwon/SPARK-28270-2. Authored-by: HyukjinKwon <[email protected]> Signed-off-by: HyukjinKwon <[email protected]>

vanzin force-pushed the shs-ng/M4.3 branch from c5a17fd to 940e95f Compare June 1, 2017 01:53

vanzin force-pushed the shs-ng/M4.2 branch from d66024c to 63a6ceb Compare June 1, 2017 01:53

vanzin force-pushed the shs-ng/M4.3 branch from 940e95f to 13d72a4 Compare June 1, 2017 20:15

vanzin force-pushed the shs-ng/M4.2 branch 2 times, most recently from 53a488a to 71f1794 Compare June 2, 2017 16:39

vanzin force-pushed the shs-ng/M4.3 branch from 13d72a4 to 7a670cf Compare June 2, 2017 16:39

vanzin force-pushed the shs-ng/M4.2 branch from 71f1794 to a6b742b Compare June 5, 2017 17:46

vanzin force-pushed the shs-ng/M4.3 branch from 7a670cf to 9cf8ea9 Compare June 5, 2017 17:46

vanzin force-pushed the shs-ng/M4.2 branch from a6b742b to 65db877 Compare June 6, 2017 17:21

vanzin force-pushed the shs-ng/M4.3 branch from 9cf8ea9 to 3a6587f Compare June 6, 2017 17:21

vanzin force-pushed the shs-ng/M4.2 branch from 65db877 to b2d3e52 Compare June 6, 2017 20:53

vanzin force-pushed the shs-ng/M4.3 branch from 3a6587f to b8de47a Compare June 6, 2017 20:53

vanzin force-pushed the shs-ng/M4.2 branch from b2d3e52 to 3b0ed8f Compare June 6, 2017 20:57

vanzin force-pushed the shs-ng/M4.3 branch from b8de47a to 75f391b Compare June 6, 2017 20:58

vanzin force-pushed the shs-ng/M4.2 branch from 3b0ed8f to 5ab636f Compare June 7, 2017 18:00

vanzin force-pushed the shs-ng/M4.3 branch 2 times, most recently from b9ca562 to 67724a5 Compare June 9, 2017 22:30

vanzin force-pushed the shs-ng/M4.2 branch from 5ab636f to 1f58877 Compare June 12, 2017 21:48

vanzin force-pushed the shs-ng/M4.3 branch from 67724a5 to 2f27e3d Compare June 12, 2017 21:48

vanzin closed this Aug 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SHS-NG M4.3: Port StorageTab to the new backend. #33

SHS-NG M4.3: Port StorageTab to the new backend. #33

Uh oh!

vanzin commented May 30, 2017

Uh oh!

Uh oh!

SHS-NG M4.3: Port StorageTab to the new backend. #33

SHS-NG M4.3: Port StorageTab to the new backend. #33

Uh oh!

Conversation

vanzin commented May 30, 2017

Uh oh!

Uh oh!