Skip to content

[SPARK-28704][SQL][TEST] Add back Skiped HiveExternalCatalogVersionsSuite in HiveSparkSubmitSuite at JDK9+ #30451

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 9 commits into from

Conversation

AngersZhuuuu
Copy link
Contributor

What changes were proposed in this pull request?

We skip test HiveExternalCatalogVersionsSuite when testing with JAVA_9 or later because our previous version does not support JAVA_9 or later. We now add it back since we have a version supports JAVA_9 or later.

Why are the changes needed?

To recover test coverage.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Check CI logs.

@github-actions github-actions bot added the SQL label Nov 21, 2020
@AngersZhuuuu
Copy link
Contributor Author

FYI @dongjoon-hyun

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36065/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36065/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Test build #131459 has finished for PR 30451 at commit 9400d9a.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
  • case class GetShufflePushMergerLocations(numMergersNeeded: Int, hostsToFilter: Set[String])
  • case class RemoveShufflePushMergerLocation(host: String) extends ToBlockManagerMaster
  • abstract class LikeAllBase extends UnaryExpression with ImplicitCastInputTypes with NullIntolerant
  • case class LikeAll(child: Expression, patterns: Seq[UTF8String]) extends LikeAllBase
  • case class NotLikeAll(child: Expression, patterns: Seq[UTF8String]) extends LikeAllBase
  • case class ParseUrl(children: Seq[Expression], failOnError: Boolean = SQLConf.get.ansiEnabled)

@AngersZhuuuu
Copy link
Contributor Author

AngersZhuuuu commented Nov 21, 2020

I found that after this change , it download zookeeper test-jar

downloading https://repo1.maven.org/maven2/org/apache/zookeeper/zookeeper/3.4.6/zookeeper-3.4.6-tests.jar ...
2020-11-20 21:45:20.389 - stderr> 	[SUCCESSFUL ] org.apache.zookeeper#zookeeper;3.4.6!zookeeper.jar(test-jar) (11ms)

But. right case is

downloading https://repo1.maven.org/maven2/org/apache/zookeeper/zookeeper/3.4.3/zookeeper-3.4.3.jar ...
2020-11-19 20:25:25.892 - stderr> 	[SUCCESSFUL ] org.apache.zookeeper#zookeeper;3.4.3!zookeeper.jar (20ms)

See the logical of dowloadVersion

  def resolveDependencyPaths(
      artifacts: Array[AnyRef],
      cacheDirectory: File): String = {
    artifacts.map { ai =>
      val artifactInfo = ai.asInstanceOf[Artifact]
      val artifact = artifactInfo.getModuleRevisionId
      val testSuffix = if (artifactInfo.getType == "test-jar") "-tests" else ""
      cacheDirectory.getAbsolutePath + File.separator +
        s"${artifact.getOrganisation}_${artifact.getName}-${artifact.getRevision}${testSuffix}.jar"
    }.mkString(",")
  }

Seems some place artifact info is wrong?

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36070/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Test build #131465 has finished for PR 30451 at commit 053974a.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36071/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36070/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36071/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Test build #131464 has finished for PR 30451 at commit 67222ca.

  • This patch fails SparkR unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36076/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36076/

@dongjoon-hyun
Copy link
Member

Thank you for working on this, @AngersZhuuuu !

@@ -103,6 +103,7 @@ package object client {
// Since HIVE-14496, Hive materialized view need calcite-core.
// For spark, only VersionsSuite currently creates a hive materialized view for testing.
case object v2_3 extends HiveVersion("2.3.7",
extraDeps = Seq("org.apache.zookeeper:zookeeper:3.4.6"),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Although we have this, the GitHub Action is stilling failing. Maybe, we have other issues?

Copy link
Contributor Author

@AngersZhuuuu AngersZhuuuu Nov 21, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Although we have this, the GitHub Action is stilling failing. Maybe, we have other issues?

Yes, with zookeeper dependency but ivy still. download zookeeper with it's test jars, after detail check I can't find code issue related to this problem. so strange.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Although we have this, the GitHub Action is stilling failing. Maybe, we have other issues?

Since default hive version is 2.3.7 and 2.3.7 can pass jdk9+, so. I changed hive version. in HiveExternalCatalogVersionSuite. It's reasonable.

So confused, still working to find why it download zookeeper(test-jar).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In maven repo I noticed that only zokeeper-3.4.x have pom, but it doesn't seem like this is the problem
image

Copy link
Member

@dongjoon-hyun dongjoon-hyun Nov 21, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with you that it's weird. Maybe, could you try this Spark Shell? If we starts without local caches (Maven/Ivy), I guess Spark Shell will fail with the same reason.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Then, we should fix it first as an independent JIRA issue.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried in feature #29966 to download zookeeper-3.4.6(after clean cache)

test("SPARK-33084: Add jar support ivy url") {
    sc = new SparkContext(new SparkConf().setAppName("test").setMaster("local-cluster[3, 1, 1024]"))
    sc.addJar("ivy://org.apache.zookeeper:zookeeper:3.4.6")
    println("+===================+")
    sc.listJars().foreach(println)
    assert(sc.listJars().find(_.contains("zookeeper-3.4.6")).nonEmpty)
  }

It really download right jar.
But with UT

 test("SPARK-33084: Add jar support ivy url") {
    sc = new SparkContext(new SparkConf().setAppName("test")
      .setMaster("local-cluster[3, 1, 1024]"))
    sc.addJar("ivy://org.apache.hive:hive-exec:2.3.7" +
      "?exclude=org.pentaho:pentaho-aggdesigner-algorithm&transitive=true")
    println("+===================+")
    sc.listJars().foreach(println)
  }

It download both zookeeper.jar and zookeeper-test.jar.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@dongjoon-hyun
This error happened when HiveExternalCatalogVersionsSuite.prepare(), So any change in current code won't work since it use downloaded compiled spark.
So change in current code won't change the result.

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Test build #131470 has finished for PR 30451 at commit 668db8b.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36082/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36082/

@SparkQA
Copy link

SparkQA commented Nov 21, 2020

Test build #131476 has finished for PR 30451 at commit b853f5e.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36091/

@github-actions github-actions bot added the INFRA label Nov 22, 2020
@AngersZhuuuu
Copy link
Contributor Author

AngersZhuuuu commented Nov 22, 2020

@dongjoon-hyun
I tried download spark-3.0.1-hadoop-2.7 and start it with

./bin/spark-sql --conf spark.sql.hive.metastore.version=2.3.7 --conf spark.sql.hive.metastore.jars=maven

Will download zookeeper-3.4.6.jar and zookeeper-3.4.6-test.jar, and it works well.

After change hive version 1.2.1 when jdk < jdk9 and hive version 2.3.7 when jdk9+ can pass the GitAction.

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36094/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36094/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36095/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36095/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36096/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36097/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Test build #131490 has finished for PR 30451 at commit ac2afc0.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36096/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Test build #131493 has finished for PR 30451 at commit d21eb17.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AngersZhuuuu AngersZhuuuu changed the title [SPARK-28704][SQL][TEST] Add back Skiped HiveExternalCatalogVersionsSuite in HiveSparkSubmitSuite at JDK9+ [SPARK-28704][SQL][TEST][test-java11] Add back Skiped HiveExternalCatalogVersionsSuite in HiveSparkSubmitSuite at JDK9+ Nov 22, 2020
@AngersZhuuuu
Copy link
Contributor Author

retest this please

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36097/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Test build #131491 has finished for PR 30451 at commit 4390fca.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Test build #131492 has finished for PR 30451 at commit d6f957c.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Test build #131494 has finished for PR 30451 at commit 568150a.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36099/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36098/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36098/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36099/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Test build #131495 has finished for PR 30451 at commit 568150a.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AngersZhuuuu
Copy link
Contributor Author

retest this please

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36102/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/36102/

@SparkQA
Copy link

SparkQA commented Nov 22, 2020

Test build #131498 has finished for PR 30451 at commit 568150a.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-28704][SQL][TEST][test-java11] Add back Skiped HiveExternalCatalogVersionsSuite in HiveSparkSubmitSuite at JDK9+ [SPARK-28704][SQL][TEST] Add back Skiped HiveExternalCatalogVersionsSuite in HiveSparkSubmitSuite at JDK9+ Nov 22, 2020
@dongjoon-hyun
Copy link
Member

Thank you so much, @AngersZhuuuu . Merged to master.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants