[SPARK-6888][SQL] Export driver quirks #5498

rtreffer · 2015-04-13T21:50:10Z

Make it possible to (temporary) overwrite the driver quirks. This
can be used to overcome problems with specific schemas or to
add new jdbc driver support on the fly.

A very simple implementation to dump the loading can be done like this (spark-shell)

class DumpQuirk extends org.apache.spark.sql.jdbc.DriverQuirks {
  def canHandle(url : String): Boolean = true
  def getCatalystType(sqlType: Int, typeName: String, size: Int, md: org.apache.spark.sql.types.MetadataBuilder): org.apache.spark.sql.types.DataType = {
    println("" + (sqlType, typeName, size, md))
    null
  }
  def getJDBCType(dt: org.apache.spark.sql.types.DataType): (String, Option[Int]) = (null, None)
}
org.apache.spark.sql.jdbc.DriverQuirks.registerQuirks(new DumpQuirk())

Not that this pull request is against 1.3 - I could not create a distribution with the current master.

srowen · 2015-04-13T21:53:51Z

This needs a JIRA -- see https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark

rtreffer · 2015-04-13T22:45:06Z

Added a ticket: https://issues.apache.org/jira/browse/SPARK-6888
Will add that to the commit after some sleep .zZzZzZ

Make it possible to (temporary) overwrite the driver quirks. This can be used to overcome problems with specific schemas or to add new jdbc driver support on the fly.

liancheng · 2015-04-15T08:50:09Z

ok to test.

liancheng · 2015-04-15T08:50:36Z

sql/core/src/main/scala/org/apache/spark/sql/jdbc/DriverQuirks.scala

+
+  private var quirks = List[DriverQuirks]()
+
+  def registerQuirks(quirk: DriverQuirks) {


Please add return type explicitly for all public methods.

rtreffer · 2015-04-15T09:02:26Z

@liancheng thank you, will updaste the patch.
Just one question: Should I sqash/amend the fixes or should I add a second commit?

liancheng · 2015-04-15T09:08:06Z

sql/core/src/main/scala/org/apache/spark/sql/jdbc/DriverQuirks.scala

+      } else {
+        r.getCatalystType(sqlType, typeName, size, md)
+      }
+    )


How about this:

quirks.map(_.getCatalystType(sqlType, typeName, size, md)).collectFirst { case dataType if dataType != null => dataType }.orNull

liancheng · 2015-04-15T09:31:51Z

You may just add new commits to this PR. Also, would you please add tests for this feature?

SparkQA · 2015-04-15T10:43:52Z

Test build #30332 has finished for PR 5498 at commit 9ca66d9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

…type mapping)

rtreffer · 2015-04-15T21:15:18Z

I still have to write tests for AggregatedQuirks.

marmbrus · 2015-04-15T22:38:38Z

If we are going to make this a public API we should consider a clearer name. Perhaps. JDBCTypeMapping? You will also need to reopen the PR against master as we don't want to add new APIs in a maintenance branch.

marmbrus · 2015-04-15T22:39:01Z

sql/core/src/main/scala/org/apache/spark/sql/jdbc/DriverQuirks.scala

@@ -39,33 +39,68 @@ import java.sql.Types
 * if `getJDBCType` returns `(null, None)`, the default type handling is used
 * for the given Catalyst type.
 */
-private[sql] abstract class DriverQuirks {
+abstract class DriverQuirks {
+  def canHandle(url : String): Boolean


Add scala doc to describe the contract for each of these methods.

SparkQA · 2015-04-15T23:10:00Z

Test build #30375 has finished for PR 5498 at commit 7f23484.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rtreffer · 2015-04-16T08:29:28Z

@marmbrus thank you, I'll fix those issues and open a new one when done.

Regarding naming/api:

It is quite common that there is one class per sql/jdbc dialect. Often called that way (e.g. MySQLDialect on hibernate). I've found quite some projects that use the same naming (via github search).
Anyway, in this case I'd like to match those namings and add a default implementation per method (returning the neutral element).

On the other hand it's currently just doing type mapping. So JDBCTypeMapping would be a very valid name, too. It would restrict the use case more (can be good or bad).

I guess you know better what would suite spark :-)

marmbrus · 2015-04-16T19:50:48Z

Dialect seems reasonable to me.

marmbrus · 2015-04-16T19:51:08Z

We will also want to mark all of these @DeveloperApi

…ladoc

SparkQA · 2015-04-17T06:34:37Z

Test build #30463 has finished for PR 5498 at commit 22d65ca.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

rtreffer · 2015-04-17T10:47:45Z

Replaced by #5555

[SPARK-6888] Export driver quirks

9ca66d9

Make it possible to (temporary) overwrite the driver quirks. This can be used to overcome problems with specific schemas or to add new jdbc driver support on the fly.

rtreffer force-pushed the export-driver-quirks branch from dca9372 to 9ca66d9 Compare April 14, 2015 09:58

rtreffer changed the title ~~Export driver quirks~~ [SPARK-6888][SQL] Export driver quirks Apr 14, 2015

liancheng reviewed Apr 15, 2015
View reviewed changes

rtreffer added 2 commits April 15, 2015 23:11

[SPARK-6888] Add tests for custom driver quirks (register/unregister/…

8e0b5e3

…type mapping)

[SPARK-6888] Fix driver quirks handling

7f23484

marmbrus reviewed Apr 15, 2015
View reviewed changes

[SPARK-6888] Rename driver quirks to jdbc dialect and add tests + sca…

22d65ca

…ladoc

rtreffer closed this Apr 17, 2015

gatorsmile mentioned this pull request Dec 27, 2016

[SPARK-19004][SQL] Fix JDBCWriteSuite.testH2Dialect by removing getCatalystType #16409

Closed


		private var quirks = List[DriverQuirks]()

		def registerQuirks(quirk: DriverQuirks) {

[SPARK-6888][SQL] Export driver quirks #5498

[SPARK-6888][SQL] Export driver quirks #5498

Uh oh!

Conversation

rtreffer commented Apr 13, 2015

Uh oh!

srowen commented Apr 13, 2015

Uh oh!

rtreffer commented Apr 13, 2015

Uh oh!

liancheng commented Apr 15, 2015

Uh oh!

liancheng Apr 15, 2015

Choose a reason for hiding this comment

Uh oh!

rtreffer commented Apr 15, 2015

Uh oh!

liancheng Apr 15, 2015

Choose a reason for hiding this comment

Uh oh!

liancheng commented Apr 15, 2015

Uh oh!

SparkQA commented Apr 15, 2015

Uh oh!

rtreffer commented Apr 15, 2015

Uh oh!

marmbrus commented Apr 15, 2015

Uh oh!

marmbrus Apr 15, 2015

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Apr 15, 2015

Uh oh!

rtreffer commented Apr 16, 2015

Uh oh!

marmbrus commented Apr 16, 2015

Uh oh!

marmbrus commented Apr 16, 2015

Uh oh!

SparkQA commented Apr 17, 2015

Uh oh!

rtreffer commented Apr 17, 2015

Uh oh!

Uh oh!