Skip to content

[SPARK-7738] [SQL] [PySpark] add reader and writer API in Python #6238

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 6 commits into from

Conversation

davies
Copy link
Contributor

@davies davies commented May 18, 2015

cc @rxin, please take a quick look, I'm working on tests.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented May 18, 2015

Test build #33011 has started for PR 6238 at commit bcc6668.

@SparkQA
Copy link

SparkQA commented May 18, 2015

Test build #33011 has finished for PR 6238 at commit bcc6668.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class DataFrameReader(object):
    • class DataFrameWriter(object):

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33011/
Test FAILed.

@andrewor14
Copy link
Contributor

retest this please

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33035 has started for PR 6238 at commit bcc6668.

@@ -50,8 +50,15 @@ private[spark] object PythonUtils {
/**
* Convert list of T into seq of T (for calling API with varargs)
*/
def toSeq[T](cols: JList[T]): Seq[T] = {
cols.toList.toSeq
def toSeq[T](vs: JList[T]): Seq[T] = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what does vs mean?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

value -> v, values => vs :)

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33035 has finished for PR 6238 at commit bcc6668.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class DataFrameReader(object):
    • class DataFrameWriter(object):

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33035/
Test FAILed.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33054 has started for PR 6238 at commit f0c5a04.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33055 has started for PR 6238 at commit 9cbf01b.

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33054 has finished for PR 6238 at commit f0c5a04.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class DataFrameReader(object):
    • class DataFrameWriter(object):

@AmplabJenkins
Copy link

Merged build finished. Test PASSed.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33054/
Test PASSed.

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33055 has finished for PR 6238 at commit 9cbf01b.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class DataFrameReader(object):
    • class DataFrameWriter(object):

@AmplabJenkins
Copy link

Merged build finished. Test PASSed.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33055/
Test PASSed.

else:
jwrite.save(path)

def saveAsTable(self, name, format=None, mode="error", **options):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are there any test cases covering this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's covered by df.saveAsTable(), had updated the tests to use new API.

@davies davies changed the title [WIP] [SQL] [PySpark] add reader amd writer API in Python [SPARK-7338] [SQL] [PySpark] add reader amd writer API in Python May 19, 2015
@davies
Copy link
Contributor Author

davies commented May 19, 2015

@rxin @yhai How do we test JDBC?

@jkbradley
Copy link
Member

@davies I think this has the wrong JIRA number in the title.

@davies davies changed the title [SPARK-7338] [SQL] [PySpark] add reader amd writer API in Python [SPARK-7738] [SQL] [PySpark] add reader amd writer API in Python May 19, 2015
@davies
Copy link
Contributor Author

davies commented May 19, 2015

@jkbradley, thanks, corrected.

@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33091 has started for PR 6238 at commit c7200eb.

@SparkQA
Copy link

SparkQA commented May 19, 2015

Test build #33091 has finished for PR 6238 at commit c7200eb.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class DataFrameReader(object):
    • class DataFrameWriter(object):

@AmplabJenkins
Copy link

Merged build finished. Test PASSed.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33091/
Test PASSed.

@davies davies changed the title [SPARK-7738] [SQL] [PySpark] add reader amd writer API in Python [SPARK-7738] [SQL] [PySpark] add reader and writer API in Python May 19, 2015
asfgit pushed a commit that referenced this pull request May 19, 2015
cc rxin, please take a quick look, I'm working on tests.

Author: Davies Liu <[email protected]>

Closes #6238 from davies/readwrite and squashes the following commits:

c7200eb [Davies Liu] update tests
9cbf01b [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
f0c5a04 [Davies Liu] use sqlContext.read.load
5f68bc8 [Davies Liu] update tests
6437e9a [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
bcc6668 [Davies Liu] add reader amd writer API in Python

(cherry picked from commit 4de74d2)
Signed-off-by: Reynold Xin <[email protected]>
@asfgit asfgit closed this in 4de74d2 May 19, 2015
jeanlyn pushed a commit to jeanlyn/spark that referenced this pull request May 28, 2015
cc rxin, please take a quick look, I'm working on tests.

Author: Davies Liu <[email protected]>

Closes apache#6238 from davies/readwrite and squashes the following commits:

c7200eb [Davies Liu] update tests
9cbf01b [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
f0c5a04 [Davies Liu] use sqlContext.read.load
5f68bc8 [Davies Liu] update tests
6437e9a [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
bcc6668 [Davies Liu] add reader amd writer API in Python
jeanlyn pushed a commit to jeanlyn/spark that referenced this pull request Jun 12, 2015
cc rxin, please take a quick look, I'm working on tests.

Author: Davies Liu <[email protected]>

Closes apache#6238 from davies/readwrite and squashes the following commits:

c7200eb [Davies Liu] update tests
9cbf01b [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
f0c5a04 [Davies Liu] use sqlContext.read.load
5f68bc8 [Davies Liu] update tests
6437e9a [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
bcc6668 [Davies Liu] add reader amd writer API in Python
nemccarthy pushed a commit to nemccarthy/spark that referenced this pull request Jun 19, 2015
cc rxin, please take a quick look, I'm working on tests.

Author: Davies Liu <[email protected]>

Closes apache#6238 from davies/readwrite and squashes the following commits:

c7200eb [Davies Liu] update tests
9cbf01b [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
f0c5a04 [Davies Liu] use sqlContext.read.load
5f68bc8 [Davies Liu] update tests
6437e9a [Davies Liu] Merge branch 'master' of github.com:apache/spark into readwrite
bcc6668 [Davies Liu] add reader amd writer API in Python
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants