feature request: pandas connector

 **Is your feature request related to a problem? Please describe.**

I'd like to be able to run a query against a Spanner database and download (possibly large-ish -- MBs to GBs) results to a pandas DataFrame. Specifically, I'd like to eventually use this as a component in an [ibis](https://ibis-project.org/) connector, but it'd also be useful for general data processing pipelines.

 **Describe the solution you'd like**

It seems that [StreamedResultSet](https://googleapis.dev/python/spanner/latest/streamed-api.html#google.cloud.spanner_v1.streamed.StreamedResultSet) is the most natural place to put a `to_dataframe` method, similar to the [RowIterator.to_dataframe method in the BigQuery client library](https://googleapis.dev/python/bigquery/latest/generated/google.cloud.bigquery.table.RowIterator.html#google.cloud.bigquery.table.RowIterator.to_dataframe).

Since `pandas` needn't be required to use this client library, the import should be conditional

https://github.com/googleapis/python-bigquery/blob/fb401bd94477323bba68cf252dd88166495daf54/google/cloud/bigquery/table.py#L29-L32

and the dependency listed in "extras".

https://github.com/googleapis/python-bigquery/blob/fb401bd94477323bba68cf252dd88166495daf54/setup.py#L50

 **Describe alternatives you've considered**

It's possible this is simpler than realized, so maybe could just be a code sample.

If there were a SQLAlchemy connector (a much bigger project than read-only pandas dataframe), then pandas support is basically free via `pandas.read_sql`.

 **Additional context**

Related StackOverflow questions:

* https://stackoverflow.com/questions/63041922/how-do-i-run-a-query-in-cloud-spanner-and-download-the-results-to-a-pandas-dataf
* https://stackoverflow.com/questions/57421665/how-to-query-spanner-and-get-metadata-especially-columns-names

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature request: pandas connector #155

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

feature request: pandas connector #155

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions