[FEA] Support Heterogeneous Sampling in cuGraph-PyG #82

alexbarghi-nv · 2024-12-02T17:11:06Z

Allows sampling of heterogeneous graphs.

Removes unbuffered sampling from the PyG examples and completely disables it in DGL. A future PR will completely drop PyG support for unbuffered sampling, and a future cugraph PR will drop support for unbuffered sampling in the distributed sampler.

Merge after rapidsai/cugraph#4795

Closes rapidsai/cugraph#4402

copy-pr-bot · 2024-12-23T16:55:14Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

tingyu66 · 2025-01-14T23:12:35Z

python/cugraph-pyg/cugraph_pyg/tests/loader/test_neighbor_loader.py

+
+    loader = NeighborLoader(
+        (feature_store, graph_store),
+        num_neighbors=[0, 1, 0, 1],


Is the 0 fanouts here used to test edge cases?

It's used to test whether we can properly exclude an edge type.

jameslamb

Put up some suggestions on the notebook testing.

jameslamb · 2025-01-15T15:39:59Z

ci/test_notebooks.sh

+  --matrix "cuda=${RAPIDS_CUDA_VERSION%.*};arch=$(arch);py=${RAPIDS_PY_VERSION}"  \
+  --prepend-channel "${CPP_CHANNEL}" \
+  --prepend-channel "${PYTHON_CHANNEL}" \
+| tee env.yaml


At this point in this script, CPP_CHANNEL and PYTHON_CHANNEL haven't yet been set. If you want the downloaded CI artifacts to be considered in the conda solve, you'll have to move this block from lower down up above this:

rapids-logger "Downloading artifacts from previous jobs" CPP_CHANNEL=$(rapids-download-conda-from-s3 cpp) PYTHON_CHANNEL=$(rapids-download-conda-from-s3 python)

If you do that, then it'd also be good to remove the rapids-mamba-retry install cugraph-dgl in favor of that coming through this, so there will be a single call to create the environment.

Here's an example: rapidsai/ucx-py#1101

jameslamb · 2025-01-15T15:43:41Z

dependencies.yaml

+      - depends_on_cudf
+      - depends_on_cugraph


Instead of this, I think here you want a depends_on_cugraph_dgl, so that a requirement like this gets added to the env.yaml:

- cugraph-dgl==25.2.*,>=0.0.0a0

Then cudf and cugraph will come through automatically as part of cugraph-dgl's required dependencies.

cugraph-gnn/conda/recipes/cugraph-dgl/meta.yaml

Line 27 in 87455cf

- cugraph ={{ minor_version }}

That's a better pattern for CI, because it allows us to catch packaging problems of the form "cugraph-dgl depends on cudf but doesn't explicitly declare it" or something like that.

cuspatial uses this "consolidated solves" approach, you could follow that project's example:

https://github.com/rapidsai/cuspatial/blob/dbdc75ddea8422c18441a63e9f1fc42230db301c/dependencies.yaml#L43-L52

https://github.com/rapidsai/cuspatial/blob/dbdc75ddea8422c18441a63e9f1fc42230db301c/ci/test_notebooks.sh#L8-L19

jameslamb · 2025-01-15T19:43:37Z

dependencies.yaml

    includes:
      - cuda_version
      - depends_on_pytorch
+      - depends_on_cugraph_dgl


This won't work as-is because this project's dependencies.yaml doesn't yet have a depends_on_cugraph_dgl.

Add this:

depends_on_cugraph_dgl: common: - output_types: conda packages: - cugraph-dgl==25.2.*,>=0.0.0a0

Maybe here, after depends_on_cugraph:

https://github.com/alexbarghi-nv/cugraph-gnn/blob/9b19ee4b5407706dffc86ce971673133b33c63a4/dependencies.yaml#L543

It doesn't have to have as much stuff as depends_on_cugraph (for example), because we're only using it to reference conda packages.

Alternatively, you could avoid this depends_on_cugraph_dgl stuff (since this is the only reference) and instead add an item to the test_notebook: group (https://github.com/alexbarghi-nv/cugraph-gnn/blob/9b19ee4b5407706dffc86ce971673133b33c63a4/dependencies.yaml#L365C1-L372C16), so it'd look like this:

test_notebook: common: - output_types: [conda, requirements] packages: - ipython - nbconvert - notebook>=0.5.0 - ogb - output_types: [conda] packages: - cugraph-dgl==25.2.*,>=0.0.0a0

Should be fixed now. Thanks for the great explanation @jameslamb !

no prob, happy to help 😊

ci/test_notebooks.sh

Co-authored-by: James Lamb <[email protected]>

jameslamb

Seeing the notebook jobs pass! (build link)

The only failing just is cugraph-dgl wheels testing, but that looks like a network error that'd be resolved by just re-running the job.

pip._vendor.urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='download.pytorch.org', port=443): Read timed out.

(build link)

I just restarted it. This should be good to merge once that passes.

Thanks for getting this fixed so quickly @alexbarghi-nv !

alexbarghi-nv · 2025-01-16T18:04:18Z

/merge

jakirkham · 2025-01-16T23:32:08Z

Thanks all! 🙏

alexbarghi-nv added 3 commits November 18, 2024 09:00

heterogeneous sampling

35af4b4

c

8da5c95

reformat

4587bd9

alexbarghi-nv changed the base branch from branch-24.12 to branch-25.02 December 2, 2024 17:11

alexbarghi-nv self-assigned this Dec 2, 2024

alexbarghi-nv added breaking Introduces a breaking change feature request New feature or request labels Dec 2, 2024

alexbarghi-nv added this to the 25.02 milestone Dec 2, 2024

alexbarghi-nv added 4 commits December 5, 2024 13:02

fix various bugs

0786495

add num sampled nodes

9eb3319

get hetero input ids working

ef57559

fix src/dst confusion

c7f0000

alexbarghi-nv added 2 commits December 31, 2024 11:07

Merge branch 'branch-25.02' into hetero-pyg

c2db63f

Merge branch 'branch-25.02' into hetero-pyg

36b2338

alexbarghi-nv marked this pull request as ready for review January 13, 2025 15:52

alexbarghi-nv requested a review from a team as a code owner January 13, 2025 15:52

alexbarghi-nv added 2 commits January 13, 2025 10:20

fix copyright

fae8b7e

remove unbuffered sampling from pyg examples and disable it in dgl

333ccb8

tingyu66 approved these changes Jan 14, 2025

View reviewed changes

alexbarghi-nv mentioned this pull request Jan 15, 2025

Switch to pynvml_utils.smi for PyNVML 12 rapidsai/cugraph#4863

Merged

update depedencies.yaml, notebook test script

52296a3

alexbarghi-nv requested review from a team as code owners January 15, 2025 15:35

alexbarghi-nv requested a review from AyodeAwe January 15, 2025 15:35

jameslamb reviewed Jan 15, 2025

View reviewed changes

fix dependencies, script based on feedback from James

9b19ee4

jameslamb requested review from jameslamb and removed request for AyodeAwe January 15, 2025 19:36

jameslamb reviewed Jan 15, 2025

View reviewed changes

alexbarghi-nv added 2 commits January 15, 2025 20:45

add cugraph_dgl dependencies

99e5e9f

move conda activation

4efa696

jameslamb reviewed Jan 16, 2025

View reviewed changes

ci/test_notebooks.sh Show resolved Hide resolved

add DGL channel

fd19e13

Co-authored-by: James Lamb <[email protected]>

jakirkham mentioned this pull request Jan 16, 2025

Use GCC 13 in CUDA 12 conda builds. #108

Merged

jameslamb approved these changes Jan 16, 2025

View reviewed changes

rapids-bot bot merged commit a9ab8b4 into rapidsai:branch-25.02 Jan 16, 2025
82 checks passed

alexbarghi-nv deleted the hetero-pyg branch January 16, 2025 18:04

jameslamb mentioned this pull request Jan 16, 2025

remove dependency on cugraph-ops #99

Merged

[FEA] Support Heterogeneous Sampling in cuGraph-PyG #82

[FEA] Support Heterogeneous Sampling in cuGraph-PyG #82

Uh oh!

Conversation

alexbarghi-nv commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Dec 23, 2024

Uh oh!

tingyu66 Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

alexbarghi-nv Jan 15, 2025

Choose a reason for hiding this comment

Uh oh!

jameslamb left a comment

Choose a reason for hiding this comment

Uh oh!

jameslamb Jan 15, 2025

Choose a reason for hiding this comment

Uh oh!

jameslamb Jan 15, 2025

Choose a reason for hiding this comment

Uh oh!

jameslamb Jan 15, 2025 • edited by bdice Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexbarghi-nv Jan 16, 2025

Choose a reason for hiding this comment

Uh oh!

jameslamb Jan 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jameslamb left a comment

Choose a reason for hiding this comment

Uh oh!

alexbarghi-nv commented Jan 16, 2025

Uh oh!

Uh oh!

jakirkham commented Jan 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

alexbarghi-nv commented Dec 2, 2024 •

edited

Loading

jameslamb Jan 15, 2025 •

edited by bdice

Loading