Huggingface Integration by vwxyzjn · Pull Request #292 · vwxyzjn/cleanrl

vwxyzjn · 2022-10-13T21:53:36Z

Description

This PR closes #110. https://huggingface.co/cleanrl/CartPole-v1-dqn-seed1 is an example model page.

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vercel · 2022-10-13T21:53:39Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	Jan 4, 2023 at 8:20PM (UTC)

vwxyzjn · 2022-10-13T22:02:13Z

The integration also makes it easier to just run models, such as

cleanrl/cleanrl_utils/evals/dqn_eval.py

Lines 43 to 57 in 4074eee

    
           from huggingface_hub import hf_hub_download 
        
           from cleanrl.dqn import QNetwork, make_env 
        
           model_path = hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth") 
        
           evaluate( 
        
               model_path, 
        
               make_env, 
        
               "CartPole-v1", 
        
               eval_episodes=10, 
        
               run_name=f"eval", 
        
               QNetwork=QNetwork, 
        
               device="cpu", 
        
               epsilon=0.05, 
        
               capture_video=False, 
        
           )

vwxyzjn · 2022-10-14T15:54:10Z

CC @ThomasSimonini for review :) Thanks!

kinalmehta

LGTM.

cleanrl_utils/evals/dqn_eval.py

simoninithomas

First, I want to thank you for this integration and all the work behind 🤗. The result is fantastic, especially the model card with visible hyperparameters.

I gave some insights based on Omar and Lucain and my review on how we can improve the push_to_hub part. I'm happy to help with this.**

In addition to this, we can:

Having a downstream part ( load_from_hub ), I can help with that.
Generating a json/yaml file containing the hyperparameters for reproducibility?
Adding the library to our Hub list so that it creates a tag for people searching for cleanrl models.

cleanrl_utils/huggingface.py

vwxyzjn · 2022-10-18T15:10:34Z

@kinalmehta @simoninithomas and @Wauplin, thanks for the review. The CommitOperation suggestion is really helpful. Regarding some further comments:

Having a downstream part ( load_from_hub ), I can help with that.
Appreciate the help! That said, we are only downloading a single file from the hub, so having a customized load_from_hub might be unnecessary, right?

cleanrl/cleanrl_utils/evals/dqn_eval.py

Line 48 in b430540

    
           model_path = hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth")

Generating a json/yaml file containing the hyperparameters for reproducibility?

Are you thinking of loading from the yaml file somehow to run the script like python dqn.py --load-yaml hyper.yaml?

Adding the library to our Hub list so that it creates a tag for people searching for cleanrl models.

That would be great! Thank you!

simoninithomas · 2022-10-20T08:09:07Z

Hi @vwxyzjn , yes for yaml I was thinking what you mentioned.

That said, we are only downloading a single file from the hub, so having a customized load_from_hub might be unnecessary, right?

Yes and no, because it has two advantages:

We are able to count how many download of the model each month.
We can cache the model without using hf_hub_download directly.

For instance with SB3 integration here's the code for load_from_hub:

def load_from_hub(repo_id: str, filename: str) -> str:
    """
    Download a model from Hugging Face Hub.
    :param repo_id: id of the model repository from the Hugging Face Hub
    :param filename: name of the model zip file from the repository
    """
    try:
        from huggingface_hub import hf_hub_download
    except ImportError:
        raise ImportError(
            "You need to install huggingface_hub to use `load_from_hub`. "
            "See https://pypi.org/project/huggingface-hub/ for installation."
        )

    # Get the model from the Hub, download and cache the model on your local disk
    downloaded_model_file = hf_hub_download(
        repo_id=repo_id,
        filename=filename,
        library_name="huggingface-sb3",
        library_version="2.1",
    )

    return downloaded_model_file

simoninithomas · 2022-10-24T10:31:49Z

FIY From our side, we started to work on the frontend integration 🤗
huggingface/hub-docs#447

vwxyzjn · 2022-10-24T21:42:05Z

Thank you @simoninithomas

We are able to count how many download of the model each month.

Does this mean hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth") would not trigger the download stats?

We can cache the model without using hf_hub_download directly.

Does hf_hub_download not cache models? I ran the dqn_eval.py and noticed the download progress bar only presents once and it did not appear again during the second run, so I assumed hf_hub_download caches automatically.

FIY From our side, we started to work on the frontend integration 🤗
huggingface/hub-docs#447

Awesome thanks! :)

Wauplin · 2022-10-25T09:25:27Z

Hi @vwxyzjn

Does this mean hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth") would not trigger the download stats?

I'll let @simoninithomas answer on that as I am 100% sure what is counted in # downloads / month.
Worth noticing that the example from @simoninithomas uses 2 kwargs library_name and library_version to make the Hub know which lib is downloading the model (e.g. a cleanrl user and not a random user).

Does hf_hub_download not cache models?

Yes it does ! No matter if you use hf_hub_download or snapshot_download , your files will be downloaded only once.

vwxyzjn · 2023-01-04T20:23:08Z

@simoninithomas @kinalmehta @Wauplin thanks so much for helping with this PR. I think everything looks good at this point. We also have a good notebook ready to go https://colab.research.google.com/github/vwxyzjn/cleanrl/blob/hf-integration/docs/get-started/CleanRL_Huggingface_Integration_Demo.ipynb. Documentation can be previewed at https://cleanrl-git-hf-integration-vwxyzjn.vercel.app/get-started/zoo/ (the embed link is broken in it because it's pointing to the master branch).

vwxyzjn · 2023-01-12T16:03:13Z

Merging this as is, subjecting to future PRs. We'd also probably use huggingface/blog#616 to make the announcement. Thanks for the great work, folks!

Wauplin · 2023-01-12T17:29:15Z

Congrats ! That was a big piece of work 🎉🎉

simoninithomas · 2023-01-16T16:13:32Z

Congratulations 👏 I was off at the end of last week. I'm preparing the blogpost for next week and we're going to have a unit using CleanRL on PPO with Edward and me using GodotRL we will have the PR this week I'll mention you to put you in the loop.

* initial commit * pre-commit * Add hub integration * pre-commit * use CommitOperation * Fix pre-commit * refactor * push changes * refactor * fix pre-commit * pre-commit * close the env and writer after eval * support dqn jax * pre-commit * Update cleanrl_utils/huggingface.py Co-authored-by: Lucain <lucainp@gmail.com> * address comments * update docs * support dqn_atari_jax * bug fix and docs * Add cleanrl to the hf's `metadata` * include huggingface integration * test for enjoy.py * bump version, pip install extra hack python-poetry/poetry#4842 (comment) * Update cleanrl_utils/huggingface.py Co-authored-by: Lucain <lucainp@gmail.com> * Update cleanrl_utils/huggingface.py Co-authored-by: Lucain <lucainp@gmail.com> * Update cleanrl_utils/huggingface.py Co-authored-by: Lucain <lucainp@gmail.com> * Update cleanrl_utils/huggingface.py Co-authored-by: Lucain <lucainp@gmail.com> * Update cleanrl_utils/huggingface.py Co-authored-by: Lucain <lucainp@gmail.com> * Update cleanrl_utils/huggingface.py Co-authored-by: Lucain <lucainp@gmail.com> * update docs * update pre-commit * quick fix * bug fix * lazy load modules to avoid dependency issues * Add huggingface shields * Add emoji * Update docs * pre-commit * Update docs * Update docs * fix: use `algorithm_variant_filename` in model card reproduction script * typo fix * feat: add hf support for c51 * formatting fix * support pulling variant depdencies directly * support model saving for `ppo_atari_envpool_xla_jax_scan` * support `ppo_atari_envpool_xla_jax_scan` * quick change * support 'c51_jax' * formatting fix * support capture video * Add notebook * update docs * support `c51_atari` and `c51_atari_jax` * typo fix * add c51 to zoo docs * add colab badge * fix broken colab svg * pypi release * typo fix * update pre-commit * remove hf-integration reference Co-authored-by: Lucain <lucainp@gmail.com> Co-authored-by: Kinal <kinal.mehta11@gmail.com> Co-authored-by: Kinal Mehta <kgm1995@gmail.com>

vwxyzjn added 2 commits October 13, 2022 17:51

initial commit

1b585d6

pre-commit

fa82356

vwxyzjn requested review from dosssman and kinalmehta October 13, 2022 21:53

Add hub integration

4074eee

vercel bot deployed to Preview October 13, 2022 22:00 View deployment

pre-commit

4436ce4

vercel bot deployed to Preview October 14, 2022 14:53 View deployment

kinalmehta approved these changes Oct 15, 2022

View reviewed changes

cleanrl_utils/evals/dqn_eval.py Show resolved Hide resolved

simoninithomas reviewed Oct 17, 2022

View reviewed changes

cleanrl_utils/huggingface.py Outdated Show resolved Hide resolved

cleanrl_utils/huggingface.py Outdated Show resolved Hide resolved

use CommitOperation

df41e3d

vercel bot deployed to Preview October 18, 2022 14:45 View deployment

Fix pre-commit

a98383d

vercel bot deployed to Preview October 18, 2022 14:48 View deployment

refactor

b430540

vercel bot deployed to Preview October 18, 2022 14:51 View deployment

Merge branch 'master' into hf-integration

dd8ee86

vercel bot deployed to Preview October 18, 2022 18:34 View deployment

simoninithomas mentioned this pull request Oct 23, 2022

(Do not merge yet) Add CleanRL to Hub Documentation huggingface/hub-docs#447

Closed

push changes

8144562

vercel bot deployed to Preview October 27, 2022 00:21 View deployment

refactor

2f20e17

Add notebook

7f22c25

vercel bot deployed to Preview January 3, 2023 15:57 View deployment

update docs

5331287

vercel bot deployed to Preview January 3, 2023 16:02 View deployment

kinalmehta added 2 commits January 4, 2023 08:27

support c51_atari and c51_atari_jax

9aec97e

Merge remote-tracking branch 'origin/hf-integration' into hf-integration

bc8c014

vercel bot deployed to Preview January 4, 2023 02:57 View deployment

kinalmehta added 2 commits January 4, 2023 16:58

typo fix

b202985

add c51 to zoo docs

54fd64a

vercel bot deployed to Preview January 4, 2023 11:29 View deployment

add colab badge

9e5841b

vercel bot deployed to Preview January 4, 2023 20:10 View deployment

fix broken colab svg

9178763

vercel bot deployed to Preview January 4, 2023 20:12 View deployment

pypi release

07961f4

vercel bot deployed to Preview January 4, 2023 20:17 View deployment

vwxyzjn added 2 commits January 4, 2023 15:18

typo fix

c09a80d

update pre-commit

a18ffdb

vercel bot deployed to Preview January 4, 2023 20:18 View deployment

remove hf-integration reference

ba7053a

vercel bot deployed to Preview January 4, 2023 20:20 View deployment

vwxyzjn requested a review from simoninithomas January 4, 2023 20:21

vwxyzjn merged commit 30381ee into master Jan 12, 2023

vwxyzjn mentioned this pull request Jan 12, 2023

Qdagger: Reincarnate RL #344

Merged

20 tasks

Conversation

vwxyzjn commented Oct 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Types of changes

Checklist:

Uh oh!

vercel bot commented Oct 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vwxyzjn commented Oct 13, 2022

Uh oh!

vwxyzjn commented Oct 14, 2022

Uh oh!

kinalmehta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

simoninithomas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vwxyzjn commented Oct 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simoninithomas commented Oct 20, 2022

Uh oh!

simoninithomas commented Oct 24, 2022

Uh oh!

vwxyzjn commented Oct 24, 2022

Uh oh!

Wauplin commented Oct 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vwxyzjn commented Jan 4, 2023

Uh oh!

vwxyzjn commented Jan 12, 2023

Uh oh!

Wauplin commented Jan 12, 2023

Uh oh!

simoninithomas commented Jan 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vwxyzjn commented Oct 13, 2022 •

edited

Loading

vercel bot commented Oct 13, 2022 •

edited

Loading

vwxyzjn commented Oct 18, 2022 •

edited

Loading

Wauplin commented Oct 25, 2022 •

edited

Loading

simoninithomas commented Jan 16, 2023 •

edited

Loading