Skip to content

Commit 03e8eec

Browse files
authored
chore: Wprazuch/nano v3 repro refinements (#564)
Signed-off-by: Wojciech Prazuch <[email protected]>
1 parent acfd5e9 commit 03e8eec

File tree

2 files changed

+25
-51
lines changed

2 files changed

+25
-51
lines changed

packages/nemo-evaluator-launcher/examples/nemotron/local_nvidia_nemotron_3_nano_30b_a3b.yaml

Lines changed: 9 additions & 39 deletions
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ execution:
2828

2929
target:
3030
api_endpoint:
31-
model_id: nvidia/nemotron-nano-3-30b-a3b
31+
model_id: nvidia/nemotron-3-nano-30b-a3b
3232
url: https://integrate.api.nvidia.com/v1/chat/completions
3333
api_key_name: NGC_API_KEY # API Key with access to build.nvidia.com
3434

@@ -43,7 +43,7 @@ evaluation:
4343
max_new_tokens: 131072
4444
temperature: 0.99999
4545
top_p: 0.99999
46-
parallelism: 512
46+
parallelism: 1
4747
request_timeout: 3600
4848
max_retries: 10
4949
extra:
@@ -60,15 +60,18 @@ evaluation:
6060
use_response_logging: true
6161
max_logged_responses: 10
6262
tasks:
63+
- name: ns_ifbench
64+
nemo_evaluator_config:
65+
config:
66+
params:
67+
extra:
68+
num_repeats: 8
6369
- name: ns_bfcl_v3
64-
env_vars:
65-
HF_TOKEN: HF_TOKEN
6670
nemo_evaluator_config:
6771
config:
6872
params:
6973
temperature: 0.6
7074
top_p: 0.95
71-
parallelism: 32
7275
extra:
7376
num_repeats: 1
7477
args: ++use_client_parsing=False
@@ -77,76 +80,43 @@ evaluation:
7780
adapter_config:
7881
use_caching: false
7982
- name: ns_bfcl_v4
80-
env_vars:
81-
HF_TOKEN: HF_TOKEN
8283
nemo_evaluator_config:
8384
config:
8485
params:
8586
max_new_tokens: 8192
86-
parallelism: 128
8787
temperature: 0.6
8888
top_p: 0.95
8989
extra:
9090
num_repeats: 1
9191
args: ++use_client_parsing=False
9292
- name: ns_livecodebench
93-
env_vars:
94-
HF_TOKEN: HF_TOKEN
9593
nemo_evaluator_config:
9694
config:
9795
params:
9896
extra:
9997
num_repeats: 8
10098
dataset_split: test_v5_2407_2412
10199
- name: ns_mmlu_pro
102-
env_vars:
103-
HF_TOKEN: HF_TOKEN
104100
nemo_evaluator_config:
105101
config:
106102
params:
107103
extra:
108104
num_repeats: 1
109105
args: "++prompt_config=eval/aai/mcq-10choices-boxed"
110106
- name: ns_gpqa
111-
env_vars:
112-
HF_TOKEN: HF_TOKEN
113107
nemo_evaluator_config:
114108
config:
115109
params:
116110
extra:
117111
num_repeats: 8
118112
args: "++prompt_config=eval/aai/mcq-4choices"
119-
- name: ns_aime2025
120-
env_vars:
121-
HF_TOKEN: HF_TOKEN
122-
JUDGE_API_KEY: JUDGE_API_KEY
123-
nemo_evaluator_config:
124-
config:
125-
params:
126-
extra:
127-
num_repeats: 64
128-
args: ++prompt_config=/prompt_templates/math-oai.yaml
129113
- name: ns_scicode
130-
env_vars:
131-
HF_TOKEN: HF_TOKEN
132-
JUDGE_API_KEY: JUDGE_API_KEY
133-
nemo_evaluator_config:
134-
config:
135-
params:
136-
extra:
137-
num_repeats: 8
138-
- name: ns_ifbench
139-
env_vars:
140-
HF_TOKEN: HF_TOKEN
141114
nemo_evaluator_config:
142115
config:
143116
params:
144117
extra:
145118
num_repeats: 8
146119
- name: ns_hle
147-
env_vars:
148-
HF_TOKEN: HF_TOKEN
149-
JUDGE_API_KEY: JUDGE_API_KEY
150120
nemo_evaluator_config:
151121
config:
152122
params:
@@ -156,5 +126,5 @@ evaluation:
156126
judge:
157127
parallelism: 16
158128
model_id: openai/gpt-4o
159-
url: <OPENAI_API_URL_FOR_JUDGE>
129+
url: ??? # Set your OpenAI-compatible API URL for the judge model
160130
api_key: JUDGE_API_KEY

packages/nemo-evaluator-launcher/examples/nemotron/nano-v3-reproducibility.md

Lines changed: 16 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# NVIDIA Nemotron 3 Nano 30B A3B — Reproducing Model Card Evaluation Results
22

3-
This tutorial demonstrates how to reproduce the evaluation results for the **NVIDIA Nemotron 3 Nano 30B A3B** model using the NeMo Evaluator Launcher.
3+
This tutorial demonstrates how to reproduce the evaluation results for the [**NVIDIA Nemotron 3 Nano 30B A3B**](https://build.nvidia.com/nvidia/nemotron-3-nano-30b-a3b) model using the NeMo Evaluator Launcher.
44

55
## Overview
66

@@ -33,7 +33,8 @@ pip install nemo-evaluator-launcher
3333
Or install from source:
3434

3535
```bash
36-
cd packages/nemo-evaluator-launcher
36+
git clone https://github.com/NVIDIA-NeMo/Evaluator.git
37+
cd Evaluator/packages/nemo-evaluator-launcher
3738
pip install -e .
3839
```
3940

@@ -66,28 +67,31 @@ export HF_HOME="/path/to/your/huggingface/cache"
6667

6768
## Running the Full Evaluation Suite
6869

69-
### Option 1: Using the Config File Directly
70+
### 1. Get the Configuration
7071

71-
Navigate to the examples directory and run:
72+
Clone the repository or download the [example config file](https://github.com/NVIDIA-NeMo/Evaluator/blob/main/packages/nemo-evaluator-launcher/examples/nemotron/local_nvidia_nemotron_3_nano_30b_a3b.yaml):
7273

7374
```bash
74-
cd packages/nemo-evaluator-launcher/examples/nemotron
75-
76-
nemo-evaluator-launcher run --config local_nvidia_nemotron_3_nano_30b_a3b.yaml
75+
git clone https://github.com/NVIDIA-NeMo/Evaluator.git
76+
cd Evaluator # or navigate to where you placed the config file
7777
```
7878

79-
### Option 2: Using Config Path from Anywhere
79+
### 2. Run the Evaluation
8080

8181
```bash
82-
nemo-evaluator-launcher run --config /path/to/examples/nemotron/local_nvidia_nemotron_3_nano_30b_a3b.yaml
82+
nemo-evaluator-launcher run \
83+
--config packages/nemo-evaluator-launcher/examples/nemotron/local_nvidia_nemotron_3_nano_30b_a3b.yaml
84+
# Or point to your path if you placed the file under a different location
8385
```
8486

85-
### Option 3: Dry Run (Preview Configuration)
87+
### 3. Dry Run (Preview Configuration)
8688

8789
To preview the configuration without running the evaluation:
8890

8991
```bash
90-
nemo-evaluator-launcher run --config nemotron/local_nvidia_nemotron_3_nano_30b_a3b.yaml --dry-run
92+
nemo-evaluator-launcher run \
93+
--config packages/nemo-evaluator-launcher/examples/nemotron/local_nvidia_nemotron_3_nano_30b_a3b.yaml \
94+
--dry-run
9195
```
9296

9397
---
@@ -132,7 +136,7 @@ The evaluation uses the NVIDIA API endpoint:
132136
```yaml
133137
target:
134138
api_endpoint:
135-
model_id: nvidia/nemotron-nano-3-30b-a3b
139+
model_id: nvidia/nemotron-3-nano-30b-a3b
136140
url: https://integrate.api.nvidia.com/v1/chat/completions
137141
api_key_name: NGC_API_KEY
138142
```

0 commit comments

Comments
 (0)