You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/guides/grpo.md
+25Lines changed: 25 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -68,6 +68,31 @@ data:
68
68
env_name: "math"
69
69
```
70
70
71
+
We support using multiple datasets for train and validation. You can refer to `examples/configs/grpo_multiple_datasets.yaml` for a full configuration example. Here's an example configuration:
72
+
```yaml
73
+
data:
74
+
_override_: true # override the data config instead of merging with it
75
+
# other data settings, see `examples/configs/sft.yaml` for more details
76
+
...
77
+
# dataset settings
78
+
train:
79
+
# train dataset 1
80
+
- dataset_name: OpenMathInstruct-2
81
+
split_validation_size: 0.05# use 5% of the training data as validation data
82
+
seed: 42# seed for train/validation split when split_validation_size > 0
83
+
# train dataset 2
84
+
- dataset_name: DeepScaler
85
+
validation:
86
+
# validation dataset 1
87
+
- dataset_name: AIME2024
88
+
repeat: 16
89
+
# validation dataset 2
90
+
- dataset_name: DAPOMathAIME2024
91
+
# default settings for all datasets
92
+
default:
93
+
...
94
+
```
95
+
71
96
We support using a single dataset for both train and validation by using `split_validation_size` to set the validation ratio.
72
97
[OpenAssistant](../../nemo_rl/data/datasets/response_datasets/oasst.py), [OpenMathInstruct-2](../../nemo_rl/data/datasets/response_datasets/openmathinstruct2.py), [ResponseDataset](../../nemo_rl/data/datasets/response_datasets/response_dataset.py), [Tulu3SftMixtureDataset](../../nemo_rl/data/datasets/response_datasets/tulu3.py) are supported for this feature.
73
98
If you want to support this feature for your custom datasets or other built-in datasets, you can simply add the code to the dataset like [ResponseDataset](../../nemo_rl/data/datasets/response_datasets/response_dataset.py).
Copy file name to clipboardExpand all lines: docs/guides/sft.md
+25Lines changed: 25 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -100,6 +100,31 @@ data:
100
100
processor: "sft_processor"
101
101
```
102
102
103
+
We support using multiple datasets for train and validation. You can refer to `examples/configs/grpo_multiple_datasets.yaml` for a full configuration example. Here's an example configuration:
104
+
```yaml
105
+
data:
106
+
_override_: true # override the data config instead of merging with it
107
+
# other data settings, see `examples/configs/sft.yaml` for more details
108
+
...
109
+
# dataset settings
110
+
train:
111
+
# train dataset 1
112
+
- dataset_name: OpenMathInstruct-2
113
+
split_validation_size: 0.05# use 5% of the training data as validation data
114
+
seed: 42# seed for train/validation split when split_validation_size > 0
115
+
# train dataset 2
116
+
- dataset_name: DeepScaler
117
+
validation:
118
+
# validation dataset 1
119
+
- dataset_name: AIME2024
120
+
repeat: 16
121
+
# validation dataset 2
122
+
- dataset_name: DAPOMathAIME2024
123
+
# default settings for all datasets
124
+
default:
125
+
...
126
+
```
127
+
103
128
We support using a single dataset for both train and validation by using `split_validation_size` to set the ratio of validation.
104
129
[OpenAssistant](../../nemo_rl/data/datasets/response_datasets/oasst.py), [OpenMathInstruct-2](../../nemo_rl/data/datasets/response_datasets/openmathinstruct2.py), [ResponseDataset](../../nemo_rl/data/datasets/response_datasets/response_dataset.py), [Tulu3SftMixtureDataset](../../nemo_rl/data/datasets/response_datasets/tulu3.py) are supported for this feature.
105
130
If you want to support this feature for your custom datasets or other built-in datasets, you can simply add the code to the dataset like [ResponseDataset](../../nemo_rl/data/datasets/response_datasets/response_dataset.py).
0 commit comments