Skip to content

Commit ffce56a

Browse files
committed
added note on seed dataset
1 parent dbe99dd commit ffce56a

File tree

5 files changed

+15
-5
lines changed

5 files changed

+15
-5
lines changed

nemo/NeMo-Data-Designer/advanced/healthcare-datasets/clinical-trials.ipynb

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -568,7 +568,9 @@
568568
"\n",
569569
"1. Medical observations and notes\n",
570570
"2. Adverse event descriptions \n",
571-
"3. Protocol deviation explanations"
571+
"3. Protocol deviation explanations\n",
572+
"\n",
573+
"**Note**: At this time, we only support using a single file as the seed. If you have multiple files you would like to use as seeds, it is recommended you consolidated these into a single file. "
572574
]
573575
},
574576
{

nemo/NeMo-Data-Designer/advanced/healthcare-datasets/physician-notes-with-realistic-personal-details.ipynb

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -126,7 +126,9 @@
126126
"source": [
127127
"## 📊 Loading Seed Data\n",
128128
"\n",
129-
"We'll use the symptom-to-diagnosis dataset as our seed data. This dataset contains patient symptoms and corresponding diagnoses which will help generate realistic medical scenarios."
129+
"We'll use the symptom-to-diagnosis dataset as our seed data. This dataset contains patient symptoms and corresponding diagnoses which will help generate realistic medical scenarios.\n",
130+
"\n",
131+
"**Note**: At this time, we only support using a single file as the seed. If you have multiple files you would like to use as seeds, it is recommended you consolidated these into a single file. "
130132
]
131133
},
132134
{

nemo/NeMo-Data-Designer/advanced/multimodal/visual-question-answering-using-vlm.ipynb

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -157,7 +157,9 @@
157157
"- **Metadata Extraction**: Preserve relevant document information\n",
158158
"- **Sampling Strategy**: Configure how the seed data is utilized during generation\n",
159159
"\n",
160-
"The seed dataset can be referenced in generation prompts using Jinja templating."
160+
"The seed dataset can be referenced in generation prompts using Jinja templating.\n",
161+
"\n",
162+
"**Note**: At this time, we only support using a single file as the seed. If you have multiple files you would like to use as seeds, it is recommended you consolidated these into a single file. "
161163
]
162164
},
163165
{

nemo/NeMo-Data-Designer/advanced/rag-examples/generate-rag-evaluation-dataset.ipynb

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -262,7 +262,9 @@
262262
"source": [
263263
"### Processing Documents and Setting Up Data Designer\n",
264264
"\n",
265-
"Now we'll process our document chunks and set up the Data Designer with our seed dataset."
265+
"Now we'll process our document chunks and set up the Data Designer with our seed dataset.\n",
266+
"\n",
267+
"**Note**: At this time, we only support using a single file as the seed. If you have multiple files you would like to use as seeds, it is recommended you consolidated these into a single file. "
266268
]
267269
},
268270
{

nemo/NeMo-Data-Designer/intro-tutorials/3-seeding-with-a-dataset.ipynb

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -148,7 +148,9 @@
148148
"\n",
149149
"- To steer the generation process, we will use an open-source [symptom-to-diagnosis dataset](https://huggingface.co/datasets/gretelai/symptom_to_diagnosis).\n",
150150
"\n",
151-
"- In this dataset, the `input_text` represents the `patient_summary` and the `output_text` represents the `diagnosis`\n"
151+
"- In this dataset, the `input_text` represents the `patient_summary` and the `output_text` represents the `diagnosis`\n",
152+
"\n",
153+
"**Note**: At this time, we only support using a single file as the seed. If you have multiple files you would like to use as seeds, it is recommended you consolidated these into a single file. \n"
152154
]
153155
},
154156
{

0 commit comments

Comments
 (0)