Skip to content

Commit ec042dc

Browse files
Fabio SalernoFabio Salerno
authored andcommitted
update README.md
1 parent b2c533c commit ec042dc

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

README.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,18 +38,24 @@ Note: Data extraction experiments can run on a single GPU.
3838

3939
## Directories
4040

41+
For detailed documentation of each directory, please refer to their respective README files.
42+
4143
### Data
4244
Contains scripts and tools for dataset filtering and sample creation, organized into two main directories.
4345

46+
- The fine-tuning dataset can be retrieved at this link: [AISE-TUDelft/memtune-tuning_data](https://huggingface.co/datasets/AISE-TUDelft/memtune-tuning_data)
47+
4448
### Training
4549
Contains:
4650
- Fine-tuning scripts for StarCoder2
4751
- Training statistics and metrics
4852

53+
- The fine-tuned models can be retrieved at this link: [AISE-TUDelft/LLM4Code-memtune](https://huggingface.co/collections/AISE-TUDelft/llm4code-memtune-678a2838766dd16037a8bfe0)
54+
4955
### Evaluation
5056
Contains code, data, and results for data extraction experiments.
5157

52-
For detailed documentation of each directory, please refer to their respective README files.
58+
- The generated data extraction benchmarks are available at [AISE-TUDelft/memtune-data_attack](https://huggingface.co/datasets/AISE-TUDelft/memtune-data_attack)
5359

5460
## Ethical use
5561
Please use the code and concepts shared here responsibly and ethically. The authors have provided this code to enhance the security and safety of large language models (LLMs). Avoid using this code for any malicious purposes. When disclosing data leakage, take care not to compromise individuals' privacy unnecessarily.

0 commit comments

Comments
 (0)