You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+7-1Lines changed: 7 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -38,18 +38,24 @@ Note: Data extraction experiments can run on a single GPU.
38
38
39
39
## Directories
40
40
41
+
For detailed documentation of each directory, please refer to their respective README files.
42
+
41
43
### Data
42
44
Contains scripts and tools for dataset filtering and sample creation, organized into two main directories.
43
45
46
+
- The fine-tuning dataset can be retrieved at this link: [AISE-TUDelft/memtune-tuning_data](https://huggingface.co/datasets/AISE-TUDelft/memtune-tuning_data)
47
+
44
48
### Training
45
49
Contains:
46
50
- Fine-tuning scripts for StarCoder2
47
51
- Training statistics and metrics
48
52
53
+
- The fine-tuned models can be retrieved at this link: [AISE-TUDelft/LLM4Code-memtune](https://huggingface.co/collections/AISE-TUDelft/llm4code-memtune-678a2838766dd16037a8bfe0)
54
+
49
55
### Evaluation
50
56
Contains code, data, and results for data extraction experiments.
51
57
52
-
For detailed documentation of each directory, please refer to their respective README files.
58
+
- The generated data extraction benchmarks are available at [AISE-TUDelft/memtune-data_attack](https://huggingface.co/datasets/AISE-TUDelft/memtune-data_attack)
53
59
54
60
## Ethical use
55
61
Please use the code and concepts shared here responsibly and ethically. The authors have provided this code to enhance the security and safety of large language models (LLMs). Avoid using this code for any malicious purposes. When disclosing data leakage, take care not to compromise individuals' privacy unnecessarily.
0 commit comments