- [ ] check requirements / license - [ ] copy into / data - [ ] document basic statistics (number of files, tokens, etc) in README.md