You can download the nearest neighbor datasets used in the examples from Hugging Face:
Make sure that the downloaded datasets are stored in the example/data directory.
You can use huggingface-cli to download the datasets.
huggingface-cli download habedi/nearest-neighbors-datasets --repo-type dataset \
--local-dir nearest-neighbors-datasetshuggingface-cli download habedi/nearest-neighbors-datasets-large --repo-type dataset \
--local-dir nearest-neighbors-datasets-largeThe command must be run inside this directory (example/data).
For convenience, you can use the pyproject.toml file to set up a Python environment with the
required dependencies using Poetry.