diff --git a/README.md b/README.md index be0d1ce8db9609372c6e1153d60c934e8d121b5b..65f61d20a1a5eaca45a6c692ac6fd61a7d3e62d0 100644 --- a/README.md +++ b/README.md @@ -14,7 +14,7 @@ where `<wiktlang>` specifies the language of the desired Wiktionary edition, `<w ### 1. Download dump files -Download the most recent Wiktionary dump files for each supported Wiktionary edition (See `supported_wiktlangs` in `src/config.py`) from `https://dumps.wikimedia.org/backup-index.html` and place them in the `dumps/` directory. The dump files should follow the pattern `<wiktlang>wiktionary-<date>-pages-articles-multistream.xml.bz2`. +Download the most recent Wiktionary dump files for each supported Wiktionary edition (See `supported_wiktlangs` in `src/config.py`) from [`https://dumps.wikimedia.org/backup-index.html`](https://dumps.wikimedia.org/backup-index.html) and place them in the `dumps/` directory. The dump files should follow the pattern `<wiktlang>wiktionary-<date>-pages-articles-multistream.xml.bz2`. If multiple timestamped dumpf files per edition are present in the `dumps/` directory, the most recent one will be selected automatically. @@ -37,7 +37,7 @@ _Since `wiktextract` and its dependency `wikitextprocessor` are not regularly pu ### 4. Load templates from dump files -Run the script `src/load_dumps.py` the dumpfile into an sqlite database that will be used by `wiktextract`. +Run the script `src/load_dumps.py` to load the most recent dumpfile (for each [supported language](https://gitlab.liris.cnrs.fr/lex-game/live-query-wiktextract/-/blob/main/src/config.py#L5)) into an sqlite database that will be used by `wiktextract`. ``` python src/load_dumps.py