From c796940240c5887e161219fb1805fd2eb11a4b45 Mon Sep 17 00:00:00 2001 From: Mathieu Loiseau <mathieu.loiseau@liris.cnrs.fr> Date: Fri, 22 Dec 2023 12:27:49 +0000 Subject: [PATCH] readme links --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index be0d1ce..65f61d2 100644 --- a/README.md +++ b/README.md @@ -14,7 +14,7 @@ where `<wiktlang>` specifies the language of the desired Wiktionary edition, `<w ### 1. Download dump files -Download the most recent Wiktionary dump files for each supported Wiktionary edition (See `supported_wiktlangs` in `src/config.py`) from `https://dumps.wikimedia.org/backup-index.html` and place them in the `dumps/` directory. The dump files should follow the pattern `<wiktlang>wiktionary-<date>-pages-articles-multistream.xml.bz2`. +Download the most recent Wiktionary dump files for each supported Wiktionary edition (See `supported_wiktlangs` in `src/config.py`) from [`https://dumps.wikimedia.org/backup-index.html`](https://dumps.wikimedia.org/backup-index.html) and place them in the `dumps/` directory. The dump files should follow the pattern `<wiktlang>wiktionary-<date>-pages-articles-multistream.xml.bz2`. If multiple timestamped dumpf files per edition are present in the `dumps/` directory, the most recent one will be selected automatically. @@ -37,7 +37,7 @@ _Since `wiktextract` and its dependency `wikitextprocessor` are not regularly pu ### 4. Load templates from dump files -Run the script `src/load_dumps.py` the dumpfile into an sqlite database that will be used by `wiktextract`. +Run the script `src/load_dumps.py` to load the most recent dumpfile (for each [supported language](https://gitlab.liris.cnrs.fr/lex-game/live-query-wiktextract/-/blob/main/src/config.py#L5)) into an sqlite database that will be used by `wiktextract`. ``` python src/load_dumps.py -- GitLab