Skip to content
Snippets Groups Projects
Commit ebb50517 authored by Jacques Fize's avatar Jacques Fize
Browse files

Update train_geocoder.py

parent d83be71f
No related branches found
No related tags found
No related merge requests found
......@@ -112,8 +112,7 @@ if args.tokenization_method == "bert":
# Identify all ngram available
pairs_of_toponym.toponym.apply(lambda x : index.split_and_add(x))
pairs_of_toponym.toponym_context.apply(lambda x : index.split_and_add(x))
print(len(index.ngram_index))
index.filter_top_ngram(10000)
num_words = len(index.index_ngram) # necessary for the embedding matrix
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment