- Jan 06, 2024
-
-
Alice Brenon authored
-
- Jan 05, 2024
-
-
Alice Brenon authored
-
- Jan 02, 2024
-
-
Alice Brenon authored
Improve scripts for profile computation: searching from (serialized) indexed trees, and using ranges to compact occurrences found
-
- Jan 01, 2024
-
-
Alice Brenon authored
Add types to represent indexed sentences and documents, effectively absorbing part of the work that was done during the search
-
- Dec 30, 2023
-
-
Alice Brenon authored
Add a script to serialize syntax trees computed from the .conllu files, copying part of the Conllu.Type types in the process to get a Generic instance for Serialization and to slightly improve structure by indexing features by name with a Map
-
- Dec 29, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
Lift some constraint on the monad in and use that to pass the config around as a Reader in the new profile extractor script
-
Alice Brenon authored
-
- Dec 27, 2023
-
-
Alice Brenon authored
-
- Dec 22, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
-
- Dec 13, 2023
-
-
Alice Brenon authored
-
- Dec 05, 2023
-
-
Alice Brenon authored
-
- Nov 22, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
-
Alice Brenon authored
-
Alice Brenon authored
A tiny script used to add the manually annotated label to a bunch of texts being prepared in a JSONL (to be used as input for prodigy)
-
Alice Brenon authored
-
Alice Brenon authored
-
- Nov 20, 2023
-
-
Alice Brenon authored
Add predictor for MultiBERT model, improve the trainer for the same model and expose the predicted score in the model's output
-
- Nov 17, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
Separate train script into one to train directly one classifier and one to train one binary classifier (accept/reject) for each discursive function
-
Alice Brenon authored
-
Alice Brenon authored
-
- Oct 30, 2023
-
-
Alice Brenon authored
-
- Sep 29, 2023
-
-
Alice Brenon authored
Add support for labeled data in input tsv and make the paralallel between conversions to and from prodigy's jsonl more obvious
-
- Sep 28, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
Temporary hack to be able to extract paragraph subcorpora as well article ones (TODO: expose something similar to readNamedTsv returning Either (Vector ParagraphRecord) (Vector ArticleRecord))
-
- Sep 26, 2023
-
-
Alice Brenon authored
-
- Sep 22, 2023
-
-
Alice Brenon authored
-
- Sep 19, 2023
-
-
Alice Brenon authored
-
- Sep 14, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
-
- Sep 13, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
-
- Sep 12, 2023
-
-
Alice Brenon authored
-
- Sep 07, 2023
-
-
Alice Brenon authored
-
Alice Brenon authored
Add a script to explode (Text) articles into paragraphs, outputing a files.tsv with a primary key for each of them in the process
-
Alice Brenon authored
-
Alice Brenon authored
-