Skip to content
Snippets Groups Projects
Commit 68acc9d8 authored by Alice Brenon's avatar Alice Brenon
Browse files

Finished editing the intro apparently

parent 3020cab8
No related branches found
No related tags found
No related merge requests found
...@@ -69,63 +69,56 @@ semantics and philosophical considerations: ...@@ -69,63 +69,56 @@ semantics and philosophical considerations:
(*a language dictionary, which appears to be only a word dictionary, must often (*a language dictionary, which appears to be only a word dictionary, must often
be a thing dictionary when it is made properly*). A similar criticism is made by be a thing dictionary when it is made properly*). A similar criticism is made by
@haiman_dictionaries_1980 who attacks no less than six criteria on which @haiman_dictionaries_1980 [p. 331] who attacks no less than six criteria on
dictionaries and encyclopedias are generally opposed to reach the conclusion which dictionaries and encyclopedias are generally opposed to reach the
that there is no distinction between them because "dictionaries *are* conclusion that there is no distinction between them because "dictionaries *are*
encyclopedias". Regardless of the validity of his reasoning, it only proves one encyclopedias". Regardless of the validity of his reasoning, it only proves one
inclusion: that perhaps, dictionaries would be a special case of encyclopedias. inclusion: that perhaps, dictionaries would be a special case of encyclopedias.
This, as will be evidenced, does by no means imply that encyclopedias are This, as will be shown, does by no means imply that conversely encyclopedias are
dictionaries. dictionaries.
XML-TEI is a set of guidelines collectively developped by the XML-TEI is a set of guidelines, tools and tranining resources collectively
@tei_consortium_tei_2023 under the form of XML schemas, along with a range of developped by the @tei_consortium_tei_2023 to represent text in a highly
tools to handle them and training resources in order to represent text in a structured and machine-readable format. Its toolbox has a modular structure
highly structured and machine-readable format. Its toolbox has a modular consisting of optional parts each covering specific needs such as the physical
structure consisting of optional parts each covering specific needs such as the features of a source document, the transcription of oral corpora or particular
physical features of a source document, the transcription of oral corpora or requirements for textual domains like poetry, or, in the case at hand,
particular requirements for textual domains like poetry, or, in the case at dictionaries. The intrinsic complexity of dictionaries has been well identified
hand, dictionaries. since the inception of the project [@tei_vault] and @ide_encoding_1995
underlines the amount of work which went into the third version of the
After describing why the dedicated guidelines (P3) to provide a toolbox both general and expressive enough to
module was a natural candidate to consider, I formalise tools from graph account for the variety of conventions found in dictionaries.
theory to browse the specifications of this guideline in a rational way and @romary_formal_2007 This module has been successfully used to encode both
explore this module in detail. historical [@williams2017], [@bohbot2018] and digitally native dictionaries
[@bowers_bridging_2018]. In addition, a specific guidelines tailored at encoding
@romary_formal_2007 dictionaries named TEI-Lex0 has also been published [@banski_tei_lex0_2017].
(@ide_encoding_1995 *dictionaries* only for western dictionaries) have been The TEI effort is described as "first steps" by @ide_background_1998 to reach a
applied for both historical (@bohbot2018) and digitally native standard to encode corpora and lay a common basis for corpora comparisons and
(@bowers_bridging_2018). In addition, a specific guidelines tailored at encoding reuse. They point some light inconsistencies in the design, remark that there is
dictionaries, TEI-Lex0, has been published [@banski_tei_lex0_2017]. generally more than one way to encode a given text in XML-TEI and identify nine
criteria to design a sound standard. Their claims are backed by concrete
Systematic study of the guidelines @ide_background_1998 but here's a new method. examples of encoding situations but without giving any idea of the prevalence of
the issues found. In fact, the sheer complexity of the guidelines can make it
Less than ten years after the beginings of the TEI, @ide_background_1998 gives a hard to ascertain whether a particular element structure is impossible to
thorough account of the criteria represent (not finding a suitable encoding is not a proof that there is none).
This chapter will use results from graph theory to give a systematic study of
the possibilities and shortcomings of the TEI *dictionaries* module.
# Dictionaries and encyclopedias
# Context of the study
After emerging over the course of the 18^th^ century, encyclopedias became a
fertile subgenre in themselves and a rich subject of study to digital humanities ## CollEx-Persée Project DISCO-LGE
for their particular relation to knowledge and its evolution. This section
describes the goal of the project, then looks at the origin of the term The project
"encyclopedia" itself before comparing the approaches of encyclopedias and
dictionaries.
## Context of the project
CollEx-Persée project DISCO-LGE
([https://www.collexpersee.eu/projet/disco-lge/](https://www.collexpersee.eu/projet/disco-lge/)) ([https://www.collexpersee.eu/projet/disco-lge/](https://www.collexpersee.eu/projet/disco-lge/))
set out to study *La Grande Encyclopédie, Inventaire raisonné des Sciences, des set out to study *La Grande Encyclopédie, Inventaire raisonné des Sciences, des
Lettres et des Arts par une Société de savants et de gens de lettres*, an Lettres et des Arts par une Société de savants et de gens de lettres* (hence
encyclopedia published in France between 1885 and 1902 by an organised team of *LGE*), an encyclopedia published in France between 1885 and 1902 by an
over two hundred specialists divided into eleven sections. This text comprises organised team of over two hundred specialists divided into eleven sections.
31 tomes of about 1200 pages each and according to @jacquet-pfau2015 [, pp. 88 et This text comprises 31 tomes of about 1200 pages each and according to
seq.] was the last major french encyclopedic endeavour directly inheriting from @jacquet-pfau2015 [, pp. 88 et seq.] was the last major french encyclopedic
the prestigious ancestor that was the *Encyclopédie ou Dictionnaire raisonné des endeavour directly inheriting from the prestigious ancestor that was the *EDdA*
sciences des arts et des métiers* published by Diderot and d'Alembert 130 years published by Diderot and d'Alembert 130 years earlier, between 1751 and 1772.
earlier, between 1751 and 1772.
The aim of the project was to digitise and make *La Grande Encyclopédie* The aim of the project was to digitise and make *La Grande Encyclopédie*
available to the scientific community as well as the general public. A previous available to the scientific community as well as the general public. A previous
...@@ -136,8 +129,8 @@ pictures with an Optical Characters Recognition (OCR) system. This prevented an ...@@ -136,8 +129,8 @@ pictures with an Optical Characters Recognition (OCR) system. This prevented an
exhaustive study of the text with textometry tools such as TXM [@heiden2010]. As exhaustive study of the text with textometry tools such as TXM [@heiden2010]. As
a prelude to project GEODE a prelude to project GEODE
([https://geode-project.github.io/](https://geode-project.github.io/)), the goal ([https://geode-project.github.io/](https://geode-project.github.io/)), the goal
of CollEx-Persée was to produce a digital version of *La Grande Encyclopédie* of CollEx-Persée was to produce a digital version of *LGE* with a quality
with a quality comparable to the one of l'*Encyclopédie* provided by the ARTFL comparable to the one of l'*Encyclopédie* provided by the ARTFL
([http://artfl-project.uchicago.edu/](http://artfl-project.uchicago.edu/)) ([http://artfl-project.uchicago.edu/](http://artfl-project.uchicago.edu/))
project in order to conduct a diachronic study of both encyclopedias. project in order to conduct a diachronic study of both encyclopedias.
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment