Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
I
ICHLL11 Article
Manage
Activity
Members
Labels
Plan
Issues
0
Issue boards
Milestones
Wiki
Code
Merge requests
0
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Alice Brenon
ICHLL11 Article
Commits
68acc9d8
Commit
68acc9d8
authored
2 years ago
by
Alice Brenon
Browse files
Options
Downloads
Patches
Plain Diff
Finished editing the intro apparently
parent
3020cab8
No related branches found
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
ICHLL_Brenon.md
+46
-53
46 additions, 53 deletions
ICHLL_Brenon.md
with
46 additions
and
53 deletions
ICHLL_Brenon.md
+
46
−
53
View file @
68acc9d8
...
@@ -69,63 +69,56 @@ semantics and philosophical considerations:
...
@@ -69,63 +69,56 @@ semantics and philosophical considerations:
(
*
a language dictionary, which appears to be only a word dictionary, must often
(
*
a language dictionary, which appears to be only a word dictionary, must often
be a thing dictionary when it is made properly
*
). A similar criticism is made by
be a thing dictionary when it is made properly
*
). A similar criticism is made by
@haiman_dictionaries_1980 who attacks no less than six criteria on
which
@haiman_dictionaries_1980
[p. 331]
who attacks no less than six criteria on
dictionaries and encyclopedias are generally opposed to reach the
conclusion
which
dictionaries and encyclopedias are generally opposed to reach the
that there is no distinction between them because "dictionaries
*are*
conclusion
that there is no distinction between them because "dictionaries
*are*
encyclopedias". Regardless of the validity of his reasoning, it only proves one
encyclopedias". Regardless of the validity of his reasoning, it only proves one
inclusion: that perhaps, dictionaries would be a special case of encyclopedias.
inclusion: that perhaps, dictionaries would be a special case of encyclopedias.
This, as will be
evidenced
, does by no means imply that encyclopedias are
This, as will be
shown
, does by no means imply that
conversely
encyclopedias are
dictionaries.
dictionaries.
XML-TEI is a set of guidelines collectively developped by the
XML-TEI is a set of guidelines, tools and tranining resources collectively
@tei_consortium_tei_2023 under the form of XML schemas, along with a range of
developped by the @tei_consortium_tei_2023 to represent text in a highly
tools to handle them and training resources in order to represent text in a
structured and machine-readable format. Its toolbox has a modular structure
highly structured and machine-readable format. Its toolbox has a modular
consisting of optional parts each covering specific needs such as the physical
structure consisting of optional parts each covering specific needs such as the
features of a source document, the transcription of oral corpora or particular
physical features of a source document, the transcription of oral corpora or
requirements for textual domains like poetry, or, in the case at hand,
particular requirements for textual domains like poetry, or, in the case at
dictionaries. The intrinsic complexity of dictionaries has been well identified
hand, dictionaries.
since the inception of the project [@tei_vault] and @ide_encoding_1995
underlines the amount of work which went into the third version of the
After describing why the dedicated
guidelines (P3) to provide a toolbox both general and expressive enough to
module was a natural candidate to consider, I formalise tools from graph
account for the variety of conventions found in dictionaries.
theory to browse the specifications of this guideline in a rational way and
@romary_formal_2007 This module has been successfully used to encode both
explore this module in detail.
historical [@williams2017], [@bohbot2018] and digitally native dictionaries
[@bowers_bridging_2018]. In addition, a specific guidelines tailored at encoding
@romary_formal_2007
dictionaries named TEI-Lex0 has also been published [@banski_tei_lex0_2017].
(@ide_encoding_1995
*dictionaries*
only for western dictionaries) have been
The TEI effort is described as "first steps" by @ide_background_1998 to reach a
applied for both historical (@bohbot2018) and digitally native
standard to encode corpora and lay a common basis for corpora comparisons and
(@bowers_bridging_2018). In addition, a specific guidelines tailored at encoding
reuse. They point some light inconsistencies in the design, remark that there is
dictionaries, TEI-Lex0, has been published [@banski_tei_lex0_2017].
generally more than one way to encode a given text in XML-TEI and identify nine
criteria to design a sound standard. Their claims are backed by concrete
Systematic study of the guidelines @ide_background_1998 but here's a new method.
examples of encoding situations but without giving any idea of the prevalence of
the issues found. In fact, the sheer complexity of the guidelines can make it
Less than ten years after the beginings of the TEI, @ide_background_1998 gives a
hard to ascertain whether a particular element structure is impossible to
thorough account of the criteria
represent (not finding a suitable encoding is not a proof that there is none).
This chapter will use results from graph theory to give a systematic study of
the possibilities and shortcomings of the TEI
*dictionaries*
module.
# Dictionaries and encyclopedias
# Context of the study
After emerging over the course of the 18^th^ century, encyclopedias became a
fertile subgenre in themselves and a rich subject of study to digital humanities
## CollEx-Persée Project DISCO-LGE
for their particular relation to knowledge and its evolution. This section
describes the goal of the project, then looks at the origin of the term
The project
"encyclopedia" itself before comparing the approaches of encyclopedias and
dictionaries.
## Context of the project
CollEx-Persée project DISCO-LGE
(
[
https://www.collexpersee.eu/projet/disco-lge/
](
https://www.collexpersee.eu/projet/disco-lge/
)
)
(
[
https://www.collexpersee.eu/projet/disco-lge/
](
https://www.collexpersee.eu/projet/disco-lge/
)
)
set out to study
*
La Grande Encyclopédie, Inventaire raisonné des Sciences, des
set out to study
*
La Grande Encyclopédie, Inventaire raisonné des Sciences, des
Lettres et des Arts par une Société de savants et de gens de lettres
*
, an
Lettres et des Arts par une Société de savants et de gens de lettres
*
(hence
encyclopedia published in France between 1885 and 1902 by an organised team of
*LGE*
), an encyclopedia published in France between 1885 and 1902 by an
over two hundred specialists divided into eleven sections. This text comprises
organised team of over two hundred specialists divided into eleven sections.
31 tomes of about 1200 pages each and according to @jacquet-pfau2015 [, pp. 88 et
This text comprises 31 tomes of about 1200 pages each and according to
seq.] was the last major french encyclopedic endeavour directly inheriting from
@jacquet-pfau2015 [, pp. 88 et seq.] was the last major french encyclopedic
the prestigious ancestor that was the
*
Encyclopédie ou Dictionnaire raisonné des
endeavour directly inheriting from the prestigious ancestor that was the
*EDdA*
sciences des arts et des métiers
*
published by Diderot and d'Alembert 130 years
published by Diderot and d'Alembert 130 years earlier, between 1751 and 1772.
earlier, between 1751 and 1772.
The aim of the project was to digitise and make
*La Grande Encyclopédie*
The aim of the project was to digitise and make
*La Grande Encyclopédie*
available to the scientific community as well as the general public. A previous
available to the scientific community as well as the general public. A previous
...
@@ -136,8 +129,8 @@ pictures with an Optical Characters Recognition (OCR) system. This prevented an
...
@@ -136,8 +129,8 @@ pictures with an Optical Characters Recognition (OCR) system. This prevented an
exhaustive study of the text with textometry tools such as TXM [@heiden2010]. As
exhaustive study of the text with textometry tools such as TXM [@heiden2010]. As
a prelude to project GEODE
a prelude to project GEODE
(
[
https://geode-project.github.io/
](
https://geode-project.github.io/
)
), the goal
(
[
https://geode-project.github.io/
](
https://geode-project.github.io/
)
), the goal
of CollEx-Persée was to produce a digital version of
*L
a Grande Encyclopédie*
of CollEx-Persée was to produce a digital version of
*L
GE*
with a quality
with a quality
comparable to the one of l'
*Encyclopédie*
provided by the ARTFL
comparable to the one of l'
*Encyclopédie*
provided by the ARTFL
(
[
http://artfl-project.uchicago.edu/
](
http://artfl-project.uchicago.edu/
)
)
(
[
http://artfl-project.uchicago.edu/
](
http://artfl-project.uchicago.edu/
)
)
project in order to conduct a diachronic study of both encyclopedias.
project in order to conduct a diachronic study of both encyclopedias.
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment