[Step 1: Mutual Introduction & Research Interests & Missing projects&researchers]

Marco BÜCHLER

unread,

Dec 7, 2011, 4:04:16 AM12/7/11

to historical-...@googlegroups.com

Dear subscribers,

as already mentioned previously, this mailing list is about historical text re-use and all related topics to that. Recently, we do already have 24 persons on this list, 8 invitations are still pending. Mails are BCC'ed.

1. Mutual Introduction
-----------------------------
First of all, I want to mutual introduce subscibers that are still on the list. Order is by choice or as Google Groups present the users:

eTRACES/GCDH: Gerhard Lauer, Annette Geßner, Juan Garces all Göttingen/Germany. eTRACES is a project funded by the German Ministery of Education and Research. GCDH/Göttingen is one of three partners.
eTRACES/GESIS: Brigitte Matthiak Cologne/Germany. Gesis is a Leibnitz Institute for Social Science.
Sharing Ancient Wisdoms: (SAWS) - Charlotte Roueche, Charlotte Tupman both King's College London. This project is a HERA project in collaboration with Dennis Searby in Upsalla, Sweden, and University of Vienna, Austria.
Tesserae: Neil Coffee, Christopher Forstall both Buffalo University, USA. They work on allusion detection recently on Latin. Furhter request for money is pending. Adaption to Greek is planned.
Relevant Researcher: Helma Dik, Richard Whaling both Chicago, USA. I met several times both person. Especially Helma has excellent (and widely covered) skills and can make "always" a relevant contribution.
eTRACES/ASV: Marco Büchler, (Gerhard Heyer - recently not on list), Maria Moritz, Christian Kötteritzsch, Frederik Baumgardt (master student). The natural language processing group is the third partner of eTRACES.
ARTFL: Namely here: Mark Olsen, USA. Within ARTFL several work/papers are done/written about sequence alignment.
"Canonical References": This is actually not really a funded project but Matteo Romanello's (Berlin, Germany) work on extracting canonical references. There exist an open source project.
"Fragmentary authors": Similar to Matteo's work, Monica Berti (Turino, Italy) does not work a funded project but relevant work on incomplete text re-use - means if source is not available.
Corpus der Arabischen und Syrischen Gnomologien (CASG): Ute Pietruschka, Norman Wetzig, Marco Reinhold - all Halle University, Germany. Project aims are detection of archetypus, translation techniques between Greek, Arabic, and Syriac, lines of transmissions of proverbs.
MorphAdorner: Philip Burns - Northwestern U., Evanston, IL, USA. A nice tool for playing with variants and morphology.
"Relevant Researcher": Stefan Jaenicke, Image and Signal Processing Group, Leipzig University. He is PhD student at Leipz. U. and recently thinks about a PhD about "Visualizing historical text re-use".
"Relevant Researcher": Tom Cheesman, Swansea U, UK. Tom and I had several mail exchanges some weeks ago. As I stored his work in my mind, he works as well on visualizing text re-use. Further funding is pending.

At the moment there is still pending invitations to the following project:
HyperHamlet, Switzerland,
BibleIndex, France,
QuotationFinder, Heidelberg, Germany. They make text re-use on historical Chines texts.

2. Research Interests
-----------------------------
WIthin the last two years I met and came into contact with all of you. Actually we do already cover a wide range of aspects of historical text re-use. For this reason, it seems to me relevant that we inform us about research interests, corpora, and things like this.

May I ask you - above mentioned projects and researchers - to provide some information about your work/project to all other list subscribers.

At the moment it seems to be relevant to write some words about:
2.A) Time span: Which time span is of your interest?
2.B) Spatial limitations: Is your project limited to specific geographical "areas/locations" like texts from the Mediterranean Sea?
2.C) Language(s): Which languages your project is working on?
2.D) Project aims/research question: What are your aims and research questions? Everything is welcome here: Text re-use is actually one of the best research fields for collaborations of humanists, computer scientists, digital humanists and so on. All questions are important and welcome.
2.E) Methodology: Do you collect traces of of text re-use manually (by hand) or what are your techniques?
2.F) Online references: What is your project website?

3. Missing projects&researchers
-------------------------------------------
IF you miss any project/researcher feel free to ask them to join the list: https://groups.google.com/group/historical-text-re-use/.

4. Ressources
-------------------
Do you think that something like a Wiki makes sense? Please me know what you think about it? It is not a thing to establish one.

Any other ressources neccessary?

Best,
Yours Marco

Mit freundlichen Grüssen
Marco Büchler


-- 
Marco Büchler
Natural Language Processing Group
Department of Computer Science
University of Leipzig
Johannisgasse 26
04109 Leipzig, Germany

Room  : 0-17
Phone : 0341 / 97-32257
eMail : mbue...@e-humanities.net
Web   : http://www.asv.informatik.uni-leipzig.de

lt

Monica Berti

unread,

Dec 8, 2011, 9:04:17 AM12/8/11

to historical-...@googlegroups.com

Hi All,

here is some information about my project called "Fragmentary Texts" (http://www.fragmentarytexts.org/)

1. Mutual Introduction
-----------------------------

"Fragmentary Texts" (http://www.fragmentarytexts.org/)

Project Director: Monica Berti (Università di Roma Tor Vergata & Tufts University)

2. Research Interests
-----------------------------

2.A) Time span: Which time span is of your interest?

Greek and Latin sources (8th century BC - 6th century AD)

2.B) Spatial limitations: Is your project limited to specific geographical "areas/locations" like texts from the Mediterranean Sea?

Texts from the Mediterranean Sea

2.C) Language(s): Which languages your project is working on?

Greek and Latin

2.D) Project aims/research question: What are your aims and research questions?

The aim of "Fragmentary Texts" is to work on models and methodologies for collecting and representing Greek and Latin texts of classical antiquity that are lost and that have been preserved as “fragments”, i.e. as "quotations" by other surviving authors. Classical sources offer many examples of quotations and textual reuses (citation, paraphrases, plagiarism, etc.) that can be very helpful to explore also modern textual reuses.

2.E) Methodology: Do you collect traces of of text re-use manually (by hand) or what are your techniques?

Both manually and with the help of digital libraries of classical sources (e.g. Perseus, TLG).

2.F) Online references: What is your project website?

http://www.fragmentarytexts.org/

http://www.fragmentarytexts.org/demo/

3. Missing projects&researchers
-------------------------------------------
IF you miss any project/researcher feel free to ask them to join the list: https://groups.google.com/group/historical-text-re-use/.

4. Ressources
-------------------
Do you think that something like a Wiki makes sense? Please me know what you think about it? It is not a thing to establish one.

I think that a Wiki makes perfect sense in order to build a community of different projects with similar questions, methodologies and problems.

All the best,

Monica Berti

_______________________________________________________________________________________________________

Monica Berti

Università degli Studi di Roma Tor Vergata - Dipartimento di Antichità e Tradizione Classica (http://antichita.uniroma2.it/berti.htm)

Tufts University - Department of Classics (http://ase.tufts.edu/classics/facultyguide/visiting/)

monica...@uniroma2.it

monica...@tufts.edu

Lauer, Gerhard

unread,

Dec 12, 2011, 6:51:13 AM12/12/11

to historical-...@googlegroups.com

… to follow up:

eTraces. Search and analysis of text reuse and transfer of knowledge in texts of the social sciences and in German literature

Sub-project: Text reuse in German literature

(A website is on its way)

1. Mutual Introduction
-----------------------------

Project Director: Gerhard Heyer, Marco Büchler (University of Leipzig)

Sub-project: Gerhard Lauer, Annette Geßner, Christian Kötteritzsch (University of Göttingen & Leipzig)

2. Research Interests
-----------------------------
2.A) Time span: Which time span is of your interest?

German sources (1500-1900)

2.B) Spatial limitations: Is your project limited to specific geographical "areas/locations" like texts from the Mediterranean Sea?

Texts from the German speaking countries

2.C) Language(s): Which languages your project is working on?

German

2.D) Project aims/research question: What are your aims and research questions?

The objectives of this sub project is to develop computer based methods for the analysis of literary history. We take established questions in the study of literary history (like secularization, the rise of aesthetic autonomy) and try to answer them by algorithm.

2.E) Methodology: Do you collect traces of of text re-use manually (by hand) or what are your techniques?

We use the digital library (the zeno.org corpus) of TextGrid (http://www.textgrid.de/en/digitale-bibliothek.html).

2.F) Online references: What is your project website?

Still on its way

3. Missing projects&researchers
-------------------------------------------
IF you miss any project/researcher feel free to ask them to join the list: https://groups.google.com/group/historical-text-re-use/.

4. Ressources
-------------------
Do you think that something like a Wiki makes sense? Please me know what you think about it? It is not a thing to establish one.

I think we are all looking for best practice examples, tools etc. and would like to share our findings via wiki.

Best,

gerhard

Gerhard Lauer

www.gcdh.de

--
You received this message because you are subscribed to the Google Groups "Historical Text Re-use" group.
To post to this group, send email to historical-...@googlegroups.com.
To unsubscribe from this group, send email to historical-text-r...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/historical-text-re-use?hl=en.

Cheesman T.

unread,

Dec 12, 2011, 10:57:17 AM12/12/11

to historical-...@googlegroups.com

Dear Marco et al.

>> "Relevant Researcher": Tom Cheesman, Swansea U, UK. Tom and I had several mail exchanges some weeks ago. As I stored his work in my mind, he works as well on visualizing text re-use. Further funding is pending. <<

Project title: “Translation Arrays: Version Variation Visualization”.

Researchers involved: Tom Cheesman (Dept of Languages, Translation and Media, Swansea U), Robert S Laramee (Dept of Computer Science, Swansea U), Jonathan Hope (Dept of English Studies, Strathclyde U), Stephan Thiel (freelance software designer, Berlin), David M Berry (Dept of Political and Cultural Studies, Swansea U).

A pilot ‘proof of concept’ project (Cheesman, Laramee, Berry) was funded Feb-June 2011 by Swansea University. On 30 Nov 11 an application was submitted to AHRC.ac.uk for further funding (Cheesman, Laramee, Hope; Thiel as nominated consultant). If this succeeds, we aim to build a prototype ‘translation array’ with basic functionality by summer 2012.

2.A) Time span: Which time span is of your interest?

[ The next few weeks! No, seriously: ]

We’re currently working with c.50 German translations of Othello by Shakespeare, from the earliest we can find to the present: they go back to 1766 and the most recent we have is from 2006.

We aim to design an interface which can be adapted for content from any time/s: our work is not conceived as a Shakespeare-specific project in the medium term, and we would very much like to talk to anyone who has a similar multiple translation (single-target-language) corpus and is interested in this type of work.

A ‘translation array’ will visualise cultural space-time, just as a ‘telescope array’ visualizes cosmological space-time. (OK, it’s ‘only’ a metaphor…)

2.B) Spatial limitations: Is your project limited to specific geographical "areas/locations" like texts from the Mediterranean Sea?

In principle we are interested in the whole world. Our core corpus is of texts mainly produced in German-speaking Europe, but some outside that zone (translators in exile 1933-45 in the UK, Australia/USA).

The questions of ‘time’, ‘space’, and ‘language’ are often tricky when we look at translations.

Translators’ interlingual cross-cultural spaces of operation vary a lot: some have little or no English: they rework others’ work; this still happens “even” in German, with theatre translations. (There’s a strong cultural norm of linguistic expertise among German translators, stronger than in some other target cultures.) In many languages (e.g. Spanish, Polish, Arabic…), the first translators worked from French or German versions. Translators can be influenced by versions in other languages, even when they do not know those languages either (e.g. the influence of Verdi’s opera in the case of Othello). Translations are interpretations, and the motivations of interpretations are, naturally, complex, and often (maybe: normally) cross-cultural.

2.C) Language(s): Which languages your project is working on?

Again, the core corpus is in German, but if we succeed with German we want to extend the work to any other languages – any languages where there is a multiplicity of translations, providing material for comparative corpus analyses.

2.D) Project aims/research question: What are your aims and research questions? Everything is welcome here: Text re-use is actually one of the best research fields for collaborations of humanists, computer scientists, digital humanists and so on. All questions are important and welcome.

The aim is to create digital tools to facilitate exploration and analysis of corpora of multiple translations of the ‘same’ source, so as to understand better how (and eventually why) translations vary between times and between physical and cultural spaces. We think that investigations of multiple translations, represented in visual forms, can create new ways of reading the source texts, even for readers who don’t know the translating languages. This last point is key to current work, because if visual analytics can be used to ‘bypass’ language barriers, then translation arrays may be of very wide interest in education and research.

2.E) Methodology: Do you collect traces of of text re-use manually (by hand) or what are your techniques?

We have been acquiring multiple translations of Othello ‘by hand’ for some time. Few are accessible online. New ones are normally available in digital formats from theatre publishers. Most of our German corpus has had to be scanned and OCR’d, from printed and typescript sources. Preparation is ongoing. Licensing is a (funding) issue.

2.F) Online references: What is your project website?

www.delightedbeauty.org – the site has two functions: (1) it provides some info about our project, including outputs (including a new draft paper, “Eddy and Viv”, on algorithmic analysis and visualization), and (2) it crowd-sources translations of 2 lines from Othello in various languages, with a view to creating future multilingual resources and developing a worldwide network of collaborators.

Best regards

Tom

Dr Tom Cheesman

Reader in German

College of Arts and Humanities

Swansea University

SA2 8PP

T: +44 (0)1792 205678 x 4030

M: (+)(0)7736408064

Institution: www.swansea.ac.uk/staff/academic/ArtsHumanities/cheesmant/

Shakespeare/Translation: www.delightedbeauty.org

Asylum: www.swanseabassgroup.org

Publishing: www.lulu.com/hafan

Christopher Forstall

unread,

Dec 13, 2011, 1:41:49 PM12/13/11

to historical-...@googlegroups.com

Dear Marco and colleagues,

Here is an introduction to Tesserae, following Marco's questionnaire.

1. Introduction

    Project name: Tesserae

    Principal investigators:

    Neil Coffee and J.-P. Koenig, University at Buffalo

2a. Time Span

    Currently we have texts from ca. 800 BC - 800 AD; one day we aim to
    include the middle ages and possibly even later texts.

2b. Spatial limitations

    In practice, our texts are all from Europe and North Africa, but we
    have no theoretical geographic limits.

2c. Languages

    We're principally interested in Latin and Greek for now; most work
    so far has been on Latin texts. One day we would like to add other
    languages, presumably those which make significant allusions to Latin
    and Greek literature.

2d. Project aims and research questions

    We study literary text reuse. Our aim is to devise computational
    methods for automatically detecting allusion, and in the process to
    learn more about how human writers create and readers understand
    allusions.

    Some specific research questions we're working on are:
        - Can allusion be defined quantitatively based on text
            similarity alone?
        - What characteristics of the text best predict readers' sense
            of the "strength" of the allusion?
        - What patterns emerge when large numbers of allusions are
            aggregated?

2e. Methodology

    We collect allusions and references to allusions both digitally
    and manually. Principal electronic sources currently are the Perseus
    Digital Library (http://www.perseus.tufts.edu) and The Latin Library
    (http://www.thelatinlibrary.com).

2f. Website

    http://tesserae.caset.buffalo.edu

4. Resources

    A Wiki sounds like a great idea. It's possible that a message board
    could fill the same function. At this stage we're most interested in
    sharing experiences, references, and general ideas.

    For example,
        - "This is how we deal with a certain issue in Perseus markup, how are
            others doing it?"
        - "I just saw an amazing talk, everyone should know about this person!"
        - "What is topic modelling and is anyone using it for text reuse?"


Thanks very much, Marco, for setting up this group; it's great to meet
everyone else and get a sense of the size of this field!

    Chris Forstall

Laurence Mellerin

unread,

Dec 22, 2011, 11:06:27 AM12/22/11

to Marco BÜCHLER, historical-...@googlegroups.com

Hello everybody,
To answer Marco's message, a few information about Biblindex project I'm
responsible for.
Biblindex is led by the French “Institut des Sources Chrétiennes” with
the support of two computer-science labs in France, the LIRIS and the
LIG, and two biblical institutes, the Peshitta Institute of Leiden
University and the “Institut für neutestamentliche Forschung” in
Münster. It aims at the constitution of an online index of biblical
references to be found in Christian and eventually Jewish literature, as
well as in Western and Eastern texts of Late Antiquity and the early
middle Ages.
> 2.A) *Time span*: We study texts from the first centuries to the
> Middle Ages (1st-14th century)
> 2.B) *Spatial limitations*: We have theorically no spatial limitation,
> but our texts come of course mainly from the Mediterranean area.
> 2.C) *Language(s)*: We're working on Latin and Greek texts at the
> moment, but will soon expand the project to Syriac texts, and later to
> Ethiopian, Arabic, Georgian ones. Modern languages in the interfaces
> are French and English.
> 2.D)*Project aims/research question*:Our first aim is to provide an
> index of references. At the time being, our index only gives
> references like his: 'this biblical book, chapter X, verse Y is quoted
> by Author, Title, p§, page X, line Y'. We already have 400.000
> references online and aim to provide one million references in 2 or 3
> years. But in a more experimental part of our project, we want to work
> directly on the texts themselves and tag the quotations and allusions
> in them. Our methodological issues are very close to those described
> in the Tesserae project. We are specially interested in the definition
> of the allusion as well.
> 2.E) *Methodology*: Do you collect traces of of text re-use manually

> (by hand) or what are your techniques?

We still collect these traces by hand but would like to develop a tool
to identify in a semi-automatical way the biblical quotations. We will
use XML-TEI encoding.
> 2.F) *Online references*: What is your project website?
Website : http://www.biblindex.org
Scientific blog : http://biblindex-en.hypotheses.org/?lang=en_GB

Thank you very much for creating this Google Groups. I also think Wiki
would be a great idea.
Best regards,
Laurence Mellerin

--
************************************************
Laurence MELLERIN
laurence...@mom.fr
CNRS "Histoire et sources des mondes antiques"(UMR 5189)
Institut des Sources Chrétiennes
22 rue Sala, F-69002 LYON
tél. + 33 (0)4 72 77 73 56 / + 33 (0)6 86 44 74 12
fax. + 33 (0)4 78 92 90 11
http://www.sources-chretiennes.mom.fr

BIBLINDEX, index scripturaire en ligne des Pères de l'Eglise
site : http://www.biblindex.org
carnet de recherche : http://biblindex.hypotheses.org/

Ute Pietruschka

unread,

Dec 27, 2011, 11:52:50 AM12/27/11

to historical-...@googlegroups.com

Dear colleagues,
here the description of our project,
best regards, and a happy New Year,
Ute Pietruschka

Dr. Ute Pietruschka
Corpus der arabischen und syrischen Gnomologien
http://casg.orientphil.uni-halle.de

> 1. Mutual Introduction
> -----------------------------
„Corpus of Arabic and Syriac Gnomologies“ (CASG)
Project Director: Ute Pietruschka (Oriental Institute, Martin-Luther-Universität Halle-Wittenberg, Germany)

Time span: 5th-13th cent.
Project aims are detection of archetypus, translation techniques between Greek, Arabic, and Syriac, lines of transmissions of Greek philosophical sayings

Spatial limitations: texts from the Mediterranean
Languages: Arabic, Syriac, Greek

Project aims:
Project aims at the constitution of an online database comprising philosophical and wisdom sayings
in Syriac and Arabic, their translation into a modern language (German, English or French) and providing parallels in Greek
These sayings are not only interesting for orientalists, but also for classicists, because sometimes variants are preserved in Arabic which are otherwise lost in Greek. The Arabic sayings can therefore be used for reconstruction of older material. The Greek sayings, translated into Syriac and Arabic, were re-used in Christian monastic collections or in Arabic (so-called) adab-works, i.e. literature of an entertaining character. During the process of re-use, form and content of the sayings were modified according to the religious and cultural environments.

Methodology: still manual collection of the sayings, with help of digital libraries, e.g. Menalib (University of Halle)

Project website: http://casg.orientphil.uni-halle.de

This Google group is very helpful for exchange and sharing of ideas, also Wiki would be o.k.

ute.pietruschka.vcf

Reply all

Reply to author

Forward