use cases for a suite of Web apps using eltk

5 views

Skip to first unread message

scott farrar

unread,

Aug 26, 2009, 1:14:58 PM8/26/09

to el...@googlegroups.com

Nothing like a little vapor-ware planning, reactions welcome. Think Django+extjs+ELTK. Here are some topics listed below:

1. Design issues
2. Common themes
3. Use cases

1. Design issues
=========

Data sets live in the cloud, both as RDF files (linked data) hosted remotely (perhaps on the user's site) but also by the server(s) in DBs for access efficiency.

Include a strong Web services component, though this is perhaps orthogonal to the actual Web apps.

2. Common themes
=========

A common theme across all use cases is the ability to hover, click-on and drag/drop GOLD and COPE cataegories.

A common theme is to diplay citation and data provenance (only for legacy data) e.g., like in ODIN where the origainl PDF is always retrievalbe.

Pull down menus should be used for categories w. many members.

3. Use Cases
========

Use Case (Creation):
-------------
Various sorts of linguistics data can be created using an editor. Something like Google Docs where multiple users can edit the same data. This should have an upload function that uses ELTK's readers, e.g., for uploading XML LIFT, csv files for sign/word lists. Users should be able to create simple text files off-line and then upload when they have a net connection.

Fundamental data types should be enumerated and editied in a common way: lexicon, paradigm, wordlist, phonetic inventory, IGT, termset, COPE, etc.

Use Case (Visualization):
---------------

Data can be viewed along a number dimensions. E.g., a lexicon can be viewed according to various sorting algorithms (by stem, by root, by feature); IGT can be viewed as a lexicon.

Data sets can be viewed for basic stats: termsets used, theory used, number of indiv. datums contain in set,

A list of what docs the user owns should be displayed (like Google Doc's main menu)

Use Case (Search/query):
---------------

Search should not only be according to string, but according to (1) concept, (2) term used for a particular concept, (3) example (show me more stuff like X .

Use Case (Manipulation):
---------------

Users are able to process data, like sorting, changing annotation terminology (e.g., global change 'PST' for 'past'), and validation (see next use case).

Use Case (Validation):
--------------

Users can check to see if uploaded files are well-formed syntactically, but also if data are semantically valid (according to ontology). So, for instance, highlight as potential error if ACC and NOM are marked on the same morpheme?

Validation includes spell checking of glosses.

W.P. McNeill

unread,

Aug 26, 2009, 3:57:56 PM8/26/09

to el...@googlegroups.com

One way to come up with usage scenarios is to simply go over papers we have all written or are writing and ask how we'd go about creating GOLD-enhanced supporting online materials (SOM).

For example, for a paper I'm currently working on I'd want to publish the following online:

A list of IPA symbols and their corresponding phonetic features. Perhaps multiple different feature sets for different languages.
A list of words, both in English and in Turkish.
For each word, a list of corresponding semantic/grammatical meanings: e.g. LEMMA = jump, ASPECT = imperfect, etc.
A list of morphemes derived from the words with similar associated meanings.

I am representing these meanings as feature/value matrixes. I am also, a la HPSG's analysis on English, collapsing the person and number agreement features into a single PERNUM feature that takes on the values 3sg and non-3sg.

Some of this is easy to see how to map to GOLD. I think Steve has already been working on Part 1. Word lists is something I'm currently working on. Here are some tricky parts:

Does the notion of a feature/value matrix get embodied in GOLD? (My gut feeling: no.) If not, how do I represent it in my SOM?
How do I handle the (relatively) idiosyncratic analysis of PERNUM. Do I need to build a Community of Practice Extension (COPE) that defines 3sg and non-3sg. If so, how do I do that?

--
W.P. McNeill
http://staff.washington.edu/billmcn/index.shtml
Sent from Seattle, WA, United States

Reply all

Reply to author

Forward

0 new messages