Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Artificial Intelligence FAQ: AI Web Directories & Online Papers 5/6 [Monthly posting]

0 views
Skip to first unread message

Ric Crabbe and Amit Dubey

unread,
Nov 2, 2001, 2:08:03 PM11/2/01
to
Archive-name: ai-faq/general/part5
Posting-Frequency: monthly
Version: 2.1
Maintainer: Ric Crabbe <cra...@usna.edu> and Amit Dubey <adu...@netscape.net>
URL: http://www.faqs.org/faqs/ai-faq/general/part5/preamble.html
Size: 46277 bytes, 1114 lines

;;; ****************************************************************
;;; Answers to Questions about Artificial Intelligence *************
;;; ****************************************************************
;;; Written by Amit Dubey, Ric Crabbe, and Mark Kantrowitz
;;; ai_5.faq

If you think of questions that are appropriate for this FAQ, or would
like to improve an answer, please send email to the maintainers.

Parts 5 and 6 of the FAQ are now under heavy construction. The FTP & WWW
resources have been combined, since both are browser accessible these
days. We're also pruning the entries to sites that include
information other than whatever project is being done at that
University, etc.

Part 5 (WWW & FTP Resources):
[5-0] Research Index (nee Citeceer)
[5-1] Weblogs, repositories, web directories and communities not
aimed primarily at researchers
[5-2] Repositories and web directories aimed primarily at researchers
[5-3] AI Bibliographies available by FTP and WWW
[5-4] Technical Reports available by FTP and WWW
[5-5] Technical Reports for/by undergraduate students
[5-6] Where can I get a machine readable dictionary, thesaurus, and
other text corpora?
[5-7] Where can I get training sets for machine learning algorithms?
[5-8] What on-line journals are there?

Search for [#] to get to question number # quickly.

----------------------------------------------------------------
Subject: [5-0] Research Index (nee Citeceer)

http://citeseer.nj.nec.com/cs

"ResearchIndex is a scientific literature digital library that aims to
improve the dissemination and feedback of scientific literature, and
to provide improvements in functionality, usability, availability,
cost, comprehensiveness, efficiency, and timeliness.

Rather than creating just another digital library, ResearchIndex
provides algorithms, techniques, and software that can be used in
other digital libraries. ResearchIndex indexes Postscript and PDF
research articles on the Web..."

http://citeseer.nj.nec.com/cs

Research index (though most people still call it citeseer) is the
currently best way to find published papers AND TRACK CITATIONS. It
is so cool, it's in the FAQ twice.

----------------------------------------------------------------
Subject: [5-1] Repositories and directories not aimed primarily at researchers

AI Topics:
http://www.aaai.org/Pathfinder/html/welcome.html
Presented by AAAI, AI Topics is a "...web site provided ... for
students, teachers, journalists, and everyone who would like to
learn more about what artificial intelligence is, and what AI
scientists do.

[Their] goal is to offer a limited number of authoritative,
non-technical resources that [they] have organized and annotated
to provide you with meaningful access to basic information about
the AI universe. Each of the AI Topics (see the navigation buttons
to the left) will lead you to online and in-print sources of
information.

There has been an explosion in the number of Websites that catalog
locations of AI information in a Yahoo-style directory. Although they
often duplicate functionality, in the interest of fairness, I will list all
the ones I know about here.

About.com:
http://ai.about.com

Kurzweilai.Net:
http://www.kurzweilai.net
An entertaining mix of flashy demos and interesting essays on AI,
many by important poeple in the field. Much philosophical
discussion, less hard-core technical discussion.

Neuron AI Directory
http://www.neuron.co.uk

Neural Network Information in Polish:
http://www.neuron.of.pl

Generation 5:
http://www.generation5.org

"Generation5 is aimed at presenting a website that will educate the
viewer on Artificial Intelligence -- whatever the level of
expertise. We have essays on the applications and history of AI for
those unfamiliar, to essays on programming and philosophy, all the way
to full blown mathematically-orientated essays on genetic algorithms
and neural networks. Generation5 prides itself also in its interviews
sections with exclusive interviews from top AI scientists like Marvin
Minsky, Craig Reynolds, Roger Schank, Andre LaMothe and many
others. Generation5 also has a comprehensive collection of original
programs, all with source included. Demonstration programs like image
recognizors, number recognizors, cellular automata creators, NLP
demonstrator and more. All programs have an accompanying essay
describing the workings of the programs.

The aim of Generation5 is not only to educate the viewer, but to allow
the viewer to contribute to further other people's knowledge. They can
do this through the discussion boards, voting systems, and soon
through AI Solutions (a scheme to submit code - with a accompanying
monthly competition)."


Yahoo Clubs:
Yahoo maintains a number of AI clubs. There is the general
AI Group, an online community that discusses AI (a resource
for beginners). Their website is:

http://clubs.yahoo.com/clubs/artificialintelligencegroup

There is also a resource for amateur robot enthusiasts at:

http://clubs.yahoo.com/clubs/theroboticsclub

Hubat:
http://www.hubat.com/servlets/search?cmd=b&db=hubat&cat=3.1&st=0
Hubat automatically generates subject hierarchies of links.
Interesting because it had to take some AI to build the engine.

Pentomino Site:
http://home.planetinternet.be/~odettedm
Student run site at T.I.D. Ronse Belgium on searching Pentomino
spaces.

----------------------------------------------------------------
Subject: [5-2] Repositories aimed primarily for researchers


CMU AI Repository:

The CMU Artificial Intelligence Repository was established by
Carnegie Mellon University to contain public domain and freely
distributable software, publications, and other materials of
interest to AI researchers, educators, students, and practitioners.
The AI Repository currently contains more than a gigabyte of
material and is growing steadily.

The AI Repository is accessible from:

http://www.cs.cmu.edu/Web/Groups/AI/html/repository.html


Artificial Life Online and the Artificial Life BBS:

Sponsored by MIT Press and the Santa Fe Institute, Artificial Life
Online and the Artificial Life BBS is intended to be a central
information collection and distribution site on the Internet for any
and all aspects of the Artificial Life endeavor.

A special feature of the BBS is a collection of 40 or so local
newsgroups dedicated to a wide variety of topics in Artificial Life.

Artificial Life Online is accessible by World-Wide Web from

http://alife.santafe.edu/

Case based reasoning:
http://www.ai-cbr.org/
ai-cbr aims to provide a comprehensive information base to
Case-Based Reasoning academics and commercial developers. Through
the dissemination of information it is hoped a stronger world-wide
community of people interested in Case-Based Reasoning will be
fostered and the commercial use of Case-Based Reasoning will
increase. (added to the FAQ 2/2/00)

http://www.aic.nrl.navy.mil/~aha/research/case-based-reasoning.html
A very complete list of resources including tutorials (added to the
FAQ 2/2/00)

Consortium for Lexical Research:

clr.nmsu.edu:/CLR/ [128.123.1.12]

Archive containing a variety of programs and data files related to
natural language processing research, with a particular focus on
lexical research. The file 00README.clr.site is a good place to start.
See the file catalog or catalog.ps for a listing of the contents of
the archive. Long descriptions are in the info/ subdirectory.
Materials for paid-up members of the Consortium are in the
members-only/ subdirectory. Public materials include the Alvey Natural
Language Tools, Sowa's Conceptual Graph parser implemented in YACC by
Maurice Pagnucco, a morphological parsing lexicon of English, a
phonological rule compiler for PC-KIMMO, C source code for the NIST
SGML parser, PC-KIMMO sources, the 1911 Roget Thesaurus, and a variety
of word lists (including English, Dutch, and male/female/last names).
Comments and questions may be directed to lex...@nmsu.edu.

There are also some materials in clr.nmsu.edu:/pub/ unrelated to
the archive.

Fuzzy Logic Repositories:

ntia.its.bldrdoc.gov:/pub/fuzzy/ [132.163.64.201] contains information
concerning fuzzy logic, including bibliographies (bib/), product
descriptions and demo versions (com/), machine readable published
papers (lit/), miscellaneous information, documents and reports (txt/),
and programs, code and compilers (prog/). You may download new items
into the new/ subdirectory. If you deposit anything in new/, please
inform fu...@its.bldrdoc.gov. The repository is maintained by
Timothy Butler, t...@its.bldrdoc.gov.

Genetic Algorithms:

The Genetic Algorithms Repository is accessible is also a WWW version at

http://www.aic.nrl.navy.mil/galist/

The information files includes Nici Schraudolph's survey of free and
commercial GA software (send email to <schr...@cs.ucsd.edu> to add to
the list).

The software includes GAC (a simple GA written in C), GAL (a simple GA
written in Common Lisp), GAucsd, GECO (a Common Lisp toolbox for
constructing genetic algorithms), GENESIS, GENOCOP, Paragenesis (a
parallel version of GENESIS that runs on the CM-200), SGA-C (a C
implementation/extension of Goldberg's SGA system).

Intelliwise:

Sergio Navega maintains a large collection of AI links:

http://www.intelliwise.com/links.htm

Funic Neural Nets Archive Site:

The Finnish University maintains an archive site containing a large
collection of neural network papers and public domain software.
The files are available through the web interface at
http://www.nic.funet.fi/pub/sci/neural or through FTP from
ftp://funic.funet.fi:/pub/sci/neural. FTP users: see the file 01README
for details. There's also a directory for non-neural net AI stuff
in the directory /pub/sci/ai. (Web service is still experimental as
of 05/29/99).

There is a list of mirrored ftp sites is in 04Neural_FTP_Sites. For
further information, contact neura...@funic.funet.fi or Marko
Gronroos <ma...@funic.funet.fi> (or <ma...@utu.fi>).

OSU Neuroprose:

archive.cis.ohio-state.edu:/pub/neuroprose/ [128.146.8.52]

This directory contains technical reports, mostly from the early 90's, as a
public service to the connectionist and neural network scientific community
which has an organized mailing list (for info:
connectioni...@cs.cmu.edu)

NL Software Registry:
[maintainer's note: links upto this point haven't been checked]

The Natural Language Software Registry is a catalogue of software
implementing core natural language processing techniques, whether
available on a commercial or noncommercial basis. Some of the topics
listed include speech signal processing, morphological analysis,
parsers, natural language generation systems, and knowledge
representation systems. The second edition of the catalog contains
more than 100 descriptions of natural language processing software.
The catalogue is available from the German Research Institute for
Artificial Intelligence (DFKI) in Saarbruecken (Germany) at the URL

http://www.dfki.de/lt/registry

The email contact for the site is lt-...@dfki.de

Essex ROBOTS Archive:
Contains robotics related information, hasn't been updated since 1995 or so:

ftp.essex.ac.uk:/pub/robots/

AI IN DESIGN WEBLIOGRAPHY
http://www.cs.wpi.edu/Research/aidg/AIinD-hotlist.html
These web pages contain links to pretty much everything
concerned with the application of AI to Design.

Miscellaneous AI:

Some miscellaneous AI programs may be found on ftp.uu.net:/pub/ai/
Most are mirrors of programs available at other sites.

AI_ATTIC is an anonymous ftp collection of classic AI programs and
other information maintained by the University of Texas at Austin. It
includes Parry, Adventure, Shrdlu, Doctor, Eliza, Animals, Trek, Zork,
Babbler, Jive, and some AI-related programming languages. This
archive is available by anonymous ftp from ftp.cc.utexas.edu
in the directory /pub/AI_ATTIC. For more information, contact
attic...@bongo.cc.utexas.edu.

The QWERTZ toolbox, a library of Standard ML modules with an emphasis
on symbolic Artificial Intelligence programming, (including
implementations of heuristic search and an ATMS reason maintenance
system) may be obtained by anonymous ftp from

ftp.gmd.de:/gmd/ai-research/Software/qwertz.tar.gz

For more information, write to Tom Gordon <thomas...@gmd.de>.

----------------------------------------------------------------
Subject: [5-3] AI Bibliographies available by FTP

General:

There are many recent papers at:

http://www.cora.whizbang.com

You can both browse and search; the searching ranks papers based on
how often they have been referenced.


Fuzzy Logic:

A BibTeX database of references addressing neuro-fuzzy issues can be
obtained by anonymous ftp from

ftp.tu-bs.de:/local/papers/ [134.169.34.15]

as the (ascii) file fuzzy-nn.bib.

Genetic Algorithms:

http://www.cogs.susx.ac.uk/users/ezequiel/alife-page/alife.html

Logic Programming, Constraints:

A BibTeX bibliography for Constraint Logic Programming is available
by anonymous ftp from

archive.cis.ohio-state.edu:/pub/clp/

in the bib/ and papers/ subdirectories.

NLP/CL:

For information on a fairly complete bibliography of computational
linguistics and natural language processing work from the 1980s, send
mail to cl...@csli.stanford.edu with the subject HELP.

The CSLI linguistics bibliography contains 3,300 entries in
bib/tib/refer format. The bibliography is heavily slanted towards
phonetics and phonology but also includes a fair amount of
computational morphology, syntax, semantics, and psycholinguistics.
The bibliography can be used with James Alexander's tib
bibliography system, which is available from minos.inria.fr
[128.93.39.5] among other places. The bibliography itself is available
by anonymous ftp from

csli.stanford.edu:/pub/bibliography/

Contributions are welcome, but should be in tib format.
For more information, contact Andras Kornai <kor...@csli.stanford.edu>

NLG:

Robert Dale's Natural Language Generation (NLG) bibliography is
available by anonymous ftp from

scott.cogsci.ed.ac.uk:/pub/nlg/ [129.215.144.3]

Note that it is formatted for A4 paper. Stick in a line
.94 .94 scale
after the %! line to print on 8.5 x 11 paper. For further information,
write to Robert Dale, University of Edinburgh, Centre for Cognitive
Science, 2 Buccleuch Place, Edinburgh EH8 9LW Scotland, or
<R.D...@edinburgh.ac.uk> or <rd...@microsoft.com>.

Mark Kantrowitz's Natural Language Generation (NLG) bibliography is
available by anonymous ftp from

ftp.cs.cmu.edu:/user/ai/areas/nlp/nlg/bib/mk/ [128.2.206.173]

In addition to the tech report, the BibTeX file containing the
bibliography is also available. The bibliography contains more than
1,200 entries. A searchable index to the bibliography is
available via the URL

http://liinwww.ira.uka.de/bibliography/Ai/nlg.html

Additions and corrections should be sent to mk...@cs.cmu.edu.

Neural Nets, Learning:

A bibliography of over 1000 entries about Self-Organizing Map
(SOM) and Learning vector Quantization (LVQ) studies is
available by anonymous ftp from

cochlea.hut.fi:/pub/ref/

as the files references.bib.Z (BibTeX file) and references.ps.Z
(PostScript file). Please send additions and corrections to
bib...@cochlea.hut.fi.

An extensive collection of references on Principal Component Analysis
(PCA) neural networks and learning algorithms is available by
anonymous ftp from dendrite.hut.fi:/pub/ref/ in LaTeX and PostScript
formats. The list was compiled by Liu-Yue Wang, a graduate student of
Erkki Oja, and updated by Juha Karhunen, all from Helsinki University
of Technology, Finland. For more information, contact Erkki Oja
<o...@dendrite.hut.fi>.

A bibliography of PCA algorithms is available by anonymous ftp from
ftp.ai.mit.edu:/pub/sanger-papers/ as pca.bib. For more information,
contact Terry Sanger <t...@ai.mit.edu>.

A 36-page bibliography of connectionist models with symbolic
processing is available by anonymous ftp from Neuroprose

archive.cis.ohio-state.edu:/pub/neuroprose/ [128.146.8.52]

as the file sun.nn-sp-bib.ps.Z. For more information, contact
Ron Sun <rs...@athos.cs.ua.edu>.

Nonmonotonic Logic, Belief Revision:

A bibliography on belief revision and nonmonotonic logics with
about 2,000 items is available by anonymous ftp from

tarski.phil.indiana.edu:/pub/morado/ [129.79.134.34]

as nonmono.bib or nonmono.bib.Z. The file is also available by WAIS as

wais://tarski.phil.indiana.edu/nonmono.bib?

and by gopher/WWW. Please send additions and corrections to Raymundo
Morado <mor...@phil.indiana.edu>.

Speech:

A bibliography of papers on Silicon Auditory Models (VLSI
implementations of auditory representations) is available by anonymous
ftp from

hobiecat.pcmp.caltech.edu:/pub/anaprose/lazzaro/sa-biblio.ps.Z

For more information, write to John Lazzaro <laz...@boom.cs.berkeley.edu>

Multi-agent Systems

http://dis.cs.umass.edu/research/agents-learn.html

----------------------------------------------------------------
Subject: [5-4] Technical Reports available by WWW/FTP

This section lists the anonymous ftp sites for technical reports from
several universities and other organizations. Some of the sites
provide only an online catalog of technical reports, while the rest
make the actual reports available online. The email address listed is
that of the appropriate person to contact with questions about
ordering technical reports.

The main source of tech reports is now from Networked Computer Science
Technical Reference Library or NCSTRL (pronounced "ancestral").
It's home page is: http://www.ncstrl.org/
If that is a problem, you can go directly to:
http://cs-tr.cs.cornell.edu/


Other general locations for technical reports from several
universities include:

wuarchive.wustl.edu:/doc/techreports/ [128.252.135.4]
cs-archive.uwaterloo.ca:/cs-archive/ (see Index for an index)
AKA watdragon.uwaterloo.ca [129.97.140.24]

The uwaterloo archive includes tech reports from the Logic Programming
and Artificial Intelligence Group (LPAIG) of the University of Waterloo.

There is also a WAIS server containing tech report abstracts that can be
searched. To use, create the file ~/wais-sources/cs-techreport-abstracts.src
containing
(:source
:version 3
:ip-address "130.194.74.201"
:ip-name "daneel.rdt.monash.edu.au"
:tcp-port 210
:database-name "cs-techreport-abstracts"
:cost 0.00
:cost-unit :free
:maintainer "wa...@daneel.rdt.monash.edu.au")
and invoke your local wais client. To add to it, email abstracts of
your papers to wa...@rdt.monash.edu.au in the following format:
%TI Title
%AU Author (use multiple %AU lines for multiple authors)
%PU Published In (citation information)
%AV Availability (e.g., ftp reports.adm.cs.cmu.edu:/1992/CMU-CS-92-101.ps)
%OR Organization (see cs-techreport-archives.src for institution codes)
%LT Local title (e.g., tech report number)
%DA Date (and, if you want, %MN Month, %YR Year)
%AB Abstract
If your papers are not available by FTP, you can use a %AV line such as:
%AV mail harry...@cs.cmu.edu
Further instructions are available from
daneel.rdt.monash.edu.au:/pub/techreports/reports/README
[Based on a post by Ashwin Ram.]

Also see the Unified Computer Science Technical Report Index
http://cs.indiana.edu/cstr/search
[this archive appears to be out of date -ed]

A list of FTP sites for technical reports and papers can be found in
http://www.rdt.monash.edu.au/tr/siteslist.html

A list of more than 230 sites publishing CS tech reports may be
obtained by anonymous ftp from

ftp.rdt.monash.edu.au:/pub/techreports/sites/sites-list-data

To receive notification of new tech report sites, send mail to
compdoc-techr...@ftp.cse.ucsc.edu to join the mailing list.

An archive of linguistics papers and preprints is available from
linguistics.archive.umich.edu:/linguistics/papers/. Contact John Lawler
(jla...@umich.edu) or linguistic...@umich.edu for more
information.

The Concurrent Engineering Research Center (CERC) at West Virginia
University has placed ASCII versions of the concurrent
engineering-related abstracts (over 500) that were on CERCnet, ASCII
back issues of the Concurrent Engineering Research in Review journal
(now discontinued), and Postscript copies of CERC technical reports in
the gopher server gopher.cerc.wvu.edu. In addition, many of the CERC
technical reports, including journal articles, symposium papers,
theses, dissertations, and issues of the Concurrent Engineering
Research in Review journal, are available as Postscript versions via
anonymous ftp from

babcock.cerc.wvu.edu:/pub/techReports/ [157.182.44.36]

An index to all the reports, including some that are
available only in hardcopy, is contained in the file "CERC-TR-INDEX".
If you need additional information, contact Mary Carriger, CERC Office
of Information Services, at carr...@cerc.wvu.edu.

The newsgroup comp.doc.techreports is devoted to distributing lists of
tech reports and their abstracts.

MIT Artificial Intelligence Laboratory:
ftp -- publications.ai.mit.edu:/ai-publications/
email -- public...@ai.mit.edu
www -- http://www.ai.mit.edu/research/publications/publications.shtml

A full catalog of MIT AI Lab technical reports (and a listing of recent
updates) may be obtained from the above location, by writing to
Publications, Room NE43-818, M.I.T. Artificial Intelligence Laboratory,
545 Technology Square, Cambridge, MA 02139, USA, or by calling
1-617-253-6773. The catalog lists the technical reports ("AI Memos")
with a short abstract and their current prices. There is also a charge
for shipping. Some recent tech reports (since 1991) are available in the
ai-publications/ subdirectory; older technical reports are NOT
available by ftp. A bibliography is in the bibliography/ directory.

CMU School of Computer Science:
ftp -- reports.adm.cs.cmu.edu
email -- Technica...@cs.cmu.edu
www -- reports-archive.adm.cs.cmu.edu/cs.html

CMU Software Engineering Institute:
ftp -- ftp.sei.cmu.edu:/pub/documents
email -- b...@sei.cmu.edu
www -- www.sei.cmu.edu/publications/publication.html

Yale:
ftp -- dept.cs.yale.edu:/pub/TR/

University of Washington CSE Tech Reports:
ftp -- june.cs.washington.edu:/tr
email -- tr-re...@cs.washington.edu

================

AT&T Bell Laboratories:
ftp -- netlib.att.com:/netlib/research/cstr/
bib.Z contains short bibliography, including all the technical
reports contained in this directory.

ftp -- research.att.com:/dist/ai

[Maintainer's note: I assume these have been moved over to Lucent's
domain?]

Argonne National Laboratory:
ftp -- anagram.mcs.anl.gov:/pub/tech_reports
email -- wri...@mcs.anl.gov

Contains MCS Division preprints and technical memoranda,
available as either .dvi or .ps files. For descriptions of the
contents, see the subdirectory pub/tech_reports/abstracts; for
the files themselves see the subdirectory pub/tech_reports/reports.

Boston University:
ftp -- cs.bu.edu:/techreports/
email -- techr...@cs.bu.edu

Brown University:
ftp -- wilma.cs.brown.edu:/techreports/
email -- techr...@cs.brown.edu

Cambridge University: Speech, Vision & Robotics Group
ftp -- svr-ftp.eng.cam.ac.uk:/reports/

Columbia University:
ftp -- cs.columbia.edu:/pub/reports
email -- tech-r...@cs.columbia.edu

DEC Cambridge Research Lab:
ftp -- crl.dec.com:/pub/DEC/CRL/abstracts/
crl.dec.com:/pub/DEC/CRL/tech-reports/

DEC Paris Research Lab:
email -- doc-s...@prl.dec.com
Put commands in Subject: line of the message.
To get a list of articles, use
send index articles
To get a list of tech reports, use
send index reports

DEC WRL:
email -- wrl-tec...@decwrl.dec.com
To get a helpfile, send a message with
help
in the subject line.

DFKI:
ftp -- duck.dfki.uni-sb.de:/pub/papers
email -- Martin Henz (he...@dfki.uni-sb.de)

Duke University:
ftp -- cs.duke.edu:/dist/papers/
cs.duke.edu:/dist/theses/
email -- techr...@cs.duke.edu [unknown user, 7/7/93]

Edinburgh:
A list of available reports can be sent via email. Send requests
for information about reports from the Center for Cognitive Science
to cogsci%ed.a...@nsfnet-relay.ac.uk, and from the Human Communication
Research Center to HCRC%ed.a...@nsfnet-relay.ac.uk.

Electrotechnical Laboratory, Japan:
Reports from the Cooperative Architecture project (half AI, half
software engineering).
ftp -- etlport.etl.go.jp:/pub/kyocho/Papers [192.31.197.99]
See file Index.English.
email -- Hideyuki Nakashima <naka...@etl.go.jp>.

Georgia Tech College of Computing, AI Group:
ftp -- ftp.cc.gatech.edu:/pub/ai (130.207.3.245)
email -- Professor Ashwin Ram <ash...@cc.gatech.edu>

HCRC (Human Communication Research Centre):
ftp -- scott.cogsci.ed.ac.uk:/pub/HCRC-papers/
mail -- Fiona-Anne Malcolm
Human Communication Research Centre
2 Buccleuch Place, Edinburgh, UK

Illinois:
email -- Erna Amerman <er...@uiuc.edu>

Illinois Genetic Algorithms Laboratory (IlliGAL):
email -- Eric Thompson <lib...@gal1.ge.uiuc.edu>
phone -- 217-333-2346 (9AM to 5PM CT, M-F)
mail -- Illinois Genetic Algorithms Laboratory
Department of General Engineering
117 Transportation Building
104 South Mathews Avenue
Urbana, IL 61801-2996
ftp -- gal4.ge.uiuc.edu:/pub/papers/IlliGALs/
Includes the GA bibliography and the Messy GA code in C
(in /pub/src/) and preprints (in /pub/papers/Publications)
www -- http://gal4.ge.uiuc.edu/illigal.home.html

Indiana:
ftp -- cogsci.indiana.edu:/pub [129.79.238.12]
ftp -- ftp.cs.indiana.edu:/pub/techreports [129.79.254.191]

INRIA, France:
ftp -- ftp.inria.fr:/INRIA/publication/

Institute for Learning Sciences at Northwestern University:
ftp -- aristotle.ils.nwu.edu:/pub/papers/
phone -- 708-491-3500

Mechanized Reasoning Group (MRG):
ftp -- ftp.mrg.dist.unige.it:/pub/mrg-ftp
email -- Fausto Giunchiglia <fau...@irst.it>
Mechanized Reasoning Group, IRST
38050 Povo Trento, Italy
Tel: +39 461-314444 (secr.)
+39 461-314436 (office)
Fax: +39 461-302040 / 314591

National University of Singapore:
ftp -- ftp.nus.sg:/pub/NUS/ISCS/techreports

New York University (NYU):
ftp -- cs.nyu.edu:/pub/tech-reports

OGI:
ftp -- cse.ogi.edu:/pub/tech-reports
email -- cse...@cse.ogi.edu

Ohio State University, Laboratory for AI Research
ftp -- nervous.cis.ohio-state.edu:/pub/papers
email -- lair-li...@cis.ohio-state.edu

OSU Neuroprose:
ftp -- archive.cis.ohio-state.edu:/pub/neuroprose (128.146.8.52)

This directory contains technical reports as a public service to the
connectionist and neural network scientific community which has an
organized mailing list (for info: connectioni...@cs.cmu.edu)
Includes several bibliographies.

Stanford:
ftp -- elib.stanford.edu:/cs

Very spotty collection.

SRI:
email -- Donna O'Neal, do...@ai.sri.com

SUNY Buffalo:
ftp -- ftp.cs.buffalo.edu:/pub/tech-reports/

SUNY at Stony Brook:
ftp -- sbcs.sunysb.edu:/pub/TechReports
email -- ri...@cs.sunysb.edu or st...@cs.sunysb.edu

The /pub/sunysb directory contains the SB-Prolog implementation
of the Prolog language. Contact war...@sbcs.sunysb.edu for more
information.

TCGA (The Clearinghouse for Genetic Algorithms):
email -- Robert Elliott Smith <r...@comec4.mh.ua.edu>
Department of Engineering of Mechanics
Room 210 Hardaway Hall
The University of Alabama
PO Box 870278
Tuscaloosa, AL 35487
205-348-1618, fax 205-348-6419

Thinking Machines:
ftp -- ftp.think.com:/think/techreport.list

This file contains a list of Thinking Machines technical reports.
Orders may be placed by email (limit 5) to t-...@think.com, or by US
Mail to Thinking Machines Corporation, Attn: Technical reports, 245
First Street, Cambridge, MA 01241. In addition, the directories
cm/starlisp and cm/starlogo contain code for the *Lisp and *Logo
simulators.

Tulane University:
ftp -- rex.cs.tulane.edu:/pub/tech/ [129.81.132.1]

University of Alabama:
ftp -- aramis.cs.ua.edu:/pub/tech-reports/

University of Arizona:
ftp -- cs.arizona.edu:/reports/
email -- tr_...@cs.arizona.edu

The directory /japan/kahaner.reports contains reports on AI in
Japan, among other things, written by Dr. David Kahaner, a
numerical analyst on sabbatical to the Office of Naval
Research-Asia (ONR Asia) in Tokyo from NIST. The reports are not
written in any sort of official capacity, but are quite interesting.

University of California/Los Angeles:
ftp -- ftp.cs.ucla.edu:/tech-report/

University of California/Santa Cruz:
ftp -- ftp.cse.ucsc.edu:/pub/bib/
ftp.cse.ucsc.edu:/pub/tr/
email -- je...@cs.ucsc.edu

University of Cambridge Computer Lab:
email -- tech-r...@cl.cam.ac.uk

University of Colorado:
ftp -- ftp.cs.colorado.edu:/pub/cs/techreports

University of Florida:
ftp -- bikini.cis.ufl.edu:/cis/tech-reports

University of Genoa, Mechanized Reasoning Group:
ftp -- ftp.mrg.dist.unige.it:/pub/mrg-ftp/
email -- Fausto Giunchiglia <fau...@irst.it>

University of Georgia:
ftp -- ai.uga.edu:/pub/ai.reports/

University of Illinois at Urbana:
ftp -- a.cs.uiuc.edu:/pub/dcs
email -- e-am...@a.cs.uiuc.edu

University of Indiana, Center for Research on Concepts and Cognition:
ftp -- cogsci.indiana.edu:/pub/
email -- he...@cogsci.indiana.edu

University of Kaiserslautern, Germany:
ftp -- ftp.uni-kl.de:/reports_uni-kl/computer_science/

University of Kentucky:
ftp -- ftp.ms.uky.edu:/pub/tech-reports/UK/cs/

University of Massachusetts at Amherst:
email -- tech...@cs.umass.edu

University of Melbourne, Australia,
Computer Vision and Pattern Recognition Laboratory (CVPRL):
ftp -- krang.vis.mu.oz.au:/pub/articles

University of Michigan:
ftp -- ftp.eecs.umich.edu:/techreports

University of North Carolina:
ftp -- ftp.cs.unc.edu:/pub/technical-reports/

University of Pennsylvania:
ftp -- ftp.cis.upenn.edu:/pub/papers/
email -- public...@upenn.edu [email bounced 7/7/93]

USC/Information Sciences Institute:
email -- Sheila Coyazo <sco...@isi.edu> is the contact. [email
bounced 7/7/93]

University of Toronto:
ftp -- ftp.cs.toronto.edu:/pub/cogrob/ (Cognitive Robotics)
ftp.cs.toronto.edu:/pub/reports/
email -- tech-r...@cs.toronto.edu

University of Virginia:
ftp -- uvacs.cs.virginia.edu:/pub/techreports/cs

University of Western Australia:
ftp -- ciips.ee.uwa.edu.au
Centre for Intelligent Information Processing Systems (CIIPS)
EE Engineering Department

University of Wisconsin:
ftp -- ftp.cs.wisc.edu:/tech-reports
ftp.cs.wisc.edu:/machine-learning
ftp.cs.wisc.edu:/computer-vision
email -- tech-repor...@cs.wisc.edu


Some AI authors have set up repositories of their own papers:

Matthew Ginsberg: t.stanford.edu:/u/ftp/papers

----------------------------------------------------------------
Subject: [5-5] Technical resources for/by undergraduate students

Brainsciences http://www.brainsciences.com

A group of students at Brown University have created a web site to
"provide a forum for undergraduates to publish their work. We feature
reports of original research, book reviews, term papers, and other work
in a similar vein."


----------------------------------------------------------------
Subject: [5-6] Where can I get a machine readable dictionary, thesaurus, and
other text corpora?

Linguistic Data Consortium:

The Linguistic Data Consortium was established to broaden the
collection and distribution of speech and natural language data
bases for the purposes of research and technology development in
automatic speech recognition, natural language processing, and
other areas where large amounts of linguistic data are needed. LDC
corpora are the most commonly used in published research.
Information about the LDC is at http://www.ldc.upenn.edu/

Free:

On unix systems, /usr/dict/words is a fine word list.

Roget's 1911 Thesaurus is available by anonymous FTP from the
Consortium for Lexical Research

clr.nmsu.edu:/CLR/lexica/roget-1911 [128.123.1.12]

It is also available from

src.doc.ic.ac.uk:/literary/collections/project_gutenberg/roget11.txt.Z

An old Webster's dictionary is in /text/dict/{DICT.Z,DICT.INDEX.Z}.
Project Gutenberg also has Roget's 1911 Thesaurus. The Project
Gutenberg archive is at mrcnext.cso.uiuc.edu:/pub/etext/. The
Project Gutenberg archive collects public domain electronic books. For more
information, write to Michael S. Hart, Professor of Electronic Text,
Executive Director of Project Gutenberg Etext, Illinois Benedictine
College, 5700 College Road, Lisle, IL 60532 or send email to
ha...@vmd.cso.uiuc.edu.

For people without FTP, Austin Code Works sells floppy disks
containing Roget's 1911 Thesaurus for $40.00. This money helps support
the production of other useful texts, such as the 1913 Webster's dictionary.

The Online Book Initiative maintains a text repository on
ftp.std.com (a public access UNIX system, 617-739-WRLD). See the
README file on obi.std.com:/obi/. For more information, send email to
o...@world.std.com, write to Software Tool & Die, 1330 Beacon Street,
Brookline, MA 02146, or call 617-739-0202.

The CHILDES project at Carnegie Mellon University has a lot of data of
children speaking to adults, as well as the adult written and adult
spoken corpora from the CORNELL project. Contact Brian MacWhinney
<br...@andrew.cmu.edu> for more information.

The Association for Computational Linguistics (ACL) has a Data
Collection Initiative. For more information, contact Donald Walker at
Bellcore, wal...@flash.bellcore.com.

Two lists of common female first names (4967 names) and male first
names (2924 names) are available for anonymous ftp from

ftp.cs.cmu.edu:/user/ai/areas/nlp/corpora/names/

Read the file README first. Send mail to mk...@cs.cmu.edu for more
information.

A list of 110,000 English words (one per line, in ASCII) is
available in the PD1:<MSDOS.LINGUISTICS> directory on SIMTEL20 as the
files WORDS1.ZIP, WORDS2.ZIP, WORDS3.ZIP, and WORDS4.ZIP. Although the
list is in MS-DOS files, it can easily be used on other machines (but
first you'll have to unzip the files on a DOS machine). The list
includes inflected forms of the words, such as plural nouns and the
-s, -ed, and -ing forms of verbs; thus the number of lexical stems in
the list is considerably smaller than the total number of word forms.
These files are available via FTP from WSMR-SIMTEL20.ARMY.MIL
[192.88.110.20]. SIMTEL20 files are mirrored on wuarchive.wustl.edu.

The Collins English Dictionary encoded as a Prolog fact base is
available from the Oxford Text Archive by anonymous ftp from

ota.ox.ac.uk:/pub/ota/dicts/1192/ [129.67.1.165]

The Oxford Text Archive includes many other texts, dictionaries,
thesauri, word lists, and so on, most of which are available for
scholarly use and research only. See the files

ota.ox.ac.uk:/pub/ota/textarchive.form
ota.ox.ac.uk:/pub/ota/textarchive.info
ota.ox.ac.uk:/pub/ota/textarchive.list
ota.ox.ac.uk:/pub/ota/textarchive.sgml

for more information, or write to arc...@ox.ac.uk, Oxford Text Archive,
Oxford University Computing Services, 13 Banbury Road, Oxford OX2
6NN, UK, call 44-865-273238 or fax 44-865-273275.

Chuck Wooters <woo...@icsi.berkeley.edu> has extracted the most
likely pronunciation for each of about 6100 words in the hand-labeled
TIMIT database, and made them available by anonymous ftp from
ftp.icsi.berkeley.edu:/pub/speech/TIMIT.mostlikely.Z.

A list of homophones from general American English is available by
anonymous ftp from svr-ftp.eng.cam.ac.uk:/comp.speech/data/ as the file
homophones-1.01.txt. To receive the list by email, send mail to
Evan.A...@sil.org. The list was compiled by Tony Robinson.

Sigurd P. Crossland <s...@seuss.vantage.gte.com> has been compiling
a dictionary of English words, including most common American words,
abbreviations, hyphenations, and even incorrect spellings. The most
recent version is available by anonymous ftp from

wocket.vantage.gte.com:/pub/standard_dictionary/dic-0394.tar.gz

The tar file includes 31 text files, one for each word-length from 2
to 32. The compressed tar file takes up just over 4mb of space, and
includes approximately 870,000 words.

WordNet is an English lexical reference system based on current
psycholinguistic theories of human lexical memory. It organizes nouns,
verbs and adjectives into synonym sets corresponding to lexical
concepts. The sets are linked by a variety of relations. Besides being
of scientific interest,
it makes a handy thesaurus. WordNet is available by anonymous ftp from

clarity.princeton.edu:/pub/

If you retrieve a copy of wordnet by ftp, please send mail to
wor...@princeton.edu.

Commercial:

Illumind publishes the Moby Thesaurus (25,000 roots/1.2 million
synonyms), Moby Words (560,000 entries), Moby Hyphenator (155,000
entries), and the Moby Part-of-Speech (214,000 entries), Moby
Pronunciator (167,000 entries with IPA encoding, syllabification, and
primary, secondary, and tertiary stress marks) and Moby Language
(100,000 word word lists in five major world languages) lexical
databases. All databases are supplied in pure ASCII, royalty-free, in
both Macintosh and MS-DOS disk formats (also in .Z file formats). Both
commercial (to resell derived structures as part of commercial
applications) and educational/research licenses are available. Samples
of each of the lexical databases are available by anonymous ftp from
netcom.com:/pub/grady/Moby_Sampler.tar.Z [192.100.81.100]. For more
information, write to Illumind, Attn: Grady Ward, 3449 Martha Court,
Arcata, CA 95521, call/fax 707-826-7715, or send email to
gr...@netcom.com.
[Maintainer's note: This contact information is no longer valid.
We're working on finding a current address.]

The Oxford Text Archive has hundreds of online texts in a wide variety
of languages, including a few dictionaries (the OED, Collins, etc.).
The Lancaster-Oslo-Bergen (LOB), Brown, and London-Lund corpii are also
available from them. For more information, write to Oxford Electronic
Publishing, Oxford University Press, 200 Madison Avenue, New York, NY
10016, call 212-889-0206, or send mail to arc...@vax.oxford.ac.uk.
(Their contact information in England is Oxford Text Archive, Oxford
University Computing Service, 13 Banbury Road, Oxford OX2 6NN, UK, +44
(865) 273238.)

Mailing Lists:

CORPORA is a mailing list for Text Corpora. It welcomes information
and questions about text corpora such as availability, aspects of
compiling and using corpora, software, tagging, parsing, and
bibliography. To be added to the list, send a message to
corpora...@x400.hd.uib.no. Contributions should be sent to
cor...@x400.hd.uib.no.


----------------------------------------------------------------
Subject: [5-7] Where can I get training sets for machine learning algorithms?

UC/Irvine (UCI) AI/Machine Learning Repository:

ftp.ics.uci.edu has a variety of AI-related materials, with a special
focus on machine learning. For example,

ftp.ics.uci.edu:/pub/machine-learning-databases/

contains over 80 benchmark data sets for classifier systems (30mb).

MLnet Machine Learning Archive
MLnet Online Information Service

In 1988 the Special Interest Group on Machine Learning of the German
Society for Computer Science (GI e.V.) decided to establish a library
of PROLOG implementations of Machine Learning algorithms. By 1994 the
library had a sizable collection of GLPed PROLOG software. The site
has grown, and now, according to the webpage it "offers a growing
collection of ML information, datasets, software and pointers to other
ML resources." The homepage is at:

http://www.mlnet.org

Send your contributions to Mathias Kirsten (in...@mlnet.org) at the GMD -
German National Research Center, or use the contribution facilities within
the MLnet OiS.

----------------------------------------------------------------
Subject: [5-7] What on-line Journals are there?

[this question is still in progress]

Journal of Artificial Intelligence Research. See [3-2a].
Journal of Machine learning Reasearch. See [3-2n].

---
[ comp.ai is moderated. To submit, just post and be patient, or if ]
[ that fails mail your article to <com...@moderators.isc.org>, and ]
[ ask youa news administrator to fix the problems with your system. ]

0 new messages