Hello,
I am an astronomer in the Nautical Almanac Office at the US Naval
Observatory (though I actually telework from Colorado). I use Python daily
for my work, and--apart from a general interest in AI--I am specifically
interested in algorithms that make working with our large datasets more
efficient, algorithms for finding groups within datasets, and algorithms
that help with optimizing page layouts.
Eric
On 9 Apr 2009, at 22:03, Noah Gift wrote:
> So we might need to spread the word a bit.
Given that a rather interesting group is apparently beginning to form,
we might have to think a bit harder about exactly what kind of word we
wish to spread.
I'm an old-school AI-er. My core discipline is psychology, my
specialism is cognitive psychology, my career has been in cognitive
science (AI, model-based programming, knowledge representation & HCI);
initially for a couple of years at Marconi Research and then for a
decade or so at HP Labs Bristol. Since '96 I've been hanging out on
the net as an independent.
On the psych side, I'm mainly interested in mental models; perception,
memory and reasoning - and the cognitive factors in human error. On
the comp side I'm mainly interested in knowledge representation,
reasoning techniques and HCI - and cognitive factors in the design/use
of programming languages.
I'm currently working informally on bringing AI-enhanced decision
support technology within the grasp of the ordinary Joe, the one who
is currently being forced to make woefully under-supported choices
about increasingly more complex matters, the outcome of which will
profoundly affect his future well-being.
>> 2. Noah, when you write """Some of the things I am interested in are
>> solving "low hanging fruit" problems that can be done this year, or
>> sooner.""" do you have any examples to share?
>
>
> Yes. The first one I want to solve, a lot for personal reasons, is
> figuring a way out to process more information in less time. The
> practical implication is RSS/Atom feeds. There is already enough
> information published to get started. The book collective
> intelligence has some examples of classifying RSS feeds. Tarek wrote
> Atomiser:
> http://tarekziade.wordpress.com/2008/12/14/looking-for-beta-testers-for-atomisator/
I agree that it's not too difficult to get started but getting usable
results demands significant effort. Unless you're happy to rely on 3rd
party solutions such as OpenCalais, you're probably going to need to
create your own adequately tagged training corpus.
It is a tempting piece of fruit, though. I decided to limit my goal to
automated tagging and took a very simplistic approach, I'm still
trying to decide whether the result is worth pursuing further [1] (the
output certainly needs tidying). I must admit, RSS items are currently
a lot cleaner and easier to handle than web pages.
Phorm [2, 3 (sections 48-50)] seem to be taking a similar approach (a
frequency count of non stop words) but I can't see that they're going
to be dramatically more successful.
> I like this initial problem because it is practical, it could save me
> and others 10-60 minutes a day, for example, and it isn't that hard.
I think you'll find that the devil is in the detail and in the
training of the classifier.
>> 3. Is there any interest in creating a Python-oriented code resource
> That sounds like a great idea. I like the idea of creating a bunch of
> open source AI solutions in Python in one spot. There is this google
> code project from Norvig's book, which might have some good ideas to
> look at as well. Maybe if we get traction, we could have a GSOC
> project, get Universities involved etc.
>
> http://code.google.com/p/aima-python/
The NLTK [4] is a great resource and I really must make some time to
play with MIT's conceptnet [5].
Noah suggests bitbucket, I'll follow that with knowledgeforge:
http://www.knowledgeforge.net/
>> 4. Anyone for SemWeb?
>
> I don't know a lot about the Semantic web, but a friend from Atlanta
> is really an expert on it. I have CC'd him, in case he is interested.
It might be able to make a contribution to the success of your RSS
summariser. There's some quite interesting stuff knocking around such
as python-dlp [6] which offers DLP-based reasoning over RDF/N3 content
and includes Chimezie Ogbuji's FuXi, a forward-chaining production
system for N3, based on Forgy's Rete algorithm.
I'm pursuing some SemWeb work from a heavily pragmatic perspective - I
want to find out whether it actually works and whether I can use it on
a commercial basis. To that effect, I've chosen a real-world domain,
the UK Parliament, and am attempting to model it using RDF [7]. I'm
looking forward to firing up the inference engines once the domain
model is complete.
> Finally, I am very realistic that this might be a boring list, and
> some problems might take a few years to solve or more. I don't mind,
> I am patient :)
The patience will definitely come in handy but I don't see why this
should necessarily be a boring list, its subscribers thus far seem to
be involved in some interesting work.
Cheers,
Graham
http://www.linkedin.com/in/ghiggins
[1] http://bel-epa.com/resources/ratonit/
[2] http://en.wikipedia.org/wiki/Phorm
[3] http://www.cl.cam.ac.uk/~rnc1/080518-phorm.pdf
[4] http://www.nltk.org/
[5] http://conceptnet.media.mit.edu/
[6] http://code.google.com/p/python-dlp/
[7] http://knowledgeforge.net/semwebparlparse/home.html
-----BEGIN PGP SIGNATURE-----
iEYEARECAAYFAknfMicACgkQOsmLt1NhivxEiwCg33SkoHO64A0FEtjGRNFrx3dl
ugYAoLvMmsxSiPtEXYpPGz45O8zJsFseiQCVAgUBSd8yJ1nrWVZ7aXD1AQISQQP9
E727SAn+KWrEDyKJOsPC18qdIBUViOGUOYrBQ49wK47QJo645eIAOWy7gA5XDKgK
wyiX1Q4n/nTh7jwhsYiEtRkO6M44rMExisFU+YDFzNKHtmS2ZNbkGh7YbI3qv1YI
ZmycWyp68264IL52A+ascao6hZu8iosBOpFy5Rv3cLg=
=GeEZ
-----END PGP SIGNATURE-----
On 10 Apr 2009, at 15:34, Rick Thomas wrote:
> the average semantic web application is much more prosaic.
Just in passing ... do you consider that there is such a thing as an
"average semantic web app" at this early stage of adoption?
> More philosophically, I recommend Andy Clark's book Being There. He
> criticizes expert system type AI and related models of human
> cognition. Roughly the idea is that human intelligence is not a matter
> of logical models in the brain but rather the situated relation of
> brain to environment and artifacts - such relations are the
> computation so, literally, our minds are not contained in our brains.
> By this reasoning our language and our computers are further
> extensions of our intelligence, not independent artificial
> intelligences. Where Noah says he would like a bot that thinks, I want
> a bot to think *with*.
>
> I think it's important to distinguish a least two senses of the term
> "artificial intelligence". One is historical describing the research
> that lead to expert systems and the type of algorithms the Raymond
> Hettinger explains. http://us.pycon.org/2009/conference/schedule/event/71/
> By now these are just tools to use when needed.
> The other usage is directed to questions of future possibilities,
> elusive because they are questions of the nature of the mind.
Ach. Given that this group has assembled under an "AI" banner, we
probably ought to ensure that we're in some sort of broad agreement
of the interpretation of the term in this particular Pythonic context.
Taking a pragmatic perspective: I find it hard to believe that
anyone's planning to re-implement Soar or ACT* in Python (but I
suppose it's not beyond the bounds of possibility). Perhaps I'm
mistaken in my perception - does anyone envisage a role for Python in
implementing a model of the kind of situated cognition that Clark is
describing?
I rather saw the primary focus as being on the kind of techniques of
statistical analysis and algorithms referenced by Hettinger and
implemented in Python. It would be these that underpinned Noah's bot.
I suspect that there are quite narrow limits on what we can collate -
the Norvig examples are snugly ensconced on googlecode, similarly Pyke
[1] on sourceforge. My original perception was that there might be a
role for a gradual accretion of more extensive examples and projects
(e.g. an RSS classifier bot) or perhaps, as has been suggested, a
unifying "glue" layer.
<digression>I share many of Clark's views but not his (admittedly
waning) endorsement of the connectionist approach. I recall attending
a talk on the subject given by Geoff Hinton - the neats were
completely sold, the scruffs, along with the psychologists, were
appalled (by and large). At Labs, we took a very close look at the
possibilities but were ultimately dissuaded from adopting it as a
means of classification because of its discontinuity with the
substantial body of empirical evidence from cognitive psychology.
We did hear of a case in the US insurance world where they adopted a
connectionist approach to classifying risk that ran into serious
trouble when it efficiently learned that people with certain zip codes
in their addresses were a greater risk than others and it classified
them accordingly, an illegal form of social exclusion known as "red-
lining". Being a distributed representation, there was little that
could be done either to remedy the problem or prevent the system from
re-learning the relationship.</digression>
Cheers,
Graham
http://www.linkedin.com/in/ghiggins
[1] http://pyke.sourceforge.net
-----BEGIN PGP SIGNATURE-----
iEYEARECAAYFAknfh+sACgkQOsmLt1NhivzpzwCeNG7v6eI6He+tzMQFMO9S1co+
PIUAoPdwKdXNpQZU0mKSLpNqi5wfzIMgiQCVAgUBSd+H61nrWVZ7aXD1AQLWgwP/
YoXiv1LqK2bOmnR7iA1ntvVCZDpazR3nGYJe7W65gJyn1+IvhD75H8k8g6r5Bynp
wNyGdyqURPSWjtw2wh7cLKiJN6T5dF13xakpwdUxTB6TgdSE0jrt9KAg5Jcqv1QU
lTudUG5gfP1noLLg1PRKQoYihv8wbNOk7n01JsAqKxw=
=d0AV
-----END PGP SIGNATURE-----
the average semantic web application is much more prosaic.
Just in passing ... do you consider that there is such a thing as an
"average semantic web app" at this early stage of adoption?
Ach. Given that this group has assembled under an "AI" banner, we
probably ought to ensure that we're in some sort of broad agreement of the interpretation of the term in this particular Pythonic context.