Standard graph API?

Magnus Lie Hetland

unread,

Aug 23, 2004, 1:04:54 PM8/23/04

to

Is there any interest in a (hypothetical) standard graph API (with
'graph' meaning a network, consisting of nodes and edges)? Yes, we
have the standard ways of implementing graphs through (e.g.) dicts
mapping nodes to neighbor-sets, but if one wants a graph that's
implemented in some other way, this may not be the most convenient (or
abstract) interface to emulate. It might be nice to have the kind of
polymorphic freedom that one has with, e.g, with the DB-API. One could
always develop factories or adaptors (such as for PyProtocols) to/from
the dict-of-sets version...

So, any interest? Or am I just a lone nut in wanting this?

--
Magnus Lie Hetland "Canned Bread: The greatest thing since sliced
http://hetland.org bread!" [from a can in Spongebob Squarepants]

Robert Brewer

unread,

Aug 23, 2004, 1:34:41 PM8/23/04

to Magnus Lie Hetland, pytho...@python.org

> Is there any interest in a (hypothetical) standard graph API (with
> 'graph' meaning a network, consisting of nodes and edges)?

1) Yes!
2) Only if it's in C. I don't mind (re)writing pure Python graph
containers, it's the speed of a pure Python graph that's the bigger
issue to me (mostly object inspection and/or
decorate-insert-retrieve-undecorate cycles). If it started in a
pure-Python package, then went into the stdlib, and then got a C
implementation (like sets.py did), that would be fine.
3) It would have to accept arbitrary objects. No "make your class a
subclass of GraphNode" garbage. If someone wanted to make a subclass of
Graph called StringGraph, which was optimized for strings, or IntGraph
optimized for ints, that would be fine. But the base class (I assume a
class implementation?) should handle instances of Object.

I'm sure there are many more details, but those are the big ones IMO.

Robert Brewer
MIS
Amor Ministries
fuma...@amor.org

Steven Bethard

unread,

Aug 23, 2004, 2:13:07 PM8/23/04

to pytho...@python.org

Magnus Lie Hetland <mlh <at> furu.idi.ntnu.no> writes:
> Is there any interest in a (hypothetical) standard graph API (with
> 'graph' meaning a network, consisting of nodes and edges)?

I don't need one right now, but I know I have a few times in the past.
Certainly seems like a good idea to me. We've got sets as builtins now, no
reason we shouldn't have a simple graph API, at least in the library.

Steve

wes weston

unread,

Aug 23, 2004, 2:43:35 PM8/23/04

to

Magnus Lie Hetland wrote:
> Is there any interest in a (hypothetical) standard graph API (with
> 'graph' meaning a network, consisting of nodes and edges)? Yes, we
> have the standard ways of implementing graphs through (e.g.) dicts
> mapping nodes to neighbor-sets, but if one wants a graph that's
> implemented in some other way, this may not be the most convenient (or
> abstract) interface to emulate. It might be nice to have the kind of
> polymorphic freedom that one has with, e.g, with the DB-API. One could
> always develop factories or adaptors (such as for PyProtocols) to/from
> the dict-of-sets version...
>
> So, any interest? Or am I just a lone nut in wanting this?
>

Magnus,
A know I'd appreciate it. It could be used to configure
neural nets and logic networks; where this api would make
it easy to build an abstraction then "compile" it into a
faster representation for execution - or just run the
tree/graph in "interpreted" mode.
I don't think it would get a lot of use, but the use
would be high end.
wes

David Eppstein

unread,

Aug 23, 2004, 2:58:15 PM8/23/04

to

In article <slrncik8...@furu.idi.ntnu.no>,

m...@furu.idi.ntnu.no (Magnus Lie Hetland) wrote:

> Yes, we have the standard ways of implementing graphs through (e.g.)
> dicts mapping nodes to neighbor-sets, but if one wants a graph that's
> implemented in some other way, this may not be the most convenient
> (or abstract) interface to emulate.

Actually, my interpretation of this standard way is as a fairly abstract
interface, rather than a specific instantiation such as dict-of-sets:
Most of the time, I merely require that iter(G) produces a sequence of
the vertices of graph G, and iter(G[v]) produces a sequence of neighbors
of vertex v. I also sometimes use "v in G" and "w in G[v]" to test
existence of vertices or edges.

Pros and cons of this approach:

- You can use a list instead of a set in the adjacency list part of the
representation. This may be faster and more space efficient when the
vertex degrees are small.

- It's easy to create test graphs as code literals

G1 = {
0: [1,2,5],
1: [0,5],
2: [0,3,4],
3: [2,4,5,6],
4: [2,3,5,6],
5: [0,1,3,4],
6: [3,4],
}
G2 = {
0: [2,5],
1: [3,8],
2: [0,3,5],
3: [1,2,6,8],
4: [7],
5: [0,2],
6: [3,8],
7: [4],
8: [1,3,6],
}

- Any indexable object can be a vertex. The vertex identities can be
something meaningful to your program. On the other hand, that means
(unless you know where your graph came from) you can't rely on the
vertices being special vertex objects with nice properties and you can't
use objects like None as flag values unless you're sure they won't be
vertices.

- It doesn't provide an abstract way of changing the graph (although
that's relatively easy if G is e.g. a dict of sets)

- It doesn't directly represent multigraphs

- It doesn't directly represent undirected graphs (instead you have to
replace an undirected edge by two directed edges and hope your callers
don't give you a directed graph by mistake).

- There isn't an explicit object representing an edge, although you can
create one by using a tuple (v,w) or (for undirected edges) a set. This
can be an advantage in terms of memory usage but a disadvantage in terms
of number of object creations. Also it means that if you want to store
information on the edges you have to use a dict indexed by the edge
instead of attributes on an edge object (probably better style anyway
since it prevents different algorithms on the same graph from colliding
with each other's attributes).

--
David Eppstein
Computer Science Dept., Univ. of California, Irvine
http://www.ics.uci.edu/~eppstein/