Here's our official group-written (read: laboriously edited)
announcement, which is also at www.opengovdata.org (the www. is crucial
for the moment):
---
8 december 2007 - This weekend, 30 open government advocates gathered to
develop a set of principles of open government data. The meeting, held
in Sebastopol, California, was designed to develop a more robust
understanding of why open government data is essential to democracy.
The Internet is the public space of the modern world, and through it
governments now have the opportunity to better understand the needs of
their citizens and citizens may participate more fully in their
government. Information becomes more valuable as it is shared, less
valuable as it is hoarded. Open data promotes increased civil discourse,
improved public welfare, and a more efficient use of public resources.
The group is offering a set of fundamental principles for open
government data. By embracing the eight principles, governments of the
world can become more effective, transparent, and relevant to our lives.
---
The principles can be used to determine whether government data can be
considered "open", and it was suggested that we develop some sort of
branding that we all can make use of to support and point to the
principles. The principles are at:
http://wiki.opengovdata.org/index.php/OpenDataPrinciples
The discussion pages linked from some of the terms in the principles are
editable wiki pages and do need to be fleshed out with suggestions from
anyone.
Also, Dan Newman started some discussion about how to mobilize citizens
at large over transparency issues. I am eager to see how that discussion
continues--- I expect some organizing will happen on the (open) mail
list created at the conference (and linked from www.opengovdata.org;
yes, yet another mail list...).
Ok, back to waiting for my plane.
Josh Tauberer
Timeliness: Promptly available information is necessary for meaningful involvement.
Accessibility: The House (and related agencies) should strive to publish information in accordance with W3C recommendations for accessibility.
Format: The House should embrace structured information (XML, etc), standardize formats across different agencies (this effort has been underway for some time), and utilize non-proprietary filetypes.
Preservation: House data is historically significant, so issues regarding archiving and permanence should be considered.
Availability: Balancing the requirements for informed participation in a democracy with the privacy rights of individuals and the pragmatics of a funcitoning legislature lead to proper levels of disclosure.
Accuracy: Standards for accuracy and completeness preserve meaningful access.
Usability: Information should be presented in a fashion that encourages use.
Interactivity: Appropriate levels of user input should be considered.
--
John Wonderlich
Program Director
The Sunlight Foundation
(202) 742-1520 ext. 234
This came up, though not quite in that way. I think the consensus at the
time was that if information disappeared, it was no longer "open"
because, well, it was no longer being provided in the first place.
But, adding some notes about preservation into either, say, the
'complete' or 'accessible' principles is definitely warranted.
--
- Josh Tauberer
- GovTrack.us
"Yields falsehood when preceded by its quotation! Yields
falsehood when preceded by its quotation!" Achilles to
Tortoise (in "Gödel, Escher, Bach" by Douglas Hofstadter)
Josh and Perla and John and Greg have already addressed some of the
issues intertwined with preservation. Here is my attempt to bring
these threads together, add an explicit preservation-perspective,
enumerate the problems, and provide a starting point to solve the
problems:
1. Preservation will not just happen. Digital preservation in
particular takes planning and resources. I worry that, if
preservation is not addressed explicitly it will not be addressed
adequately.
2. Requiring "access" alone -- even open, complete, free access -- is
not enough. Without planning and funding for long-term access and
preservation, access today can turn into inadvertent loss tomorrow.
3. Relying on the government to provide the only means of long-term
preservation and access will work sometimes but fail when it is most
critical. If the government is the sole-provider and it (intentionally
or unintentionally) amends, alters, loses, abridges, or deletes
content, the content is lost for everyone for ever. Information that
is most embarrassing, most valuable, most useful to citizens in making
government accountable will be most vulnerable to intentional control,
alteration, and loss.
4. Relying on the government to provide sole means of access endangers
privacy as it allows governments to record and track use of government
information by individuals.
5. We can predict, based on what governments have done in the past,
what will happen if we allow or encourage the government to recover
costs for access (even marginal cost of distribution). There will be
two-tier access for users: some will be able to afford access and
others will not. There will be two-tier access for content: 'popular'
content will be free or less expensive, but there will be charges for
less-popular or less-used. There will be two-tier access to
functionality: one user may be able to get one page of a hearing for
free, but it will cost for citizens' groups, libraries, and others who
wish to get mass content (e.g. all hearings for a congress). The
government will rely on private sector vendors to provide access
through outsourcing and will claim that availability of content
through private vendors meets the requirements of 'access.' In an
attempt to recover costs, governments will license access to data and
in doing so, impose licensing restrictions on redistribution and use
and will apply technological locks (i.e., DRM) to enforce license
restrictions.
One possible component of a solution to the above problems is to
require that the government make available en masse and distribute
(without charge or licensing restrictions or DRM) all government
information to libraries, archives, and other memory organizations.
The existing Federal Depository Library Program (FDLP), which is
defined by U.S. Code Title 44 and administered by the Government
Printing Office (GPO), provides a starting place for such a
distribution system. The GPO has, however, been arrogating to itself
the role once given to distributed depository libraries and most
depository libraries have been reluctant to ask for the responsibility
of accepting deposit of digital government files. It will probably be
necessary to write into plans for preservation the explicit role of
government deposit and the role of depository libraries to accept and
preserve that information. With lots of copies in lots of
institutions, free of of locks and restrictions on use, it will be
harder to lose, destroy, or control access to government information.
With multiple partners preserving and providing access to the
information, there will be multiple budgets, multiple constituencies,
and multiple technical preservation solutions.
Jim Jacobs
Data Services Librarian Emeritus
University of California San Diego
I believe, it is tomorrow (Tues) at 10am.
http://hsgac.senate.gov/index.cfm?Fuseaction=Hearings.Detail&HearingID=513
Sean Moulton
Director,
Federal Information Policy
OMB Watch
1742 Connecticut Ave. NW
Washington,
DC 20009
Phone: (202) 234-8494
Fax: (202) 234-8584
wish to get mass content ( e.g. all hearings for a congress). The