Nik, good to hear from you again. I’m happy to see this discussion rise to the surface particularly in the face of dwindling resources and increasing demand for open data.
Your last point is interesting. I’m keen to talk more about how govt and the open data community can co-design such applications so that the end product isn’t one where the user response isn’t ‘that’s nice but it would have been better if XYZ had been considered’.
Anyone want to chime in one this?
David
The tool needs to be able to connect to multiple data sources, perform data mashups, provide advanced visualizations, accommodate geospatial analysis, and provide an easy to use interface for analysts, developers and end consumers.
Technical Specifications:
Ease of Use: the tool’s overall ease of use for analysts, developers and consumers is paramount. Analysts and developers should not need to know SQL or any other proprietary languages to leverage the tool. Consumers should be able to interact easily with the data in a highly visual, easy to use interface.
Pull and Assemble Data: the tool will allow Analysts to access and join data from multiple data stores and combine external data in the form of flat files, spreadsheets or other formats. Analysts should be able to access a central repository containing frequently used joins, designs and objects and to access metadata captured from earlier explorations.
Analyze and Visualize Data: the tool will provide advanced analysis capabilities and support advanced visualization techniques. The tool should be able to present geo-coded data on a visual map. Formatting options for layout and design must be flexible and easy to use.
Package and Distribute Data: the final output of the data exploration and visualization should be easy to package and distribute. Consumers should be able to interact with the output while connected to their local area network or when working disconnected. The output should be exportable to Excel, PowerPoint and other formats.
Mobile Capabilities: the tool should allow consumers who work in the field or the office to receive and interact with data on mobile devices including tablets and smart phones.
Training and Support: the training required to leverage the full capabilities of the tool should be manageable for Analysts and minimal for Consumers. Training should be clearly accessible through e-learning, online help and through local training partners if necessary.
Architecture and Integration: the tool should be able to integrate with the databases, cubes and semantic layers that are currently in production at WorkSafeBC and provide integration with SharePoint. The tool and its capabilities need to be embedded within our existing applications and web pages. The tool should be easy to deploy and scalable for enterprise use. The initial installation is estimated at approximately 500 users.
Although I would make the point that usually bulk downloads ARE APIs.
Your browser does a HTTP GET of a usually fixed URL to fetch the
resource. It's the simplest possible API. The gov't updates that
file, then you GET it.
So if a API is required, a single bulk download API request meets that
requirement IMO.
I can also talk from personal experience about building Recollect.net.
I would not base my application on any API provided by the gov't for
technical and business reasons. I wouldn't trust that the Gov't APIs
could meet the performance, efficiency and reliability requirements I
need. So I seek raw data, then write importers into my DB schema.
From here, I expose my data model into a RESTful API that we build our
app UI and connectors on.
I would use an API to import data into the app, or poll the API for
new data, but I would always seek to maintain a local copy of the data
with access designed for my specific use.
I want to bounce this back to Jury - how does this blog post and group
consensus inform the projects that you're working on? Are you seeing
similar problems with APIs?
Luke
Luke to your question, I believe that there's huge value in the open data community collaborating with and assisting municipalities (all govs) with designing their open data portals - after all, the data catalogues are primarily meant for your use and those of the research community. A significant decision is being in the ICT community around inhouse data centers versus the cloud or a hybrid and I thought the article had an element that could contribute to the decision making process.
I'm hearing from the group that Gov should stay out of the API biz. That's great if that's the optimum scenario for you - so, back to you, is that the optimum .... or maybe "it depends" :-)
Cheers Jury
Sent from my iPhone
Gov'ts should provide the "simplest possible API" - the single bulk
download file. No coding fancy things, just put the data at a URL and
keep it up to date.
Only when it's extremely necessary - as noted elsewhere in this thread
- should a more complex API be considered.
But "Bulk download" vs "API" is totally orthogonal to "In-house" vs
"Cloud". A URL is a URL is a URL. I don't care if the bulk download
comes from a municipal server or from amazon s3 cloud hosting. Focus
on the simplest possible API - the bulk download file.
Sent from my iPad