Good Morning Everyone,
I wanted to start out by thanking everyone who attended last night concerning the city-wide open access database which we have termed 'DataLFT'.
In summary:
The group decided that the end goal of this group should be to provide a standardized means of retrieving/storing data related to the Lafayette Area.
Many questions were raised on how to achieve that technologically, with many "I don't know's" and "WTH???"... j/k...not really but that's the gist.
We all determined that first and foremost, we have to learn what data is available in what formats and where the largest needs lie in terms of type of data sets.
Basically, we agreed to mimic the
datasf.org site for now to gather these resources with some analysis to determine which data sets to focus on first.
The outline of our stripped down
datasf.org clone is:
- Home page
- Listing of datasets and descriptions
- Simple search
- DataSet submission
- Name of data set
- URL to data set
- Description of data set
- Type of data set
- (Optional) Instructions on how to use it as a feed
- Dataset suggestion
- Listing of dataset suggestions
- Ability for other members to vote on whether they would like to have that data as well
- Entry of new dataset suggestion
- Suggested Name
- Description of data the user would like to see
- User Management
- Sign up form
- email
- password
- job title??? - I'm thinking to use for logistics of what type of person wants what type of data? Open for suggestions.
- bot prevention
- Admin Section
- Ban/Delete abusive users
- Approval of DataSet submission
- 3 Administrators should review and agree the data set is valid
Please feel free to update anything I may have missed or forgotten.
Today I'll be setting up a virtual machine with CentOS installed and Mercurial. I'll be setting up the repository on
http://bitbucket.org. Register now if you want or I'll send out the final information when I'm done. If you want to follow me on there I'm bryanfuselier. You will receive notification that I've created a repo when I do.
Which leads me to a small challenge I want to offer:
Last night when raised the question "What is the platform our website will be in?", naturally everyone brought up their specialties which were:
Coldfusion (with 4 votes)
PHP/Python (with 2 votes)
Everyone seemed to agree on MySQL as the database backend since everyone had worked with it. I started thinking after I went home last night that we're the only developer group that doesn't focus on a specific platform internally. So the challenge becomes:
Let's try something none of us have done. Why not? What's wrong with learning something different when we have each other to lean on? With such a small project, we won't be looking at large amounts of time to figure out what we want to do. If you all are up to this challenge, send back your feedback on what you would like to work with and why. Here's mine:
Tornado -
www.tornadoweb.org - We are discussing building a user-generated content site. While we don't expect to have 10,000 users, why not go ahead and build it in a service that solves the
C10K Problem? One thing that was mentioned was preparing this for usage in other areas. What if Los Angeles adopts it and they end up with over 10,000 users?
SQLite3 -
www.sqlite.org - Small footprint because it runs only in script when necessary.. Actually I could re-list this:
http://www.sqlite.org/features.html but it pretty much speaks for itself. While it definitely doesn't work for our "end product" of storing massive amounts of data, it would serve well for this small site and perform extremely well. Plus being file based, it can easily be handed over to people who want a copy of it ( to protect people's privacy we can create two db files: one for user management and the other for site data).
Let's keep the conversation going. I want suggestions and ideas. Thanks everyone!!!
Bryan F.