This is a maintenance release which incorporates fixes and enhancements from the both the community and the core team.
- HTML parsing functions (based on JSoup)
- Metaphone3 (American English) & Cologne Phonetic (German) coders & clustering
- Google Fusion Table import support
- Facet for exact duplicates
- Ability to star favorite expressions for reuse later
- Latest Apache POI library including a number of Excel bug fixes
Google Refine is designed to be extensible; a number of extensions have been written by users over the past eight months, e.g. the
RDF extension from
DERI in Galway and the
CKAN extension. Extensions are distributed separately by their publishers and significantly enhance the functionality of the base product. The wiki has documentation on writing your own
extension or
reconciliation service.
Other than
backing up your data, there is no special upgrade procedure required. If you are an existing user, you will get prompted with the option to upgrade the next time you run Google Refine.
The kits for Linux, Mac, and Windows are available for download from:
The project is completely open sourced, liberally licensed, and community driven. If you'd like to be involved, we'd love to have you. Check the
wiki for
ways to participate.
The Google Refine Team