As part of our DANCE efforts, we are creating a common analysis structure for web discussions called DiscourseDB.
DiscourseDB is supposed to represent online discussions from different sources (e.g. forums, chats, instant messaging, etc.) in a unified format that allows researchers to perform discourse analyses across sources without having to take the specific properties of each particular source into account.
We are still at an early design stage and welcome any feedback.
The links below lead to (1) a brief overview document summarizing the basic ideas that govern the current design of DiscourseDB and (2) to an Entity-Relationship diagram that lays out the current structure of DiscourseDB.
DiscourseDB overview document