HTMLMediaElement.playbackRate
My name is Reuben Wetherbee and I work for the University of Pennsylvania School of Arts and Science. I was recently given a project to help members of the Writing Program come up with a way to align a whole bunch of readings by poets with the corresponding text. I ended up using jPlayer and Hyperaudio concepts to get the job done. The final product has both a player and creator/editor module where the Writing Program can do some manual alignment for themselves starting from plain text. Manual alignment works since most of the poems are rather short. You can choose to align by word or by line and edit already tagged text.
Anyway, if anybody from the group wants to check it out and give me feedback that would be great.
The code is on github at www.github.com/jrweth/MediaAlignedText
You can see some documentation and some demos at:
http://jrweth.github.com/MediaAlignedText
Some Poems that have already been aligned and rolled out:
http://www.writing.upenn.edu/pennsound/x/Williams-WC/the_red_wheelbarrow.php
http://www.writing.upenn.edu/pennsound/x/Ashbery/crossroads_in_the_past.php
http://www.writing.upenn.edu/pennsound/x/Ashbery/the_skaters.php
Disclaimer – this is my first foray into a jQuery plugin and really heavy duty javascript so I’m sure there are some bugs / kinks / idiotic ways of doing things that could use some work. I also haven’t yet tested on a really long selection yet, so there might be performance problems.
Hi Mark,
Presently my primary concern is how best to obtain and implement into both HTML and javascript the hypertranscripting, as my prospective project involves potentially a thousand pages, and word-alignment is required.
I've been checking out the various hyperaudio demos, and I have questions ;)
1. My understanding was that legit HTML5 data attributes must be prefixed with 'data-'; where did 'oval' and 'm' come from and are they legit HTML5 data-attributes? I'm also wondering about the pros/cons of each word being duped as an attribute of itself (e..g., oval=), versus IDs for each word (as in my hack prototype); I suppose the former is more semantically meaningful and easier to read in an array.
2. In paneltext.js I see the .subtitle arrays; how did you go about getting the time values on these projects? This certainly seems to be the most tedious aspect of the project I'm looking at (approximately 1000 word-aligned pages, btw); even with my Logic chops however, it's not a quick task and I have no way to export into anything useful, e.g., xml. I had a go with Soundbooth, using the 'Analyze Speech’ + 'Export ... Speech Analysis'. Only with a reference text were the results useful, though the exported xml file has so much extraneous taggery that to extract/convert the time values to a data-attribute will require a load of editing in BBEdit. As to CMU Sphinx, I've installed it and ran a couple of the demos but many hours later I have no idea how exactly to proceed with it for my objectives. Some granny-proof documentation/tutorials would be most welcome, but that's another board ;)
3. In the hyperpadaudio and Hyperdisken demos (http://happyworm.com/clientarea/hyperaudio/hap/v22/pad.htm, http://happyworm.com/clientarea/hyperaudio/htdemo/), where is the paneltext.js or its equivalent? Are you achieving the highlighting differently than in the other demos? Specifically, are you using an alternate method which doesn't require an array with timings for each word?
As to this: http://hyper-audio.org/r/, well that's just a thing of beauty, hella nice work man.
I thank you in advance for indulging my noob questions.
p.s. Mark, only after hours of investigation did I notice your presentation, which, IMO, is essential reading. Perhaps you can add a link to this google discussion page? For me, the key discovery therein are the available transcription services, should my results with Soundbooth prove too time-consuming. On a paid project I could easily justify hiring one of those services, though in my case, as I'll have the texts, I require only marked up timestamping.
Cheers, sr