e.g. And that's why I never get any respect. [laughter] Now think of this more...

+

# Try not to break sentences up, even if it ends up not exactly lining up with the minute headings.

+

#* That is, if a sentence goes over the minute boundary by 5 or 10 seconds, keep the text for it in the minute where the sentence started. Only break the sentence if it goes longer than that (use your best judgement).

+

+

See below for an example, or better yet follow the example in: http://elinux.org/index.php?title=Session:The_Internet_of_Things

Overview

The purpopse of this project is to create transcripts of embedded Linux talks, as pages on the elinux wiki,
via crowdsourcing. The value of transcripts is that the material in the talks is much more
accessible to people. The transcript is searchable, and the talk can be read much more quickly than it can
be watched. This essentially makes the talk accessible via random access instead of sequential access.
There is a lot of very good content that is preserved in the videos that have been made over the years, and
the goal of this project is to make that content more accessible and usable to embedded Linux developers.

Instructions for Volunteers

There are three main tasks involved in this project, and as a volunteer you can participate in any of these tasks:

Creating session pages for embedded Linux talks

Creating transcriptions for the talks (on the session pages)

Verifying the transcriptions for the talks

Please find instructions for each of these activities below...

Note: There is a single master page holding all the talks we would like to transcribe,
located at:

Create an entry for the talk in the table of presentation on the list page. Follow the style of linking that is used for other sessions.

Please select a unique name for the talk (possibly adding a date if needed to distinguish the talk from others with similar names)

The session page name should always start with the prefix: "Session:"

Save the List page.

Click on the link, to create the session page for the talk

Create the base format for the page by copying and pasting from the Session:Template

Fill in the page with as much information as you can

Make sure that you include a link to the slides (if available separately) and the video

Sometimes, it is possible to get the original description of the talk, or the speaker biography, from the original or archived web site of the event

Save the session page

That's it. Thanks!

Adding to a transcription

By our estimate, it takes about 10 hours to transcribe a talk, if done by a human. This is based
on an estimate of about 10 minutes of transcription work for each minute of video. In order to make
this easy an scalable, we have broken down the transcript into 1-minute intervals.

e.g. And that's why I never get any respect. [laughter] Now think of this more...

Try not to break sentences up, even if it ends up not exactly lining up with the minute headings.

That is, if a sentence goes over the minute boundary by 5 or 10 seconds, keep the text for it in the minute where the sentence started. Only break the sentence if it goes longer than that (use your best judgement).

The first session I tried was the Samsung fragmentation page from ELC - unfortunately, there was no video for this.
The second session I tried was for Mike Anderson's keynote from ELC 2012 - setting up the page was a bit of a pain.
Very clear instructions are needed for this. It should be as mechanical as possible, with little room for error.

They use square brackets for comments like [laughter], [music], or [pause], and angle brackets
to indicate a speaker change.

Starting and stopping the video is painstaking. Backing up the video is difficult to be precise at.

For Mike's talk, it took me about 6 minutes of transcribing for each minute of video. I was trying
to go as fast as possible and timing myself. I think a reasonable estimate for each minute of
video is about 10 minutes of time (set up, actual transcription, corrections, saving). This is
when done out of context (a single minute session).

At this rate, 6 minutes of video would take an hour to transcribe,
so a 60 minute video would therefore take about 10 hours to transcribe.
Give another 2 hours for a second pass, and we're talking about 12 hours per video.
That's a lot, and clearly outside the scope of what one person would be expected to do.

Note that Mike's talk had very good audio, and Mike speaks slowly and clearly with no accent, so
this is probably a best-case scenario.