Forum Moderators: open
YouTube's parent company Google has announced on its blog that automatic captions are to begin to roll out across the site. The machine-generated captions will initially be generated in English. At first they will only be found on 13 channels. These include National Geographic, Columbia, as well as most Google and YouTube channels.The software engineer behind the technology, Ken Harrenstien, is deaf. Currently YouTube offers a manual captioning service but video makers tend not to use it.
"The majority of user-generated video content online is still inaccessible to people like me," Mr Harrenstien wrote in the Google blog.
At best, I can often grasp general context from a phone message. It tends to translate names into other words. So "This is Tommy" was transcribed as "This is call me". It does fairly well on stop words and numbers, but tends overwhelmingly to miss the words of significance.
It's fun to guess though. I actually guessed based on area code and the few correct words of significance that "call me" was "Tommy".
Still, there's a long way to go for this be a significant help to the deaf. Professional audio from a professional announcer who doesn't say "um" a lot (a killer on Google Voice transcriptions) might help though.
Assuming they roll it out more broadly, it raises an interesting question. Four actually
1. Is Google using captions in the index currently?
2. Is this also the first step to enabling search of audio and video content beyond title, tags and surrounding text?
3. Will they be able to implement duplicate content filters eventually? I know there are people who use the same video many times, with different tags, titles and surrounding text. My understanding is that currently, if you change the lenght by a second or two, it will not be seen as the same video.
4. Does this make it more important to add voiceover to your videos instead of just my dramatic video of birds diving underwater?