Live TV Captioning: Human or Machine Transcription?

  • Thread starter Thread starter Gear300
  • Start date Start date
  • Tags Tags
    Machines
AI Thread Summary
The discussion revolves around the capabilities of captioning technologies used in live broadcasts, such as NFL and FIFA events. There is a debate on whether these systems are advanced machines or skilled human stenographers. Observations indicate that captions often lag behind spoken words in news programs, while in studio settings, the opposite occurs. The use of captions is prevalent, especially for accessibility, but certain formats like Jeopardy can be problematic due to spoilers. Technological advancements, such as those in the Google Pixel 6A, allow for real-time captioning during phone calls and podcasts, suggesting that local machine processing is involved. While machines are increasingly utilized for transcription, especially in live settings, human involvement remains crucial for accuracy, particularly in high-stakes environments like news broadcasts or legal proceedings. Stenotype keyboards are commonly used, and the need for precise transcription is emphasized, as errors can lead to significant consequences in various contexts, including business negotiations and courtroom settings.
Gear300
Messages
1,209
Reaction score
9
Are they really caption machines, or really well-trained stenographers? Like in an NFL or FIFA or other live broadcast. I took it that they might have been caption machines with highly advanced grammar compilers, but looking around, I guess they can just as well be very fast typists/keyboard-players.

When somebody asked two years ago which language would dominate the future (because English seems to hold most of the Internet), I thought that ironically NLP and fast-translating machines would preserve pluralism in language, mostly because in a hundred or so years, we would have fast-translating or quick-witted grammar compilers.
 
Computer science news on Phys.org
Gear300 said:
Are they really caption machines, or really well-trained stenographers? Like in an NFL or FIFA or other live broadcast.
I think machines.

One thing I have noticed.
  1. For news programs, the captions lag behind the spoken words.
  2. For studio programs, the spoken words lag behind the captions.
I use captions on TV almost 100% of the time. But I learned to never use captions when watching Jeopardy because the captions reveal the answer before either I or the contestant have time to answer.

I just got a new Google Pixel 6A phone. It has the feature of showing captions for live phone calls or podcasts. It works for downloaded podcasts even when in airplane mode. Obviously, the captions must be generated locally in the phone's machine. I suspect that I may be able to select the language of the phone's caption machine independent of the language of the speech.
 
  • Informative
  • Like
Likes berkeman, jedishrfu and FactChecker
Zoom does this today in real time. Not very well for scientific talks ("Is Jay-Sigh a rapper?"), unfortunately.
 
It depends. Machines are used in some cases, but TV programs with a decent budget still use humans to transcribe, computers still make to many mistakes and that could cause real problems if what is being transcribed is say a news program or a political speech.
As far as I understand the keyboards used are versions of stenotype keyboards.

Note also that this is not as "exotic" as it might seem, there are lots of cases where transcriptions have to happen live; the most obvious case being in a courtroom, where what is said obviously have to be recorded accurately. There are also services that do live transcriptions of e.g. lectures for people with hearing impairments; a colleague of mine uses one of these services when listening to talks at conferences, the built in transcribe feature in e.g. Teams doesn't work quite well enough.

I don't see why the stenographer would be to "highly trained" just because it is live TV? Sure, mistakes might have more impact on TV, but even if it is just a "local" transcription it still needs to be correct.
I friend of mine used to work for a company that transcribes (live) conference calls between business that are e.g. negotiating contracts; needless to say mistakes can be costly.
 
I came across a video regarding the use of AI/ML to work through complex datasets to determine complicated protein structures. It is a promising and beneficial use of AI/ML. AlphaFold - The Most Useful Thing AI Has Ever Done https://www.ebi.ac.uk/training/online/courses/alphafold/an-introductory-guide-to-its-strengths-and-limitations/what-is-alphafold/ https://en.wikipedia.org/wiki/AlphaFold https://deepmind.google/about/ Edit/update: The AlphaFold article in Nature John Jumper...
Thread 'Urgent: Physically repair - or bypass - power button on Asus laptop'
Asus Vivobook S14 flip. The power button is wrecked. Unable to turn it on AT ALL. We can get into how and why it got wrecked later, but suffice to say a kitchen knife was involved: These buttons do want to NOT come off, not like other lappies, where they can snap in and out. And they sure don't go back on. So, in the absence of a longer-term solution that might involve a replacement, is there any way I can activate the power button, like with a paperclip or wire or something? It looks...
Back
Top