Real Time Sub-Titles For Videos : Thoughts, Ideas & Algorithms?

Kaustubh Katdare

Kaustubh Katdare

@thebigk Oct 21, 2024
This novel thought just struck my mind and I'm sure it's going to be a 'HUGE' thing if implemented. The idea is like this :-

When a video streams, the algorithm will quickly fetch the streamed data; extract the audio signals out of it and depending upon the voice, add sub-titles in the language of user's choice. So as soon as you start streaming your video; the sub-titles will start appearing on user's screen in the language decided by the user.

What do you think? Quite awesome? Already done? How to do it? Algorithm tryouts? Bring them on! 😁

Replies

Welcome, guest

Join CrazyEngineers to reply, ask questions, and participate in conversations.

CrazyEngineers powered by Jatra Community Platform

  • Anoop Kumar

    Anoop Kumar

    @anoop-kumar-GDGRCn Jul 8, 2012

    Aim: ^^^
    Apparatus: Super fast speech recognition system which needs self leaning computing system + Google translate.
    Major user: frequent visitor of YouTube video learner and PirateBay. ☕


    Edit: here is one of this kind project developed but I din't find how actually it's working
    <a href="https://blog.mathieu-leplatre.info/announcing-subtivals-realtime-subtitles-for-film-festivals.html" target="_blank" rel="nofollow noopener noreferrer">Announcing Subtivals, realtime subtitles for film festivals - Mathieu Leplatre</a>
  • silverscorpion

    silverscorpion

    @silverscorpion-iJKtdQ Jul 9, 2012

    ^^ That is still not completely real-time. The subtitles should still be prepared beforehand. It's just projected from a different projector, so that even if the original video is not subtitled, we can still have our own subtitles superimposed from another projector. And then there are other features like sncing, speeding up the subtitles etc...

    Truly real-time subtitle generation would be quite a task, I think. Speech recognition, Natural Language processing and translation. All the tasks are formidable in themselves. Should be combined in real time. But the idea is really good.