Re: How to extract HD dvb subtitles (and possibly convert to srt)?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Al 18/12/10 12:57, En/na Luca Olivetti ha escrit:
Al 15/12/10 18:28, En/na Luca Olivetti ha escrit:
I've a media player that only understand srt subtitles, so I'm looking
for a way to extract the dvb subtitles from a vdr ts recording, and then
use some kind of ocr to convert it to srt.
I asked google and I came empty handed (many people asking the same
question with no working answer).

[...]


I found how to do it but it's way too cumbersom:

the cvs version of ProjectX supports HD subtitles, *but* the sup
exported subtitles aren't readable neither by suprip (not subrip!) nor
by BDSup2Sub, so I additionally export to sub format and use BDSup2Sub
to convert it to a valid sup, that can then be processed by suprip.

Both ProjectX and BDSup2Sup, being java programs, work under Linux, but
suprip is windows only.


Projectx:
http://sourceforge.net/projects/project-x/

BDSup2Sub
http://forum.doom9.org/showthread.php?t=145277

SupRip
http://exar.ch/suprip/

I found a simpler (i.e scriptable) way: I wrote a simple python script that uses ProjectX (cvs version) to extract the subtitles from the ts file, BDSup2Sub to convert the subtitles to png images, ImageMagick's convert to improve the resulting images and finally gocr to convert the images to text.
It seems to work acceptably with recordings from the bbchd/bbcone-hd.
It's available here:

http://ventoso.org/luca/vdr/vdrsubrip/

Both ProjectX and BDSup2Sub will spit a million warnings, but the final result is ok. As always YMMV.

Bye
--
Luca

_______________________________________________
vdr mailing list
vdr@xxxxxxxxxxx
http://www.linuxtv.org/cgi-bin/mailman/listinfo/vdr


[Index of Archives]     [Linux Media]     [Asterisk]     [DCCP]     [Netdev]     [Xorg]     [Util Linux NG]     [Xfree86]     [Big List of Linux Books]     [Fedora Users]     [Fedora Women]     [ALSA Devel]     [Linux USB]

  Powered by Linux