AutoTC : Automatic Time-Code recognition for the purpose of synchronisation of subtitles in the broadcasting of motion pictures using the SMPTE standard : a thesis presented in partial fulfilment of the requirements for the degree of Master of Science in Computer Science, School of Engineering and Advanced Technology (S.E.A.T.) at Massey University, (Albany), New Zealand

Kashyap, Vineet

AutoTC : Automatic Time-Code recognition for the purpose of synchronisation of subtitles in the broadcasting of motion pictures using the SMPTE standard : a thesis presented in partial fulfilment of the requirements for the degree of Master of Science in Computer Science, School of Engineering and Advanced Technology (S.E.A.T.) at Massey University, (Albany), New Zealand

Files

01_front.pdf(108.56 KB)

02_whole.pdf(2.35 MB)

Date

2011

Authors

Kashyap, Vineet

Publisher

Massey University

Rights

The Author

Abstract

Time-coding requires manual marking of the in/out points of every dialogue spoken in the lm. This process is very manual, cumbersome and time-consuming which takes about 8-10 hours for an average lm duration of 2 and a half hours. AutoTC, a multi-threaded client-server application has been built for the automatic recognition of time-codes for the purpose of automatic synchronisation of subtitles in the broadcasting of Motion Pictures. It involves generating time-codes programmatically based on the input video's frame rate to be subtitled and using the audio to recognise in/out points automatically using the principles of Voice Activity Detection. Results show that the time taken to recognise time-codes automatically is approximately 1/6th compared to the the time taken by a professional time-coder using `Poliscript'[18], a commercial tool used in the production of subtitles. `IN-SYNC', a new performance metric, has been proposed to evaluate the accuracy of the developed system which will foster further research and development in the eld of automatic subtitling in an attempt to make it the de-facto standard. The application has been tested on the NOIZEUS[30] corpus giving an IN-SYNC accuracy of 65% on clean data with 6 mis-detections and an average of 51.56% on noisy data with 13 mis-detections which is very encouraging. The application can also send data to the MOSES[32] server developed for producing draft translations from Hindi to English which will make the subtitling process much faster, e cient and quality-centric.

Keywords

Time-coding, Motion picture subtitles, Voice activity detection, Speech processing systems

URI

http://hdl.handle.net/10179/3779

Collections

Theses and Dissertations

Full item page

Files

Date

DOI

Open Access Location

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Rights

Abstract

Description

Keywords

Citation

URI

Collections