×

img Acces sibility Controls

Research Projects Banner

Research Projects

Intelligent speech-to-speech Translation with Lip-syncing for Educational Domain

Implementing Organization

Principal Investigator
Dr. Partha Pakray
National Institute Of Technology (NIT) silchar, Assam
CO-Principal Investigator
Prof. sivaji Bandyopadhyay
Jadavpur University

Project Overview

English is the preferred language in the educational sector globally, but in multilingual countries like India, there is a need to translate English-based lectures or tutorials into local Indian languages to help students understand specific topics. As digital communication becomes more visual, there is a need for systems that can automatically translate a video of an educational expert speaking in English into a target local language with realistic lip synchronization. The motivation for this is due to the increasing audiovisual content in educational information streams, such as YouTube and government institutes' NPTEL videos. Existing systems can only translate audiovisual content at a speech-to-speech level, which has limitations, such as producing unsynchronized lip movements and poor user experience. This project aims to build upon face-to-face translation systems for the educational domain by proposing a pipeline that can take a video of a person speaking in a source language and output a video of the same speaker speaking in a target language, ensuring that the voice style and lip movements justify the target language.
Funding Organization
Funding Organization
Science and Engineering Research Board (SERB), New Delhi
Anusandhan National Research Foundation (ANRF)
Quick Information
Area of Research
Computer Sciences and Information Technology
Start Year
2024
End Year
2027
Sanction Amount
₹ 23.91 L
Status
Ongoing
Output
No. of Research Paper
00
Technologies (If Any)
00
No. of PhD Produced
N/A
Startup (If Any)
00
No. of Patents
Filed :00
Grant :00
arrowtop