×

img Acces sibility Controls

Research Projects Banner

Research Projects

Information Access from Document Images of Indian languages

Implementing Organization

Dept. of E & ECE, Indian Institute of Technology (IIT), Kharagpur
Central Electronics Engineering Research Institute (CEERI)
Indian Statistical Institute (ISI)
International Institute of Information Technology Hyderabad
Principal Investigator
Prof. Prabir Kumar Biswas
Professor and Head
|
Department of Geology and Geophysics, Indian Institute of Technology (IIT), Kharagpur, West Bengal
Department of Electronics and Electrical Communication Engineering
CO-Principal Investigator
Dr. C. V. Jawahar
IIIT Hyderabad
CO-Principal Investigator
Prof. Santanu Chaudhury
CEERI Pilani
CO-Principal Investigator
Prof. Jayanta Mukhopadhyay
Professor
|
Department of Geology and Geophysics, Indian Institute of Technology (IIT), Kharagpur, West Bengal
Department of Computer Science and Engineering
CO-Principal Investigator
Prof. Bhabotosh Chanda
Professor
|
Indian Statistical Institute (ISI)
Electronics and Communication Sciences Unit
CO-Principal Investigator
Prof. Shamik Sural
IISC Bangalore, Karnataka, Karnataka, Karnataka & IIT Kharagpur, West Bengal
Dept. of CSE

Project Overview

Development content aware image processing algorithms for robust and efficient recognition and retrieval from Indian language document images is proposed. Our image processing algorithms aim at improving the quality of document images by removing the noise and low resolution artifacts by adopting content aware shape-based morphological filters. A set of recognizers will be built using state of the art machine learning techniques such as deep learning for handwritten, typewritten and low resolution document images where the existing technologies are insufficient. For hard and noisy handwritten documents, we propose holistic keyword spotting techniques to reduce search space and complement the recognition based approaches. We will also build and demonstrate information access and retrieval schemes over a joint space of image features and noisy text, so as to enable a set of immediate practical applications. The methods will be validated on two different focussed collections during the project.
Funding Organization
Funding Organization
Ministry of Electronics and Information Technology (MeitY)
Quick Information
Area of Research
Computer Sciences and Information Technology
Focus Area
Multimodal, Multilingual and Cross-lingual Interfaces
Sanction Amount
₹ 4.00 Cr
Status
Ongoing
Output
No. of Research Paper
00
Technologies (If Any)
00
No. of PhD Produced
N/A
Startup (If Any)
00
No. of Patents
Filed :00
Grant :00
arrowtop