1st Workshop on

Improving Document Analysis

For Indian Languages
In conjunction with ICFHR 2022, 5th to 7th December, 2022.
Explore

Overview

The workshop aims to provide a space for reflecting on the steps needed to adapt and evolve current state-of-the-art methods & software tools available for document analysis for Indian regional languages. The main goal of the workshop is to accelerate the research and development of document analysis tools so that the performance of such tools is at par with other Latin-based languages, which we are currently lagging. We will discuss SOTA work in the broad spectrum of document analysis and its import to Indian languages.
Topics of discussion include
➼ Document Layout Analysis
➼ Optical Character Recognition(OCR) of Printed and Handwritten Documents
➼ OCR post-editing
➼ Name-Entity Recognition
➼ Form Field Detection
➼ Web-based Application for Document Analysis Tools

We will also discuss available datasets or corpora for major Indian languages, Deep Neural-Network based Computer Vision and Language Models that can be used for various downstream tasks of document analysis; They form the foundational basis for performing Document Analysis for the regional languages.

The driving idea behind the workshop comes from the fact that there is a large scope for improvement in adapting and creating new techniques and methods in the many fields of document analysis from the context of Indian regional languages. With its unique structure and clear diversity from Latin-based languages, there is a need to focus on clearly understanding the space and filling that gap that exists.

Workshop organizers are also offering a 6-hour introductory course on A Beginners Guide to OCR Research and Application Development for Indian Languages.

Important Dates

Date and Time Actionable Item
23:59, 20th Oct 2022 Paper Submission Deadline
23:59, 1st Nov 2022 Author Notification
23:59, 5th Nov 2022 Camera Ready Paper Due
23:59, 10th Nov 2022 Registration Deadline
5th to 7th Dec 2022 Workshop Dates

Submissions

Instructions for Submission

The corresponding paper to be submitted must follow the template here
Keep visiting for further clarity and instructions with respect to submissions and subsequent registrations

Program Chairs

...

Ganesh Ramakrishnan

Institute Chair Professor, Department of Computer Science, IIT Bombay

...

Parag Chaudhuri

Associate Professor, Department of Computer Science, IIT Bombay

...

Dr Venkatapathy Subramanian

Senior Project Research Scientist

Program Committee

...

C V Jawahar

Professor, IIIT Hyderabad

...

Ajoy Mondal

Post doctoral Fellow, IIIT Hyderabad

...

Chetan Arora

Associate Professor, Department of Computer Science, IIT Delhi

...

Ravi Kiran S

Assistant Professor, IIIT Hyderabad

...

G. S. Lehal

Professor, Computer Science Department, Punjabi University, Patiala

...

Tushar Patnaik

Joint Director, CDAC, Noida

Organizers

...

Badri Vishal Kasuba

Masters Student, IIT Bombay

...

Dhruv Kudale

Masters Student, IIT Bombay

Contact Us

...

IIT Bombay

Main Gate Rd, IIT Area, Powai, Mumbai, Maharashtra 400076

...

Dr Venkatapathy Subramanian

venkatapathy@cse.iitb.ac.in