End-to-end scene text recognition is usually divided in two different sub tasks: word detection and word recognition. Currently OpenCV text detection does not use state of the art deep network(s), this proposal is to implement a deep network for text detection. The current text recognition uses a big network, another proposal is to modify the existing network to calculate PHOC representation, which will make the network lexicon independent and also reduce the network size considerably.

Organization

Student

Suman Kumar Ghosh

Mentors

  • Prasanna
close

2017