Zero-Shot Learning Based Approach For Medieval Word Recognition Using Deep-Learned Features

View Researcher's Other Codes

Disclaimer: The provided code links for this paper are external links. Science Nest has no responsibility for the accuracy, legality or content of these links. Also, by downloading this code(s), you agree to comply with the terms of use as set out by the author(s) of the code(s).

Please contact us in case of a broken link from here

Authors Sukalpa Chanda, Jochem Baas, Sebastien Hamely, Dominique Stutzmanny, Lambert Schomaker, Daniël Haitink
Journal/Conference Name 16th International Conference on Frontiers in Handwriting Recognition (ICFHR 2018) 2018 10
Paper Category
Paper Abstract Historical manuscripts reflect our past. Recently digitization of large quantities of historical handwritten docu- ments is taking place in every corner of the world, and are being archived. From those digital repositories, automatic text indexing and retrieval system fetch only those documents to an end user that they are interested in. A regular OCR technology is not capable of rendering this service to an end user in a reliable manner. Instead, a word recognition/spotting algorithm performs the task. Word recognition based systems require enough labelled data per class to train the system. Moreover, all word classes need to be taught beforehand. Though word spotting could evade this drawback of prior training, these systems often need to have additional overheads like a language model to deal with “out of lexicon” words. Zero-shot learning could be a possible alternative to counter such situation. A Zero-shot learning algorithm is capable of handling unseen classes, provided the algorithm has been fortified with rich discriminating features and reliable “attribute description” per class during training. Since deeply learned features have enough discriminating power, a deep learning framework has been used here for feature extraction purpose. To the best of our knowledge, this is probably the first work on “out of lexicon” medieval word recognition using a Zero-Shot Learning framework. We obtained very encouraging results(accuracy ≈57% for “out of lexicon” classes) while dealing with 166 training classes and 50 unseen test classes.
Date of publication 2018
Code Programming Language Python
Comment

Copyright Researcher 2022