keyboard_arrow_up
Character and Image Recognition for Data Cataloging in Ecological Research

Authors

Shannon Heh, Lynbrook High School, USA

Abstract

Data collection is an essential, but manpower intensive procedure in ecological research. An algorithm was developed by the author which incorporated two important computer vision techniques to automate data cataloging for butterfly measurements. Optical Character Recognition is used for character recognition and Contour Detection is used for image-processing. Proper pre-processing is first done on the images to improve accuracy. Although there are limitations to Tesseract’s detection of certain fonts, overall, it can successfully identifywords of basic fonts. Contour detection is an advanced technique that can be utilized to measure an image. Shapes and mathematical calculations are crucial in determining the precise location of the points on which to draw the body and forewing lines of the butterfly. Overall, 92% accuracy were achieved by the program for the set of butterflies measured.

Keywords

Computer Vision, Image Recognition, Character Recognition, Ecology, Butterfly Cataloging

Full Text  Volume 8, Number 6