In a stroke of necessity fueled genius I thought, “Why not try a Modern Greek OCR engine?” Since all the letters are the same and there is more impetus to design a high-functionality OCR engine for Modern Greek, it would seem that it would at least accurately give the words (for the most part) and maybe would even do it better than the results shown above. So, after my initial results, I entered a dark-horse candidate into the running. ![]() To put it bluntly, and with all due respect to those who have done the work in getting these OCR engines together, the results were underwhelming. Let’s take a look at the results turned out by these two. I did nothing to clean up the quality of the pictures, like is generally suggested for better OCR results. The first text is obviously cleaner and easier to read. John Moschus “Spiritual Meadows” chapter 7 To test the OCR engines I fed in one picture of clean, neat, very legible Greek text (taken with my phone) and one example of a more difficult example copied from a pdf from the Patrologia Graeca. I’ll cover the third OCR tool later.įor an updated take on using a newer version of Tesseract and its seriously good results (my new default OCR tool), check out the Power of Tesseract. These two recommend themselves as two OCR engines which have been trained specifically for OCR-ing Ancient Greek text.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |