Breaking down Tesseract OCR Tesseract, an open source OCR project was originally developed by HP between 1984 and 1994 as a part of PhD research project at HP Labs, Bristol. vision ocr machine-learning papers