You are here

Workshop: Converting scientific-paper/PDF documents into accessible formats

Presented by Masakazu Suzuki, Katsuhito Yamaguchi and Toshihiro Kanahori

The workshop will use the new Infty Reader 3, a math OCR system, to convert PDF documents or scanned paper documents into electronic form. The InftyReader can recognize those documents including technical notations such as complicated math formulae. The output can be opened in Infty Editor or ChattyInfty so that OCR corrections can be made, or it can be exported directly to various other accessible formats. The resulting exported document can be read by sighted people in a variety of word processors or as a web document in any web browser. An exported document can be read by various print-disabled people in ChattyInfty3, MS Word, in Internet Explorer+MathPlayer, or it can be embossed in braille. The new Infty Reader version is more accurate in recognizing complex layout and can be coupled with the FineReader OCR engine for improved recognition. Several points to get much better recognition results in InftyReader are also discussed.