djvu2hocr(1) debian man page | unix.com

Man Page: djvu2hocr

Operating Environment: debian

Section: 1

DJVU2HOCR(1)							 djvu2hocr manual						      DJVU2HOCR(1)

NAME
djvu2hocr - DjVu to hOCR converter
SYNOPSIS
djvu2hocr [option...] djvu-file djvu2hocr {--version | --help | -h}
DESCRIPTION
djvu2hocr converts hidden text from a DjVu file to the hOCR[1] format.
OPTIONS
Text segmentation options --word-segmentation=simple Use the same word segmentation as found in the DjVu file. This is the default. --word-segmentation=uax29 Use the Unicode Text Segmentation[2] algorithm to break lines into words, possibly fixing word segmentation found in the DjVu file. Other options --version Output version information and exit. -h, --help Display help and exit.
PORTABILITY
djvu2hocr uses a custom extension to hOCR to retain characters which cannot be directly represented in an HTML/XML document. For example, control character BEL (^G, U+0007), is converted into the following HTML chunk: <span class="djvu_char" title="#x07"> </span>
SEE ALSO
djvu(1)
AUTHOR
Jakub Wilk <jwilk@jwilk.net> Author.
NOTES
1. hOCR http://docs.google.com/View?docid=dfxcv4vc_67g844kf 2. Unicode Text Segmentation http://unicode.org/reports/tr29/ djvu2hocr 0.7.9 03/10/2012 DJVU2HOCR(1)
Related Man Pages
djvm(1) - debian
djvu2hocr(1) - debian
djvuserve(1) - debian
djvuserve(1) - suse
djvutoxml(1) - suse
Similar Topics in the Unix Linux Community
Segmentation fault on basic linux commands
Why segmentation(coredump) in the following code in C?
Segmentation fault in Unix shell (linux OS)
C. To segmentation fault or not to segmentation fault, that is the question.
Why does this example C code run and yet SHOULD either not compile or give a segmentation fault?