dawg2wordlist.1.asc 971 B

123456789101112131415161718192021222324252627282930313233343536373839404142434445
  1. DAWG2WORDLIST(1)
  2. ================
  3. :doctype: manpage
  4. NAME
  5. ----
  6. dawg2wordlist - convert a Tesseract DAWG to a wordlist
  7. SYNOPSIS
  8. --------
  9. *dawg2wordlist* 'UNICHARSET' 'DAWG' 'WORDLIST'
  10. DESCRIPTION
  11. -----------
  12. dawg2wordlist(1) converts a Tesseract Directed Acyclic Word
  13. Graph (DAWG) to a list of words using a unicharset as key.
  14. OPTIONS
  15. -------
  16. 'UNICHARSET'
  17. The unicharset of the language. This is the unicharset
  18. generated by mftraining(1).
  19. 'DAWG'
  20. The input DAWG, created by wordlist2dawg(1)
  21. 'WORDLIST'
  22. Plain text (output) file in UTF-8, one word per line
  23. SEE ALSO
  24. --------
  25. tesseract(1), mftraining(1), wordlist2dawg(1), unicharset(5),
  26. combine_tessdata(1)
  27. <https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html>
  28. COPYING
  29. -------
  30. Copyright \(C) 2012 Google, Inc.
  31. Licensed under the Apache License, Version 2.0
  32. AUTHOR
  33. ------
  34. The Tesseract OCR engine was written by Ray Smith and his research groups
  35. at Hewlett Packard (1985-1995) and Google (2006-2018).