Package hunalign: Information
Source package: hunalign
Version: 1.2-alt2
Build time: Jan 16, 2021, 09:41 PM
Category: Text tools
Report package bugHome page: http://mokk.bme.hu/resources/hunalign/
License: LGPL
Summary: The hunalign sentence aligner
Description:
hunalign aligns bilingual text on the sentence level. Its input is tokenized and sentence-segmented text in two languages. In the simplest case, its output is a sequence of bilingual sentence pairs (bisentences). In the presence of a dictionary, hunalign uses it, combining this information with Gale-Church sentence-length information. In the absence of a dictionary, it first falls back to sentence-length information, and then builds an automatic dictionary based on this alignment. Then it realigns the text in a second pass, using the automatic dictionary. Like most sentence aligners, hunalign does not deal with changes of sentence order: it is unable to come up with crossing alignments, i.e., segments A and B in one language corresponding to segments B' A' in the other language. There is nothing Hungarian-specific in hunalign, the name simply reflects the fact that it is part of the hun* NLP toolchain.
List of rpms provided by this srpm:
hunalign (e2kv5, e2kv4, e2k)
hunalign-data (noarch)
hunalign-debuginfo (e2kv5, e2kv4, e2k)
hunalign (e2kv5, e2kv4, e2k)
hunalign-data (noarch)
hunalign-debuginfo (e2kv5, e2kv4, e2k)
Maintainer: Vitaly Lipatov
Last changed
Oct. 4, 2019 Michael Shigorin 1.2-alt2
- E2K: avoid superflous optimization level
Oct. 27, 2013 Vitaly Lipatov 1.2-alt1
- new version 1.2 (with rpmrb script)
April 16, 2012 Vitaly Lipatov 1.1-alt1
- initial build for ALT Linux Sisyphus