Package guessencoding: Information
Source package: guessencoding
Version: 1.4-alt2_16jpp8
Build time: Feb 17, 2019, 06:37 AM in the task #221714
Category: Development/Other
Report package bugHome page: http://docs.codehaus.org/display/GUESSENC/
License: ASL 2.0
Summary: Guess encoding of files and return configured reader
Description:
The purpose of this library is to "guess" the encoding of files, and retrieve a reader that is properly configured to use the right encoding as guessed. The library is able to recognize the various Unicode encoding variants: * UTF-8 * UTF-16LE - Low Endian * UTF-16BE - Big Endian * UTF-32 If a Unicode encoding isn't recognized, it's an 8-bit encoding. If the 8-bit encoding is not US-ASCII, the default platform 8-bit encoding is assumed whatever it is. However, the library cannot guess between different 8-bit encodings. Only statistical analysis, n-grams and similar techniques specific to each language used in those files can help guessing the encoding, but this is not supported by the library.
Maintainer: Igor Vlasenko
Last changed
Feb. 5, 2019 Igor Vlasenko 1.4-alt2_16jpp8
- fc29 update
April 19, 2018 Igor Vlasenko 1.4-alt2_15jpp8
- java update
Nov. 9, 2017 Igor Vlasenko 1.4-alt2_14jpp8
- fc27 update