Aspell将字典文件解码为latin1，即使环境和Aspell配置都将编码指定为UTF-8_Utf 8_Aspell

Aspell将字典文件解码为latin1，即使环境和Aspell配置都将编码指定为UTF-8

utf-8

Aspell将字典文件解码为latin1，即使环境和Aspell配置都将编码指定为UTF-8,utf-8,aspell,Utf 8,Aspell,更新显然，解决方案是使用另一个配置参数在命令行上设置encoding:--encodig=UTF-8 例如： zby@tvm1:/home/xpapers$ aspell --lang=en create master ./dictionary.local < w Warning: The word "PÃ©rez" is invalid. The character '©' (U+A9) may not appear in the middle of a word. Skipping

更新显然，解决方案是使用另一个配置参数在命令行上设置encoding:--encodig=UTF-8

例如：

zby@tvm1:/home/xpapers$ aspell --lang=en create master ./dictionary.local < w 
Warning: The word "PÃ©rez" is invalid. The character '©' (U+A9) may not appear in the middle of a word. Skipping word.

这是第二个带重音的字母e。hextump：

zby@tvm1:/home/xpapers$ hexdump w
0000000 c350 72a9 7a65 000a                    
0000007

这是littleendian-所以你需要翻转字节-但它似乎是正确的UTF-8（50-p，然后是c3 72-带重音的e），并且它在我的控制台中显示OK

在env中，我有：

zby@tvm1:/home/xpapers$ set | grep LANG
LANG=en_US.UTF-8

aspell配置（由aspell dump config转储）附在下面，我认为唯一相关的信息是：

# encoding (string)
#   encoding to expect data to be in
# default: !encoding = UTF-8

所以看起来UTF-8的一切都准备好了——但阿斯佩尔似乎仍然在尝试拉丁语-1

这是在Ubuntu Karmic Coala上：

zby@tvm1:~$ cat /etc/lsb-release
DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=9.10
DISTRIB_CODENAME=karmic
DISTRIB_DESCRIPTION="Ubuntu 9.10"

阿斯佩尔是：

zby@tvm1:~$ aspell -v
@(#) International Ispell Version 3.1.20 (but really Aspell 0.60.6)

=============================================

zby@tvm1:/home/xpapers$ aspell dump config
# conf (string)
#   main configuration file
# default: aspell.conf

# conf-dir (string)
#   location of main configuration file
# default: /etc

# data-dir (string)
#   location of language data files
# default: <prefix:lib/aspell> = /usr/lib/aspell

# dict-alias (list)
#   create dictionary aliases

# dict-dir (string)
#   location of the main word list
# default: <data-dir> = /usr/lib/aspell

# encoding (string)
#   encoding to expect data to be in
# default: !encoding = UTF-8

# filter (list)
#   add or removes a filter

# filter-path (list)
#   path(s) aspell looks for filters

# mode (string)
#   filter mode
# default: url

# extra-dicts (list)
#   extra dictionaries to use

# home-dir (string)
#   location for personal files
# default: <$HOME|./> = /home/zby

# ignore (integer)
#   ignore words <= n chars
# default: 1

# ignore-case (boolean)
#   ignore case when checking words
# default: false

# ignore-repl (boolean)
#   ignore commands to store replacement pairs
# default: false

# keyboard (string)
#   keyboard definition to use for typo analysis
# default: standard

# lang (string)
#   language code
# default: <language-tag> = en_US

# local-data-dir (string)
#   location of local language data files
# default: <actual-dict-dir> = /usr/lib/aspell/

# master (string)
#   base name of the main dictionary to use
# default: <lang> = en_US

# normalize (boolean)
#   enable Unicode normalization
# default: true

# norm-required (boolean)
#   Unicode normalization required for current lang
# default: false

# norm-form (string)
#   Unicode normalization form: none, nfd, nfc, comp
# default: nfc

# norm-strict (boolean)
#   avoid lossy conversions when normalization
# default: false

# per-conf (string)
#   personal configuration file
# default: .aspell.conf

# personal (string)
#   personal dictionary file name
# default: .aspell.<lang>.pws = .aspell.en_US.pws

# prefix (string)
#   prefix directory
# default: /usr

# repl (string)
#   replacements list file name
# default: .aspell.<lang>.prepl = .aspell.en_US.prepl

# run-together (boolean)
#   consider run-together words legal
# default: false

# run-together-limit (integer)
#   maximum number that can be strung together
# default: 2

# run-together-min (integer)
#   minimal length of interior words
# default: 3

# save-repl (boolean)
#   save replacement pairs on save all
# default: true

# set-prefix (boolean)
#   set the prefix based on executable location
# default: true

# size (string)
#   size of the word list
# default: +60

# sug-mode (string)
#   suggestion mode
# default: normal

# sug-edit-dist (integer)
#   edit distance to use, override sug-mode default
# default: 1

# sug-typo-analysis (boolean)
#   use typo analysis, override sug-mode default
# default: true

# sug-repl-table (boolean)
#   use replacement tables, override sug-mode default
# default: true

# sug-split-char (list)
#   characters to insert when a word is split

# use-other-dicts (boolean)
#   use personal, replacement & session dictionaries
# default: true

# variety (list)
#   extra information for the word list

# warn (boolean)
#   enable warnings
# default: true

# affix-compress (boolean)
#   use affix compression when creating dictionaries
# default: false

# clean-affixes (boolean)
#   remove invalid affix flags
# default: true

# clean-words (boolean)
#   attempts to clean words so that they are valid
# default: false

# invisible-soundslike (boolean)
#   compute soundslike on demand rather than storing
# default: false

# partially-expand (boolean)
#   partially expand affixes for better suggestions
# default: false

# skip-invalid-words (boolean)
#   skip invalid words
# default: true

# validate-affixes (boolean)
#   check if affix flags are valid
# default: true

# validate-words (boolean)
#   check if words are valid
# default: true

# backup (boolean)
#   create a backup file by appending ".bak"
# default: true

# byte-offsets (boolean)
#   use byte offsets instead of character offsets
# default: false

# guess (boolean)
#   create missing root/affix combinations
# default: false

# keymapping (string)
#   keymapping for check mode: "aspell" or "ispell"
# default: aspell

# reverse (boolean)
#   reverse the order of the suggest list
# default: false

# suggest (boolean)
#   suggest possible replacements
# default: true

# time (boolean)
#   time load time and suggest time in pipe mode
# default: false


#######################################################################
#
# Filter: email
#   filter for skipping quoted text in email messages
#
# configured as follows:

# f-email-quote (list)
#   email quote characters

# f-email-margin (integer)
#   num chars that can appear before the quote char
# default: 10


#######################################################################
#
# Filter: html
#   filter for dealing with HTML documents
#
# configured as follows:

# f-html-check (list)
#   HTML attributes to always check

# f-html-skip (list)
#   HTML tags to always skip the contents of


#######################################################################
#
# Filter: tex
#   filter for dealing with TeX/LaTeX documents
#
# configured as follows:

# f-tex-check-comments (boolean)
#   check TeX comments
# default: false

# f-tex-command (list)
#   TeX commands


#######################################################################
#
# Filter: sgml
#   filter for dealing with generic SGML/XML documents
#
# configured as follows:

# f-sgml-check (list)
#   SGML attributes to always check

# f-sgml-skip (list)
#   SGML tags to always skip the contents of


#######################################################################
#
# Filter: texinfo
#   filter for dealing with Texinfo documents
#
# configured as follows:

# f-texinfo-ignore (list)
#   Texinfo commands to ignore the parameters of

# f-texinfo-ignore-env (list)
#   Texinfo environments to ignore


#######################################################################
#
# Filter: context
#   experimental filter for hiding delimited contexts
#
# configured as follows:

# f-context-delimiters (list)
#   context delimiters (separated by spaces)

# f-context-visible-first (boolean)
#   swaps visible and invisible text
# default: false

zby@tvm1：/home/xpapers$aspell dump config
#形态（字符串）
#主配置文件
#默认值：aspell.conf
#配置目录（字符串）
#主配置文件的位置
#默认值：/etc
#数据目录（字符串）
#语言数据文件的位置
#默认值：=/usr/lib/aspell
#dict别名（列表）
#创建字典别名
#dict dir（字符串）
#主单词列表的位置
#默认值：=/usr/lib/aspell
#编码（字符串）
#编码以期望数据处于
#默认值：！编码=UTF-8
#过滤器（列表）
#添加或删除筛选器
#过滤路径（列表）
#路径aspell查找筛选器
#模式（字符串）
#过滤模式
#默认值：url
#额外命令（列表）
#要使用的额外词典
#主目录（字符串）
#个人文件的位置
#默认值：=/home/zby
#忽略（整数）
#使用-lang=en创建字典时忽略单词，Aspell将查找en语言文件。在我的Ubuntu系统上，它看起来像：
name en
charset iso8859-1
special ' -*-
soundslike en
affix en

所以Aspell使用这个字符集。要覆盖该设置，请使用--encoding=utf-8选项
然后为输入（和建议的单词）设置编码选项。谢谢-这就解释了。（作为参考，我系统上的文件是：/usr/lib/aspell/en.dat）。@BillMoseley也许你可以帮我做这个：
name en
charset iso8859-1
special ' -*-
soundslike en
affix en