Php 西里尔字母符号的问题

Php 西里尔字母符号的问题,php,character-encoding,multibyte,Php,Character Encoding,Multibyte,以下代码未检测到编码正确 $data = 'ABCDEG АБВГДЕ'; $charset = mb_detect_encoding($data); $data = iconv($charset, "UTF-8", $data); $data = mb_strtolower($data, 'UTF-8'); $datasort = str_replace(array("\r", "\n", " "), '', $data); $counter = mb_strlen($datasort,'UT

以下代码未检测到编码正确

$data = 'ABCDEG АБВГДЕ';
$charset = mb_detect_encoding($data);
$data = iconv($charset, "UTF-8", $data);
$data = mb_strtolower($data, 'UTF-8');
$datasort = str_replace(array("\r", "\n", " "), '', $data);
$counter = mb_strlen($datasort,'UTF-8');
foreach (count_chars($datasort, 1) as $i => $val) 
{
echo '
<tr>
    <th scope="row">'.mb_detect_encoding(chr($i)).'</th>
// ON LATIN SYMBOLS IT DETECTED ANCII AND ON CYRILLIC IT DETECTED **NOTHING**
</tr>
';
}
问题出在哪里?
//php文件有UTF-8编码

试试这个,所有的都是ASCII或UTF-8:代码的问题是计数字符在转换过程中改变了编码

A:ASCII码 B:UTF-8 C:UTF-8 D:UTF-8 E:UTF-8 G:UTF-8 А:UTF-8 Б:UTF-8 b:UTF-8 Γ:ASCII Б:ASCII码 Б:UTF-8


ASCII是UTF-8的子集,因此如果文档是ASCII,那么它已经是UTF-8了。在这种情况下,所有字母都将返回ASCII,但如果在循环之前对单词进行了ASCII,并尝试检测编码,则会返回UTF-8@headmax没错。但主要的问题是:为什么当我试图检测西里尔文的编码时,却一无所获呢?为什么你想检测并猜测字符编码?您只需使用编写文本时使用的编码进行阅读。您是否丢失了基本信息元数据?在本例中,编码是您告诉编辑器使用的编码。
function mbStringToArray ($string) {
    $strlen = mb_strlen($string);
    while ($strlen) {
        $array[] = mb_substr($string,0,1,"UTF-8");
        $string = mb_substr($string,1,$strlen,"UTF-8");
        $strlen = mb_strlen($string);
    }
    return $array;
} 

$data = 'ABCDEG АБВГДЕ';
$data = str_replace(array("\r", "\n", " "), '', mb_strtolower($data));
iconv(mb_detect_encoding($data, mb_detect_order(), true), "UTF-8", $data);

$data = mbStringToArray($data);

echo '<table>';
foreach ($data as $i => $val) 
{
echo '
<tr>
    <th scope="row">'. $val . ' : ' .mb_detect_encoding(chr($i)).'</th>
</tr>
';
}
echo '</table>';