在PHP中删除重音符号和其他字符的函数出现问题_Php_String_Unicode_Utf 8

在PHP中删除重音符号和其他字符的函数出现问题

php string unicode utf-8

在PHP中删除重音符号和其他字符的函数出现问题,php,string,unicode,utf-8,Php,String,Unicode,Utf 8,我找到了一个简单的函数，可以从字符串中删除一些不需要的字符 function strClean($input){ $input = strtolower($input); $b = array("á","é","í","ó","ú", "ñ", " "); //etc... $c = array("a","e","i","o","u","n", "-"); //etc... $input = str_replace($b, $c, $input); return $input; } 当我

我找到了一个简单的函数，可以从字符串中删除一些不需要的字符

function strClean($input){

$input = strtolower($input);
$b = array("á","é","í","ó","ú", "ñ", " "); //etc...
$c = array("a","e","i","o","u","n", "-"); //etc...

$input = str_replace($b, $c, $input);

return $input;
}

当我在口音或其他字符上使用它时，比如这个单词“áéñí”，它会打印出那些问号或奇怪的字符，比如：

注意：我正在UTF-8中使用strclean.php（包含此函数）和index.php。index.php如下所示：

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
    <title></title>
</head>
<body>
    <?php
    include('strclean.php');

    echo 'óóóáà';
    echo strClean('óóóáà');


    ?>
</body>
</html>

我做错了什么？

是否会发生替换，即当您事先打印$input时是否会得到相同的奇怪字符？如果是这样，PHP源代码文件的字符集与输入不匹配，在替换之前，可能需要对输入使用iconv（）

编辑：我将您的两个文件都上传到我的Web服务器，打印和清理工作正常（请参阅）。这是在PHP4.4.9和Firefox3.0.6上实现的。我想到了更多的潜在问题：

它在Firefox上适用吗？我隐约记得IE6（可能还有更高版本）希望HTML头部分中的字符集是用小写（“utf-8”）编写的
编辑器是否在代码文件中包含字节顺序标记（BOM）？我的没有，也许PHP会被这些东西噎住
您可以查看HTTP头以查看是否有异常情况发生，例如错误的MIME类型吗？Firefox的篡改数据插件可以帮助实现这一点

编辑：我将您的两个文件都上传到我的Web服务器，打印和清理工作正常（请参阅）。这是在PHP4.4.9和Firefox3.0.6上实现的。我想到了更多的潜在问题：

它在Firefox上适用吗？我隐约记得IE6（可能还有更高版本）希望HTML头部分中的字符集是用小写（“utf-8”）编写的
编辑器是否在代码文件中包含字节顺序标记（BOM）？我的没有，也许PHP会被这些东西噎住
您可以查看HTTP头以查看是否有异常情况发生，例如错误的MIME类型吗？Firefox的篡改数据插件可以帮助实现这一点

iconv('UTF-8', 'ASCII//TRANSLIT', $input);

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    <title></title>
</head>
<body>

<?php
    function strClean($input) {
        $input = mb_strtolower($input, 'UTF-8');
        $b = array("á","é","í","ó","ú", "n", " ");
        $c = array("a","e","i","o","u","n", "-");
        return str_replace($b, $c, $input);
    }

    $string = 'á é í ó ú n abcdef ghij';
    echo $string ."<br />". strClean($string);
?>

</body>
</html>

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    <title></title>
</head>
<body>

<?php
    function strClean($input) {
        $input = mb_strtolower($input, 'UTF-8');
        $b = array("á","é","í","ó","ú", "n", " ");
        $c = array("a","e","i","o","u","n", "-");
        return str_replace($b, $c, $input);
    }

    $string = 'á é í ó ú n abcdef ghij';
    echo $string ."<br />". strClean($string);
?>

</body>
</html>

    function quit_accenture($str){
      $pattern = array();
      $pattern[0] = '/[Á|Â|À|Å|Ä]/';
      $pattern[1] = '/[É|Ê|È]/';
      $pattern[2] = '/[Í|Î|Ì|Ï]/';
      $pattern[3] = '/[Ó|Ô|Ò|Ö]/';
      $pattern[4] = '/[Ú|Û|Ù|Ü]/';
      $pattern[5] = '/[á|â|à|å|ä]/';
      $pattern[6] = '/[ð|é|ê|è|ë]/';
      $pattern[7] = '/[í|î|ì|ï]/';
      $pattern[8] = '/[ó|ô|ò|ø|õ|ö]/';
      $pattern[9] = '/[ú|û|ù|ü]/';
      $replacement = array();
      $replacement[0] = 'A';
      $replacement[1] = 'E';
      $replacement[2] = 'I';
      $replacement[3] = 'O';
      $replacement[4] = 'U';
      $replacement[5] = 'a';
      $replacement[6] = 'e';
      $replacement[7] = 'i';
      $replacement[8] = 'o';
      $replacement[9] = 'u';
      return preg_replace($pattern, $replacement, $str);
    }
    $txt = $_POST['your_htmled_text'];
    //Convert to your system's charset. I checked this on the php.ini
    $txt = iconv('UTF-8', 'ISO-8859-1//TRANSLIT', $txt);
    //Apply your function
    $txt = quit_accenture($txt);
    //output
    print_r($txt);

    function quit_accenture($str){
      $pattern = array();
      $pattern[0] = '/[Á|Â|À|Å|Ä]/';
      $pattern[1] = '/[É|Ê|È]/';
      $pattern[2] = '/[Í|Î|Ì|Ï]/';
      $pattern[3] = '/[Ó|Ô|Ò|Ö]/';
      $pattern[4] = '/[Ú|Û|Ù|Ü]/';
      $pattern[5] = '/[á|â|à|å|ä]/';
      $pattern[6] = '/[ð|é|ê|è|ë]/';
      $pattern[7] = '/[í|î|ì|ï]/';
      $pattern[8] = '/[ó|ô|ò|ø|õ|ö]/';
      $pattern[9] = '/[ú|û|ù|ü]/';
      $replacement = array();
      $replacement[0] = 'A';
      $replacement[1] = 'E';
      $replacement[2] = 'I';
      $replacement[3] = 'O';
      $replacement[4] = 'U';
      $replacement[5] = 'a';
      $replacement[6] = 'e';
      $replacement[7] = 'i';
      $replacement[8] = 'o';
      $replacement[9] = 'u';
      return preg_replace($pattern, $replacement, $str);
    }
    $txt = $_POST['your_htmled_text'];
    //Convert to your system's charset. I checked this on the php.ini
    $txt = iconv('UTF-8', 'ISO-8859-1//TRANSLIT', $txt);
    //Apply your function
    $txt = quit_accenture($txt);
    //output
    print_r($txt);