Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/string/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 是否有从字符串(\u0410)中删除特殊字符的方法?_Python_String_Re - Fatal编程技术网

Python 是否有从字符串(\u0410)中删除特殊字符的方法?

Python 是否有从字符串(\u0410)中删除特殊字符的方法?,python,string,re,Python,String,Re,我正在尝试清理字符串,但删除特殊字符时遇到问题。已注意小写字母和删除\n my_string = "\u0410\u041d\u041e\u0422\u0410\u0426\u0418\u042f: \n* Lectures \u2013 20 hours \n* Workshops \u2013 8 hours (these workshops are so designed as to provoke active student participation;" 预期结

我正在尝试清理字符串,但删除特殊字符时遇到问题。已注意小写字母和删除\n

my_string = "\u0410\u041d\u041e\u0422\u0410\u0426\u0418\u042f: \n* Lectures \u2013 20 hours \n* Workshops \u2013 8 hours (these workshops are so designed as to provoke active student participation;"
预期结果:

"lectures 20 hours workshops 8 hours these workshops are so designed as to provoke active student participation"
是否有删除所有特殊字符(\u0410等)的方法?

提取单词(使用正则表达式),并:

输出

lectures 20 hours workshops 8 hours these workshops are so designed as to provoke active student participation
或者更好,正如@tobias_k所建议的,首先使用lower:

result = " ".join(re.findall(r"[a-z0-9]+", my_string.lower()))

或先降低,然后仅使用a-z
result = " ".join(re.findall(r"[a-z0-9]+", my_string.lower()))