在python tensorflow中将字节字符串张量转换为普通字符串张量_Python_Keras_Tensorflow2.0

在python tensorflow中将字节字符串张量转换为普通字符串张量

python keras

在python tensorflow中将字节字符串张量转换为普通字符串张量,python,keras,tensorflow2.0,Python,Keras,Tensorflow2.0,我正在尝试将字节字符串张量转换为字符串张量，这样我就可以对字符串执行一些操作，并将更改后的字符串存储回tensorflow 2.3中的相同字节张量 tf.Tensor([b"This is my first string as byte string, b"This is my second string as byte string"'], shape=(2,), dtype=string) 我用了tf.strings，这里的as_string（a）是输入张

我正在尝试将字节字符串张量转换为字符串张量，这样我就可以对字符串执行一些操作，并将更改后的字符串存储回tensorflow 2.3中的相同字节张量

tf.Tensor([b"This is my first string as byte string,  b"This is my second string as byte string"'], shape=(2,), dtype=string)

我用了tf.strings，这里的as_string（a）是输入张量，但它给了我错误

InvalidArgumentError:字符串属性“T”的值不在允许值列表中：int8、int16、int32、int64

这是因为as_字符串只接受少数输入类型，而我正在传递字节字符串

我该怎么做，请帮助我，因为我被卡在上面了

b“这是我作为字节字符串的第一个字符串”是字节类型，但是您使用的数据类型是tf.string

您可以使用Unicode字符串：

tf.Tensor([u"This is my first string as byte string,  u"This is my second string as byte string"], shape=(2,), dtype=string)

谢谢Timur。让我再解释一下。基本上，我用下面的代码将一些文本文件（有纯文本）读入变量谢谢Timur。基本上，我用下面的代码

raw\u train\ds=tf.keras.preprocessing.text\u dataset\u从\u目录将一些文本文件（有纯文本）读入变量（'aclImdb/train'，batch\u size=batch\u size，validation\u split=0.99，subset='training'）

来自本文[。因此，在原始\u train\ds中，我们得到了字节文本张量形式的纯文本。现在，我想迭代这个数据集，并从字符串（字节字符串）中删除停止字，我如何在这个数据集中创建和执行stopwords替换？带字符串的方法可以阅读：-ISBN 978-1617294433-第六章。这是一本非常积极的书。