Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/355.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 注释自己的数据集以转换为forge';s格式_Python_Machine Learning_Nlp - Fatal编程技术网

Python 注释自己的数据集以转换为forge';s格式

Python 注释自己的数据集以转换为forge';s格式,python,machine-learning,nlp,Python,Machine Learning,Nlp,我正在使用从邮件中提取签名。但它似乎无法正确处理我的数据。我想将talon的分类器训练到我的数据集,但我需要将数据集更改为。如何将“我的数据”更改为以下forge格式: #sig#-- #sig#Mike Smith #sig#555-243-0623 我有来自 编辑-1: 我的数据集采用以下格式: Hi there!\n\nOur new project description is as follows:\n\nABC is a .. company committed to produc

我正在使用从邮件中提取签名。但它似乎无法正确处理我的数据。我想将talon的分类器训练到我的数据集,但我需要将数据集更改为。如何将“我的数据”更改为以下forge格式:

#sig#--
#sig#Mike Smith
#sig#555-243-0623
我有来自

编辑-1:

我的数据集采用以下格式:

Hi there!\n\nOur new project description is as follows:\n\nABC
is a .. company committed to producing works both new 
and\nrediscovered . We b 
performances\nare exactly alike; with every ads, new 
thoughtsfeelings......
**\n\n \n\nBest, \n\XYZ\n\n     \n\XYZ, xxx 
xxx, xxx, xxx, xxx\n\nUnsubscribe (https://media.XYZ.org/hs/manage- 
preferences/unsubscribe-all? 
d=Vnk5T44_PyVnW1b1YkV4mCt9cW3Z_sBZ3zhsYkW3Fbt723zd- 
J2W3P3Q7w3ZsjYmW3_fWSh6pjXx2VmWcP5724cW0N6P6s3r25DrQW7ZSHL  
sFPu8fNOrW7ByvX2ISZHyRP5Kjm4vKXy60COlsUqmvd1t9W- 
) Manage preferences (https://media.xyz.org/hs/manage/unsubscribe? 
d=Vnk5T44_PyVnW1b1YkV4mCt9cW3Z- 
J2W3P3Q7w3ZsjYmW3_fWSh6pjXx2VmWcP5724cW0N6P6s3r25DrQW7ZSHL- 
=hs_email&utm_medium=email&utm_content=756&_hsenc=JegN- 
sFPu8fNOrW7ByvX2ISZHyRP5KjmW
)\n\n\n\n--\n\nTheABC Company\nxx-xxx-xxx | 
ABC@gmail.com | ABC.org 
我想:

#sig#Best
#sig#....
#sig#The ABC Company
#sig#ABC@gmail.com | ABC.org

共享一些示例数据集…共享一些示例数据集。。。