如何根据python中的后缀提取单词_Python_Nltk - Fatal编程技术网

如何根据python中的后缀提取单词

python

如何根据python中的后缀提取单词,python,nltk,Python,Nltk,我有以下python代码： import re; import nltk; from nltk.util import ngrams; file="C:/Python26/test.txt"; f=open("Suffix.txt",'w'); with open(file,'r') as rf: lines = rf.readlines(); c=0; for word in lines: if word.endswith(beta):

我有以下python代码：

import re;
import nltk;
from nltk.util import ngrams;
file="C:/Python26/test.txt";
f=open("Suffix.txt",'w');
with open(file,'r') as rf:
    lines = rf.readlines();
    c=0;
    for word in lines:
        if word.endswith(beta):
            f.write(word.strip("\n")+"\t"'1'"\n");
            c=c+1;
        else:
            f.write(word.strip("\n")+"\t"'0'"\n");
            c=c+1;
    print c;
    f.close()

这段代码没有给那些以“beta”开头的单词加上标记“1”，当我用

statrswith（）

替换

endswith（）

时，这段代码工作得很好，它给了以“beta”开头但不适用于

endswith（）

的带有标记“1”的单词

我不太了解这种行为。为什么会发生这种情况

我的文件看起来像这样
IL-2
基因
表达式
和
NF-κ
B
激活
通过
CD28
需要
反应性
氧气
生产
通过
5-脂氧合酶

.
这是因为
这个词以'\n' 结尾。您应该确保在检查之前去掉该部分，或者检查它是否以'beta\n' 结尾，然后重试 if word.strip().endswith(beta): 您是否尝试了word.rstrip（）.endswith（beta版）
？您也不需要将所有行读取到内存中，您可以迭代files对象，这是python而不是c删除
thx，运行良好，结果正确无需担心，您也不需要在脱衣舞中指定
“\n”
，默认情况下会删除换行符，最后使用
rstrip
从字符串末尾删除。您能给出
suffix.txt
的一个片段吗？

[html]相关文章推荐

Html 表不包括'；不显示边框 html css

Html CSS图像间歇性加载 html css visual-studio-2008

Html img的src应该是什么样子 html image

Ruby Nokogiri解析HTML表III html ruby

HTML问题<；表>；宽度 html css

Html 使DIV滚动条主页滚动条？ html

Html 如何以正确的方式处理从右向左的文本输入字段？ html css

html输入的Meteor重置值 html meteor coffeescript

Html 将博客从wordpress导出到blogger后，图像链接未更新 html wordpress templates

隐藏在背景图像HTML后面的文本 html css

Html 使用CSS创建属性的换行符 html css wordpress

Html 我无法创建导航栏子菜单 html css

Html InternetExplorer11在像素问题上撒谎 html css internet-explorer

Html 如何解决此问题？还有页脚吗？ html css

Html 在我的网站中，我在哪里使用css网格 html css

Html 导航栏没有UL标签-无法在页面上居中链接 html css

Html 注册/登录容器不会出现在背景图像中 html css responsive-design

Html 将页边距底部添加到边框折叠表 html css reactjs

Html div类中的H1不起作用。不知道为什么 html css

Html 如何将div的内容张贴在正文中间？ html css

随机文章推荐

Webgl用于聚合数据 webgl

WebGL:GLD元素原因'；GLDraweElements:尝试访问属性0'； webgl

为什么我的WebGL上下文只有一部分扩展 webgl

WebGL计算着色器和VBO/UBO'；s webgl

WebGL：在同一绘制调用中绘制不同的几何图元？ webgl

浏览器不支持带有webgl的d3d11 webgl

[python]相关推荐

如何使用wxpythongui库显示数据（动态文本）？
Python Wxpython

Python RegexValidator不允许在CharField中使用尾随空格
Python Django Django Rest Framework

Python 我的ip和端口号应该是什么，这样errno 10061就不会'；不会出现吗？
Python Sockets

Python 从linux在远程机器上开发的最佳方式
Python Linux Windows

Python:ImportError:没有名为cymru.bogon.dns的模块
Python Installation

如何在python中为变量生成随机单词生成器？
Python Python 3.x Random

Python 访问列表中的职位
Python String List Python 3.x

Python 在Heroku dynamo上运行Hadoop mapreduce Django应用程序
Python Django Hadoop Heroku

Python2.7：使用reduce验证元素是否在列表中
Python

Python pycharm track'；打印'；调试模式下的输出
Python Debugging Pycharm

访问类时出现问题'；Python中的s变量
Python Python 2.7

Python 从Geosig Ve-53接收地震数据
Python

python在append中挣扎的新手
Python

Python 使用xterm时终止子进程
Python

在python中读取JSON并将其转换为数据帧
Python Json Pandas

Python 当你表现得很怪异的时候
Python

Python str.upper（）返回一个显然不等于任何内容的字符串，包括另一个str.upper（）
Python String

Python 读取带有嵌套while循环的文件
Python

具有多行文本块的正则表达式（python）
Python Regex List

Python：如何在单词与符号和数字混合的字符串列表中创建唯一单词的字典
Python

在python中创建没有作为参数传递的aubarray的子数组
Python Arrays Numpy

在我的代码中有一个bug，我使用python中的类并在其中定义了一些函数，但是传递给函数的值不准确
Python

如何使字符串介于N和x27之间'；及(n&"x2B ; 1)及"x27";'；在python中出现
Python

Python 索引器错误：在while循环中列出索引超出范围
Python List Loops Indexing

Python是如何独立于平台的？
Python C

Python matplotlib中的颜色栏缺少结束标记
Python Matplotlib

如何在Python中计算赋值的所有可能排列？
Python

如何在MSYS2中使用Windows Python安装
Python

如何删除Python字符串中以大写字母开头的子字符串？
Python Regex String

如何在python中使用break结束while循环？
Python If Statement

Tags

Laravel 4 Cocoa Touch Sip Office365 Configuration Coq Odoo Vue.js Mysql Perl Orientdb System Verilog Anaconda Asterisk Magento Gmail Cobol Windows Services Dictionary Sql Server 2008 Loopbackjs Powershell Twitter Bootstrap Encryption Opencl Winforms Apache Storm Teamcity Gulp Nginx Adobe Intellij Idea Asynchronous Mapreduce Cuda Puppet Windows 8 Monitoring Asp.net Mvc 5 Google App Maker Cassandra Jasper Reports Requirejs Transactions Embedded For Loop Visual Studio Code Exchange Server Domain Driven Design Enums Spring Mvc Moodle Jekyll Video Streaming Bazel Rxjs Dataframe Amazon Cloudformation Lotus Notes Docker F# Http Colors Build Opengl Es Continuous Integration Email Apache2 Computer Science Google Drive Api Properties Fullcalendar Permissions Actionscript 3 Clojure Nunit Youtube Api Api Inheritance Streaming Maven Geometry Scroll Junit Bootstrap 4 Mono Printing Workflow Formatting Visual Studio Protocol Buffers Sprite Kit Methods Redis Ibm Mq Snmp Neo4j Couchdb Visual Studio 2012 Jquery Mobile Google Cloud Platform Ravendb Hibernate Stream Netbeans Google Calendar Api Passwords Vaadin Documentation Triggers Date Swagger Module Fonts Azure Sql Database Virtualbox Java File Ios6 Isabelle Ibm Mobilefirst Gradle Z3 Apache Kafka Postman Ipython Compiler Errors Web Crawler Cakephp Material Ui Redirect Pdf Opencart Iphone Symfony Multithreading Pycharm Activerecord Input Java 8 Cordova Fluent Nhibernate Windows Runtime Windows Store Apps Java Me Webpack Rss Pascal Image Processing Reference Firefox Centos Automated Tests Hazelcast Camera Cluster Computing Jupyter Notebook Qml Wordpress Memory Management Titanium Mercurial Cocos2d X Cron Vba Ssh Grid Jmeter Telegram Webrtc Netsuite Phpstorm Android Fragments Maven 2 Windows Phone 7 Autohotkey Jms Push Notification Jestjs Dialogflow Es Vim Active Directory Scrapy Spring Cloud Serialization Visual Studio 2008 Openstack Telerik Signalr Asp Classic Wicket Kernel Doctrine Linker Visual Studio 2013 Latex Ios4 Julia Struct Nsis Highcharts

Copyright © 2024. All Rights Reserved by - Fatal编程技术网