删除不包含特定文本的行-Python_Python_Pandas_Dataframe - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/337.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
删除不包含特定文本的行-Python_Python_Pandas_Dataframe - Fatal编程技术网

删除不包含特定文本的行-Python

python pandas dataframe

删除不包含特定文本的行-Python,python,pandas,dataframe,Python,Pandas,Dataframe,我有一个表格文件，如下所示： query_name KEGG_KOs PROKKA_00013 NaN PROKKA_00015 bactNOG[38] PROKKA_00017 NA|NA|NA PROKKA_00019 K00240 PROKKA_00020 K00246 PROKKA_00022 K02887 如果第2列（“KEGG_KOs”）不是以“K0”开头，我将尝试创建一个脚本来遍历并删除整行。我正在尝试创建以下输出： query_na

我有一个表格文件，如下所示：

query_name      KEGG_KOs
PROKKA_00013    NaN
PROKKA_00015    bactNOG[38]
PROKKA_00017    NA|NA|NA
PROKKA_00019    K00240
PROKKA_00020    K00246
PROKKA_00022    K02887

如果第2列（“KEGG_KOs”）不是以“K0”开头，我将尝试创建一个脚本来遍历并删除整行。我正在尝试创建以下输出：

query_name     KEGG_KOs
PROKKA_00019    K00240
PROKKA_00020    K00246
PROKKA_00022    K02887

以前的回复提到人们使用熊猫数据框，但我没有运气使用这些回复来帮助他们。任何人都将不胜感激，干杯

我试过了（但这只隔离了一个特定的K0线）

df = pd.read_csv("eggnog.txt", delimiter="\t", names=["#query_name", "KEGG_KOs"])
print(df.loc[df['KEGG_KOs'] == 'K00240'])

与

regex一起使用或与regex一起使用，用于字符串^
和参数na=False
，因为缺少值：
df1 = df[df['KEGG_KOs'].str.startswith('K0', na=False)]
print (df1)
     query_name KEGG_KOs
3  PROKKA_00019   K00240
4  PROKKA_00020   K00246
5  PROKKA_00022   K02887

或：
您可以使用先读后写的方式打开。假设原始文件保存为old.txt，更新后的文件保存为new.txt
text = ''
with open("old.txt", 'r') as org:
    next(org)
    for line in org:
        data = line.strip().split()
        if data[1].startswith("K0"):
            text = text + data[0] + " "+ data[1] + '\n'

w = open('new.txt', 'w')
w.write("query_name"+" "+ "KEGG_KOs\n")
w.write(text)
w.close()

你能给我们看一下你的代码和你的尝试吗？
text = ''
with open("old.txt", 'r') as org:
    next(org)
    for line in org:
        data = line.strip().split()
        if data[1].startswith("K0"):
            text = text + data[0] + " "+ data[1] + '\n'

w = open('new.txt', 'w')
w.write("query_name"+" "+ "KEGG_KOs\n")
w.write(text)
w.close()




[pandas]相关文章推荐



                                                        
Pandas 熊猫的胡须到底是什么'；箱线图说明了什么？
pandas 
Pandas 用于时间序列打印的水平居中XLabel
pandas 
Pandas 从任意行获取熊猫中的csv标头
pandas 
pandas dataframe.to_gqb如何上传到远程表？
pandasgoogle-bigquery 
Pandas 熊猫数据阅读器不'；我不为谷歌金融工作
pandas 
为什么pandas.Series.values会删除原始序列中存在的一些标量数据？
pandasdataframe 
Pandas 将数据从一个数据帧移动到另一个具有相同列的数据帧中
pandas 
将Pandas代码循环两次并使Bot的运行时间加倍
pandasloopsfor-loopdataframe 
如何在没有公共索引的情况下使用Pandas dataframe作为映射
pandas 
Pandas:1数据帧比较行以创建新列
pandas 
Pandas 熊猫：can'；t使用mod_wsgi导入numpy
pandas 
Pandas 将文本列转换为二进制，其中一组值=1，所有其他值=0
pandaspython-2.7 
Pandas 熊猫把柱子放在适当的位置，然后把它放回原处
pandas 
Pandas 如果列表中的子字符串以字符串形式出现，则为新列赋值
pandaslist 
将PySpark和DBSCAN与pandas_udf相结合
pandasapache-sparkpysparkscikit-learn 
Pandas 分组和绘图
pandasmatplotlib 
Pandas 在多索引中创建新列
pandas 
Pandas 执行groupby.median（）时如何保存分类列？
pandas 
Pandas 反向累积计数
pandas 
Pandas 使用pyspark基于多列值的删除记录
pandaspyspark 
                                       





随机文章推荐



                                                        
Ssrs 2008 动态更改ssrs报告中的连接字符串
ssrs-2008 
Ssrs 2008 SSRS-使用不同数据集字段的表达式
ssrs-2008reporting-services 
Ssrs 2008 SSRS报告parameter.label显示parameter.value
ssrs-2008axapta 
Ssrs 2008 如何使用自动NTLM凭证传递从外部验证SSR？
ssrs-2008 
Ssrs 2008 Reporting Services列组单元格跨组
ssrs-2008 
Ssrs 2008 报告服务中的交互式排序失败，SSRS 2008报告服务中出现rsReportNotReady异常
ssrs-2008 
Ssrs 2008 物料清单的SSRS递归层次结构
ssrs-2008 
Ssrs 2008 2008年SSRS负货币价值
ssrs-2008 
Ssrs 2008 从两行中选择一个值
ssrs-2008


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Seo
Build
Css
Curl
Google Cloud Dataflow
Numpy
Npm
Liferay
Webgl
Enums
Discord.py
Service
Vmware
Cocoa
Ipad
Ignite
Assembly
Computer Science
Next.js
Tfs
Hash
Arrays
Scala
Angular Material
C#
Pagination
Maps
Codeigniter
Nginx
Mapbox
Webview
Parallel Processing
Spring Mvc
Apache Nifi
Inno Setup
Breeze
X86
Orm
Xcode
Continuous Integration
Razor
Command Line
Joomla
Less
Mediawiki
Generics
Google Drive Api
Triggers
Google Analytics
Axapta
Maven 2
Clearcase
Sqlalchemy
Rally
Csv
Nest
Functional Programming
Windows 8
Javafx 2
Mdx
Nunit
Linux
Lotus Notes
Notifications
Phpmyadmin
Compiler Construction
Concurrency
D3.js
Yocto
Hazelcast
Chart.js
Struct
Typescript
Opengl Es
Sap
Vim
Autohotkey
Memory Leaks
Asp.net Web Api
Macros
Web Applications
Windows
Orchardcms
Log4j
Mapreduce
Jdbc
Grid
Autodesk Forge
Https
3d
C
Activerecord
Isabelle
Flutter
Fiware
Vhdl
Selenium
Karate
Quickbooks
Loops
Couchbase
Streaming
Sapui5
Xamarin
Xslt
Glsl
Cygwin
Directx
Vba
Blackberry
Prestashop
Blazor
Visual Studio 2013
Python Sphinx
Struts2
Meteor
Alfresco
Dataframe
Junit
Uitableview
Utf 8
Amazon Cloudformation
.net 4.0
Internet Explorer
Ethereum
Gatsby
Xaml
Zsh
Stripe Payments
Web Services
Gremlin
Syntax
Io
Cassandra
Sails.js
Tabs
Tinymce
Timer
Compression
Docker Compose
Pdf
Sockets
Embedded
Angular
File Io
Url Rewriting
Maven
Express
Scikit Learn
Blockchain
Sql Server
Sed
Codenameone
Apache Storm
Visual Studio 2017
Jenkins
Ruby On Rails 3.1
Deep Learning
Ibm Mobilefirst
Prometheus
Antlr
Antlr4
Flash
Llvm
Winforms
Java 8
Applescript
Math
Google Chrome Devtools
Jquery
Oracle Apex
Jupyter Notebook
Jms
Debugging
Sip
Proxy
Knockout.js
Internet Explorer 8
Jar
Spring
Magento
Synchronization
Google Bigquery
Visual Studio Code
Geolocation
Composer Php
Moodle
Transactions
Protractor
Debian
Function
Corda
Artificial Intelligence
Kotlin
Data Structures
Jira
Mobile
Stanford Nlp
Twilio
Firefox
Xsd


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网