Python 如何从制表符分隔的文本中删除字符串外部的字符
我有一个文件,比如“Mrinq\u Parts\u Available.txt”,看起来像这样Python 如何从制表符分隔的文本中删除字符串外部的字符,python,string,split,Python,String,Split,我有一个文件,比如“Mrinq\u Parts\u Available.txt”,看起来像这样 Source Date Category SubCategory Present Description Value Units Vendor Part No Package Box Name Location Quantity Ordered Used MOQ=1 MOQ=100 MOQ=1000 Comments Link Digik
Source Date Category SubCategory Present Description Value Units Vendor Part No Package Box Name Location Quantity Ordered Used MOQ=1 MOQ=100 MOQ=1000 Comments Link
Digikey 29-May-15 RF Amplifier No 0.5 W RFMD RFPA3807 SOIC8 10 0 3.4 5V http://www.digikey.com/product-detail/en/RFPA3807TR13/689-1073-1-ND/2567207
我有一个python代码,它可以拆分这些行
def removeEmptyLines(inputFile):
with open(inputFile, 'rb') as f:
d = f.readlines()
k = []
for i in d:
k.append(i.split())
print (k)
if __name__=="__main__":
parts_database_file = "Mrinq_Parts_Available.txt"
removeEmptyLines(parts_database_file)
但输出如下所示:
[b'Source', b'Date', b'Category', b'SubCategory', b'Present', b'Description', b'Value', b'Units', b'Vendor', b'Part', b'No', b'Package', b'Box', b'Name', b'Location', b'Quantity', b'Ordered', b'Used', b'MOQ=1', b'MOQ=100', b'MOQ=1000', b'Comments', b'Link']
[b'Digikey', b'29-May-15', b'RF', b'Amplifier', b'No', b'0.5', b'W', b'RFMD', b'RFPA3807', b'SOIC8', b'10', b'0', b'3.4', b'5V', b'http://www.digikey.com/product-detail/en/RFPA3807TR13/689-1073-1-ND/2567207']
如何删除每个解析数据前面的“b”?您的文件显然是一个ASCII文件,因此您应该将其作为ASCII文件打开:
with open(inputFile, 'r') as f:
另一个类似的问题:使用
open()
函数中的'b'
以二进制文件的形式打开文件。只需删除b
:,将open(inputFile,'r')作为f: