Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/307.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python将列表的CSV部分转换为数据帧_Python_Pandas - Fatal编程技术网

Python将列表的CSV部分转换为数据帧

Python将列表的CSV部分转换为数据帧,python,pandas,Python,Pandas,我正在拉入一个文本文件,其中包含许多不同的数据:序列号、类型和csv数据日志: A123> A123>读取序列号 sn=12143 A123>读取cms序列号 cms序列号=12143-00000000-0000 A123>读取fw版本 fw rev=1.3,1.3 A123>读取日志 日志= 1855228,1,0,41,-5726183,25,22,21,22,0,0,892048500,0,0 1855240,1,0,33,026319,25,22,22,23,0,0,0,89204850

我正在拉入一个文本文件,其中包含许多不同的数据:序列号、类型和csv数据日志:

A123>

A123>读取序列号

sn=12143

A123>读取cms序列号

cms序列号=12143-00000000-0000

A123>读取fw版本

fw rev=1.3,1.3

A123>读取日志

日志=

1855228,1,0,41,-5726183,25,22,21,22,0,0,892048500,0,0

1855240,1,0,33,026319,25,22,22,23,0,0,0,892048500,0,0,0

261201,1,0,41,-8226122,20,21,21,0,0,0,872048500,0,0

2612142,1,0,4930127607,21,22,21,21,0,0,812048500,0,0

以下是我目前掌握的代码:

import pandas as pd

lines = []                  # Declare an empty list named "lines"
with open ('03-22-2019.txt', 'rt') as in_file:  # Open file 
    for line in in_file:  # For each line of text in in_file, where the data is named "line",
        lines.append(line.rstrip('\n'))   # add that line to our list of lines, stripping newlines.
    
while('' in lines):
        lines.remove("")

lines = [x for x in lines if 'A123' not in x]  #delete all lines with 'A123'


for element in lines:            # For each element in our list,
        print(element)              # print it.
        

split_line = lines[0].split()  # create list with serial number line
Serial_Num = split_line[-1]
print(Serial_Num)

split_line = lines[1].split()  # go to line with CMS SN
CMS_SN = split_line[-1]
print(CMS_SN)

split_line = lines[2].split()
Firm_Rev_1 = split_line[-1]
Firm_Rev_2 = split_line[-2]
print(Firm_Rev_1)
print(Firm_Rev_2)
                                  #  Problem section starts here!
start_data = lines.index("log =") + 1                   #<<<<<<<<<<
data = [x for x in lines[start_data:].split(",")]       #<<<<<<<<<<
#dfObj = pd.DataFrame(lines[start_data:-1].split(","))  #<<<<<<<<<<
将熊猫作为pd导入
lines=[]#声明一个名为“lines”的空列表
打开('03-22-2019.txt','rt')如_文件中所示:#打开文件
对于in_文件中的行:#对于in_文件中的每行文本,其中数据名为“行”,
line.append(line.rstrip('\n'))#将该行添加到我们的行列表中,去掉换行符。
而(“”在行中):
行。删除(“”)
lines=[x代表x,如果'A123'不在x中]#删除所有带有'A123'的行
对于行中的元素:#对于列表中的每个元素,
打印(元素)#打印它。
split_line=行[0]。split()#创建带有序列号行的列表
序列号=分割线[-1]
打印(序列号)
split_line=行[1]。split()#转到带有CMS序列号的行
CMS\u序列号=分割线[-1]
打印(CMS\U SN)
split_line=行[2]。split()
固定版本1=分割线[-1]
固定版本2=分割线[-2]
印刷品(公司版次1)
印刷品(公司版次2)
#问题部分从这里开始!

start_data=lines.index(“log=)+1#看起来你已经非常接近了

# this will get you a list of lists for each line.
data = [line.split(',') for line in lines[start_data:]]
# This should construct your data frame
dfObj = pd.DataFrame(data=data, columns=[list, of, column, names])