Python将列表的CSV部分转换为数据帧_Python_Pandas

Python将列表的CSV部分转换为数据帧

python pandas

Python将列表的CSV部分转换为数据帧,python,pandas,Python,Pandas,我正在拉入一个文本文件，其中包含许多不同的数据：序列号、类型和csv数据日志： A123> A123>读取序列号 sn=12143 A123>读取cms序列号 cms序列号=12143-00000000-0000 A123>读取fw版本 fw rev=1.3，1.3 A123>读取日志日志= 1855228,1,0,41，-5726183,25,22,21,22,0,0,892048500,0,0 1855240,1,0,33,026319,25,22,22,23,0,0,0,89204850

我正在拉入一个文本文件，其中包含许多不同的数据：序列号、类型和csv数据日志：

A123>

A123>读取序列号

sn=12143

A123>读取cms序列号

cms序列号=12143-00000000-0000

A123>读取fw版本

fw rev=1.3，1.3

A123>读取日志

日志=

1855228,1,0,41，-5726183,25,22,21,22,0,0,892048500,0,0

1855240,1,0,33,026319,25,22,22,23,0,0,0,892048500,0,0,0

261201,1,0,41，-8226122,20,21,21,0,0,0,872048500,0,0

2612142,1,0,4930127607,21,22,21,21,0,0,812048500,0,0

以下是我目前掌握的代码：

import pandas as pd

lines = []                  # Declare an empty list named "lines"
with open ('03-22-2019.txt', 'rt') as in_file:  # Open file 
    for line in in_file:  # For each line of text in in_file, where the data is named "line",
        lines.append(line.rstrip('\n'))   # add that line to our list of lines, stripping newlines.
    
while('' in lines):
        lines.remove("")

lines = [x for x in lines if 'A123' not in x]  #delete all lines with 'A123'


for element in lines:            # For each element in our list,
        print(element)              # print it.
        

split_line = lines[0].split()  # create list with serial number line
Serial_Num = split_line[-1]
print(Serial_Num)

split_line = lines[1].split()  # go to line with CMS SN
CMS_SN = split_line[-1]
print(CMS_SN)

split_line = lines[2].split()
Firm_Rev_1 = split_line[-1]
Firm_Rev_2 = split_line[-2]
print(Firm_Rev_1)
print(Firm_Rev_2)
                                  #  Problem section starts here!
start_data = lines.index("log =") + 1                   #<<<<<<<<<<
data = [x for x in lines[start_data:].split(",")]       #<<<<<<<<<<
#dfObj = pd.DataFrame(lines[start_data:-1].split(","))  #<<<<<<<<<<

将熊猫作为pd导入
lines=[]#声明一个名为“lines”的空列表
打开（'03-22-2019.txt'，'rt'）如_文件中所示：#打开文件
对于in_文件中的行：#对于in_文件中的每行文本，其中数据名为“行”，
line.append（line.rstrip（'\n'））#将该行添加到我们的行列表中，去掉换行符。
而（“”在行中）：
行。删除（“”）
lines=[x代表x，如果'A123'不在x中]#删除所有带有'A123'的行
对于行中的元素：#对于列表中的每个元素，
打印（元素）#打印它。
split_line=行[0]。split（）#创建带有序列号行的列表
序列号=分割线[-1]
打印（序列号）
split_line=行[1]。split（）#转到带有CMS序列号的行
CMS\u序列号=分割线[-1]
打印（CMS\U SN）
split_line=行[2]。split（）
固定版本1=分割线[-1]
固定版本2=分割线[-2]
印刷品（公司版次1）
印刷品（公司版次2）
#问题部分从这里开始！
start_data=lines.index（“log=）+1#看起来你已经非常接近了
# this will get you a list of lists for each line.
data = [line.split(',') for line in lines[start_data:]]
# This should construct your data frame
dfObj = pd.DataFrame(data=data, columns=[list, of, column, names])