Python表空白分隔符
我有一个示例txt文件,看起来像:Python表空白分隔符,python,pandas,delimiter,Python,Pandas,Delimiter,我有一个示例txt文件,看起来像: ACW00011604 17.1167 -61.7833 10.1 ST JOHNS COOLIDGE FLD ACW00011647 17.1333 -61.7833 19.2 ST JOHNS E000041196 25.3330 55.5170 34.0 SHARJAH INTER. AIRP 等等 我需要的是像这样加载此文件: X X.1 X.3 X.4 X.5 ACW00011604 17.1167
ACW00011604 17.1167 -61.7833 10.1 ST JOHNS COOLIDGE FLD
ACW00011647 17.1333 -61.7833 19.2 ST JOHNS
E000041196 25.3330 55.5170 34.0 SHARJAH INTER. AIRP
等等
我需要的是像这样加载此文件:
X X.1 X.3 X.4 X.5
ACW00011604 17.1167 -61.7833 10.1 ST JOHNS COOLIDGE FLD
ACW00011647 17.1333 -61.7833 19.2 ST JOHNS
我试着把熊猫当警察
ds=pd.read_table("st.txt", delim_whitespace=True, header=None)
但它的工作原理是:
X X.1 X.3 X.4 X.5 X.6 X.7 X.8
ACW00011604 17.1167 -61.7833 10.1 ST JOHNS COOLIDGE FLD
ACW00011647 17.1333 -61.7833 19.2 ST JOHNS
E000041196 25.3330 55.5170 34.0 SHARJAH INTER. AIRP
如何处理此问题?用于读取固定宽度的文件格式,并传递参数header=None
和所需的列名:
In [18]:
import io
import pandas as pd
t="""ACW00011604 17.1167 -61.7833 10.1 ST JOHNS COOLIDGE FLD
ACW00011647 17.1333 -61.7833 19.2 ST JOHNS
E000041196 25.3330 55.5170 34.0 SHARJAH INTER. AIRP"""
df = pd.read_fwf(io.StringIO(t), header=None, names=['X','X.1','X.3','X.4', 'X.5'])
df
Out[18]:
X X.1 X.3 X.4 X.5
0 ACW00011604 17.1167 -61.7833 10.1 ST JOHNS COOLIDGE FLD
1 ACW00011647 17.1333 -61.7833 19.2 ST JOHNS
2 E000041196 25.3330 55.5170 34.0 SHARJAH INTER. AIRP
因此,在您的情况下,以下各项应起作用:
ds=pd.read_fwf("st.txt", header=None, names=['X','X.1','X.3','X.4', 'X.5'])