Python 从数据帧中的所有行提取子字符串
我有一个pd.DataFrame,如下所示:Python 从数据帧中的所有行提取子字符串,python,pandas,Python,Pandas,我有一个pd.DataFrame,如下所示: pd.DataFrame(["SSDILFJKSIDHFKJSHDKUFH", "SLIDFSOIUDHFIUSDHF", "K<NFSKJGHSDUFSDK"], ["SKDJF", "FDKSJFSSDF", "SIDFDS"]) pd.DataFrame([“SSDILFJKSIDHFKJSHDKUFH”、“slidefsoiudhfiusdhf”、“K您可以尝试使用apply df Out[115]: index
pd.DataFrame(["SSDILFJKSIDHFKJSHDKUFH", "SLIDFSOIUDHFIUSDHF", "K<NFSKJGHSDUFSDK"], ["SKDJF", "FDKSJFSSDF", "SIDFDS"])
pd.DataFrame([“SSDILFJKSIDHFKJSHDKUFH”、“slidefsoiudhfiusdhf”、“K您可以尝试使用apply
df
Out[115]:
index 0
0 SKDJF SSDILFJKSIDHFKJSHDKUFH
1 FDKSJFSSDF SLIDFSOIUDHFIUSDHF
2 SIDFDS K<NFSKJGHSDUFSDK
df.apply(lambda x : x[0][len(x['index'])],axis=1)
Out[116]:
0 F
1 H
2 J
dtype: object
这是使用列表理解的一种方法:
df = pd.DataFrame({'A': ["SSDILFJKSIDHFKJSHDKUFH", "SLIDFSOIUDHFIUSDHF",
"K<NFSKJGHSDUFSDK"]},
index=["SKDJF", "FDKSJFSSDF", "SIDFDS"])
df['B'] = [j[1:i+1] for i, j in zip(s.index.map(len), s.values)]
print(df)
A B
SKDJF SSDILFJKSIDHFKJSHDKUFH SDILF
FDKSJFSSDF SLIDFSOIUDHFIUSDHF LIDFSOIUDH
SIDFDS K<NFSKJGHSDUFSDK <NFSKJ
df=pd.DataFrame({'A':[“SSDILFJKSIDHFKJSHDKUFH”,“slidefsoiudhfiusdhf”,
“你能为你要做的事情添加一些伪代码吗?
df = pd.DataFrame({'A': ["SSDILFJKSIDHFKJSHDKUFH", "SLIDFSOIUDHFIUSDHF",
"K<NFSKJGHSDUFSDK"]},
index=["SKDJF", "FDKSJFSSDF", "SIDFDS"])
df['B'] = [j[1:i+1] for i, j in zip(s.index.map(len), s.values)]
print(df)
A B
SKDJF SSDILFJKSIDHFKJSHDKUFH SDILF
FDKSJFSSDF SLIDFSOIUDHFIUSDHF LIDFSOIUDH
SIDFDS K<NFSKJGHSDUFSDK <NFSKJ