Python 在列中查找关键字并将这些关键字添加到同一行的新列中
我是python新手,这是我关于堆栈溢出的第一篇文章。我有一个关键字列表和一个包含多列的数据框 我想在一个特定的列中搜索这些关键字,并写出与之相对应的关键字 这就是我正在做的 这就是我得到的错误 这就是我想要的 请帮助找出哪里出了问题,或者提出更好的解决方法。谢谢 如果有助于简化工作,请编写下面的代码Python 在列中查找关键字并将这些关键字添加到同一行的新列中,python,dataframe,append,contains,Python,Dataframe,Append,Contains,我是python新手,这是我关于堆栈溢出的第一篇文章。我有一个关键字列表和一个包含多列的数据框 我想在一个特定的列中搜索这些关键字,并写出与之相对应的关键字 这就是我正在做的 这就是我得到的错误 这就是我想要的 请帮助找出哪里出了问题,或者提出更好的解决方法。谢谢 如果有助于简化工作,请编写下面的代码 import pandas as pd keywords = ["hello","hi","greetings","wassup"] data = ["hello, my name is Ha
import pandas as pd
keywords = ["hello","hi","greetings","wassup"]
data = ["hello, my name is Harry", "Hi I am John", "Yo! Wassup", "Greetings fellow traveller","Hey im
Henry", "Hello there General Kenobi"]
df = pd.DataFrame(data,columns = ['strings'])
df['Keywords'] = ""
df2 = pd.DataFrame(data = None, columns = df.columns)
for word in keywords:
temp = df[df['strings'].str.contains(word,na = False)]
temp.reset_index(drop = True)
temp['Keywords'] = word
df2.append(temp)
错误:
C:\Users\harka\Anaconda3\lib\site packages\ipykernel\u launcher.py:5:SettingWithCopyWarning:
试图在数据帧切片的副本上设置值。
尝试改用.loc[row\u indexer,col\u indexer]=value
请参阅文档中的注意事项:
“”我添加了“Yo”以显示它可以返回多个字符串
import pandas as pd
def keyword(row):
strings = row['strings']
keywords = ["hello","hi","greetings","wassup",'yo']
keyword = [key for key in keywords if key.upper() in strings.upper()]
return keyword
data = ["hello, my name is Harry", "Hi I am John", "Yo! Wassup", "Greetings fellow traveller","Hey im Henry", "Hello there General Kenobi"]
df = pd.DataFrame(data,columns = ['strings'])
df['keyword'] = df.apply(keyword, axis=1)
如果您不喜欢返回的字符串列表,那么可能是逗号分隔的字符串
import pandas as pd
def keyword(row):
strings = row['strings']
keywords = ["hello","hi","greetings","wassup",'yo']
keyword = [key for key in keywords if key.upper() in strings.upper()]
return ','.join(keyword)
data = ["hello, my name is Harry", "Hi I am John", "Yo! Wassup", "Greetings fellow traveller","Hey im Henry", "Hello there General Kenobi"]
df = pd.DataFrame(data,columns = ['strings'])
df['keyword'] = df.apply(keyword, axis=1)
我添加了“Yo”来表示它可以返回多个字符串
import pandas as pd
def keyword(row):
strings = row['strings']
keywords = ["hello","hi","greetings","wassup",'yo']
keyword = [key for key in keywords if key.upper() in strings.upper()]
return keyword
data = ["hello, my name is Harry", "Hi I am John", "Yo! Wassup", "Greetings fellow traveller","Hey im Henry", "Hello there General Kenobi"]
df = pd.DataFrame(data,columns = ['strings'])
df['keyword'] = df.apply(keyword, axis=1)
如果您不喜欢返回的字符串列表,那么可能是逗号分隔的字符串
import pandas as pd
def keyword(row):
strings = row['strings']
keywords = ["hello","hi","greetings","wassup",'yo']
keyword = [key for key in keywords if key.upper() in strings.upper()]
return ','.join(keyword)
data = ["hello, my name is Harry", "Hi I am John", "Yo! Wassup", "Greetings fellow traveller","Hey im Henry", "Hello there General Kenobi"]
df = pd.DataFrame(data,columns = ['strings'])
df['keyword'] = df.apply(keyword, axis=1)