Python 如何在现有Excel工作表下写入dataframe,而不丢失透视表中Excel的切片器?
我正在使用openpyxl模块将pandas数据框附加到现有excel工作表下面。但问题是,在执行此任务时,透视表中excel工作表的切片器被破坏。我已经尝试了很多方法来找到它的解决方案,但是我认为openpyxl无法避免这种情况 我使用以下方法用openpyxl实现它-Python 如何在现有Excel工作表下写入dataframe,而不丢失透视表中Excel的切片器?,python,excel,pandas,win32com,xlwings,Python,Excel,Pandas,Win32com,Xlwings,我正在使用openpyxl模块将pandas数据框附加到现有excel工作表下面。但问题是,在执行此任务时,透视表中excel工作表的切片器被破坏。我已经尝试了很多方法来找到它的解决方案,但是我认为openpyxl无法避免这种情况 我使用以下方法用openpyxl实现它- #HELPER FUNCTION TO APPEND DATAFRAME BELOW EXCEL FILE def append_df_to_excel(filename, df, sheet_name='Sheet1', s
#HELPER FUNCTION TO APPEND DATAFRAME BELOW EXCEL FILE
def append_df_to_excel(filename, df, sheet_name='Sheet1', startrow=None,
truncate_sheet=False,
**to_excel_kwargs):
"""
Append a DataFrame [df] to existing Excel file [filename]
into [sheet_name] Sheet.
If [filename] doesn't exist, then this function will create it.
Parameters:
filename : File path or existing ExcelWriter
(Example: '/path/to/file.xlsx')
df : dataframe to save to workbook
sheet_name : Name of sheet which will contain DataFrame.
(default: 'Sheet1')
startrow : upper left cell row to dump data frame.
Per default (startrow=None) calculate the last row
in the existing DF and write to the next row...
truncate_sheet : truncate (remove and recreate) [sheet_name]
before writing DataFrame to Excel file
to_excel_kwargs : arguments which will be passed to `DataFrame.to_excel()`
[can be dictionary]
Returns: None
"""
from openpyxl import load_workbook
import pandas as pd
# ignore [engine] parameter if it was passed
if 'engine' in to_excel_kwargs:
to_excel_kwargs.pop('engine')
writer = pd.ExcelWriter(filename, engine='openpyxl', index=False)
# Python 2.x: define [FileNotFoundError] exception if it doesn't exist
try:
FileNotFoundError
except NameError:
FileNotFoundError = IOError
try:
# try to open an existing workbook
writer.book = load_workbook(filename,data_only=True)
# get the last row in the existing Excel sheet
# if it was not specified explicitly
if startrow is None and sheet_name in writer.book.sheetnames:
startrow = writer.book[sheet_name].max_row
# truncate sheet
if truncate_sheet and sheet_name in writer.book.sheetnames:
# index of [sheet_name] sheet
idx = writer.book.sheetnames.index(sheet_name)
# remove [sheet_name]
writer.book.remove(writer.book.worksheets[idx])
# create an empty sheet [sheet_name] using old index
writer.book.create_sheet(sheet_name, idx)
# copy existing sheets
writer.sheets = {ws.title:ws for ws in writer.book.worksheets}
except FileNotFoundError:
# file does not exist yet, we will create it
pass
if startrow is None:
startrow = 1
# write out the new sheet
df.to_excel(writer, sheet_name, startrow=startrow, **to_excel_kwargs)
# save the workbook
writer.save()
我已经看到xlwings和win32com可以实现这一点,但不确定如何使用这些库实现。我想问一下,如何在现有excel文件的下方追加dataframe,而不会丢失excel透视表中的切片器
我们不用openpyxl就可以做到这一点,因为我认为openpyxl没有可用的方法。
我在openpyxl中收到以下警告
C:\Users\Desktop\PycharmProjects\MyProject\venv\lib\site-packages\openpyxl\worksheet\_reader.py:292: UserWarning: Slicer List extension is not supported and will be removed
warn(msg)
我已经尝试了所有可能的解决方案,最后我可以说,使用openpyxl是不可能的。作为xlwings的替代,我们可以使用openpyxl库。xlwings库处理速度非常快。我们可以在下面代码的帮助下将数据框附加到excel工作表下面。通过这种方式,切片器值不会松动或删除
import xlwings as xw
import pandas as pd
df = pd.read_excel("File_to_append.xlsx")
wb = xw.Book(r"Existing_Excel.xlsx")
ws = wb.sheets('Sheet1') #Name of sheet where to append df
ws.cells(cell_row_number,cell_column_number).options(index=False, header=False).value = df
#Here cell_row_number and cell_column_number are integer valuesof row number and column number to append data
希望我清楚。这是我找到的最好的解决方案。有人能帮我解决吗?有同样的问题,并得出结论,xlwing是唯一有效的方法。是的,xlwing比openpyxl快得多。我们在几秒钟内完成工作,而不丢失切片器和透视表信息