Python Dask read_拼花地板额外增加了一列dir0_Python_Dask_Parquet - Fatal编程技术网

Python Dask read_拼花地板额外增加了一列dir0

python dask

Python Dask read_拼花地板额外增加了一列dir0,python,dask,parquet,Python,Dask,Parquet,我在不同的目录中有多个拼花文件 paths = ['adl://entrofi/shift/20190725_060500_20190928_060500/*.parquet', 'adl://entrofi/shift/20190726_060500_20190928_060500/*.parquet', 'adl://entrofi/shift/20190727_060500_20190928_060500/*.parquet', 'adl://entrofi/shift/20190728_

我在不同的目录中有多个拼花文件

paths = ['adl://entrofi/shift/20190725_060500_20190928_060500/*.parquet',
'adl://entrofi/shift/20190726_060500_20190928_060500/*.parquet',
'adl://entrofi/shift/20190727_060500_20190928_060500/*.parquet',
'adl://entrofi/shift/20190728_060500_20190928_060500/*.parquet',
'adl://entrofi/shift/20190820_060500_20190920_060500/*.parquet',
'adl://entrofi/shift/20190828_060500_20190928_060500/*.parquet']

每个文件都包含列

A、B、C

我想看所有这些文件，所以我做了一个

ddf=dd.read\u拼花地板（路径）。放下重复件（）
但是，ddf
包含列A、B、C和dir0
dir0包含文件夹的名称
从中读取路径
中的每个路径
读取路径中的每个文件不包含dir0
列
如何避免将dir0
自动添加到我的ddf
？
这是fastparquet后端的预期行为，因为看起来您的文件是按文件夹名称分区的，在这种情况下使用“钻取”方案（与field=value
目录名相反）
为了避免这种情况，您可以使用pyarrow引擎，或者只需指定要保留的列：
ddf = dd.read_parquet(paths, columns=['A', 'B', 'C'])
ddf = dd.read_parquet(paths, engine='pyarrow')

我可以指定列，但是由于我有很多列，这使得代码很笨拙。当我指定pyarrow
时，会弹出另一个错误NotImplementedError:将pyarrow与'DaskAdlFileSystem'文件系统对象一起使用
，对于后者，我认为您需要等待即将发布的版本。




[dask]相关文章推荐



                                                        
使用dask访问重影块
dask 
如何从命令行运行dask分布式本地集群？
dask 
在不丢失阵列形状信息的情况下查找dask阵列中元素的频率？
dask 
Dask Workers终身选项不等待作业完成
dask 
                                       





随机文章推荐



                                                        
Codeigniter 在语言文件中输入关键字/值的重复任务
codeigniterautomation 
CodeIgniter |谷歌地图API的定制信息窗口
codeignitergoogle-maps-api-3 
Codeigniter 如何在单独的CI安装中将一种语言文件包含在另一种语言文件中？
codeigniter 
使用Codeigniter form_验证回调解析用于文件输入的参数
codeigniterfile-uploadvalidation 
使用预定义变量加载codeigniter视图
codeigniter 
更新Codeigniter中的多行
codeigniter 
如何在codeigniter中取消设置特定数据数组
codeignitersession 
DataTables TableTools插件-如何在Codeigniter中配置sSwfPath路径？
codeigniterdatatables 
Paypal Notify URL使用CodeIgniter未返回任何内容
codeigniterurlpaypal 
codeigniter-比较值
codeigniter 
我不理解css codeigniter的功能
codeigniter 
Codeigniter 正在尝试检查if语句中是否存在文件_
codeigniter 
Codeigniter 杂货店积垢：向字段添加解释性注释
codeigniter 
Codeigniter 联接在模型中不起作用
codeigniter 
Codeigniter HMVC错误：此网页有重定向循环
codeigniterredirect 
Codeigniter选择2个用户并向他们发送电子邮件
codeigniter 
尝试在codeigniter中锁定块不工作
codeigniter 
如何在codeigniter中使用多个图像
codeigniter 
在codeigniter中检索图像
codeigniter 
当使用hmvc CodeIgniter时，如何在一个控制器文件夹中调用两个控制器？
codeigniter


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
python中匹配单词前换行符的正则表达式
									Python
							 									Regex
							 									Indexing
							 
Python 如何从scipy.sparse.array列表中选择所有行的最大值？
									Python
							 									List
							 									Numpy
							 
python中的强数
									Python
							 									Algorithm
							 									Python 3.x
							 
Python 如何从该输入框获取数据？
									Python
							 									Django
							 									Web
							 
Python UnboundLocalError:分配问题之前引用的局部变量
									Python
							 
无法转换UTF-8字符-Python
									Python
							 									Python 2.7
							 									Utf 8
							 
如何将值转换为float并将其分配给Python中的字典？
									Python
							 									Dictionary
							 
阻止Windows关闭Python
									Python
							 									Python 3.x
							 
Python Can'；无法正确编码csv文件？
									Python
							 									Excel
							 									Csv
							 									Unicode
							 									Encoding
							 
Python 谷歌应用程序引擎-将用户记录快速保存到数据存储、事务、实体组
									Python
							 									Google App Engine
							 									Nosql
							 
python中用逗号拆分字符串
									Python
							 									Regex
							 									Csv
							 
python tornado下载远程文件
									Python
							 									Download
							 
Python 使用matplotlib绘制随机形状的面片
									Python
							 									Matplotlib
							 
numpy中的Python for循环
									Python
							 									Numpy
							 
Python 使用openCV分割、裁剪（边界框）和标记字符
									Python
							 									Opencv
							 									Image Processing
							 
如何总结Python列表中的相似值
									Python
							 									Python 2.7
							 
Python 为什么以下代码不遵循指定的顺序？
									Python
							 
在tkinter'中传递参数；python上的s协议（“WM”u DELETE“u WINDOW”函数）
									Python
							 									Tkinter
							 
将json转换为python obj
									Python
							 									Json
							 
Python注册表脚本没有输出
									Python
							 
Python 从两个列表生成字典的最快方法
									Python
							 									List
							 									Dictionary
							 
如何在python中找到零数组中的最大值？
									Python
							 									Numpy
							 
可插入Python子命令模式？
									Python
							 									Python 3.x
							 
Python 如何修复列表中的两次追加
									Python
							 									List
							 
Python2.7.3的jupyter笔记本问题
									Python
							 									Python 2.7
							 									Jupyter Notebook
							 
如何导入与python中的postgres服务器不同的表？
									Python
							 									Postgresql
							 									Flask
							 
Python Django管理启动项目<；名称>；创建旧版本的django项目
									Python
							 									Django
							 
Python 按最小属性值对类实例的元组列表进行排序
									Python
							 									List
							 									Sorting
							 
Python 为什么我从QuantLib的不同信用定价引擎中得到不同的结果
									Python
							 
Python CatBoost训练后功能信息
									Python
							 									Machine Learning
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Coding Style
R
Actionscript 3
Charts
Colors
Octave
Security
Json
Visual Studio 2017
Sbt
Grep
Postman
Nginx
Proxy
Tomcat
Swing
Windows 7
Github
D
Select
Recursion
Cobol
Jira
Yocto
Discord.js
Maven
Nestjs
Cloud
Language Agnostic
Blackberry
Powershell
Django Rest Framework
Snowflake Cloud Data Platform
If Statement
Ckeditor
Random
Svg
Alfresco
Perforce
Redis
Redux
Graphql
Libgdx
Orientdb
Amazon Dynamodb
Iis 7
Cakephp
Outlook
Google Visualization
Plsql
Oracle Apex
Pip
Emacs
Mips
Phpmyadmin
Cordova
Windows Phone 7
Sencha Touch
Optimization
Devexpress
Vaadin
Sublimetext3
Three.js
Amp Html
Google Analytics
Error Handling
Function
Dictionary
Ruby On Rails 3.1
Woocommerce
Jasmine
Stm32
Encryption
Web Scraping
Serial Port
Video Streaming
Openlayers 3
Pine Script
Coldfusion
Merge
Umbraco
Bash
Yii2
Plugins
Azure Data Factory
Backbone.js
Artifactory
Codenameone
Filesystems
Migration
Firefox Addon
Rally
Windows Phone 8.1
Spring
Apache Flex
Mapreduce
Jquery Ui
Project Management
Matrix
Gwt
Matlab
Spring Cloud
Https
Javascript
Url Rewriting
Css
Seo
Datetime
Workflow
Wpf
Com
Playframework 2.0
Input
Email
Sharepoint 2013
Magento2
Google Chrome
Ocaml
For Loop
Orm
Aframe
Xampp
Pyspark
Prestashop
Sass
Gis
Configuration
Syntax
Join
Keyboard
Subsonic
Tinymce
Apache Nifi
Asynchronous
Open Source
Asterisk
Documentation
Mfc
Spring Boot
Ansible
Listview
Discord
Html5 Canvas
Zend Framework
Orchardcms
Asp.net Web Api
Latex
.net
Spring Security
Debian
Sql
Processing
Sequelize.js
Kubernetes
Azure
Hive
Axapta
Ibm Mobilefirst
Aem
Enums
Symfony
Scala
Hash
Permissions
Windbg
Image
Time
Audio
C
Sql Server 2008 R2
Excel
Web Crawler
Pytorch
Apache Flink
Google Api
Eclipse Plugin
Ignite
Redirect
Asp.net Mvc 4
Ssas
Android Emulator
Asp.net Mvc 3
Gcc
Biztalk
Sqlite
Session
Unix
Plot
Vhdl
Svn
Tensorflow
Jakarta Ee
Phpunit
Spotify
Blazor
Windows 8
Nest
Amazon Cloudformation
Notepad++
Xquery
Visual Studio 2010


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网