org.apache.spark.sql.AnalysisException:尝试从spark访问azure时路径不存在
下面是我的代码:org.apache.spark.sql.AnalysisException:尝试从spark访问azure时路径不存在,azure,apache-spark,azure-storage-blobs,Azure,Apache Spark,Azure Storage Blobs,下面是我的代码: val spark = SparkSession.builder().master("local[*]").appName("demoApp").getOrCreate() spark.sparkContext.hadoopConfiguration.set("fs.azure", "org.apache.hadoop.fs.azure.NativeAzureFileSystem") spark.sparkContext.hadoopConfiguration.set(
val spark = SparkSession.builder().master("local[*]").appName("demoApp").getOrCreate()
spark.sparkContext.hadoopConfiguration.set("fs.azure", "org.apache.hadoop.fs.azure.NativeAzureFileSystem")
spark.sparkContext.hadoopConfiguration.set("fs.azure.account.key.<storage-account>.blob.core.windows.net", <account_key>)
val baseDir = "wasb://<container-name>@<storage_account>.blob.core.windows.net/"
val df = spark.read.orc(baseDir+"path")
错误:
org.apache.spark.sql.AnalysisException: Path does not exist wasb://<container-name>@<storage_account>.blob.core.windows.net/path
我建议检查以下各项 他们提供了如何使用标准spark API和databricks API读取Blob存储帐户的示例,代码如下所示:
val df = spark.read.parquet("wasbs://<your-container-name>@<your-storage-account-name>.blob.core.windows.net/<your-directory-name>")
dbutils.fs.ls("wasbs://<your-container-name>@<your-storage-account-name>.blob.core.windows.net/<your-directory-name>")
下面的答案对你有帮助吗?