Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/performance/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Jupyter notebook 无法在受限dataproc安装程序中安装JUPYTER_Jupyter Notebook_Google Cloud Dataproc - Fatal编程技术网

Jupyter notebook 无法在受限dataproc安装程序中安装JUPYTER

Jupyter notebook 无法在受限dataproc安装程序中安装JUPYTER,jupyter-notebook,google-cloud-dataproc,Jupyter Notebook,Google Cloud Dataproc,我想用Jupter可选组件设置dataproc集群 gcloud beta dataproc clusters create cluster-1ea3 --enable-component-gateway \ --region europe-west1 --subnet data-network --no-address --zone europe-west1-b \ --single-node --master-machine-type n1-standard-4 --master-boot-

我想用Jupter可选组件设置dataproc集群

gcloud beta dataproc clusters create cluster-1ea3 --enable-component-gateway \
--region europe-west1 --subnet data-network --no-address --zone europe-west1-b \
--single-node --master-machine-type n1-standard-4 --master-boot-disk-size 500 \
--image-version 1.5-debian10 --optional-components ANACONDA,JUPYTER \
--scopes 'https://www.googleapis.com/auth/cloud-platform' --project clouddemoenvironment
“--no address”确保私有IP和网络“数据网络”通过Google私有访问启用。若我并没有安装Jupyter可选组件,但集群并没有出现以下可选组件错误,那个么一切都会很好

<13>Nov  5 09:01:44 google-dataproc-startup[1466]: <13>Nov  5 09:01:44 activate-component-jupyter[2710]: Looking in links: /opt/dataproc/jupyter/gcp
<13>Nov  5 09:01:44 google-dataproc-startup[1466]: <13>Nov  5 09:01:44 activate-component-jupyter[2710]: Collecting https://github.com/GoogleCloudPlatform/jupyter-extensions/archive/2cb9d24fe01cd329a8c4352a07b0eb8f9771fb07.zip#subdirectory=jupyter-gcs-contents-manager (from -r /opt/dataproc/jupyter/jupyter_extra_packages.requirements (line 1))
<13>Nov  5 09:01:59 google-dataproc-startup[1466]: <13>Nov  5 09:01:59 activate-component-jupyter[2710]:   WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ConnectTimeoutError(<pip._vendor.urllib3.connection.VerifiedHTTPSConnection object at 0x7f6b1afbac10>, 'Connection to github.com timed out. (connect timeout=15)')': /GoogleCloudPlatform/jupyter-extensions/archive/2cb9d24fe01cd329a8c4352a07b0eb8f9771fb07.zip
Nov 5 09:01:44 google dataproc启动[1466]:Nov 5 09:01:44激活组件jupyter[2710]:查看链接:/opt/dataproc/jupyter/gcp
11月5日09:01:44 google dataproc启动[1466]:11月5日09:01:44激活组件jupyter[2710]:正在收集https://github.com/GoogleCloudPlatform/jupyter-extensions/archive/2cb9d24fe01cd329a8c4352a07b0eb8f9771fb07.zip#subdirectory=jupyter-地面军事系统内容管理器(来自-r/opt/dataproc/jupyter/jupyter_额外_软件包要求(第1行))
11月5日09:01:59 google dataproc启动[1466]:11月5日09:01:59激活组件jupyter[2710]:警告:在连接被“ConnectTimeoutError(,”连接到github.com超时(连接超时=15)”中断后重试(重试(重试(总计=4,连接=None,读取=None,重定向=None,状态=None))':/GoogleCloudPlatform/jupyter extensions/archive/2cb9d24fe01cd329a8c4352a07b0eb8f9771fb07.zip
我知道集群无法访问github,失败是有道理的。在检查文件时,它被引用了

如果仅使用内部IP地址创建Dataproc群集,则在初始化操作中通过Internet访问github.com的尝试将失败,除非您已配置通过云NAT或云VPN引导流量的路由。如果不访问互联网,您可以启用私有Google访问,并将作业依赖项放在云存储中;集群节点可以从内部IP从云存储下载依赖项

我不想使用云NAT或云VPN。我是否可以向系统传达一些信息,以不同的方式解决依赖关系。不幸的是,初始化脚本的概念也可能不起作用,因为执行顺序在可选组件之后

<13>Nov  5 09:01:44 google-dataproc-startup[1466]: <13>Nov  5 09:01:44 activate-component-jupyter[2710]: Looking in links: /opt/dataproc/jupyter/gcp
<13>Nov  5 09:01:44 google-dataproc-startup[1466]: <13>Nov  5 09:01:44 activate-component-jupyter[2710]: Collecting https://github.com/GoogleCloudPlatform/jupyter-extensions/archive/2cb9d24fe01cd329a8c4352a07b0eb8f9771fb07.zip#subdirectory=jupyter-gcs-contents-manager (from -r /opt/dataproc/jupyter/jupyter_extra_packages.requirements (line 1))
<13>Nov  5 09:01:59 google-dataproc-startup[1466]: <13>Nov  5 09:01:59 activate-component-jupyter[2710]:   WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ConnectTimeoutError(<pip._vendor.urllib3.connection.VerifiedHTTPSConnection object at 0x7f6b1afbac10>, 'Connection to github.com timed out. (connect timeout=15)')': /GoogleCloudPlatform/jupyter-extensions/archive/2cb9d24fe01cd329a8c4352a07b0eb8f9771fb07.zip
任何关于如何在非internet环境中利用可选组件的建议

问候,,
吉尔

这个启动时间依赖关系是最新Dataproc映像中的一个bug

它应该在下一个Dataproc subminor映像版本中修复

要解决此问题,您可以使用以前的subminor映像版本。(
--图像版本=1.5.18-debian10


更新:此问题已在中修复,因此您可以使用最新版本。

谢谢您的提示。目前支持门户中似乎有两个问题。和。我尝试使用1.4-debian10,但失败了。我将尝试上面的版本。我用上面的版本进行了测试,一切都很好。