OpenStack Xenial/Pike和Xenial/Queens之间是否存在阻止JuJu创建openvswitch桥的变化?

OpenStack Xenial/Pike和Xenial/Queens之间是否存在阻止JuJu创建openvswitch桥的变化?,openstack,openstack-nova,openstack-neutron,juju,maas,Openstack,Openstack Nova,Openstack Neutron,Juju,Maas,在过去两年半的时间里,我们一直在运行OpenStack环境,但在运行过程中遇到了一些问题,但大多数情况下几乎没有停机时间。最近,我们一直在尝试向堆栈中添加一块新的硬件作为nova计算节点,以便为我们的vm提供更多的CPU核心和RAM。不幸的是,由于某些原因,安装过程并不顺利 我们正在使用JuJu和MaaS运行Xenial/Queens进行部署/资源调配。我们一直在运行Xenial/Pike,直到12月升级。我们开始怀疑Queens的升级是造成问题的原因,因为我们能够在升级之前添加新的硬件。我们甚

在过去两年半的时间里,我们一直在运行OpenStack环境,但在运行过程中遇到了一些问题,但大多数情况下几乎没有停机时间。最近,我们一直在尝试向堆栈中添加一块新的硬件作为nova计算节点,以便为我们的vm提供更多的CPU核心和RAM。不幸的是,由于某些原因,安装过程并不顺利

我们正在使用JuJu和MaaS运行Xenial/Queens进行部署/资源调配。我们一直在运行Xenial/Pike,直到12月升级。我们开始怀疑Queens的升级是造成问题的原因,因为我们能够在升级之前添加新的硬件。我们甚至删除了一台作为nova计算节点的现有机器,并尝试将其添加回堆栈,现在它也出现了与新硬件相同的问题

问题的根本原因似乎与中子开关应用有关。当我们通过JuJu安装nova compute charm时,一切似乎都进展顺利,直到次级中子openvswitch charm的(自动)安装/配置。在OpenStack管理网络(eno1上的10.10.30.0/24)的安装连接过程中,在某个特定点查看日志时丢失。我们可以通过在eno2(一个不同的外部网络)上添加第二个连接来进一步强制安装,但eno1上的连接丢失仍然存在,计算服务无法与堆栈的其余部分通信

看看堆栈中其他功能正常的计算节点,管理网桥(br-eno1)似乎不是由中子openvswitch创建的。过程的某些部分看起来像是在准备创建网桥时关闭了eno1,但随后失败,导致机器无法在该接口上与堆栈的其余部分通信

自从升级到Queens后,我们的配置都没有改变,但也许我们不知道Pike->Queens升级时附带的默认配置有一些不推荐或更改?我们已经阅读了发行说明,但似乎找不到任何可以解释这种行为的东西

任何帮助都将不胜感激。我在下面列出了一些我认为相关的日志文件,但可以提供可能需要的任何其他内容。提前谢谢

损坏的服务器ifconfig

eno1      Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet addr:10.10.30.101  Bcast:10.10.30.255  Mask:255.255.255.0
          inet6 addr: fe80::4ed9:8fff:fec5:2e3/64 Scope:Link
          UP BROADCAST RUNNING PROMISC MULTICAST  MTU:1500  Metric:1
          RX packets:487314 errors:0 dropped:0 overruns:0 frame:0
          TX packets:91955 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:255807482 (255.8 MB)  TX bytes:6693026 (6.6 MB)
          Interrupt:17

eno2      Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet addr:10.189.134.103  Bcast:10.189.134.255  Mask:255.255.255.0
          inet6 addr: fe80::4ed9:8fff:fec5:2e4/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:195386 errors:0 dropped:0 overruns:0 frame:0
          TX packets:89021 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:29175518 (29.1 MB)  TX bytes:37673375 (37.6 MB)
          Interrupt:18

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:65536  Metric:1
          RX packets:181496 errors:0 dropped:0 overruns:0 frame:0
          TX packets:181496 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1
          RX bytes:22574807 (22.5 MB)  TX bytes:22574807 (22.5 MB)

lxdbr0    Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet6 addr: fe80::1/64 Scope:Link
          inet6 addr: fe80::b8c2:36ff:fe60:de08/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:7 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:0 (0.0 B)  TX bytes:650 (650.0 B)
断开的服务器ovs vsctl显示

fc878983-8ae5-479f-999f-d809f5a2ba8f
    Manager "ptcp:6640:127.0.0.1"
        is_connected: true
    Bridge br-data
        Port "eno1"
            Interface "eno1"
        Port br-data
            Interface br-data
                type: internal
    Bridge br-ex
        Port br-ex
            Interface br-ex
                type: internal
    Bridge br-int
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port br-int
            Interface br-int
                type: internal
    ovs_version: "2.9.5"
be5c20fd-46ef-4991-8dc3-3860944308e5
    Manager "ptcp:6640:127.0.0.1"
        is_connected: true
    Bridge br-data
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port "eno1"
            Interface "eno1"
                error: "could not add network device eno1 to ofproto (Device or resource busy)"
        Port "eno2"
            Interface "eno2"
        Port br-data
            Interface br-data
                type: internal
        Port phy-br-data
            Interface phy-br-data
                type: patch
                options: {peer=int-br-data}
    Bridge br-tun
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port patch-int
            Interface patch-int
                type: patch
                options: {peer=patch-tun}
        Port "gre-0a0a1e7f"
            Interface "gre-0a0a1e7f"
                type: gre
                options: {df_default="true", in_key=flow, local_ip="10.10.30.117", out_key=flow, remote_ip="10.10.30.127"}
        Port "gre-0a0a1e74"
            Interface "gre-0a0a1e74"
                type: gre
                options: {df_default="true", in_key=flow, local_ip="10.10.30.117", out_key=flow, remote_ip="10.10.30.116"}
        Port "gre-0a0a1e76"
            Interface "gre-0a0a1e76"
                type: gre
                options: {df_default="true", in_key=flow, local_ip="10.10.30.117", out_key=flow, remote_ip="10.10.30.118"}
        Port br-tun
            Interface br-tun
                type: internal
    Bridge br-int
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port "qvo5560dd35-7e"
            tag: 2
            Interface "qvo5560dd35-7e"
        Port patch-tun
            Interface patch-tun
                type: patch
                options: {peer=patch-int}
        Port "qvo97c660e7-e3"
            tag: 1
            Interface "qvo97c660e7-e3"
        Port "qvo44aeabe3-de"
            tag: 1
            Interface "qvo44aeabe3-de"
        Port "qvo267cccc8-45"
            tag: 1
            Interface "qvo267cccc8-45"
        Port "qvofdf0ce36-50"
            tag: 2
            Interface "qvofdf0ce36-50"
        Port "qvof193baf6-c0"
            tag: 1
            Interface "qvof193baf6-c0"
        Port "qvod9facd45-41"
            tag: 1
            Interface "qvod9facd45-41"
        Port "qvoeeab657c-df"
            tag: 1
            Interface "qvoeeab657c-df"
        Port "qvodd4b9252-e5"
            tag: 1
            Interface "qvodd4b9252-e5"
        Port br-int
            Interface br-int
                type: internal
        Port "qvoc841a7f1-25"
            tag: 2
            Interface "qvoc841a7f1-25"
        Port "qvod6b38e4c-a1"
            tag: 2
            Interface "qvod6b38e4c-a1"
        Port int-br-data
            Interface int-br-data
                type: patch
                options: {peer=phy-br-data}
    Bridge br-ex
        Port br-ex
            Interface br-ex
                type: internal
    ovs_version: "2.9.2"
工作服务器ifconfig:

br-eno1   Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet addr:10.10.30.117  Bcast:10.10.30.255  Mask:255.255.255.0
          inet6 addr: fe80::1a66:daff:fe55:6bdc/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:9552045918 errors:0 dropped:4 overruns:0 frame:0
          TX packets:8731602524 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:25169343655058 (25.1 TB)  TX bytes:20302362419370 (20.3 TB)

eno1      Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet6 addr: fe80::1a66:daff:fe55:6bdc/64 Scope:Link
          UP BROADCAST RUNNING PROMISC MULTICAST  MTU:1500  Metric:1
          RX packets:27433132917 errors:0 dropped:821138 overruns:0 frame:0
          TX packets:25763792601 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:31217303277897 (31.2 TB)  TX bytes:26547305328673 (26.5 TB)
          Interrupt:18

eno2      Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet addr:10.189.134.118  Bcast:10.189.134.255  Mask:255.255.255.0
          inet6 addr: fe80::1a66:daff:fe55:6bdd/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:23432963 errors:0 dropped:0 overruns:0 frame:0
          TX packets:34 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:2858920977 (2.8 GB)  TX bytes:2404 (2.4 KB)
          Interrupt:19

eno3      Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          UP BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)
          Interrupt:19

eno4      Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          UP BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)
          Interrupt:16

gre_sys   Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet6 addr: fe80::d061:36ff:fecd:3bdf/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:65000  Metric:1
          RX packets:1247735590 errors:0 dropped:0 overruns:0 frame:0
          TX packets:1053172217 errors:0 dropped:8 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:934609315304 (934.6 GB)  TX bytes:1138575443474 (1.1 TB)

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:65536  Metric:1
          RX packets:874404497 errors:0 dropped:0 overruns:0 frame:0
          TX packets:874404497 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1
          RX bytes:1422560696594 (1.4 TB)  TX bytes:1422560696594 (1.4 TB)

lxdbr0    Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet addr:10.0.216.1  Bcast:0.0.0.0  Mask:255.255.255.0
          inet6 addr: fe80::d83b:4eff:fedb:7be0/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:9 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:0 (0.0 B)  TX bytes:750 (750.0 B)

qbr267cccc8-45 Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          UP BROADCAST RUNNING MULTICAST  MTU:1458  Metric:1
          RX packets:257167 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:8981790 (8.9 MB)  TX bytes:0 (0.0 B)
.
.
.
.
tap267cccc8-45 Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet6 addr: fe80::fc16:3eff:fede:d180/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1458  Metric:1
          RX packets:4801309 errors:0 dropped:0 overruns:0 frame:0
          TX packets:6300403 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:12100707022 (12.1 GB)  TX bytes:3222243030 (3.2 GB)
.
.
.
.
vethWY9OQC Link encap:Ethernet  HWaddr FF:FF:FF:FF:FF:FF (redacted)
          inet6 addr: fe80::fc50:b6ff:fe7a:2584/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:533168318 errors:0 dropped:0 overruns:0 frame:0
          TX packets:468982413 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:191221371188 (191.2 GB)  TX bytes:227602758832 (227.6 GB)
工作服务器ovs vsctl显示

fc878983-8ae5-479f-999f-d809f5a2ba8f
    Manager "ptcp:6640:127.0.0.1"
        is_connected: true
    Bridge br-data
        Port "eno1"
            Interface "eno1"
        Port br-data
            Interface br-data
                type: internal
    Bridge br-ex
        Port br-ex
            Interface br-ex
                type: internal
    Bridge br-int
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port br-int
            Interface br-int
                type: internal
    ovs_version: "2.9.5"
be5c20fd-46ef-4991-8dc3-3860944308e5
    Manager "ptcp:6640:127.0.0.1"
        is_connected: true
    Bridge br-data
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port "eno1"
            Interface "eno1"
                error: "could not add network device eno1 to ofproto (Device or resource busy)"
        Port "eno2"
            Interface "eno2"
        Port br-data
            Interface br-data
                type: internal
        Port phy-br-data
            Interface phy-br-data
                type: patch
                options: {peer=int-br-data}
    Bridge br-tun
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port patch-int
            Interface patch-int
                type: patch
                options: {peer=patch-tun}
        Port "gre-0a0a1e7f"
            Interface "gre-0a0a1e7f"
                type: gre
                options: {df_default="true", in_key=flow, local_ip="10.10.30.117", out_key=flow, remote_ip="10.10.30.127"}
        Port "gre-0a0a1e74"
            Interface "gre-0a0a1e74"
                type: gre
                options: {df_default="true", in_key=flow, local_ip="10.10.30.117", out_key=flow, remote_ip="10.10.30.116"}
        Port "gre-0a0a1e76"
            Interface "gre-0a0a1e76"
                type: gre
                options: {df_default="true", in_key=flow, local_ip="10.10.30.117", out_key=flow, remote_ip="10.10.30.118"}
        Port br-tun
            Interface br-tun
                type: internal
    Bridge br-int
        Controller "tcp:127.0.0.1:6633"
            is_connected: true
        fail_mode: secure
        Port "qvo5560dd35-7e"
            tag: 2
            Interface "qvo5560dd35-7e"
        Port patch-tun
            Interface patch-tun
                type: patch
                options: {peer=patch-int}
        Port "qvo97c660e7-e3"
            tag: 1
            Interface "qvo97c660e7-e3"
        Port "qvo44aeabe3-de"
            tag: 1
            Interface "qvo44aeabe3-de"
        Port "qvo267cccc8-45"
            tag: 1
            Interface "qvo267cccc8-45"
        Port "qvofdf0ce36-50"
            tag: 2
            Interface "qvofdf0ce36-50"
        Port "qvof193baf6-c0"
            tag: 1
            Interface "qvof193baf6-c0"
        Port "qvod9facd45-41"
            tag: 1
            Interface "qvod9facd45-41"
        Port "qvoeeab657c-df"
            tag: 1
            Interface "qvoeeab657c-df"
        Port "qvodd4b9252-e5"
            tag: 1
            Interface "qvodd4b9252-e5"
        Port br-int
            Interface br-int
                type: internal
        Port "qvoc841a7f1-25"
            tag: 2
            Interface "qvoc841a7f1-25"
        Port "qvod6b38e4c-a1"
            tag: 2
            Interface "qvod6b38e4c-a1"
        Port int-br-data
            Interface int-br-data
                type: patch
                options: {peer=phy-br-data}
    Bridge br-ex
        Port br-ex
            Interface br-ex
                type: internal
    ovs_version: "2.9.2"
服务器损坏/var/log/jujuju/unit-中子-openvswitch.log 这些是机器在管理网络(eno1)上失去连接之前的最后一行

然后,我们看到以下内容(只能在现场或通过eno2连接进入):

2020-05-26 18:08:53错误juju.api监视器。go:59 health ping在30秒后超时
2020-05-26 18:08:53错误jujuju.worker.dependency engine.go:551“api调用者”歧管工作器返回意外错误:api连接意外断开
2020-05-26 18:08:53信息juju从templates/queens/openvswitch_agent.ini下载的日志模板
2020-05-26 18:08:53信息juju日志从模板呈现:/etc/neutron/plugins/ml2/openvswitch_agent.ini
2020-05-26 18:08:53信息juju日志编写了模板/etc/中子/plugins/ml2/openvswitch_agent.ini。
2020-05-26 18:08:54调试用于amqp的juju日志生成模板上下文
2020-05-26 18:08:54调试配置更改回溯(最近一次调用上次):
2020-05-26 18:08:54调试配置已更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/config已更改”,第266行,在
2020-05-26 18:08:54调试配置已更改main()
2020-05-26 18:08:54调试配置已更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/config已更改”,第259行,主视图
2020-05-26 18:08:54调试配置已更改挂钩。执行(sys.argv)
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/charmhelpers/core/hookenv.py”,第914行,执行
2020-05-26 18:08:54调试配置已更改自身。挂钩[挂钩名称]()
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/charmhelpers/contrib/openstack/utils.py”,第1568行
2020-05-26 18:08:54调试配置已更改(停止启动、重新启动功能)
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/charmhelpers/core/host.py”,第741行,在“更改”助手上重新启动
2020-05-26 18:08:54调试配置已更改r=lambda_f()
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/charmhelpers/contrib/openstack/utils.py”,第1567行,在
2020-05-26 18:08:54调试配置已更改(lambda:f(*args,**kwargs)),\u重新启动\u映射\u缓存['cache'],
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/config-changed”,第150行,配置更改
2020-05-26 18:08:54调试配置已更改配置写入
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/charmhelpers/contrib/openstack/templating.py”,第334行,全部写入
2020-05-26 18:08:54调试配置已更改[在六个.iterkeys(self.templates)中为k编写self.write(k)]
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/charmhelpers/contrib/openstack/templating.py”,第334行,in
2020-05-26 18:08:54调试配置已更改[在六个.iterkeys(self.templates)中为k编写self.write(k)]
2020-05-26 18:08:54调试配置更改文件“/var/lib/jujuju/agents/unit-中子-openvswitch-43/charm/hooks/charmhelpers/contrib/openstack/templating.py”,第321行,写入
2020-05-26 18:08:54调试配置已更改
auto lo
iface lo inet loopback

auto lo
iface lo inet loopback
    dns-nameservers 10.10.30.99 10.244.0.66 10.244.0.67
    dns-search maas

auto eno1
iface eno1 inet manual
    mtu 1500

auto eno2
iface eno2 inet static
    address 10.189.134.103/24
    dns-nameservers 10.189.134.99 10.244.0.66 10.244.0.67
    mtu 1500

auto br-eno1
iface br-eno1 inet static
    address 10.10.30.101/24
    dns-nameservers 10.10.30.99 10.244.0.66 10.244.0.67
    gateway 10.10.30.254
    bridge_ports eno1