Spring integration spring xd在处理大量数据时丢失消息
我正在使用SpringXD,我的流如下图所示,并在3节点容器上运行测试,其中1个管理节点使用rabbit作为传输Spring integration spring xd在处理大量数据时丢失消息,spring-integration,spring-xd,Spring Integration,Spring Xd,我正在使用SpringXD,我的流如下图所示,并在3节点容器上运行测试,其中1个管理节点使用rabbit作为传输 aws-s3|processor1|http-client|processor2>queue:readyQueue 我已经创建了以下点击 tap1 aws-s3>s3Queue tap2 processor1>processorQueue1 tap3 http-client>httpQueue 我在测试中运行以下场景: Scenario1:5个
aws-s3|processor1|http-client|processor2>queue:readyQueue
我已经创建了以下点击
tap1 aws-s3>s3Queue
tap2 processor1>processorQueue1
tap3 http-client>httpQueue
我在测试中运行以下场景:
Scenario1
:5个200k=100万条记录的文件
http客户端=70和处理器2=30的并发性
我看到900k消息队列
我看到889k消息处理器队列1
我看到886k消息httpQueue
我看到883k消息处理器队列2
信息到处都是,而且是随机的
Scenario2:
5个文件,200k=100万条记录,所有模块并发性=1
我看到998800消息队列
我看到998760消息处理器队列1
我看到997540消息httpQueue
我看到997530消息处理器队列2
即使这个数字是随机的,也不一致
Scenario3
我如下所示更改了流,并发性=1,5个200k=100万条记录的文件
aws-s3 >testQueue
aws-s3 |processor1 >testQueue2
我收到我所有的信息,我运行了3次,没有问题。我收到了我所有的100万条信息
scenario4
我将流更改如下,并发性=15个文件,200k=100万条记录
aws-s3 >testQueue
aws-s3 |processor1 >testQueue2
我收到我所有的信息,我运行了3次,没有问题。我收到了我所有的100万条信息
scenario4
在scenario4和scenarion 3中,数据摄取速度更快,处理500万数据需要5分钟,兔子传输队列中的数据摄取速度更快,如每秒5k msg
在场景1中,即使s3模块以每秒300到1000 msg的速度拉取数据,数据摄取也会变慢
在场景2中,s3拉取数据的速度更快,但http客户端的速度较慢,如每秒100 msg,而aws-s3拉取数据的速度较快,如每秒3-4k msg
我认为xd线程正在导致问题,我正在丢失消息。请您帮助我如何解决此问题
更新
Scenario 5
我在http客户机中将回复超时更改为-1
,然后
我只输了37毫克
现在我再次运行第二次迭代,我丢失了25000 msg,我看到了发生这种情况时的咆哮容器日志
2016-03-04T03:42:04-0500 1.2.1.RELEASE ERROR task-scheduler-7 handler.LoggingHandler - org.springframework.messaging.MessageHandlingException: error occurred in message handler [org.springframework.integration.amqp.outbound.AmqpOutboundEndpoint@b6700b1]; nested exception is org.springframework.amqp.AmqpIOException: java.io.IOException
at org.springframework.integration.handler.AbstractMessageHandler.handleMessage(AbstractMessageHandler.java:84)
at org.springframework.xd.dirt.integration.rabbit.RabbitMessageBus$SendingHandler.handleMessageInternal(RabbitMessageBus.java:891)
at org.springframework.integration.handler.AbstractMessageHandler.handleMessage(AbstractMessageHandler.java:78)
at org.springframework.integration.dispatcher.AbstractDispatcher.tryOptimizedDispatch(AbstractDispatcher.java:116)
at org.springframework.integration.dispatcher.UnicastingDispatcher.doDispatch(UnicastingDispatcher.java:101)
at org.springframework.integration.dispatcher.UnicastingDispatcher.dispatch(UnicastingDispatcher.java:97)
at org.springframework.integration.channel.AbstractSubscribableChannel.doSend(AbstractSubscribableChannel.java:77)
at org.springframework.integration.channel.AbstractMessageChannel.send(AbstractMessageChannel.java:287)
at org.springframework.integration.channel.interceptor.WireTap.preSend(WireTap.java:129)
at org.springframework.integration.channel.AbstractMessageChannel$ChannelInterceptorList.preSend(AbstractMessageChannel.java:392)
at org.springframework.integration.channel.AbstractMessageChannel.send(AbstractMessageChannel.java:282)
at org.springframework.integration.channel.AbstractMessageChannel.send(AbstractMessageChannel.java:245)
at sun.reflect.GeneratedMethodAccessor204.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:497)
at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:317)
at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:190)
at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:157)
at org.springframework.integration.monitor.DirectChannelMetrics.monitorSend(DirectChannelMetrics.java:114)
at org.springframework.integration.monitor.DirectChannelMetrics.doInvoke(DirectChannelMetrics.java:98)
at org.springframework.integration.monitor.DirectChannelMetrics.invoke(DirectChannelMetrics.java:92)
at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:179)
at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:207)
at com.sun.proxy.$Proxy1537.send(Unknown Source)
at org.springframework.messaging.core.GenericMessagingTemplate.doSend(GenericMessagingTemplate.java:115)
at org.springframework.messaging.core.GenericMessagingTemplate.doSend(GenericMessagingTemplate.java:45)
at org.springframework.messaging.core.AbstractMessageSendingTemplate.send(AbstractMessageSendingTemplate.java:95)
at org.springframework.integration.handler.AbstractMessageProducingHandler.sendOutput(AbstractMessageProducingHandler.java:231)
at org.springframework.integration.handler.AbstractMessageProducingHandler.produceOutput(AbstractMessageProducingHandler.java:154)
at org.springframework.integration.splitter.AbstractMessageSplitter.produceOutput(AbstractMessageSplitter.java:157)
at org.springframework.integration.handler.AbstractMessageProducingHandler.sendOutputs(AbstractMessageProducingHandler.java:102)
at org.springframework.integration.handler.AbstractReplyProducingMessageHandler.handleMessageInternal(AbstractReplyProducingMessageHandler.java:105)
Caused by: org.springframework.amqp.AmqpIOException: java.io.IOException
at org.springframework.amqp.rabbit.support.RabbitExceptionTranslator.convertRabbitAccessException(RabbitExceptionTranslator.java:63)
at org.springframework.amqp.rabbit.connection.SimpleConnection.createChannel(SimpleConnection.java:51)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory$ChannelCachingConnectionProxy.createBareChannel(CachingConnectionFactory.java:758)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory$ChannelCachingConnectionProxy.access$300(CachingConnectionFactory.java:747)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory.doCreateBareChannel(CachingConnectionFactory.java:419)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory.createBareChannel(CachingConnectionFactory.java:395)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory.getCachedChannelProxy(CachingConnectionFactory.java:364)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory.getChannel(CachingConnectionFactory.java:357)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory.access$1100(CachingConnectionFactory.java:75)
at org.springframework.amqp.rabbit.connection.CachingConnectionFactory$ChannelCachingConnectionProxy.createChannel(CachingConnectionFactory.java:763)
at org.springframework.amqp.rabbit.connection.ConnectionFactoryUtils$1.createChannel(ConnectionFactoryUtils.java:85)
at org.springframework.amqp.rabbit.connection.ConnectionFactoryUtils.doGetTransactionalResourceHolder(ConnectionFactoryUtils.java:134)
at org.springframework.amqp.rabbit.connection.ConnectionFactoryUtils.getTransactionalResourceHolder(ConnectionFactoryUtils.java:67)
at org.springframework.amqp.rabbit.core.RabbitTemplate.doExecute(RabbitTemplate.java:1035)
at org.springframework.amqp.rabbit.core.RabbitTemplate.execute(RabbitTemplate.java:1028)
at org.springframework.amqp.rabbit.core.RabbitTemplate.send(RabbitTemplate.java:540)
at org.springframework.amqp.rabbit.core.RabbitTemplate.convertAndSend(RabbitTemplate.java:635)
at org.springframework.integration.amqp.outbound.AmqpOutboundEndpoint.send(AmqpOutboundEndpoint.java:331)
at org.springframework.integration.amqp.outbound.AmqpOutboundEndpoint.handleRequestMessage(AmqpOutboundEndpoint.java:323)
at org.springframework.integration.handler.AbstractReplyProducingMessageHandler.handleMessageInternal(AbstractReplyProducingMessageHandler.java:99)
at org.springframework.integration.handler.AbstractMessageHandler.handleMessage(AbstractMessageHandler.java:78)
... 93 more
Caused by: java.io.IOException
at com.rabbitmq.client.impl.AMQChannel.wrap(AMQChannel.java:106)
at com.rabbitmq.client.impl.AMQChannel.wrap(AMQChannel.java:102)
at com.rabbitmq.client.impl.AMQChannel.exnWrappingRpc(AMQChannel.java:124)
at com.rabbitmq.client.impl.ChannelN.open(ChannelN.java:125)
at com.rabbitmq.client.impl.ChannelManager.createChannel(ChannelManager.java:134)
at com.rabbitmq.client.impl.AMQConnection.createChannel(AMQConnection.java:499)
at org.springframework.amqp.rabbit.connection.SimpleConnection.createChannel(SimpleConnection.java:44)
... 112 more
Caused by: com.rabbitmq.client.ShutdownSignalException: connection error
at com.rabbitmq.utility.ValueOrException.getValue(ValueOrException.java:67)
at com.rabbitmq.utility.BlockingValueOrException.uninterruptibleGetValue(BlockingValueOrException.java:33)
at com.rabbitmq.client.impl.AMQChannel$BlockingRpcContinuation.getReply(AMQChannel.java:348)
at com.rabbitmq.client.impl.AMQChannel.privateRpc(AMQChannel.java:221)
at com.rabbitmq.client.impl.AMQChannel.exnWrappingRpc(AMQChannel.java:118)
... 116 more
Caused by: com.rabbitmq.client.impl.UnknownChannelException: Unknown channel number 23364
at com.rabbitmq.client.impl.ChannelManager.getChannel(ChannelManager.java:80)
at com.rabbitmq.client.impl.AMQConnection$MainLoop.run(AMQConnection.java:552)
... 1 more
2016-03-04T03:42:05-0500 1.2.1.RELEASE ERROR AMQP Connection xxx:5672 connection.CachingConnectionFactory - Channel shutdown: channel error; protocol method: #method<channel.close>(reply-code=404, reply-text=NOT_FOUND - no queue 'xdbus.tap-s3.tap:stream:stream.batch-aws-s3-source.0' in vhost '/', class-id=50, method-id=20)
2016-03-04T03:53:13-0500 1.2.1.RELEASE ERROR AMQP Connection xxx:5672 connection.CachingConnectionFactory - Channel shutdown: connection error
2016-03-04T03:53:13-0500 1.2.1.RELEASE ERROR AMQP Connection xxx:5672 connection.CachingConnectionFactory - Channel shutdown: channel error; protocol method: #method<channel.close>(reply-code=404, reply-text=NOT_FOUND - no queue 'xdbus.tap-s3.tap:stream:stream.batch-aws-s3-source.0' in vhost '/', class-id=50, method-id=20)
~
2016-03-04T02:57:54-0500 1.2.1.RELEASE ERROR AMQP Connection xxx:8080 connection.CachingConnectionFactory - Channel shutdown: connection error
2016-03-04T02:57:55-0500 1.2.1.RELEASE ERROR AMQP Connection xxx:8080 connection.CachingConnectionFactory - Channel shutdown: connection error
2016-03-04T03:42:04-0500 1.2.1.RELEASE ERROR AMQP Connection yyy:5672 connection.CachingConnectionFactory - Channel shutdown: connection error
兔形
spring:
rabbitmq:
addresses: host1:5672,host2:5672,host3:5672
adminAddresses: http://host1:15672,http://host2:15672,http://host3:15672
nodes: rabbit@host1.test.com,rabbit@host2.test.com,rabbit@host2.test.com
username: test
password: test
virtual_host: /
useSSL: false
sslProperties:
随着缓存大小增加到200而更新
我添加了您提供的xml,并将缓存大小增加到200。这是处理100万和80 k消息时的情况。只有我的http客户端并发度为100,其他所有并发度为1。缓慢处理停止的消息仍在http客户端队列之前,并且计数相同。但我命名通道中的消息计数缓慢增加,大约为10个消息/个一分钟,但是很慢
s3轮询器|处理器| http客户端>队列:batchCacheQueue
在http 186174之前,Msg未在队列中递减。但慢慢地,Msg将进入batchCacheQueue
要模拟的测试用例:
1) 我使用的是spring integration aws-s3源代码,在复合模块|类似处理器的xml解析| http客户机(具有并发100>命名通道)中使用了拆分器
2) 我认为文件源也可以工作。创建一个包含百万条记录的文件,并尝试从该文件中提取该文件
3) 大约运行4到5次后,我们看到发生了此异常
原因:com.rabbitmq.client.impl.UnknownChannelException:未知通道号23364
当频道被大量搅动时,我们发现了一个问题;您需要在兔子缓存连接工厂中增加通道缓存大小
看
我打开了一个,以便下一版本的Spring XD将在servers.yml中公开此设置,这样您就不必覆盖总线配置文件。谢谢Gary。我发现了消息丢失的问题,请查看我的更新。让我知道这是我们应该解决的问题。是的;增加缓存大小可以解决这个问题;我目前不知道未知通道异常的根本原因(我无法在测试中再现它),但增加缓存大小可以避免这种情况。我确实更新了它,MSG被挂起。请查看我的更新。如果您没有从指定通道消费,可能您的rabbit服务器上的资源不足。我看到队列在流中。也许看看rabbit日志?我再次运行了类似的测试,每10分钟从s3中提取200k条记录,而不是每5秒提取200k条记录。现在我看不出这个问题。添加摄入延迟有帮助。但我也将http客户端的并发性从100改为10。但这里我也不使用namedChannel。rabbit中也有这样的说明从属设备未同步,无法删除或清除该命名通道