2016-02-25T15:08:18.994+0800 E QUERY [thread1] Error: count failed: {
"code" : 16340,
"ok" : 0,
"errmsg" : "No replica set monitor active and no cached seed found for set: shard3"
} :
复制代码
去两个shard实例的日志下看没看到有出错
016-02-25T14:16:26.039+0800 I SHARDING [LockPinger] cluster 139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018 pinged successfully at 2016-02-25T14:16:24.849+0800 by distributed lock pinger '139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018/APGW02:27021:1456298014:1883953971', sleeping for 30000ms
2016-02-25T14:16:56.455+0800 I SHARDING [LockPinger] cluster 139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018 pinged successfully at 2016-02-25T14:16:56.291+0800 by distributed lock pinger '139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018/APGW02:27021:1456298014:1883953971', sleeping for 30000ms
2016-02-25T14:17:05.690+0800 I NETWORK [conn988] end connection 139.122.10.145:27348 (58 connections now open)
2016-02-25T14:17:21.060+0800 I NETWORK [conn989] end connection 139.122.10.145:27353 (57 connections now open)
2016-02-25T14:17:26.635+0800 I SHARDING [LockPinger] cluster 139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018 pinged successfully at 2016-02-25T14:17:26.485+0800 by distributed lock pinger '139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018/APGW02:27021:1456298014:1883953971', sleeping for 30000ms
复制代码
然后重启起来,再在mongos客户端下查询集合的count,结果吧mongos也给搞垮了
2016-02-25T15:15:59.901+0800 I SHARDING [LockPinger] cluster 139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018 pinged successfully at 2016-02-25T15:15:59.775+0800 by distributed lock pinger '139.122.10.145:27018,139.122.10.146:27018,139.122.10.23:27018/APGW01:27017:1456284050:1804289383', sleeping for 30000ms
2016-02-25T15:16:03.636+0800 F ASIO [NetworkInterfaceASIO-TaskExecutorPool-9-0] Uncaught exception in NetworkInterfaceASIO IO worker thread of type: UnknownError Caught std::exception of type std::system_error: thread: Resource temporarily unavailable
2016-02-25T15:16:03.636+0800 I - [NetworkInterfaceASIO-TaskExecutorPool-9-0] Fatal Assertion 28820
2016-02-25T15:16:03.636+0800 I - [NetworkInterfaceASIO-TaskExecutorPool-9-0]
***aborting after fassert() failure
2016-02-25T15:16:04.199+0800 F - [NetworkInterfaceASIO-TaskExecutorPool-9-0] Got signal: 6 (Aborted).