Commit Graph

116 Commits

Author SHA1 Message Date
Hiram Chirino b0e91d47f5 Have the leveldb store thorw SuppressReplyExceptions instead of IOExceptions so that the clients retry try the operations instead of giving up. Also retry the problemantic getMessage() call which seems to fail at times. 2013-11-25 13:17:58 -05:00
Claus Ibsen 00cb9a5668 AMQ-4858: Moved scala source file from java to scala dir. Thanks to Jaromir Hamala for patch. 2013-11-22 11:25:08 +01:00
Claus Ibsen c0090f6f0c AMQ-4885: Renamed some API methods in MessageStore due typos. 2013-11-22 11:21:42 +01:00
Timothy Bish 38ab4b10a4 https://issues.apache.org/jira/browse/AMQ-4855
Deprecate the misspelled subcription named getters and use the correctly
spelled versions where the older ones are in use.
2013-11-11 11:40:07 -05:00
Hiram Chirino 42e1c463d4 Replicated leveldb slaves index snapshots were being labeled with higher journal positions than what they really contained. 2013-11-08 10:34:21 -05:00
Hiram Chirino 4367ec1b82 If a replicated leveldb slave's connection gets slow, lets merge together journal write events to avoid them queuing up on the master side. 2013-11-05 11:53:37 -05:00
Hiram Chirino a907fc9e94 leveldb store: Add more log traces for when we need to get more details on what's going on. 2013-11-04 09:26:01 -05:00
Hiram Chirino b1d8cbe4cd Fixes for leveldb replication: make sure we only apply index updates when we encounter a UOW_END_RECORD so that we don't end up with an inconsistent index if a partial UOW is replicated. 2013-11-04 09:26:01 -05:00
Hiram Chirino dea38e62f8 leveldb replication: Lets always download the current append log just to be safe 2013-11-04 09:26:01 -05:00
Hiram Chirino 2824a94af4 Continue to append to the last leveldb log file on a store restart. 2013-11-04 09:26:01 -05:00
Hiram Chirino 190da29691 Fix for: https://issues.apache.org/jira/browse/AMQ-4837 : LevelDB corrupted in AMQ cluster.
- The tracked flushed offset was not getting initialized to match the initial size of the log file.  This invalid flushed offset was being used to do the initial sync /w the slave and then things went downhill.
2013-11-01 12:13:33 -04:00
Hiram Chirino 24d5490e57 Fix for: https://issues.apache.org/jira/browse/AMQ-4837 : LevelDB corrupted in AMQ cluster.
- Log rotation was causing a pre-mature index snapshot to be taken on the slave (snapshot while the slave was still synchronizing).
- Also fix issue with the append position displayed in JMX for the master not being correct.
2013-10-31 12:52:50 -04:00
Hiram Chirino 3ed52ef8a2 Working on a test case for https://issues.apache.org/jira/browse/AMQ-4837 : LevelDB corrupted in AMQ cluster. 2013-10-30 15:44:06 -04:00
Hiram Chirino 17415ceecd Only register the leveldb store MBeans in JMX if jmx is enabled on the broker. 2013-10-30 15:34:29 -04:00
Hiram Chirino aaa1a74dfb Add a toString() method to the replicated leveldb store so that it displays a nicer message when the broker starts it up. 2013-10-30 12:22:28 -04:00
Hiram Chirino 1896d27409 Update to the latest leveldbjni release. 2013-10-17 09:30:07 -04:00
Hiram Chirino ef45a5f8ff Avoid NullPointerExceptions that can occur during leveldb replication M/S state transitions. 2013-10-15 12:43:00 -04:00
gtully 864aa6798f [maven-release-plugin] prepare for next development iteration 2013-10-14 23:38:35 +01:00
gtully 87c9bbebac [maven-release-plugin] prepare release activemq-5.9.0 2013-10-14 23:38:22 +01:00
Hiram Chirino d44c8968c4 Expose a JMX attribute to show when the last leveldb replication log entry was recorded.
This can be used to get a time estimate of far behind a master a slave is.
2013-10-14 08:45:54 -04:00
gtully ee0933c71e [maven-release-plugin] prepare for next development iteration 2013-10-11 14:51:36 +01:00
gtully b276006911 [maven-release-plugin] prepare release activemq-5.9.0 2013-10-11 14:51:24 +01:00
Claus Ibsen 59af7479b2 Polished the logging 2013-10-11 11:20:22 +02:00
Hiram Chirino 0214bfbfcd Support weighting the leveldb store replication nodes so that some have a better chance of becoming the master than others. 2013-10-10 15:42:02 -04:00
Hiram Chirino e90ce1aabb Don't display leveldb replication recovery progress on slave nodes. 2013-10-10 14:52:07 -04:00
gtully 4f108cead5 fix up missing license headers - keep mr. rat happy :-) 2013-10-10 11:41:58 +01:00
Hiram Chirino d609d2ae38 Make sure the leveldb append log file is flushed to disk before it's closed to avoid data loss. 2013-10-09 11:49:11 -04:00
Hiram Chirino 5e63ddd337 Fixing bug which caused replicated leveldb nodes to not recover from a ZooKeeper failure. 2013-10-09 10:56:34 -04:00
Hiram Chirino 119fdab1d0 Fixes leveldb replication bug that can cause read errors: make sure that log is flushed for records that are being read. 2013-10-09 09:15:48 -04:00
Hiram Chirino a5ac1a3630 Avoid logging ZooKeeper related exceptions during a replicated leveldb store shutdown. 2013-10-03 10:42:45 -04:00
Hiram Chirino b720b264e3 The the hostname on the replicated leveldb store test so that the test works better on machines with weird DNS settings. 2013-10-02 12:15:36 -04:00
Hiram Chirino 0b66abcbfb Fixing leveldb replication bug where loosing quorum did not properly restart the master in a clean way. 2013-10-02 12:15:36 -04:00
Hiram Chirino 8804341c19 Adding more debug info to help trouble shoot election issues with leveldb replication elections. 2013-10-02 10:45:58 -04:00
Hiram Chirino f75520fc8b Implementing AMQ-4744: Support using LevelDB as a nested store in mKahaDB 2013-09-27 09:19:48 -04:00
gtully efaa351db7 https://issues.apache.org/jira/browse/AMQ-4365 - allow lease locker to be used by kahadb - remove deps on jdbc pa. LockableService now passes a reference to a locker so it can pull the brokerService, extracted some of the jdbc lock common stuff, additional test kahadb with jdbc lease 2013-09-20 15:33:24 +01:00
Hiram Chirino d771ebb97e Fixes bug in replicated leveldb where log files on slaves were not getting GCed. 2013-09-16 11:58:03 -04:00
Hiram Chirino 1eca031356 leveldb replication Master was failing to give up being master after it's process is suspended by using ctrl-z. 2013-09-03 10:41:33 -04:00
Hiram R. Chirino f4d51e092b Adding a LevelDB version of the RedeliveryRestartTest. Implemented redelivery tracking in the leveldb store.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1518289 13f79535-47bb-0310-9956-ffa450edef68
2013-08-28 17:20:25 +00:00
Hiram R. Chirino 45e1462609 Persist the latest producer position across restarts in the leveldb store.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1515799 13f79535-47bb-0310-9956-ffa450edef68
2013-08-20 12:38:32 +00:00
Gary Tully 7c50c1c736 fix up failure - still leveldb variant problem that needs work - testQueueTransactionalOrderWithRestart - org.apache.activemq.bugs.AMQ2149LevelDBTest
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1512332 13f79535-47bb-0310-9956-ffa450edef68
2013-08-09 15:11:11 +00:00
Hiram R. Chirino 563cc1bc6a Warn when there are too many replication nodes attached.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1506965 13f79535-47bb-0310-9956-ffa450edef68
2013-07-25 13:40:59 +00:00
Timothy A. Bish 194c6535cd [LevelDB]
https://issues.apache.org/jira/browse/AMQ-4296

Fixes remainder of failing unit tests.  The LevelDB wasn't incrementing or decrementing reference counts on messages added to the store which causes the expectations of certain memory limit based tests to fail as the memory usage was being updates after the store add instead of during so a message could get placed into the batch list of a cursor when we did not expect that it would.  This could also cause a browse to return fewer message than we want as the in memory messages would top out the usage limit so we'd never page in one batch of messages. 

git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1505805 13f79535-47bb-0310-9956-ffa450edef68
2013-07-22 20:53:27 +00:00
Timothy A. Bish 11ed6b0e18 {LevelDB] fix for StoreQueueCursorLevelDBNoDuplicateTest failure. When the cursor calls setBatch and then recovering the next batch its expecting the batch to start at lastMsgId + 1 so we were seeing a single duplicate message and since audit was disabled it was delivered.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1504668 13f79535-47bb-0310-9956-ffa450edef68
2013-07-18 21:21:38 +00:00
Hiram R. Chirino ae5abf143b Adding a container property to the zk node data used by the leveldb replication.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1504626 13f79535-47bb-0310-9956-ffa450edef68
2013-07-18 20:04:45 +00:00
Hiram R. Chirino 86e2426d1b Improve the replicated leveldb bits: Avoid dependencies on fabric-group stuff. Makes it easier to embed in different versions of a fabric osgi env.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1501877 13f79535-47bb-0310-9956-ffa450edef68
2013-07-10 17:49:03 +00:00
Timothy A. Bish 5f0dd8ab53 Fix for failing LveelDB unit tests where only non-persistent messages are sent in a TX. The preCommit wasn't being run so the Queue's orderIndexUpdates structure wasn't getting updated with the TX to process in the postCommit phase.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1501420 13f79535-47bb-0310-9956-ffa450edef68
2013-07-09 18:15:26 +00:00
Hiram R. Chirino 2019a21d96 Simplify and improve the leveldb replication MBean
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1499754 13f79535-47bb-0310-9956-ffa450edef68
2013-07-04 13:47:27 +00:00
Hiram R. Chirino 34f3329370 When the leveldb replicated master was shutting down the client would get notified of a failure and it would not be hidden from the client app. We now suppress sending failure messages to clients when a broker is shutting down so that the client failover logic can kick in an reconnect the client to another server gracefully.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1498978 13f79535-47bb-0310-9956-ffa450edef68
2013-07-02 15:28:42 +00:00
Hiram R. Chirino b98067732b Improve the replicated leveldb behavior when the number of nodes in the cluster falls below the required minimum. The master node will switch to electing mode. The master store startup will now also block until it syncs up with slaves so that we don't accept connections the master is fully online.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1498601 13f79535-47bb-0310-9956-ffa450edef68
2013-07-01 17:38:13 +00:00
Hiram R. Chirino 16cd8c3954 Expose the replicated store status via JMX.
git-svn-id: https://svn.apache.org/repos/asf/activemq/trunk@1497843 13f79535-47bb-0310-9956-ffa450edef68
2013-06-28 16:57:29 +00:00