summaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAgeFilesLines
* merge bug26119 into stableMatthias Radestock2014-04-231-8/+9
|\
| * don't erase aliases quite so eagerly on DOWNMatthias Radestock2014-04-231-8/+9
|/ | | | | since that can result in completely forgetting about a member when there are still acks making their way around the right.
* maintain gm ?GROUP_TABLE abstractionMatthias Radestock2014-04-221-3/+3
| | | | in yet another place
* refactor: extract gm view change handlingMatthias Radestock2014-04-211-29/+18
|
* cosmetic: consistencyMatthias Radestock2014-04-211-11/+12
|
* make gm record_new_member_in_group tx side effect freeMatthias Radestock2014-04-211-22/+14
|
* refactor: tweak and push through gm read_group abstractionMatthias Radestock2014-04-211-10/+13
|
* simplify: get rid of tmp varMatthias Radestock2014-04-211-57/+50
|
* enforce gm GROUP_TABLE abstractionMatthias Radestock2014-04-211-21/+17
| | | | | | ...by using mnesia:write/3 instead of /1 and generally abstract mnesia reading/writing and make it dirty reads obvious.
* merge bug26084 into stableMatthias Radestock2014-04-210-0/+0
|\
* | blank members_state after maybe_erase_aliasesMatthias Radestock2014-04-211-7/+9
| | | | | | | | since the latter operates on the former
* | merge stable into bug26084Matthias Radestock2014-04-210-0/+0
|\ \ | |/ |/|
* | refactor: simplify gm:internal_broadcastMatthias Radestock2014-04-211-19/+12
| | | | | | | | eliminate branching
* | Merge bug26117Simon MacMullen2014-04-178-61/+73
|\ \
| * | explainMatthias Radestock2014-04-171-12/+17
| | |
| * | go back to from members_changed/4 to /3Matthias Radestock2014-04-177-21/+18
| | |
| * | always return DeadPids, so that all deaths are loggedMatthias Radestock2014-04-161-6/+5
| | |
| * | gm_pids is the source of truthMatthias Radestock2014-04-161-20/+22
| | | | | | | | | | | | | | | | | | We base promotion decisions on the sub-set of [QPid | SPids] that is referenced in gm_pids. In particular, QPid may be missing from that, if another slave previously noticed its death.
| * | drive remove_from_queue with DeadGMPidsMatthias Radestock2014-04-163-28/+37
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | ...instead of LiveGMPids The latter may be out of date s.t. it contains fewer pids than are actually alive, due to new GMs having sprung into live recently. We'd then, incorrectly, remove the corresponding entries from the queue's mnesia record, causing havoc. DeadGMPids can be out of date too; it may contain fewer pids than have actually died, due to GMs having died more recently. That is harmless though since it never leads us to remove an entry that we shouldn't, and we will learn about any new deaths eventually.
* | | Merge bug 26123Simon MacMullen2014-04-173-88/+72
|\ \ \
| * | | monitor workersMatthias Radestock2014-04-162-2/+14
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | In a way this is a damage limitation exercise... When a worker dies while working it is not in the pool anyway, so noticing its death is a no-op. When a worker dies while idle, it will not return to the pool when we next submit work to it. But obviously the submitted work won't be carried out, and the submitter will get an error (unless they submitted the work asynchronously). All the monitoring does is reduce the likelihood of the latter happening. It cannot eliminate it though since the worker may die just as work was being submitted to it.
| * | | track workers by Pid instead of nameMatthias Radestock2014-04-163-59/+37
| | | |
| * | | record workers in a setMatthias Radestock2014-04-161-35/+29
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | ...instead of a queue. That way when an idle worker is restarted (and sends an 'idle' message to the pool), it won't end up in the pool twice. Note that we always hand out work to the first worker in the ordset. That is actually more efficient than the round-robin strategy we had with the queue since it keeps a smaller number of workers busy while others can hibernate.
* | | | Merge bug26127Simon MacMullen2014-04-170-0/+0
|\ \ \ \ | |/ / /
* | | | Remove rabbit_amqqueue:force_event_refresh/1 synchronySimon MacMullen2014-04-172-40/+22
|/ / /
* | | Merge bug26118Simon MacMullen2014-04-163-6/+11
|\ \ \ | |/ /
| * | cosmetic-ish: add a missing ?TABLE abstractionMatthias Radestock2014-04-161-4/+4
| | |
| * | cosmetic: prevent trailing whitespace in log messageMatthias Radestock2014-04-151-2/+2
| | |
| * | helper for starting a background rabbitMatthias Radestock2014-04-151-0/+5
| | |
* | | Handle hibernation in our pre-go state.Simon MacMullen2014-04-151-0/+3
|/ /
* | Merge bug26115Simon MacMullen2014-04-151-4/+4
|\ \
| * | eliminate accidental, crash-inducing assertionMatthias Radestock2014-04-141-4/+4
| | |
* | | Merge bug26113Simon MacMullen2014-04-151-1/+2
|\ \ \ | |/ / |/| |
| * | Add a capability for it.Simon MacMullen2014-04-151-1/+2
|/ /
* | Merge bug26084Simon MacMullen2014-04-142-41/+26
|\ \ | |/
| * eliminate comment duplicationMatthias Radestock2014-04-141-4/+1
| |
| * ensure propagation of master deathMatthias Radestock2014-04-131-8/+8
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously slaves were monitoring the master and sending a process_death message through gm in an attempt to get the updated gm state propagated (and thus members_changed callbacks getting invoked on slaves, which in turn trigger promotion). However, the DOWN notification for the master may well arrive and be processed, and the process_death message sent around the ring, before GM has noticed the master death, thus rendering the process_death broadcast ineffectual for its intended purpose. With the recent GM changes we can guarantee that at least one slave will be notified of the master death via GM. So we emit the process_death message then, and not on master DOWN (and ditch the master monitoring as a result). Since the new emission is triggered by a gm notification we know GM is aware of the master death, thus avoiding the aforementioned problem.
| * less lazy death notificationsMatthias Radestock2014-04-131-21/+9
| | | | | | | | | | | | | | | | | | | | | | When we receive a DOWN, it's possible that there are no messages from the dead member floating around anyway. So rather than waiting for some other activity to occur first, it is perfectly safe enough to always call maybe_erase_aliases. That way we get slightly less lazy death notification in this case (i.e. we can then be certain that either the dead member is fully removed immediately or there must still be messages from the dead member floating around. No other possibility exists).
| * ensure callback_view_changed is called whenever the view may have changedMatthias Radestock2014-04-131-8/+8
| | | | | | | | | | | | We had missed a case here. Ditto for check_neighbours.
* | Merge bug26102Simon MacMullen2014-04-110-0/+0
|\ \ | |/
* | Actually let's move the unlink/1 back, not convinced it is any clearer.Simon MacMullen2014-04-111-5/+4
| |
* | Monitor and wait for 'DOWN' from our own GM as well as the slaves, to make ↵Simon MacMullen2014-04-101-5/+6
|/ | | | sure our GM does not try to update group info after we have called forget_group/1. Also ensure we unlink from the coordinator when shutting down as well as when stopping mirroring, not strictly necessary but clearer.
* update copyright notice yearMatthias Radestock2014-04-101-1/+1
|
* Merge bug25855Simon MacMullen2014-04-094-17/+34
|\
| * More unique atomSimon MacMullen2014-04-091-2/+2
| |
| * Delayed restart of the disk monitor.Simon MacMullen2014-04-093-12/+25
| |
| * Make sure we have complete command output if we actually can't parse it.Simon MacMullen2014-04-091-5/+9
| |
* | Merge bug26100Simon MacMullen2014-04-092-8/+12
|\ \ | |/
| * Merge bug26103Simon MacMullen2014-04-091-1/+1
| |\
| | * Merge bug26104Simon MacMullen2014-04-091-1/+1
| | |\