delta/rabbitmq-server-git.git - github.com: rabbitmq/rabbitmq-server.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Integrate looking_glass	Gerhard Lazu	2017-10-04	2	-0/+51
\| \| \| \| \| \| \|	An Erlang/Elixir/BEAM profiler tool: https://github.com/rabbitmq/looking_glass Signed-off-by: Loïc Hoguin <loic@rabbitmq.com>
*	Merge pull request #1382 from rabbitmq/rabbitmq-server-1379rabbitmq_v3_6_13_milestone1	Michael Klishin	2017-10-03	1	-2/+2
\|\ \| \| \| \|	Do not limit paging to disk if hibernated or resumed after credit
\| *	Merge branch 'stable' into rabbitmq-server-1379	Michael Klishin	2017-10-03	2	-18/+12
\| \|\ \| \|/ \|/\|
* \|	Merge pull request #1378 from rabbitmq/always-prioritise-consumers	Michael Klishin	2017-10-02	1	-13/+5
\|\ \ \| \| \| \| \| \|	Remove consumer bias & allow queues under max load to drain quickly
\| * \	Merge branch 'stable' into always-prioritise-consumers	Michael Klishin	2017-09-30	1	-5/+7
\| \|\ \ \| \|/ / \|/\| \|
* \| \|	Revert "Revert "Ensure that exit code 0 is used for "stop" command""	Michael Klishin	2017-09-28	1	-5/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 547be437426163fcd2655857e34f4d24b4994c3a. The change in 1518b601036351e3f8686bab9c8ae87a8d5af0a3 is in line with the expected stop behavior in 3.6.x. For 3.7.0 we are still contemplating what it should be.
\| * \|	Remove consumer bias & allow queues under max load to drain quickly	Gerhard Lazu	2017-09-28	1	-13/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Given a queue process under max load, with both publishers & consumers, if consumers are not always prioritised over publishers, a queue can take 1 day (or more) to fully drain. Even without consumer bias, queues can drain fast (i.e. 10 minutes in our case), or slow (i.e. 1 hour or more). To put it differently, this is what a slow drain looks like: ``` ___ <- 2,000,000 messages / \__ / \___ _ _ / \___/ \_____/ \___ / \ \|-------------- 1h --------------\| ``` And this is what a fast drain looks like: ``` _ <- 1,500,000 messages / \_ / \___ / \ \|- 10 min -\| ``` We are still trying to understand the reason behind different drain rates, but without removing consumer bias, this would always happen: ``` ______________ <- 2,000,000 messages / \_______________ / \______________ ________ / \__/ \______ / \ \|----------------------------- 1 day ---------------------------------\| ``` Other observations worth capturing: ``` \| PUBLISHERS \| CONSUMERS \| READY MESSAGES \| PUBLISH MSG/S \| CONSUME ACK MSG/S \| \| ---------- \| --------- \| -------------- \| --------------- \| ----------------- \| \| 3 \| 3 \| 0 \| 22,000 - 23,000 \| 22,000 - 23,000 \| \| 3 \| 3 \| 1 - 2,000,000 \| 5,000 - 8,000 \| 7,000 - 11,000 \| \| 3 \| 0 \| 1 - 2,000,000 \| 21,000 - 25,000 \| 0 \| \| 3 \| 0 \| 2,000,000 \| 5,000 - 15,000 \| 0 \| ``` * Empty queues are the fastest since messages are delivered straight to consuming channels * With 3 publishing channels, a single queue process gets saturated at 22,000 msg/s. The client that we used for this benchmark would max at 10,000 msg/s, meaning that we needed 3 clients, each with 1 connection & 1 channel to max the queue process. It is possible that a single fast client using 1 connection & 1 channel would achieve a slightly higher throughput, but we didn't measure on this occasion. It's highly unrealistic for a production, high-throughput RabbitMQ deployment to use 1 publishers running 1 connection & 1 channel. If anything, there would be many more publishers with many connections & channels. * When a queue process gets saturated, publishing channels & their connections will enter flow state, meaning that the publishing rates will be throttled. This allows the consuming channels to keep up with the publishing ones. This is a good thing! A message backlog slows both publishers & consumers, as the above table captures. * Adding more publishers or consumers slow down publishinig & consuming. The queue process, and ultimately the Erlang VMs (typically 1 per CPU), have more work to do, so it's expected for message throughput to decrease. Most relevant properties that we used for this benchmark: ``` \| ERLANG \| 19.3.6.2 \| \| RABBITMQ \| 3.6.12 \| \| GCP INSTANCE TYPE \| n1-standard-4 \| \| -------------------- \| ------------ \| \| QUEUE \| non-durable \| \| MAX-LENGTH \| 2,000,000 \| \| -------------------- \| ------------ \| \| PUBLISHERS \| 3 \| \| PUBLISHER RATE MSG/S \| 10,000 \| \| MSG SIZE \| 1KB \| \| -------------------- \| ------------ \| \| CONSUMERS \| 3 \| \| PREFETCH \| 100 \| \| MULTI-ACK \| every 10 msg \| ``` Worth mentioning, `vm_memory_high_watermark_paging_ratio` was set to a really high value so that messages would not be paged to disc. When messages are paged out, all other queue operations are blocked, including all publishes and consumes. Artefacts attached to rabbitmq/rabbitmq-server#1378 : - [ ] RabbitMQ management screenshots - [ ] Observer Load Chars - [ ] OS metrics - [ ] RabbitMQ definitions - [ ] BOSH manifest with all RabbitMQ deployment properties - [ ] benchmark app CloudFoundry manifests.yml [#151499632]
\| \| *	Do not limit paging to disk if hibernated or resumed after credit	Daniil Fedotov	2017-09-28	1	-2/+2
\| \|/ \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Paging of messages to disk is limited by lazy_queue_explicit_gc_run_operation_threshold This is required to avoid GC on every publish. But for big messages the process can get hibernated or blocked by the message store credit-flow, leaving the process with a lot of memory taken. When hibernating or unblocking we should try to page messages to disk regardless of the threshold, because we know that memory reduction is required at this moment. Fixes #1379 [#150916864]
* \|	Revert "Ensure that exit code 0 is used for "stop" command"	Michael Klishin	2017-09-28	1	-7/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 1518b601036351e3f8686bab9c8ae87a8d5af0a3. There is no consensus on how it should work. In general it is impossible to tell a stopped node from a node we could not contact (with some specific exceptions reported by modern net_kernel version such as the Erlang cookie mismatch), so some believe stop should be idempotent and others believe it should not.
* \|	Merge pull request #1376 from rabbitmq/rabbitmq-server-1362	Michael Klishin	2017-09-28	1	-5/+7
\|\ \ \| \|/ \|/\|	Ensure that exit code 0 is used for "stop" command
\| *	Ensure that exit code 0 is used for "stop" command	Luke Bakken	2017-09-26	1	-5/+7
\|/ \| \| \|	Part of the fix to #1362
*	Merge pull request #1372 from rabbitmq/rabbitmq-server-1371	Michael Klishin	2017-09-23	6	-33/+149
\|\ \| \| \| \|	Take ha-mode into account in choosing queue master
\| *	Be smarter about when to use all cluster nodes vs just those specified by ↵	Luke Bakken	2017-09-22	3	-8/+22
\| \| \| \| \| \| \| \|	ha-mode:nodes when picking initial queue master node
\| *	Fix test for ha-mode exactly. Requires sync-ing the queues to ensure slave ↵	Luke Bakken	2017-09-22	2	-15/+22
\| \| \| \| \| \| \| \|	pids are started
\| *	Squash a warning => compiles from scratch	Michael Klishin	2017-09-22	3	-26/+57
\| \|
\| *	Add three tests for different ha-mode scenarios with min-masters	Luke Bakken	2017-09-22	1	-3/+60
\| \|
\| *	Handle suggested_queue_nodes output when slave nodes are empty	Luke Bakken	2017-09-22	3	-7/+12
\| \|
\| *	Use EUnit assertions here	Michael Klishin	2017-09-22	1	-6/+6
\| \| \| \| \| \| \| \|	They produce more informative match failure reports.
\| *	Cosmetics	Michael Klishin	2017-09-22	3	-4/+4
\| \|
\| *	Take ha-mode into account in choosing queue master	Luke Bakken	2017-09-22	3	-6/+8
\|/ \| \| \|	Fixes #1371
*	Merge pull request #1366 from ↵	Michael Klishin	2017-09-22	2	-36/+160
\|\ \| \| \| \| \| \| \| \|	rabbitmq/sync-rabbitmq-config-example-with-configure-page Sync rabbitmq.config.example with PROJECT_ENV
\| *	Edits, more links	Michael Klishin	2017-09-22	1	-34/+48
\| \|
\| *	Additions & changes to example config as per rabbitmq-server#1366	Gerhard Lazu	2017-09-22	1	-8/+16
\| \|
\| *	Replace commented line with empty line	Gerhard Lazu	2017-09-20	1	-1/+1
\| \|
\| *	Remove redundant for	Gerhard Lazu	2017-09-20	1	-1/+1
\| \|
\| *	Fix typo: SSH -> SSFix typo: SSH -> SSL	Gerhard Lazu	2017-09-20	1	-1/+1
\| \|
\| *	Add last missing config properties to example config	Gerhard Lazu	2017-09-20	1	-0/+14
\| \| \| \| \| \| \| \|	This was the last pass, there are no more left in PROJECT_ENV, promise.
\| *	Add a few extra missing properties & correct defaults in example config	Gerhard Lazu	2017-09-20	1	-6/+19
\| \|
\| *	Add other missing config properties from PROJECT_ENV to example config	Gerhard Lazu	2017-09-20	1	-4/+41
\| \|
\| *	Remove extra details from new config properties added to example config	Gerhard Lazu	2017-09-20	1	-20/+8
\| \| \| \| \| \| \| \|	Add links instead, as per discussion with @michaelklishin & @hairyhum
\| *	Add msg_store_io_batch_size to example config	Gerhard Lazu	2017-09-20	1	-0/+7
\| \|
\| *	Remove extra details from msg_store_credit_disc_bound from example config	Gerhard Lazu	2017-09-20	1	-14/+0
\| \| \| \| \| \| \| \| \| \|	Having discussed with @michaelklishin & @hairyhum, it was agreed that these details belong in http://www.rabbitmq.com/persistence-conf.html
\| *	Explain the IoBatchSize role in push_betas_to_deltas	Gerhard Lazu	2017-09-20	1	-0/+3
\| \|
\| *	Clarify IO_BATCH_SIZE in betas -> deltas (not gammas!) comment	Gerhard Lazu	2017-09-20	1	-6/+6
\| \|
\| *	Add msg_store_credit_disc_bound to example config	Gerhard Lazu	2017-09-20	1	-0/+20
\| \|
\| *	Add mirroring_sync_batch_size to example config	Gerhard Lazu	2017-09-20	1	-0/+18
\| \|
\| *	Add delegate_count to example config	Gerhard Lazu	2017-09-20	1	-1/+6
\| \|
\| *	Add queue_master_locator to example config	Gerhard Lazu	2017-09-20	1	-0/+7
\| \|
\| *	Add all log categories to example config mentioned on the configure page	Gerhard Lazu	2017-09-19	1	-5/+9
\| \|
* \|	Merge pull request #1369 from rabbitmq/lrb-add-exec-stop	Michael Klishin	2017-09-19	1	-0/+1
\|\ \ \| \| \| \| \| \|	Add additional ExecStop line from rabbitmq-server-release
\| * \|	Add additional ExecStop line from rabbitmq-server-release	Luke Bakken	2017-09-19	1	-0/+1
\|/ / \| \| \| \| \| \| \| \| \| \|	This line is part of the release but not included here https://github.com/rabbitmq/rabbitmq-server-release/blob/stable/packaging/debs/Debian/debian/rabbitmq-server.service#L16
* \|	Merge pull request #1368 from rabbitmq/rabbitmq-server-1359	Michael Klishin	2017-09-19	1	-0/+8
\|\ \ \| \|/ \|/\|	Add optional Restart and RestartSec configuration
\| *	wording	Luke Bakken	2017-09-19	1	-2/+2
\| \|
\| *	Add optional Restart and RestartSec configuration	Luke Bakken	2017-09-19	1	-0/+8
\|/ \| \| \| \| \|	Fixes #1359 This gives guidances for those users who wish to automatically restart RabbitMQ in the event of a failure. Tested by using the `Restart=on-failure` setting, then running `rabbitmqctl eval "erlang:halt(abort)."`
*	Merge pull request #1365 from rabbitmq/rabbitmq-server-1364	Michael Klishin	2017-09-15	2	-2/+15
\|\ \| \| \| \|	Do not log anything when list of apps to stop is empty
\| *	Further improve logging around enable plugin set changes	Michael Klishin	2017-09-15	1	-2/+13
\| \|
\| *	Do not log anything when list of apps to stop is empty	Luke Bakken	2017-09-15	1	-0/+2
\| \|
* \|	Update rabbitmq-components.mk	Michael Klishin	2017-09-13	1	-2/+2
\| \|
* \|	Update rabbitmq-components.mk	Michael Klishin	2017-09-12	1	-1/+3
\| \|
* \|	Wording	Michael Klishin	2017-09-12	4	-4/+4
\| \|