delta/rabbitmq-server-git.git - github.com: rabbitmq/rabbitmq-server.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Merge pull request #1404 from rabbitmq/rabbitmq-management-499	Michael Klishin	2017-10-26	1	-25/+33
\|\ \| \| \| \|	Add more sections to the memory breakdown.
\| *	Add more sections to the memory breakdown.	Daniil Fedotov	2017-10-25	1	-25/+33
\| \| \| \| \| \| \| \| \| \| \| \|	If a node memory is measured using `allocator` or `rss` strategies, we can calculate unused memory and report it separately in the memory breakdown.
* \|	Merge pull request #1403 from rabbitmq/performance	Michael Klishin	2017-10-25	1	-30/+42
\|\ \ \| \| \| \| \| \|	Performance enhancements
\| * \|	Avoid lists when generating channel stats	Diana Corbacho	2017-10-25	1	-30/+38
\| \| \| \| \| \| \| \| \| \| \| \|	[#152240490]
\| * \|	Comment optimisation in format channel stats	Diana Corbacho	2017-10-25	1	-0/+4
\| \|/ \| \| \| \| \| \|	[#152240490]
* \|	Merge pull request #1402 from rabbitmq/lrb-qa-152075395	Michael Klishin	2017-10-25	1	-4/+17
\|\ \ \| \|/ \|/\|	Handle errors returned via RPC during shutdown
\| *	Review changes	Luke Bakken	2017-10-25	1	-14/+8
\| \|
\| *	Make error processing a bit smarter and add comments as to why	Luke Bakken	2017-10-25	1	-4/+20
\| \|
\| *	Handle catch all (error) case	Luke Bakken	2017-10-25	2	-5/+5
\| \|
\| *	Trigger error in shutdown	Luke Bakken	2017-10-24	2	-3/+6
\|/ \| \| \| \| \|	Added an exit() statement to trigger an error and then some case statements to handle it. QA of commit 51aca8851eba66445ad5b164fc92b2ffa0692fc2
*	Merge pull request #1388 from rabbitmq/rabbitmq-server-batch-betas	Gerhard Lazu	2017-10-23	3	-23/+68
\|\ \| \| \| \|	Flush messages to disk in batches.
\| *	Merge branch 'stable' into rabbitmq-server-batch-betas	Michael Klishin	2017-10-19	1	-1/+1
\| \|\
\| * \|	Fix the flag value for queue waiting message paging continuation	Daniil Fedotov	2017-10-18	1	-1/+1
\| \| \|
\| * \|	Flush messages to disk in batches.	Daniil Fedotov	2017-10-18	3	-23/+68
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If messages should be embedded to a queue index, there will be no credit flow limit, so message batches can be too big and block the queue process. Limiting the batch size allows consumer to make progress while publishers are blocked by the paging-out process. [#151614048]
* \| \|	Merge branch 'cloudamqp-max_connections' into stable	Michael Klishin	2017-10-20	3	-1/+22
\|\ \ \
\| * \| \|	Add rabbit.connection_max to rabbitmq.config.example	Michael Klishin	2017-10-20	1	-0/+19
\| \| \| \|
\| * \| \|	rabbit.max_connections => rabbit.connection_max	Michael Klishin	2017-10-20	2	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We already have rabbit.channel_max, frame_max.
\| * \| \|	Merge branch 'max_connections' of ↵	Michael Klishin	2017-10-20	1	-1/+2
\| \|\ \ \ \|/ / / / \| \| \| \| \| \| \| \|	https://github.com/cloudamqp/rabbitmq-server into cloudamqp-max_connections
\| * \| \|	use rabbit_misc:get_env/3 for backward compability	Carl Hörberg	2017-10-20	1	-1/+1
\| \| \| \|
\| * \| \|	Allow configuring max connections through config	Carl Hörberg	2017-10-18	1	-1/+2
\| \|/ /
* \| \|	Merge branch ↵	Michael Klishin	2017-10-19	4	-5/+17
\|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \|	'rabbitmqctl-shutdown-custom-exit-code-to-indicate-server-reported-failure' into stable
\| * \| \|	Use a special exit code when server reports an error on shutdown	Michael Klishin	2017-10-19	4	-5/+17
\|/ / / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This way tools that manage RabbitMQ nodes can detect this condition among a bunch of fairly generic exit codes. Note that when a timeout occurs, we still use a "temporary failure" code. See rabbitmq/rabbitmq-server#1396 for context.
* \| \|	Ensure "rss" is the default in configuration example	Luke Bakken	2017-10-19	1	-2/+2
\| \|/ \|/\|
* \|	Merge pull request #1397 from rabbitmq/default-to-rss-152081051	Michael Klishin	2017-10-19	1	-1/+1
\|\ \ \| \|/ \|/\|	Default mem calc strategy should be "rss"
\| *	Default mem calc strategy should be "rss"	Luke Bakken	2017-10-18	1	-1/+1
\|/ \| \| \| \| \|	This is due to a scenario in which the Erlang VM allocator stats report a huge increase in memory consumption which is only reflected in VSS increase, not RSS PT #152081051
*	Merge pull request #1396 from rabbitmq/lrb-fix-shutdown-tests	Michael Klishin	2017-10-18	1	-14/+1
\|\ \| \| \| \|	Shutdown cannot fail
\| *	Shutdown cannot fail	Daniil Fedotov	2017-10-17	1	-14/+1
\|/
*	Ignore indirect dependencies of rabbit in rabbit_plugins	Gerhard Lazu	2017-10-17	2	-6/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The goal of maybe_keep_required_deps() is still the same: we don't want to list applications from $RABBITMQ_PLUGINS_DIR the rabbit application depends on. Now, the function handles indirect dependencies. Signed-off-by: Jean-Sébastien Pedron <jean-sebastien@rabbitmq.com> [#136346167] This commit is cherry-picked from `master` because rabbit_common now depends on recon: having rabbit_common depend on 3rd-party applications didn't happen so far in stable and wasn't supported. This broke the creation of the standalone package. (cherry picked from commit cbbcc3e32f97b6047b4f82f0776717dd3e85430c)
*	Typo	Michael Klishin	2017-10-17	1	-1/+1
\|
*	Wording	Michael Klishin	2017-10-17	1	-23/+23
\|
*	Corrections	Michael Klishin	2017-10-17	1	-1/+1
\|
*	Edits, links	Michael Klishin	2017-10-17	1	-46/+55
\|
*	Wording	Michael Klishin	2017-10-16	1	-1/+1
\|
*	Updates to management plugin section	Michael Klishin	2017-10-16	1	-4/+4
\|
*	Update rabbitmq-components.mk	Jean-Sébastien Pédron	2017-10-13	1	-0/+4
\|
*	Update rabbitmq-components.mk	Jean-Sébastien Pédron	2017-10-13	1	-4/+4
\|
*	Merge pull request #1389 from rabbitmq/rabbitmq-common-224	Michael Klishin	2017-10-12	2	-5/+5
\|\ \| \| \| \|	Update default node memory calculation strategy to match rabbitmq/rabbitmq-common#224
\| *	Update default allocator strategy to match rabbitmq/rabbitmq-common#224	Michael Klishin	2017-10-11	2	-5/+5
\|/
*	Merge pull request #1384 from rabbitmq/integrate-looking_glass	Jean-Sébastien Pédron	2017-10-04	2	-0/+61
\|\ \| \| \| \|	Integrate `looking_glass`
\| *	Ignore maps:from_list in xref	Gerhard Lazu	2017-10-04	1	-0/+1
\| \| \| \| \| \| \| \|	Signed-off-by: Loïc Hoguin <loic@rabbitmq.com>
\| *	Use maps:from_list so that it compiles on R16B03	Gerhard Lazu	2017-10-04	1	-5/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since looking_glass requires Erlang 19, maps:from_list will not be called if it's not defined in earlier Erlang versions. Signed-off-by: Loïc Hoguin <loic@rabbitmq.com>
\| *	Integrate looking_glass	Gerhard Lazu	2017-10-04	2	-0/+51
\|/ \| \| \| \| \| \|	An Erlang/Elixir/BEAM profiler tool: https://github.com/rabbitmq/looking_glass Signed-off-by: Loïc Hoguin <loic@rabbitmq.com>
*	Merge pull request #1382 from rabbitmq/rabbitmq-server-1379rabbitmq_v3_6_13_milestone1	Michael Klishin	2017-10-03	1	-2/+2
\|\ \| \| \| \|	Do not limit paging to disk if hibernated or resumed after credit
\| *	Merge branch 'stable' into rabbitmq-server-1379	Michael Klishin	2017-10-03	2	-18/+12
\| \|\ \| \|/ \|/\|
* \|	Merge pull request #1378 from rabbitmq/always-prioritise-consumers	Michael Klishin	2017-10-02	1	-13/+5
\|\ \ \| \| \| \| \| \|	Remove consumer bias & allow queues under max load to drain quickly
\| * \	Merge branch 'stable' into always-prioritise-consumers	Michael Klishin	2017-09-30	1	-5/+7
\| \|\ \ \| \|/ / \|/\| \|
* \| \|	Revert "Revert "Ensure that exit code 0 is used for "stop" command""	Michael Klishin	2017-09-28	1	-5/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 547be437426163fcd2655857e34f4d24b4994c3a. The change in 1518b601036351e3f8686bab9c8ae87a8d5af0a3 is in line with the expected stop behavior in 3.6.x. For 3.7.0 we are still contemplating what it should be.
\| * \|	Remove consumer bias & allow queues under max load to drain quickly	Gerhard Lazu	2017-09-28	1	-13/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Given a queue process under max load, with both publishers & consumers, if consumers are not always prioritised over publishers, a queue can take 1 day (or more) to fully drain. Even without consumer bias, queues can drain fast (i.e. 10 minutes in our case), or slow (i.e. 1 hour or more). To put it differently, this is what a slow drain looks like: ``` ___ <- 2,000,000 messages / \__ / \___ _ _ / \___/ \_____/ \___ / \ \|-------------- 1h --------------\| ``` And this is what a fast drain looks like: ``` _ <- 1,500,000 messages / \_ / \___ / \ \|- 10 min -\| ``` We are still trying to understand the reason behind different drain rates, but without removing consumer bias, this would always happen: ``` ______________ <- 2,000,000 messages / \_______________ / \______________ ________ / \__/ \______ / \ \|----------------------------- 1 day ---------------------------------\| ``` Other observations worth capturing: ``` \| PUBLISHERS \| CONSUMERS \| READY MESSAGES \| PUBLISH MSG/S \| CONSUME ACK MSG/S \| \| ---------- \| --------- \| -------------- \| --------------- \| ----------------- \| \| 3 \| 3 \| 0 \| 22,000 - 23,000 \| 22,000 - 23,000 \| \| 3 \| 3 \| 1 - 2,000,000 \| 5,000 - 8,000 \| 7,000 - 11,000 \| \| 3 \| 0 \| 1 - 2,000,000 \| 21,000 - 25,000 \| 0 \| \| 3 \| 0 \| 2,000,000 \| 5,000 - 15,000 \| 0 \| ``` * Empty queues are the fastest since messages are delivered straight to consuming channels * With 3 publishing channels, a single queue process gets saturated at 22,000 msg/s. The client that we used for this benchmark would max at 10,000 msg/s, meaning that we needed 3 clients, each with 1 connection & 1 channel to max the queue process. It is possible that a single fast client using 1 connection & 1 channel would achieve a slightly higher throughput, but we didn't measure on this occasion. It's highly unrealistic for a production, high-throughput RabbitMQ deployment to use 1 publishers running 1 connection & 1 channel. If anything, there would be many more publishers with many connections & channels. * When a queue process gets saturated, publishing channels & their connections will enter flow state, meaning that the publishing rates will be throttled. This allows the consuming channels to keep up with the publishing ones. This is a good thing! A message backlog slows both publishers & consumers, as the above table captures. * Adding more publishers or consumers slow down publishinig & consuming. The queue process, and ultimately the Erlang VMs (typically 1 per CPU), have more work to do, so it's expected for message throughput to decrease. Most relevant properties that we used for this benchmark: ``` \| ERLANG \| 19.3.6.2 \| \| RABBITMQ \| 3.6.12 \| \| GCP INSTANCE TYPE \| n1-standard-4 \| \| -------------------- \| ------------ \| \| QUEUE \| non-durable \| \| MAX-LENGTH \| 2,000,000 \| \| -------------------- \| ------------ \| \| PUBLISHERS \| 3 \| \| PUBLISHER RATE MSG/S \| 10,000 \| \| MSG SIZE \| 1KB \| \| -------------------- \| ------------ \| \| CONSUMERS \| 3 \| \| PREFETCH \| 100 \| \| MULTI-ACK \| every 10 msg \| ``` Worth mentioning, `vm_memory_high_watermark_paging_ratio` was set to a really high value so that messages would not be paged to disc. When messages are paged out, all other queue operations are blocked, including all publishes and consumes. Artefacts attached to rabbitmq/rabbitmq-server#1378 : - [ ] RabbitMQ management screenshots - [ ] Observer Load Chars - [ ] OS metrics - [ ] RabbitMQ definitions - [ ] BOSH manifest with all RabbitMQ deployment properties - [ ] benchmark app CloudFoundry manifests.yml [#151499632]
\| \| *	Do not limit paging to disk if hibernated or resumed after credit	Daniil Fedotov	2017-09-28	1	-2/+2
\| \|/ \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Paging of messages to disk is limited by lazy_queue_explicit_gc_run_operation_threshold This is required to avoid GC on every publish. But for big messages the process can get hibernated or blocked by the message store credit-flow, leaving the process with a lot of memory taken. When hibernating or unblocking we should try to page messages to disk regardless of the threshold, because we know that memory reduction is required at this moment. Fixes #1379 [#150916864]
* \|	Revert "Ensure that exit code 0 is used for "stop" command"	Michael Klishin	2017-09-28	1	-7/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 1518b601036351e3f8686bab9c8ae87a8d5af0a3. There is no consensus on how it should work. In general it is impossible to tell a stopped node from a node we could not contact (with some specific exceptions reported by modern net_kernel version such as the Erlang cookie mismatch), so some believe stop should be idempotent and others believe it should not.