delta/python-packages/kafka-python.git - github.com: mumrah/kafka-python.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	Attempt to fix travis build. Decrease complexity of service.py in favor of ↵	Mark Roberts	2014-05-06	1	-2/+2
\| \| \| \|	in memory logging. Address code review concerns
*	Make commit() check for errors instead of simply assert no error	Mark Roberts	2014-04-30	1	-1/+1
\|
*	Make BrokerRequestError a base class, make subclasses for each broker error	Mark Roberts	2014-04-30	1	-9/+7
\|
*	Various fixes	Mark Roberts	2014-04-25	1	-0/+3
\| \| \| \| \| \| \| \|	Bump version number to 0.9.1 Update readme to show supported Kafka/Python versions Validate arguments in consumer.py, add initial consumer unit test Make service kill() child processes when startup fails Add tests for util.py, fix Python 2.6 specific bug.
*	Fix last remaining test by making autocommit more intuitive	Mark Roberts	2014-04-24	1	-1/+1
\|
*	Split out kafka version environments, default tox no longer runs any ↵	Mark Roberts	2014-04-23	1	-11/+11
\| \| \| \|	integration tests, make skipped integration also skip setupClass, implement rudimentary offset support in consumer.py
*	Split up and speed up producer based integration tests	Mark Roberts	2014-04-17	1	-1/+1
\|
*	Commit in seek if autocommit	Mark Roberts	2014-03-27	1	-1/+4
\|
*	Make seek(); commit(); work without commit discarding the seek change	Mark Roberts	2014-03-25	1	-0/+1
\|
*	Resolve conflicts for #106	Omar Ghishan	2014-01-28	1	-39/+70
\|\
\| *	Add doc string for SimpleConsumer._get_message()	Omar Ghishan	2014-01-20	1	-0/+6
\| \|
\| *	Make get_messages() update and commit offsets just before returning	Omar Ghishan	2014-01-15	1	-16/+35
\| \|
\| *	Only use timeout if it's not None	Omar Ghishan	2014-01-15	1	-4/+5
\| \|
\| *	Store fetched offsets separately.	Omar Ghishan	2014-01-15	1	-10/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fetch requests can be repeated if we get a ConsumerFetchSizeTooSmall or if _fetch() is called multiple times for some reason. We don't want to re-fetch messages that are already in our queue, so store the offsets of the last enqueued messages from each partition.
\| *	Fix offset increments:	Omar Ghishan	2014-01-15	1	-16/+17
\| \| \| \| \| \| \| \| \| \|	* Increment the offset before returning a message rather than when putting it in the internal queue. This prevents committing the wrong offsets. * In MultiProcessConsumer, store the offset of the next message
* \|	Merge branch 'repr' of https://github.com/mahendra/kafka-python into ↵	mrtheb	2014-01-14	1	-0/+8
\|\ \ \| \|/ \|/\| \| \| \| \| \| \| \| \| \| \|	mahendra-repr Conflicts: kafka/client.py kafka/consumer.py
\| *	Add proper string representations for each class	Mahendra M	2013-10-08	1	-0/+8
\| \|
* \|	Exception hierarchy, invalidate more md on errors	Thomas Dimson	2014-01-13	1	-1/+1
\| \|
* \|	Merge pull request #88 from rdiomar/rdiomar_changes	Omar	2014-01-13	1	-129/+140
\|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Various changes/fixes, including: * Allow customizing socket timeouts * Read the correct number of bytes from kafka * Guarantee reading the expected number of bytes from the socket every time * Remove bufsize from client and conn * SimpleConsumer flow changes * Fix some error handling * Add optional upper limit to consumer fetch buffer size * Add and fix unit and integration tests
\| * \|	Remove unnecessary brackets	Omar Ghishan	2014-01-06	1	-2/+2
\| \| \|
\| * \|	Add a limit to fetch buffer size, and actually retry requests when fetch ↵	Omar Ghishan	2014-01-06	1	-37/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	size is too small Note: This can cause fetching a message to exceed a given timeout, but timeouts are not guaranteed anyways, and in this case it's the client's fault for not sending a big enough buffer size rather than the kafka server. This can be bad if max_fetch_size is None (no limit) and there is some message in Kafka that is crazy huge, but that is why we should have some max_fetch_size.
\| * \|	Fix seek offset deltas	Omar Ghishan	2014-01-06	1	-6/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We always store the offset of the next available message, so we shouldn't decrement the offset deltas when seeking by an extra 1
\| * \|	Add comments and maintain 80 character line limit	Omar Ghishan	2014-01-06	1	-7/+23
\| \| \|
\| * \|	Add iter_timeout option to SimpleConsumer. If not None, it causes the ↵	Omar Ghishan	2014-01-06	1	-6/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	iterator to exit when reached. Also put constant timeout values in pre-defined constants
\| * \|	Add buffer_size param description to docstring	Omar Ghishan	2014-01-06	1	-1/+2
\| \| \|
\| * \|	Remove SimpleConsumer queue size limit since it can cause the iterator	Omar Ghishan	2014-01-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	to block forever if it's reached.
\| * \|	SimpleConsumer flow changes:	Omar Ghishan	2014-01-06	1	-112/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Combine partition fetch requests into a single request * Put the messages received in a queue and update offsets * Grab as many messages from the queue as requested * When the queue is empty, request more * timeout param for get_messages() is the actual timeout for getting those messages * Based on https://github.com/mumrah/kafka-python/pull/74 - don't increase min_bytes if the consumer fetch buffer size is too small. Notes: Change MultiProcessConsumer and _mp_consume() accordingly. Previously, when querying each partition separately, it was possible to block waiting for messages on partition 0 even if there are new ones in partition 1. These changes allow us to block while waiting for messages on all partitions, and reduce total number of kafka requests. Use Queue.Queue for single proc Queue instead of already imported multiprocessing.Queue because the latter doesn't seem to guarantee immediate availability of items after a put: >>> from multiprocessing import Queue >>> q = Queue() >>> q.put(1); q.get_nowait() Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/multiprocessing/queues.py", line 152, in get_nowait return self.get(False) File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/multiprocessing/queues.py", line 134, in get raise Empty Queue.Empty
\| * \|	Reset consumer fields to original values rather than defaults in FetchContext	Omar Ghishan	2014-01-06	1	-3/+5
\| \| \|
\| * \|	Allow None timeout in FetchContext even if block is False	Omar Ghishan	2014-01-06	1	-4/+4
\| \| \|
\| * \|	* Guarantee reading the expected number of bytes from the socket every time	Omar Ghishan	2014-01-06	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Remove bufsize from client and conn, since they're not actually enforced Notes: This commit changes behavior a bit by raising a BufferUnderflowError when no data is received for the message size rather than a ConnectionError. Since bufsize in the socket is not actually enforced, but it is used by the consumer when creating requests, moving it there until a better solution is implemented.
* \| \|	Merge pull request #66 from jcrobak/fix-import-collision	David Arthur	2014-01-08	1	-0/+2
\|\ \ \ \| \|/ / \|/\| \|	Enable absolute imports for modules using Queue.
\| * \|	Enable absolute imports for modules using Queue.	Joe Crobak	2013-10-21	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When running on Linux with code on a case-insensitive file system, imports of the `Queue` module fail because python resolves the wrong file (It is trying to use a relative import of `queue.py` in the kafka directory). This change forces absolute imports via PEP328.
* \| \|	allow for timeout to be None in SimpleConsumer.get_messages	Zack Dever	2013-12-12	1	-1/+2
\|/ /
* \|	Ensure that multiprocess consumer works in windows	Mahendra M	2013-10-08	1	-53/+63
\|/
*	flake8 pass (pep8 and pyflakes)	mrtheb	2013-10-03	1	-23/+30
\|
*	remove unused exception instance variables	Vetoshkin Nikita	2013-10-03	1	-1/+1
\|
*	Cherry-pick mrtheb/kafka-python 2b016b69	mrtheb	2013-10-03	1	-1/+4
\| \| \| \|	Set FetchRequest MaxBytes value to bufsize instead of fetchsize (=MinBytes)
*	Fix #44 Add missing exception classv0.8.0	David Arthur	2013-09-24	1	-4/+3
\| \| \| \|	Also move the exceptions to common instead of util
*	Auto-adjusting consumer fetch size	David Arthur	2013-09-09	1	-15/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Related to #42 Adds new ConsumerFetchSizeTooSmall exception that is thrown when `_decode_message_set_iter` gets a BufferUnderflowError but has not yet yielded a message In this event, SimpleConsumer will increase the fetch size by 1.5 and continue the fetching loop while _not_ increasing the offset (basically just retries the request with a larger fetch size) Once the consumer fetch size has been increased, it will remain increased while SimpleConsumer fetches from that partition
*	Fixed #42, make fetch size configurable	David Arthur	2013-09-08	1	-4/+7
\| \| \| \| \| \| \| \| \| \|	Was hard coded to 1024 bytes which meant that larger messages were unconsumable since they would always get split causing the consumer to stop. It would probably be best to automatically retry truncated messages with a larger request size so you don't have to know your max message size ahead of time
*	Fix minor bug in offset management	Mahendra M	2013-07-01	1	-1/+4
\| \| \| \| \|	In the current patch get_messages(count=1) would return zero messages the first time it is invoked after a consumer was initialized.
*	Add more cleanup in consumer.stop()	Mahendra M	2013-06-28	1	-5/+7
\|
*	Fix cases of single partition	Mahendra M	2013-06-28	1	-2/+3
\|
*	Add TODO comments	Mahendra M	2013-06-27	1	-0/+2
\|
*	Re-init the sockets in the new process	Mahendra M	2013-06-27	1	-1/+3
\|
*	Fix a bug in seek.	Mahendra M	2013-06-27	1	-0/+6
\| \| \| \|	This was hidden because of another bug in offset management
*	Merge branch 'master' into partition	Mahendra M	2013-06-25	1	-17/+9
\|\ \| \| \| \| \| \| \| \|	Conflicts: kafka/consumer.py
\| *	Optimize auto-commit thread	Mahendra M	2013-06-12	1	-21/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The previous commit optimized the commit thread such that the timer started only when there were messages to be consumed. This commit goes a step further and ensures the following: * Only one timer thread is created * The main app does not block on exit (waiting for timer thread to finish) This is ensured by having a single thread blocking on an event and keeps calling a function. We use events instead of time.sleep() so as to prevent the python interpreter from running every 50ms checking if the timer has expired (logic copied from threading.Timer)
\| *	Spawn the commit thread only if necessary	Mahendra M	2013-06-11	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If there are no messages being consumed, the timer keeps creating new threads at the specified intervals. This may not be necessary. We can control this behaviour such that the timer thread is started only when a message is consumed
* \|	Got MultiProcessConsumer working	Mahendra M	2013-06-25	1	-10/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Other changes * Put a message size restriction on the shared queue - to prevent message overload * Wait for a while after each process is started (in constructor) * Wait for a while in each child if the consumer does not return any messages Just to be nice to the CPU. * Control the start event more granularly - this prevents infinite loops if the control does not return to the generator. For eg: for msg in consumer: assert False * Update message status before yield