What is Kafka’s batch size?

Kafka producers will buffer unsent records for each partition. These buffers are of a size specified by the batch.size config. You can achieve higher throughput by increasing the batch size, but there is a trade-off between more batching and increased end-to-end latency. The larger your batch size, the more cumulative time messages will spend waiting in the send buffer. So, you want to batch to increase throughput, but you don’t want to batch too much lest you cause unwanted latency.

The effects of batching on throughput and latency are really well illustrated by this blog: http://blog.l1x.me/post/2015/03/02/high-performance-kafka-for-analytics.html, but that blog only looks at message sizes up to 1KB. What happens after for message larger than 1KB? Let me show you…

What’s wrong with using small batch sizes?

Batching increases latency because the producer will delay sending a message until it fills its send buffer (or the linger.ms timer expires). However, larger messages seem to be disproportionately delayed by small batch sizes. In the following graph, I measured end-to-end latency for a wide range of message sizes using a batch size of 16KB. The step up in latency is due to the batch size being too small.

latency-batch-16kb

When I increased batch size to 32KB, end-to-end latency was much improved, as shown below:

latency-batch-32kb

Conclusion:

If you’re sending large messages in Kafka, you might be surprised to find how much you can improve performance simply by increasing you producer batch size.