Dissecting the Disruptor: Writing to the ring buffer

July 04, 2011

This is the missing piece in the end-to-end view of the Disruptor. Brace yourselves, it's quite long. But I decided to keep it in a single blog so you could have the context in one place.

The important areas are: not wrapping the ring; informing the consumers; batching for producers; and how multiple producers work.

ProducerBarriers
The Disruptor code has interfaces and helper classes for the Consumers, but there's no interface for your producer, the thing that writes to the ring buffer. That's because nothing else needs to access your producer, only you need to know about it. However, like the consuming side, a ProducerBarrier is created by the ring buffer and your producer will use this to write to it.

Writing to the ring buffer involves a two-phase commit. First, your producer has to claim the next slot on the buffer. Then, when the producer has finished writing to the slot, it will call commit on the ProducerBarrier.

So let's look at the first bit. It sounds easy - "get me the next slot on the ring buffer". Well, from your producer's point of view it is easy. You simply call nextEntry() on the ProducerBarrier. This will return you an Entry object which is basically the next slot in the ring buffer.

The ProducerBarrier makes sure the ring buffer doesn't wrap
Under the covers, the ProducerBarrier is doing all the negotiation to figure out what the next slot is, and if you're allowed to write to it yet.

(I'm not convinced the shiny new graphics tablet is helping the clarity of my pictures, but it's fun to use).

For this illustration, we're going to assume there's only one producer writing to the ring buffer. We will deal with the intricacies of multiple producers later.

The ConsumerTrackingProducerBarrier has a list of all the Consumers that are accessing the ring buffer. Now to me this seemed a bit odd - I wouldn't expect the ProducerBarrier to know anything about the consuming side. But wait, there is a reason. Because we don't want the "conflation of concerns" a queue has (it has to track the head and tail which are sometimes the same point), our consumers are responsible for knowing which sequence number they're up to, not the ring buffer. So, if we want to make sure we don't wrap the buffer, we need to check where the consumers have got to.

In the diagram above, one Consumer is happily at the same point as the highest sequence number (12, highlighted in red/pink). The second Consumer is a bit behind - maybe it's doing I/O operations or something - and it's at sequence number 3. Therefore consumer 2 has the whole length of the buffer to go before it catches up with consumer 1.

The producer wants to write to the slot on the ring buffer currently occupied by sequence 3, because this slot is the one after the current ring buffer cursor. But the ProducerBarrier knows it can't write here because a Consumer is using it. So the ProducerBarrier sits and spins, waiting, until the consumers move on.

Claiming the next slot
Now imagine consumer 2 has finished that batch of entries, and moves its sequence number on. Maybe it got as far as sequence 9 (in real life I expect it will make it as far as 12 because of the way consumer batching works, but that doesn't make the example as interesting).

The diagram above shows what happens when consumer 2 updates to sequence number 9. I've slimmed down the ConsumerBarrier in this picture because it takes no active part in this scene.

The ProducerBarrier sees that the next slot, the one that had sequence number 3, is now available. It grabs the Entry that sits in this slot (I've not talked specifically about the Entry class, but it's basically a bucket for stuff you want to put into the ring buffer slot which has a sequence number), sets the sequence number on the Entry to the next sequence number (13) and returns this entry to your producer. The producer can then write whatever value it wants into this Entry.

Committing the new value
The second phase of the two-stage commit is, well, the commit.

The green represents our newly updated Entry with sequence 13 - yeah, I'm sorry, I'm red-green colour-blind too. But other colours were even more rubbish.

When the producer has finished writing stuff into the entry it tells the ProducerBarrier to commit it.

The ProducerBarrier waits for the ring buffer cursor to catch up to where we are (for a single producer this will always be a bit pointless - e.g. we know the cursor is already at 12, nothing else is writing to the ring buffer). Then the ProducerBarrier updates the ring buffer cursor to the sequence number on the updated Entry - 13 in our case. Next, the ProducerBarrier lets the consumers know there's something new in the buffer. It does this by poking the WaitStrategy on the ConsumerBarrier - "Oi, wake up! Something happened!" (note - different WaitStrategy implementations deal with this in different ways, depending upon whether it's blocking or not).

Now consumer 1 can get entry 13, consumer 2 can get everything up to and including 13, and they all live happily ever after.

ProducerBarrier batching
Interestingly the disruptor can batch on the producer side as well as on the Consumer side. Remember when consumer 2 finally got with the programme and found itself at sequence 9? There is a very cunning thing the ProducerBarrier can do here - it knows the size of the buffer, and it knows where the slowest Consumer is. So it can figure out which slots are now available.

If the ProducerBarrier knows the ring buffer cursor is at 12, and the slowest Consumer is at 9, it can let producers write to slots 3, 4, 5, 6, 7 and 8 before it needs to check where the consumers are.

Multiple producers
You thought I was done, but there's more.

I slightly lied in some of the above drawings. I implied that the sequence number the ProducerBarrier deals with comes directly from the ring buffer's cursor. However, if you look at the code you'll see that it uses the ClaimStrategy to get this. I skipped this to simplify the diagrams, it's not so important in the single-producer case.

With multiple producers, you need yet another thing tracking a sequence number. This is the sequence that is available for writing to. Note that this is not the same as ring-buffer-cursor-plus-one - if you have more than one producer writing to the buffer, it's possible there are entries in the process of being written that haven't been committed yet.

Let's revisit claiming a slot. Each producer asks the ClaimStrategy for the next available slot. Producer 1 gets sequence 13, like in the single producer case above. Producer 2 gets sequence 14, even though the ring buffer cursor is still only pointing to 12, because the ClaimSequence is dishing out the numbers and has been keeping track of what's been allocated.

So each producer has its own slot with a shiny new sequence number.

I'm going colour producer 1 and its slot in green, and producer 2 and its slot in a suspiciously pink-looking purple.

Now imaging producer 1 is away with the fairies, and hasn't got around to committing for whatever reason. Producer 2 is ready to commit, and asks the ProducerBarrier to do so.

As we saw in the earlier commit diagram, the ProducerBarrier is only going to commit when the ring buffer cursor reaches the slot behind the one it wants to commit into. In this case, the cursor needs to reach 13 so that we can commit 14. But we can't, because producer 1 is staring at something shiny and hasn't committed yet. So the ClaimStrategy sits there spinning until the ring buffer cursor gets to where it should be.

Now producer 1 wakes up from its coma and asks to commit entry 13 (green arrows are sparked by the request from producer 1). The ProducerBarrier tells the ClaimStrategy to wait for the ring buffer cursor to get to 12, which it already had of course. So the ring buffer cursor is incremented to 13, and the ProducerBarrier pokes the WaitStrategy to let everything know the ring buffer was updated. Now the ProducerBarrier can finish the request from producer 2, increment the ring buffer cursor to 14, and let everyone know that we're done.

You'll see that the ring buffer retains the ordering implied by the order of the initial nextEntry() calls, even if the producers finish writing at different times. It also means that if a producer is causing a pause in writing to the ring buffer, when it unblocks any other pending commits can happen immediately.

Phew. And I managed to describe all that without mentioning a memory barrier once.

EDIT: The most recent version of the RingBuffer hides away the Producer Barrier. If you can't see a ProducerBarrier in the code you're looking at, then assume where I say "producer barrier" I mean "ring buffer"

EDIT 2: Note that version 2.0 of the Disruptor uses different names to the ones in this article. Please see my summary of the changes if you are confused about class names.

Comments

Anonymous27 July 2011 at 01:50
What happens if, in the two producer scenario, producer 1 fails to commit to the buffer? Is producer 2 blocked from ever completing the commit? Does the ring buffer suffer an effective deadlock at that point?
ReplyDelete
Replies
Trisha27 July 2011 at 10:44
The producers need to manage this scenario carefully, as they do need to be aware that they are blocking other producers.

If one of the producers fails to commit because the producer is broken, your whole system has bigger problems than deadlock. If, however, it fails to commit because the transaction it was trying to complete failed, two options spring to mind: 1) retry until the commit is successful (the other producer will block until success) or 2) commit a "dead message" to the ring buffer so that the blocked producers can continue, and the consumers will ignore the dead message while still allowing the sequence numbers to increment in the expected fashion.
ReplyDelete
Replies
Anonymous27 July 2011 at 16:59
Thank you for the quick response. One other question on the disruptor: do you have strategies for handling either slow consumers or slow producers (in the multi-producer scenario)?

On a side note, it would be great to hear about other architectural aspects of LMAX such as your approach to high availability, data persistence, work distribution among multiple processing units, etc.
ReplyDelete
Replies
Trisha27 July 2011 at 17:32
Martin Fowler's article gives a bigger context of the LMAX architecture, but we're not going to give away all our secrets yet ;)

http://martinfowler.com/articles/lmax.html

If your consumers are consistently slower than your producers, your system is always going to back up regardless of your architecture. One of the ways you might handle this is to have two consumers doing the same thing, but one processes even sequence numbers and one processes odd ones. That way you could potentially process more in parallel (of course you can expand beyond two).

If your producers are slow, then it might point to your design doing too many things in one place. Our producers are usually simple things - they take data from one place and stick it in the ring buffer. If your producers are doing a lot of processing, you could think about moving that logic into an early consumer. Your producer could write the raw data into the ring buffer, the new consumer could read the raw data off the ring buffer, process it, write the processed data into the Entry, and have all downstream consumers dependent upon this consumer. That might suggest ways you can parallelise this work.
ReplyDelete
Replies
Trisha5 August 2011 at 09:23
Since the producer is the one that knows whether the buffer is full, and since the producer is managing writing to it, it should be easy - implementing the producer is totally up to you, so you can decide exactly how to deal with the case when the ring buffer is full
ReplyDelete
Replies
Trisha10 August 2011 at 10:20
If I understand you correctly, what you're saying is that if you had a consumer which was doing asynchronous logging, but it was logging much slower than the producer was producing, the producer would end up blocking, waiting for the logger to finish before it can publish to the ring buffer?

Yes, that's true. If you have any consumer which runs slower than the producer, eventually everything will be hanging around waiting for that consumer. If you don't want to block the ring buffer, I guess there's a few things you could try - have your consumer representing the async logger punt the data off to somewhere else to deal with it (maybe a second disruptor with a *massive* ring buffer, or some sort of logging service that won't block this consumer), or parallelise the logging, so one consumer logs the odds, the other the evens (or mod 4, or whatever). Then you just need to write a simple mechanism to weave the logs back together when/if you need to read them (obviously you want your different loggers logging to different files to avoid contention).
ReplyDelete
Replies
Shane Tolmie10 August 2011 at 13:54
Brilliant article, you've explained the concepts brilliantly. We're thinking of including Disruptor in our next project.
ReplyDelete
Replies
Trisha10 August 2011 at 16:30
@Shane Awesome, let me know how it goes!
ReplyDelete
Replies
siky4 February 2012 at 16:24
Hi,
I am new for Disruptor framework, i have scenario where i need 100 producers and 200 consumers, in this case can you use Disruptor framework.
ReplyDelete
Replies
Trisha4 February 2012 at 16:26
Hi siky,

Sure, there's nothing to stop you doing that. But you're going to want to check your "wait" settings to make sure you've got the best configuration. To get the most speed out of the Disruptor though you want a CPU core per producer/consumer, if that's what you're after you're going to have to shard your problem more carefully so you can split it over multiple machines.
ReplyDelete
Replies
itsMyTime14 March 2012 at 13:43
Hi Trisha,
thank you for the great article.
assuming that i transfer messages between two different modules.
currently i use JMS, i have a producer that sends messages to a queue and consumers that pull them off the queue.

does the Distruptor ment to be a replacement for that configuration? for what type of architecture would you find it suit?

thanks.
ReplyDelete
Replies
Unni Vemanchery Mana29 March 2012 at 07:33
Hi Trisha,
This is really a great way of explaining this pattern.I appreciate this.I got a feel of what is Disruptor pattern and what it tries to do.I have not gone through the source code nor used it any projects.So I have couple of questions here before using it.

Is the entire Disruptor pattern has been implemented using array?

In the case of sequencing, why always keep incrementing number? So what it happens to the array index?

Apart from traditional circular rings, here you are updating the slots, not inserting a new one with the message.In that case, how the buffer overflow will happen? because you are never going to increase the initial capacity of the RingBuffer, if I understand it correctly.

Can we have our own implementation of ProducerBarrier and plug-in to this framework?
ReplyDelete
Replies
Xeus2 May 2012 at 00:31
I don't understand for what you need the cursor? As you anyway have a sequence number in each Entry, you could set the initial value of the sequence number to 0x8000000000000000L (if the ring buffer is created) and then each producer must update the sequence number of an allocated entry after it is done with producing, so that the consumers just need to check for a valid sequence number instead to wait for the cursor. This allows the consumer to skip a not yet ready entry and to get back to it later, so that the consumer can do some useful work while a producer is creating an entry. Especially helpful if you have multiple producers and consumers.

So an entry is valid, if its sequence number is less/equal next minus ring-buffer size (isValid = entry.seqNumber >= (ringbuffer.next - ringbuffer.length)).

Additionally you don't need the CAS operation to the cursor anymore.
ReplyDelete
Replies
Prasenjit22 May 2012 at 07:06
Hi Trisha,

I am new to Disruptor. This article is really helpful for beginners. I have the following two queries:

Q1. I got a code example of one producer to one consumer(http://www.kotancode.com/2012/01/06/hello-disruptor/) and

one producer to multiple dependent consumers(http://mechanitis.blogspot.in/2011/07/dissecting-disruptor-wiring-up.html),

I like to get a code sample for multiple producers to multiple consumers or a Sequencer: 3P – 1C. Can you please refer to me any blogs or code sample?

Q2. This is a generic question regarding how multiple producer works.

Is Disruptor can use in a environment where a single file/variable is getting updated by multiple producers. For example, there are two producers (P1,P2), Which are updating a single shared variable (named as "count").

Initially the “count” value is 0.

Producer P1 will add 1 with the "count" current value. So after producer P1 processed, the value of count will be (0+1) = 1.

Producer P2 will add 2 with the "count" current value, So after producer P2 processed, the value of count will be (1+2) = 3.

Basically, P2 needs to read the updated "count" value (done by P1) and add the incremented value(2).

How we can maintain the order of execution of producers?(P2 will execute always after P1 execution.)

At the consumer side, consumers (C1,C2) will read the "count" value as sequentially(1,3,.,.,.). This is ok, as in ring buffer, each consumer will read the ring buffer value in sequential order only.

Thanks,
Prasenjit.
ReplyDelete
Replies
Arjuns Gab2 June 2012 at 13:29
Just managed to wrap my head around these lil rings...

of what I understand; the ring buffers are perhaps to serve multiple thread consumer scenarios for IPC (Inter process communication),.... but any chance the Rings can be vary large to serve and replace JMS implementations in the future?

Conceptually can these Ring Buffers can proxy traditional JMS Queues in behavior with added advantages of being able to fetch messages in Batch also?

Also, I'm told that the ideal ring size depends on the amount of CPU power available. Cant locate the blog for that... would like to be able to compute that.

Curious.
tx
ReplyDelete
Replies
TeckTalk17 June 2012 at 20:02
Hi Trisha,
Is there any place where all the WaitStrategy and ClaimStrategy have been defined in detail and how one can identify the right strategy?
Also if my system is processing 10000 ticks per second then what should be the ideal size of my ring buffer?
ReplyDelete
Replies
Just A Thought!27 March 2013 at 10:43
This comment has been removed by the author.
ReplyDelete
Replies
MusiKeO30 January 2014 at 02:44
Hi Trisha,

I was wondering...for a web server dealing with multiple http-threads should we use multiple producers ? And if yes should we use 1 producer per http-thread ?

By the way... Thank you for your article it is an awesome work :) !
ReplyDelete
Replies
Unknown20 June 2014 at 12:09
Hi, Do we have benchmarks in which we have very high number of consumers (1P - 200C) ?
ReplyDelete
Replies
Unknown4 February 2015 at 06:54
Hi,

If consumer's event process speed is always slower than publisher, how can it process?
ring size is 2048, publishing speed is 2048/s, event processing is 1024/s,
ReplyDelete
Replies
Dolazy19 June 2015 at 01:19
Can a large ring buffer cause problems like dirtying many cache lines, thrashing, etc..? I.e should the ring buffer be as small as possible?
ReplyDelete
Replies
Unknown1 July 2015 at 11:48
Hi Trisha, i have read your article above and i want to ask :
I have a listener (with threadpool), which listen to a queue and for each message i put it into ringbuffer, which has a handler with multiple consumer;
Therefore, i conclude that i have to scale the threadpool of Listener, Handler, and RIng Buffer's size;
Am i right?

Now, if however, the producer is faster than consumer, you said there will be blocking operations, does it mean the producer thread will just waiting around until there is a free slot? If yes, it means bad for listener threadpool (in my case) right?
But i looked at the code and the tryNext() method can throw insufficient exception (or something like that :D), so how exactly i should deal with this case? If i should catch the exception, what should i do with the event it hold?

Thank you very much
ReplyDelete
Replies

Add comment

Trisha's Ramblings

This Blog Has Moved!

Dissecting the Disruptor: Writing to the ring buffer

Comments

Post a Comment

Popular posts from this blog

Dissecting the Disruptor: What's so special about a ring buffer?

Dissecting the Disruptor: Demystifying Memory Barriers