Distributed shared memory from first principles

[Update 10/16]

What is the fundamental performance limiting factor that has dominated that last 30 years of computer architecture? The obvious answer is the disparity between processor and memory/storage speed. We connect processors to cache, to more cache, to even more cache, to some bus, to memory, to disk/network, suffering costs at each step as we try to keep the damn processor busy. My guess is that memory locality is totally different now than it was back when virtual memory systems were first designed – much less pronounced. My short review of the literature shows little research on, for example, memory locality of databases or operating systems since the 1980s. (Is that wrong?) But we do know enough to understand why the entire concept of distributed shared memory is wrong although there needs to be a catchy name for this category of error. The error is to see that some feature of computer hardware makes some type of software engineering difficult and then provide a software mechanism that makes that type of engineering impossible. The complexity of the memory hierarchy and the complexity of making locality work pose enormous challenges for software designers. The purpose of distributed shared memory is to make it impossible for software designers to control the cache performance of their programs by hiding caching behavior under a non-deterministic opaque layer.

(see comment by John Regehr)

One thought on “Distributed shared memory from first principles”

John Regehr
October 15, 2007 at 11:42 am
Permalink

I’d say it just a little differently…

If you can raise the level of abstraction for developers by shoving one or two orders of magnitude in performance difference under the rug, maybe this is OK for most programmers. For example, explicit cache memory management is not a good idea for general purpose computing.

On the other hand if you attempt to raise the level of abstraction even more by shoving six or more orders of magnitude of performance difference under the rug, as DSM does, then it becomes impossible in practice to create code with acceptable performance.

So there is a continuum between control and abstraction containing both good and bad design points. It is not hard to see that when performance is important, DSM is a poor design point.

Comments are closed.