Chapter 14: Concurrency

Where you will harness the power of modern computing.

Concurrency is one of the most important subjects in modern programming because of the pressing need to improve the efficiency of programs; coupled with the troublesome limitation that increasing the speed of individual CPUs is becoming harder and harder.

The solution, at the moment, is to increase the number of CPUs in the machine to allow programmers to run their code in parallel.

The problem is that new models of computation which take concurrency into account have had to be developed to address this need, but nobody knows yet which of the many alternatives is the right one, or if there is even a right one at all.

Until now, most programmers just thought about their programs in a purely sequential manner; and, as we explore some of these concurrency models, you'll notice that they try to restore some of this feeling, while at the same time scheduling work to happen concurrently, without (almost) any programmer involvement.

Lux takes the approach of providing the means to use multiple concurrency models, since I couldn't decide on which was the right one.

Some languages (like Erlang and Go) choose to commit themselves to one model or another, but I'm not confident that the software industry (as a whole) is experienced enough with concurrency as to declare any one model the winner.

The result: more variety.

And watch out, because the amount of concurrency models may increase with future Lux releases.

Anyhow, let's quit the chit-chat and dive in!

Promises

This is my favorite one, because it can be used for almost anything, whereas I see the other modules as more specialized tools for certain use cases.

This model is based on concepts you may be familiar with: futures and promises.

Futures are basically concurrent and asynchronous computations which run and yield a value, which you can later access.

Promises are more like storage locations (which may be set only once), which you can use for communication by setting their values in one process, and reading it from another.

Some languages offer both (often used in conjunction), while others only offer one (while also kind of giving it properties of the other).

I pretty much came to the conclusion that, for all intents and purposes, their similarities were much greater than their differences.

So, I just fused them.

And so, Lux implements promises in the lux/concurrency/promise module, by means of the Promise type.

You can run IO computations concurrently using the future function (which returns a Promise that will contain the result of the computation).

You can also bind functions to Promise values in order to be notified when they have been resolved (the term for when the value of the Promise is set).

By means of this ability to watch Promise values, it's possible to implement Functor, Applicative and Monad structures for Promise, which is precisely what is done in the standard library.

The result is that, through the do macro, you can implement complex concurrent and asynchronous computations that look and feel just like synchronous ones.

If you're curious about how that looks, take a peek:

(def: #export (seq left right)
  {#;doc "Sequencing combinator."}
  (All [a b] (-> (Promise a) (Promise b) (Promise [a b])))
  (do Monad<Promise>
    [a left
     b right]
    (wrap [a b])))

Oh, and did I mention there are combinators in that module?

If you didn't know there was some magic going on in the Promise type, you wouldn't have suspected this was concurrent code. It looks just like any other old synchronous code you might have use with any other monad.

Pretty neat, huh?

Functional Reactive Programming

FRP is based on the idea of values that change over time, and structuring your programs to dynamically respond to those changes in a reactive way.

The way it's implemented in Lux is through the Chan type in lux/concurrency/frp (itself implemented on top of Promise). Chan instances are (potentially infinite) sequences of values that you can process in various ways as new values come into being. Chan instances may be closed, but they may also go on forever if you'd like them to.

By the way, "chan" is just short for channel.

The lux/concurrency/frp module offers various functions for processing channels in various them (some of them generating new channels), and the Chan type also happens to be a monad, so you can write fairly complex and powerful code with it.

Software Transactional Memory

Implemented in the lux/concurrency/stm module.

STM is quite a different beast from the other 2 approaches, in that they address the problem of how do I propagate information within the system, while STM deals with how to keep data in one place, where it can be accessed and modified concurrently by multiple processes.

It works by having variables which may be read and written to, but only within transactions, which could be seen as descriptions of changes to be made to one (or more) variables in an atomic, consistent and isolated way.

Let's break down those last 3 terms:

Atomic: This just means that if more than 1 change needs to be made in a transaction, either all gets done, or none. There is no room for partial results.
Consistent: This just means that transactional computations will take the set of variables they operate from one valid state to another. This is largely a consecuence of transactions being atomic.
Isolated: This means that transactions run in isolation (or, without interference) from one another, thereby ensuring no transaction may see or modify any in-trasaction value being computed somewhere else, and they all get the impression that they are the only transaction running at any given time.

For those of you familiar with relational databases, this might remind you of their ACID properties (with the caveat that Lux STM is non-durable, as it's done entirely in memory).

The way it works is by running multiple transactions concurrently, and then committing their results to the affected variables. If 2 transactions modify any common variables, the first one to commit wins, and the second one would be re-calculated to take into account the changes to those variables. This implies that transactions are sensitive to some "version" of the variables they involve and that is correct. That is the mechanism use to avoid collisions and ensure no inconsistencies ever arise.

The relevant types are Var, which corresponds to the variables, and STM which are computations which transform transactions in some way and yield results.

Like IO and unlike Promise, just writing STM computations doesn't actually run them, and you must call the commit function to actually schedule the system to execute them (receiving a Promise value for the result of the transaction).

You may also follow variables to get channels of their values if you're interesting in tracking them.

The Actor Model

Buyer Beware: Of all the concurrency modules, this is the one most likely to change by the next release, so be careful when you use it.

The actor model is also very different from the other models in that, while they deal with computations which produce values concurrently (even if they also do other things), the actor model is all about processes running concurrently and communicating with one another.

You can't run an actor and just wait for it to finish to get the result. For all you know, it may never end and just run forever.

Also, interaction with actors is based on message passing, and an actor may consume an indefinite number of such messages (and send messages to other actors).

The relevant module is the lux/concurrency/actor module, and the relevant type is:

(type: (Actor s m)
  {#mailbox (lux/concurrency/stm;Var m)
   #kill-signal (lux/concurrency/promise;Promise lux;Unit)
   #obituary (lux/concurrency/promise;Promise [(lux;Maybe lux;Text) s (lux;List m)])})

Actors have mailboxes in which they receive their messages.

By following the mailbox vars, the actors can react to all the incoming messages.

It's also possible to kill an actor (although it can also die "naturally" if it encounters a failure condition during normal execution).

And if it dies, you'll receive it's state at the time of death, a list of unconsumed messages from its mailbox and (possibly) a message detailing the cause of death.

Just from this definition, it's easy to see that actors are stateful (a necessity for modeling a variety of complex behaviors).

To create an actor, you must first create a procedure of type Proc:

(type: (Behavior s m)
  {#step (-> (Actor s m) m s (lux/concurrency/promise;Promise (lux/data/error;Error s)))
   #end (-> (lux;Maybe lux;Text) s (lux/concurrency/promise;Promise lux;Unit))})

They are pairs of functions to be run on each iteration of the actor, and when it dies (at its end).

You can then call the spawn function with an initial state and a compatible procedure.

But writing complex actors with multiple options for its messages can be messy with these tools, so a macro was made to simplify that.

"Allows defining an actor, with a set of methods that can be called on it.
 The methods can return asynchronous outputs.
 The methods can access the actor's state through the *state* variable.
 The methods can also access the actor itself through the *self* variable."
(actor: Adder
  Int

  (method: (add! [offset Int])
    [Int Int]
    (let [new-state (i.+ offset *state*)]
      (wrap (#;Success [new-state [*state* new-state]]))))

  (stop:
    (exec (log! (format "Cause of death: " (default "???" *cause*)))
      (log! (format "Current state: " (%i *state*)))
      (wrap []))))

You can have as many methods as you want, and refer to the state of the actor through the *state* variable (*self* being the variable you use to refer to the actor itself).

For every method you define, a function will be defined in your module with the same name, and taking the same arguments. That function will always take the actor itself as its last argument, and will return an Async of the return type.

You can either die with an #;Error value, or continue on to the next message with a #;Success containing an updated actor state, and a return value for the method. The type of the return value must match the type following the method signature.

stop: creates the end method and gives you access to the (possible) cause of death with the *cause* variable. It expects an (Promise Unit) return value, and the body of stop: (as well as the other methods) runs implicitly inside an Monad<Promise> do expression.

In this chapter, you have learned how to use the many tools Lux offers to tap into the multi-processor power of modern computing systems.

But if you think about it, being able to hold onto values or pass them around concurrently is rather useless unless you have some important and complex data to move around in the first place; and so far we have only dealt with fairly simple data-structures.

Well, read the next chapter if you want to learn how to take your data to the next level with the help of persistent data structures.

See you in the next chapter!

Chapter 14: Concurrency

Chapter 14: Concurrency

Promises

Functional Reactive Programming

Software Transactional Memory

The Actor Model

results matching ""

No results matching ""