Give it time

Our experience of reality is based on the passage of time. Things begin and end, events happen one after the other. The world changes. And yet we model software as static object graphs frozen in time.

This sort of makes sense. While I’m used to going through my day as events in time (wake up, have breakfast, shower, really wake up, etc), the act of understanding feels like building a graph. When I plan for a new project, my mind jumps to try and form a picture of the current structure of the system. I try to understand how the different components interact with each other, and how they relate to the business domain. I think in hierarchies.

Object graph — A relational data model for a coffee shop

Then I use frameworks and programming paradigms that lean on this intuition. I model conceptual hierarchies as associations, and I think about domain entities as being parents or children of each other. I use statements like “a branch has many orders”, “a line item belongs to an order”.

These mental images exist in a kind of platonic universe, outside of time.

But this “timeless” view of the world is short-lived. It really only kicks in during the process of understanding a system from scratch. Past that stage, I have to deal with the fact that the world does change. The domain is not static, and neither is my understanding of it. From then on, I have to think about how the system evolves over time. We deal with refactoring, migrating, and accommodating changing requirements.

For most of a project’s life cycle, time does play a big role.

And with time as a dimension, and change as a driver, graphs can go from asset to liability.

Depth and coupling

Graphs grow deeper as the domain grows more complex. Perhaps product variants have “allowed complements” associated to them (to validate which other products can be added to them, for example toppings and add-ons). Perhaps they have a price. Perhaps customers can have discounts that can be applied to their orders.

Different parts of the model become progressively more tangled together. As the domain becomes more complex, so does the mental model. Graph depth, and therefore coupling, is unbounded.

Where’s the door?

Graph-based models are effective at describing structure, but less so at describing behaviour. What capabilities does the model expose? What workflows does it enable and, more importantly, what are the entry points into those workflows?

This is especially true of ORM-based models, where data and capabilities are melded into the same relational graph. ORMs offer amazing flexibility, but they can make it hard to describe the model’s intended behaviour.

For example, you can create an order line item via the order record:

line_item = order.line_items.create(variant: variant, ...)

… Or via the line item record factory:

line_item = LineItem.create(order: order, variant: variant, ...)

Because of this malleability, domain invariants become hard to enforce and reason about. For example, if a business rule says that an order can only have up to 5 line items: that validation probably lives in the context of an order. But since the object graph does not prescribe its “public” entry points, we’re forced to resort to callbacks and other indirect methods to enforce invariants across whole sections of the graph.

class Order
  has_many :line_items
  validates :max_five_line_items
end

class LineItem
  belongs_to :order
  before_save :validate_order
end

Moreover: business rules are often contextual. The order can only have 5 line items if the order is in a certain state, or if the customer is a certain type. Rules are not necessarily properties of the graph structure itself, but relative to behaviour, data and time.

Implicit command layer

One way of identifying behaviour and capabilities in the model is by looking at the points where users interact with it. In CRUD web systems this is usually HTTP handlers or controllers.

# POST /orders/:id/line_items
def create
  order = Order.find(params[:id])
  order.line_items.create(line_item_params)
end

These handlers define the entry points into the system. They constitute the system’s command layer, albeit somewhat informally, and tied to a specific execution context - handling HTTP requests.

If we then need to run some capabilities in the background, or as CLI tasks or scheduled jobs, we’re required to model and implement those entry points in different ways, tied to their distinct execution contexts.

# A background job
class OrderArchivalJob
  def perform(order_id)
    order = Order.find(order_id)
    order.archive
  end
end

# A CLI task
bin/rake orders:archive

These are all commands by another name. We just refer to them by their execution contexts instead of their roles in the system.

Dee-dee-dee

It’s this kind of ambiguity that Domain Driven Design set out to solve. And its focus on language can indeed provide a lot of clarity, keep coupling in check, and illuminate the entry points into the system.

In particular, DDD’s Aggregate acts as a gatekeeper into the model. Aggregates enforce business rules and guarantee data consistency for entire chunks of the model.

class Order
  def add_line_item(variant_id:, ...)
    # validate business rules rules
    raise "order can't have more than 5 line items" if line_items.count >= 5
    line_items.create(variant_id: variant_id, ...)
  end

  def update_quantity(line_item_id, quantity)
    # etc
  end

  def remove_line_item(line_item_id)
    # etc
  end
end

# The Order is the Aggregate Root
# for all order-related capabilities
# All interactions with an order or its sub-components 
# are defined as methods in the order
line_item = order.add_line_item(variant_id: 10, ...)

DDD’s Aggregate and bounded contexts can also help keep different parts of the model decoupled from each other by defining strict boundaries around them. In our example above, we can decide that variant prices belong to a pricing context, and that orders belong to a sales context. With this constraint in mind we then define the contract between the two contexts.

Standalone command objects

Yet another way to bring a system’s behaviour to the fore -and abstract it away from execution context- is to have explicit command objects. These are sometimes referred to as “service objects” (a mis-identification that usually belies a misunderstanding of the role they play).

AddLineItem.run(order_id: 123, variant_id: 456, ...)

This pattern can definitely be misused, but it can also give clearer indication of a system’s capabilities as a distinct and uniform abstraction.

commands/
- orders/
  - create.rb
  - add_line_item.rb
  - remove_line_item.rb
  - archive.rb
  - cancel.rb
  - update_quantity.rb
  - place.rb

The arrow of time

But in trying to surface a system’s behaviour and entry points, in all cases we hit on the central concept of a command. Commands initiate action, and may lead to state changes and side effects. Commands are causes to your domain’s effects. At a high level, we can describe entire sets of behaviours in those terms.

Cause and effect

There’s an explicit sense of direction here. There’s a before and an after. There’s time, which is the axis on which behaviour plays out.

Behaviour can be modeled and tracked as a sequence of effects, and effects are events that happen over time.

The deep state

We still need state. But state is a function of behaviour, not the other way around. We can derive state from the events left behind by the system’s behaviour.

State derived from events — State is derived from events

Time composes

The single directionality of time allows us to compose behaviours together. We can model workflows by stitching together sequences of commands and events. Derived state can be used to drive the next command in the workflow.

These are the basic conventions of Event Modeling, which I’m using (liberally) in this post.

Note that there’s no canonical “data model” here. There are ad-hoc data models derived from the system’s behaviour, and they’re essentially throw-away projections or views, in service of the next transition in the state machine, or whatever user-facing screen or report is needed.

Linear workflow — A linear "single threaded" workflow

Concurrency is part of the model

Thinking in timelines and workflows means that we can model concurrency as a first-class citizen of the system.

Concurrent workflow — Concurrent timelines in a choreographed workflow

In this example, a main Order timeline “spawns” payment and build as concurrent timelines, which notify back to the parent timeline when they finish, at their own pace.

Whether the timelines are concurrent or sequential is a business decision. In this case, the coffee shop may judge that allowing starting the build before the payment is completed is worth the risk, as it speeds up overall delivery time. Business decisions are, by definition, part of the domain model, and therefore concurrency here is not just a technical concern.

Automate this

Composing workflows based on this reduced set of building blocks lets us automate where it makes sense. We can start with a human-operated workflow, where a staff member must confirm orders ready to be released to the customer, once both payment and build are complete.

Manual process — A human operator moves the workflow forward

Later, we decide that this step is time-consuming and easily automateable. We now have a script periodically checking a specialised state view and scheduling the next command.

Note that both human operators and automations work on the same layer as actors in the workflow. They sit on the margins of the domain model, moving the workflow forward.

When, not where

These workflows can be understood independently of their execution context. At this level it matters little whether a command is run from a web controller, or a background job, or a CLI task. We care about what the system can do, and when it can do it relative to workflows exposed. The model should not concern itself with execution context, and in fact we should be free to change it piece-meal as the system evolves.

TL;DR

Graphs are deep, timelines are shallow.
Shallow is simpler to reason about, and more resistant to coupling.
Your domain can be modeled, and thought of, as timelines instead of graphs.

Notes

Workflows are central to a lot of domains. If you think about it, it’s peculiar that we design software based on object graphs and treat the workflows as, at best, a secondary part of the design process, and as an afterthought at worst.

I suspect this is mainly because of the legacy of relational databases as our primary way of interacting with and storing data for many decades. It’s all about the data schemas, and any behaviour is implicit in the interactions made possible by those schemas. This is reinforced by CRUD toolkits built on top of these assumptions.

Links

Event Modeling is the main source for the ideas and diagrams in this post (though I’ve skipped some of its conventions to take some points across). While Event Sourcing is the obvious architectural pattern to implement this approach, modeling in these terms is high-level and should work well with state-stored systems.