Azimuth

Agent-Based Models (Part 9)

2024-05-13T18:42:35Z

Since May 1st, Kris Brown, Nathaniel Osgood, Xiaoyan Li, William Waites and I have been meeting daily in James Clerk Maxwell’s childhood home in Edinburgh.

We’re hard at work on our project called New Mathematics and Software for Agent-Based models. It’s impossible to explain everything we’re doing while it’s happening. But I want to record some of our work. So please pardon how I skim through a bunch of already known ideas in my desperate attempt to quickly reach the main point. I’ll try to make up for this by giving lots of references.

Today I’ll talk about an interesting class of models we have developed together with Sean Wu. We call them ‘stochastic C-set rewriting systems’. They’re just part of our overall framework, but they’re an important part.

In this sort of model time is continuous, the state of the world is described by discrete data, and the state changes stochastically at discrete moments in time. All those features are already present in the class of models I described in Part 7. But today’s models are far more general, because the state of the world is described in a more general way! Now the state of the world at any moment of time is a C-set: a functor

for some fixed finitely presented category C to the category of sets.

C-sets are a flexible generalization of directed graphs. For example, a thing like this is a C-set for an appropriate choice of C:

There are also C-sets that look even less like graphs.

C-sets have been implemented in AlgebraicJulia, a software framework for doing scientific computation with categories. To learn more, start here:

• Evan Patterson, Graphs and C-sets I: What is a graph?, AlgebraicJulia Blog, 1 September 2020.

There’s a lot more on this blog explaining things you can do with C-sets, and how they’re implemented in AlgebraicJulia. We plan to take advantage of all this stuff!

In particular, we’ll use ‘double pushout rewriting’ to specify rules for how a C-set can change with time. If you’re not familiar with this concept, start here:

• nLab, Double pushout rewriting.

This concept is well-understood (by those who understand it well), so I’ll just roughly sketch it. In double pushout rewriting for C-sets, a rewrite rule is a diagram of C-sets

To apply this rewrite rule to a C-set S, we find inside that C-set an instance of the pattern L, called a ‘match’, and replace it with the pattern R. These ‘patterns’ are themselves C-sets. The C-set I can be thought of as the common part of L and I. The maps and tell us how this common part fits into L and R.

Note that in this incredibly sketchy explanation I am already starting to use maps between C-sets! Indeed, for each category C there’s a category called with:

• functors as objects: we call these C-sets;

• natural transformations between such functors as morphisms: we call these C-set maps.

This sort of category has been intensively studied for many decades, and there’s a huge amount we can do with them:

• nLab, Category of presheaves.

I used C-set maps in a couple of places above. First, the arrows here

are C-set maps. For slightly technical reasons we demand that be monic: that’s why I drew it with a hooked arrow. Second, I introduced the term ‘match’ without defining it. But we can define it: a match of L to a C-set S is simply a C-set map

And now for some good news: Kris Brown has already implemented double pushout rewriting for C-sets in AlgebraicJulia:

• Github, AlgebraicRewriting.jl.

Stochastic C-set rewriting systems

Now comes the main idea I want to explain.

A stochastic C-set rewriting system consists of:

1) a category C

2) a finite collection of rewrite rules

3) for each rewrite rule in our collection, a timer This is a stochastic map

That’s all.

What does this do for us? First, it means that for each choice of rewrite rule in our collection, and for each so-called start time we get a probability measure on

Let’s write to mean a randomly chosen element of distributed according to the probability measure We call the wait time, because it says how long after time we should wait until we apply the rewrite rule The time

is called the rewrite time.

In what follows, I’ll always assume these randomly chosen numbers are stochastically independent—even if we reuse the same timer repeatedly for different tasks.

Running a stochastic C-set rewriting system

Okay, so how do we actually use this for modeling? How do we ‘run’ a context-independent stochastic C-set rewriting system? I’ll sketch it out.

The idea is that at any time the state of the world is some C-set, say If you give me the initial state of the world the stochastic C-set rewriting system will tell you how to compute the state of the world at all later times. But this computation involves randomness.

Here’s how it works:

We start at We look for all matches to patterns in the initial state For each match we compute a wait time and then the rewrite time but right now We make a table of all the matches and their rewrite times.

The smallest of the rewrite times in our table, say is the first time the state of the world can change. We change it by applying the rewrite rule to the state of the world When we do this, we cross off the rewrite time and its corresponding match from our table.

More generally, suppose is any time when the state of the world changes. It will have changed by applying some rewrite rule to the previous state of the world, giving some new C-set

When this happens, new matches can appear, and existing matches can disappear. So we do this:

1) For each existing match that disappears, we cross off that match and its rewrite time from our table.

2) For each new match that appears, say one involving the rewrite rule we add that match and its rewrite time to our table.

We then wait until the smallest rewrite time in our table, say At that time, we apply the corresponding rewrite rule to the state , getting some new C-set We also cross off the rewrite time and its corresponding match from our table.

Then just keep doing the loop.

Subtleties

A lot of the subtleties in this formalism involve our use of timers.

For example, I computed wait times using a timer which is a stochastic map

The dependence on here means the wait time can depend on when we start the timer. And the fact that this stochastic map takes values in means the wait time can be infinite. This is a way of letting rewrite rules have a probability < 1 of ever being applied. If you don't like these features you can easily limit the formalism to avoid them.

The more serious subtleties involve whether and how to change wait times as the state of the world changes. For example, we can imagine more general timers that explicitly depend on the current state of the world as well as the time However, in this case I am confused about how we should update our table of wait times as the state of the world changes. So I decided to postpone discussing this generalization!

Hexagonal Tiling Honeycomb

2024-05-10T18:12:45Z

This picture by Roice Nelson shows a remarkable structure: the hexagonal tiling honeycomb.

What is it? Roughly speaking, a honeycomb is a way of filling 3d space with polyhedra. The most symmetrical honeycombs are the ‘regular’ ones. For any honeycomb, we define a flag to be a chosen vertex lying on a chosen edge lying on a chosen face lying on a chosen polyhedron. A honeycomb is regular if its geometrical symmetries act transitively on flags.

The most familiar regular honeycomb is the usual way of filling Euclidean space with cubes. This cubic honeycomb is denoted by the symbol {4,3,4}, because a square has 4 edges, 3 squares meet at each corner of a cube, and 4 cubes meet along each edge of this honeycomb. We can also define regular honeycombs in hyperbolic space. For example, the order-5 cubic honeycomb is a hyperbolic honeycomb denoted {4,3,5}, since 5 cubes meet along each edge:

Coxeter showed there are 15 regular hyperbolic honeycombs. The hexagonal tiling honeycomb is one of these. But it does not contain polyhedra of the usual sort! Instead, it contains flat Euclidean planes embedded in hyperbolic space, each plane containing the vertices of infinitely many regular hexagons. You can think of such a sheet of hexagons as a generalized polyhedron with infinitely many faces. You can see a bunch of such sheets in the picture:

The symbol for the hexagonal tiling honeycomb is {6,3,3}, because a hexagon has 6 edges, 3 hexagons meet at each corner in a plane tiled by regular hexagons, and 3 such planes meet along each edge of this honeycomb. You can see that too if you look carefully.

A flat Euclidean plane in hyperbolic space is called a horosphere. Here’s a picture of a horosphere tiled with regular hexagons, yet again drawn by Roice:

Unlike the previous pictures, which are views from inside hyperbolic space, this uses the Poincaré ball model of hyperbolic space. As you can see here, a horosphere is a limiting case of a sphere in hyperbolic space, where one point of the sphere has become a ‘point at infinity’.

Be careful. A horosphere is intrinsically flat, so if you draw regular hexagons on it their internal angles are

as usual in Euclidean geometry. But a horosphere is not ‘totally geodesic’: straight lines in the horosphere are not geodesics in hyperbolic space! Thus, a hexagon in hyperbolic space with the same vertices as one of the hexagons in the horosphere actually bulges out from the horosphere a bit — and its internal angles are less than : they are

This angle may be familar if you’ve studied tetrahedra. That’s because each vertex lies at the center of a regular tetrahedron, with its four nearest neighbors forming the tetrahedron’s corners.

It’s really these hexagons in hyperbolic space that are faces of the hexagonal tiling honeycomb, not those tiling the horospheres, though perhaps you can barely see the difference. This can be quite confusing until you think about a simpler example, like the difference between a cube in Euclidean 3-space and a cube drawn on a sphere in Euclidean space.

Connection to special relativity

There’s an interesting connection between hyperbolic space, special relativity, and 2×2 matrices. You see, in special relativity, Minkowski spacetime is equipped with the nondegenerate bilinear form

usually called the Minkowski metric. Hyperbolic space sits inside Minowski spacetime as the hyperboloid of points with and 0." class="latex" /> But we can also think of Minkowski spacetime as the space of 2×2 hermitian matrices, using the fact that every such matrix is of the form

and

In these terms, the future cone in Minkowski spacetime is the cone of positive definite hermitian matrices:

0, \, \mathrm{tr}(A) > 0 \right\} " class="latex" />

Sitting inside this we have the hyperboloid

0 \right\} " class="latex" />

which is none other than hyperbolic space!

Connection to the Eisenstein integers

Since the hexagonal tiling honeycomb lives inside hyperbolic space, which in turn lives inside Minkowski spacetime, we should be able to describe the hexagonal tiling honeycomb as sitting inside Minkowski spacetime. But how?

Back in 2022, James Dolan and I conjectured such a description, which takes advantage of the picture of Minkowski spacetime in terms of 2×2 matrices. And this April, working on Mathstodon, Greg Egan and I proved this conjecture!

I’ll just describe the basic idea here, and refer you elsewhere for details.

The Eisenstein integers are the complex numbers of the form

where and are integers and is a cube root of 1. The Eisenstein integers are closed under addition, subtraction and multiplication, and they form a lattice in the complex numbers:

Similarly, the set of 2×2 hermitian matrices with Eisenstein integer entries gives a lattice in Minkowski spacetime, since we can describe Minkowski spacetime as

Here’s the conjecture:

Conjecture. The points in the lattice that lie on the hyperboloid are the centers of hexagons in a hexagonal tiling honeycomb.

Using known results, it’s relatively easy to show that there’s a hexagonal tiling honeycomb whose hexagon centers are all points in The hard part is showing that every point in is a hexagon center. Points in are the same as 4-tuples of integers obeying an inequality (the 0" class="latex" /> condition) and a quadratic equation (the condition). So, we’re trying to show that all 4-tuples obeying those constraints follow a very regular pattern.

Here are two proofs of the conjecture:

• John Baez, Line bundles on complex tori (part 5), The n-Category Café, April 30, 2024.

Greg Egan and I came up with the first proof. The basic idea was to assume there’s a point in that’s not a hexagon center, choose one as close as possible to the identity matrix, and then construct an even closer one, getting a contradiction. Shortly thereafter, someone on Mastodon by the name of Mist came up with a second proof, similar in strategy but different in detail. This increased my confidence in the result.

What’s next?

Something very similar should be true for another regular hyperbolic honeycomb, the square tiling honeycomb:

Here instead of the Eisenstein integers we should use the Gaussian integers, , consisting of all complex numbers

where and are integers.

Conjecture. The points in the lattice that lie on the hyperboloid are the centers of squares in a square tiling honeycomb.

I’m also very interested in how these results connect to algebraic geometry! I explained this in some detail here:

• Line bundles on complex tori (part 4), The n-Category Café, April 26, 2024.

Briefly, the hexagon centers in the hexagonal tiling honeycomb correspond to principal polarizations of the abelian variety . These are concepts that algebraic geometers know and love. Similarly, if the conjecture above is true, the square centers in the square tiling honeycomb will correspond to principal polarizations of the abelian variety . But I’m especially interested in interpreting the other features of these honeycombs — not just the hexagon and square centers — using ideas from algebraic geometry.

Agent-Based Models (Part 8)

2024-04-17T08:13:49Z

Last time I presented a class of agent-based models where agents hop around a graph in a stochastic way. Each vertex of the graph is some ‘state’ agents can be in, and each edge is called a ‘transition’. In these models, the probability per time of an agent making a transition and leaving some state can depend on when it arrived at that state. It can also depend on which agents are in other states that are ‘linked’ to that edge—and when those agents arrived.

I’ve been trying to generalize this framework to handle processes where agents are born or die—or perhaps more generally, processes where some number of agents turn into some other number of agents. There’s already a framework that does something sort of like this. It’s called ‘stochastic Petri nets’, and we explained this framework here:

• John Baez and Jacob Biamonte, Quantum Techniques for Stochastic Mechanics, World Scientific Press, Singapore, 2018. (See also blog articles here.)

However, in their simplest form, stochastic Petri nets are designed for agents whose only distinguishing information is which state they’re in. They don’t have ‘names’—that is, individual identities. Thus, even calling them ‘agents’ is a bit of a stretch: usually they’re called ‘tokens’, since they’re drawn as black dots.

We could try to enhance the Petri net framework to give tokens names and other identifying features. There are various imaginable ways to do this, such as ‘colored Petri nets’. But so far this approach seems rather ill-adapted for processes where agents have identities—perhaps because I’m not thinking about the problem the right way.

So, at some point I decided to try something less ambitious. It turns out that in applications to epidemiology, general processes where n agents come in and m go out are not often required. So I’ve been trying to minimally enhance the framework from last time to include processes ‘birth’ and ‘death’ processes as well as transitions from state to state.

As I thought about this, some questions kept plaguing me:

When an agent gets created, or ‘born’, which one actually gets born? In other words, what is its name? Its precise name may not matter, but if we want to keep track of it after it’s born, we need to give it a name. And this name had better be ‘fresh’: not already the name of some other agent.

There’s also the question of what happens when an agent gets destroyed, or ‘dies’. This feels less difficult: there just stops being an agent with the given name. But probably we want to prevent a new agent from having the same name as that dead agent.

Both these questions seem fairly simple, but so far they’re making it hard for me to invent a truly elegant framework. At first I tried to separately describe transitions between states, births, and deaths. But this seemed to triplicate the amount of work I needed to do.

Then I tried models that have

• a finite set of states,

• a finite set of transitions,

• maps mapping each transition to its upstream and downstream states.

Here is the disjoint union of and a singleton whose one element is called undefined. Maps from to are a standard way to talk about partially defined maps from to We get four cases:

1) If the downstream of a transition is defined (i.e. in ) but its upstream is undefined we call this transition a birth transition.

2) If the upstream of a transition is defined but its downstream is undefined we call this transition a death transition.

3) If the upstream and downstream of a transition are both defined we call this transition a transformation. In practice most of transitions will be of this sort.

4) We never need transitions whose upstream and downstream are undefined: these would describe agents that pop into existence and instantly disappear.

This is sort of nice, except for the fourth case. Unfortunately when I go ahead and try to actually describe a model based on this paradigm, I seem still to wind up needing to handle births, deaths and transformations quite differently.

For example, last time my models had a fixed set of agents. To handle births and deaths, I wanted to make this set time-dependent. But I need to separately say how this works for transformations, birth transitions and death transitions. For transformations we don’t change For birth transitions we add a new element to And for death transitions we remove an element from and maybe record its name on a ledger or drive a stake through its heart to make sure it can never be born again!

So far this is tolerable, but things get worse. Our model also needs ‘links’ from states to transitions, to say how agents present in those states affect the timing of those transition. These are used in the ‘jump function’, a stochastic function that answers this question:

If at time agent arrives at the state upstream to some transition and the agents at states linked to the transition form some set when will agent make the transition given that it doesn’t do anything else first?

This works fine for transformations, meaning transitions that have both an upstream and downstream state. It works just a tiny bit differently for death transitions. But birth transitions are quite different: since newly born agents don’t have a previous upstream state , they don’t have a time at which they arrived at that state.

Perhaps this is just how modeling works: perhaps the search for a staggeringly beautiful framework is a distraction. But another approach just occurred to me. Today I just want to briefly state it. I don’t want to write a full blog article on it yet, since I’ve already spent a lot of time writing two articles that I deleted when I became disgusted with them—and I might become disgusted with this approach too!

Briefly, this approach is exactly the approach I described last time. There are fundamentally no births and no deaths: all transitions have an upstream and a downstream state. There is a fixed set of agents that does not change with time. We handle births and deaths using a dirty trick.

Namely, births are transitions out of a ‘unborn’ state. Agents hang around in this state until they are born.

Similarly, deaths are transitions to a ‘dead’ state.

There can be multiple ‘unborn’ states and ‘dead’ states. Having multiple unborn states makes it easy to have agents with different characteristics enter the model. Having multiple dead states makes it easy for us to keep tallies of different causes of death. We should make the unborn states distinct from the dead states to prevent ‘reincarnation’—that is, the birth of a new agent that happens to equal an agent that previously died.

I’m hoping that when we proceed this way, we can shoehorn birth and death processes into the framework described last time, without really needing to modify it at all! All we’re doing is exploiting it in a new way.

Here’s one possible problem: if we start with a finite number of agents in the ‘unborn’ states, the population of agents can’t grow indefinitely! But this doesn’t seem very dire. For most agent-based models we don’t feel a need to let the number of agents grow arbitrarily large. Or we can relax the requirement that the set of agents is finite, and put an infinite number of agents in an unborn state. This can be done without using an infinite amount of memory: it’s a ‘potential infinity’ rather than an ‘actual infinity’.

There could be other problems. So I’ll post this now before I think of them.

Protonium

2024-04-14T22:39:43Z

It looks like they’ve found protonium in the decay of a heavy particle!

Protonium is made of a proton and an antiproton orbiting each other. It lasts a very short time before they annihilate each other.

It’s a bit like a hydrogen atom where the electron has been replaced with an antiproton! But it’s much smaller than a hydrogen atom. And unlike a hydrogen atom, which is held together by the electric force, protonium is mainly held together by the strong nuclear force.

There are various ways to make protonium. One is to make a bunch of antiprotons and mix them with protons. This was done accidentally in 2002. They only realized this upon carefully analyzing the data 4 years later.

This time, people were studying the decay of the J/psi particle. The J/psi is made of a heavy quark and its antiparticle. It’s 3.3 times as heavy as a proton, so it’s theoretically able to decay into protonium. And careful study showed that yes, it does this sometimes!

The new paper on this has a rather dry title—not “We found protonium!” But it has over 550 authors, which hints that it’s a big deal. I won’t list them.

• BESIII Collaboration, Observation of the anomalous shape of X(1840) in J/ψ→γ3(π+π−), Phys. Rev. Lett. 132 (2024), 151901.

The idea here is that sometimes the J/ψ particle decays into a gamma ray and 3 pion-antipion pairs. When they examined this decay, they found evidence that an intermediate step involved a particle of mass 1880 MeV/c², a bit more than an already known intermediate of mass 1840 MeV/c².

This new particle is a bit lighter than twice the mass of a proton, 938 MeV/c². So, there’s a good chance that it’s protonium!

But how did physicists made protonium by accident in 2002? They were trying to make antihydrogen, which is a positron orbiting an antiproton. To do this, they used the Antiproton Decelerator at CERN. This is just one of the many cool gadgets they keep near the Swiss-French border.

You see, to create antiprotons you need to smash particles at each other at almost the speed of light—so the antiprotons usually shoot out really fast. It takes serious cleverness to slow them down and catch them without letting them bump into matter and annihilate.

That’s what the Antiproton Decelerator does. So they created a bunch of antiprotons and slowed them down. Once they managed to do this, they caught the antiprotons in a Penning trap. This holds charged particles using magnetic and electric fields. Then they cooled the antiprotons—slowed them even more—by letting them interact with a cold gas of electrons. Then they mixed in some positrons. And they got antihydrogen!

But apparently some protons got in there too, so they also made some protonium, by accident. They only realized this when they carefully analyzed the data 4 years later, in a paper with only a few authors:

• N. Zurlo, M. Amoretti, C. Amsler, G. Bonomi, C. Carraro, C. L. Cesar, M. Charlton, M. Doser, A. Fontana, R. Funakoshi, P. Genova, R. S. Hayano, L. V. Jorgensen, A. Kellerbauer, V. Lagomarsino, R. Landua, E. Lodi Rizzini, M. Macri, N. Madsen, G. Manuzio, D. Mitchard, P. Montagna, L. G. Posada, H. Pruys, C. Regenfus, A. Rotondi, G. Testera, D. P. Van der Werf, A. Variola, L. Venturelli and Y. Yamazaki, Production of slow protonium in vacuum, Hyperfine Interactions 172 (2006), 97–105.

Protonium is sometimes called an ‘exotic atom’—though personally I’d consider it an exotic nucleus. The child in me thinks it’s really cool that there’s an abbreviation for protonium, Pn, just like a normal element.

T Corona Borealis

2024-03-27T18:35:32Z

Sometime this year, the star T Corona Borealis will go nova and become much brighter! At least that’s what a lot of astronomers think. So examine the sky between Arcturus and Vega now—and look again if you hear this event has happened. Normally this star is magnitude 10, too dim to see. When it goes nova is should reach magnitude 2 for a week—as bright as the North Star. So you will see a new star, which is the original meaning of ‘nova’.

But why do they think T Corona Borealis will go nova this year? How could they possibly know that?

It’s done this before. It’s a binary star with a white dwarf orbiting a red giant. The red giant is spewing out gas. The much denser white dwarf collects some of this gas on its surface until there’s enough fuel to cause a runaway thermonuclear reaction—a nova!

We’ve seen it happen twice. T Corona Borealis went nova on May 12, 1866 and again on February 9, 1946. What’s happening now is a lot like what happened in 1946.

In February 2015, there was a sustained brightening of T Corona Borealis: it went from magnitude 10.5 to about 9.2. The same thing happened eight years before it went nova the last time.

In June 2018, the star dimmed slightly but still remained at an unusually high level of activity. Then in April 2023 it dimmed to magnitude 12.3. The same thing happened one year before it went nova the last time.

If this pattern continues, T Corona Borealis should erupt sometime between now and September 2024. I’m not completely confident that it will follow the same pattern! But we can just wait and see.

This is one of only 5 known repeating novas in the Milky Way, so we’re lucky to have this chance.

Here’s how it might work:

The description at NASA’s blog:

A red giant star and white dwarf orbit each other in this animation of a nova. The red giant is a large sphere in shades of red, orange, and white, with the side facing the white dwarf the lightest shades. The white dwarf is hidden in a bright glow of white and yellows, which represent an accretion disk around the star. A stream of material, shown as a diffuse cloud of red, flows from the red giant to the white dwarf. The animation opens with the red giant on the right side of the screen, co-orbiting the white dwarf. When the red giant moves behind the white dwarf, a nova explosion on the white dwarf ignites, filling the screen with white light. After the light fades, a ball of ejected nova material is shown in pale orange. A small white spot remains after the fog of material clears, indicating that the white dwarf has survived the explosion.

For more details, try this:

• B. E. Schaefer, B. Kloppenborg, E. O. Waagen and the AAVSO observers, Announcing T CrB pre-eruption dip, AAVSO News and Announcements.

The Probability of Undecidability

2024-03-13T23:59:10Z

There’s a lot we don’t know. There’s a lot we can’t know. But can we at least know how much we can’t know?

What fraction of mathematical statements are undecidable—that is, can be neither proved nor disproved? There are many ways to make this question precise… but it remains a bit mysterious. The best results I know appear, not in a published paper, but on MathOverflow!

In 1998, the Fields-medal winning topologist Michael Freedman conjectured that the fraction of statements that are provable in Peano Arithmetic approaches zero quite rapidly as you go to longer and longer statements:

He must also have been conjecturing that Peano Arithmetic is consistent, since if it’s inconsistent then all its statements are provable. From now on let’s assume that PA is consistent.

In 2005, Cristian Calude and Konrad Jürgensen published a paper arguing that Freedman was on the right track. More precisely, they showed that the fraction of statements in PA that are provable goes to zero as we go to longer and longer statements. The fraction of disprovable statements also goes to zero. So, the fraction of undecidable statements approaches 1.

Unfortunately their paper had a mistake!

In 2009, David Speyer argued that the fraction of provable statements does not approach 0 and does not approach 1 as we consider longer and longer statements. Instead, it’s bounded by numbers between 0 and 1. Similarly for the fraction of undecidable statements! His argument is not air-tight, as he admits and explains—but I believe it. Someone should try to complete his proof.

Speyer’s idea is very simple: if P is any statement, the statement “P or 1 = 1” is provable. This can be used to get a lower bound on the number of provable statements of a given length. Similarly, suppose G is some undecidable statement. Then for any statement P, the statement “G and (P or 1 = 1)” is undecidable. This can be used to get a lower bound on the number of undecidable statements of a given length.

The Probability of the Law of Excluded Middle

2024-03-14T15:27:26Z

The Law of Excluded Middle says that for any statement P, “P or not P” is true.

Is this law true? In classical logic it is. But in intuitionistic logic it’s not.

So, in intuitionistic logic we can ask what’s the probability that a randomly chosen statement obeys the Law of Excluded Middle. And the answer is “at most 2/3—or else your logic is classical”.

This is a very nice new result by Benjamin Bumpus and Zoltan Kocsis:

• Benjamin Bumpus, Degree of classicality, Merlin’s Notebook, 27 February 2024.

Of course they had to make this more precise before proving it. Just as classical logic is described by Boolean algebras, intuitionistic logic is described by something a bit more general: Heyting algebras. They proved that in a finite Heyting algebra, if more than 2/3 of the statements obey the Law of Excluded Middle, then it must be a Boolean algebra!

Interestingly, nothing like this is true for “not not P implies P”. They showed this can hold for an arbitrarily high fraction of statements in a Heyting algebra that is still not Boolean.

Here’s a piece of the free Heyting algebra on one generator, which some call the Rieger–Nishimura lattice:

Taking the principle of excluded middle from the mathematician would be the same, say, as proscribing the telescope to the astronomer or to the boxer the use of his fists. — David Hilbert

I disagree with this statement, but boy, Hilbert sure could write!

Nicholas Ludford

2024-02-29T23:17:15Z

At first glance it’s amazing that one of the great British composers of the 1400s largely sank from view until his works were rediscovered in 1850.

But the reason is not hard to find. When the Puritans took over England, they burned not only witches and heretics, but also books — and music! They hated the complex polyphonic choral music of the Catholics.

So, in the history of British music, between the great polyphonists Robert Fayrfax (1465-1521) and John Taverner (1490-1545), there was a kind of gap — a silence — until the Peterhouse Partbooks were rediscovered.

These were an extensive collection of musical manuscripts, handwritten by a single scribe between 1539 and 1541. Most of them got lost somehow and found only in the 1850s. Others were found even later, in 1926! They were hidden behind a panel in a library — probably hidden from the Puritans.

The 1850 batch contains wonderful compositions by Nicholas Ludford
(~1485-1557). One music scholar has called him “one of the last unsung
geniuses of Tudor polyphony”. Another wrote:

it is more a matter of astonishment that such mastery should be displayed by a composer of whom virtually nothing was known until modern times.

Ludford’s work was first recorded only in 1993, and much of the Peterhouse Partbooks have been recorded only more recently. A Boston group called Blue Heron released a 5-CD set, starting in 2010 and ending in 2017. It’s magnificent!

Below you can hear the Sanctus from Nicholas Ludford’s Missa Regnum mundi. It has long, sleek lines of harmony; you can lose yourself trying to follow all the parts.

Agent-Based Models (Part 7)

2024-04-16T15:50:50Z

Last time I presented a simple, limited class of agent-based models where each agent independently hops around a graph. I wrote:

Today the probability for an agent to hop from one vertex of the graph to another by going along some edge will be determined the moment the agent arrives at that vertex. It will depend only on the agent and the various edges leaving that vertex. Later I’ll want this probability to depend on other things too—like whether other agents are at some vertex or other. When we do that, we’ll need to keep updating this probability as the other agents move around.

Let me try to figure out that generalization now.

Last time I discovered something surprising to me. To describe it, let’s bring in some jargon. The conditional probability per time of an agent making a transition from its current state to a chosen other state (given that it doesn’t make some other transition) is called the hazard function of that transition. In a Markov process, the hazard function is actually a constant, independent of how long the agent has been in its current state. In a semi-Markov process, the hazard function is a function only of how long the agent has been in its current state.

For example, people like to describe radioactive decay using a Markov process, since experimentally it doesn’t seem that ‘old’ radioactive atoms decay at a higher or lower rate than ‘young’ ones. (Quantum theory says this can’t be exactly true, but nobody has seen deviations yet.) On the other hand, the death rate of people is highly non-Markovian, but we might try to describe it using a semi-Markov process. Shortly after birth it’s high—that’s called ‘infant mortality’. Then it goes down, and then it gradually increases.

We definitely want to our agent-based processes to have the ability to describe semi-Markov processes. What surprised me last time is that I could do it without explicitly keeping track of how long the agent has been in its current state, or when it entered its current state!

The reason is that we can decide which state an agent will transition to next, and when, as soon as it enters its current state. This decision is random, of course. But using random number generators we can make this decision the moment the agent enters the given state—because there is nothing more to be learned by waiting! I described an algorithm for doing this.

I’m sure this is well-known, but I had fun rediscovering it.

But today I want to allow the hazard function for a given agent to make a given transition to depend on the states of other agents. In this case, if some other agent randomly changes state, we will need to recompute our agent’s hazard function. There is probably no computationally feasible way to avoid this, in general. In some analytically solvable models there might be—but we’re simulating systems precisely because we don’t know how to solve them analytically.

So now we’ll want to keep track of the residence time of each agent—that is, how long it’s been in its current state. But William Waites pointed out a clever way to do this: it’s cheaper to keep track of the agent’s arrival time, i.e. when it entered its current state. This way you don’t need to keep updating the residence time. Whenever you need to know the residence time, you can just subtract the arrival time from the current clock time.

Even more importantly, our model should now have ‘informational links’ from states to transitions. If we want the presence or absence of agents in some state to affect the hazard function of some transition, we should draw a ‘link’ from that state to that transition! Of course you could say that anything is allowed to affect anything else. But this would create an undisciplined mess where you can’t keep track of the chains of causation. So we want to see explicit ‘links’.

So, here’s my new modeling approach, which generalizes the one we saw last time. For starters, a model should have:

• a finite set of vertices or states,

• a finite set of edges or transitions,

• maps mapping each edge to its source and target, also called its upstream and downstream,

• finite set of agents,

• a finite set of links,

• maps and mapping each link to its source (a state) and its target (a transition).

All of this stuff, except for the set of agents, is exactly what we had in our earlier paper on stock-flow models, where we treated people en masse instead of as individual agents. You can see this in Section 2.1 here:

• John Baez, Xiaoyan Li, Sophie Libkind, Nathaniel D. Osgood, Evan Patterson, Compositional modeling with stock and flow models.

So, I’m trying to copy that paradigm, and eventually unify the two paradigms as much as possible.

But they’re different! In particular, our agent-based models will need a ‘jump function’. This says when each agent will undergo a transition if it arrives at the state upstream to that transition at a specific time This jump function will not be deterministic: it will be a stochastic function, just as it was in yesterday’s formalism. But today it will depend on more things! Yesterday it depended only on and But now the links will come into play.

For each transition , there is set of links whose target is that transition, namely

Each link in will have one state as its source. We say this state affects the transition via the link

We want the jump function for the transition to depend on the presence or absence of agents in each state that affects this transition.

Which agents are in a given state? Well, it depends! But those agents will always form some subset of and thus an element of So, we want the jump function for the transition to depend on an element of

I’ll call this element And as mentioned earlier, the jump function will also depend on a choice of agent and on the arrival time of the agent

So, we’ll say there’s a jump function for each transition which is a stochastic function

The idea, then, is that is the answer to this question:

If at time agent arrived at the vertex and the agents at states linked to the edge are described by the set when will agent move along the edge to the vertex given that it doesn’t do anything else first?

The answer to this question can keep changing as agents other than move around, since the set can keep changing. This is the big difference between today’s formalism and yesterday’s.

Here’s how we run our model. At every moment in time we keep track of some information about each agent namely:

• Which vertex is it at now? We call this vertex the agent’s state,

• When did it arrive at this vertex? We call this time the agent’s arrival time,

• For each edge whose upstream is when will agent move along this edge if it doesn’t do anything else first? Call this time

I need to explain how we keep updating these pieces of information (supposing we already have them). Let’s assume that at some moment in time an agent makes a transition. More specifically, suppose agent makes a transition from the state

to the state

At this moment we update the following information:

1) We set

(So, we update the arrival time of that agent.)

2) We set

(So, we update the state of that agent.)

3) We recompute the subset of agents in the state (by removing from this subset) and in the state (by adding to this subset).

4) For every transition that’s affected by the state or the state , and for every agent in the upstream state of that transition, we set

where is the element of saying which subset of agents is in each state affecting the transition (So, we update our table of times at which agent will make the transition given that it doesn’t do anything else first.)

Now we need to compute the next time at which something happens, namely And we need to compute what actually happens then!

To do this, we look through our table of times for each agent and all transitions out of the state that agent is in. and see which time is smallest. If there’s a tie, break it. Then we reset and to be the agent-edge pair that minimizes

5) We set

Then we loop back around to step 1), but with replacing

Whew! I hope you followed that. If not, please ask questions.

Well Temperaments (Part 6)

2024-02-27T21:36:59Z

Andreas Werckmeister (1645–1706) was a musician and expert on the organ. Compared to Kirnberger, his life seems outwardly dull. He got his musical training from his uncles, and from the age of 19 to his death he worked as an organist in three German towns. That’s about all I know.

His fame comes from the tremendous impact of his his theoretical writings. Most importantly, in his 1687 book Musikalische Temperatur he described the first ‘well tempered’ tuning systems for keyboards, where every key sounds acceptable but each has its own personality. Johann Sebastian Bach read and was influenced by Werckmeister’s work. The first book of Bach’s Well-Tempered Clavier came out in 1722—the first collection of keyboard pieces in all 24 keys.

But Bach was also influenced by Werckmeister’s writings on counterpoint. Werckmeister believed that well-written counterpoint reflected the orderly movements of the planets—especially invertible counterpoint, where as the music goes on, a melody that starts in the high voice switches to the low voice and vice versa. Bach’s Invention No. 13 in A minor is full of invertible counterpoint:

The connection to planets may sound bizarre now, but the ‘music of the spheres’ or ‘musica universalis’ was a long-lived and influential idea. Werckmeister was influenced by Kepler’s 1619 Harmonices Mundi, which has pictures like this:

But the connection between music and astronomy goes back much further: at least to Claudius Ptolemy, and probably even earlier. Ptolemy is most famous for his Almagest, which quite accurately described planetary motions using a geocentric system with epicycles. But his Harmonikon, written around 150 AD, is the first place where just intonation is clearly described, along with a number of related tuning systems. And it’s important to note that this book is not just about ‘harmony theory’. It’s about a subject he calls ‘harmonics’: the general study of vibrating or oscillating systems, including the planets. Thinking hard about this, it become clearer and clearer why the classical ‘quadrivium’ grouped together arithmetic, geometry, music and astronomy.

In Grove Music Online, George Buelow digs a bit deeper:

Werckmeister was essentially unaffected by the innovations of Italian Baroque music. His musical surroundings were nourished by traditions whose roots lay in medieval thought. The study of music was thus for him a speculative science related to theology and mathematics. In his treatises he subjected every aspect of music to two criteria: how it contributed to an expression of the spirit of God, and, as a corollary, how that expression was the result of an order of mathematical principles emanating from God.

“Music is a great gift and miracle from God, an art above all arts because it is prescribed by God himself for his service.” (Hypomnemata musica, 1697.)

“Music is a mathematical science, which shows us through number the correct differences and ratios of sounds from which we can compose a suitable and natural harmony.” (Musicae mathematicae Hodegus curiosus, 1686.)

Musical harmony, he believed, actually reflected the harmony of Creation, and, inspired by the writings of Johannes Kepler, he thought that the heavenly constellations emitted their own musical harmonies, created by God to influence humankind. He took up a middle-of-the-road position in the ancient argument as to whether Ratio (reason) or Sensus (the senses) should rule music and preferred to believe in a rational interplay of the two forces, but in many of his views he remained a mystic and decidedly medieval. No other writer of the period regarded music so unequivocally as the end result of God’s work, and his invaluable interpretations of the symbolic reality of God in number as expressed by musical notes supports the conclusions of scholars who have found number symbolism as theological abstractions in the music of Bach. For example, he not only saw the triad as a musical symbol and actual presence of the Trinity but described the three tones of the triad as symbolizing 1 = the Lord, 2 = Christ and 3 = the Holy Ghost.

The Trinity symbolism may seem wacky, but many people believe it pervades the works of Bach. I’m not convinced yet—it’s not hard to find the number 3 in music, after all. But if Bach read and was influenced by the works of Werckmeister, maybe there really is something to these theories.

Werckmeister’s tuning systems

As his name suggests, Werckmeister was a real workaholic. There are no less than five numbered tuning systems named after him—although the first two were not new. Of these systems, the star is Werckmeister III. I’ll talk more about that one next time. But let’s look briefly at all five.

Werckmeister I

This is another name for just intonation. Just intonation goes back at least to Ptolemy, and it had its heyday of popularity from about 1300 to 1550. I discussed it extensively starting here.

Werckmeister II

This is another name for quarter-comma meantone. Quarter-comma meantone was extremely popular from about 1550 until around 1690, when well temperaments started taking over. I discussed it extensively starting here, but remember:

All but one of the fifths are 1/4 comma flat, making the thirds built from those fifths ‘just’, with frequency ratios of exactly 5/4: these are the black arrows labelled 0. Unfortunately, the sum of the numbers on the circle of fifths needs to be -1. This forces the remaining fifth to be 7/4 commas sharp: it’s a painfully out-of-tune ‘wolf fifth’. And the thirds that cross this fifth are forced to be even worse: 8/4 commas sharp. Those are the problems that Werckmeister sought to solve with his next tuning system!

Werckmeister III

This was probably the world’s first well tempered tuning system! It’s definitely one of the most popular. Here it is:

4 of the fifths are 1/4 comma flat, so the total of the numbers around the circle is -1, as required by the laws of math, without needing any positive numbers. This means we don’t need any fifths to be sharp. That’s nice. But the subtlety of the system is the location of the flatted fifths: starting from C in the circle of fifths they are the 1st, 2nd, 3rd and… not the 4th, but the 6th!

I’ll talk about this more next time. For now, here’s a more elementary point. Comparing this system to quarter-comma meantone, you can see that it’s greatly smoothed down: instead of really great thirds in black and really terrible ones in garish fluorescent green, Werckmeister III has a gentle gradient of mellow hues. That’s ‘well temperament’ in a nutshell.

For more, see:

• Wikipedia, Werckmeister temperament III.

Werckmeister IV

This system is based not on 1/4 commas but on 1/3 commas!

As we go around the circle of fifths starting from B♭, every other fifth is 1/3 comma flat… for a while. But if we kept doing this around the whole circle, we’d get a total of -4. The total has to be -1. So we eventually need to compensate, and Werckmeister IV does so by making two fifths 1/3 comma sharp.

I will say more about Werckmeister IV in a post devoted to systems that use 1/3 and 1/6 commas. But you can already see that its color gradient is sharper than Werckmeister III. Probably as a consequence, it was never very popular.

For more, see:

• Wikipedia, Werckmeister temperament IV.

Werckmeister V

This is another system based on 1/4 commas:

Compared to Werckmeister III this has an extra fifth that’s a quarter comma flat—and thus, to compensate, a fifth that’s a quarter comma sharp. The location of the flat fifths seems a bit more random, but that’s probably just my ignorance.

For more, see:

• Wikipedia, Werckmeister temperament V.

Werckmeister VI

This system is based on a completely different principle. It also has another really cool-sounding name—the ‘septenarius tuning’—because it’s based on dividing a string into 196 = 7 × 7 × 4 equal parts. The resulting scale has only rational numbers as frequency ratios, unlike all the other well temperaments I’m discussing. Werckmeister described this system as “an additional temperament which has nothing at all to do with the divisions of the comma, nevertheless in practice so correct that one can be really satisfied with it”. For details, go here:

• Wikipedia, Werckmeister temperament VI.

Werckmeister on equal temperament

Werckmeister was way ahead of his time. He was not only the first, or one of the first, to systematically pursue well temperaments. He also was one of the first to embrace equal temperament! This system took over around 1790, and rules to this day. But Werckmeister advocated it much earlier—most notably in his final book, published in 1707, one year after his death.

There is an excellent article about this:

• Dietrich Bartel, Andreas Werckmeister’s final tuning: the path to equal temperament, Early Music 43 (2015), 503–512.

You can read it for free if you register for JSTOR. It’s so nice that I’ll quote the beginning:

Any discussion regarding Baroque keyboard tunings normally includes the assumption that Baroque musicians employed a variety of unequal temperaments, allowing them to play in all keys but with individual keys exhibiting unique characteristics, the more frequently used diatonic keys featuring purer 3rds than the less common chromatic ones. Figuring prominently in this discussion are Andreas Werckmeister’s various suggestions for tempered tuning, which he introduces in his Musicalische Temperatur. This is not Werckmeister’s last word on the subject. In fact, the Musicalische Temperatur is an early publication, and the following decade would see numerous further publications by him, a number of which speak on the subject of temperament.

Of particular interest in this regard are Hypomnemata Musica (in particular chapter 11), Die Nothwendigsten Anmerckungen (specifically the appendix in the undated second edition}, Erweiterte und verbesserte Orgel-Probe (in particular chapter 32), Harmonologia Musica (in particular paragraph 27) and Musicalische Paradoxal-Discourse (in particular chapters 13 and 23-5). Throughout these publications, Werckmeister increasingly championed equal temperament. Indeed, in his Paradoxal Discourse much of the discussion concerning other theoretical issues rests on the assumption of equal temperament. Also apparent is his increasing concern with theological speculation, resulting in a theological justification taking precedence over a musical one in his argument for equal temperament. This article traces Werckmeister’s path to equal temperament by examining his references to it in his publications and identifying the supporting arguments for his insistence on equal temperament.

In his Paradoxal Discourse, Werckmeister wrote:

Some may no doubt be astonished that I now wish to institute a temperament in which all 5ths are tempered by 1/12, major 3rds by 2/3 and minor 3rds by 3/4 of a comma, resulting in all consonances possessing equal temperament, a tuning which I did not explicitly introduce in my Monochord.

This is indeed equal temperament:

And in a pun on ‘wolf fifth’, he makes an excuse for not talking about equal temperament earlier:

Had I straightaway assigned the 3rds of the diatonic genus, that tempering which would be demanded by a subdivision of the comma into twelve parts, I would have been completely torn apart by the wolves of ignorance. Therefore it is difficult to eradicate an error straightaway and at once.

However, it seems more likely to me that his position evolved over the years.

What’s next?

You are probably getting overwhelmed by the diversity of tuning systems. Me too! To deal with this, I need to compare similar systems. So, next time I will compare systems that are based on making a bunch of fifths a quarter comma flat. The time after that, I’ll compare systems that are based on making a bunch of fifths a third or a sixth of a comma flat.

For more on Pythagorean tuning, read this series:

• Pythagorean tuning.

For more on just intonation, read this series:

• Just intonation.

For more on quarter-comma meantone tuning, read this series:

• Quarter-comma meantone.

For more on well-tempered scales, read this series:

• Part 1. An introduction to well temperaments.

• Part 2. How small intervals in music arise naturally from products of integral powers of primes that are close to 1. The Pythagorean comma, the syntonic comma and the lesser diesis.

• Part 3. Kirnberger’s rational equal temperament. The schisma, the grad and the atom of Kirnberger.

• Part 4. The music theorist Kirnberger: his life, his personality, and a brief introduction to his three well temperaments.

• Part 5. Kirnberger’s three well temperaments: Kirnberger I, Kirnberger II and Kirnberger III.

For more on equal temperament, read this series:

• Equal temperament.