<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: The Mathematical Origin of Irreversibility</title>
	<atom:link href="http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/feed/" rel="self" type="application/rss+xml" />
	<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/</link>
	<description></description>
	<lastBuildDate>Wed, 22 May 2013 00:35:08 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: John Baez</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-22364</link>
		<dc:creator><![CDATA[John Baez]]></dc:creator>
		<pubDate>Sun, 25 Nov 2012 03:05:02 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-22364</guid>
		<description><![CDATA[If you have a measure $latex d \mu$ you can get another measure by multiplying it by a function $latex f$:

$latex d \nu = f  d\mu$

Conversely, under some conditions you can figure out $latex f$ knowing the measures $latex d \mu$ and $latex d \nu$.  This trick is called the Radon-Nikodym derivative because if you just follow your nose it looks like a derivative

$latex \displaystyle{ \frac{d \nu}{d \mu} = f } $

It&#039;s really more like division: the measure $latex d \nu$ divided by the measure $latex d \mu$ is the function $latex f$.

If this is too abstract for you, imagine $latex d x$ is the usual thing that shows up in integrals and define

$latex d \mu(x) = \alpha(x)  \, d x $

$latex d \nu(x) = \beta(x)  \, d x $

for functions $latex \alpha$ and $latex \beta$.  Then the Radon-Nikodym derivative at the point $latex x$ is

$latex  \displaystyle{ \frac{d \nu(x)}{d \mu(x)} = \frac{\alpha(x)}{\beta(x)} } $

whenever this exists, that is, whenever you&#039;re not dividing by zero.

I&#039;m stating everything in a way that leaves out the technical fine print that&#039;s needed for an actual theorem.  For that, try &lt;a href=&quot;http://en.wikipedia.org/wiki/Radon%E2%80%93Nikodym_theorem&quot; rel=&quot;nofollow&quot;&gt;Wikipedia&lt;/a&gt;.  But you said you wanted intuition, and the intuition is a lot simpler than the Wikipedia article makes it seem.]]></description>
		<content:encoded><![CDATA[<p>If you have a measure <img src='http://s0.wp.com/latex.php?latex=d+%5Cmu&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;mu' title='d &#92;mu' class='latex' /> you can get another measure by multiplying it by a function <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' />:</p>
<p><img src='http://s0.wp.com/latex.php?latex=d+%5Cnu+%3D+f++d%5Cmu&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;nu = f  d&#92;mu' title='d &#92;nu = f  d&#92;mu' class='latex' /></p>
<p>Conversely, under some conditions you can figure out <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' /> knowing the measures <img src='http://s0.wp.com/latex.php?latex=d+%5Cmu&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;mu' title='d &#92;mu' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=d+%5Cnu&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;nu' title='d &#92;nu' class='latex' />.  This trick is called the Radon-Nikodym derivative because if you just follow your nose it looks like a derivative</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Cfrac%7Bd+%5Cnu%7D%7Bd+%5Cmu%7D+%3D+f+%7D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle{ &#92;frac{d &#92;nu}{d &#92;mu} = f } ' title='&#92;displaystyle{ &#92;frac{d &#92;nu}{d &#92;mu} = f } ' class='latex' /></p>
<p>It&#8217;s really more like division: the measure <img src='http://s0.wp.com/latex.php?latex=d+%5Cnu&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;nu' title='d &#92;nu' class='latex' /> divided by the measure <img src='http://s0.wp.com/latex.php?latex=d+%5Cmu&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;mu' title='d &#92;mu' class='latex' /> is the function <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='f' title='f' class='latex' />.</p>
<p>If this is too abstract for you, imagine <img src='http://s0.wp.com/latex.php?latex=d+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d x' title='d x' class='latex' /> is the usual thing that shows up in integrals and define</p>
<p><img src='http://s0.wp.com/latex.php?latex=d+%5Cmu%28x%29+%3D+%5Calpha%28x%29++%5C%2C+d+x+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;mu(x) = &#92;alpha(x)  &#92;, d x ' title='d &#92;mu(x) = &#92;alpha(x)  &#92;, d x ' class='latex' /></p>
<p><img src='http://s0.wp.com/latex.php?latex=d+%5Cnu%28x%29+%3D+%5Cbeta%28x%29++%5C%2C+d+x+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='d &#92;nu(x) = &#92;beta(x)  &#92;, d x ' title='d &#92;nu(x) = &#92;beta(x)  &#92;, d x ' class='latex' /></p>
<p>for functions <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cbeta&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;beta' title='&#92;beta' class='latex' />.  Then the Radon-Nikodym derivative at the point <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x' title='x' class='latex' /> is</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Cfrac%7Bd+%5Cnu%28x%29%7D%7Bd+%5Cmu%28x%29%7D+%3D+%5Cfrac%7B%5Calpha%28x%29%7D%7B%5Cbeta%28x%29%7D+%7D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;displaystyle{ &#92;frac{d &#92;nu(x)}{d &#92;mu(x)} = &#92;frac{&#92;alpha(x)}{&#92;beta(x)} } ' title='&#92;displaystyle{ &#92;frac{d &#92;nu(x)}{d &#92;mu(x)} = &#92;frac{&#92;alpha(x)}{&#92;beta(x)} } ' class='latex' /></p>
<p>whenever this exists, that is, whenever you&#8217;re not dividing by zero.</p>
<p>I&#8217;m stating everything in a way that leaves out the technical fine print that&#8217;s needed for an actual theorem.  For that, try <a href="http://en.wikipedia.org/wiki/Radon%E2%80%93Nikodym_theorem" rel="nofollow">Wikipedia</a>.  But you said you wanted intuition, and the intuition is a lot simpler than the Wikipedia article makes it seem.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Rowlands</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-22362</link>
		<dc:creator><![CDATA[Jon Rowlands]]></dc:creator>
		<pubDate>Sun, 25 Nov 2012 02:54:20 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-22362</guid>
		<description><![CDATA[The properties of the Radon-Nikodym derivative are invoked all through the proof, and understanding the end result really seems to come down to understanding this object. Can you give any intuition about it?]]></description>
		<content:encoded><![CDATA[<p>The properties of the Radon-Nikodym derivative are invoked all through the proof, and understanding the end result really seems to come down to understanding this object. Can you give any intuition about it?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Baez</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20972</link>
		<dc:creator><![CDATA[John Baez]]></dc:creator>
		<pubDate>Sun, 21 Oct 2012 22:58:08 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20972</guid>
		<description><![CDATA[I&#039;ve really got to read this... thanks for the tantalizing summary!]]></description>
		<content:encoded><![CDATA[<p>I&#8217;ve really got to read this&#8230; thanks for the tantalizing summary!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Don Foster</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20964</link>
		<dc:creator><![CDATA[Don Foster]]></dc:creator>
		<pubDate>Sun, 21 Oct 2012 17:25:12 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20964</guid>
		<description><![CDATA[With regard to the notions of fittingness and gradients of one sort and another, I am curious about the origins of concerted action. If this comment is way off track, would you kindly ignore it or simply delete it.

Consider that when you take a stroll in the woods, metric tons of raw matter around you are acting in concert, there is a high degree of fittingness on many meta-levels, everything is mutually finding its proper angle of repose and surprisingly, for great deal of matter, that angle is nearly vertical. There is consensus, a congruous entwining of energy paths, and a multiplicity of ‘formal’ agreements that are both dynamic and enduring. You belong in this scene to the very level of complex DNA chemistry that is resonant within the world around you.

Now, in an ideal gas at maximal entropy, I would expect each molecule to be traveling on its own unique trajectory, each with its own information theoretic distinction. There would be no concerted action, no consensus as to path. Does the trend toward increasing entropy and the final state thereof allow us to roughly characterize the nature of energy? That is, if there is utility in the notion ‘nature abhors a gradient’ is there also some utility in the notion that energy eschews pathways?

If we tentatively accept that idea, then how is it possible to view the world about us as resulting from the evolving ‘chemistry’ and entwinement of energy pathways?

&#039;Into the Cool: Energy Flow, Thermodynamics and Life&#039; sounds very similar in scope to Howard T Odum’s book, &#039;Environment, Power and Society&#039;, published in 1971, both in search of general organizing principles.

General organizing principles are useful and in that regard I am wondering if it is useful to identify some grand counterpoise to energy as being catalytic in the emergence of path. Pathways emerge on gradients, not at equilibrium. On a deep level, could path be viewed as emergent between countervailing gradients of distinction?]]></description>
		<content:encoded><![CDATA[<p>With regard to the notions of fittingness and gradients of one sort and another, I am curious about the origins of concerted action. If this comment is way off track, would you kindly ignore it or simply delete it.</p>
<p>Consider that when you take a stroll in the woods, metric tons of raw matter around you are acting in concert, there is a high degree of fittingness on many meta-levels, everything is mutually finding its proper angle of repose and surprisingly, for great deal of matter, that angle is nearly vertical. There is consensus, a congruous entwining of energy paths, and a multiplicity of ‘formal’ agreements that are both dynamic and enduring. You belong in this scene to the very level of complex DNA chemistry that is resonant within the world around you.</p>
<p>Now, in an ideal gas at maximal entropy, I would expect each molecule to be traveling on its own unique trajectory, each with its own information theoretic distinction. There would be no concerted action, no consensus as to path. Does the trend toward increasing entropy and the final state thereof allow us to roughly characterize the nature of energy? That is, if there is utility in the notion ‘nature abhors a gradient’ is there also some utility in the notion that energy eschews pathways?</p>
<p>If we tentatively accept that idea, then how is it possible to view the world about us as resulting from the evolving ‘chemistry’ and entwinement of energy pathways?</p>
<p>&#8216;Into the Cool: Energy Flow, Thermodynamics and Life&#8217; sounds very similar in scope to Howard T Odum’s book, &#8216;Environment, Power and Society&#8217;, published in 1971, both in search of general organizing principles.</p>
<p>General organizing principles are useful and in that regard I am wondering if it is useful to identify some grand counterpoise to energy as being catalytic in the emergence of path. Pathways emerge on gradients, not at equilibrium. On a deep level, could path be viewed as emergent between countervailing gradients of distinction?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matteo Smerlak</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20862</link>
		<dc:creator><![CDATA[Matteo Smerlak]]></dc:creator>
		<pubDate>Thu, 18 Oct 2012 19:03:55 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20862</guid>
		<description><![CDATA[The fitness flux $latex \Phi$ is defined as a sum over all transitions along the process. For each transition $latex j$, it compares the forward and backward transition rates *at time $latex \tau_j$*. You&#039;re right that if the environmental conditions change along the process, what counts as fit also change; what matters is the fitness variation at each transition. In other words, in a changing environment $latex \Phi$ depends on the actual times at which the transitions took place: a transition that increases fitness at some time may not increase fitness at some other time. But did I get your question right, Graham?]]></description>
		<content:encoded><![CDATA[<p>The fitness flux <img src='http://s0.wp.com/latex.php?latex=%5CPhi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Phi' title='&#92;Phi' class='latex' /> is defined as a sum over all transitions along the process. For each transition <img src='http://s0.wp.com/latex.php?latex=j&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='j' title='j' class='latex' />, it compares the forward and backward transition rates *at time <img src='http://s0.wp.com/latex.php?latex=%5Ctau_j&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;tau_j' title='&#92;tau_j' class='latex' />*. You&#8217;re right that if the environmental conditions change along the process, what counts as fit also change; what matters is the fitness variation at each transition. In other words, in a changing environment <img src='http://s0.wp.com/latex.php?latex=%5CPhi&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Phi' title='&#92;Phi' class='latex' /> depends on the actual times at which the transitions took place: a transition that increases fitness at some time may not increase fitness at some other time. But did I get your question right, Graham?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Markovian Excursions &#124; Eventually Almost Everywhere</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20833</link>
		<dc:creator><![CDATA[Markovian Excursions &#124; Eventually Almost Everywhere]]></dc:creator>
		<pubDate>Wed, 17 Oct 2012 16:19:52 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20833</guid>
		<description><![CDATA[[...] The Mathematical Origin of Irreversibility (johncarlosbaez.wordpress.com) [...]]]></description>
		<content:encoded><![CDATA[<p>[...] The Mathematical Origin of Irreversibility (johncarlosbaez.wordpress.com) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Blake Pollard</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20821</link>
		<dc:creator><![CDATA[Blake Pollard]]></dc:creator>
		<pubDate>Wed, 17 Oct 2012 07:18:55 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20821</guid>
		<description><![CDATA[An interesting/related paper was recently published in PRL, &quot;Thermodynamics of Prediction&quot; by S. Still, with Crooks as a coauthor, here is the arXiv link:   http://arxiv.org/abs/1203.3271 

I&#039;ll try to explain it a bit to encourage you to read it.

Related to Jarzynski&#039;s work and the fluctuation theorem is the idea of measuring equilibrium quantities by forcing a system out of equilibrium and observing the response, see: http://www.physics.berkeley.edu/research/liphardt/pdfs/JarzynskiTest.pdf

In Still&#039;s paper they observe that much of the literature assumes that the driving signal is given/known explicitly, while in nature and biological systems this is most often not the case, hence they study stochastic driving signals. 

The idea is that implicit in the dynamics of such a forced system is a model of its (stochastic) environment. They then ask how the quality of this model is related to thermodynamic efficiency. 
How do you measure the quality of a model? 
For them a good model should have predictive power and not be overly complicated. This comes down to a balance of a systems memory, namely you break it into two parts, one useful, predictive part, and the other &quot;useless nostalgia&quot;. It is this latter part that is related to dissipation, hence having less nostalgia is having less dissipation is having better thermodynamic efficiency. 

Here is a whirlwind tour of how the paper makes this idea precise, taking many quotes straight from the paper:
&quot;the dynamics of system are modeled by discrete time Markovian conditional state-to-state transition probabilities&quot; 
For the driving signal they assume only that its changes are governed by some probability density.
Their system is in contact with a heat bath and starts in equilibrium. It then goes through a bunch of steps where the environment forces it out of equilibrium and then it relaxes. The environment does work on the system in each driving step and heat flows at each relaxation step. 
If you let it relax all the way to equilibrium any additional free energy gets dissipated as heat back to the environment. The additional free energy is given by the Kullback-Leibler divergence, or the relative entropy between the current state/distribution and the equilibrium one. The change in non-equilibrium free energy is the sum of the change in equilibrium free energy and this additional piece: $latex \Delta F_{neq} = \Delta F_{eq} + F_{add} $
Dissipation work is given by the difference between work done on the system and the non-equilibrium change in free energy: $latex W_{diss} = W - \Delta F_{neq} $
Excess work is given by the difference in work done on the system and the equilibrium free-energy (the work done for the quasistatic case): $latex W_{ex} = W - \Delta F_{eq} $
The excess work minus the dissipation work gives you, $latex W_{ex} - W_{diss} = \Delta F_{neq} - \Delta F_{eq} = F_{add} $ , which is precisely the additional free energy (the KL divergence of the current distribution relative to the equilibrium one). 

Now for the information/prediction half, which I understand even less!
They look at Shannon&#039;s (symmetric) mutual information of the system state and the external driving signal, both at a certain time t, this is called the system&#039;s &#039;instantaneous memory&#039;, $latex I[s_{t},x_{t}] $ where s is for system and x is for external signal. The &#039;instantaneous predictive power&#039; is, $latex I[s_{t}, x_{t+1}] $, or the mutual information of a state at time t and the driving signal at t+1.  The difference of the two is the &#039;instantaneous nonpredictive  information;&#039; &quot;it represents useless nostalgia and provides a measure for the ineffectiveness of the implicit model.&quot; (So memory-power=information, kidding)
The paper then shows that this instantaneous nostalgia is proportional to the average work dissipated as t goes to t+1.

They derive a lower bound on the total dissipation and use it to refine Landauer&#039;s principle. They then discuss this in relation to biological systems, where the systems have adapted to their environments forcing, asking if minimizing nostalgia is a factor driving things towards energetic efficiency. 

This paper is more about thermodynamics and prediction (as the title suggests) than about reversibility. I don&#039;t understand it yet, but there are hints of connections here, not only among biology, chemistry and nature, but also information and complexity (algorithmic information theory). I guess that&#039;s why there is all the talk of Kolmogorov above. I&#039;m new at this, and I&#039;m sure many of you reading this have a better big picture! It seems though that in Still&#039;s paper, a good model is about balancing complexity with predictability, and that this is done by not having any nostalgia, which doesn&#039;t sound practical for us nostalgic humans! I would like to learn more though about mutual information in biological systems, and how some systems we understand a little bit have some &#039;memory&#039; built in, either about their environment, or even their own dynamics.

So read the paper, it does a better job explaining itself!]]></description>
		<content:encoded><![CDATA[<p>An interesting/related paper was recently published in PRL, &#8220;Thermodynamics of Prediction&#8221; by S. Still, with Crooks as a coauthor, here is the arXiv link:   <a href="http://arxiv.org/abs/1203.3271" rel="nofollow">http://arxiv.org/abs/1203.3271</a> </p>
<p>I&#8217;ll try to explain it a bit to encourage you to read it.</p>
<p>Related to Jarzynski&#8217;s work and the fluctuation theorem is the idea of measuring equilibrium quantities by forcing a system out of equilibrium and observing the response, see: <a href="http://www.physics.berkeley.edu/research/liphardt/pdfs/JarzynskiTest.pdf" rel="nofollow">http://www.physics.berkeley.edu/research/liphardt/pdfs/JarzynskiTest.pdf</a></p>
<p>In Still&#8217;s paper they observe that much of the literature assumes that the driving signal is given/known explicitly, while in nature and biological systems this is most often not the case, hence they study stochastic driving signals. </p>
<p>The idea is that implicit in the dynamics of such a forced system is a model of its (stochastic) environment. They then ask how the quality of this model is related to thermodynamic efficiency.<br />
How do you measure the quality of a model?<br />
For them a good model should have predictive power and not be overly complicated. This comes down to a balance of a systems memory, namely you break it into two parts, one useful, predictive part, and the other &#8220;useless nostalgia&#8221;. It is this latter part that is related to dissipation, hence having less nostalgia is having less dissipation is having better thermodynamic efficiency. </p>
<p>Here is a whirlwind tour of how the paper makes this idea precise, taking many quotes straight from the paper:<br />
&#8220;the dynamics of system are modeled by discrete time Markovian conditional state-to-state transition probabilities&#8221;<br />
For the driving signal they assume only that its changes are governed by some probability density.<br />
Their system is in contact with a heat bath and starts in equilibrium. It then goes through a bunch of steps where the environment forces it out of equilibrium and then it relaxes. The environment does work on the system in each driving step and heat flows at each relaxation step.<br />
If you let it relax all the way to equilibrium any additional free energy gets dissipated as heat back to the environment. The additional free energy is given by the Kullback-Leibler divergence, or the relative entropy between the current state/distribution and the equilibrium one. The change in non-equilibrium free energy is the sum of the change in equilibrium free energy and this additional piece: <img src='http://s0.wp.com/latex.php?latex=%5CDelta+F_%7Bneq%7D+%3D+%5CDelta+F_%7Beq%7D+%2B+F_%7Badd%7D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Delta F_{neq} = &#92;Delta F_{eq} + F_{add} ' title='&#92;Delta F_{neq} = &#92;Delta F_{eq} + F_{add} ' class='latex' /><br />
Dissipation work is given by the difference between work done on the system and the non-equilibrium change in free energy: <img src='http://s0.wp.com/latex.php?latex=W_%7Bdiss%7D+%3D+W+-+%5CDelta+F_%7Bneq%7D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='W_{diss} = W - &#92;Delta F_{neq} ' title='W_{diss} = W - &#92;Delta F_{neq} ' class='latex' /><br />
Excess work is given by the difference in work done on the system and the equilibrium free-energy (the work done for the quasistatic case): <img src='http://s0.wp.com/latex.php?latex=W_%7Bex%7D+%3D+W+-+%5CDelta+F_%7Beq%7D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='W_{ex} = W - &#92;Delta F_{eq} ' title='W_{ex} = W - &#92;Delta F_{eq} ' class='latex' /><br />
The excess work minus the dissipation work gives you, <img src='http://s0.wp.com/latex.php?latex=W_%7Bex%7D+-+W_%7Bdiss%7D+%3D+%5CDelta+F_%7Bneq%7D+-+%5CDelta+F_%7Beq%7D+%3D+F_%7Badd%7D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='W_{ex} - W_{diss} = &#92;Delta F_{neq} - &#92;Delta F_{eq} = F_{add} ' title='W_{ex} - W_{diss} = &#92;Delta F_{neq} - &#92;Delta F_{eq} = F_{add} ' class='latex' /> , which is precisely the additional free energy (the KL divergence of the current distribution relative to the equilibrium one). </p>
<p>Now for the information/prediction half, which I understand even less!<br />
They look at Shannon&#8217;s (symmetric) mutual information of the system state and the external driving signal, both at a certain time t, this is called the system&#8217;s &#8216;instantaneous memory&#8217;, <img src='http://s0.wp.com/latex.php?latex=I%5Bs_%7Bt%7D%2Cx_%7Bt%7D%5D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='I[s_{t},x_{t}] ' title='I[s_{t},x_{t}] ' class='latex' /> where s is for system and x is for external signal. The &#8216;instantaneous predictive power&#8217; is, <img src='http://s0.wp.com/latex.php?latex=I%5Bs_%7Bt%7D%2C+x_%7Bt%2B1%7D%5D+&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='I[s_{t}, x_{t+1}] ' title='I[s_{t}, x_{t+1}] ' class='latex' />, or the mutual information of a state at time t and the driving signal at t+1.  The difference of the two is the &#8216;instantaneous nonpredictive  information;&#8217; &#8220;it represents useless nostalgia and provides a measure for the ineffectiveness of the implicit model.&#8221; (So memory-power=information, kidding)<br />
The paper then shows that this instantaneous nostalgia is proportional to the average work dissipated as t goes to t+1.</p>
<p>They derive a lower bound on the total dissipation and use it to refine Landauer&#8217;s principle. They then discuss this in relation to biological systems, where the systems have adapted to their environments forcing, asking if minimizing nostalgia is a factor driving things towards energetic efficiency. </p>
<p>This paper is more about thermodynamics and prediction (as the title suggests) than about reversibility. I don&#8217;t understand it yet, but there are hints of connections here, not only among biology, chemistry and nature, but also information and complexity (algorithmic information theory). I guess that&#8217;s why there is all the talk of Kolmogorov above. I&#8217;m new at this, and I&#8217;m sure many of you reading this have a better big picture! It seems though that in Still&#8217;s paper, a good model is about balancing complexity with predictability, and that this is done by not having any nostalgia, which doesn&#8217;t sound practical for us nostalgic humans! I would like to learn more though about mutual information in biological systems, and how some systems we understand a little bit have some &#8216;memory&#8217; built in, either about their environment, or even their own dynamics.</p>
<p>So read the paper, it does a better job explaining itself!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matteo Smerlak</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20706</link>
		<dc:creator><![CDATA[Matteo Smerlak]]></dc:creator>
		<pubDate>Fri, 12 Oct 2012 22:13:13 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20706</guid>
		<description><![CDATA[You&#039;re right, this is a typo. Thanks for pointing it out!]]></description>
		<content:encoded><![CDATA[<p>You&#8217;re right, this is a typo. Thanks for pointing it out!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Graham</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20705</link>
		<dc:creator><![CDATA[Graham]]></dc:creator>
		<pubDate>Fri, 12 Oct 2012 21:21:24 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20705</guid>
		<description><![CDATA[I just remembered this paper.

http://www.sekj.org/PDF/anzf40/anzf40-185.pdf

It has simulations of populations constantly adapting to a changing environment, and sometimes going extinct.]]></description>
		<content:encoded><![CDATA[<p>I just remembered this paper.</p>
<p><a href="http://www.sekj.org/PDF/anzf40/anzf40-185.pdf" rel="nofollow">http://www.sekj.org/PDF/anzf40/anzf40-185.pdf</a></p>
<p>It has simulations of populations constantly adapting to a changing environment, and sometimes going extinct.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Graham</title>
		<link>http://johncarlosbaez.wordpress.com/2012/10/08/the-mathematical-origin-of-irreversibility/#comment-20704</link>
		<dc:creator><![CDATA[Graham]]></dc:creator>
		<pubDate>Fri, 12 Oct 2012 21:15:13 +0000</pubDate>
		<guid isPermaLink="false">http://johncarlosbaez.wordpress.com/?p=12363#comment-20704</guid>
		<description><![CDATA[I don&#039;t follow that. It seems that if the environment changes, what counts as fit changes, so the definition of Phi changes. 

What happens in an oscillating environment, where the population keeps adapting, then adapting back?]]></description>
		<content:encoded><![CDATA[<p>I don&#8217;t follow that. It seems that if the environment changes, what counts as fit changes, so the definition of Phi changes. </p>
<p>What happens in an oscillating environment, where the population keeps adapting, then adapting back?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
