December 31, 2006

Books to Read While the Algae Grow in Your Fur, December 2006

David Nasaw, Andrew Carnegie
Readable, sympathetic and highly detailed biography of Pittsburgh's most famous and important citizen. (Like most biographies, it seems to me at least twice as long as necessary.) The sympathy does not prevent Nasaw from being quite blunt about Carnegie's dishonesties (on, e.g., the great Homestead Strike of 1892, or union-busting generally, or avoiding financial speculation) and self-delusions. For all that, Carnegie still emerges as a truly remarkable man, and, if we must allow the concentration of great wealth into private hands, far better that it be used the way Carnegie did than the common alternatives. Interestingly, the idea that he somehow deserved his fortune, or earned it through special efforts, was not among his self-delusions. (In fact, some of what Nasaw quotes him as saying is not unlike this by Herbert Simon.) This strikes me as one of the most remarkable things about him.
On a local note, it was a bit disconcerting to learn that the Carnegie Mellon University at which I teach (an expensive research university, emphasizing the graduate education of students from around the world) is so thoroughly not what "our founder and benefactor" had in mind (a vocational-technical school for the children of the local working class). It is hard for me to wish this change undone, but...
Stanislaw Lem, Solaris
See: On First Looking Into Lem's Solaris.
David King, The Commissar Vanishes: The Falsification of Photographs and Art in Stalin's Russia
A sermon, in pictures, on the text "The state is the coldest of all cold monsters ... and whatever it says, it lies." With abundant thanks to John "reprieved" Burke for lending this book to my brother.
Max Décharné, Straight from the Fridge, Dad: A Dictionary of Hipster Slang
Of course, he could have made four fifths of this up, for all I can tell, but at least he gives sources for almost everything, and, well, se non è vero, è ben trovato.
C. J. Box, Savage Run
Enjoyable mystery novel in contemporary Wyoming, centering on conflicts between environmentalists and ranchers, with a likeable, and only-too-familiarly-imperfect, game warden caught in the middle. Second book in a series, but I read this without having read the first one.
Cecilia Dart-Thornton, The Ill-Made Mute
Brain-candy fantasy. Clear influences of Tolkien, Vance, Celtic folklore, and Australian natural history (also pretty clearly a first novel). I read it with enjoyment while doped up with cold medicine, but think it would have been fun regardless. First in a trilogy; I'll be getting the others out of the library.
P. C. Hodgell, To Ride a Rathorn
Latest (fourth) installment in Hodgell's long-running, and remarkable, fantasy series. Probably incomprehensible if you haven't read the previous volumes. (The first two — Godstalk and Dark of the Moon — are so long out of print I won't even bother to give them links; a two-in-one reprint, Dark of the Gods, is only recently out of print. Seeker's Mask, the third, is still available.) For instance, I one am not about to try explaining why Our Heroine has earned the undying hatred of the titular creature, a sort of carnivorous, unicorn by rendering his mother a profound service. But, if you like fantasy, you owe it to yourself to read them: for Hodgell manages, by seamless turns, to convey wonder, humor, intrigue, profound creepiness, and the resolve to struggle against fate which comes close to the heart of epic.
Steven Hamilton, A Cold Day in Paradise
Reasonably good detective novel, somewhat hard-boiled (e.g., the Beautiful Woman Who Is Trouble For The Narrator), but not, you should forgive the expression, inedibly hard-boiled. Won multiple awards when it came out, which I don't get at all.
Andrea Camilleri, The Terra-cotta Dog
A modern campaign against the Mafia intersecting oddly with WWII-era secrets; very different from Forbes's Waking Raphael, despite these thematic similarities, and even the shared obsession with food. Very fun. However, the cover has nothing to do with the book. (Previous Camilleri plugs, June '05 and July '06; Forbes, January '06.)
E. J. G. Pitman, Some Basic Theory for Statistical Inference
Review: Intermediate Statistics from an Advanced Point of View.

Books to Read While the Algae Grow in Your Fur; Heard About Pittsburgh, PA; Scientifiction and Fantastica; Pleasures of Detection, Portraits of Crime; Enigmas of Chance

Posted by crshalizi at December 31, 2006 23:59 | permanent link

December 30, 2006

Absolutely Regular

Leo Kontorovich was last seen in these parts when I plugged his recent paper on concentration of measure in mixing processes. He has just started a weblog, Absolutely Regular. As he explains in his first post, his aim is to discuss topics related to his research, which means ideas in math, computer science and learning theory. The fare ranges from accessible musing on subjects like what makes math "deep", through notes for "mathematically mature" audiences (like this, contrasting measure concentration and large deviations), to technical yet fascinating questions about the learnability of formal languages. (As for the rumors that he picked up the habit of assigning unsolved questions from his own research as student problems from this class, well, "You might think that; you might very well think that; but I couldn't possibly comment.") There is far too little of any of this online, never mind all of it, in one place, with a keen mind behind it. Leo and I regard each other's politics as unsound, to put it mildly, but I am very happy to have him posting from the next building, and hope he will long continue.

Linkage; Enigmas of Chance

Posted by crshalizi at December 30, 2006 22:15 | permanent link

December 25, 2006

On First Looking into Lem's Solaris

Much to my loss (and, less importantly, embarrassment), I had never read this before this week. It really is as brilliant as everyone says, one of Lem's best, and bleakest, meditations on intelligence and alienness, cosmic strangeness and human pain. Most science fiction, like most fiction of any kind, is crap. Of the rest, most is mere brain-candy (which I devour eagerly, see side-bar at left). Of the rest, most is the literature of the great transformation, of humanity's passage out of pre-industrial darkness (perhaps into a different kind of darkness). This is science fiction as a literature that goes beyond the confines of our species.

I will not attempt a proper review, but I do want to draw out just one thread — I'm sure it's an old story to those who actually study Lem. The novel seems to owe something to two classic American stories of alien contact in the Antarctic, Lovecraft's At the Mountains of Madness and John W. Campbell's "Who Goes There?", though I have no idea if that's even historically possible, and Solaris is unquestionably at a far higher intellectual level. (There are a few places where the passage from Polish to English via French has reduced technical terms to gobbledygook, though I think I can guess what Lem meant.) In fact, I can't help but wonder if Solaris wasn't, in part, Lem's response to the challenge Campbell, as editor, set to his authors: "Write me a creature that thinks as well as a man, or better than a man, but not like a man". All those writers failed. (I think Lovecraft wanted to do this, but his best efforts ran a-ground in sentiments like this: "Radiates, vegetables, monstrosities, star spawn — whatever they had been, they were men!"). Lem actually succeeded here in making his readers imagine something which is so orthogonal to any sort of terrestrial mentality that even terms like "mind" or "intelligence" seem dubious, but inescapable. That he achieves this effect through, in part, an even more extreme version of the literal anthropomorphism indulged in by Campbell, that is artistry.

There is artistry, too, in the way Lem's protagonist realizes he has had a profound encounter with the utterly alien, but what matters to him is the all-too-human hope its side-effects offer of a tormented emotional redemption. "I knew nothing, and I persisted in the faith that the time of cruel miracles was not past."

Merry Christmas.

Scientifiction

Posted by crshalizi at December 25, 2006 10:00 | permanent link

December 10, 2006

Kronstadt

From the Abandoned Places photo-group I posted about this summer:

The photographer, one Reik, helpfully notes that "It's near St. Petersburg, Russia"; once upon a time, rather too near.

Writing for Antiquity

Posted by crshalizi at December 10, 2006 14:10 | permanent link

December 01, 2006

Statistical Communication

Adolphe Quetelet:

The constancy with which the same crimes repeat themselves every year with the same frequency and provoke the same punishment in the same ratios, is one of the most curious facts we learn from the statistics of the courts; I have stressed it in several papers; I have repeated every year: There is an account paid with a terrifying regularity; that of the prisons, the galleys, and the scaffolds. This one must be reduced. And every year the numbers have confirmed my prevision in a way that I can even say: there is a tribute man pays more regularly than those owed to nature or to the Treasury; the tribute paid to crime! Sad condition of human race! We can tell beforehand how many will stain their hands with the blood of their fellow creatures, how many will be forgers, how many poisoners, almost as one can foretell the number of births and deaths.

Thomas Pynchon (to my admittedly-philistine mind, the one worthwhile passage in Vineland):

If patterns of ones and zeroes were "like" patterns of human lives and deaths, if everything about an individual could be represented in a computer record by a long string of ones and zeroes, then what kind of creature would be represented by a long string of lives and deaths? It would have to be up one level at least—an angel, a minor god, something in a UFO. It would take eight human lives and deaths just to form one character in this being's name—its complete dossier might take up a considerable piece of the history of the world.

Randall "xkcd" Munroe:

Enigmas of Chance

Posted by crshalizi at December 01, 2006 08:00 | permanent link

November 30, 2006

Books to Read While the Algae Grow in Your Fur, November 2006

Theodore Judson, Fitzpatrick's War
A futuristic re-telling of the story of Alexander the Great, making him out to be the psychopathic catastrophe-in-human-form he really was.
Lois McMaster Bujold, The Sharing Knife: Beguilement
A fun fantasy, but it's the first volume in a new series, and the protagonists are getting off entirely too easy for a Bujold book. This makes me very suspicious about what she's setting up for volume two...
Liz Williams, Snake Agent
It's sometime in the middle of the twenty-first century. Singapore has franchised itself, and Singapore Three is somewhere on the coast of southern China. Detective Inspector Chen is a member of the police force. His job is to interface with the Celestial and Infernal bureaucracies, which are exactly as described in traditional Chinese beliefs about the afterlife. And, oh yes, his wife is a demon whom he helped get out of the arranged marriage from (forgive the expression) Hell. And then things get complicated.
Laura Lippman, No Good Deeds
Latest in the series (previous installment noticed in August 2005). Absorbing as always; for once I figured out who did it long before the ending (this doesn't seem that hard here!), and that had very little to do with the interest of the novel.
Christopher Priest, The Prestige
I see, from the inside front cover, that I bought my copy in October of 1999. It therefore took me seven years and one month to get around to reading this. It is really, really good, so you should read it much sooner. I will not say anything more, lest I spoil Priest's illusions.
Julia Spencer-Fleming, Out of the Deep I Cry and To Darkness and to Death
Continues the "Claire Fergusson" series begun with In the Bleak Midwinter and A Fountain Filled with Blood. A bit reminiscent, actually, of Phil Rickman's Merrily Watkins mysteries (noticed here and here): a female Anglican/Episcopalian minister investigating murders in an isolated rural town which has lost its economic reason to exist --- and where potential new ones are themselves insidious. The major differences are (1) Spencer-Fleming writes about upstate New York, while Rickman writes about the Welsh marches; (2) Spencer-Fleming is entirely in the mundane world, with none of Rickman's occult intrusions; (3) Spencer-Fleming devotes a lot of space to her heroine's complicated relationship with her town's married chief of police, whereas Watkins's love life is mercifully straightforward.
Toby Musgrave, Courtyard Gardens
Nice as a collection of ideas, with drool-worthy accompanying photographs. Alas, many of them seem ill-suited to the climate of the Allegheny Plateau...
Lindsay Allen, The Persian Empire
Readable, extremely well-illustrated brief summary of the history of the Achaemenid Empire, combining ancient sources with archaeological findings. Good on trying to see things from a less parochial perspective than that of the Greeks (Vidal's Creation [review by Danny Yee; review by Anoop Sarkar] gets a [humorous] plug in an end-note), and on integrating what we know with how we learned it. The last chapter gives an interesting description of later Persian and Iranian ideas about the empire, Alexander, the ruins at Persepolis, etc.
Claudia Koonz, The Nazi Conscience
Horrifies, because there was one.
R. R. Bahadur, Some Limit Theorems in Statistics
Concise yet remarkably lucid summary of pioneering (< 1971) work on statistical applications of large deviations theorems, much of it Bahadur's. Reads somewhat awkwardly now, because it came before the work by Donsker, Varadhan, etc. which led to the modern generalized, streamlined theory of large deviations, so Bahadur has to spend time proving special cases of general theorems, can't just talk about "rate functions", etc. But, strangely, I can't seem to find anyone re-presenting this work using modern large deviations theory...
Warren Ellis and Stuart Immonen, Nextwave: Agents of H.A.T.E.: This Is What They Want
"Healing America by beating people up."
Elaine Cunningham, Shadows in the Darkness
More mind candy.
Laurence Gough, Heartbreaker
Nicely-done police procedural set in Vancouver. The 8th novel in a long series, but I read it with enjoyment without having read any of the others, which I will now look for.
Patricia Briggs, Steal the Dragon and When Demons Walk
Mind-candy fantasy, but tasty. Briggs gets points for the fact that the heroics in the first book did not, in fact, prevent the successful Invasion by the Eastern Hordes, which is simply a settled matter of fact in the second book.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at November 30, 2006 23:59 | permanent link

November 29, 2006

Notes on "A Likelihood approach to Analysis of Network Data"

Attention Conservation Notice: Over 3000 words of technical commentary on a paper on the statistical analysis of networks. Does a poor job of explaining things to those without background knowledge of networks, statistical inference and Markov chains. Includes some geeky jokes and many equations.

Yesterday in the Statistical Learning for Networks seminar, we discussed the following paper; what follows are a mix of my notes from before and after the discussion.

Carsten Wiuf, Markus Brameier, Oskar Hagberg and Michael P. H. Stumpf, "A likelihood approach to analysis of network data", Proceedings of the National Academy of Sciences (USA) 103 (2006): 7566--7570 [Freely available from PNAS as over six months old]
Abstract: Biological, sociological, and technological network data are often analyzed by using simple summary statistics, such as the observed degree distribution, and nonparametric bootstrap procedures to provide an adequate null distribution for testing hypotheses about the network. In this article we present a full-likelihood approach that allows us to estimate parameters for general models of network growth that can be expressed in terms of recursion relations. To handle larger networks we have developed an importance sampling scheme that allows us to approximate the likelihood and draw inference about the network and how it has been generated, estimate the parameters in the model, and perform parametric bootstrap analysis of network data. We illustrate the power of this approach by estimating growth parameters for the Caenorhabditis elegans protein interaction network.

It's perhaps not completely clear from the abstract that their method works for a particular class of network growth models, which they call "duplication-attachment" (DA) models. These are pure growth models, where the network only expands, and never loses nodes or edges. (The network is assumed to be undirected, without self-loops or multiple edges between a given pair of nodes.) At each time step, we add one node. This is randomly chosen, with fixed probability, to be either a duplication or an attachment. If it's an attachment, the new node attaches to an old one, chosen uniformly over the graph (possibly with some fixed probability < 1). If it's a duplication event, we pick an existing node to duplicate, and the new one gets a certain probability of copying each of its model's links (independently), and a different probability of being linked to its model. It is entirely possible that a new node is added with no links to any existing node. Notice that nodes (and edges) have no intrinsic properties in this model; their names are arbitrary. Any two isomorphic graphs should therefore be assigned the same probability.

(The motivation for such models is that gene duplication is apparently fairly common, at least over evolutionary time, which would duplicate the interactions between genes, or their proteins. Attachment, here, is supposed to summarize all the other processes of network growth. There are several models of the evolution of protein interaction networks, e.g. those of Sole et al., 2002, and Vazquez et al., 2003, which are basically of the duplication-attachment type, and yield networks which at least qualitatively match some features of real-world examples, like degree distributions. These papers are not cited here.)

From any starting graph, it is easy to run the model forward and generate new, larger random graphs; the probabilities involved are all pretty simple uniform and binomial things. In fact, the current state of the graph completely determines the distribution of future graphs, so this is a Markov chain. The transition probabilities are fixed by the duplication and attachment parameters, collectively $\theta$ , and these, together with a distribution over starting graphs, give us a fully-specified stochastic model.

Normally, statistical inference for Markov chains is fairly straightforward, because most of the classical conveniences which make inference for IID data tractable are still available. (This is, after all, what led Markov to his chains!) So why then does the paper not end on the second page, with a citation "See Billingsley (1961)"? Because normally, when we observe a Markov chain, we observe a sequence of values from the chain, and that lets us estimate the transition parameters. Here, however, we have only the end-state, the final graph, as our data. The Markov chain will let us assign likelihoods to paths, but we don't know where we started from, and we don't know where we went from there, just how we ended up here.

Suppose we did know where we started from, some graph of size $t_0$ , $G_{t_0}$ . (Remember here that each step of the chain adds one node, so t really counts nodes, not clock-time. This is why it's natural to start at $t_0$ , as opposed to 0 or 1. The paper does not seem to explain this point.) If we knew our initial state, then in principle we could figure out the probability of reaching our final state from it, as a sum over all possible paths:

\[ 
L(G_t,\theta) \equiv \mathrm{Pr}(G_t|G_0;\theta) = \sum_{G_{t_0 +1}, \ldots G_{t-1}, G_{t} \in \mathcal{G}(t_0,t)}{\prod_{s={t_0+1}}^{t}{\mathrm{Pr}(G_s|G_{s-1};\theta)}} 
 \]

where $ \mathcal{G}(t_0,t) $ is set of all growing sequences of graphs which start at $G_{t_0}$ and end at $G_t$ . This mathematical expression is a mouthful, admittedly, but it's probably clearer in a picture.

There are only so many paths from the initial graph $G_{t_0}$ to the final, observed graph $G_t$ . The chain tells us the probability of each such path. Since we had to take one, and only one, of these paths, the total probability of making the journey is the sum of the probabilities of all the individual paths.

At this point, any physicists in the audience should be nodding their heads; what I've just said is that the likelihood, from a given starting configuration, is a sum over histories, or a path integral. Along with the authors, I'll return to how to evaluate this path integral presently, but first we need to figure out how to get that starting configuration.

If we had a known distribution over starting graphs, we could (in principle) just evaluate the likelihood conditional on each starting graph, and then take a weighted sum over graphs. This, however, is not what the authors do. (I'm really not sure where one would find such a distribution, other than another model for graph development. Bayesian practice would suggest picking something which led to easy computations, but this makes a mockery of any pretense to either modeling nature, or to representing incomplete prior knowledge.) Instead, they try to use the known dynamics of the DA model to fix on an unambiguous starting point, and do everything conditional on that.

They observe that you can take any graph, and, for each node, identify the other nodes it could have been copied from. (If A could have been copied directly from B, then A's neighbors must be a subset of B's [ignoring the link between A and B, if any].) So, from any starting graph, you can recursively remove nodes that could have arise through simple duplications. In general, at each stage in this recursion there will be multiple nodes which could be removed, and their choice is arbitrary. Remarkably enough, no matter which choices one makes, the recursion always terminates at the same graph. (More exactly, any two end-points are isomorphic to each other, and so identical for statistical purposes.) The proof is basically a fixed point theorem about a partial order defined on graphs through deletion-of-duplicates, but they confine it to the supplementary materials, so you can take it on trust (and they don't use such lattice-theoretic language even there). This graph --- the data, minus everything that could be pure duplication --- is what they take as their starting point. This is the $G_{t_0}$ to the data's $G_t$ . Everything is then done conditional on $G_{t_0}$ .

OK, we have our initial condition and our final condition, and we have our Markov chain, so all we've got to do so is evaluate the integral over paths linking the two.

Problem: There are too many paths. In the worst case, the number of paths is going to grow factorially with the number of nodes in the observed graph. Even though along each path we've just got to do some straight-forward multiplication, simply enumerating all the paths and summing over them will take us forever. (The authors discuss some algorithmic tricks for speeding up the exact calculation, but still get something super-exponential!) Thus, evaluating the path integral for the likelihood is intractable, even for a single parameter value.

Solution: Don't look at all the paths. Rather, sample some paths, say N of them, evaluate the likelihood along each, and average. Hope that this converges quickly (in N) to the exact value of the integral. This is, after all, how physicists approach many path integrals.

Problem: Even if N is fairly small, we need to examine many settings of the parameter $\theta$ . It could still kill us to have to sample N distinct paths for each parameter value.

Solution: Use importance sampling. Draw a single path, valid for all parameter values, and evaluate the likelihood in terms of the value of an "importance weight" along this path. The weight has to be a function of $\theta$ , but it should be the only thing which is. We do this here by writing the likelihood, $ L(G_t,\theta) $ , as an expectation with respect to a reference measure, which the authors write $\theta_0$ . This reference measure is given by another Markov chain, called the "driving chain"; despite its name, it is not a member of the DA family of chains. The trick here is that one sample of possible paths, generated according to this chain, can be used to (approximately) evaluate the likelihood at all parameter settings of the DA model.

The crucial equation is [3] on p. 7568

\[ 
L(\theta,G_t) = \mathbf{E}_{\theta_0}\left[ \prod_{s=t_0}^{t}S(\theta_0,\theta,G_s,\nu)\right] 
 \]

where
\[ 
S(\theta_0,\theta,G_t,\nu) = \frac{1}{t} \omega(\theta,G_t,\nu) \frac{\omega(\theta_0,G_t)}{\omega(\theta_0,G_t,\nu)} 
 \]

(N.B., the paper writes the second factor as $ \omega(\nu,G_t,\nu) $ , but this is wrong.) Let's unpack this a bit.

$ \omega(\theta,G_t,\nu) $ is the probability of producing the graph $G_t$ through the addition of the node $\nu$ , with parameter setting $\theta$ . (N.B., $\nu$ must be a "removable" node, one which could have been added by duplication.) $ \omega(\theta,G_t) $ is this transition probability, summed over all possible $\nu$ . The first two factors in S are what we want, the probability we'd get moving forward along the path according to the parameter $\theta$ . The third term is the reciprocal of the transition probabilities according to the driving chain. Its only job is to cancel those probabilities out.

The algorithm for generating the ith sample path is then as follows. Start with the observed graph $G_t$ . Count backwards, $ s = t, t-1, \ldots t_0+1 $ Pick a node $\nu^{(i)}_s$ to delete, with probability proportional to $ \omega(\theta_0, G^{(i)}_s, \nu^{(i)}_s) $ . (Once again, this limits us to the "removable" nodes, the ones which could have been produced by duplication.) Set $ G^{(i)}_{s-1} $ to be the result of deleting that node. Keep going back until we hit the irreducible core, $G_{t_0}$ . (We will always hit this core, by the fixed point theorem proved in the supplementary results.) Then


\begin{eqnarray*} 
l^{(i)}(\theta) & =  & \prod_{s=t_0}^{t}{S(\theta_0,\theta,G^{(i)}_{s},\nu^{(i)}_s)}\\ 
\hat{L}(\theta,G_t) & = & \frac{1}{N}\sum_{i=1}^{N}{l^{(i)}(\theta)} 
\end{eqnarray*}

To ease the calculation of this for multiple parameter settings, it may be worth noting (though the authors do not) that
\[ 
l^{(i)}(\theta) = \left(\frac{(t_0-1)!}{t!}\right)\left(\prod_{s=t_0}^{t}{\omega(\theta,G^{(i)}_s,\nu^{(i)}_s)} \right)\left(\prod_{s=t_0}^{t}{\frac{\omega(\theta_0,G^{(i)}_s)}{\omega(\theta_0,G^{(i)}_s,\nu^{(i)}_s)}}\right) 
 \]

and the middle factor is the only one which depends on $\theta$ .

So, to summarize: We can generate a sample of paths connecting the observed final graph to the unobserved initial graph, according to the driving chain, and then approximate the likelihood for any parameter value by multiplying the importance weights along those paths and summing over paths. (The importance weights themselves even factor nicely.) We have thus solved the problem of evaluating a path integral, when we've got only one end of too many possible paths.

The trick used here to pull this off depended on having a uniquely-defined starting point for all parameter settings, namely the $G_{t_0}$ defined through undoing duplications. (According to the authors, they took this from papers on the coalescent process in population genetics, but I have not been able to track down their references.) Strictly speaking, everything is conditional on that starting point. Of this, more below.

Left unaddressed by the above is the question of how many paths we need to sample. Remember, the whole point is to not have to look at every possible path! If it turns out that accurate approximations to the likelihood require us to sample some substantial fraction, then this is all for nothing. However, the authors' figures reveal something quite remarkable. Whether N is 10 or 1000, the approximate likelihood changes very little (at least on a log scale), even with real data. This suggests that we don't, actually, need a lot of paths, but why?

For each i, $ l^{(i)}(\theta) $ is an independent realization of a random variable, whose distribution depends only $\theta$ (holding fixed the driving chain, and the initial and final graphs). Since they are IID, we can apply the central limit theorem, which tells us that their mean should converge at rate $ 1/\sqrt{N} $ . Since that's noticeably smaller for N=1000 than for N=10, it must be the case that the variance of the likelihood along the individual sample paths is already pretty small. Why?

The lame physics answer is, "the principle of least action". There will be optimal, most-probable paths, and they will dominate the sum, the others tending to make negligible contributions. With high probability, a random sample will pick out the most probable paths. Q.E.D. This argument could, perhaps, be made less lame through an application of large deviations results, specifically conditional-limit-theorem- (or "Gibbs's conditioning principle"-) type results for Markov chains, which roughly say that if something improbable (a passage from $G_{t_0}$ to $G_t$ ) happens, it does so in the least-improbable possible way, and deviations from that trajectory are exponentially suppressed. <televangelist>In the name of Cramér, in the name of Varadhan, in the name of Freidlin and Wentzell, I call on you, Brother Argument, to be healed! Arise and prove! And, cf. Eyink (2000).</televangelist>

An information-theoretic answer is to invoke the asymptotic equipartition principle, a.k.a. the Shannon-McMillan-Breiman theorem. This says that if $ X_1, X_2, \ldots X_t $ are generated according to a (well-behaved) stochastic process, whose distribution is $\mu$ , and $\theta$ is a sufficiently well-behaved model, then

\[ 
-\frac{1}{t}\log{L(\theta,X_1^t)} \rightarrow h(\mu) + d(\mu,\theta), ~\mu-a.s. 
 \]

where $ h(\mu) $ is the entropy rate of the data-generating process $\mu$ , and $ d(\mu,\theta) $ is the relative entropy rate between the data source and the model, i.e., the asymptotic growth rate of the Kullback-Leibler divergence. (For details, see Algoet and Cover, 1988, or Gray, 1990). So
\[ 
\log{L(\theta)} = - (h+d(\theta))t + o(t), ~\mu-a.s. 
 \]

In words, there are only two kinds of sample long sample paths: those which aren't generated at all, and those where the normalized log-likelihood are equal, at least to first order. It's not clear to me here whether t is big enough in the examples for this effect to kick in.

The biggest unclarity of all, probably, is the role of $G_{t_0}$ . Recall that we reached this by removing nodes which could have been added by pure duplication. There is, however, no particular reason to think that the actual growth of the graph ever passed through this state. It has the advantage of giving us a unique starting point for the chain, but there are, potentially, others. One, of course, is the trivial network consisting of a single node! Another possibility (which came up in the discussion, I think mostly due to Anna Goldenberg) is to first remove potential duplicates, as the authors do, and then remove nodes which have only one link to them, as clear attachments. This process of unwinding the attachments could potentially be iterated, until no "danglers" are left. This, too, is a uniquely-defined point. We can then go back to removing nodes which are, now, potential duplicates, and so on. Someone (I forget who) suggested that this might always terminate at the one-node network; it would be nice to either show this or give a counter-example. But if there is some principled reason, other than tractability, to use their $G_{t_0}$ , I can't figure out what it is from this paper.

Only using a growing network, and in particular only focusing on growth through duplication, certainly simplifies the computational problem, by reducing the number of possible paths which could terminate in the observed graph. Deletion of nodes and edges is however going to be very important in more biologically-plausible models, to say nothing of models of social or technological networks. Presumably the trick of using a backward-looking chain which stops when it hits a unique starting configuration could still be used with deletions --- I think the authors are hinting as much in their conclusion --- but it's not clear to me that a unique starting point is appropriate. With biological interaction networks, for example, one might argue that, e.g., metazoans have been around for a long time, so the distribution of networks ought to be close to stationary, and so starting configurations should be drawn from an invariant distribution of the appropriate chain...

This raises two further points, which are not un-related: the asymptotics of the DA model, and the biological utility of such a model. Run for a long time, the DA model will produce graphs of unbounded size, but it's not immediately obvious what these graphs will look like. In particular, what will be their degree distribution? The Barabasi-Albert model (Albert and Barabasi, 2002) produces scale-free distributions, <boosterism>because it uses the same mechanism as Herbert Simon's classic paper</boosterism> (Bornholdt and Ebel, 2001). This relies on a rich-get-richer dynamic, where nodes with high degree are more likely to attract new edges. My initial thought was that this wasn't present in the DA model, because targets for attachment and targets for duplication are both chosen uniformly. However, someone in the discussion --- I think, though I may be mis-remembering, that it was Tanzy Love --- pointed out that while high-degree nodes are no more likely to be copied than low-degree nodes, edges into high-degree nodes are more likely to be copied than edges into low-degree nodes. This is because if a node has degree k, there are k other nodes whose duplicated could end up linking to it. It may even be the case that this is falls under the theorems in Simon... Presumably the asymptotics would only become harder to handle if we added events deleting nodes or edges.

As for the biological utility, I'll repeat that none of the nodes have any identity of their own; only their role in the network of relations represented by the edges has any bearing on the model. "If this be structuralism, make the most of it": by turning it into a neutral model for the evolution of biological networks. After all, there is no reason here to duplicate certain nodes or edges, it's all just uniform chance. One key use of neutral models is to provide a background against which to detect adaptation; how could we do that here?

References:
Albert, Réka and Albert-László Barabási (2002), "Statistical Mechanics of Networks", Reviews of Modern Physics 74 (2002): 47--97 = cond-mat/0106096
Algoet, Paul H., and Thomas M. Cover (1988), "A Sandwich Proof of the Shannon-McMillan-Breiman Theorem", The Annals of Probability 16: 899--909
Billingsley, Patrick (1961). Statistical Inference for Markov Processes. Statistical Research Monographs, vol. 2. Chicago: University of Chicago Press.
Bornholdt, Stefan and Holger Ebel (2001), "World-Wide Web scaling exponent from Simon's 1955 model", Physical Review E 64: 035104 = cond-mat/0008465
Eyink, Gregory L. (2000), "A Variational Formulation of Optimal Nonlinear Estimation", Methodology and Computing in Applied Probability (submitted) = physics/0011049
Gray, Robert M. (1990), Entropy and Information Theory. Berlin: Springer-Verlag. Full text free online.
Simon, Herbert A. (1955), "On a Class of Skew Distribution Functions", Biometrika 42: 425--440
Solé Ricard V., Romualdo Pastor-Satorras, Eric Smith and Thomas B. Kepler (2002), "A model of large-scale proteome evolution", Advances in Complex Systems 5 (2002): 43--54 = cond-mat/0207311
Vázquez, A. and A. Flammini and A. Maritan and A. Vespignani (2003), "Modeling of protein interaction networks", Complexus 1: 38--44 = cond-mat/0108043

Networks; Enigmas of Chance

Posted by crshalizi at November 29, 2006 02:30 | permanent link

November 27, 2006

Again with the Statistics 754, Stochastic Processes

Speaking, as I was, of this course, I've just spruced up the course homepage in anticipation of the coming semester. Comments, particularly on last year's lecture notes, are most welcome. (Reader M.H. has already won a free copy of the book, if it ever materializes, for his very helpful suggestions.)

Also, I have made a promise to try to post at least once a week, so that certain people know I'm alive (or at least that one of my scripts is still running).

Enigmas of Chance; Corrupting the Young

Posted by crshalizi at November 27, 2006 22:57 | permanent link

October 31, 2006

Books to Read While the Algae Grow in Your Fur, October 2006

Ray Bradbury, Something Wicked This Way Comes
Why is it that every time I re-read this, the boys seem a bit more remote, and the old man a bit closer?
Stephen King, Cycle of the Werewolf
Karen Rose Cercone, Steel Ashes
First in the series of which I reviewed one book (Blood Tracks) at length. There was a third, Coal Bones, but apparently the series never went any further. This is a shame: they're good, fun, books, and after a bit more than a year of living in Pittsburgh I enjoy them even more. (For instance, Amberson Avenue, where one of the villains in Steel Ashes lives, is two blocks from my house.)
Karin Slaughter, Triptych
Not part of her regular series. To say anything beyond what you can read on the cover blurb would spoil an intricately-constructed series of surprises, so I won't. Like all Slaughter's books, good but disturbing.
Stephen Biddle, Military Power: Explaining Victory and Defeat in Modern Battle
Or: the "revolution in military affairs" happened in 1918. More exactly, even early 20th century weaponry, used at capacity, sufficed to create an utterly leathal "storm of steel". What Biddle calls the "modern system" consists of the tactics and operational skills which let armies nonetheless survive and manuveur on the battlefield --- extensive exploitation of all available cover while moving, detached units, combined arms, defense in depth, etc., etc. Biddle's argument is that success in mastering the modern system goes a lot further towards explaining who wins battles than does superiority in resources and materiel, or even superior technology per se. He makes his case through nicely-selected case studies, statistical studies on what systematic data exists, and some not-crazy simulations. (The statistical studies are definitely good by the standards of applied social science, but they use regression, rather than some method more appropriate to causal inference.) Biddle sounds extremely plausible to me, i.e., someone who admittedly knows nothing about military science. One of the big limitations, though, is that he's explicitly confining himself to land warfare among regular armies --- nothing about urban warfare, guerrillas, etc...
Terry Pratchett, The Amazing Maurice and His Educated Rodents; The Wee Free Men; A Hat Full of Sky
Strongly recommended for weekends when you are feeling glum and mildly ill. Many thanks to "Uncle Jan" for copies!

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at October 31, 2006 23:59 | permanent link

October 16, 2006

I Taught Him Everything He Knows

"Concentration of measure" is a phenomenon in probability theory where, roughly speaking, any set which contains a substantial fraction of the probability can be expanded just a little to yield a set containing most of the probability. Another way to say this is that, given any reasonably continuous function, the probability that it deviates from its mean is exponentially small, and the exponential rate does not depend on the precise function. This makes concentration of measure results extremely useful for questions involving the estimation of complicated and ugly functions. The classical work in this area proves concentration-of-measure for various kinds of sequences of independent variables, but for real applications in statistics, machine learning or physics you'd want to be able to handle dependence. The natural way to do this would be to look at mixing processes, which are at least asymptotically independent.

Leo Kontorovich, who was one of the students in my stochastic processes class this past spring, now has a paper summarizing his work on, precisely, concentration of measure for mixing sequences:

Leonid Kontorovich, "Metric and Mixing Sufficient Conditions for Concentration of Measure", math.PR/0610427
Abstract: We derive sufficient conditions for a family $(X^n,\rho_n,P_n)$ of metric probability spaces to have the measure concentration property. Specifically, if the sequence $\{P_n\}$ of probability measures satisfies a strong mixing condition (which we call $\eta$-mixing) and the sequence of metrics $\{\rho_n\}$ is what we call $\Psi$-dominated, we show that $(X^n,\rho_n,P_n)$ is a normal Levy family. We establish these properties for some metric probability spaces, including the possibly novel $X=[0,1]$, $\rho_n=\ell_1$ case.
This paper serves as a good entry-point to Leo's earlier papers on measure concentration for hidden Markov processes (math.PR/0608064) and Markov tree processes (math.PR/0608511), as well as his paper with Kavita Ramanan on the general martingale-difference method of which these other results are special cases (math.PR/0609835).

Since I'll be teaching stochastic processes again in the spring, I would very much like to claim that Leo wrote these papers as a direct result of having taken my class. But the truth is that Leo knew so much about this already that, so far teaching him everything he knows, I learned almost all I know about concentration from him. But this is one of the real pleasures of teaching...

Enigmas of Chance; Corrupting the Young; Incestuous Amplification

Posted by crshalizi at October 16, 2006 14:00 | permanent link

October 15, 2006

Ninth European Conference on Artificial Life (ECAL2007) — Call for Papers

The Ninth European Conference on Artificial Life will be held September 10--14 in Lisbon. The "European" in ECAL just refers to the location. No matter where you are in the world, if you work on any of the following subjects (among others),

Origin and synthesis of Life: artificial chemistries, autocatalytic systems, pre-biotic evolution, self-replication, self-organization, molecular self-assembly; Information and Measures of Complexity; Networks and Complex Systems; Evolutionary Robotics and Autonomous Agents: exploitation and evolution of morphologies, 3D rapid prototyping printers, non-holonomic robot control, bio-morphic engineering, self-assembly, evolvable hardware, collective robotics; Models of Brain-Body-Environment interaction: the question of boundaries, the dynamical systems approach, agency, homeostasis and autopoiesis, sensorimotor coordination, action-perception loops; Morphogenesis and Development; Learning, Adaptive Behaviour and Evolution; Social Behavior: language, social interaction, communication, swarm intelligence, ant systems; A-Life Art; Epistemological Issues, Tools and Methodologies; Philosophical Issues; Ethical and Social Issues

then, as a member of the program committee, I strongly urge you to submit a paper before the deadline of 9 April 2007 (which gives you plenty of time!). The official call for papers has details on format, procedure, publication of the proceedings, etc. See you is Lisbon!

Complexity; Biology

Posted by crshalizi at October 15, 2006 19:45 | permanent link

October 12, 2006

Data Mining (36-350) Lecture Notes, Weeks 4--7

These handouts are shamelessly ripped off derivative work, amplifying and expanding those created by Tom Minka when he invented this course. (See his originals here.) Posted here in response to a number (> 1) of requests. See here for the first three weeks' handouts.

Note to students in 36-350: This page will not keep up to date with the handouts, or with other course documents; use Blackboard!

  1. September 20 and 25 (Lecture 6): Partitioning Data into Clusters. Supervised and unsupervised learning. Social and organizational aspects of categorization. Finding categories in data via clustering. Characteristics of good clusters. The k-means algorithm for clustering. Search algorithms, search landscapes, hill climbing, local minima. Algorithms for hierarchical clustering. Avoiding spherical clusters. See also: slides to accompany the second half, showing clustering of images.
  2. September 27 (Lecture 7): Making Better Features. Transforming features to enhance invariance. Transforming features to improve their distribution. Projecting high-dimensional data into lower dimensions. Principal component analysis: informal description and example.
  3. October 2 (Lecture 8): More on Principal Component Analysis. Mathematical basis: maximizing the variance of the projected points. Mathematical basis: minimizing reconstruction error. Interpretation of PCA results.
  4. October 4: Review of course to date. (No handout.)
  5. October 9 (Lecture 9): Evaluating Predictive Models. Classification and linear regression as examples of predictive modeling. Error measures a.k.a. loss functions; examples. In-sample error. Out-of-sample or generalization error; why it matters, relation to in-sample error. Model selection. An example of over-fitting. Approaches to limiting over-fitting and its ill effects.
  6. October 11 (Lecture 10): Regression Trees. Difficulties of fitting global models in complex systems. Recursive partitioning and simple local models as a solution. Prediction trees in general. Regression trees in particular. An example. Tree growing. Tree pruning via cross-validation.

Corrupting the Young; Enigmas of Chance

Posted by crshalizi at October 12, 2006 11:40 | permanent link

October 08, 2006

"The Invisible Academy: Non-Linear Effects of Linear Learning"

I am sure that I speak for all of us in the Statistics Department at CMU (especially my erstwhile fellow bloggers) when I say we're very pleased and excited to have Prof. Mark Liberman as our seminar speaker on 16 October, a week from tomorrow. Mark is coming to us not as the impressario of LanguageLog (though, in that role, his scourging of sloppy data analysis, whether socio-political or neurosexual, is a joy to behold), nor as the director of the Linguistic Data Consortium, nor even to speak about bibliomics. Rather, he'll be talking about some work which combines the stochastic linear learning models (of the sort pioneered by, among others, Frederick Mosteller) with agent-based modeling of cultural evolution. Which is to say, he's talking about aggregate behaviors of interacting stochastic processes which are more interesting than just the central limit theorem.

The invisible academy: non-linear effects of linear learning
Abstract: When linguists, psychologists or engineers try to understand, explain or imitate human speech and language, they usually do so by modeling individual speakers, hearers or learners. Nevertheless, language is an emergent property of groups (of humans), and elementary arguments suggest that non-trivial characteristics of speech and language emerge from interactions within groups of individuals over time. This talk will argue that some (old and pathetically) simple ideas about learning and perception have obvious but non-trivial consequences for the emergence of cultural norms, including linguistic ones.

Monday, October 16, at 4:30 pm in Baker Hall A51; free and open to the public.

The Collective Use and Evolution of Concepts; Enigmas of Chance

Posted by crshalizi at October 08, 2006 13:45 | permanent link

October 02, 2006

It's Not Who You Know, It's What You Do

Blog near-silence will continue until bootstrap testing results improve. In the meanwhile, it occurs to me that I haven't mentioned that I have a new paper.

CRS, Marcelo F. Camperi and Kristina Lisa Klinkner, "Discovering Functional Communities in Dynamical Networks", q-bio.NC/0609008, forthcoming in the proceedings of the ICML 2006 workshop on "Statistical Network Analysis: Models, Issues and New Directions" (Springer Lecture Notes in Computer Science)
Abstract: Many networks are important because they are substrates for dynamical systems, and their pattern of functional connectivity can itself be dynamic — they can functionally reorganize, even if their underlying anatomical structure remains fixed. However, the recent rapid progress in discovering the community structure of networks has overwhelmingly focused on that constant anatomical connectivity. In this paper, we lay out the problem of discovering functional communities, and describe an approach to doing so. This method combines recent work on measuring information sharing across stochastic networks with an existing and successful community-discovery algorithm for weighted networks. We illustrate it with an application to a large biophysical model of the transition from beta to gamma rhythms in the hippocampus.

Thanks — over and above the acknowledgments in the paper &mdash are due to Jörg Reichardt for help implementing his algorithm, and to Anna Goldenberg for very thorough (not to mention patient!) editing, resulting in a much better paper.

There are about a zillion possible extensions and applications, which is a good note on which to get back to work.

Networks; Self-Centered; Minds, Brains, and Neurons

Posted by crshalizi at October 02, 2006 17:50 | permanent link

September 30, 2006

Books to Read While the Algae Grow in Your Fur, September 2006

E. R. Chamberlin, The Bad Popes
Not, sadly, all of the bad popes, but rather a selection of seven of (to Chamberlin's mind) the worst.
Max Beerbohm, Seven Men
Many thanks to John Burke for recommending this wonderfully funny, yet surprisingly creepy, little book. (The story of one of the six men, "'Savonarola' Brown", is online.)
Sheri Berman, The Primacy of Politics: Social Democracy and the Making of Europe's Twentieth Century
A highly interesting book which nonetheless does not deliver what it promises. I will defer detailed comments to the forthcoming Crooked Timber symposium.
Steven Runciman, Byzantine Civilization
Short, introductory history of an era of which I am largely ignorant; so, I'm not actually sure I have any grounds to recommend it, except that I feel less ignorant now, and it's well-written. Definitely shows its age, however; nobody, surely, would now worry over "Hellenistic" versus "Aramaean" influences in art?
Anthony Shadid, Night Draws Near: Iraq's People in the Shadow of America's War
For America, the war is merely a politico-military disaster and thousands of causalties; for the Iraqis, it is infinitely worse. An intelligent presentation of how the war looked to a wide spread of Arab Iraqis, by a very good Arabic-speaking American journalist.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at September 30, 2006 23:59 | permanent link

September 21, 2006

We Can Have a Better Academic Publishing System

Paul "arXiv" Ginsparg, evangelizing in the pages of the latest issue of The Journal of Neuroscience, under the rubric "As We May Read":

The e-print arXiv (http://arXiv.org/), initiated in August 1991, has effectively transformed the research communication infrastructure of multiple fields of physics and could play a prominent role in a unified set of global resources for physics, mathematics, and computer science. It has grown to contain >375,000 articles (as of July 2006), with >50,000 new submissions expected in calendar year 2006 and >40,000,000 full-text downloads per year. It is an international project, with dedicated mirror sites in 17 countries and collaborations with United States and foreign professional societies and other international organizations, and it has also provided a crucial lifeline for isolated researchers in developing countries...

The arXiv is entirely scientist driven: articles are deposited by researchers when they choose (either before, simultaneous with, or after peer review), and the articles are immediately available to researchers throughout the world. As a pure dissemination system, it operates at a factor of 100-1000 times lower in cost than a conventionally peer-reviewed system.... This is the real lesson of the move to electronic formats and distribution: not that everything should somehow be free, but that with many of the production tasks automatable or off-loadable to the authors, the editorial costs will then dominate the costs of an unreviewed distribution system by many orders of magnitude. ...

The site has never been a random Usenet newsgroup- or blogspace-like free-for-all. From the outset, arXiv.org relied on a variety of heuristic screening mechanisms ... to ensure insofar as possible that submissions are at least "of refereeable quality." This means that they satisfy the minimal criterion, that they would not be peremptorily rejected by any competent journal editor as nutty, offensive, or otherwise manifestly inappropriate, and they would instead at least in principle be suitable for review. These mechanisms are an important, if not essential, component of why readers find the arXiv site so useful. ...

The arXiv repository functions are flexible enough either to coexist with the preexisting publication system or to help it evolve into something better optimized for researcher needs. Although there are no comprehensive editorial operations administered by the site, the vast majority of the 50,000 new articles per year are nonetheless subject to some form of review, whether by journals, conference organizers, or thesis committees. Physics and astronomy journals have learned to take active advantage of the availability of the materials before journal publication ...

On the one-decade time scale, it is likely that more research communities will join some form of global unified archive system without the current partitioning and access restrictions familiar from the paper medium, for the simple reason that it is the best way to communicate knowledge and hence to create new knowledge. Ironically, it is also possible that the technology of the 21st century will allow the traditional players from a century ago, namely the professional societies and institutional libraries, to return to their dominant role in support of the research enterprise.

Ginsparg's title is a riff on Vannevar Bush's "As We May Think"; hopefully it will not take 48 years for these suggestions to be widely implemented.

Learned Folly

Posted by crshalizi at September 21, 2006 08:42 | permanent link

September 20, 2006

Seeking Advice on Introducing Ideological Bias

Well, it looks like I have some VIGRE-funded undergraduate research assistants for the semester; specifically, I have takers both for my idea about aggregated network data, and neutral cultural diffusion on assortative networks. The latter, however, leaves me with a puzzle.

The project makes sense (to the extent that it does) against a background assumption that culture is not neutral, but an adaptation to the physical or social environment. This is a pretty pervasive assumption on the part of social scientists, historians, ethnographers, cultural critics, etc., but I'm not sure that my students will have been exposed to this idea as such. So, I need background reading which will rapidly persuade a wholesome, technically-inclined Carnegie Mellon undergrad that a person's beliefs and values ought to be correlated with their status in society. "Rapidly" is important, because there is some technical research to get to, and recapitulating the whole history of the sociology of belief is not an option. Similarly, assigning The German Ideology and/or The Protestant Ethic and the Spirit of Capitalism seems like asking for trouble. (And, with Weber, I'd feel like I'd have to spend a lot of time unteaching the errors.) So, I invoke the collective wisdom of the Web for reading suggestions: please write me at cshalizi [at] cmu [dot] edu [dot] oryx, deleting the name of a genus of antelope.

Corrupting the Young

Posted by crshalizi at September 20, 2006 16:20 | permanent link

Arrrrgh! Owwwwwww! Noooooooooo! (A Remark on Power Laws)

Via Language Log, there comes this fun post on the distribution of the number of repetitions of "R" in strings of the form "AR+", as in, "Arrr, mateys!" These findings, like Mark Liberman's on "AW+", are in line with the results of the seminal paper in this area, Dennis Chao and Patrik D'haeseleer's "The Distribution of Variable-length Phatic Interjectives on the World Wide Web" (University of New Mexico Computer Science Department Tech Report TR-CS-2001-23). I eagerly await further results in this exciting pico-field.

Being what I am, however, I can't resist pointing out that looking for a straight line on a log-log plot, and even finding one with high r-squared, is simply not a reliable way of checking whether a distribution is a power-law. Please do not do this. (And yes, I should be finishing that paper on the right approach, rather than blogging.)

Power Laws

Posted by crshalizi at September 20, 2006 13:23 | permanent link

September 19, 2006

A Triumph of Socialist Realism

My copy of Michael Bérubé's What's Liberal About the Liberal Arts? arrived in the mail yesterday (thanks to the unionized workers at Powell's), only to coincide with the release of the graphic novel (4Mb PDF). As a left-liberal professor, I can proudly testify to the complete and utter accuracy of this document's depiction of our revolutionary praxis. (At least, when we are not teaching fluffy courses about enchiladas.) My students should take notes, because this will be on the exam, particularly the bits about camels.

However, What's Liberal About the Liberal Arts? The Graphic Novel shows me that I must also engage in self-criticism. I have been complacent about enforcing political correctness, behaving as though statistics and machine learning could be taught in a neutral manner, when in reality every classroom is a site of conflict. In particular, I have been insufficiently vigorous in struggling against my students' tendencies to embrace borgeois individualism, in the form of the theory of merely personal, subjective and idealist probability associated with a clerical ideologue of British imperialism. I see that I must double and re-double my efforts to indoctrinate them in the principles of frequentist probability, based as it is on collectives and the work of social democrats. Might I be permitted to hope that comrade Gonick (also cited by comrade-professor Burke) will produe a proletarian, graphic version of the great red book epitomizing Peircism-Popperism-Neyman Pearson Thought?

(Via Pharyngula, Crooked Timber and John Burke [in e-mail] more or less at once)

Learned Folly; Corrupting the Young; The Progressive Forces; Enigmas of Chance; Afghanistan and Central Asia

Posted by crshalizi at September 19, 2006 13:49 | permanent link

Shorter Rajiv Chandrasekaran

"Better red than expert".

(Some people managed to report this three years ago.)

The Continuing Crisis; The Running-Dogs of Reaction

Posted by crshalizi at September 19, 2006 12:43 | permanent link

Piracy and Peer Production

In honor of International Talk Like a Pirate Day, I bring you:

Coincidentally, the campus movie club just screened the latest Disney pirate movie; at least I think that was what they were showing, since their poster showed a strong influence, in its iconography, of a similarly-titled but not quite so work-safe production.

Linkage

Posted by crshalizi at September 19, 2006 12:27 | permanent link

September 16, 2006

Data Mining (36-350) Lecture Notes, Weeks 1--3

These handouts are shamelessly ripped off derivative work, amplifying and expanding those created by Tom Minka when he invented this course. (See his originals here.) Posted here in response to a number (> 1) of requests.

Lecture 5 is also a shameless rip-off explication of Aleks Jakulin's "Quantifying and Visualizing Attribute Interactions" (cs.AI/0308002).

Note to students in 36-350: This page will not keep up to date with the handouts, or with other course documents; use Blackboard!

  1. Searching Documents by Similarity (28 August 2006). Why similarity search? Defining similarity and distance. The bag-of-words representation. Normalizations. Some results.
  2. More on Similarity Search (30 August 2006). Stemming, linguistic issues. Picking out good features, or at least ignoring non-discriminative ones. Inverse document frequency. Using feedback from the searcher.
  3. Searching Images by Similarity (6 September 2006). Representation and abstraction. How to search images without looking at images; a failure-mode. The bag-of-colors representation. More examples. Invariance and representation. See also: slides illustrating this lecture.
  4. Finding Informative Features (11--13 September 2006). More on finding good features. Entropy and uncertainty. Information and entropy. Ranking features by informativeness. Examples.
  5. Interactions Among Features (18 September 2006). Redundancy and enhancement of information. Information-sharing graphs. Examples.

Corrupting the Young; Enigmas of Chance

Posted by crshalizi at September 16, 2006 12:56 | permanent link

September 09, 2006

Glory and $500 (VIGRE-funded Undergraduate Research in Statistics at Carnegie Mellon)

Hey, kid! Interested in winning eternal intellectual glory and entering the glamorous world of scientific research? Interested in $500 for the semester? Are you an undergrad at Carnegie Mellon University? If so, the statistics department has no less than nine possible projects for you. (Some of them are mine; one began as a blog-post, another as a notebook entry.) Apply now!

Update, 29 March 2007: See here for summer 2007 offers — all the glory, and more money!

Update, 26 January 2008: and see here for spring 2008 semester.

Engimas of Chance; Corrupting the Young

Posted by crshalizi at September 09, 2006 22:30 | permanent link

September 08, 2006

Friday Cat Blogging (Latest Research in Chaos Theory and Complex Systems Issue of Science Geek Edition)

In his writings on the theory of automata, John von Neumann defines a complex system as one where the best model is, in fact, the system itself, and specifically mentions cats as an example of such complexity (thereby demonstrating not merely sagacity but taste). Since then, I am sorry to say, there has been a sad lack of interest in feline behavior within complex systems theory; but no longer. Going to the pet store the other day, I blundered into an aisle of motorized cat toys. Mindful of Abbas Raza's post on 3QD, I actually looked at them; and of course when I came to one that boasted an Amazing Chaos Wand, I had to look at it; and of course I had to buy it when I read the following ad copy on the side of the box (my links):

Professor Nozawa
CHAOTIC TOYS FACTORY, LTD.

The Cat Attack uses the latest research in chaos theory and complex systems to emulate the movements and personality of a cat's favorite prey. This "virtual mouse" technology utilizes algorithms based on a six-dimensional coupled nzmap system modeled on the neural network of a real mouse. What that all means is that the Cat Attack's "virtual mouse" will become your cat's new best friend!

Professor Shimada
NIHON UNIVERSITY

Dr. Nozawa has developed an amazing and practical use for "deterministic chaos". It was Norbert Wiener who pointed out the importance of random noise in the brain and in automatic control systems. However, Dr. Nozawa showed that is was possible to solve the difficult "Traveling Salesman Problem" of optimization faster by using nonlinear dynamics, rather than random noise. The "Cat Attack" toy developed by Dr. Nozawa's company operates using powerful nonlinear dynamics algorithms, and this leading edge of human knowledge is appreciated by, among others, the Nozawa family cats, Tal, Fu and Phi, who seem to think the "Cat Attack" is truly alive.

I have turned this apparatus over to a collaborator who specializes in these issues, and she reports good results while the batteries last.

Friday Cat Blogging; Complexity

Posted by crshalizi at September 08, 2006 11:24 | permanent link

September 07, 2006

Completing the Circuit

Since everyone is going on about Leszek Kolakowski (starting with Tony Judt, and thence Jonathan Goodwin, 3QD, DeLong) or, separately, Althusser (Bérubé, McLemee), I bring you: Kolakowski demolishing Althusser. Read it and weep, or laugh, as hailed. And do read Main Currents of Marxism, now that it's out in one volume ("so handy for pocket or purse", in the words of a certain rootless cosmopolite).

This completes your moment of Marxist harmonic convergence.

The Progressive Forces; Philosophy

Posted by crshalizi at September 07, 2006 09:09 | permanent link

August 31, 2006

Books to Read While the Algae Grow in Your Fur, August 2006

Thomas G. Pavel, The Spell of Language: Poststructuralism and Speculation
Translated by Linda S. Jordan, with the author, from Le Mirage linguistique; also published in translation as The Feud of Language: A History of Structuralist Thought. Basically, an attempt to answer the question of why mid-century French thought became so taken not just with the idea of language, but with certain thematics aspects of one particular approach to language which was already being surpassed in linguistics itself. Basically: it was an attempted "modernization of the human sciences", which were felt, for reasons Pavel goes into, to be comparatively backwards in France. (This would be stronger, I think, if he could point to other efforts in this direction, contemporaneous with the beginnings of structuralism, even if he couldn't say why those failed to thrive.) Erudite about a wide range of scholarly fields, often very shrewd (see especially chapter 6, "On Discretionary Intellectual Behavior"), but in places over-written in much the same style as the authors critiqued, and sometimes rhetorically over-stated. Recommended if you care about structuralism and its spawn.
Thomas E. Ricks, Fiasco: The American Adventure in Iraq
How we got into this goddamn mess, and just how hard it will be to even start to act in ways which won't make things worse. (He only hints, though, at why the Army was so resistant to learning about counter-insurgency.) Depressing as hell, but strongly recommended.
Some bloggers have been upset that Ricks wrote much more "on message" stories for the Washington Post (e.g., his 2003 profile of Wolfowitz, not at all like the portrait in Fiasco, has been singled out for criticism) at the time all this was going on. (If you want to follow this polemic/flame-war, start from here and work backward through the links.) I'm prepared to offer at least a partial and cynical defense. It's clear, from his excellent earlier book on the Marines, that he wants to like the US military, but isn't naive about it. He wrote a remarkable joint story with Anthony Shadid from 2 June 2003, in which Ricks went on a patrol through Baghdad with a US Army unit, and Shadid followed behind talking to the Baghdadis; this doesn't leave you with the illusion that it's all going to go swimmingly and the boys will be home by Christmas. So my guess is that he felt there was just no way he could continue to work on the story, without filing the kind of pieces he did; they were the price of access. Even the Wolfowitz profile ran in parallel with a profile of General Zinni, in fine anti-this-war-now form, making it reasonably clear to any reader which man knew what they were talking about, without actually saying that one of them had his head in the clouds, if not up his ass. It's depressing, and disturbing, that an excellent journalist at a major American newspaper felt he had to employ such Aesopian (not to say Straussian) devices, yes, but am I in a position to say he was wrong?
Frank J. Sorauf, Inside Campaign Finance: Myths and Realities
A well-written look at how the campaign finance system actually worked between the post-Watergate reforms and 1992, when the book was printed. A lot of this was quite eye-opening: it's surprisingly hard to find evidence that PAC donations affected roll-call votes, for instance. Incumbent members of Congress, according to Sorauf, were already much better electoral bets than challengers, and had been for decades, which is why the former found it much easier to raise money than the latter, though money had a much higher marginal impact on challengers' ability to get votes than it did on that of incumbents. So (to gloss over a lot of Sorauf's nuances) campaign donations weren't so much bribes, to get politicians to do things they wouldn't've otherwise as tribute, to keep the contributors from being shut out of influence. (Who was it who said that "If you can't take their money, drink their whiskey, sleep with their women, and vote against them anyway, you don't belong in Congress"?) — I used the past perfect tense because the book ends with the 1992 election, and that as an epilogue; it was frustrating to repeatedly find Sorauf making a good point, and then wonder what had happened in the last decade and a half.
Update: John Burke, in e-mail, remembers the "If you can't take their money" line as originating with Jesse Unruh, and Wikipedia agrees, so that must be right. (Wikipedia quotes the saying as "If you can't take their money, drink their booze, eat their food, screw their women, and still look them in the eye and vote against them, you don't belong here", i.e. in the legislature.)
Rashid Khalidi, Resurrecting Empire: Western Footprints and America's Perilous Path in the Middle East
Capsule introduction to the unhappy experiences of the Middle East with interventions by western powers over the last two centuries, and some reasons why the latest self-proclaimed liberators of Baghdad could expect to be received with some skepticism, whatever their actual intentions. Written in mid-2003, and pretty much borne out by events.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at August 31, 2006 23:59 | permanent link

August 24, 2006

Two Links on Scientific Programming

This looks useful (via Fresh Tracks).

This is only too accurate.

Linkage

Posted by crshalizi at August 24, 2006 09:09 | permanent link

August 23, 2006

The Evolution of Complexity (Encore Performance)

Constant readers may recall the advert for a workshop on the evolution of complexity that ran here in January. Because of the way that workshop went, the call for papers will be contined until morale improves has been renewed, with all papers to appear in a special issue of Artificial Life. I reprint the CFP below. Notice that this time, the length limits are the ordinary ones of the journal (2000 words for letters, 12000 words for articles), not the harder limits of the workshop. This should be good.

Special Issue on the Evolution of Complexity

Artificial Life journal

Call for Papers

Guest Editors:

Carlos Gershenson
Centrum Leo Apostel, Vrije Universiteit Brussel
Krijgskundestraat 33. B-1160, Brussels, Belgium

Tom Lenaerts
SWITCH, Flanders Interuniversity Institute for Biotechnology
Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium

Motivation

As a result of the quality of the Evolution of Complexity workshop at ALife X last June in Bloomington and the interest of the attendants; we announce a call for papers for a special issue on this theme for the Artificial Life journal.

The evolution of complexity is a central theme in Biology. Yet it is not without ambiguity. Complexity has been used to refer to different things. For instance, complexification has been interpreted as a process of diversification between evolving units or as a scaling process that is related to the idea of transitions between different levels of complexity. Other meanings of complexity have been introduced, both inside and outside of Biology. In most cases, though, the central concern is to understand what produces complexity.

The focus of this special issue will be on biological interpretations of complexity and on evolutionary and related dynamics as driving mechanisms for producing complexity. Questions to be addressed in the special issue include:

  • How could complexity growth be measured or operationalised in natural and artificial living systems?
  • How can existing data from nature be brought to bear on the study of this issue?
  • What are the main hypotheses about complexity growth that can actually be tested today?
  • Are the principles of natural selection as they are currently understood sufficient to explain the evolution of complexity in living systems?
  • What are the environmental and other constraints of the evolution of complexity in living systems?
  • What is the role of developmental mechanisms in the evolution of complexity in living systems?
  • What are conditions could reduce evolved complexity in living systems?
  • How factors allow the evolution of complexity in living systems to be manipulated and controlled?
  • What models are most appropriate for understanding the evolution of complexity in living systems?

Paper Submission:

Submitted articles and letters should follow the submission guidelines of the Artificial Life Journal, available at http://mitpress.mit.edu/ALIFE. Authors should also include a cover letter describing briefly the relevance of their article to the specific topic of this call.

These articles and letters should NOT be submitted to the journal editor, but should be uploaded through the special issue website (single PDF files only, include cover letter as the first page of the paper).

Papers will be judged by members of the Program Committee on their relevance to the call for papers, originality, clarity of the presentation, and overall quality.

Important Dates:

Submission deadline: December 15th, 2006
Notification of acceptance: February 1st, 2007
Camera-ready papers due: March 1st, 2007

Programme Committee:

Chris Adami
Lee Altenberg
Mark Bedau
Hugues Bersini
John Bonner
Dominique Chu
Jim Crutchfield
Bruce Edmonds
Carlos Gershenson
Mario Giacobini
Franics Heylighen
Tom Lenaerts
Juan Julián Merelo
Barry McMullin
Chrystopher Nehaniv
Charles Ofria
Jorge Pacheco
Tom Ray
Jon Rowe
Stanley Salthe
Cosma Shalizi
Richard Watson
Larry Yeager

Biology; Complexity

Posted by crshalizi at August 23, 2006 18:00 | permanent link

August 22, 2006

Go Outside and Play, Why Don't You?

Posting will be light as I gear up for teaching (36-350, data mining for undergraduates, based on this excellent course template by Tom Minka, and Principles of Data Mining by Hand, Mannila and Smyth), and try to finish some papers. I'd advise you to go outside and enjoy what's left of the summer, but since you're reading this, you're not the type of person to do that. Go read Three Quarks Daily instead: they've pre-empted my idle notions of writing posts on deep states and Jack Chick, and they have a better sense of humor and much better taste in art than I do.

The regularly-scheduled cursing of R, LaTeX, time, and an inadequate supply of ibuprofen will now recommence.

Linkage

Posted by crshalizi at August 22, 2006 15:10 | permanent link

August 19, 2006

Bush as Intellectual

This is a joke that never grows old — just more bitter. (The latter via everyone in the known universe; provoked into posting by John Burke in e-mail.)

The Commonwealth of Letters; The Running-Dogs of Reaction

Posted by crshalizi at August 19, 2006 18:55 | permanent link

August 18, 2006

On the Superiority of Sociology to String Theory

One of my friends in graduate school had an adviser with a gift for memorable expressions, at least for a theoretical physicist, especially when dismayed by some stupidity. Two of his put-downs which stuck with me were "I could go crazy tomorrow and find an appointment in the sociology department", and "I don't want to criticize you, but this is the way superstring people think". I was never sure which was supposed to be worse, but now I know. Sociologists have many faults, but they do know better than to try explaining a variable with a constant, while string theorists evidently do not. (Via CapitalistImperialistPig, who has better things to write about.)

The fact that Prof. Motl reasons so badly here that he'd fail my freshman stats class is, of course, infinitely less offensive than fact that he's a bigot (of the "we must squarely face the harsh light of my pseudo-scientific prejudices" variety). But I can't help feeling — hoping, even — that the two sorts of idiocy are linked.

Update, next day: Greetings, readers from Reference Frame and Brad DeLong! Just to correct some mis-apprehensions: my Ph.D. is in statistical physics, not sociology; I'm an assistant professor of statistics (not sociology) at Carnegie Mellon (not Michigan, where I was a post-doc); the closest I have ever come to "committing a social science" was drawing a map with some other physicists. If you're actually interested in my qualifications, you can look at my CV, or my research and teaching.

Second Update, 20 August: I'm afraid I was too elliptical above. The occasion for Motl's outpourings was this story in the New York Times on how the fraction of black and Hispanic students at New York's specialized high schools, like the famous Bronx High School of Science, has fallen over the last ten years. Fallen, as in, decreased, as in, changed over time, as in, been variable. To account for this, Motl pointed to the black-white IQ gap, which he proclaims one of the great invariant facts of human life, as in, nothing changes it, as in, constant. Even if one grants him his premise (which I would not), the IQ gap might, with a lot of other assumptions, explain why the number of blacks in these schools is low, but cannot explain why it has fallen. This is why I said he would fail my freshman stats class. A Marxist who tried to use the conflicting interests of capital and labor to explain the wage stagnation of recent decades would be guilty of exactly the same fallacy, and I'd fail them for the same reason.

I hope I have made myself clear.

Manual trackback: Nanopolitan

The Running-Dogs of Reaction; Learned Folly; IQ

Posted by crshalizi at August 18, 2006 23:45 | permanent link

August 15, 2006

Analogous Partitions or Alphabetical Provocation?

A cheerful comparison problem, in honor of the day:

India:Pakistan::Israel:Palestine
or
India:Pakistan::Palestine:Israel

Which analogy is more valid? Which one is more inflammatory?

The Continuing Crisis

Posted by crshalizi at August 15, 2006 13:15 | permanent link

August 14, 2006

Rootless Cosmpolites

Reading Billmon and Adele Stan reminds me of a question which has been bugging me for a while, but not so much as to actually investigate it. Namely: how much did traditional anti-Semitic stereotypes contribute to the stereotypes associated nowadays with conservative attacks on the "liberal cultural elite", "transational professional class", etc.? (I'm thinking of ideas like: Jews run the media etc. behind the scenes; Jews are clever but shallow; the bankers are all Jews; the Communists are all Jews; the Jews want to do away with our wholesome institutions and religion.) It would seem like a natural translation for someone to have made, but I don't know of any evidence that it did happen that way. Maybe, after all, there are only so many ways of disliking other groups that any pair of negative stereotypes is going to have a lot in common, if you look for it. (Of course, people who harbor or play to such stereotypes about liberals are not necessarily anti-Semites, even if those stereotypes historically developed out of anti-Semitic ones.)

Has anyone with some actual knowledge looked into this?

Update 17 August 2006: Edward Burns writes to point out that before the current fabricated outrage over the "war on Christmas", it was being pushed as a UN plot by the John Birch Society in the 1950s, and before that Henry Ford was warning that the Jews were trying to get rid of Christmas and Easter. (See this good story by Michelle Goldberg in Salon.) It's not clear, however, that there was any actual transmission of ideas from one episode to the next; if I were a historian I'd think it would be worth looking into, though.

The Running-Dogs of Reaction

Posted by crshalizi at August 14, 2006 20:43 | permanent link

August 12, 2006

Parasites and Reintermediaries of Death

Starting about a year ago, I have refused to referee papers for journals owned by Elsevier, since it sticks in my craw to provide free labor for people who turn around and gouge the academic community mercilessly. This reasoning applies, to some degree, to all commercial journal publishers, though Elsevier is unusually exploitative in its pricing. There is however a more substantial reason to dislike them: their — forgive the phrase — mercenary involvement in the international arms trade. Tom Stafford, who blogs at Idiolect, is organizing a petition of academics to try to get Elsevier to stop organizing arms fairs; it's worth signing.

Learned Folly

Posted by crshalizi at August 12, 2006 17:24 | permanent link

Zadig, or, The Book of Fate

Almost two years ago, I complained that there was no English e-text of Zadig, Voltaire's oriental Enlightenment detective fable. At the time, I thought this was my usual pointless whining into the void; but no. Through the good efforts of Barbara Tozier, and the rest of the people associated with Project Gutenberg and Distributed Proofreading, one of the most charming of M. Voltaire's literary productions is now available for free to the English-reading web, in both plain-text and handsome HTML versions.

You realize this means I'm only going to whine here more.

The Commonwealth of Letters; Linkage

Posted by crshalizi at August 12, 2006 16:50 | permanent link

August 04, 2006

Maladministration of Organs

Cross-posted to Crooked Timber.

Kieran Healy's post about his book on organ donation gives me a hook to write something about the other end of the system, about organ recipients and the institutions which are supposed to match them up with donated organs. More specifically, how one such institution, the Kaiser HMO of Northern California, quite spectacularly failed several thousand people who were depending on them, by not matching them up. The story has been around since early May, when it was broken by Charles Ornstein and Tracy Weber in the Los Angeles Times (cached here), since confirmed by an investigation by Medicare/Medicaid. It doesn't seem to have gotten all that much attention among the blogs, but it's outrageous, and deserves, for that reason alone, to be better known.

Kaiser is a very large and old HMO, with a huge presence in the Bay Area and northern California. (In fact it was one of the very first HMOs, started by Kaiser Steel during WWII as a way of attracting workers to its foundries and shipyards when wages were frozen.) It actually consists of non-profit health plans and hospitals, and for-profit physicians' partnerships, the Permanente Medical Groups; normally, patients in the health plans have to go to physicians in the groups to have their medical expenses covered. One notable exception, up until 2004, was the 1500--2000 Kaiser patients in northern California who needed kidney transplants because of renal failure: those procedures were sub-contracted to the hospitals at UCSF and UC Davis, which also managed the patients' places on the waiting list.

That last bit requires some explanation. Once organs are donated, they need to be matched up to recipients. This is done by an organization called the United Network for Organ Sharing, which tries to trade off urgency, seniority (i.e., time spent waiting for an organ), proximity and compatibility (since the closer the match between the donor's immune system and the recipient's, the less problems from rejection). There is a heavy weight put on seniority, though especially close immunological matches can over-ride it. Each transplant center is responsible for keeping the network up-to-date about their patients who need organs, their immunological profiles, and their time spent on the waiting list.

What seems to have happened is that in 2002, a transplant surgeon named Arturo Martinez proposed to Kaiser that it could save money, and increase the utilization of its hospitals' surgical capacity, by bringing the kidney transplant program in-house, and Kaiser agreed, with Martinez becoming head transplant surgeon. (It would be unfair, at this point, to say that Kaiser did this because it meant more business for the for-profit Permanente Medical Group, but it's hard to imagine that counted against the proposal.) As of mid-2004, Kaiser patients on the waiting list were informed that they would no longer be covered for transplants at UCSF or UC Davis, though they were free to go ahead and have them if they could come up with the money (roughly $100,000).

So far, all this is maybe a little self-serving on Kaiser's part, but not, in itself, appalling. (It's certainly more than legitimate for health-care organizations to try to save money.) What happened, though, was that Kaiser completely screwed up the program. Remember that organs are allocated (basically) through the UNOS system. The patients were being removed from the listings under the university hospitals, and being added to the listings under the new Kaiser transplant program. Unless this was done correctly, this would mean that they'd look like new names on the list, and so all of their accumulated waiting time, one of the main determinants of priority, would vanish. This happened to a huge number of people on the list, basically reducing the chance that they'd get a kidney to next to nothing. This becomes less surprising when one learns that Kaiser never consulted UNOS about the massive transfer of patients it was planning, and "placed responsibility for submitting patient data ... in the hands of a single clerk who had one hour of telephone training on UNOS's database", though not any more excusable. Needless to say, patients were not told that by staying with Kaiser, they were losing their place on the lists, and thereby reducing their odds of survival. Some of them, at least, seem to have been assured that they were keeping their places, when that wasn't true, though this is less clear to me.

Losing seniority on the transplant lists wasn't the only problem. Kaiser did very few transplants, compared to the number of organs which were available. This happened in part because they just didn't have the capacity to keep up with their many patients (at one point they were down to a single nephrologist for the whole program, who was also supposed to be its medical overseer), and in part because of what seems to have been mis-placed perfectionism or caution. These combined to the point of repeatedly turning down "zero mismatch" kidneys, ones where the likely compatibility over-rode considerations of seniority. This happened several dozen times at least — twice for one patient alone. Again, needless to say, patients weren't told about this. In a "it's not a bug, it's a feature" moment, Kaiser initially attempted to defend its program by pointing out how few patients had died after transplants — since they'd done so few.

What strikes me as especially outrageous about all this is that the people being screwed over were people who needed new kidneys. To state the obvious, anyone who needs an organ transplant is very ill. It's maybe less obvious that being that ill is a full-time job. One of the vital parts of the body is no longer working; to substitute for it requires extraordinarily complicated, time-consuming and generally unpleasant procedures. People who need new kidneys are people who are kept alive by dialysis, which is, indeed, complicated, time-consuming, often painful, almost always exhausting, and carries a non-trivial risk of infections, possibly fatal. People who need new kidneys are also often people who are very ill in other ways, since it's not that common for both your kidneys to just stop working if nothing else is going on. (Kidney problems are, for instance, a not-uncommon complication of diabetes, and of high blood pressure. Dialysis, naturally, messes with blood pressure, adding yet another variable to monitor and regulate.) Simply staying alive, when you are multiply-sick person with organ failure, can pretty much demand all the time and attention you have to give, and a fair chunk of your loved ones' as well. (There are good reasons why the families of people in situations like this tend to fall ill themselves.) You are certainly not in a position to check up on whether your medical organization has, through incompetence, messed up your position on the transplant lists. (Some Kaiser patients actually tried to keep up with their place on the lists, but were given the run-around.) And as for switching to another medical organization, do please show me the company which will extend coverage to someone who needs a new kidney, at a price which can be afforded by someone who needs a new kidney.

Since the Times broke the story, there's been some improvement. The doctor who was medical head of the program, and apparently at least partly responsible for snafu of not transferring patients' time, Sharon Inokuchi, has been relieved of her administrative duties. (In fact, if memory serves, she left Kaiser, but now I can't find where I think i read that.) The program has been investigated by the Center for Medicare and Medicaid Services, which basically confirmed the newspaper reports, and forced it to promise major changes; it could still lose its eligibility for funding under those programs. The California state agency which regulates managed care is still, I believe, investigating. There's talk of large fines, and there will certainly be lawsuits. All of this is to the good; it's certainly better than nothing. But still, thinking about this makes me angry: Kaiser had a duty towards many very sick people, who were in a very poor position to look after themselves. It failed in that duty quite dramatically. In any organ transplant program, patients will die while waiting for a match. In most kidney transplant programs, though, about twice as many patients receive transplants as die while waiting; Kaiser managed to reverse that ratio. While it's hard, in the nature of things, to identify any one patient who's died and say "They would have lived, if only Kaiser hadn't done this", it's almost certain that more of these people have died than would have otherwise. I don't have a better remedy to propose than fines or lawsuits or institutional tinkering, but they all seem horribly inadequate.

Posted by crshalizi at August 04, 2006 14:49 | permanent link

August 02, 2006

The Starry Heavens Above

Cross-posted to Crooked Timber.

Now this is what I call "filling the mind with ever new and increasing admiration and awe, the more often and steadily we reflect upon them".

(Via David R. in e-mail.)

The Eternal Silence of These Infinite Spaces

Posted by crshalizi at August 02, 2006 11:46 | permanent link

July 31, 2006

Books to Read While the Algae Grow in Your Fur, July 2006

Joel Best, Flavor of the Month: Why Smart People Fall for Fads
An attempt at a sociological description and explanation of institutional fads, meaning things more like phonics or total quality management than, say, Rubik's Cube, or even blogging. Short version: our institutions always have problems, some of them real and remediable, and some of them simply because we can imagine them doing much better than they are, and believe in the perfectability of social arrangements. (E.g., he argues that pretty much any way of teaching children to read will always lead to some who aren't "up to level", and so create an audience for some new way of teaching them to read.) On top of this, there are various structural biases in media and social networks which will tend to propagate news of new ideas and their successes more more efficiently than failures or grounds for skepticism. (At some points he seems to be saying that some of these things are unique to America, or at least more pronounced here than in other industrialized countires, but of course presents no comparative data that we are more prone to institutional fads than, say, the French or the Japanese, or even than the Canadians.) Worth reading, which can be done in a weekend.
Naomi Novik, Black Powder War
Our heroes (see previous installments) cross Asia by the old silk road, only to help lose the Battle of Jena. (I refuse to regard this as a spoiler.) Continues to provide candyish satisfaction. Many hints are dropped as to plot developments in sequels; I hope Novik has the strength of auctorial character to resist following all of them up.
Andrea Camilleri, The Shape of Water and The Snack Thief
See earlier remarks on Camilleri. The series grows on me the more I read; fortunately for my productivity not that many books have been translated.
Clark Glymour, The Mind's Arrows: Bayes Nets and Graphical Causal Models in Psychology
What graphical causal models are, why they are such a good way of representing causal structure, why they might be good ways of representing causal knowledge, and ways various parts of psychology could benefit from using them. Much of Part III, on inferring mental architecture from lesions in neuropsychology, could almost equally well apply to functional imaging studies (except that lesion work is better grounded). Glymour spares few opportunities to point out just what a horrible idea it is to use linear regression and factor analysis for causal inference, concluding with a chapter on The Bell Curve. On the one hand, Glymour shows that, using the kind of evidence Herrnstein and Murray do, it is not possible to draw any reliable conclusion at all about the causal relations between IQ and social outcomes. On the other hand, an awful lot of mainstream social science uses the same kind of data and the same kind of method... That chapter in particular will make little sense to readers who haven't, at the least, forgotten how to calculate factor loadings; but I am not sure the argument could be honestly made in any more popular way, and I am impressed by how far Glymour has gone to simplify his rather technical work in this area.
On the down side, there are some really awful puns.
Kenneth Pomeranz, The Great Divergence: China, Europe, and the Making of the Modern World Economy
Gets its own review: King Cotton and King Coal Raise the West.
H. R. Trevor-Roper, The Hermit of Peking: The Hidden Life of Sir Edmund Backhouse
Droll and engrossing historical detection on a massively eccentric and bizarre literary (and commercial and espionage) fraud artist. I quote an early passage describing Backhouse's memoirs, to which John Burke drew my attention, to give some flavor of the whole: "I had not read far before I realized why the Swiss custodians of these volumes had preferred not to entrust them to the post. How, I asked myself, would a right-minded and conscientious customs officer react if he were to open and read these works? The text would surely be confiscated, and perhaps the law would inconveniently take note of the sender and the addressee. For the volumes were of no ordinary obscenity." (As John says, the word "ordinary" is a wonderful touch here.) — But this was in 1973; standards have, to say the least, changed somewhat, and I wonder if anyone has thought to publish Backhouse's memoirs, perhaps in conjunction with some queer/post-colonial effort at rehabilitation?
Alexandre J. Chorin and Ole H. Hald, Stochastic Tools in Mathematics and Science
A short (two-hundred-odd), highly non-rigorous, quick-and-dirty introduction to stochastic methods for applied mathematics, starting with basic probability and expectation, and ending up with some nice non-equilibrium statistical mechanics. (They do however assume quite substantial familiarity with linear algebra, operators, etc.) I am enough of a probability geek, at this point, to wish that they had done things a bit more rigorously and a bit more abstractly in places, because I think it would help the reader see, e.g., the Ornstein-Uhlenbeck process as part of a general pattern of stochastic ordinary differential equations, rather than a tricksy special case. Also, there are more places than I'd like where they don't lay out the motivation clearly, and a self-studying reader is going to say "wait, where is this going exactly?" — not so much of a problem in a classroom setting, which is where these notes began. I'd be very happy to teach from this text.
Kirk Mitchell, Sky Woman Falling and Dance of the Thunder Dogs
Latest (and possibly last) in a series of police procedurals set in Indian Country, broadly construed. Reasonably addictive; the earlier installments (Cry Dance, Spirit Sickness, Ancient Ones) are in some ways even grimmer, because they focus more on the complicated relationship of the detectives.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at July 31, 2006 23:59 | permanent link

One Big Mutual Fund, or, The Ownership Society (Modest Proposals for the Reform of Corporate Governance, Part 2)

Attention Conservation Notice: Over 1500 words on a wacky quasi-socialist economic scheme, from someone utterly lacking in credentials in economics. The scheme does not respect the sanctity of private enterprise, but at the same time would not reduce the alienation of labor one iota. Includes a lengthy quotation of a game-theoretic impossibility result. Also, you already saw it when it was cross-posted at Crooked Timber.

In the previous installment in this series of modest proposals, I looked at ways of making the incentives of the managers of large, publicly-held corporations align more closely with those of their long-term shareholders. This left alone the question of the beneficiaries of corporate value; assuming that the managers are busily working to maximizing their revenue streams, who benefits from their industry and diligence? Having just read Mark Greif's great essay on redistribution in n+1 (issue 4; sadly, not online), I would like to make a suggestion.

The text for today is Gary Miller's Managerial Dilemmas: The Political Economy of Hierarchy, an excellent book which I learned about from Henry Farrell. Ambitiously, Miller tries to explain why hierarchical corporations exist at all, why they take some of the forms they do, and how, in part, their form relates to their performance. Much of the book, especially the first part, is a partially-successful attempt to find good economic reasons for their features, i.e., efficiency-enhancing ones. (He does not seriously consider the option that enterprises are hierarchical for non-economic reasons, say that some people like bossing others around, which hierarchies let them do, and those people are able to select hierarchies over other, more efficient, forms. After all, it's hardly historically unprecedented for powerful people to prefer institutions which lower aggregate output but give them a bigger share of the product. See, e.g., here.) He also tries to explain why theories of corporate organization that rely solely on economic "mechanism design", i.e., structuring information and material incentives, will actually lead to sub-optimal results, for pretty basic game-theoretic reasons; getting beyond these impasses is fundamentally a political problem. This is potentially quite subversive in its own way, but it's really the first part of the work, about the economic justification of the hierarchical enterprise, that I'm going to twist and abuse.

One of the features of the modern corporation that Miller attempts to rationalize is the existence of shareholders who are passive and, in the overwhelming majority, utterly disconnected from the day-to-day or even year-to-year operations of the company. He does so by means of the following impossibility theorem, attributed to Bengt Holmstrom. Having tried to summarize Holmstrom's theorem better than Miller, and failed, I'll just quote Miller.

Holmstrom assumes that there are n agents whose actions determine a level of revenue x. The actions taken are unobservable and are costly to each of the agents. In particular, we assume the production function is a team production in which the productivity of each individual's action is determined by other individuals' levels of effort.

Holmstrom points out the desirability of three characteristics of an incentive system — and then shows that they are logically inconsistent. First, Holmstrom examines the Nash equilibrium outcome of an incentive system. At such an equilibrium, each individual will find that he or she could not do better by choosing a different effort level, as long as all others do not change their effort levels. Simple marginal analysis tells us that, in such an equilibrium, each person will find that his or her marginal cost of effort is exactly equal to the marginal gain; otherwise, the individual could be better off by working harder or not as hard. Second, Holmstrom stipulates that the outcome be budget balancing — that is, the incentive system should exactly distribute the revenues generated by the actors among the actors. Third, Holmstrom examines Pareto efficiency. This means that the outcome should be such that the individuals in the organization could not find a different outcome that would make them all better off.

Holmstrom shows that no budget-balancing system can create a Nash equilibrium that is also Pareto efficient. In other words, every budget-balancing incentive system will induce a social dilemma among its participants. The reason is that individuals will bring their own marginal costs of effort into equality with their own marginal gain. This means that each individual will not undertake an additional unit of effort that will produce less individual gain than individual cost — even if that extra unit of effort produces more gain for other individuals on the team.

As an example, suppose there is some individual who has a marginal revenue productivity of $12: Each unit of her own effort generates an extra $12 for the team. According to Pareto optimality, she should exert additional effort as long as the cost to her of that effort is less than or equal to $12; each such unit of effort generates more revenue for the team that it costs her as an individual. The only way to motivate her is to make sure that she gets all of the marginal revenue of her last unit of effort. In a team, it is impossible for this to be the case for every individual, as long as the incentive system is budget balancing. If everyone gets all of the last dollar produced, the team will have to pay out more in incentives than it generates. But if the individual gets only one-third of the marginal revenue from her actions, she will work only as long as her effort costs her less than $4 per unit. [pp. 129--130]

This suggests a rather unusual role for shareholders: they provide a money-sink, someplace money can go other than those actually involved in production. This means that the economic mechanism no longer has to be budget-balancing, which actually makes efficiency possible. Miller suggests that this is one reason why the modern public corporation, with its separation between legal ownership (by stockholders) and day-to-day control (by managers) can work, to the extent that it does. It is precisely because the shareholders are passive, with very limited influence over the actual running of the corporation!

Today's modest proposal — and I should make it very clear that Miller suggests nothing of the kind — is to take this separation of functions even further. Shareholders can use their legal ownership to intervene in the running of the company, though it is hard (and managers try to make it harder). By doing so, however, they become players in the team-production game, and lose their useful role as a money-sink. To limit this danger, while retaining the advantages of competitive markets for capital allocation and corporate control, I suggest the following. A substantial fraction — say three-quarters — of all profits of publicly-held corporations are to be paid to a new institution, which we might call the National Mutual Fund. (Closed corporations and partnerships are exempt.) Once a year, the Fund would pay out its accumulated profits as dividend checks, giving an equal amount to every adult citizen. And that's it.

Substantially reducing the flow of dividends associated with stock ownership should cause a large one-time shock to the level of the stock market. (Roughly speaking, shares should drop by about 3/4.) However, because the Fund collects uniformly, it should not distort relative prices, which are what matter for purposes of capital allocation. The net worth of stock-holders, likewise, will suffer a one-time drop, but this will be partially compensated for by their receiving payments from the Fund in the future. Anyway, lots of things affect the value of stock holdings; it's not like someone purchased their labor with a promise of future benefits, and then tried to back out of a freely-entered contract when it came time to pay up.

A further wrinkle would be to curb the practice of retained earnings. These account for a huge fraction of corporate capital formation, but they are also one of the ways in which managements escape market discipline. (For some figures on this, see Henwood's Wall Street, pp. 72--76.) I suppose one could make a Hayekian argument in favor of the practice, but, really, if management can make a good case that a pet project will earn at least a normal rate of return, it shouldn't be hard for them to raise funds on the open capital market, and if they can't make such a case, it's hard to see how they'd be discharging their fiduciary duties to shareholders by pursuing it anyway. This reform, I should add, is logically separate from that of instituting the National Mutual Fund. However, since corporations would pay more out in dividends, it would tend to increase the value of shares, reducing the shock to the level of the stock market.

It is hard to see why the actions of the National Mutual Fund could not be at least as rule-bound and de-politicized as those of a central bank run by skilled technocrats. Indeed, it would seem easier to reduce the discretion of the Fund's officials to the vanishing point, and to strictly keep it from meddling with the affairs of any corporation, which would be deeply counter-productive. For their part, the citizens receiving the dividends would get the benefits of "portfolio diversification in their income", but their incentives to meddle politically with individual firms, even quite large firms, would be quite muted. Moreover, they would have a direct and tangible incentive in the health of the corporate sector as a whole, making them less likely to support market-distorting measures to benefit particular firms, geographical regions or industrial sectors. We would move, in short, towards a true ownership society.

Modest Proposals; The Dismal Science

Posted by crshalizi at July 31, 2006 15:38 | permanent link

July 30, 2006

Brunch in the Ruins

Cross-posted to Crooked Timber.

It's a hot, lazy Sunday, which seems like a good time for browsing through livejournal communities dedicated to photos of peacefully rusting machines, quietly crumbling buildings, and similar modern ruins:

Abandoned Places [via David Chess]
Decayed Machinery [via I forget who, years ago]

The photographers are all amateurs, so the quality (to the slight extent I can judge) is quite variable, but many manage to capture the suggestion of sunset and sadness, of unhappy stories brought to a close, which fascinates me about such scenes. Some of these photos, in fact, seem as good as, say, those in Terry Evans's book on the former Joliet Arsenal, Disarming the Prairie, bringing to mind the words of the poet:

These are the halls of the dead, where the spiders spin and the great circuits fall quiet, one by one.
— But I see I'm getting melodramatic, and it's just too hot and sticky and still to sustain that.

Update: John Burke (who needs to revive his blog) writes to point to Jef Poskanzer's great industrial archaeology page, with many fine pictures and links.

Linkage

Posted by crshalizi at July 30, 2006 14:59 | permanent link

July 29, 2006

Inducing Disorientation in Larval Economists

Cross-posted to Crooked Timber
As a good neoclassical, neoliberal economist, Brad DeLong is acutely aware that the market system is not natural at all, but a delicate historical anomaly. He is worried that it is so familiar to his students that they will find alternate modes of social organization almost incredible; accordingly he wants to mess with their heads:
Would making Berkeley's first-year economics Ph.D. graduate students this fall read short biographies of William Gates and William Marshall as a way of getting at the idea that there are non-market societies that work very differently from our own today--would that be a teaching idea of extraordinary brilliance or of total insane lunacy?
The rest of the post is an extended excerpt from the New York Review of Books review of a biography of William Marshal (which goes on to my to-read list). The question I have is, what should DeLong make his students read, to give them a vivid sense of just how differently production and distribution could be and have been organized? Argonauts of the Western Pacific, perhaps? Gilgamesh?

And: those of us who teach things other than economics, what books do or should we hand out as ice-axes for our students' frozen seas? (This one is mine.)

The Dismal Science; The Great Transformation

Posted by crshalizi at July 29, 2006 17:43 | permanent link

July 28, 2006

Frederick Mosteller Is Dead

Cross-posted to Crooked Timber

Via everyone in the profession: the statistician Frederick Mosteller has died. Mosteller was one of the great leaders of the generation of statisticians in which our field went from being an annex of mathematics (as it was when he attended Carnegie Tech) to an autonomous, institutionalized discipline. He had an astonishing range as a researcher, but is perhaps best known for his work on stochastic theories of learning theory and the authorship of the Federalist Papers. He was also a notable teacher, as his essay "Classroom and Platform Performance" suggests, and in the later part of his career tried to bring elementary inferential hygenie to educational research. More anecdotes are available from Tales of the Statisticians, or this brief sketch by his student Stephen Fienberg.

Enigmas of Chance

Posted by crshalizi at July 28, 2006 09:42 | permanent link

July 26, 2006

How to Make Our Ideas Clear — to Others

Cross-posted to Crooked Timber
In the comments to my post on Onsager, Maynard Handley explains why he finds himself somewhat unsympathetic, as Onsager apparently did not expend the effort necessary to make himself understood by others.
You, the author of the paper, have a responsibility to make your ideas comprehensible. If the first method you choose to explain them fails, then you listen to what people say about where they lost all track of understanding and write a new paper with NEW explanations, not the same explanations that failed last time only renumbered. ... [This is] not something that is drilled into young scientists; that it is YOUR responsibility to make your ideas clear to others, not their responsibility to try to figure out what you are talking about. As science grows ever larger and ever more complex, I think it is time for all scientists to be much more explicit and much more ruthless on this point.
Whether this is really a fair criticism of Onsager, I couldn't say, but the general point is true, important, and a perfect hook for the next thing I wanted to post about.

Science is a social, collaborative process, so part of being a good scientist is effective communication. Scientific communication is overwhelmingly written communication (scientific disciplines are, in a sense, literary communities), so part of being a good scientist is being a good writer. Unfortunately, scientists get little training in writing, and much of that consists of being advised to follow the rules found in horrid little compendia. Fortunately, there is some actual research on effective written communication, that is, on how to arrange your words so that their readers tend to acquire clear notions of your ideas. The best practical guide here, I've found, is Joseph William's Style: Towards Clarity and Grace. However, I have just discovered (via Paradise Blogged) a fine essay by George Gopen and Judith Swan, "The Science of Scientific Writing", which gives a clear yet concise presentation of the work. (Gopen and Williams are collaborators.) Here is their own summary of how to be clear:

  1. Follow a grammatical subject as soon as possible with its verb.
  2. Place in the stress position the "new information" you want the reader to emphasize.
  3. Place the person or thing whose "story" a sentence is telling at the beginning of the sentence, in the topic position.
  4. Place appropriate "old information" (material already stated in the discourse) in the topic position for linkage backward and contextualization forward.
  5. Articulate the action of every clause or sentence in its verb.
  6. In general, provide context for your reader before asking that reader to consider anything new.
  7. In general, try to ensure that the relative emphases of the substance coincide with the relative expectations for emphasis raised by the structure.
If these rules, and the notions behind them, are valid, they should apply to more than just scientific writing; in particular, they should hold for other kinds of academic prose. Looking at their examples of revising scientific writing, expressing the same complicated and precise ideas in more easily grasped ways, I couldn't help but be reminded of the humanistic "bad writing" controversy, where the case for the defense often seems to rest on complexity, and the example of scientific jargon. The examples of Gopen and Swan, Williams, etc., show that those defenses do not hold. Scholars of the humanities may have reasons for being unclear which don't apply to scientists, but I can't think of any good ones.

The Commonwealth of Letters

Posted by crshalizi at July 26, 2006 00:53 | permanent link

July 25, 2006

The Nobel Prize Winner as Neglected Genius

Cross-posted to Crooked Timber.

A staple of bad movies and trashy novels about scientists (i.e., the kind I read) is the neglected genius whose ideas are rejected with incomprehension by the scientific establishment during his life, because they are simply Too Far Ahead Of His Time to be grasped by lesser mortals, only for the scientific community to rediscover these insights decades later. This sort of thing can make for entertaining fiction (if dreary self-mythologization), but it simply doesn't happen all that often in real life, especially not when the hero is a part of the establishment, and indeed a much-honored one. It certainly doesn't show up, with documentary evidence, in the staid, reliable pages of Reviews of Modern Physics. Nonetheless:

Gregory L. Eyink and Katepalli R. Sreenivasan, "Onsager and the theory of hydrodynamic turbulence", Reviews of Modern Physics 78 (2006): 87--135; no free copy
Abstract: Lars Onsager, a giant of twentieth-century science and the 1968 Nobel Laureate in Chemistry, made deep contributions to several areas of physics and chemistry. Perhaps less well known is his ground-breaking work and lifelong interest in the subject of hydrodynamic turbulence. He wrote two papers on the subject in the 1940s, one of them just a short abstract. Unbeknownst to Onsager, one of his major results was derived a few years earlier by A. N. Kolmogorov, but Onsager's work contains many gems and shows characteristic originality and deep understanding. His only full-length article on the subject in 1949 introduced two novel ideas — negative-temperature equilibria for two-dimensional ideal fluids and an energy-dissipation anomaly for singular Euler solutions — that stimulated much later work. However, a study of Onsager's letters to his peers around that time, as well as his private papers of that period and the early 1970s, shows that he had much more to say about the problem than he published. Remarkably, his private notes of the 1940s contain the essential elements of at least four major results that appeared decades later in the literature: (1) a mean-field Poisson-Boltzmann equation and other thermodynamic relations for point vortices; (2) a relation similar to Kolmogorov's 4/5 law connecting singularities and dissipation; (3) the modern physical picture of spatial intermittency of velocity increments, explaining anomalous scaling of the spectrum; and (4) a spectral turbulence closure quite similar to the modern eddy-damped quasinormal Markovian equations. This paper is a summary of Onsager's published and unpublished contributions to hydrodynamic turbulence and an account of their place in the field as the subject has evolved through the years. A discussion is also given of the historical context of the work, especially of Onsager's interactions with his contemporaries who were acknowledged experts in the subject at the time. Finally, a brief speculation is offered as to why Onsager may have chosen not to publish several of his significant results. [My links.]

Nobody outside of statistical physics (and maybe physical chemistry) has heard of Onsager, but he was indeed a great physicist, albeit in a very technical, non-flashy way. By the time he did this work on turbulence, he was already well-known in statistical mechanics for the analytical solution of the Ising model, his theory of phase transitions in liquid crystals, and, perhaps most importantly, a pair of classic papers from 1931 which basically founded modern irreversible thermodynamics, for which he would eventually win the Nobel Prize. (Eyink and Sreenivasan give a fuller discussion of his accomplishments, including the Onsager-Machlup theory of non-equilibrium processes, on which Eyink himself has done important work.) We're definitely not talking about some marginal figure cut off from the scientific community.

Nonetheless, his attempts to get people to pay attention to these ideas on turbulence were singularly unsuccessful. The reaction of Theodore von Kármán, a deservedly great name in fluid mechanics, was to describe it (in a letter to his student C. C. Lin) as "somewhat 'screwy' "; Onsager also corresponded with Lin, who replied in the classic manner of someone wanting to put an end to a conversation (quoted on p. 117): "I am sorry to say that I have not made much progress, except that I desire still more to see something done in this line to bring your ideas down to my level of understanding." As for the statistical physicists, Eyink and Sreenivasan describe their reaction as one of "polite incomprehension" (except on the part of von Neumann — in an unpublished report). The fact that one of Onsager's letters describing his ideas (reproduced as Appendix A in this paper) is headed "The little vortices who wanted to play", and begins "Once upon a time there were n vortices of strengths K1, ... , Kn in a two-dimensional frictionless incompressible fluid" probably didn't help, either. Most of all, a combination of discouragement over this reception, a tendency to be a slow and perfectionist author, and having scads of major research projects going simultaneously kept Onsager from even trying to publish any of this material.

The moral, I hope, is clear: statistical physicists who wander into other areas of science, and find their ideas dismissed by the best experts on those subjects, should nonetheless publish in Physical Review, in a "Fools! I'll show them all!" spirit, provided they are Lars Onsager.

(It's interesting that this paper was written by two physicists active in this area, rather than by a historian of science. It seems doubtful to me that a historian, reading the relevant materials in the Onsager archives, would have realized that there was a story here, unless they were familiar with modern work on turbulence at a deeply technical level — unless they had "contributory" as well as "interactional" expertise. And if anyone had gone over those archives around 1990, before these ideas were re-discovered, what would they have made of it?)

Manual trackback: In the Pipeline

Physics

Posted by crshalizi at July 25, 2006 15:20 | permanent link

Two Menances to the Keystone State

Cross-posted to Crooked Timber.

Two of my more public-spirited fellow citizens have recently identified looming threats to our own Commonwealth of Pennsylvania.

  1. Our beloved junior senator, Rick Santorum (via Pharyngula):
    Most scientists unfortunately, those that certainly are advocating for this [embryonic stem cell research], and many others feel very little moral compulsion. It's a utilitarian, materialistic view of doing whatever they can do to pursue their desired goals.

    I, for one, will be happier voting on Mr. Santorum's re-election in November, knowing that my ballot will play a part in the age-old struggle between utilitarian materialism and deontological idealism, as well as the sagas of human-canine relations and Old Corruption.

  2. Our beloved linguistics professor, Mark Liberman:
    More than a third of all Pennsylvanians are native speakers of a language other than English — and many of them have not even tried to learn English since immigrating, or at least prefer to carry out their daily lives in another language, living together in neighborhoods where their native language dominates. Some people worry that the majority status of English is critically endangered. 25 years ago, a major political figure warned that these "aliens ... will never adopt our language or customs, any more than they can acquire our complexion", and so far, his prediction seems to be right on the money.

    And let's not forget what they've done to our cooking!

The Running-Dogs of Reaction

Posted by crshalizi at July 25, 2006 12:12 | permanent link

July 24, 2006

Chronicle on Cole

Cross-posted to Crooked Timber, hence the parting shot about comments.

Under the rubric "Can Blogging Derail Your Career?", the Chronicle of Higher Education has seven bloggers discussing Yale's decision to not hire Juan Cole as a professor of history, and the role, if any, played by his blog in that decision: Siva Vaidhyanathan, Dan Drenzer, Brad DeLong, Michael Bérubé (all: yay!), Glenn Reynolds and Ann Althouse (both: hiss), and Erin O'Connor (null result), with a "response" by Cole, which doesn't actually address the others' posts specifically, and reads like a separate essay on the same subject as the others. (Via De Long.)

(Some of the things which were written about Cole as part of the controversy (e.g.,) give the impression of a professor who attains incomprehensibility not through obscurity but through foaming at the mouth. As it happens, though, I sat in on his seminar on millenarian movements when I was a post-doc at Michigan, and nothing could be further from the truth. I suppose I could have missed all the sessions which degenerated into hours-long rants about Zionist Entities... Of course, I don't know why Yale didn't give him the job, but if it was because they thought he was too spittle-flecked to be presentable to parents and alumni, they were misinformed.)

The fact that this post is not filed under "Middle East Politics" isn't going to stop anyone in the comments, is it?

Learned Folly

Posted by crshalizi at July 24, 2006 22:01 | permanent link

Critical Sensation

Attention conservation notice: 1500 words on psychophysics and the statistical mechanics of disordered excitable media. Also, it was cross-posted to Crooked Timber, where I am guest-blogging this week, so you've seen it already.

First off, I should thank Henry and the rest of the Timberites for the kind invitation to guest-post, and that very warm introduction. In exchange, I'm going to blog more or less as I usually would, only here. This means some big bricks of posts about "complex systems", so called, which is or was my scientific field, more or less; and also any miscellaneous outrages which catch my eye this week. Mounting my usual hobby-horses on this stage is a poor exchange for their generosity, but mounting hobby-horses is why I started blogging in the first place, and anyway I'm big on conscienceless exploitation of cooperators.

Today I want to talk (below the line) about some recent work in the statistical mechanics of disordered systems, which might help explain how our sense organs work, and actually involves some good uses of the self-organized criticality and power laws; tomorrow or the day after I'll get to the smoldering question of "Why Oh Why Can't We Have Better Econophysics?"


Folklore says that the dark-adapted human eye can detect a single photon; this isn't quite true, but we can consciously detect a few tens of photons, and some species are that sensitive. Of course, we can see not only in the dark but also during broad daylight, but then the number of photons falling on every part of the retina is huge; the eye isn't overwhelmed and saturated, though now one or ten photons more or less makes no discernible difference. In the jargon, the eye, and the other sensory organs, have both a large "dynamic range" (we can see in the dark and in the daylight), and "nonlinear response" (changes which are noticeable in the dark aren't against a high-intensity background). Some version of these facts, including the basic (power-law) form of the relationship between physical stimulus intensity and perceived sensory magnitude, have been known since the nineteenth century. This makes it all the more puzzling that sensory neurons show a linear response over a narrow dynamic range, beyond which they saturate.

You could evade this difficulty by having lots of neurons with different operating ranges, so that raising stimulus intensity saturated some but activated others. The problem is that there don't seem be that wide a spectrum of operating ranges for individual neurons. In a recent paper, Osame Kinouchi and Mauro Copelli (who blog together at Semciência) offer another way, which has to do with the way sensory neurons interact with each other in a network.

Osame Kinouchi and Mauro Copelli, "Optimal dynamical range of excitable networks at criticality", Nature Physics 2 (2006): 348--351; free preprint version at q-bio.NC/0601037 *
Abstract: A recurrent idea in the study of complex systems is that optimal information processing is to be found near phase transitions. However, this heuristic hypothesis has few (if any) concrete realizations where a standard and biologically relevant quantity is optimized at criticality. Here we give a clear example of such a phenomenon: a network of excitable elements has its sensitivity and dynamic range maximized at the critical point of a non-equilibrium phase transition. Our results are compatible with the essential role of gap junctions in olfactory glomeruli and retinal ganglionar cell output. Synchronization and global oscillations also emerge from the network dynamics. We propose that the main functional role of electrical coupling is to provide an enhancement of dynamic range, therefore allowing the coding of information spanning several orders of magnitude. The mechanism could provide a microscopic neural basis for psychophysical laws.

Neurons, like muscle cells, are "excitable", in that the right stimulus will get them to suddenly expend a lot of energy in a characteristic way — muscle cells twitch, and neurons produce an electrical current called an action potential or spike. Kinouchi and Copelli use a standard sort of model of an excitable medium of such cells, which distinguish between the excited state, a sequence of "refractory" states where the neuron can't spike again after it's been excited, and a resting or quiescent state when the right input could get it to fire. (These models have a long history in neurodynamics, the study of heart failure, cellular slime molds, etc.) Normally, in these models the cells are arrayed in some regular grid, and the probability that a resting cell becomes excited goes up as it has more excited neighbors. This is still true in Kinouchi and Copelli's model, only the arrangement of cells is now a simple random graph. Resting cells also get excited at a steady random rate, representing the physical stimulus.

Kinouchi and Copelli argue that the key quantity in their model is how many cells are stimulated into firing, on average, by a single excited cell. If this "branching ratio" is less than one, an external stimulus will tend to produce a small, short-lived burst of excitation, and there will be no spontaneous activity; the system is sub-critical. If the branching ratio is greater than one, outside stimuli produce very large, saturating waves of excitation, and there's a lot of self-sustained activity, making it hard to use a super-critical network as a detector. At the critical point, however, where each excited cell produces, on average, exactly one more excited cell, waves of excitation eventually die out, but they tend to be very long-lived, and in fact their distribution follows a power law.

(People who teach courses on random processes are very fond of branching processes, because the basic model can be solved exactly with hundred-year-old math, but there are endless ramifications, and some of the applications are very technically sweet. Like most mathematical scientists, Kinouchi has certain tools he tends to return to, and critical branching processes are one of them.)

As Kinouchi and Copelli say in their abstract, the idea that the critical point, where things are just about to go unstable, is a useful place for processing or transmitting information is a persistent theme of complex systems. (You could, arguably, even trace a version of the idea back to William James's Principles of Psychology.) It has also, before this, been one of the weakest of our ideas. The original work from the 1980s on "evolving to the edge of chaos" has proved impossible to replicate, I would even say experimentally refuted. (Why the phrase and idea continue to propagate is another question for another time.) Stu Kauffman's studies of models of gene regulatory networks certainly suggests that information moved through these most easily near their critical point, but I don't think anyone has done a careful information-theoretic analysis of that. In any case, E. coli doesn't care about the bandwidth of its regulatory network: it cares about reliably making lactase when it only has lactose to eat, i.e., specific adaptive functions. Prior to this, I can only think of one situation where the idea has been made precise and has strong evidence to back it up (namely, this paper), but that's a purely mathematical exercise of no biological relevance.

What Kinouchi and Copelli have done is very different: they've actually identified something biologically important which is maximized at the critical branching ratio, namely the dynamic range. The network as a whole responds to the stimulus, and its dynamic range can be many orders of magnitude wider than that of its component cells. It is this enhancement which is maximized at the critical branching ratio, and falls off sharply for networks which are even a little sub- or super- critical. As a bonus, the shape of the response function is of the correct power-law form, though, in their model, the exact exponent isn't right. Modifying the network structure, or some model details, changes the exponent, but the dynamic range is still sharply peaked at the critical branching ratio.

There are a lot of other nice things about this paper, which I won't get in to least I repeat it all, but I will point out one thing: while their central qualitative results are pretty robust to small tweaks, there are some details of their model which make it a fair caricature of some excitable media, but not all. These are quite deliberately matched to properties of the olfactory system and the retina, but wouldn't work in, say, the cortex, where the dynamics of excitation are different. So this isn't an "over-universal" model, but one of particular phenomena produced by particular mechanisms. In fact, looking at olfaction, they are able to make a prediction about the effects of knocking out specific genes which are involved in the fast, symmetrical electric couplings they assume. Nobody seems to have done those experiments yet, but, at least to this non-biologist, it seems feasible, and, now, very interesting.

*: Here's an anecdote illustrating how broken academic publishing is. Kinouchi and Copelli work at the University of Saõ Paulo, which doesn't, for reasons of economy, subscribe to Nature Physics. To get an electronic copy of their own published paper, they were forced to write correspondents at other universities. I couldn't help them, because my school doesn't feel like it can afford to subscribe to Nature Physics either.

Complexity Minds, Brains, and Neurons;

Posted by crshalizi at July 24, 2006 16:20 | permanent link

July 20, 2006

George Hersey's The Monumental Impulse: A Declaration of Defeat

Attention conservation notice: 1600 words of pedantic whining about how a book on architecture didn't meet my parochial expectations. Also, it was mostly written and then abandoned to the gnawing criticism of the mice over a year ago.

I give up; it's got me beat.

I asked the good people at the MIT Press for a review copy of this, this thing, years and years ago, back when I was still regularly writing book reviews, because it sounded cool.

We humans owe an immense architectural debt to many other species. Indeed, the first hexagons humans saw may have been in honeycombs, the first skyscrapers termitaries (termite high-rises), and the first tents those of African weaver ants. In The Monumental Impulse, art historian George Hersey investigates many ties between the biological sciences and the building arts. Natural building materials such as wood and limestone, for example, originate in biological processes. Much architectural ornament borrows from botany and zoology. Hersey draws striking analogies between building types and animal species. He examines the relationship between physical structures and living organisms, from bridges to mosques, from molecules to mammals.

Insects, mollusks, and birds are given separate chapters, and three final chapters focus on architectural form and biological reproduction. Hersey also discusses architecture in connection with the body's interior processes and shows how buildings may be said to reproduce, adapt, and evolve, like other inanimate or "nonbiotic" entities such as computer programs and robots. The book is both learned and entertaining, and is abundantly illustrated with fascinating visual comparisons.

It would have been cool, too, if Hersey weren't an ignorant idiot. Oh, I'm pretty sure he's OK when it comes to purely architectural matters — though how would I know if he wasn't? What drives me up the wall every time I try to write about this book is that he gets everything else wrong.

Let's take history and languages first. Hersey is a professor of art history, and so I don't think it's unfair to expect him to get these straight. But we are talking about a man who can instance a spiral design from a Byzantine church as an example of an "Islamic spiral", whatever that may be (p. 47). He doesn't seem to realize (pp. 28–29) that the early Greek architects who first defined the classical orders wouldn't have used a Latin word (capitulum) to refer to the top of a column. I'm not even sure he realizes (pp. 7–8) that Hermes Trismegistus was a Hellenistic myth. For that matter, after presenting a fanciful analogy (pp. 17–18) between the ground-plan of Lemba, a Chalcolithic village in Cyprus, c. 3900 B.C., and cross-sectional diagrams of biological cells, he writes:

Certainly the builders of these dwellings would have known similar forms in their immediate surroundings — things that, unlike true body cells, were visible to the naked eye. One prototype would be the egg, which begins as a contained for a single-cell embryo embedded in the nourishing matter it will need in order to reproduce and grow. ... And then there are cell-like beehives, birds' nests, and plants. To the Greeks, moreover, and therefore maybe even to the Chalcolithic residents of Lemba, the word for cell (kutos) also meant uterus, and even the whole human body. So we must not relinquish the thought that the Lemba cells are the extended phenotype of builders whose own bodies, though they did not consciously know this, were put together similarly.
Let me try to extract everything that's broken in these sentences, and see what's left.
  1. Whatever they were speaking in Cyprus in 3900 B.C., it wasn't Greek, or even in the Indo-European family. Since kutos derives from the proto-Indo-European root *(s)keu- meaning "to cover" or "to conceal", it's irrelevant to the Lemban vocabulary and mentality. (See e.g. Calvert Watkin's Dictionary of Indo-European Roots, pp. 78–79.)
  2. He's right that kutos also meant uterus in Greek, but so what? Nobody thought mammals had eggs until the Renaissance, much less eggs somehow related to things called "cells". It's just as relevant that kutos also referred to "the fourth stomach of the ox".
  3. Suppose that the Lemban word for their houses was nothing at all like their word for "cell", and furthermore that their designs for their houses owed nothing at all to observations of eggs, birds' nests, beehives or anything like. Would that mean their buildings were not part of the builders' extended phenotypes? Obviously, the answer is "no", or at least that should be obvious to anyone who understands what "extended phenotype" is supposed to mean.
  4. He moves from "maybe" to "must not abandon" in the space of a sentence.
However, it does seem plausible that the Lembans would have noticed that eggs, birds' nests, and their houses were all roundish.

Bad as he is on history and languages, Hersey is worse at science and mathematics, and the history of science. He has no idea what "a topology" is (p. 51; at best this is a garbling of a curved manifold). He is capable of writing about distances "increasing at a fixed angle" (p. 45; he seems to mean rate), and of "steel molecules" (p. 13). He thinks sharks are not vertebrates (p. xvii). He literally does not know what a virus is (p. 15). (He also doesn't know when they were first discovered, nor when their shapes were first determined [pp. 15–16], and so his suggestion that illustrations of the shapes of viruses influenced some of Gaudí's designs on the Sagrada Família [p. 16] is just wrong.) He doesn't realize that DNA molecules don't actually look like the simple diagrams people draw of double helices, but are bent, folded and twisted, and so resemble spiral staircases not at all (pp. 6–8). And so on, and so forth.

Let me give a last example of the kind of thing which irritates the hell out of me; it comes from very early in the book (pp. xviii-xix), but it's central to whatever attempt at an argument Hersey makes.

Homo sapiens shares something that I don't yet dare call a gene sequence for building — shares it, perhaps homologously, perhaps convergently, with other constructing creatures such as birds, crustaceans, ants, termites, and bees. I will also be claiming, as a corollary, that the shapes of our monumental shelters, whether bicycle sheds or cathedrals, reflect and often derive from the shapes first created by these other species — species that, like us, are subject to the monumental impulse. ...
But now comes a paradox: certain ants, termites, honeybees, and birds build elaborate structures. So do humans. But, as humans, we are anomalous in doing this. Only a few other mammals build — most obviously beavers and badgers. Worse still, our own closest cousins, the other primates, hardly build at all. An African termitary might remind us of Wright, of a Gaudí spire, or of a skyscraper by Hermann Obrist. But no such thoughts come to mind when we look at the rudimentary retreats of chimps and gorillas. Thus any genetic homology that brackets us with the other builder-species will have to be very ancient and, also, will have to have bypassed our immediate ancestors and cousins.
This isn't a paradox for his idea — it's a refutation. The last common ancestor of humans and termites lived before the Cambrian explosion, presumably in the oceans; whatever bizarre wormy thing it may have been, it assuredly didn't build. Even if the genes "for" building in humans and termites are both descended from the same set of genes in that remote common ancestor, they are no more homologous than flight is homologous in birds and pterodactyls, because they both independently modified vertebrate forelimbs into wings. A little reading on comparative methods, and how homologies are actually established, would have kept Hersey from wasting his and his reader's time. (Similarly for the chapter on the reproduction and evolution of architectural designs, which is completely innocent of all actual work on, say, the evolution of technology, or even on the cognitive processes of architectural representation.)

The whole book is like that — a series of conceits which a little thought or research would've shown don't work, presented as real scholarship. To be fair, Hersey sometimes allows himself a certain levity of presentation: in chapter eight he claims (basically) that people like domes because they remind them of breasts, which he illustrates by juxtaposing a picture of the Taj Mahal with one he attributes to "D-Cup Superstars, February 1992". But even then, he concludes, on no basis whatsoever, that the Taj is "an architectural thernody to the queen's breasts" (p. 155), and means us to take that seriously.

I won't say that Hersey's book is bullshit, because Hersey buys it. But I will say it's crap. It's crap in the same way that much too much of what I read by scholars in the humanities is crap: Hersey doesn't think carefully and critically, he tries to use ideas he doesn't understand, he's sloppy about facts, and he thinks he's establishing reliable conclusions when he can't argue his way out of a wet paper bag. Lest by saying this I call up the wrathful wraith of Chun the Unavoidable, I hasten to add that (1) many scholars in the humanities are, indeed, excellent and careful scholars, who do not suffer from these debilities, and (2) I think this cannot in the least be blamed on any value of post-*ism, or or "theory" or anything of that sort. Certainly Hersey is not a post-*ist, and I have no reason to believe this sort of crappiness has become more common among humanists in recent years. (If anything, I'd guess that the causal arrows point from crappiness to post-*ism, rather than the other way. For this, too, however, I have no evidence.)

— But, you see, this is what always happens when I try to write a proper review of The Monumental Impulse: I end up wandering hopelessly off topic, in order to avoid having to think of all the ways the book vexed me.

Do not read this book.

Learned Folly; The Natural Science of the Human Species

Posted by crshalizi at July 20, 2006 00:45 | permanent link

July 19, 2006

"Where Alph, the sacred river, ran"

The Xanadu region of Titan has rivers:

No images, yet, of "caverns measureless to man", or a "sunless sea" of liquid methane.

(Via Beyond the Beyond.)

Update, 20 July: I should have read the press release first (via Uncertain Principles). More importantly: Venus! (via The Daily Llama).

Update, 21 July: The Quantum Pontiff wonders whether Einstein's work on the physics of rivers will hold on Titan.

The Eternal Silence of These Infinite Spaces

Posted by crshalizi at July 19, 2006 22:00 | permanent link

July 16, 2006

Sow the Wind, Reap the Drag Coefficient (Dept. of "We are as gods, and might as well get good at it")

I wrote this about a year ago; I'm cleaning out my drafts folder.

As a paid-up member of the Viridian movement, I it would be a Very Good Thing for industrial civilization to find a power source which will let it run and grow without choking on its own waste. (I'm no good at ranting about this, so if you're curious I refer you to the master.) But it would be foolish to pretend that non-fossil-fuel power sources will not carry their own costs, including the possibility to alter the climate, and something worse than foolish not to consider those costs. After all, one of the Viridian design principles advises us to "Look at the Underside First":

Legions of people are paid large sums to promote the positive aspects of commercially available products. Very few people earn their daily bread by pointing out malfunctions, bugs, screw-ups, design failures, side-effects and the whole sad galaxy of trade-offs and failings that are inherent in any technological artifact. To counteract this gross social imbalance, a wise designer and a wise critic will make it a matter of principle to look at the underside first.

In which spirit, I bring you the following.

David W. Keith, Joseph F. DeCarolis, David C. Denkenberger, Donald H. Lenschow, Sergey L. Malyshev, Stephen Pacala and Philip J. Rasch, "The influence of large-scale wind power on global climate", Proceedings of the National Academy of Sciences (USA) 101 (2004): 16115--16120 [free via open access]
Abstract: Large-scale use of wind power can alter local and global climate by extracting kinetic energy and altering turbulent transport in the atmospheric boundary layer. We report climate-model simulations that address the possible climatic impacts of wind power at regional to global scales by using two general circulation models and several parameterizations of the interaction of wind turbines with the boundary layer. We find that very large amounts of wind power can produce nonnegligible climatic change at continental scales. Although large-scale effects are observed, wind power has a negligible effect on global-mean surface temperature, and it would deliver enormous global benefits by reducing emissions of CO2 and air pollutants. Our results may enable a comparison between the climate impacts due to wind power and the reduction in climatic impacts achieved by the substitution of wind for fossil fuels.

There are basically three channels through which wind power would influence climate. The first is direct increase of friction in the vicinity of the turbine, which dissipates kinetic energy in the wind into heat. The second is that the increased drag produced by the turbines will change the speed and direction of prevailing winds, and since these winds carry heat and moisture, can potentially alter the climate over large areas. The third, and most indirect, channel is that increasing use of wind power may reduce the amount of fossil fuel consumption, and interact with climate changes produced by the existing load of greenhouse gases.

The present paper is a first cut at considering all three channels, using two standard global climate models and a variety of different assumptions about where wind-turbines will be situated, how efficient they will be, and the manner and magnitude of the resulting increase in surface-layer drag. There are plenty of crude approximations made here (for instance, that the increase in drag is uniform over the affected area), but they're recognized as such. The results thus need to be treated with caution, but they are interesting. The second channel (changing atmospheric transport) has a much bigger impact than the first (direct friction); together, under plausible assumptions, they give a near-zero impact on the global mean, root-mean-square change in seasonal means of about one twentieth of a degree centigrade at one point, and peak changes of half a degree. Under further plausible, but of course more arguable, assumptions, the third channel (reduced carbon emissions) swamps these effects, too, by about a factor of five. Interestingly, the configurations for wind power they consider tend to cool the poles and warm the lower lattitudes, while carbon dioxide warming has the opposite effect. This suggests (though the authors don't go there) that we might consider massive wind farms as a pure climate-change moderator, even if they weren't a practical power source from an engineering point of view.

It should probably be noted that the first two sponsors listed in the acknowledgments are British Petroleum and the Electric Power Research Institute.

The Great Transformation

Posted by crshalizi at July 16, 2006 05:07 | permanent link

The Awful Turkish Language

I wrote this about a year and a half ago; I'm cleaning out my drafts folder.

There are reasons why one might think that Turkey should not be admitted to the European Union, but surely the silliest must be that Turkish is not an Indo-European language. Following Phersu, I can just imagine the consequences of taking this seriously. First, the Basque-speaking provinces of France and Spain leave the EU, along with Hungary, Finland, Estonia and Malta. But then, of course, India and Pakistan will submit rival applications to join, closely followed, no doubt, by the Iraqi Kurds. The whole idea is so stupid that I can't believe it was meant seriously, or even guess what Giscard d'Estaing thought "Indo-European" meant.

That said, Turkish does have features which are absent or attenuated in (most) Indo-European languages. (Disclaimer: I do not speak Turkish.) For instance, it's highly agglutinative, forming new words by adding suffixes to roots, and doing so recursively. (German does this too, but to nowhere near the same degree.) This leads to words like yapabilecekdiyseniz, "if you were going to be able to do". (Readers may amuse themselves by analyzing this example using the Turkish Suffix Dictionary.) Moreover, these words are not oddities, like "antidisestablishmentarianistic", but in everyday use. I once heard a talk by a computational linguist specializing in Turkish — Gerjan van Schaaik, who oddly seems to have no web presence — where he mentioned that if one studied the corpus of Turkish daily newspapers, one could easily build a lexicon of 500,000 entries, and still cover only 95% of the words in the corpus. (I can't tell, from my notes, whether van Schaaik was talking about something that had actually been done, or just making a rough estimate.) This property of Turkish becomes very important for a number of technologies, including one without which the modern world would simply grind to a halt: spam filtering.

Levent Özgür, Tunga Güngör and Fikret Gürgen, "Adaptive anti-spam filtering for agglutinative languages: a special case for Turkish", Pattern Recognition Letters 25 (2004): 1819--1831 [PDF reprint via Prof. Güngör]
Abstract: We propose anti-spam filtering methods for agglutinative languages in general and for Turkish in particular. The methods are dynamic and are based on Artificial Neural Networks (ANN) and Bayesian Networks. The developed algorithms are user-specific and adapt themselves with the characteristics of the incoming e-mails. The algorithms have two main components. The first one deals with the morphology of the words and the second one classifies the e-mails by using the roots of the words extracted by the morphological analysis. Two ANN structures, single layer perceptron and multi-layer perceptron, are considered and the inputs to the networks are determined using binary model and probabilistic model. Similarly, for Bayesian classification, three different approaches are employed: binary model, probabilistic model, and advanced probabilistic model. In the experiments, a total of 750 e-mails (410 spam and 340 normal) were used and a success rate of about 90% was achieved.

Özgür et al. do not report on the ability of their classifiers to discriminate between spam, and weirdly pseudo-learned pronouncements from former presidents of France.

Enigmas of Chance

Posted by crshalizi at July 16, 2006 04:59 | permanent link

July 07, 2006

Friday Cat Blogging (Leaving on a Jet Plane Issue of Non-Science-Geek Edition)

Speaking of air travel, as we were, Kara dreams of traveling to the Beijing Complex Systems Summer School.

Alas! She will not be gracing the Fragrant Villa with her presence. Twenty hours in planes and airports would not agree with her temperament, nor would jet-lag be pleasant for one so distressed by any interruption of her accustomed nap cycle.

As for me, I have cleverly worked on my three lectures in reverse order, so the last two are finished...

Friday Cat Blogging

Posted by crshalizi at July 07, 2006 10:00 | permanent link

July 06, 2006

Statistical Arbitrage in the Sky

Reading this piece in the New York Times about Farecast, Oren Etzioni & co.'s attempt to forecast airline ticket price movements (through the power of machine learning!) leaves me with three reactions.

First, it's a relief to read something about data-mining and airlines which isn't Yet Another Unconstitutional Step Towards a National Surveillance State. In fact, this would be cool if it works.

Second, I'm professionally curious about how well Farecast's predictions, with over 100 independent variables, would compare to simple alternatives, like low-memory hidden Markov models, or throwing out all the variables except the time remaining until departure. (Since I'm teaching data mining in the fall, I'm also professorially curious.) I suspect that, even if there is some real improvement, it is small. Come to that, the trivial predictor which always forecasts a price increase is going to set a pretty high baseline accuracy...

Third, I'll be surprised if does work, not on technical grounds, but because I don't see how it makes sense for airlines to cooperate. The ideal airline pricing scheme is one which gouges you just enough that you're indifferent between taking the flight and not going at all (or taking some alternate mode of transport, etc.). This is why flights at ungodly hours are cheaper than those on the same route at decent times: by showing up at five in the morning to have your luggage prodded, etc., you signal to the airline that they really can't get any more money out of you. [1] Now suppose that you want to take a certain flight, and there's a maximum amount you'd pay for it. As things currently work, you look up the price and see what the airline is currently charging. If that price is less than its value to you, you buy the ticket, and the difference is your "consumer surplus". Now Farecast comes along and says, in effect "sure, you could do that, but the price is going to drop --- hold on and you'll do even better." So you buy when the price hits its trough, and are better thereby. (Yes, some people act like this now, I'd guess not many, and not very successfully.) From the airlines' point of view, however, every dollar by which your consumer surplus grows is a dollar they could have had. ("And I would have gotten away with it, too, if it hadn't been for you meddling KDDs!") Consumers and airlines are engaged in a zero-sum competition over the potential surplus, and this doesn't help the airlines.

Which, again, makes me very puzzled about why they would cooperate with it. The smart move on their side, I think, would be to systematically undermine the reliability of Farecast. This could be done very simply, without even attempting to reverse-engineer the predictor: monitor its forecasts of your own flights, and, all else being equal, do the opposite. It's true that a reliable forecast of a price increase isn't so bad, for the airlines, as a forecast of a price decrease, but systematically jamming and confusing Farecast should be easier than selectively doing so. But I defer to real economists about the importance of this wrinkle.

The larger moral ought to be a familiar one: in strategic interactions, you have to assume that the other side will adapt to you. This doesn't mean that statistical methods have no place in studying strategic interaction (see, e.g., the second paper here), but it does mean we should be very dubious about the ability of simple data mining to give us an advantage over an opponent as smart and determined as a commercial airline.

(Thanks to K. for sending me the article, and discussing it.)

1: If you want to understand the logic of airline pricing, among much else, a great read is Carl Shapiro and Hal Varian's Information Rules: A Strategic Guide to the Network Economy. [I have been sitting on a draft review for seven years now, and am not about to stop.] Despite the very late-1990s title, this is really about the general economic principles involved in any industry where high first-unit costs and low marginal costs give you positive economies of scale, or where there are strong positive network externalities. Airlines are in the first category, because the cost of getting a jet from New York to LA with 100 passengers is almost the same as getting it there with 101 passengers, and is mostly the cost of getting it there empty. Alas, appreciating the rational essence of the process does not help make the lived experience any more endurable.

Enigmas of Chance; The Dismal Science

Posted by crshalizi at July 06, 2006 22:00 | permanent link

July 05, 2006

Topping from Below

Nature, in its wisdom, has compiled a list of the fifty most popular weblogs by scientists. I am more than a bit astonished to see this blog on the list, but my faith in the reading public is bolstered by seeing that it's only in the fiftieth place, and everything ahead of it that I recognize definitely deserves to be. (Via Aetiology [#7] and Pharyngula [#1].)

Linkage; Self-Centered

Posted by crshalizi at July 05, 2006 16:05 | permanent link

The Sharks of DARPA

I wrote this about a year ago; I'm cleaning out my drafts folder.

As every schoolchild knows, what became the Internet was at first ARPANET, one of the many projects by which the Pentagon's Advanced Research Projects Agency nurtured the development of computer science and technology in the US. ARPA eventually became DARPA, but for a long time retained its character of a patron of basic and curiosity-driven research. Lately, however, it seems to have lost its touch, what with the whole terrorism futures markets debacle, "total information awareness", and so on. (I myself was for several years sustained, as a graduate student and post-doc, by my adviser's DARPA grant, familiarly known as cooperative agreement F30602-00-2-0583, part of the TASK program. Of course, it's not for me to say where our work fell in the spectrum from foundational to flaky.) The sad but predictable response has been to demand that DARPA de-emphasize basic academic research in favor of working with private-sector contractors for short-term military payoff.

All of which is by way of lead-in for this post to the connectionists mailing list:

Postdoctoral position in neurobiology / engineering in Woods Hole

A 4-year DARPA research project, funded annually, to steer the behavior of sharks in the natural environment through stimulation of selected sensory brain areas. Expertise in brain stimulation, multi-electrode recording and neural data analysis most desirable. Interfacing with wireless data transmission and stereotactic electrode positioning.

A year earlier, a Boston Globe story had more information, but lacked the admirably matter-of-fact tone of the job-ad.

I don't see much by way of follow-up, but I haven't looked very hard. For all I know, this work is already well on its way to giving us a beloved childrens' classic.

Learned Folly; Minds, Brains, and Neurons

Posted by crshalizi at July 05, 2006 09:55 | permanent link

Signs I Will Not Recommend Your Manuscript Be Published As Is (No. 734)

You make me want to quote the Medium Lobster's cost-benefit analysis of pre-emptively blowing up the moon in my report.

I hope you appreciate what it costs me to resist this temptation.

Learned Folly

Posted by crshalizi at July 05, 2006 09:30 | permanent link

July 04, 2006

This, This Is My People

From John Hall and Charles Lindholm's Is America Breaking Apart? (pp. 83--90, omitting footnotes):

One thing that has often been held to characterize Americans is the ambiguity, confusion, and "contagious vagueness" of their understanding of political theory. Americans may know, for example, that citizens have rights, but are extremely unclear about what those rights might be; they know Americans are supposed to be free, but not how freedom is limited, or what freedoms are permitted; they know that "all men are created equal," but cannot reconcile that precept with the protection of property. In other words, most Americans are very proud indeed of the principles that their country is built upon, but for them those principles consist primarily of abstract notions, such as liberty, justice, and equality, rather than a systematic set of specific precepts or practices.

The major sources of "contagious vagueness" are to be discovered in the priority Americans have always given to pragmatic experience over philosophical system-building. Under the fluid circumstances of American life, ambivalence and woolly abstraction have certain advantages. ... While remaining faithful to nebulous abstractions of "freedom," "individualism," and "equality," Americans can still be pliable in action without betraying their integrity or finding themselves immobilized by contradiction. Ideological vagueness thus allows Americans to feel a sense of unity without the trouble of actually considering exactly what that unity is based upon.

The abstraction and ambiguity of taken-for-granted foundational principles also allows Americans easily to "hold contradictory ideas simultaneously without bothering to resolve the potential conflict between them." This, too, is not necessarily a bad thing in a pluralistic society, where central authority is relatively weak. The blurry quality of American assumptions about their shared creed allows them to accept innovations easily, so long as the innovations are metaphorically bathed in the aura of tradition, and to react according to circumstances without too much concern about agreement with prior positions. Most important, an ability to ignore contradictions permits Americans to overlook disputes that might tear a more ideologically consistent society apart.

Corresponding to American abstraction and vagueness in the realm of political philosophy is a positive can-do approach to ordinary problems. ... It is no surprise that the predominant American philosophy is pragmatism.... However, what pragmatism takes for granted as "common sense" is actually a culturally constructed perspective, based in large measure on what has recently been called modular thinking. This is a strategy for instrumental action which assumes that complex wholes can be broken into elementary parts; these parts can then be efficiently recombined according to need. Modular thinking is American to the core: it is an atomistic, flexible, anti-organic, and anti-authoritarian view of the world — one which dispenses with tradition in favor of efficiency, and places all alternatives on an equal footing, subject to personal evaluation by the active innovator, who decides which combination is best.

Modular thinking has had a successful history in America. It is responsible for the development of the assembly line and Taylorist innovations in scientific management, and it provides the foundational principles behind everything from the construction of shopping malls to the planning of school curricula. ... [T]he pervasive pragmatic modular approach to life permits Americans to avoid divisive ideological issues by visualizing the world around them as a machine that can be retooled, or taken apart and rebuilt, in order to achieve maximum efficiency. ... Disagreements are not over principles, but over design. Though this mechanistic instrumental worldview may remove much of the magic from the cosmos, and though it certainly does not grasp complex social realities, it is not likely to arouse great passions either — and so is conducive to social peace.

It is especially striking that for Americans even the self is considered to be a kind of modular entity, capable of being reconfigured to fit into preferred life styles. This malleability is often decried as indicating American shallowness, or else praised as the postmodern triumph of the signifier. But the American emphasis on perpetual self-transformation also serves the cause of unity, though perhaps not in the way Protestant moralists would prefer. This is because the search for identity is a notoriously solipsistic pursuit: such quests do not lead to revolution, but to harmless participation in the therapeutic, self-help, and twelve-step groups that have so mushroomed in America. At the very worst, the search for an authentic self draws the most perplexed seekers towards immersion in the multitude of sects and cults that have always sprouted on American soil. Occasionally, it is true, these groups spiral into psychosis ... but generally these new religions are akin to the "healthy-minded," "once-born" faiths that William James found so characteristic of America. They typically affirm the goodness of all creation and preach accommodation with the world as it is, stressing mental discipline, while applying the optimistic American "can-do" attitude to spiritual uplift and practical self-betterment. Membership in them is no more harmful than membership in any local PTA.

American faith in the power of individuals to change themselves is quite understandable as a product of the immigrant experience in combination with the Protestant ethos. Protestant sects believe that individuals can be spiritually transformed through disciplined, virtuous action in this world. For most of the original settlers immigration to America was just such a transformative action, a voluntary pilgrimage in search of the City on a Hill. In secular garb, this model continues to hold: becoming an American is a kind of conversion experience. The newcomer "is not required to learn a philosophy," Daniel Boorstin notes, "so much as to rid his lungs of the air of Europe." This point is not invalidated by the fact that more recent immigrants, male and female, rid their lungs of the air of China, India, and Africa. For all these newcomers, past and present, America has been the "Mother of Exiles. From her beacon-hand glows world-wide welcome." The content of the glowing welcome offered by the Statue of Liberty is not a dogma, but an opportunity. America presents itself as a place where newcomers can achieve their dreams, free at least from the chains of tradition, class, and history...

For the zealous believer of colonial times, the end sought through migration to America was a passage into heaven.... For the modern entrepreneur, the goal is likely to be far more mundane: owning one's own business and acquiring a house in an exclusive suburb. Whether what they sought was spiritual or material, immigrants to America have worried little about conceptual consistency or a systematic organization of principles. Central instead is a belief that individuals have the capacity, through personal effort, dogged discipline, and creative innovation, to leave the past behind, to pursue happiness, and to become whatever their potential allows. Only in America would the Army call on its recruits to "be all you can be."

We can feel the heady appeal of this transformative aspect of American life in a letter sent by a French migrant to California during the Gold Rush:

In the mist of this world of adventurers, who change their occupations as often as they do their shirt, egad, I did as the others. As mining did not turn out remunerative enough, I left it for the town, where in succession I became a typographer, a slater, plumber, etc. In consequence of thus finding out that I am fit for any sort of work, I feel less of a mollusk and more of a man.
For such adventurous souls, America indeed offered — and continues to offer — an opportunity for taking on a new and better identity, for making a mollusk into a human being.
They go on to talk about why we are so friendly and nice, and why believe that love is the only legitimate bond between people, but I'm skirting the bounds of fair use as it is, even in the interest of patriotic display.

The Beloved Republic

Posted by crshalizi at July 04, 2006 17:00 | permanent link

June 30, 2006

Books to Read While the Algae Grow in Your Fur, June 2006

Chris Eliasmith and Charles H. Anderson, Neural Engineering: Computation, Representation, and Dynamics in Neurobiological Systems
This is one of the few ventures into describing how the brain works that is solid and general enough that I would call it a theory. It is however more a theory of how one would construct the optimal computational device (adaptive control system) to get an organism through the world, given that certain types of neurons are available, and as such all the limitations on optimality analysis in biology apply. They apply with extra force here, because of course the brain has to learn connections, and there are real obstacles in the way of local learning processes producing globally optimal outcomes. (It is, to steal Ashby's old title, a design for a brain, not necessarily for ours.) Still, this is really good stuff. Anyone who cares about these subjects ought to read it. (In fact, I should probably write a full-scale review...)
Alan Furst, Night Soldiers
Just a simple country boy, sailing down the river for the NKVD... I think this was his first novel, but can't quite tell, which is to its credit. (See also: earlier remarks on Furst.)
Dean Baker, The Conservative Nanny State: How the Wealthy Use the Government to Stay Rich and Get Richer
Convincing portrait of modern conservative economic policy as a series of quite blatant attempts to selectively interfere with the workings of the free market, so that certain groups enjoy economic rents at the expense of the rest of the population. (Or: class struggle, a phrase he never uses.) My biggest complaint: the best econometric studies I know (e.g., L. G. Kletzer's) say that the growth of trade accounts for about a quarter of the loss of manufacturing jobs in the US in recent decades, the rest being (pretty much) due to increased productivity, so it's not clear how much impact trade in manufactured goods, but not in professional services, has on income inequality; I wish Baker had talked about this. (Of course, manufacturing wages haven't gone up at anything like a rate corresponding to productivity gains, which is in a larger sense Baker's point. I could well imagine --- but have no evidence to back this up --- that even if trade does not account for a lot of job-loss in the U.S., the fact that employers could threaten to relocate any particular plant overseas would serve to hold down wages. And the threat would only have to be credible to employees, not actually practical.)
Full-text free online under Creative Commons, intellectual property rights being Exhibit C in Baker's case (after trade and Federal Reserve policy).
Charles C. Mann, 1491: New Revelations of the Americas Before Columbus
Popular account of recent work by anthropologists, archaeologists, geographers, etc., on the nature, extent and impact of pre-Columbian civilizations in the Americas. Well-written, mostly convincing, and good at pointing out where there are controversies and why. It did strike me as reaching in a few places (e.g., on his evidence, I fully buy that the Aztecs had a very sophisticated literary tradition, but that's not philosophy on the level of ancient China, Greece or India). Still, very much recommended.
Stephen King, The Dark Tower
The end of the story of Roland, the last gunslinger, and his quest for the Dark Tower. There are two endings; both made me want to cry. "There I will sing all their names..."

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at June 30, 2006 23:59 | permanent link

June 27, 2006

Corners Bumped

A year and a half ago, I commenced a major purge of my book collection. A year ago, when I left Ann Arbor for Pittsburgh, I consigned thirty-odd boxes of used books to Corners Bumped, a.k.a. my friends Bill and Barbara Tozier. (This tells you something about the number of books I did end up shipping here.) Recently, they have been refurbishing their online book-selling presence, listing new stuff from my boxes and their own stock, and musing (1, 2, 3, 4) on used books and the trade therein. Go buy something, will you? It will make all of us — you, me, Bill and Barbara, and not least the books — better off.

This concludes today's crass commercial plug.

Linkage; Self-centered

Posted by crshalizi at June 27, 2006 15:45 | permanent link

Problems in the Doctrine of Chances

Paradoxes and fallacies are fun teaching tools, and elementary courses in probability theory are well supplied with them. I have long thought that more advanced probability classes would be more palatable if we presented our students with more opportunities to go "Wait, that can't be right", and sink their teeth into something really just-plain-wrong. Happily, we now have about ten more such morsels to offer them:

Guang-Liang Li and Victor O. K. Li, "Inconsistencies in Current Theories of Real Numbers, Measure, Probability, and Stochastic Processes", math.GM/0606635
Abstract: Current theories of real numbers, measure, probability, and stochastic processes appear to be self-inconsistent. We exemplify the inconsistencies with Poisson processes, Markov chains, continuous random variables, the Lebesgue measure, and the Cantor-Dedekind theory of real numbers.

Anyone taking 36-754 from me next spring should expect to see sections 2 and 3 as assignments.

Enigmas of Chance

Posted by crshalizi at June 27, 2006 14:00 | permanent link

June 23, 2006

Friday Cat Blogging (The Parallelogram Paw Issue of Science Geek Edition)

Kara has a charming habit of secreting herself behind some corner and landing a devastating swipe of her paw on the calves of passers-by. This would be very effective at crippling us, if only she had not been cruelly declawed at some point before we adopted her from the shelter. Today's post, however, will not be about the irreducible intensionality revealed by her desire to bring down prey much larger than herself, nor about the qualia of her phantom claws (which I am convinced she experiences). Rather, I want to talk about the mechanism which brings her paw — and with it her fore-arm and shoulder and the rest of her body — to move just so as to effectively ham-string us. This mechanism is accessible to experimental investigation, and, as it turns out, pretty nearly linear.

Christian Ethier, Laurent Brizzi, Warren G. Darling and Charles Capaday, "Linear Summation of Cat Motor Cortex Outputs", The Journal of Neuroscience 26 (2006): 5574--5581 [Thanks to Prof. Capaday for a reprint]
Abstract: Recruitment of movement-related muscle synergies involves the functional linking of motor cortical points. We asked how the outputs of two simultaneously stimulated motor cortical points would interact. To this end, experiments were done in ketamine-anesthetized cats. When prolonged (e.g., 500 ms) trains of intracortical microstimulation were applied in the primary motor cortex, stimulus currents as low as 10-20 \mu A evoked coordinated movements of the contralateral forelimb. Paw kinematics in three dimensions and the electromyographic (EMG) activity of eight muscles were simultaneously recorded. We show that the EMG outputs of two cortical points simultaneously stimulated are additive. The movements were represented as displacement vectors pointing from initial to final paw position. The displacement vectors resulting from simultaneous stimulation of two cortical points pointed in nearly the same direction as the algebraic resultant vector. Linear summation of outputs was also found when inhibition at one of the cortical points was reduced by GABA_A receptor antagonists. A simple principle emerges from these results. Notwithstanding the underlying complex neuronal circuitry, motor cortex outputs combine nearly linearly in terms of movement direction and muscle activation patterns. Importantly, simultaneous activation does not change the nature of the output at each point. An additional implication is that not all possible movements need be explicitly represented in the motor cortex; a large number of different movements may be synthesized from a smaller repertoire.

To elaborate a little on the experimental procedure: the cats were anesthetized with ketamine, which allowed for surgery to attach sensors measuring electrical activity to eight muscles, as well as brain surgery opening up the motor cortex. They also attached a sensor to the paw, tracking its position in space. Then, maintained on a steady flow of ketamine, the "body of the cat was laid on a cushion with its forelimbs hanging perpendicular to the ground and free to move in all directions against gravity". At this point, they began small-scale, low-current stimulation of the motor cortex, until they identified a number of points each of which, when stimulated, produced a repeatable and distinct motion of the paw and fore-limb. They recorded both the displacement of the paw, and the total activity, over the course of the movement, of each of the eight muscles. Thus, each point in the cortex corresponded to both a three-dimensional vector, in ordinary space, and an eight-dimensional vector, in muscle space.

In a linear system, if you add two inputs, you also add two outputs; this, and nothing more, is what "linear" means. Vectors add according to the parallelogram rule, so if the cat motor system is linear, if we stimulate two points simultaneously, the movement of the paw should be the superposition of the movements produced by either point on its own. I wish I could build some suspense at this point, by harping about how neurons are notoriously nonlinear devices, so it's madness to expect any sort of linearity here, but Ethier et al.'s title rather gives the show away. When they simultaneously stimulated pairs of points, they got paw motions which were almost perfect linear sums of the individual movements. This was true of both the paw-displacement vector, and the muscle-activation vectors. (See especially their figures 2 and 5, and accompanying text.) It didn't matter whether they electrically stimulated both points, or stimulated one point while chemically reducing the inhibition at the other. This is very nice, but what I like even more is the experiment summarized in their figure 7, where they took two cortical points, which produced nearly perpendicular movements on their own, and by varying the magnitude of the stimuli at each, got a sequence of movements which smoothly interpolated between them, exactly as one would hope for a linear control system.

Now, I should say that the linearity of response wasn't perfect. The largest systematic deviations from linearity occurred when summing the individual motions would have produced the largest displacements — in a word, the muscle response saturated. To avoid this effect, Ethier et al. first established an input-output relationship for each cortical point on its own, and kept the stimulus magnitude low enough to avoid saturation there. They suspect the place where the response became sub-linear was in the spinal cord, but they don't, that I can see, really establish it.

There are two different larger morals to be drawn from this story. One has to do with the functional anatomy of the motor cortex (a larger, long-running story nicely presented in an older paper of Capaday's). There is clearly a great deal of localization of function there — this point produces a swiping motion of the paw, that that one pulls it back towards the chest, neither makes the tail twitch — but of a peculiar sort. One might well imagine that each point in the motor cortex would correspond to a particular muscle or group of muscles; instead, at least in the part Ethier et al. worked with, they seem to correspond to motions involving many muscles to varying degrees, overlapping from one cortical point to another. Linearity means that a reasonably small set of motions could serve as a basis for a vast range of coordinated actions, without all of those having to be separately stored in the motor cortex.

The other moral has to do with the general principles of neural representation and computation. Neurons are, indeed, horribly nonlinear little things, so it would be entirely reasonable to suppose that neural codes are too; but that would be too quick. One of the few efforts in this area that is general, abstract and predictive enough that it seems to me to be worth calling a theory, the "neural engineering" advanced by Chris Eliasmith and Charles Anderson in a book of that title, takes as its first principle "nonlinear encoding and linear decoding". That is, while the mapping from input to output is hairy and nonlinear, for typical outputs you can recover the input, to high accuracy, using a linear rule. This is especially easy to arrange in neural systems where excitation and inhibition are neural balanced, so Ethier et al.'s findings on dis-inhibition fit in nicely.

Nonlinear encoding and linear decoding is not just an assumption of Eliasmith and Anderson, but, e.g., features quite prominently in Spikes, and is implicit in the now-standard "reverse correlation" method. While I am not, usually, one to argue with scientific success, I have reservation about this. William James used to decry, as "the psychologist's fallacy", the "confusion of his own standpoint with that of the mental fact about which he is making his report" (Principles of Psychology, ch. 7). Something similar (the "computational neuroscientist's fallacy", perhaps?) seems to me involved here. Neural representations do not exist to be decoded by scientists, but to be used by other parts of the organism, and ultimately to produce adaptive actions. What is lacking, in most of these studies, is evidence that linear decodings of neural activity are in any way biologically relevant. (One of the nice things about Eliasmith and Anderson is they see at least part of this, since their second principle is that other parts of the brain use a neural representation by applying alternately-weighted linear transformations to it, i.e., biased linear decodings. But they present less evidence for this than for their first principle.)

In this case, however, I don't find much room for doubt: the points in the motor cortex represent actions, and those representations are, when the paw meets the calf, linearly decoded. It's still not clear to me that linear decodings and transformations are any easier for the brain to implement, but at least in this case that's what's going on, and it's an empirical fact we will have to incorporate in our models, or ideally explain in our theories.

And now, if you will excuse me, I'm being attacked.

Friday Cat Blogging; Minds, Brains, and Neurons

Posted by crshalizi at June 23, 2006 23:40 | permanent link

June 08, 2006

Neuropharmacological Foundations of the Public Sphere

The importance of coffee-houses in the Enlightenment, and the rise of the public sphere, is a historical common-place. But it's also puzzling: historians can say a lot of sensible things about how, as a social setting, the cafe was conducive to the give and take of (more or less) rational argument, and (relative) indifference to social standing in favor of persuasion. But I've never heard a good story for why coffee houses had to be run that way, nor that (say) taverns weren't, or couldn't have been, run that way. So, while not denigrating the social factor, it doesn't seem to explain why this connection took hold. Now, at last, scientific proof that Enlightenment had a sound material basis (via Mind Hacks):

Pearl Y. Martin, Jenny Laing, Robin Martin, and Melanie Mitchell, "Caffeine, Cognition, and Persuasion: Evidence for Caffeine Increasing the Systematic Processing of Persuasive Messages", Journal of Applied Social Psychology 35 (2005): 160--182 [full-text PDF free via that link]
Abstract: Caffeine is known to increase arousal, attention, and information processing—all factors implicated in facilitating persuasion. In a standard attitude-change paradigm, participants consumed an orange-juice drink that either contained caffeine (3.5 mg/kg body weight) or did not (placebo) prior to reading a counterattitudinal communication (anti-voluntary euthanasia). Participants then completed a thought-listing task and a number of attitude scales. The first experiment showed that those who consumed caffeine showed greater agreement with the communication (direct attitude: voluntary euthanasia) and on an issue related to, but not contained in, the communication (indirect attitude: abortion). The order in which direct and indirect attitudes were measured did not affect the results. A second experiment manipulated the quality of the arguments in the message (strong vs. weak) to determine whether systematic processing had occurred. There was evidence that systematic processing occurred in both drink conditions, but was greater for those who had consumed caffeine. In both experiments, the amount of message-congruent thinking mediated persuasion. These results show that caffeine can increase the extent to which people systematically process and are influenced by a persuasive communication.

I should perhaps add that the leap from their findings to the rise of modern rationalism is entirely my own.

Manual trackback: Stephen Laniel; Brad DeLong

Update, 13 June: On the basis of my correspondence about this post, I feel like I have to add that I was not serious. "Differential diagnosis, people" (to quote a great sage and eminent junkie): both coffee and the coffee-house were imported into Europe from the Levant, where the coffee-house developed as a recognizable social form, without triggering a local version of the Enlightenment. For that matter, all the physiological studies point to the influence of caffeine as such, not just coffee, so why not tea? (Though, come to think of it, didn't the proto-scientific and industrial development of the Song dynasty coincide with the rise of tea houses?) No doubt patterns of caffeine consumption have had some effect on culture, and it would be nice if historians could study them, but not like this. I'm a reductionist, but when I toss out simplistic biological explanations for complicated, ill-defined social phenomena, I'm joking, unlike some people.

Also: Ruchira Dutta writes with news of a learned investigation into related questions, A. A. Reade's Study and Stimulants; or, the Use of Intoxicants and Narcotics in Relation to Intellectual Life, as Illustrated by Personal Communications on the Subject, from Men of Letters and of Science (Philadelphia: Lippincott, 1883). As the subtitle suggests, it largely consists of letters from various eminences of the day on their own experiences of using drugs, and thoughts on whether and how drugs are of use to intellectuals. Some of them are quite charming, in a rather prim way. Thus H. H. Bancroft's letter, in its entirety: "In my opinion some constitutions are benefited by a moderate use of tobacco and alcohol; others are not. But to touch these things is dangerous." Or, speaking of the rise of rationalism, Mr. W. E. H. Lecky: "I am not a smoker, and am therefore unable to give you any evidence on the subject." Other correspondents were not similarly inhibited by their lack of first person data, such as Keshub Chunder Sen ("The problem you have undertaken to solve is, indeed, one of intense importance and interest, and all who can ought to help its solution in the interests both of science and morality") or Mr. Ivan Tourguéneff.

Minds, Brains, and Neurons; The Collective Use and Evolution of Concepts; The Great Transformation

Posted by crshalizi at June 08, 2006 17:49 | permanent link

June 04, 2006

Reasons to Be Cheerful: Higher Primates Issue

  1. Marc Hauser and Bailey Spaulding, "Wild rhesus monkeys generate causal inferences about possible and impossible physical transformations in the absence of experience", Proceedings of the National Academy of Sciences (USA) 103 (2006): 7181--7185 [PDF reprint. More exactly, they are less surprised if they see an apple being cut in half by a knife than by a glass of water, even if they have not seen knives cutting apples over and over again, so as to build up an expectation of constant conjunction on that basis. The deeper point Hume was making (with al-Ghazzali before him) was that, however we acquire our causal expectations, all they mean empirically is constant conjunction, a "habit of the world". Higher primates can be very good at making guesses about those habits.]
  2. The Second Mrs. Kong, an opera with libretto by Russell Hoban and music by Sir Harrison Birtwistle. [Via John Burke, in e-mail]
  3. Victory Brewing Company's Golden Monkey tripel ale.

Linkage

Posted by crshalizi at June 04, 2006 21:30 | permanent link

May 31, 2006

Books to Read While the Algae Grow in Your Fur, May 2006

Naomi Novik, Throne of Jade
Sequel to His Majesty's Dragon (see here), and much the same remarks apply.
Manuel De Landa, A Thousand Years of Nonlinear History
Take Daniel Dennett's philosophy (mechanical materialism brought up to date), add a course of reading in the better world historians (Braudel, McNeill, Crosby), economists (North, Simon; also the less defensible parts of Jane Jacobs) and sociolinguists (Labov), and then translate into the Deleuze-and-Guattari dialect of post-structuralist, which adds absolutely nothing to the argument. (Fortunately, in a nice display of code-switching, De Landa for the most part writes clearly, if quite abstractly and academically, restricting the Deleuzisms to clearly-delineated sections. This only makes it easier to see that they are completely superfluous.) Deserves a full review (after all, I got this review copy in 1998), not that I'm apt to have the time in the near future...
Tim Powers, Declare
The occult Lovecraftian inner truth of the Cold War, with Kim Philby as emissary to the Old Ones. Superb. (Only: it really doesn't fit well that Roman Catholicism is also supposed to be true.)
Rebecca Solnit, River of Shadows: Eadweard Muybridge and the Technological Wild West
This is about the future we now live, when it wasn't widely distributed yet — and how it began in California, specifically the San Francisco area, circa 1870--1890. (One of the few connections Solnit doesn't make is to William Everdell's theory of modernism as discontinuity and collage, though it would seem to fit her argument perfectly.) One place her argument fails, however, is in persuading me that Muybridge's character had any influence on the subsequent development of cinematography — that if he had been a different person, things would have really turned out differently in any important respect, that much the same process wouldn't've been invented by other people, finding much the same uses, just as, say, television was invented by several very different people more or less simultaneously. (As one of Lem's characters argues in His Master's Voice, science and technology are ergodic processes, in which individuals' influences are transient fluctuations.) But Muybridge's life story (and the intersecting stories Solnit weaves through it) is well worth knowing for its own sake.
M. S. Bartlett, An Introduction to Stochastic Processes, with Special Reference to Methods and Applications
Old-fashioned British (Fisherian) statistics — it's from 1955! — with all the weaknesses (in mathematical sophistication, and attention to rigor) and strengths (attention to empirical applicability, preference for straightforward techniques over abstraction for its own sake) of that tradition. His treatment of the mean-square ergodic theorem (which someone seems to have ripped off), for example, is very nearly saying how to calculate the ergodic limit, and not an exercise in the spectral analysis of unitary operators. And, while he just takes it as obvious that you should do statistical inference for stochastic processes by maximizing the likelihood, he does consider inference for stochastic processes, because ultimately he's a statistican with data to analyze, and not a probabilist with theorems to prove.
Amir Dembo and Ofer Zeitouni, Large Deviations Techniques and Applications
Very nice textbook, probably ideal for a year-long course on large deviations theory. Especially strong on projective limits, and on applications to signal processing and information theory.
Frank den Hollander, Large Deviations
Maybe the best first introduction to large deviations theory I've seen. I'd have prefered a bit more functional analysis and a bit less combinatorics in the first two chapters (really!), but it's excellent, and I've stolen from it shamelessly.
Richard S. Ellis, Entropy, Large Deviations, and Statistical Mechanics
In addition to being an excellent exposition of the rigorous theory of large deviations (especially for physicists, naturally!), this is also one of the most conceptually satisfying approaches to the foundations of statistical mechanics. In particular, it makes good probabilistic sense of the method of maximum entropy, without invoking weird sub-Bayesian ideas about statistical inference. (Namely, maximum Gibbs-Shannon entropy drops out as an approximate consequence of large deviations theory, when considering a small part of a large system, becoming exact only in the thermodynamic limit. As Ellis says, the core of this idea goes back to Boltzmann.)

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at May 31, 2006 23:59 | permanent link

May 22, 2006

A Scene from the Café Central

Or rather, from Ernest Gellner's Language and Solitude: Wittgenstein, Malinowski and the Habsburg Dilemma (pp. 108--109):

Imagine the conversation of a few Viennese characters, in the Cafe Central, on the assumption that the Tractactus is a correct account of the human condition.

CHARACTER A: High up on the left hand of my visual field, I note a fact in which a 9-value predicate links the appropriate number of things... I don't think I have had this one before, I'd like a snapshot for my collection.
B: I have a much more interesting one right in the middle of my field, a variable with 127 things attached — it has a lot of tentacles, holding those things. I've counted them.
C (to A): Don't you believe him. I have known him for years, he is invariably given to exaggerating the complexity of his facts, just to make himself interesting.
D (to C): I don't think he is deliberately lying, he just drinks too much and then imagines things.
B (furiously, to both C and D): What you have both said is extremely offensive and I have no option but to call you out! My seconds will call on you. That is, assuming you have honour: my fraternity has decreed that Jews have none and we may not duel with them. Is either of you at least half Aryan? That is all one can hope for in Vienna these days.
A: Gentlemen, gentlemen, please calm down. May I remind you first of all that dueling is forbidden by law and, secondly, death not being an event in life, is totally pointless anyway.

The above conversation is not copied out of Die letzten Tage der Menscheit or Der Weg ins Freie or Der Mann ohne Eigenschaften or even Radetzky Marsch. It is all my own work and it is copyright.

[Thanks to Wolfgang for checking my transcription of German titles!]

Philosophy; The Commonwealth of Letters

Posted by crshalizi at May 22, 2006 03:45 | permanent link

May 21, 2006

Partway Down the Danube

Hopefully, it's just jet-lag, but seeing highway signs marked "Wien", "Praha" and "Budapest" makes me feel much the same way one marked "Minas Tirith" would, that something from known only from books has become unsettlingly real.

Postcards

Posted by crshalizi at May 21, 2006 12:00 | permanent link

May 12, 2006

Friday Cat Blogging (Staging an Intervention Issue of Non-Science-Geek Edition)

Kara has Views about humans working on Friday nights:

Friday Cat Blogging

Posted by crshalizi at May 12, 2006 17:58 | permanent link

Snowclones in May!

I'm afraid I was a bit premature in announcing my return to posting, since I find my time being taken up by actually doing some science, and being, for the first time, on the sending rather than the receiving end of a qualifying exam. In the meanwhile, let me enthuse about Far from the Madding Gerund, a selection (and annotation) of the best of Mark Liberman's and Geoffrey Pullum's posts from Language Log. Doing a proper review, or even justice to their weblog, is really beyond what I have time for, but this book seems to capture quite perfectly their mixture of whimsy, skepticism, accessible scholarship, and pure good-natured zeal for their subject. If you have been reading them, then you will find these little essays as good as you remembered, if not better, and you'll wonder how you forgot about some; if you have not been reading Language Log, then their book should convince you to start. Either way, reading it will make you better and happier. (Disclaimer: I got an unsolicited review copy of this book this week, which didn't help my productivity any. Also, when I gave a talk at Penn on April Fool's Day last year, Mark was kind enough to let me crash in the hospitality suite at One Language Log Plaza.)

Linkage

Posted by crshalizi at May 12, 2006 09:40 | permanent link

May 05, 2006

The Absorbing Boundary

Two hundred and fifty pages later, it's the last day of classes, and my last lecture in advanced probability is done. (I still need to tex up the one lecture I gave from hand-written notes, but there's no rush.) Looking over the tome, I didn't get to a lot of stuff I wanted to (easily enough for another semester!), it's far from self-contained, and it almost certainly contains mistakes, but I think it's not bad for a first draft. It also took an order of magnitude more of my time than was wise. I'll be teaching the class again in the spring, and hope I'll have the strength of will to leave it alone, rather than succumbing to the temptation to do something.

Posting about science and outrages against common decency will recommence shortly.

Corrupting the Young; Enigmas of Chance

Posted by crshalizi at May 05, 2006 15:00 | permanent link

April 30, 2006

Books to Read While the Algae Grow in Your Fur, April 2006

Naomi Novik, His Majesty's Dragon
Read on the recommendation of Henry Farrell, who, while accurately describing the Hornblower-meets-Pern setting as sounding exquisitely repulsive, said it was actually a lot of fun: which it is. (Cf. Cheryl Morgan's review in Emerald City.) Annoyingly, the first book in a trilogy; less annoingly, the sequels are coming out this summer.
Adam Gopnik, Paris to the Moon
Cute to charming (though, really, there is such a thing as being too concerned with food). Fulfills any desire I might have to actually live in France.
Jane Haddam, Hardscrabble Road
Mystery novel about a right-wing talk radio idiot caught with an addiction to prescription painkillers --- but who is not Rush Limbaugh, if only because there's obviously a large admixture of Bill O'Reilly. (Similarly, "Jig" Taylor seems to be a cross between Linus Pauling and Noam Chomsky.) As always with Haddam, the delight is in the characters and the dialogue; she should do more with politics.
Karin Slaughter, Faithless
I did not need to stay up all night with this before lecturing on metric transitivity. But, well, read that first page...
Ken MacLeod, Learning the World: A Scientific Romance
This is a lot of fun — as MacLeod always is: a human genration starship makes first contact with the first known alien intelligence, a species of bat-winged humanoids who are in the throes of their own industrial revolution. Complications ensure. About half of the book is told from the contactee's perspective, and that reads like an old-fashioned golden-age-of-SF story, or even (cf. subtitle) something out of Wells. (For that matter, compare MacLeod's aliens to the moon-bats of the Great Moon Hoax of 1835.) And the other half of the book, largely from the perspective of a teenage girl aboard the starship ("Learning the World" being the name of her blog) seems like some kind of friendly settling-of-accounts with Heinlein, perhaps by way of Alexei Panshin's Rite of Passage... But it's all, unmistakbly and gloriously, MacLeod.
Phil Rickman, The Smile of a Ghost
Another novel in the Merrily Watkins series, which, amazingly, shows no sign of fading... Also, for once the rational explanation is entirely satisfying, given that one of the characters is very deeply twisted (which is abundantly established in the book), yet not completely unsympathetic.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at April 30, 2006 23:59 | permanent link

March 31, 2006

Books to Read While the Algae Grow in Your Fur, March 2006

Terry Pratchett, Thud!
One of the things which makes Terry Pratchett a good satirist is that it is not possible to directly map things (at least not important ones) in his stories on to our world, even though they are obviously connected. But then, he is the contemporary avatar of Anatole France... (And: thank you, "Uncle Jan".)
Patricia Briggs, Raven's Shadow and Raven's Strike
Entertaining and nicely done (if not very deep) fantasy novels. Good rainy-weekend books. (The cover of the second one is appalling, however.)
Don Herzog, Poisoning the Minds of the Lower Orders
Or: the birth of conservatism out of the spirit of contempt — at least in England in the aftermath of the French Revolution.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at March 31, 2006 23:59 | permanent link

March 24, 2006

Friday Cat Blogging (Measure-Zero Exception to the Hiatus Issue of Non-Science-Geek Edition)

Kara consults her authorities. (What, you didn't think I was writing this without help, did you?)

Friday Cat Blogging; Enigmas of Chance

Posted by crshalizi at March 24, 2006 12:51 | permanent link

March 18, 2006

Statistical Network Analysis: Call for Papers

As a member of the program committee for the workshop Statistical Network Analysis: Models, Issues and New Directions, part of the 2006 International Conference on Machine Learning, I urge you to submit your best work by 28 April; you can send us the bad stuff after that. (That, in response to hearing endless variants on "First prize, a trip to Pittsburgh; second prize, two trips", which was old in my grandfather's day.)

CALL FOR PAPERS

Statistical Network Analysis:
Models, Issues and New Directions

a workshop at the

23rd International Conference on Machine Learning
(ICML 2006)

Thursday, June 29, 2006, Pittsburgh PA, USA

Overview:

This workshop focuses on probabilistic methods for network analysis, paying special attention to model design and computational issues of learning and inference.

Many modern data analysis problems involve large data sets of artificial, social, and biological networks. In these settings, traditional IID assumptions are blatantly inappropriate; the analyses must take into account the structure of relationships between the data. As a result, there has been increasing research developing techniques for incorporating network structures into machine learning and statistics.

Network modeling is an active area of research in several domains. Statisticians have mostly concentrated on models of static networks. These models are concerned with the existence of edges between individual nodes, but do not attempt to model aggregate properties. In contrast, physicists have addressed global properties of large complex networks. Their models describe average statistics of the network, or properties of typical networks in large ensembles; the links between particular nodes are less meaningful.

This workshop aims to bring together statistical network modeling researchers from different communities, thereby fostering collaborations and intellectual exchange. Our hope is that this will result in novel modeling approaches, diverse applications, and new research directions.

We wish to clarify that in this workshop, the word "relational" carries a different meaning from the usual sense of the word in Probabilistic Relational Models (PRMs). For example, in real life, any two random people maybe connected through a complex web of friendships; estimation of interpersonal connections thus cannot be done independently of the rest of the network. We focus on modeling statistical properties of the network, as opposed to different types of probabilistic relations. This differentiates us from the co-located ICML workshop on Statistical Relational Models.

Online Submissions:

We welcome the following types of papers:

  • research papers that introduce new models or apply established models to novel domains,
  • research papers that explore theoretical and computational issues,
  • position papers that discuss shortcomings and desiderata of current approaches, or propose new directions for future research.
We encourage authors to emphasize the role of learning and its relevance to the application domains at hand. In addition, we hope to identify current successes in the area, and will therefore consider papers that apply previously proposed models to novel domains and data sets.

Submissions should be limited to a maximum of 8 pages, and adhere to ICML format. Please email your submissions to: edo [at] cmu.edu.

Deadline for Submissions: Friday, April 28, 2006
Notification of Decision: Friday, May 5, 2006

Format:

This is a one-day workshop. It will consist of several themed sessions targeting methodological and application issues (e.g., estimation in static models, network evolution modeling, and statistical modeling of large scale networks) with talks (invited and contributed) and moderated discussion. Discussions at the workshop will facilitate exchanging of research ideas and help identify other challenging problems in the area. At the end of the workshop, a panel of statisticians, physicists, and computer scientists will discuss the points arising throughout the day and identify the most promising and challenging directions.

Publication:

Accepted papers will be distributed on a CD and made available for download.

Organizers:

Edo Airoldi, Carnegie Mellon University
David Blei, Princeton University
Stephen Fienberg, Carnegie Mellon University
Anna Goldenberg, Carnegie Mellon University
Eric Xing, Carnegie Mellon University
Alice Zheng, Carnegie Mellon University

Program Committee:

David Banks, Duke University
Peter Dodds, Columbia University
Lise Getoor, University of Maryland
Mark Handcock, University of Washington, Seattle
Peter Hoff, University of Washington, Seattle
David Jensen, University of Massachusetts, Amherst
Alan Karr, National Institute of Statistical Sciences
Jon Kleinberg, Cornell University
Andrew McCallum, University of Massachusetts, Amherst
Foster Provost, New York University
Cosma Shalizi, Carnegie Mellon University
Padhraic Smyth, University of California, Irvine
Josh Tenenbaum, Massachusetts Institute of Technology
Stanley Wasserman, Indiana University

Networks; Enigmas of Chance

Posted by crshalizi at March 18, 2006 15:06 | permanent link

February 28, 2006

Books to Read While the Algae Grow in Your Fur, February 2006

Ian Kershaw, Hitler, 1889--1936: Hubris
There is something perversely mesmerizing about a bohemian crank becoming master of Europe, with nothing more to recommend him than his ability to express pathological hatred... But Kershaw is very good at showing at once how Hitler fit the circumstances he was in, without being in any way inevitable: he benefitted enormously, crucially, from choices other people made, and they didn't have to make them — unless being a reactionary idiot constitutes a condition of diminished responsibility.
Robert O. Paxton, The Anatomy of Fascism
Nice comparative history of fascist movements, with sharp observations, especially about how one of their distinguishing characteristics was a desire to maintain wide-scaled political mobilization after they'd attained power, and the crucial role of non-fascist conservatives and vested interested in promoting them (which is not to say that the fascists were just instruments of the bourgeoise). Also, some shrewd-sounding guesses about how future fascist movements might look in, e.g., the US. But not enough comparisons with non-fascist movements...

Appearances to the contrary notwithstanding, I did not actually spend all my time in February reading about fascists. In fact, most of my reading was in these books, though I didn't go cover to cover:

J. L. Doob, Stochastic Processes
Since 1953, pretty much every new textbook on stochastic processes in general (including a certain set of lecture notes) has been a lower and distorted version of this. Re-reading it reminds me why it is very hard to break out of this pattern... (No purchase link, because the price Wiley charges for their flimsy paperback edition is unconscionable. You're better off looking for a used copy of the hardback, and the world would be better off if Wiley would let a decent reprint publisher take it up.)
Michel Loève, Probability Theory
Another one of the classics, which I was revisiting for the very limited purpose of stealing what he had to say about second-order processes (this being the Loeve of the Karhunen-Loève theorem). And you have to love a math book dedicated to the authors' fellow concentration-camp internees (remembering, in the words of the poet, "Human reason is beautiful and invincible...").
I. I. Gikhman and A. V. Skorokhod, Introduction to the Theory of Random Processes
This is an "Introduction" for people who are already familiar with measure-theoretic probability (though there is a one-chapter summary, intended as a refresher). It is at once quite thorough, and assumes a high level of mathematical maturity and comfort with abstractions, and very practical, and assumes the reader doesn't mind pages of calculations. (This strikes me as very Soviet.) A good value for the money.
Stewart N. Ethier and Thomas G. Kurtz, Markov Processes: Characterization and Convergence
Do not start reading from the beginning, which is several chapters of fairly turbid linear-operator theory, followed by a chapter of utterly opaque weak-convergence-under-the-Skorokhod-topology. Instead, start with the material on actual Markov processes, and then work backwards through the foundational chapters as needed. (The flow-chart at the end of the book, indicating which results depend on which previous ones, is very helpful for this.) There is a wealth of fascinating material here, of profound importance for statistics and for physics, since both rely crucially on extracting nearly-deterministic behavior from large-scale Markov processes, which is a key theme of these results.
Olav Kallenberg, Foundations of Modern Probability
This is intended as something in the same mold as Loeve's book, developing probability from basic measure theory up through advanced topics in stochastic processes. Almost everything I want to teach is in here, and, while the proofs are often quite compressed, it is character-building for my students (and me!) to fill things in. Pretty much every topic of contemporary interest to probability theorists gets covered; but coverage is very much dictated mathematical, rather than by statistical or physical, interest. Kallenberg is fond of revisiting previously-introduced themes, which is pedagogically sound — and would work better, in a reference book, if the index were more detailed.
L. C. G. Rogers and D. Williams, Diffusions, Markov Processes, and Martingales (in two volumes: Foundations and Itô Calculus)
I have to say that this is very much a mathematician's view of stochastic processes, and that I frankly don't see the point, for applications, of a lot of what gets them excited --- though, following them, I can get excited about it as pure math! And they're really very good at logically and comprehensively developing a mathematical theory.
It was striking how ugly Ethier and Kurtz's book is, on on the page, and Gikhman and Skorokhod and (it must be said) Doob aren't much better. Loeve, and Rogers and Williams, are better designed, and Kallenberg is actually, to my eye, fairly attractive.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at February 28, 2006 23:59 | permanent link

February 19, 2006

SUMO versus the Dendritic Claws (Sunday Sibling Blogging)

I occasionally dabble in mathematical aspects of neuroscience. My brother does the real thing:

Aryaman Shalizi, Brice Gaudillière, Zengqiang Yuan, Judith Stegmüller, Takahiro Shirogane, Qingyuan Ge, Yi Tan, Brenda Schulman, J. Wade Harper,and Azad Bonni, "A Calcium-Regulated MEF2 Sumoylation Switch Controls Postsynaptic Differentiation", Science 311 (2006): 1012--1017
Abstract: Postsynaptic differentiation of dendrites is an essential step in synapse formation. We report here a requirement for the transcription factor myocyte enhancer factor 2A (MEF2A) in the morphogenesis of postsynaptic granule neuron dendritic claws in the cerebellar cortex. A transcriptional repressor form of MEF2A that is sumoylated at lysine-403 promoted dendritic claw differentiation. Activity-dependent calcium signaling induced a calcineurin-mediated dephosphorylation of MEF2A at serine-408 and, thereby, promoted a switch from sumoylation to acetylation at lysine-403, which led to inhibition of dendritic claw differentiation. Our findings define a mechanism underlying postsynaptic differentiation that may modulate activity-dependent synapse development and plasticity in the brain.

Also check out the accompanying "perspectives" piece.

Minds, Brains, and Neurons; Incestuous Amplification

Posted by crshalizi at February 19, 2006 14:50 | permanent link

February 17, 2006

European Conference on Complex Systems 2006

will be held at Oxford, 25--29 September. Since I'm on the program committee, I ought to point interested parties to the call for papers (deadline 7 April) and the call for workshop proposals (deadline 5 May).

Complexity

Posted by crshalizi at February 17, 2006 11:45 | permanent link

January 31, 2006

Books to Read While the Algae Grow in Your Fur, January 2006

Ken Bruen, The Guards
I don't usually like the hardboiled PI school of detective novel, and making the PI an Irish drunk didn't promise to make it any more appealing. But this was extremely good, due mostly to Bruen's prose style and ability to avoid sentimentality. I actually read this in one sitting...
Steven Saylor, Murder on the Appian Way
The back-story of Cicero's Pro Milone.
Patricia A. McKillip, Od Magic
Andrew P. Vayda (ed.), Environment and Cultural Behavior: Ecological Studies in Cultural Anthropology
1969 reprint collection. Lots of cool old stuff — Geertz on Indonesian agriculture, Kroeber on North American culture areas, Barth on the Swat Pathans and their neighbors, etc. No doubt tremendously out of date.
Leslie Forbes, Waking Raphael
Wonderfully-written and absorbing literary mystery novel. I'm not sure what to make of the way the darkness of the Fascist past is routed by the combination of an overly-proper Englishwoman and a motley collection of (more-or-less-ex-)Communists.
Martin Meredith, The Fate of Africa: A History of Fifty Years of Independence
While I was reading this, Danny Yee wrote a review that makes the points I would've, except for this: one of the most depressing aspects of this history is its repetitiousness. (Read on the recommendation of Alex Mallet.)
Douglas Preston and Lincoln Child, Dance of Death
When I find myself stuck in an airport with nothing to read (don't laugh! it happens!), I look for something new from these two because, unlike their competitors, they have some brains underneath the formulas. Their latest is fun, though not, I think, one of their best, and probably not very satisfying if you haven't been following their (hitherto loosely connected) series. I give them points for killing off a long-running character, though I won't say which one.
Barbara R. Rossing, The Rapture Exposed: The Message of Hope in the Book of Revelation
A serious, mainline-Protestant (specifically, Lutheran) interpretation of the Book of Revelation, and the apocalyptic parts of the Bible more generally, as a call to make this world, which its Creator loves, a better, a redeemed, place. (Cf. Tuveson's Millennium and Utopia, on the historical roots of the ideal of progress in early modern Europe, not referred to be Rossing.) Explicitly aimed against the Left Behind books and similar nonsense, though not so blunt as Slacktivist's writings to that end. Since vast numbers of our fellow citizens accept that mythology uncritically, it's useful to know arguments against it which start from premises they're more likely to accept. — It's quite correct, and probably helpful, to hammer the point home that the full dispensationalist/Rapture mythology only appeared in the 19th century (though there were medieval predecessors for the dispensations, in Joachim of Fiore and his followers; I don't know if there's a historical connection). There's absolutely no reason to believe it was the doctrine of the early Church. However, it seems very implausible to me that the early Church believed in anything others than the sudden, rapidly-approaching and violent end and replacement of this world. I don't think Rossing (and similar authors) adequately addresses the possibility that the apocalyptic writers were speaking in symbols and had profoundly weird millenarian beliefs, that John of Patmos was ranting against Caesar in code, and that he believed there would be no more sea, but glass. — In any case, whether we should be trying to improve this world does not really depend on the correct reading of Revelations any more than it does on the correct reading of the Satyricon.

Books to Read While the Algae Grow in Your Fur

Posted by crshalizi at January 31, 2006 23:59 | permanent link

January 30, 2006

Self-Commitment

I seem to have locked myself into writing five pages of math textbook three times a week between now and the end of the semester. If I am to pull this off while still doing research and have something like a life, I can't spend any time posting here, or even really reading weblogs. So I'm putting this on hold until after May 5th, and deleting my RSS reader for the duration. (I will keep updating the book log in the side-bar, though.) Regular service will resume once school's out, and in the meanwhile you can get along without me for three months.

(There, that should be sufficiently emphatic to discourage backsliding.)

Self-centered

Posted by crshalizi at January 30, 2006 20:24 | permanent link

January 24, 2006

Graphs, Trees, Materialism, Fishing

Attention conservation notice: Over 6000 words of dubious value. Written as a contribution to the Valve event on Franco Moretti's Graphs, Maps, Trees. Contains advice to literary scholars from someone utterly unqualified to give it, hypothesis testing, rastergrams, many long quotations, and ruminations on materialism and rational history. If this is the sort of thing you're interested in, you'd really be better off reading Moretti yourself.
Cross-posted to the Valve, where there is a comments section. I can't promise to reply...

A few years ago, I wrote a review of Moretti's Atlas of the European Novel, in which I presumed to tell him how to go about his business. When he ran across it, his reaction was not (as mine would've been, had our situation been reversed) to tell me where to get off, but to invite me to a workshop he was organizing at Stanford on new interdisciplinary work on the novel — its motto, the quotation from Brecht about "questions that appear to us completely unsolved", is recycled for this book — where I had a great time. Reading these essays as they came out in New Left Review, I enjoyed them greatly, and recall thinking that Moretti could hardly have done a better job of appealing to my prejudices if he'd tried. (Said prejudices are those of someone almost equally fond of The Extended Phenotype and Main Currents of Marxism.)

With this kind of background, it comes as no surprise, I trust, that I really like this book, and finding objecting to what he actually proposes here highly wrong-headed. In what follows, I want to say a bit about "graphs" and a bit about "trees", and explain why this sounds so promising to me. I am not going to say anything about "maps", because I don't think I have anything to add to that discussion, but I will, for the sake of getting an M in there, end with some remarks on "materialism". At no point can I pretend to be competent to evaluate the originality of Moretti's work within literary scholarship, to say how much of a departure, say, the trees really are. In a feeble attempt to pretend that my price is higher than a weekend in California and a review copy, I will make some criticisms, most about tedious extra stuff I wish Moretti had also done. I'd like to think that what I say will also have some value for those who don't share my rather haphazard intellectual trajectory, but my experience with trying to communicate across disciplines means I'll get a warm glow if I'm even comprehensible, never mind persuasive. I am accordingly very grateful to the Valve, and especially to Jonathan Goodwin, for letting someone with my credentials (viz., none) participate in this event.

Graphs

Do Genres Come in Bunches?

Moretti makes a very striking claim in his first chapter: that genres of novels appear together, in clusters, separated by about 25 years, and disappear together too. Looking at his graph, my eye agrees, but my eye also tells me that there are faces in clouds (the East African Plains Ape is an incorrigible pattern-finder), and probability theory tells me that purely random processes can produce a lot of apparent clustering and regularity. What reason is there to think that what looks like genres coming in clusters isn't just coincidence?

Let's be a little more precise about what we'd mean by "chance" and "coincidence" here. One natural possibility is that new genres appear at a constant rate over time, utterly independently of one another. Every year, then, there would be a constant probability of a new genre forming, but whether it did or not would have no bearing on whether the next year saw a new genre. This is our null model — the one which says what things should look like if we're just fooling ourselves, and there are no clusters. To get slightly technical, the distribution of intervals between genre-arrivals should have what's called a geometric distribution. Assuming, for the sake of argument, that that's true, we can use the average time between genre-appearances (3.44 years) to estimate the most likely value for the probability of a new genre appearing in any given year (about 29%).

Once we assume that the inter-arrival distribution is geoemtric and find the parameter, we can simulate from it, and get examples of what Moretti's graph would look like, if only chance were at play.

The top line shows the appearance dates of Moretti's 44 genres; the next two lines give the results of simulating from a model of uniform random appearance, with the same mean time between genres as the actual history.

Is there more clustering in reality than in the results of the null model? I couldn't say, by eye, but I don't have to. I can calculate the probability of generating Moretti's history from the null model: it's somewhat less than 1 in 10^45. This in itself isn't decisive, since any particular history becomes less and less probable as one considers longer and longer intervals of time (cue Stoppard), so we need to know what fraction of all histories of that length are at least that unlikely. I could work this out exactly, if I were willing to do some actual math, but I'm lazy, so I just had the computer simulate a million histories and evaluated all of their likelihoods. If the null model were actually true, we'd see histories like Moretti's only about 0.4% percent of the time. [1] So this is actually pretty good evidence that the null model is not true, and Moretti's history does show the kind of clustering he thinks it does.

Of course, this only underlines the question of why Moretti's data is clustered. I can think of a couple of deflating explanations (maybe the clusters match the periods more intensely scrutinized by historians; maybe they tend to adjust when they report genres appearing towards certain focal dates). Or it could be due to some sort of exogeneous influence, from war, politics, economic shifts, etc. (I did not try removing the obviously-topical genres, like Chartist novels, and repeating the analysis.) Or it could be due to some sort of endogenous mechanism within the system of literary production and consumption — generational turn-over of authors, of readers, of editors and publishers (suggested by my friend Bill Tozier). Or: maybe there's some space of things-people-like-in-novels, which the popular genres at any one time partition up in various ways; if one genre dies out or another appears, this might destabilize all the others as well. I don't think Moretti's time series, by itself, is enough to begin to let us decide among these mechanisms (some of which are compatible), but I do think it lets us see that some mechanism is called for.

Here is my first reproach: Moretti should have been the one to do this analysis, not me. If testing hypotheses is too banausic and mechanical for the pages of New Left Review, then it should either be in another article, or in the book. Moretti is a shrewd man, and in this case his intuitive analysis of the data was right, but there is no reason to rely on intuition alone for something like this. And, if one is going to go to the trouble to collect quantitative data, one ought to use it quantitatively. Mathematical abstraction (quantitative or otherwise) is not valuable for its own sake, but for the inferences it lets us make, when the proper tools are applied. In this case, those tools are pretty easy to bring to bear. They should be.

Dissolving Genre History

Here is Moretti at the end of "Graphs":

For most literary historians ... there is a categorical difference between 'the novel' and the various 'novelistic (sub)genres': the novel is, so to speak, the substance of the form, and deserves a full general theory; subgenres are more like accidents, and their study, however interesting, remains local in character, without real theoretical consequences. The forty-four genres of figure 9, however, suggest a different historical picture, where the novel does not develop as a single entity—where is 'the' novel, there?—but by periodically generating a whole set of genres, and then another, and another... Both synchronically and diachronically, in other words, the novel is the system of its genres: the whole diagram, not one privileged part of it. Some genres are morphologically more significant, of course, or more popular, or both—and we must account for this: but not by pretending that they are the only ones that exist. And instead, all great theories of the novel have precisely reduced the novel to one basic form only (realism, the dialogic, romance, meta-novels...); and if the reduction has given them their elegance and power, it has also erased nine tenths of literary history. Too much.

On the one hand, this seems to me to be obviously correct. On the other hand, I wonder very much why Moretti stops here. If we look within any one of those forty-four genres, I think we have every reason to suppose that we'd find it composed, in its turn, of sub-genres, and so on, and ultimately of a shift succession of individual texts. "The" Bildungsroman (to pick one of the forty-four, not entirely at random) is a short-hand way of referring to the most common and enduring features of a historically-changing and always-various population of books, just as "the" bottle-nosed dolphin is an abbreviation for the leading tendencies of a certain population of organisms. What Moretti hints at, in the paragraph I quoted, is that "the" novel is itself a population, either of genres, or of texts structured into genres. But he doesn't say outright what seems very plain to me, and so I'd like to know why, and specifically whether he thinks it's actually wrong, or unhelpful.

The assumptions of population thinking are diametrically opposed to those of the typologist. The populationist stresses the uniqueness of everything in the organic world. What is true for the human species—that no two individuals are alike—is equally true for all other species of animals and plants. Indeed, even the same individual changes continuously throughout its lifetime and when placed into different environments. All organisms and organic phenomena are composed of unique features and can be described collectively only in statistical terms. Individuals, or any kind of organic entities, form populations of which we can determine the arithmetic mean and the statistics of variation. Averages are merely statistical abstractions, only the individuals of which the populations are composed have reality. The ultimate conclusions of the population thinker and of the typologist are precisely the opposite. For the typologist, the type (eidos) is real and the variation an illusion, while for the populationist the type (average) is an abstraction and only the variation is real. [2]

This makes salient the question of how we mark off different populations as distinct. The usual biological criterion is through common descent, and the possibility of inter-breeding —- Mayr's "biological species concept". (There is a vast controversial literature on the details.) Ruth Garrett Millikan has a closely related notion of "reproductively-established families", which doesn't lean so heavily on the details of biology, and which would seem to fit the case of genres of novels. One could also define classes of texts purely morphologically, which might include many unrelated lineages (just as one might consider all streamlined marine predators which live in the water all the time, a class including dolphins, killer whales, sharks, tuna, ichthyosaurs, etc.). Just as such organic forms have appeared in several lineages, morphologically-defined categories could appear in multiple places and periods, the way novels arose, apparently quite independently, in both the Hellenistic world and in China (and elsewhere, for all I know). Historical populations, however, are unique.

Trees

One could ... take evolutionary bibliography as the prototypical evolutionary science and think of biology in terms of bibliographic analogies... [3]

The Cabinet of Horrors

When trying to explain cultural change and cultural variation, people have generally sought to do so by supposing culture is causally driven by something else (the climate, the social structure), or, even more strongly, that it is adapted to something else, or, more strongly yet, that it functions adaptively for the benefit of something else (here social structure, or ruling classes, are favored as suspects over the climate). This has led to an awful lot of (if I may use the phrase) adaptationist just-so stories, and uncritical analogy-mongering on a level with the sort of thinking which leads rhinoceros horn to be prescribed for impotence. Jon Elster is worth quoting at some length:
In his comments on the links among capitalism, Protestantism, and Catholicism Marx set a disastrous precedent for many later writers who have attempted to find "structural homologies" or "isomorphisms" (two fancy terms for "similarities") between economic structures and mental products. Because virtually any two entities can be said to resemble each other in some respect, this practice has no constraints other than the inventiveness of and ingenuity of the writer: There are no reality constraints and no reality control.
Marx suggests two inconsistent lines of argument. One is that there is a strong connection between mercantilism and Protestantism, the other that there is an elective affinity between mercantilism and Catholicism. He was confused, apparently, by the fact that money has two distinct features that point to different religious modes. On the one hand, money (gold and silver), unlike credit, can be hoarded. Hoarding easily turns into an obsession, which is related to the fanatical self-denying practices of extreme Protestantism. On the other hand, money can be seen as the "incarnation" or "transubstantiation" of real wealth. In that sense the money fetishism associated with mercantilism is related to the specifically Catholic practice of investing relics and the like with supernatural significance. Both arguments are asserted several times by Marx, each serving to show up the essential arbitrariness of the other. Later attempts to explain the theology of Port Royal, the philosophy of Descartes, or the physics of Newton in terms of similarities with the underlying economic structure are equally arbitrary. Like the analogies between societies and organisms that flourished around the turn of the century, they belong to the cabinet of horrors of scientific thought. Their common ancestor is the theory of "signs" that flourished in the century prior to the scientific revolution inaugurated by Galileo — the idea that there are natural, noncausal correspondences between different parts of the universe. What Keith Thomas refers to as the "short-lived union of science and magic" maintained a subterranean existence of which the doctrine of ideology, in one of its versions, has been one manifestation. [4]

Even if we shutter and lock the Cabinet of Horrors, and go to look for explanations of trends in such cultural products as novels (which is, after all, what Moretti wants), I'm afraid we will find most of them in the capacious Closet of Mildly Appalling Objects. There is no shortage of attempts to give such changes meaning as signs of something else, some aspect of the social or economic structure, of the way we live now (or the way they lived then), but very, very few of them are convincing. In his great book on changing fashions, A Matter of Taste, the sociologist Stanley Lieberson looks at some of the reasons why these attempts at ad hoc explanation are so often bad. (He puts things more politely; I paraphrase.) First, the facts are often just screwy, both about the developments to be explained: non-existent trends, non-existent causes, weirdly mis-characterized trends, trends being explained by events which happened long after the former began, etc. (In fairness, such "scholarly misconstruction of reality" is a lot more common than we academics like to think.) Second, the mechanism connecting the explanatia to the explananda is left totally obscure. Third, no attempt is made to test the explanation, by checking that it can account for the magnitude of the observed change, by ruling out alternative explanations, or by much of anything else. The result is a steady stream of claims about how culture works which are advanced with what is, under the circumstances, an astonishing degree of assurance. Lieberson's book provides many fine examples of such cavalier just-so story-telling for names, the decline of hats, etc. [5]

Checking hypotheses about causation, and still more about adaptation, is really hard with just one case, arguably hopeless. What you need is the ability to reliably detect departures from the hypothesis, if they are actually present — "power", in the statisticians' jargon. It is hard to get much power when n=1. If you want to claim that certain aspects of 19th century British novels were the way they were because those features fitted with ideologies of British imperialism — a fairly strong hypothesis about adaptation — I don't see how you can do it just by interpreting Mansfield Park, no matter how subtle and sophisticated your reading. On the other hand, if you look at lots of contemporary novels, and the ones which (say) depict Great Britain's relations with its colonies in the same way as Mansfield Park does are systematically more successful, on average, than those which depict it differently, well then I don't see how that couldn't be good news for your idea, though even that would really only be the beginning of backing it up.

Biologists have given a lot of thought to checking hypotheses about adaptation, and developed many means of doing so. Mutatis mutandis, many of these means could also be applied to literature, or other aspects of culture. Eric Rabkin, Carl Simon and their collaborators have started doing just this with their Genre Evolution Project, looking at short stories from 20th century American science fiction, and no doubt there are others doing this kind of thing too.

One way of checking adaptive hypotheses, especially relevant here, is the "comparative method", or rather methods, which work much, much better when combined with good phylogenies. I think a literary historian who wants to study the evolution of genres and devices would be very well advised to look at the comparative methods biologists employ to study the evolution of qualitative characteristics of organisms. (The major issue would be that literary phylogenies will not be trees but more complicated lattices. But this is analogous to the effects of lateral gene transfer, common among bacteria, and so I'd suspect not only solvable but solved, someplace in the literature. Whether inheritance is by means of discrete-valued, particulate factors, i.e., genes, is not a crucial issue for such methods.) What I really want to see from Moretti (or someone) is a study along these lines of clues in the detective story; I'd be even more interested in one of free indirect discourse.

A crucial aspect of testing hypothesis about adaptation is a contrast with the outcome of a well-crafted neutral model — a way of saying what to expect if no adaptation were present, or not that adaptation anyway. These often have surprising consequences; for instance, neutral genetic drift will tend to fix some version of a gene in a given population, even if it confers no fitness advantage. (This is described in any book on population genetics.) So I wonder about things like whether we should expect, under a reasonable neutral model, that some formal device should become universal within a genre? If so, did clues take over detective stories any faster than neutrality would predict? (It's hard to imagine a successful genre where every story relies on confessions found by accident, but whether that's intrinsically weirder than actually existing detective stories, I can't say.)

The foregoing shouldn't be taken to mean that comparative literature should slavishly imitate comparative biology. There are people who have thought about the application of evolutionary ideas to social and cultural change in ways which are much more sophisticated about psychology, social organization and human interaction than (most) advocates of memetics; I am thinking particularly of David Hull, W. G. Runciman, Dan Sperber, Stephen Toulmin's great The Collective Use and Evolution of Concepts, and even the fragmentary MS. of Adam Westoby. As the economist Richard Nelson writes, we should expect our ideas of general evolution to change as we learn more about cultural evolution. We should also expect to have to develop different methods of data analysis. But, as always, we start with what we already know how to do.

Materialism

I share Moretti's hope for a "materialist sociology of literary form"; Hell, I'd like a materialist sociology of culture generally. But I suspect it won't be able to do everything he wants it to.

When Moretti quotes D'Arcy Thompson on how the form of an object is a diagram of the forces which produced it, I'm happy to go along, and even happy to agree that this gives us some ability to work backwards, from form to force. But this sort of inverse problem generally doesn't have a unique solution, especially if some of the forces were transient and highly contingent... Less metaphorically, something Lieberson argues very convincingly is that we often have to distinguish between the social forces causing there to be a change in some taste, and those which shape the content of the new taste. Often the latter mechanisms are more or less internal to the bit of culture in question, like ratcheting. Or: culture doesn't have to express or reflect the social order. I suspect Moretti would be disappointed if this were the case for, say, genres of novels. Well, so would I. But this needs to be checked. One way would be to try to develop good neutral models, and see whether, and where, they break down

Dan Sperber has a great essay, in his Explaining Culture, on "how to be a genuine materialist in anthropology", where he complains about treating Capital, the World-System, cultural symbol-systems, mentalities, etc. as reified causal forces, if not self-interested foresightful agents, forgetting that human history, society and culture are actually "real individuals, their activity and the conditions under which they live" (to appropriate a once-famous line). It seems, at least to this interested outsider, that the study of literature in society suffers from this, too. And I think what Sperber advocates there should go here, too: give actual causal accounts of how macroscopic patterns emerge from the interaction of many material bodies (notably, people and books), of the sort we know to exist, endowed with the kinds of abilities we know them to have.

This commitment may sound harmless, because contentless, but it does actually have implications. It means that you have to do a lot of work to justify functionalist explanations (though it's not impossible). It should make you very dubious about ideal types. It should make you more interested in exploring variation, and not dismissing it. It should make you very dubious about "practices" and other shared mental objects, at least as ordinarily conceived. And it suggests a lot of productive directions, investigating communication, cognition, and the collective patterns they produce.

In Graphs, Maps, Trees, as in his Atlas, Moretti is basically looking at the communication end of things. He doesn't say much about cognition, or individual thought more generally. Elsewhere (see e.g. Signs Taken for Wonders) he has dabbled in psychoanalysis, but I hope that's past. A materialist theory of literary form will ultimately have to concern itself with the organic processes of reading and composition, but the way to do this is through empirical study of readers and writers, not more interpretation of texts, or armchair ruminations (whether those are on the primal scene, the environment of evolutionary adaptation, or conceptual blending). Of course literary scholars have been making stabs in this direction at least since Richards's Practical Criticism, but with the advent of cognitive psychology this can be done in a much more systematic way, combining modeling of cognition with experimental tests of the models. [6] Again, many people (e.g., Jerry Hobbs, Herbert Simon) have been proposing this for some little while, but it's only recently, with works like Bortolussi and Dixon's Psychonarratology, that people have begun to actually do it, taking the predictions of various theories of narrative, which say that changing stories in certain ways should affect readers' responses, and seeing whether that's actually right. This, and not desk-bound speculation about analogies, seems to me the proper way to start on a cognitive psychology of literature. It is obviously complementary to what Moretti wants to do, and (this is the sweet part) the two enquiries can be pursued in parallel; neither has to wait for the other.

One thing Moretti does not do, anywhere, is construct models linking interacting individual behavior to aggregate patterns. Economists and sociologists already make such models, and anthropologists are starting to do so. It may be premature here, but ultimately it will be vital. If different social groups have different beliefs, is that because those beliefs express their relations to the mode of production, or is it because they tend to talk more with in the group than across group boundaries? Adaptationist theories of culture tend to go for the first choice, but we don't really know whether the latter could account for the specific patterns of cultural difference and change that we see.

How Not to Learn from the Natural Sciences

What I said above about not mindlessly imitating biology deserves some amplification.

Evolution ought to have a bad name in the study of literary history. Reading Rene Wellek's "The Concept of Evolution in Literary History" (or his article for the Dictionary of the History of Ideas) is actually quite depressing. (It brings to mind Kurt Vonnegut's line "they deserved to fail, because they were all so stupid".) The many post-Darwinian ventures in this direction went, essentially, nowhere, at least as far as understanding literature better goes. It surely didn't help that their understandings of biological evolution were often very bad, generally some kind of Spencerian or even Lamarckian belief in tendencies of progressive development — perhaps inspiring, but hopelessly un-explanatory. (This has vitiated far too much evolutionary theorizing about social processes; cf. Toulmin's chapter 5.) As for the more recent wave, since the 1980s, the people who seem to think that literature exists because humanity craves dramatizations of Daly and Wilson's Sex, Evolution and Behavior drive me up the wall. (Their idea makes no sense even if you are very sympathetic to evolutionary psychology, which I am.)

Which said, this is not at all what Moretti is proposing, and I don't see the harm in trying to make this all fit together as another instance of a general pattern, alongside biological evolution, because they have similar causally-relevant features, and so similar mechanisms are at work. Many people have pointed out, in some detail, that explaining biological processes through the joint action of variation and selective transmission in populations is one instance of a general pattern of historical explanation; Toulmin is particularly clear on this [7]. There is a demography of businesses, of interest groups, even of medieval manuscripts of classical works, and so why not one of literary texts? Inheriting discrete, particulate hereditary factors from a small, fixed number of immediate ancestors is not the sine qua non of this form of historical explanation, though the details of the process of inheritance will very strongly affect the character of the resulting dynamics. It might be that theories of literary change cast in this form are too complicated to be useful, or that we just don't know enough yet to find the useful ways to formulate them. But it wouldn't hurt to seriously try, and we'd learn a lot, no matter the eventual outcome.

Varieties of Rational History

One way to take the bit from Braudel about "a more rational history" that Moretti adopts as a motto is simply to hope that literary history will be a rational enterprise. There are various aspects to this — the accumulation of knowledge, a desire to give explanations, a realization that more than one explanation might be possible and a desire to check which one is right, and so on. To do all this, it's important to develop, use and refine reliable methods of inquiry — ones which are unlikely to lead you into error, and where errors are apt to be self-correcting. You want to be able to persuade others, and you want to know that you're not just persuading yourself. As a statistician, my job is to help with that bit, so it looms large for me. I think this is more or less what Moretti has in mind when he talks (elsewhere) about wanting "falsifiable" literary history — for ideas which have enough content that they can not only be communicated from one person to another (without tripping Liberman's detector), but checked. Which said, I wish that here, as in his Atlas, Moretti had done a more systematic job of checking his conclusions. Would it be unfair to suggest that, while he sees the need for data analysis, it will be left to a successor generation to put it into routine practice?

If you want to say that asking literary history to be communicable, testable and reliable is asking it to be scientific and that's icky, well, it's a free country (at least for now). The more I think about what makes something a science, the less that seems like an important question. But whether something is a rational enterprise of inquiry matters. I'm sure it's possible to object to wanting history to be more rational in this sense, but I find that thought so alien and pointless I won't even try to engage it.

Another take on "rational history" is that the vast mass of details in small-scale history are essentially random, or, more exactly, the connections among them are as convoluted and involved as the details themselves. (This is one way to define randomness, mathematically.) But looking at larger scales, the randomness averages out, leaving regularities which are simpler and more nearly comprehensible by finite minds, and more reliable. As a statistical physicist and a statistican, I am the last to disagree: "In fact, all epistemological value of the theory of probability is based on this: that large-scale random phenomena in their collective action create strict, nonrandom regularity." [8] The small-scale details of literature and of human life have an intrinsic interest and value that is missing from the small-scale detail of molecular chaos, so there is certainly all the room in the world for what Moretti would like to do and close reading, and even essayistic appreciation. (But there is not, I am afraid, room enough in the world for Harold Bloom.) Whether there is room in an academy organized around the production of peer-reviewed research findings for all of them, is fortunately not a question I need to have an opinion on.

Finally, you might be tempted to go from the last sense of "rational" to supposing that large-scale history must be the working-out of some scheme which is "rational" in that it's really deterministic, or even teleological. This would be a mistake. It is not at all hard to give examples of stochastic processes which combine random evolution and feedback, which converge on very nice large-scale regularities, but which regularity they converge on is completely random and indeterminate. [9] Brian Arthur, among others, argues that processes like this are important in the evolution of technology. Is literature like that? I have no idea. But I don't see any reason it can't be, and this needs to be borne in mind.

Go Fish

Let me close by quoting the same paragraph twice, once from the version in NLR, and then again from the closing pages of the book. In both cases, he is enumerating themes which stretch across his chapters.

First, a total indifference to the philosophizing that goes by the name of 'Theory' in literature departments. It is precisely in the name of theoretical knowledge that 'Theory' should be forgotten, and replaced with the extraordinary array of conceptual constructions, —theories, plural, and with a lower case 't'—developed by the natural and by the social sciences. 'Theories are nets', wrote Novalis, 'and only he who casts will catch'. Theories are nets, and we should learn to evaluate them for the empirical data they allow us to process and understand: for how they concretely change the way we work, rather than as ends in themselves. Theories are nets; and there are so many interesting creatures that await to be caught, if only we try.
First of all, a somewhat pragmatic view of theoretical knowledge. 'Theories are nets', wrote Novalis, 'and only he who casts will catch'. Yes, theories are nets, and we should evaluate them, not as ends in themselves, but for how they concretely change the way we work: for how they allow us to enlarge the literary field, and re-design it in a better way, replacing the old, useless distinctions (high and low; canon and archive; this or that national literature...) with new temporal, spatial and morphological distinctions.
Whether this pragmatic message is what Novalis meant, I have no idea; I only know the line because Popper used it as the epigraph for The Logic of Scientific Discovery. But that's what Popper meant by it, and I think it's right, and I look forward to seeing the coelacanths and tube-worms and giant squid which will be brought up from the deeps in years to come.

[1]: More on testing the null model of genre appearance, for those into that kind of thing: Really, of course, the most suitable null model for random appearance would be a continuous-time Poisson process. Since the data are discretized by years, however, I'm faking it by using a geometric distribution of inter-arrival intervals. (I also tried simulating from a Poisson process and then discretizing the result; the results weren't much different.) The only parameter of such a process is the mean inter-arrival time, or equivalently the "intensity", the probability per year of producing a new genre. Simple maximum likelihood estimation gives this as 0.2905405, which implies a log-likelihood for the original data of -103.9498. To evaluate the significance, I generated 1,000,000 sample paths, of the same length as Moretti's, and then for each one re-estimated the intensity and used that to evaluate the log-likelihood. (This sort of "bootstrapping" should account for the fact that I fit that parameter to the data in the first place. It wouldn't be appropriate if, say, Moretti had advanced the conjecture that the mean inter-arrival time should be 10 years on independent grounds.) Of the 1,000,000 sample paths, only 3,802 had log-likelihoods as small or smaller than the original data. That is to say, if the null model were correct, we'd see results like this only about 0.38 percent of the time. So we can certainly reject the null model at the conventional 5 percent significance level, or even the 1 percent level, and in fact this is a considerably more severe test than that.

[2]: Ernst Mayr, What Evolution Is, p. 84, quoting a 1959 paper of his own.

[3]: This is from Sidney Winter's article on "Natural Selection and Evolution" in the New Palgrave Dictionary of Economics (1987), where he works out the analogy in some detail.

[4]: An Introduction to Karl Marx, pp. 183--184.

[5]: "Adventures of a Man of Science", Elif Batuman's wonderfully-titled review of Graphs, Maps, Trees in n+1 magazine, is a quite nice essay, but it also provides what looks like a typical example of the kind of mere plausibility I have in mind:

Perhaps the Holmes stories are not half-baked versions of the "correct" mystery story, but a different kind of mystery story, wherein the nondecodability of clues is not a bug, but a feature. Conan Doyle was writing during the conquest of England by industry and rationalism; perhaps his readers wanted stories about the kinds of magic that are possible within the constraints of science. Holmes categorically rejects the supernatural, not in order to show that the new, rational rules preclude magic, but in order to show that you can still have magic even if you play by the rules. Decodable clues came a "generation" later, with Agatha Christie and the first World War, and became more rigorous after the second—by which time readers wanted to be reminded that the world was still rational. [pp. 146--147]
First of all, it seems bizarre to say that Britain was being conquered by "industry and rationalism" in the 1890s, long after the scientific revolution, the Enlightenment, the Industrial Revolution and all its social consequences, utilitarianism, etc. (Indeed, Mr. Lecky might want to have a few words...) Second, Batuman gives us no reason to think that contemporary readers saw what Holmes did as (pardon the phrase) magic within the bounds of reason alone. Third, even if she were right about the social situation and the cultural product, the hypothesized causal connection is really just another arbitrary analogy, of the sort Elster complained about. Suppose Conan Doyle had been better about using decodable clues than Christie. Would it not then sound just as plausible to say this expresses the triumph of rationalism, followed by a post-war weakening? As it is, Batuman's account seems to appeal, implicitly, to a desire to hang on to older ways of thinking. Either the whole reading public of Britain in the 1890s is being treated, in a grossly anthropomorphic fashion, as a single person, with such a desire, or she is making a quite specific prediction about which readers Conan Doyle appealed to, one which does not seem especially plausible, though it might be tested. (It is utterly unclear whose purposes or needs are invokes by the in-order-to's — Conan Doyle's? his original readers'? society's? — but I fear the worst.) Finally, no attempt is made to check that this is the source of the appeal, nor that the later strict decodability of clues really was caused by the World Wars, for the reasons given. I don't know enough to say that this suggestion is false, or that checking it would be impossible. I don't even want to suggest that a book review in a little magazine would be a good place to do such tests. But it doesn't seem to worry Batuman that there is no support for this idea (yet). — Let me repeat that I like the essay.

[6]: Incidentally, thinking that cognition is computational, and even that its computational architecture is strongly constrained by organically-evolved developmental processes, in no way commits one to denying that thought is also profoundly cultural and historical. Sperber is very good on this, but also see Frawley's Vygotsky and Cognitive Science, or the papers collected in The Elements of Reason.

[7]: Of course it isn't the only pattern of successful historical explanation. Even within the natural sciences, geology and astronomy provide very different ones.

[8]: Gnedenko and Kolmogorov, Limit Distributions for Sums of Independent Random Variables, p. 1.

[9]: More exactly, there are stochastic processes ("urn schemes") where the relative frequencies of different outcomes are guaranteed to converge, with 100% probability, but the ratio at which they converge is itself a random variable, not determined by the initial set-up in any way. The models of lock-in developed by Brian Arthur and his collaborators in the 1980s are urn models, but actually less indeterministic than the classical ones.


Manual trackback: Reprieved; Crooked Timber; Pedantry; Three Quarks Daily; Idiocentrism; An Unenviable Situation ("deeply offensive").

Update, 7 February: Seth Edenbaum has more on why he dislikes this post so much — and why he dislikes me (or at least my online persona; I don't believe we've ever met). I think he's wrong, both about this and about me, but it's only right to point to criticisms.

The Commonwealth of Letters; Writing for Antiquity; Biology; Enigmas of Chance

Posted by crshalizi at January 24, 2006 16:20 | permanent link

January 18, 2006

Lecture Notes on Stochastic Processes (Advanced Probability II)

I've started putting the notes for my lectures on stochastic processes (36-754) online at the course homepage.

Contents
Table of contents, which gives a running list of definitions, lemmas, theorems, etc. This will be updated with each new lecture.
Lecture 1 (16 January)
Definition of stochastic processes, examples, random functions
Lecture 2 (18 January)
Finite-dimensional distributions (FDDs) of a process, consistency of a family of FDDs, theorems of Daniell and Kolmogorov on extending consistent families to processes
Lecture 3 (20 January)
Probability kernels and regular conditional probabilities, extendings finite-dimensional distributions defined recursively through kernels to processes (the Ionescu Tulcea theorem).
Homework Assignment 1 (due 27 January)
Exercise 1.1; Exercise 3.1. Solutions.
Lecture 4 (23 January)
One-paramater processes and their representation by shift-operator semi-groups.
Lecture 5 (25 January)
Three kinds of stationarity, the relationship between strong stationarity and measure-preserving transformations (especially shifts).
Lecture 6 (27 January)
Reminders about filtrations and optional times, definitions of various sorts of waiting times, and Kac's Recurrence Theorem.
Homework Assigment 2 (due 6 February)
Exercise 5.3; Exercise 6.1; Exercise 6.2. Solutions
Lecture 7 (30 January)
Kinds of continuity, versions of stochastic processes, difficulties of continuity, the notion of a separable random function.
Lecture 8 (1 February)
Existence of separable modifications of stochastic processes, conditions for the existence of measurable, cadlag and continuous modifications.
Lecture 9 (3 February)
Markov processes and their transition-probability semi-groups.
Lecture 10 (6 February)
Markov processes as transformed IID noise; Markov processes as operator semi-groups on function spaces.
Lecture 11 (8 February)
Examples of Markov processes (Wiener process and the logistic map). Overlaps with solutions to the second homework assignment.
10 February
Material from section 2 of lecture 10, plus an excursion into sofic processes.
Lecture 12 (13 February)
Generators of homogeneous Markov processes, analogy with exponential functions.
Lecture 13 (15 February)
The strong Markov property and the martingale problem.
Homework Assignment 3 (due 20 February)
Exercises 10.1 and 10.2
Lecture 14 (17, 20 February)
Feller processes, and an example of a Markov process which isn't strongly Markovian.
Lecture 15 (24 February, 1 March)
Convergence in distribution of cadlag processes, convergence of Feller processes, approximation of differential equations by Markov processes.
Lecture 16 (3 March)
Convergence of random walks to Wiener processes.
Homework Assignment 4 (due 13 March)
Exercise 16.1, 16.2 and 16.4.
Lecture 17 (6 March)
Diffusions, Wiener measure, non-differentiability of almost all continuous curves.
Lecture 18 (8 March)
Stochastic integrals: heuristic approach via Euler's method, rigorous approach.
Lecture 19 (20, 21, 22 and 24 March)
Examples of stochastic integrals. Ito's formula for change of variables. Stochastic differential equations, existence and uniqueness of solutions. Physical Brownian motion: the Langevin equation, Ornstein-Uhlenbeck processes.
Lecture 20 (27 March)
More on SDEs: diffusions, forward (Fokker-Planck) and backward equations. White noise.
Lecture 21 (29, 31 March)
Spectral analysis; how the white noise lost its color. Mean-square ergodicity.
Lecture 22 (3 April)
Small-noise limits for SDEs: convergence in probability to ODEs, and our first large-deviations calculations.
Lecture 23 (5 April)
Introduction to ergodic properties and invariance.
Lecture 24 (7 April)
The almost-sure (Birkhoff) ergodic theorem.
Lecture 25 (10 April)
Metric transitivity. Examples of ergodic processes. Preliminaries on ergodic decompositions.
Lecture 26 (12 April)
Ergodic decompositions. Ergodic components as minimal sufficient statistics.
Lecture 27 (14 April)
Mixing. Weak convergence of distribution and decay of correlations. Central limit theorem for strongly mixing sequences.
Lecture 28 (17 April)
Introduction to information theory. Relations between Shannon entropy, relative entropy/Kullback-Leibler divergence, expected likelihood and Fisher information.
Lecture 29 (24 April)
Entropy rate. The asymptotic equipartition property, a.k.a. the Shannon-MacMillan-Breiman theorem, a.k.a. the entropy ergodic theorem. Asymptotic likelihoods.
Lecture 30 (26 April)
General theory of large deviations. Large deviations principles and rate functions; Varadhan's Lemma. Breeding LDPs: contraction principle, "exponential tilting", Bryc's Theorem, projective limits.
Lecture 31 (28 April)
IID large deviations: cumulant generating functions, Legendre's transform, the return of relative entropy. Cramer's theorem on large deviations of empirical means. Sanov's theorem on large deviations of empirical measures. Process-level large deviations.
Lecture 32 (1 May)
Large deviations for Markov sequences through exponential-family densities.
Lecture 33 (2 May)
Large deviations in hypothesis testing and parameter estimation.
Lecture 34 (3 May)
Large deviations for weakly-dependent sequences (Gartner-Ellis theorem).
Lecture 35 (5 May)
Large deviations of stochastic differential equations in the small-noise limit (Freidlin-Wentzell theory).
References
The bibliography, currently confined to works explicitly cited.
Everything to date

In the staggeringly-unlikely event that anyone wants to keep track of the course by RSS, this should do the trick.

Enigmas of Chance; Corrupting the Young

Posted by crshalizi at January 18, 2006 12:00 | permanent link

January 17, 2006

7x7

The last time I did one of these things, my weblog broke, resulting in a several-month hiatus. But, since you asked, and since this lets me procrastinate working on my lecture notes, my research, my house and my contribution to the Moretti event at the Valve all at once...

1. Seven things to do before I die

omitting the really important ones, which are all private: (i) Learn Persian. (ii) Learn Chinese. (iii) Travel the Silk Road from Chang'an to Constantinople. (iv) Cast bronze. (v) Write that textbook on statistical analysis of complex systems I keep talking about. (vi) Write that novel I keep not talking about. (vii) See New York.

2. Seven things I cannot do

to my regret: (i) Roll my R's. (ii) Speak anything other than English. (iii) Ignore a live television in my field of vision. (iv) Make small talk. (v) Finish things ahead of time. (vi) Understand musicians talking about music. (vii) Let things go.

3. Seven things that attract me to [Pittsburgh]

(i) The abundance of romanesque arches. (ii) Being told so often that it's a lot cleaner than it used to be, or so they've heard. (iii) Living and working in the shadow of a tower which was manifestly built to summon the Dark Gods. (iv) Wholey's, the Pennsylvania Macaroni Company, and the rest of the Strip. (v) The ability to afford a leafy yuppie-scum neighborhood. (vi) The feeling that, at last, I am among my people. (vii) The best job at the best university I could possibly hope for (until I publish more).

4. Seven things I say most often

(i) "Which, clearly, is absurd." (ii) "Take your favorite [mathematical object]". (iii) "There's a neat paper where". (iv) "Have you read [book/paper/story/post]?" (v) "This is a good idea why, exactly?" (vi) "Let me try to say that again, only this time coherently." (vii) "I'm sorry."

5. Seven books (or series) that I love

Seven? Just seven? Couldn't I pick forty-nine instead? Or three hundred and forty-three? Well, by free association then. (i) Science and Civilisation in China (Needham). (ii) The Cartoon History of the Universe (Gonick). (iii) The Pursuit of the Millennium (Cohn). (iv) The City and the Stars (Clarke). (v) Vehicles (Braitenberg). (vi) His Master's Voice (Lem). (vii) Patricia McKillip's fantasy novels. You're sure I can only pick seven?

6. Seven movies that I watch over and over again (or would if I had the time)

or would if I watched movies over and over again. (i) The Seven Samurai. (ii) Yojimbo. (iii) Sanjuro. (iv) The Hidden Fortress. (v) Apocalypse Now. (vi) Repo Man. (vii) The Usual Suspects.

7. Seven people I want to join in, too.

(i) Dave Bacon. (ii) Wolfgang Beirl. (iii) John Burke. (iv) Tim Burke. (v) Aaron Clauset. (vi) The crew at Halfway Down the Danube. (vii) Ken MacLeod.

Linkage; Self-Centered; Heard About Pittsburgh, PA

Posted by crshalizi at January 17, 2006 10:49 | permanent link

January 05, 2006

In Memoriam James Allan Klinkner

My father-in-law passed away last month. Here is his obituary.

I wasn't lucky enough to know Jim before his illness; even so, he was one of the most vital people I've ever known, eager to embrace all life's possibilities. He was a man of considerable talents in many directions, of curiosity and accomplishment. A holder of strong opinions, he respected them in others, even when he disagreed. His love for those who mattered to him was sincere and profound. If I had to bear the same burden, I hope I would have it in me to do so with as much fortitude.

That burden has, at last, been lifted from him. I miss him. I'm going to keep missing him.

Posted by crshalizi at January 05, 2006 10:00 | permanent link

January 03, 2006

The Evolution of Complexity

... is a fascinating area of inquiry, containing a small quantity of outstanding work, surrounded by a vast expanse of rubbish. This is your chance to help improve the ratio, if you can fit your contribution into six pages by February 28th.

Workshop on the Evolution of Complexity
June 3rd, 2006, Bloomington, IN, USA
as a part of the
Tenth International Conference on the Simulation and Synthesis of Living Systems: ALife X

The evolution of complexity is a central theme in Biology. Yet it is not without any ambiguity. Complexity has been used to refer to different things. For instance, complexification has been interpreted as a process of diversification between evolving units or as a scaling process that is related to the idea of transitions between different levels of complexity. Other meanings of complexity have been introduced, both inside and outside the realm of Biology. What concerns most researchers is to get insight into the mechanisms that produce their notion of complexity.

The focus of this workshop will be on biological interpretations of complexity and the driving mechanisms: primarily we want the focus to be on evolutionary and related dynamics as mechanisms for producing complexity. Furthermore, we want to bring together historical and novel research in this context.

Questions to be addressed at the workshop include:

  • What are the environmental constraints of complexity growth in living systems?
  • What is the origin and role of developmental mechanisms in evolution?
  • Are the principles of natural selection, as they are currently understood, sufficient to explain the evolution of complexity?
  • What are the limits at different levels to the evolution of complexity, and which conditions could reduce evolved complexity?
  • <Which models are | What language is> more appropriate to <understand | speak about> the evolution of complexity in living systems?
  • How could complexity growth be measured or operationalised in natural and artificial systems?
  • How can data from nature be brought to bear on the study of this issue?
  • What are the main hypotheses about complexity growth that can actually be tested?
  • Is it possible to <direct|manipulate> the evolution of complexity, or which benefits would bring its understanding?

See the full call for papers for more information about submission, publication, peer-review, etc. (I am on the program committee, which means I will be one of the reviewers.)

Biology; Complexity

Posted by crshalizi at January 03, 2006 17:52 | permanent link

Three-Toed Sloth:   Hosted, but not endorsed, by the Center for the Study of Complex Systems