?>
From the Abandoned Places photo-group I posted about this summer:

The photographer, one Reik, helpfully notes that "It's near St. Petersburg, Russia"; once upon a time, rather too near.
Posted by crshalizi at December 30, 2006 22:27 |
| permanent link
The Ninth European Conference on Artificial Life will be held September 10--14 in Lisbon. The "European" in ECAL just refers to the location. No matter where you are in the world, if you work on any of the following subjects (among others),
|
Origin and synthesis of Life: artificial chemistries, autocatalytic systems, pre-biotic evolution, self-replication, self-organization, molecular self-assembly; Information and Measures of Complexity; Networks and Complex Systems; Evolutionary Robotics and Autonomous Agents: exploitation and evolution of morphologies, 3D rapid prototyping printers, non-holonomic robot control, bio-morphic engineering, self-assembly, evolvable hardware, collective robotics; Models of Brain-Body-Environment interaction: the question of boundaries, the dynamical systems approach, agency, homeostasis and autopoiesis, sensorimotor coordination, action-perception loops; Morphogenesis and Development; Learning, Adaptive Behaviour and Evolution; Social Behavior: language, social interaction, communication, swarm intelligence, ant systems; A-Life Art; Epistemological Issues, Tools and Methodologies; Philosophical Issues; Ethical and Social Issues |
![]() |
Posted by crshalizi at December 29, 2006 00:59 |
| permanent link
The constancy with which the same crimes repeat themselves every year with the same frequency and provoke the same punishment in the same ratios, is one of the most curious facts we learn from the statistics of the courts; I have stressed it in several papers; I have repeated every year: There is an account paid with a terrifying regularity; that of the prisons, the galleys, and the scaffolds. This one must be reduced. And every year the numbers have confirmed my prevision in a way that I can even say: there is a tribute man pays more regularly than those owed to nature or to the Treasury; the tribute paid to crime! Sad condition of human race! We can tell beforehand how many will stain their hands with the blood of their fellow creatures, how many will be forgers, how many poisoners, almost as one can foretell the number of births and deaths.
Thomas Pynchon (to my admittedly-philistine mind, the one worthwhile passage in Vineland):
If patterns of ones and zeroes were "like" patterns of human lives and deaths, if everything about an individual could be represented in a computer record by a long string of ones and zeroes, then what kind of creature would be represented by a long string of lives and deaths? It would have to be up one level at least—an angel, a minor god, something in a UFO. It would take eight human lives and deaths just to form one character in this being's name—its complete dossier might take up a considerable piece of the history of the world.
Randall "xkcd" Munroe:

Posted by crshalizi at December 10, 2006 14:12 |
| permanent link
Posted by crshalizi at December 03, 2006 23:17 |
| permanent link
Attention Conservation Notice: Over 3000 words of technical commentary on a paper on the statistical analysis of networks. Does a poor job of explaining things to those without background knowledge of networks, statistical inference and Markov chains. Includes some geeky jokes and many equations.
Yesterday in the Statistical Learning for Networks seminar, we discussed the following paper; what follows are a mix of my notes from before and after the discussion.
It's perhaps not completely clear from the abstract that their method works for a particular class of network growth models, which they call "duplication-attachment" (DA) models. These are pure growth models, where the network only expands, and never loses nodes or edges. (The network is assumed to be undirected, without self-loops or multiple edges between a given pair of nodes.) At each time step, we add one node. This is randomly chosen, with fixed probability, to be either a duplication or an attachment. If it's an attachment, the new node attaches to an old one, chosen uniformly over the graph (possibly with some fixed probability < 1). If it's a duplication event, we pick an existing node to duplicate, and the new one gets a certain probability of copying each of its model's links (independently), and a different probability of being linked to its model. It is entirely possible that a new node is added with no links to any existing node. Notice that nodes (and edges) have no intrinsic properties in this model; their names are arbitrary. Any two isomorphic graphs should therefore be assigned the same probability.
(The motivation for such models is that gene duplication is apparently fairly common, at least over evolutionary time, which would duplicate the interactions between genes, or their proteins. Attachment, here, is supposed to summarize all the other processes of network growth. There are several models of the evolution of protein interaction networks, e.g. those of Sole et al., 2002, and Vazquez et al., 2003, which are basically of the duplication-attachment type, and yield networks which at least qualitatively match some features of real-world examples, like degree distributions. These papers are not cited here.)
From any starting graph, it is easy to run the model forward and generate
new, larger random graphs; the probabilities involved are all pretty simple
uniform and binomial things. In fact, the current state of the graph
completely determines the distribution of future graphs, so this is a Markov
chain. The transition probabilities are fixed by the duplication and
attachment parameters, collectively
, and
these, together with a distribution over starting graphs, give us
a fully-specified stochastic model.
Normally, statistical inference for Markov chains is fairly straightforward, because most of the classical conveniences which make inference for IID data tractable are still available. (This is, after all, what led Markov to his chains!) So why then does the paper not end on the second page, with a citation "See Billingsley (1961)"? Because normally, when we observe a Markov chain, we observe a sequence of values from the chain, and that lets us estimate the transition parameters. Here, however, we have only the end-state, the final graph, as our data. The Markov chain will let us assign likelihoods to paths, but we don't know where we started from, and we don't know where we went from there, just how we ended up here.
Suppose we did know where we started from, some graph of
size
,
. (Remember here that each step of the chain adds one node,
so t really counts nodes, not clock-time. This is why it's natural to
start at
, as opposed to 0 or 1. The paper does not seem to
explain this point.) If we knew our initial state, then in principle we could
figure out the probability of reaching our final state from it, as a sum over
all possible paths:
is set of all growing sequences of
graphs which start at
and end
at
. This mathematical expression is a
mouthful, admittedly, but it's probably clearer in a picture.
to
the final, observed graph
. The chain tells us the probability
of each such path. Since we had to take one, and only one, of these paths, the
total probability of making the journey is the sum of the probabilities of all
the individual paths.
At this point, any physicists in the audience should be nodding their heads; what I've just said is that the likelihood, from a given starting configuration, is a sum over histories, or a path integral. Along with the authors, I'll return to how to evaluate this path integral presently, but first we need to figure out how to get that starting configuration.
If we had a known distribution over starting graphs, we could (in principle) just evaluate the likelihood conditional on each starting graph, and then take a weighted sum over graphs. This, however, is not what the authors do. (I'm really not sure where one would find such a distribution, other than another model for graph development. Bayesian practice would suggest picking something which led to easy computations, but this makes a mockery of any pretense to either modeling nature, or to representing incomplete prior knowledge.) Instead, they try to use the known dynamics of the DA model to fix on an unambiguous starting point, and do everything conditional on that.
They observe that you can take any graph, and, for each node, identify the
other nodes it could have been copied from. (If A could have been copied
directly from B, then A's neighbors must be a subset of B's [ignoring the link
between A and B, if any].) So, from any starting graph, you can recursively
remove nodes that could have arise through simple duplications. In general, at
each stage in this recursion there will be multiple nodes which could be
removed, and their choice is arbitrary. Remarkably enough, no
matter which choices one makes, the recursion always terminates at
the same graph. (More exactly, any two end-points are isomorphic to
each other, and so identical for statistical purposes.) The proof is basically
a fixed point theorem about a partial order defined on graphs through
deletion-of-duplicates, but they confine it to the supplementary
materials, so you can take it on trust (and they don't use
such lattice-theoretic language even there). This graph
--- the data, minus everything that could be pure duplication --- is what they
take as their starting point. This is the
to the data's
. Everything is then done conditional on
.
OK, we have our initial condition and our final condition, and we have our Markov chain, so all we've got to do so is evaluate the integral over paths linking the two.
Problem: There are too many paths. In the worst case, the number of paths is going to grow factorially with the number of nodes in the observed graph. Even though along each path we've just got to do some straight-forward multiplication, simply enumerating all the paths and summing over them will take us forever. (The authors discuss some algorithmic tricks for speeding up the exact calculation, but still get something super-exponential!) Thus, evaluating the path integral for the likelihood is intractable, even for a single parameter value.
Solution: Don't look at all the paths. Rather, sample some paths, say N of them, evaluate the likelihood along each, and average. Hope that this converges quickly (in N) to the exact value of the integral. This is, after all, how physicists approach many path integrals.
Problem: Even if N is fairly small, we need to examine
many settings of the parameter
. It could still kill us to
have to sample N distinct paths for each parameter value.
Solution: Use importance sampling. Draw a single path,
valid for all parameter values, and evaluate the likelihood in terms of the
value of an "importance weight" along this path. The weight has to be a
function of
, but it should be the only thing which is. We
do this here by writing the likelihood,
, as an
expectation with respect to a reference measure, which the authors write
. This reference measure is given by
another Markov chain, called the "driving chain"; despite its name, it
is not a member of the DA family of chains. The trick here is
that one sample of possible paths, generated according to this chain,
can be used to (approximately) evaluate the likelihood at all
parameter settings of the DA model.
The crucial equation is [3] on p. 7568
,
but this is wrong.) Let's unpack this a bit.
is the probability of producing the
graph
through the addition of the
node
, with parameter
setting
. (N.B.,
must be a "removable"
node, one which could have been added by duplication.)
is this transition probability, summed over all
possible
. The first two factors in S are what we
want, the probability we'd get moving forward along the path according to the
parameter
. The third term is the reciprocal of the
transition probabilities according to the driving chain. Its only job is to
cancel those probabilities out.
The algorithm for generating the ith sample path is then as
follows. Start with the observed graph
. Count backwards,
Pick a
node
to delete, with probability
proportional to
. (Once
again, this limits us to the "removable" nodes, the ones which could have been
produced by duplication.) Set
to be the result of deleting that node. Keep going
back until we hit the irreducible core,
. (We will always hit
this core, by the fixed point theorem proved in the supplementary results.)
Then
.
So, to summarize: We can generate a sample of paths connecting the observed final graph to the unobserved initial graph, according to the driving chain, and then approximate the likelihood for any parameter value by multiplying the importance weights along those paths and summing over paths. (The importance weights themselves even factor nicely.) We have thus solved the problem of evaluating a path integral, when we've got only one end of too many possible paths.
The trick used here to pull this off depended on having a uniquely-defined
starting point for all parameter settings, namely the
defined
through undoing duplications. (According to the authors, they took this from
papers on the coalescent process in population genetics, but I have not been
able to track down their references.) Strictly speaking, everything is
conditional on that starting point. Of this, more below.
Left unaddressed by the above is the question of how many paths we need to sample. Remember, the whole point is to not have to look at every possible path! If it turns out that accurate approximations to the likelihood require us to sample some substantial fraction, then this is all for nothing. However, the authors' figures reveal something quite remarkable. Whether N is 10 or 1000, the approximate likelihood changes very little (at least on a log scale), even with real data. This suggests that we don't, actually, need a lot of paths, but why?
For each i,
is an independent
realization of a random variable, whose distribution depends
only
(holding fixed the driving chain, and the initial and
final graphs). Since they are IID, we can apply the central limit theorem,
which tells us that their mean should converge at rate
.
Since that's noticeably smaller for N=1000 than for N=10, it must be the case
that the variance of the likelihood along the individual sample paths is
already pretty small. Why?
The lame physics answer is, "the principle of least action". There will be
optimal, most-probable paths, and they will dominate the sum, the others
tending to make negligible contributions. With high probability, a random sample
will pick out the most probable paths. Q.E.D. This argument could, perhaps,
be made less lame through an application of large deviations results,
specifically conditional-limit-theorem- (or "Gibbs's conditioning principle"-)
type results for Markov chains, which roughly say that if something improbable
(a passage from
to
) happens, it does so in
the least-improbable possible way, and deviations from that trajectory are
exponentially suppressed. <televangelist>In the name of Cramér,
in the name of Varadhan, in the name of Freidlin and Wentzell, I call on you,
Brother Argument, to be healed! Arise and prove! And, cf. Eyink
(2000).</televangelist>
An information-theoretic answer is to invoke the asymptotic equipartition
principle, a.k.a. the Shannon-McMillan-Breiman theorem. This says that
if
are generated according to a (well-behaved)
stochastic process, whose distribution is
, and
is a sufficiently well-behaved model, then
is the entropy rate of the data-generating process
, and
is the relative entropy rate
between the data source and the model, i.e., the asymptotic growth rate of the
Kullback-Leibler divergence. (For details, see Algoet and Cover, 1988, or
Gray, 1990). So
The biggest unclarity of all, probably, is the role of
.
Recall that we reached this by removing nodes which could have been
added by pure duplication. There is, however, no particular reason to think
that the actual growth of the graph ever passed through this state. It has the
advantage of giving us a unique starting point for the chain, but there are,
potentially, others. One, of course, is the trivial network consisting of a
single node! Another possibility (which came up in the discussion, I think
mostly due to Anna Goldenberg) is to
first remove potential duplicates, as the authors do, and then remove nodes
which have only one link to them, as clear attachments. This process of
unwinding the attachments could potentially be iterated, until no "danglers"
are left. This, too, is a uniquely-defined point. We can then go back to
removing nodes which are, now, potential duplicates, and so on. Someone (I
forget who) suggested that this might always terminate at the one-node network;
it would be nice to either show this or give a counter-example. But if there
is some principled reason, other than tractability, to use
their
, I can't figure out what it is from this paper.
Only using a growing network, and in particular only focusing on growth through duplication, certainly simplifies the computational problem, by reducing the number of possible paths which could terminate in the observed graph. Deletion of nodes and edges is however going to be very important in more biologically-plausible models, to say nothing of models of social or technological networks. Presumably the trick of using a backward-looking chain which stops when it hits a unique starting configuration could still be used with deletions --- I think the authors are hinting as much in their conclusion --- but it's not clear to me that a unique starting point is appropriate. With biological interaction networks, for example, one might argue that, e.g., metazoans have been around for a long time, so the distribution of networks ought to be close to stationary, and so starting configurations should be drawn from an invariant distribution of the appropriate chain...
This raises two further points, which are not un-related: the asymptotics of the DA model, and the biological utility of such a model. Run for a long time, the DA model will produce graphs of unbounded size, but it's not immediately obvious what these graphs will look like. In particular, what will be their degree distribution? The Barabasi-Albert model (Albert and Barabasi, 2002) produces scale-free distributions, <boosterism>because it uses the same mechanism as Herbert Simon's classic paper</boosterism> (Bornholdt and Ebel, 2001). This relies on a rich-get-richer dynamic, where nodes with high degree are more likely to attract new edges. My initial thought was that this wasn't present in the DA model, because targets for attachment and targets for duplication are both chosen uniformly. However, someone in the discussion --- I think, though I may be mis-remembering, that it was Tanzy Love --- pointed out that while high-degree nodes are no more likely to be copied than low-degree nodes, edges into high-degree nodes are more likely to be copied than edges into low-degree nodes. This is because if a node has degree k, there are k other nodes whose duplicated could end up linking to it. It may even be the case that this is falls under the theorems in Simon... Presumably the asymptotics would only become harder to handle if we added events deleting nodes or edges.
As for the biological utility, I'll repeat that none of the nodes have any identity of their own; only their role in the network of relations represented by the edges has any bearing on the model. "If this be structuralism, make the most of it": by turning it into a neutral model for the evolution of biological networks. After all, there is no reason here to duplicate certain nodes or edges, it's all just uniform chance. One key use of neutral models is to provide a background against which to detect adaptation; how could we do that here?
References:
Albert, Réka and Albert-László Barabási (2002),
"Statistical Mechanics of Networks", Reviews of Modern Physics
74 (2002): 47--97 = cond-mat/0106096
Algoet, Paul H., and Thomas M. Cover (1988), "A Sandwich Proof of the
Shannon-McMillan-Breiman Theorem", The Annals of
Probability 16: 899--909
Billingsley, Patrick (1961). Statistical Inference for Markov
Processes. Statistical Research Monographs, vol. 2. Chicago:
University of Chicago Press.
Bornholdt, Stefan and Holger Ebel (2001), "World-Wide Web scaling exponent
from Simon's 1955 model", Physical Review E 64:
035104 = cond-mat/0008465
Eyink, Gregory L. (2000), "A Variational Formulation of Optimal Nonlinear
Estimation", Methodology and Computing in Applied Probability
(submitted) = physics/0011049
Gray, Robert M. (1990), Entropy and Information Theory.
Berlin:
Springer-Verlag. Full text
free online.
Simon, Herbert A. (1955), "On a Class of Skew Distribution Functions",
Biometrika 42: 425--440
Solé Ricard V., Romualdo Pastor-Satorras, Eric Smith and Thomas
B. Kepler (2002), "A model of large-scale proteome evolution", Advances
in Complex Systems 5 (2002):
43--54 = cond-mat/0207311
Vázquez, A. and A. Flammini and A. Maritan and A. Vespignani (2003),
"Modeling of protein interaction networks", Complexus
1:
38--44 = cond-mat/0108043
Posted by crshalizi at December 01, 2006 08:15 |
| permanent link
Speaking, as I was, of this course, I've just spruced up the course homepage in anticipation of the coming semester. Comments, particularly on last year's lecture notes, are most welcome. (Reader M.H. has already won a free copy of the book, if it ever materializes, for his very helpful suggestions.)
Also, I have made a promise to try to post at least once a week, so that certain people know I'm alive (or at least that one of my scripts is still running).
Posted by crshalizi at November 29, 2006 02:40 |
| permanent link
"Concentration of measure" is a phenomenon in probability theory where, roughly speaking, any set which contains a substantial fraction of the probability can be expanded just a little to yield a set containing most of the probability. Another way to say this is that, given any reasonably continuous function, the probability that it deviates from its mean is exponentially small, and the exponential rate does not depend on the precise function. This makes concentration of measure results extremely useful for questions involving the estimation of complicated and ugly functions. The classical work in this area proves concentration-of-measure for various kinds of sequences of independent variables, but for real applications in statistics, machine learning or physics you'd want to be able to handle dependence. The natural way to do this would be to look at mixing processes, which are at least asymptotically independent.
Leo Kontorovich, who was one of the students in my stochastic processes class this past spring, now has a paper summarizing his work on, precisely, concentration of measure for mixing sequences:
Since I'll be teaching stochastic processes again in the spring, I would very much like to claim that Leo wrote these papers as a direct result of having taken my class. But the truth is that Leo knew so much about this already that, so far teaching him everything he knows, I learned almost all I know about concentration from him. But this is one of the real pleasures of teaching...
Posted by crshalizi at November 27, 2006 22:57 |
| permanent link
I've started putting the notes for my lectures on stochastic processes (36-754) online at the course homepage.
In the staggeringly-unlikely event that anyone wants to keep track of the course by RSS, this should do the trick.
Posted by crshalizi at November 27, 2006 22:28 |
| permanent link
Posted by crshalizi at November 27, 2006 11:42 |
| permanent link
Two more publications are now available (but were done some time ago).
One is a fairly straightforward paper, which you can get from arxiv.org.
This paper grows out of work Matthew did in Ann Arbor, when he visited last summer. (Our summer, not his.)
The other is a little more convoluted.
Now, back to work.
Minds, Brains, and Neurons; Complexity; Physics; Philosophy; Self-Centered
Posted by crshalizi at October 22, 2006 16:42 |
| permanent link
When considering slime molds, about which I hope to have more to say soon, we should not forget their natural predators [cache]. (Via Matthew Berryman, in e-mail.)
Posted by crshalizi at October 22, 2006 16:42 |
| permanent link
(Via Matthew Berryman in e-mail)
Posted by crshalizi at October 22, 2006 16:42 |
| permanent link
Posted by crshalizi at October 15, 2006 20:12 |
| permanent link
These handouts are shamelessly ripped off derivative work,
amplifying and expanding those created
by Tom Minka when he
invented this course. (See his
originals here.) Posted
here in response to a number (> 1) of requests.
See here for the first three
weeks' handouts.
Note to students in 36-350: This page will not keep up to date with the handouts, or with other course documents; use Blackboard!
Posted by crshalizi at October 15, 2006 19:57 |
| permanent link
I am sure that I speak for all of us in the Statistics Department at CMU (especially my erstwhile fellow bloggers) when I say we're very pleased and excited to have Prof. Mark Liberman as our seminar speaker on 16 October, a week from tomorrow. Mark is coming to us not as the impressario of LanguageLog (though, in that role, his scourging of sloppy data analysis, whether socio-political or neurosexual, is a joy to behold), nor as the director of the Linguistic Data Consortium, nor even to speak about bibliomics. Rather, he'll be talking about some work which combines the stochastic linear learning models (of the sort pioneered by, among others, Frederick Mosteller) with agent-based modeling of cultural evolution. Which is to say, he's talking about aggregate behaviors of interacting stochastic processes which are more interesting than just the central limit theorem.
Monday, October 16, at 4:30 pm in Baker Hall A51; free and open to the public.
The Collective Use and Evolution of Concepts; Enigmas of Chance
Posted by crshalizi at October 12, 2006 11:42 |
| permanent link
Blog near-silence will continue until bootstrap testing results improve. In the meanwhile, it occurs to me that I haven't mentioned that I have a new paper.
Thanks — over and above the acknowledgments in the paper &mdash are due to Jörg Reichardt for help implementing his algorithm, and to Anna Goldenberg for very thorough (not to mention patient!) editing, resulting in a much better paper.
There are about a zillion possible extensions and applications, which is a good note on which to get back to work.
Posted by crshalizi at October 08, 2006 13:47 |
| permanent link
Paul "arXiv" Ginsparg, evangelizing in the pages of the latest issue of The Journal of Neuroscience, under the rubric "As We May Read":
The e-print arXiv (http://arXiv.org/), initiated in August 1991, has effectively transformed the research communication infrastructure of multiple fields of physics and could play a prominent role in a unified set of global resources for physics, mathematics, and computer science. It has grown to contain >375,000 articles (as of July 2006), with >50,000 new submissions expected in calendar year 2006 and >40,000,000 full-text downloads per year. It is an international project, with dedicated mirror sites in 17 countries and collaborations with United States and foreign professional societies and other international organizations, and it has also provided a crucial lifeline for isolated researchers in developing countries...
The arXiv is entirely scientist driven: articles are deposited by researchers when they choose (either before, simultaneous with, or after peer review), and the articles are immediately available to researchers throughout the world. As a pure dissemination system, it operates at a factor of 100-1000 times lower in cost than a conventionally peer-reviewed system.... This is the real lesson of the move to electronic formats and distribution: not that everything should somehow be free, but that with many of the production tasks automatable or off-loadable to the authors, the editorial costs will then dominate the costs of an unreviewed distribution system by many orders of magnitude. ...
The site has never been a random Usenet newsgroup- or blogspace-like free-for-all. From the outset, arXiv.org relied on a variety of heuristic screening mechanisms ... to ensure insofar as possible that submissions are at least "of refereeable quality." This means that they satisfy the minimal criterion, that they would not be peremptorily rejected by any competent journal editor as nutty, offensive, or otherwise manifestly inappropriate, and they would instead at least in principle be suitable for review. These mechanisms are an important, if not essential, component of why readers find the arXiv site so useful. ...
The arXiv repository functions are flexible enough either to coexist with the preexisting publication system or to help it evolve into something better optimized for researcher needs. Although there are no comprehensive editorial operations administered by the site, the vast majority of the 50,000 new articles per year are nonetheless subject to some form of review, whether by journals, conference organizers, or thesis committees. Physics and astronomy journals have learned to take active advantage of the availability of the materials before journal publication ...
On the one-decade time scale, it is likely that more research communities will join some form of global unified archive system without the current partitioning and access restrictions familiar from the paper medium, for the simple reason that it is the best way to communicate knowledge and hence to create new knowledge. Ironically, it is also possible that the technology of the 21st century will allow the traditional players from a century ago, namely the professional societies and institutional libraries, to return to their dominant role in support of the research enterprise.
Ginsparg's title is a riff on Vannevar Bush's "As We May Think"; hopefully it will not take 48 years for these suggestions to be widely implemented.
Posted by crshalizi at October 02, 2006 17:53 |
| permanent link
By now, everyone and her brother has read, or at least read about, the papers by Albert-László Barabási and co., purporting to show that response times in e-mail, and in Darwin and Einstein's correspondence, follow a power law distribution, and that this is due to queuing processes.
Unfortunately, this is not true; the apparent power law is merely an artifact of a bad analysis of the data, which which is immensely better described by a log-normal distribution. (Via Aaron Clauset.)
As every school-child knows (at least, these school-children do!), adding together many independent random variables, each of which makes a small contribution to the over-all result, generally gives you a Gaussian or normal distribution (unless the contributing variables are, themselves, kind of pathological). This fact is the central limit theorem.
What happens if the inputs are multiplied together, rather than added? Well, take the logarithm: log(XY) = log(X) + log(Y). The logarithm of the product will be the sum of the logarithms of the inputs. The latter will still be independent, so the logarithm of the output will be normally distributed. Undoing the log gives what's imaginative called the log-normal distribution. Log-normals are very common, for the same reasons that normals are. Unlike normals, they are very easy to mistake for power law distributions, especially if your knowledge of statistics is as limited as most theoretical physicists'. (The distribution of links to weblogs, for instance, is much better fit by a log-normal than a power law, as we've seen.) In their comment, Stoffer et al. show that a log-normal actually gives a textbook-quality fit to Barabási's data. (The only change I'd make to their procedure is that I'd report the likelihood ratio directly, and let people work out their own Bayesian posteriors if so inclined.) Looking at the data reported in the new Nature paper on Darwin's and Einstein's correspondence, if it's not log-normal too — well, I'd say I'd eat my hat, but I don't own one; I'll buy a Notre Dame hat and eat it.
Let me turn the microphone over to Francis Galton (as quoted in Ian Hacking's The Taming of Chance):
I know of scarcely anything so apt to impress the imagination as the wonderful form of cosmic order expressed by `the law of error.' A savage, if could understand it, would worship it as a god. It reigns with severity in complete self-effacement amidst the wildest confusion. The huger the mob and the greater the anarchy the more perfect its sway. Let a large sample of chaotic elements be taken and marshalled in order of their magnitudes, and then, however wildly irregular they appeared, an unexpected and most beautiful form of regularity proves to have been present all along.As Hacking notes, on further consideration Galton was even more impressed by the central limit theorem, and accordingly replaced the sentence about savages with "The law would have been personified by the Greeks and deified, if they had known of it." Whether deified by Hellenes or savages, however, the CLT has a message for those doing data analysis, and the message is:
Thou shalt have no other distribution before me, for I am a jealous limit theorem.
I restrain myself from making any observations on the editorial process at Nature, or on the competence of the referees of Barabási's papers. I do wish it to be noted, however, that this post is not an entry in the "Why Oh Why Can't Physicists Learn Better Probability and Statistics?" series, as Amaral and Barabási are both associated with Gene Stanley's school of statistical physics.
Update, Halloween: Suresh Venkatasubramanian, at Geomblog, turns his microphone over to Michael Mitzenmacher, who has some very good comments. (This led me to read Mitzenmacher's nice paper on generating mechanisms for power-laws.) I am more convinced by Mitzenmacher by the difference in the goodness of fits, simply because it is so overwhelmingly large. It hardly seems to make sense, in this case, to say that the data are even approximately power-law distributed...
Update, 23 November: Barabási's group has posted a reply (physics/0511186). To my eyes, the crucial observation by Stouffer et al. was that the fit of the data to a power law is in fact really, really bad, so it's pointless to talk about what mechanism might produce a power law in such situations. The reply's take on this point is that this is "merely" a statistical issue! In short, I don't find the reply at all convincing on the major points, but if you care, by all means read it. (The reply claims that Stouffer et al.'s comment was rejected by "three referees" at Nature; one wonders if they were the referees who approved Barabási's original paper.)
Update, 25 November: To hammer the point home, let's look at Figure 1b from Stouffer et al.'s comment. (Click for a larger version.)

Update, 29 November: Yet more commentary, from Aaron Clauset.
Update, 29 September 2006: In the event you still care about this, see G. Grinstein and R. Linsker, "Biased Diffusion and Universality in Model Queues", Physical Review Letters (2006): 130201. Grinstein and Linsker analytically solve for the asymptotic distribution of Barabási's queueing model, finding either a power law or a power-law with an exponential cut-off; they also show that the result is very sensitive to introducing a cost for switching between different kinds of tasks.
Manual trackback: In Search of 42; Pharyngula; hakank.blogg; Juan de Mairena [v.2.718]; Three Quarks Daily; Metamerist; Zoltán Sylvester; Language Log; Statistical Modeling, Causal Inference, and Social Science
Posted by crshalizi at September 29, 2006 19:06 |
| permanent link
Well, it looks like I have some VIGRE-funded undergraduate research assistants for the semester; specifically, I have takers both for my idea about aggregated network data, and neutral cultural diffusion on assortative networks. The latter, however, leaves me with a puzzle.
The project makes sense (to the extent that it does) against a background assumption that culture is not neutral, but an adaptation to the physical or social environment. This is a pretty pervasive assumption on the part of social scientists, historians, ethnographers, cultural critics, etc., but I'm not sure that my students will have been exposed to this idea as such. So, I need background reading which will rapidly persuade a wholesome, technically-inclined Carnegie Mellon undergrad that a person's beliefs and values ought to be correlated with their status in society. "Rapidly" is important, because there is some technical research to get to, and recapitulating the whole history of the sociology of belief is not an option. Similarly, assigning The German Ideology and/or The Protestant Ethic and the Spirit of Capitalism seems like asking for trouble. (And, with Weber, I'd feel like I'd have to spend a lot of time unteaching the errors.) So, I invoke the collective wisdom of the Web for reading suggestions: please write me at cshalizi [at] cmu [dot] edu [dot] oryx, deleting the name of a genus of antelope.
Posted by crshalizi at September 21, 2006 08:43 |
| permanent link
Via Language Log, there comes this fun post on the distribution of the number of repetitions of "R" in strings of the form "AR+", as in, "Arrr, mateys!" These findings, like Mark Liberman's on "AW+", are in line with the results of the seminal paper in this area, Dennis Chao and Patrik D'haeseleer's "The Distribution of Variable-length Phatic Interjectives on the World Wide Web" (University of New Mexico Computer Science Department Tech Report TR-CS-2001-23). I eagerly await further results in this exciting pico-field.
Being what I am, however, I can't resist pointing out that looking for a straight line on a log-log plot, and even finding one with high r-squared, is simply not a reliable way of checking whether a distribution is a power-law. Please do not do this. (And yes, I should be finishing that paper on the right approach, rather than blogging.)
Posted by crshalizi at September 20, 2006 16:21 |
| permanent link
My copy of Michael Bérubé's What's Liberal About the Liberal Arts? arrived in the mail yesterday (thanks to the unionized workers at Powell's), only to coincide with the release of the graphic novel (4Mb PDF). As a left-liberal professor, I can proudly testify to the complete and utter accuracy of this document's depiction of our revolutionary praxis. (At least, when we are not teaching fluffy courses about enchiladas.) My students should take notes, because this will be on the exam, particularly the bits about camels.
However, What's Liberal About the Liberal Arts? The Graphic Novel shows me that I must also engage in self-criticism. I have been complacent about enforcing political correctness, behaving as though statistics and machine learning could be taught in a neutral manner, when in reality every classroom is a site of conflict. In particular, I have been insufficiently vigorous in struggling against my students' tendencies to embrace borgeois individualism, in the form of the theory of merely personal, subjective and idealist probability associated with a clerical ideologue of British imperialism. I see that I must double and re-double my efforts to indoctrinate them in the principles of frequentist probability, based as it is on collectives and the work of social democrats. Might I be permitted to hope that comrade Gonick (also cited by comrade-professor Burke) will produe a proletarian, graphic version of the great red book epitomizing Peircism-Popperism-Neyman Pearson Thought?
(Via Pharyngula, Crooked Timber and John Burke [in e-mail] more or less at once)
Learned Folly; Corrupting the Young; The Progressive Forces; Enigmas of Chance; Afghanistan and Central Asia
Posted by crshalizi at September 20, 2006 13:28 |
| permanent link
Posted by crshalizi at September 19, 2006 13:49 |
| permanent link
In honor of International Talk Like a Pirate Day, I bring you:
Posted by crshalizi at September 19, 2006 12:47 |
| permanent link
These handouts are shamelessly ripped off derivative work,
amplifying and expanding those created
by Tom Minka when he
invented this course. (See his
originals here.) Posted
here in response to a number (> 1) of requests.
Lecture 5 is also a shameless rip-off explication
of Aleks Jakulin's
"Quantifying and Visualizing Attribute Interactions"
(cs.AI/0308002).
Note to students in 36-350: This page will not keep up to date with the handouts, or with other course documents; use Blackboard!
Posted by crshalizi at September 19, 2006 12:28 |
| permanent link
Hey, kid! Interested in winning eternal intellectual glory and entering the glamorous world of scientific research? Interested in $500 for the semester? Are you an undergrad at Carnegie Mellon University? If so, the statistics department has no less than nine possible projects for you. (Some of them are mine; one began as a blog-post, another as a notebook entry.) Apply now!
Posted by crshalizi at September 16, 2006 12:58 |
| permanent link
In honor of the end of the semester and the arrival of spring, here is my first, and so far only, attempt at writing an exam. This was given as a take-home midterm by a friend teaching a statistics-for-people-who-don't-like-math course at a school which, to protect the innocent, I'll call the University of Winnemac. (It's a long story.) I'm fond of it, but the students at Winnemac hated it, and apparently none of them got any of the jokes. Solutions are available upon request. (Note that Problem 5 is a simplified version of the "Carnival Booth" algorithm due to Samidh Chakrabarti and Aaron Strauss.)
Problems are longer and harder than exercises; they also count for twice as much. Some questions are in multiple choice format, but you should always show your work.
A designer measures the height of a hundred models, randomly chosen from the runways in Milan. The sample mean height is 5.85 feet, with a sample
standard deviation of 0.15 feet.
(a) What is the 95% confidence interval for the mean height, in feet, of
Milanese models?
(b) What is the standard deviation of their height in inches?
(c) What is the confidence interval for their height in inches?
(d) What is the confidence interval (in feet or inches) for their height while
wearing three-inch platform shoes?
(e) The standard deviation of their height in platforms?
Every week the market price for frozen concentrated orange juice (FCOJ) futures either goes up or down, with equal probability. A market analyst obtains a list of 1024 FCOJ traders. At the beginning of the year he sends them a letter announcing a free trial period of his new market prediction service; half the letters say the market will rise that week, and half that it will fall. The next week he discards the names of the traders to whom he made the wrong prediction, and repeats the process. Thus after k weeks, the remaining names on the list have received k correct predictions in a row for free.
(a) What is the probability that any given trader is still on the list after
seven weeks?
(b) How many names are still on the list after seven weeks?
What are the probabilities of getting the following sequences of heads and tails from 30 consecutive tosses of a fair coin?
Mutual fund annual rates of return are normally distributed. What fraction of funds will have returns between one and three standard deviations above the mean in a given year?
A survey of cats in J. Random College Town finds their weight is normally distributed, with a mean of 9 pounds and a standard deviation of 1.5 pounds. The same survey finds that the weight of cat owners is also normally distributed, with a mean of 150 pounds and a standard deviation of 15 pounds. Describe the distribution of the weight of cat-owners holding their cats, assuming feline and human weights are independent random variables.
In Exercises 6--8, A, B and C are three events. P(A) = 0.75, P(B) = 0.65 and P(C) = 0.40. Note: Drawing Venn diagrams is not required to solve these problems, but it may help.
(a) What is the smallest possible value of P(A and B)?
(b) The largest possible value?
Which of the following statements could be true?
(a) If C is (A and B), what is P(B or A)?
(b) If C = (A and B), what is P(B|A)?
In his book I Am Sickened by Your Ignorance, the critic Orpheus Bruno declared that "no more than one poem in ten thousand is truly great; the rest might as well be shopping lists". Bruno was subsequently abducted by renegade experimental psychologists and made to rate a large number of randomly-selected poems from 0 to 100, and also to say which ones were "truly great". He, of course, ignored the 0--100 scale entirely, preferring a boundless scale to mirror his boundless magnificence. His ratings, in fact, were normally distributed (or, as he said, "followed the law of the immortal Gauss"): the mean rating was 45, with a standard deviation of 16, and poems which scored 93 or above were "truly great". What is the proportion of truly great poetry?
Professor Sheila Nagig of the Miskatonic University Department of Statistics
refuses to give tests to her students, saying that most students who get high
scores are just lucky, not knowledgeable, so the test isn't informative. Pressed by the Dean to explain herself, she argues as follows. Consider a test with 100 yes-or-no questions. A student's degree of knowledge of the subject (say, finite-temperature canonical quantum gravity) can be measured by the probability p of their answering a given question correctly. Assuming the questions are independent (which is the case on Prof. Nagig's tests), a student's score is therefore a binomial random variable, B(p, 100). Normally a passing score is 70% correct, or 70 questions out of 100. Note that someone who knows nothing and guesses completely at random has p = 0.5. (In the following, you may use the normal approximation if you wish.)
(a) What is the probability of a student passing if their p = 0.5?
(b) What is the probability of a student passing if their p = 0.7?
(c) Assume that one American in a million has a finite-temperature canonical quantum gravity p of 0.7, and the rest have p = 0.5, i.e., they know nothing about it. What is the probability that a random American who passes a test in the subject knows nothing about it?
(d) Explain what is wrong with Prof. Nagig's argument.
Glenn and Glenda Martingale have a successful angora sweater business, and, in a fit of vertical integration, buy an angora goat farm. As you know, the most important trait of an angora goat is its fuzziness, measured in hairs per square millimeter. The Martingales, being statistically sophisticated, determine the fuzziness of their goats as follows. For each goat in the herd, fuzziness is measured at a random spot on its body, and then averaged across all goats in the herd. These are their results.
| Sample mean fuzziness | 10.30 |
| Sample standard deviation | 1.21 |
| Low end of 95% confidence interval | 9.99 |
| High end of 95% confidence interval | 10.61 |
Assume, like the Martingales, that fuzziness is normally distributed. Calculate the number of goats in the herd, i.e., the number of samples. Round to the nearest goat. You may assume that there are a lot of goats.
Extra credit. Can you re-do the calculation, without assuming the number of samples is large?
The International Group of Angora Fanciers (IGAF) stipulates that only wool
from goats whose fuzziness is at least 9.09 can be used to make angora sweaters. Assume that the fuzziness is normally distributed with mean 10.30 and standard deviation 1.21 (as in the previous problem).
(a) What is the probability that a random goat on the farm is fuzzy enough for IGAF?
(b) What is the probability that at least 25 out of a random group of 30 goats meets the IGAF standard? Calculate this exactly, using the binomial distribution.
(c) Repeat the previous calculation using the normal approximation.
In his book Alchemical Management: Getting the Lead Out and the Gold In, Alex Cagliostro, the famous business consultant, profiles ten companies that achieved excellence after adopting his system of alchemical management (seminars available through appointment with Cagliostro Consulting PLC). The rival consultancy of Hooke, Waterhouse, Comstock and Root points out that there were seventy-two other companies which tried alchemical management without achieving excellence.
(a) Assuming these 82 firms form a representative sample (aside: is that reasonable?), calculate a 95% confidence interval for the proportion of
excellent firms among alchemically-managed companies.
(b) It is known that 12% of all firms achieve excellence. Test HWCR's claim that excellence is no more common among alchemically-managed firms than among non-alchemical companies. State the null and alternative hypotheses. Is this a one-sided or two-sided test? Calculate the p-value.
Airport security cannot give a detailed screening to everybody trying to fly by plane, so they select a fraction for detailed screening. Suppose there are
four kinds of passengers: innocent-looking law-abiding citizens, suspicious-looking law-abiding citizens, suspicious-looking terrorists and innocent-looking terrorists. Security officials decide to screen all suspicious-looking passengers, and a random 2% of all innocent-looking people just to be safe. 10% of the total population is suspicious-looking, but 80% of all terrorists are.
(a) What is the probability that no one in a group of four random terrorists
will be screened the next time they fly?
(b) Say a terrorist has evaded scrutiny if they have taken five flights without being screened once. What is the probability that a terrorist is innocent-looking, given that he has evaded scrutiny?
(c) Supposing one has a group of four terrorists who have all evaded scrutiny, what is the probability that none of them will be screened the next time they fly?
(d) Repeat the calculation in (c), supposing that airport security ignores who looks suspicious or innocent, and screens 12% of all passengers completely at random.
Posted by crshalizi at September 11, 2006 12:00 |
| permanent link

The atmosphere just outside Beijing is hot, humid, and almost unbelievably hazy. The atmosphere of the complex systems summer school where I'm teaching this week is, thankfully, rather different...
Posted by crshalizi at September 11, 2006 12:00 |
| permanent link
Attention conservation notice: An appeal to the reader's knowledge of textbooks on stochastic processes; also a plea not to be thrown into the briar-patch.
In the spring, I'm going to be teaching the department's advanced course on stochastic processes (36-754, for those keeping track at home). The catalogue description of the course reads, in full, as follows:
This course introduces advanced topics in Probability Theory such as Brownian motion, Markov processes, stationary processes, stochastic integration, etc.It's intended for students from math or statistics who've had a first course in measure-theoretic probability, such as our 36-752, which goes up through the laws of large numbers for independent variables, a little martingale theory, and the central limit theorem. Most if not all of them will have already had a course on stochastic processes at the level of Grimmett and Stirzaker. My plan is to take advantage of the "etc." in the description, and teach a course on
I am looking for a textbook which covers all of this, or at least most of it; I'd be willing to change the material to match a good text. The students currently in 752 are using Ash and Doleans-Dade, which is good, and the last two chapters (which they won't get to) introduce a little ergodic theory and a little stochastic calculus, respectively, but not in enough depth. No one book I know seems to fit, and making them buy more than one expensive book doesn't seem right. If you have any suggestions, please mail them to me at cshalizi [at] oryx [dot] cmu [dot] edu (removing the name of a genus of antelope, which is there only to confuse spammers). I am going to have to spend a lot of time on my lecture notes; I really don't want that to have to grow into, in effect, writing my own book.
Update, next day: Thanks to Bill Tozier, Anand Sarwate and Wolfgang Beirl for writing with suggestions. Wolfgang, in particular, pointed me to Alexandre Stefanov's useful collection of online probability texts and notes (part of a bigger collection of mathematics resources). One of these, Robert Gray's Probability, Random Processes and Ergodic Properties, is something I was already planning to mine, along with his Entropy and Information Theory.
Update, Halloween: We will be using Olav Kallenberg's Foundations of Modern Probability as a reference, with the primary text being my lecture notes.
Manual trackback: Nothing Funny About Feldspar
Posted by crshalizi at September 11, 2006 12:00 |
| permanent link
General course information (includes gratuitous eye-candy), syllabus, detailed course outline. To ward off the evil eye, I should say that I do not expect to get through all of what's in the outline. (Suggestions about the outline are still most welcome.) Lecture notes will be posted on the course website as I write them. Thanks, again, to everyone who helped me pick a text.
Update, January 2006: See downstream for links to the lecture notes.
Posted by crshalizi at September 11, 2006 12:00 |
| permanent link
Two hundred and fifty pages later, it's the last day of classes, and my last lecture in advanced probability is done. (I still need to tex up the one lecture I gave from hand-written notes, but there's no rush.) Looking over the tome, I didn't get to a lot of stuff I wanted to (easily enough for another semester!), it's far from self-contained, and it almost certainly contains mistakes, but I think it's not bad for a first draft. It also took an order of magnitude more of my time than was wise. I'll be teaching the class again in the spring, and hope I'll have the strength of will to leave it alone, rather than succumbing to the temptation to do something.
Posting about science and outrages against common decency will recommence shortly.
Posted by crshalizi at September 11, 2006 12:00 |
| permanent link
In his writings on the theory of automata, John von Neumann defines a complex system as one where the best model is, in fact, the system itself, and specifically mentions cats as an example of such complexity (thereby demonstrating not merely sagacity but taste). Since then, I am sorry to say, there has been a sad lack of interest in feline behavior within complex systems theory; but no longer. Going to the pet store the other day, I blundered into an aisle of motorized cat toys. Mindful of Abbas Raza's post on 3QD, I actually looked at them; and of course when I came to one that boasted an Amazing Chaos Wand, I had to look at it; and of course I had to buy it when I read the following ad copy on the side of the box (my links):
Professor Nozawa
CHAOTIC TOYS FACTORY, LTD.The Cat Attack uses the latest research in chaos theory and complex systems to emulate the movements and personality of a cat's favorite prey. This "virtual mouse" technology utilizes algorithms based on a six-dimensional coupled nzmap system modeled on the neural network of a real mouse. What that all means is that the Cat Attack's "virtual mouse" will become your cat's new best friend!
Professor Shimada
NIHON UNIVERSITYDr. Nozawa has developed an amazing and practical use for "deterministic chaos". It was Norbert Wiener who pointed out the importance of random noise in the brain and in automatic control systems. However, Dr. Nozawa showed that is was possible to solve the difficult "Traveling Salesman Problem" of optimization faster by using nonlinear dynamics, rather than random noise. The "Cat Attack" toy developed by Dr. Nozawa's company operates using powerful nonlinear dynamics algorithms, and this leading edge of human knowledge is appreciated by, among others, the Nozawa family cats, Tal, Fu and Phi, who seem to think the "Cat Attack" is truly alive.
I have turned this apparatus over to a collaborator who specializes in these issues, and she reports good results while the batteries last.

Posted by crshalizi at September 09, 2006 22:59 |
| permanent link
Since everyone is going on about Leszek Kolakowski (starting with Tony Judt, and thence Jonathan Goodwin, 3QD, DeLong) or, separately, Althusser (Bérubé, McLemee), I bring you: Kolakowski demolishing Althusser. Read it and weep, or laugh, as hailed. And do read Main Currents of Marxism, now that it's out in one volume ("so handy for pocket or purse", in the words of a certain rootless cosmopolite).
This completes your moment of Marxist harmonic convergence.
Posted by crshalizi at September 08, 2006 11:24 |
| permanent link
This looks useful (via Fresh Tracks).
This is only too accurate.
Posted by crshalizi at September 07, 2006 09:13 |
| permanent link
Posted by crshalizi at September 07, 2006 09:13 |
| permanent link
Constant readers may recall
the advert for a workshop on the
evolution of complexity that ran here in January. Because of the way that
workshop went, the call for papers will be contined until morale
improves has been renewed, with all papers to appear in a special
issue of Artificial Life. I reprint the CFP below. Notice that
this time, the length limits are the ordinary ones of the journal (2000 words
for letters, 12000 words for articles), not the harder limits of the workshop.
This should be good.
Special Issue on the Evolution of Complexity
Artificial Life journal
Call for Papers
Guest Editors:
Carlos Gershenson
Centrum Leo Apostel, Vrije Universiteit Brussel
Krijgskundestraat 33. B-1160, Brussels, Belgium
Tom Lenaerts
SWITCH, Flanders Interuniversity Institute for Biotechnology
Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
Motivation
As a result of the quality of the Evolution of Complexity workshop at ALife X last June in Bloomington and the interest of the attendants; we announce a call for papers for a special issue on this theme for the Artificial Life journal.
The evolution of complexity is a central theme in Biology. Yet it is not without ambiguity. Complexity has been used to refer to different things. For instance, complexification has been interpreted as a process of diversification between evolving units or as a scaling process that is related to the idea of transitions between different levels of complexity. Other meanings of complexity have been introduced, both inside and outside of Biology. In most cases, though, the central concern is to understand what produces complexity.
The focus of this special issue will be on biological interpretations of complexity and on evolutionary and related dynamics as driving mechanisms for producing complexity. Questions to be addressed in the special issue include:
- How could complexity growth be measured or operationalised in natural and artificial living systems?
- How can existing data from nature be brought to bear on the study of this issue?
- What are the main hypotheses about complexity growth that can actually be tested today?
- Are the principles of natural selection as they are currently understood sufficient to explain the evolution of complexity in living systems?
- What are the environmental and other constraints of the evolution of complexity in living systems?
- What is the role of developmental mechanisms in the evolution of complexity in living systems?
- What are conditions could reduce evolved complexity in living systems?
- How factors allow the evolution of complexity in living systems to be manipulated and controlled?
- What models are most appropriate for understanding the evolution of complexity in living systems?
Paper Submission:
Submitted articles and letters should follow the submission guidelines of the Artificial Life Journal, available at http://mitpress.mit.edu/ALIFE. Authors should also include a cover letter describing briefly the relevance of their article to the specific topic of this call.These articles and letters should NOT be submitted to the journal editor, but should be uploaded through the special issue website (single PDF files only, include cover letter as the first page of the paper).
Papers will be judged by members of the Program Committee on their relevance to the call for papers, originality, clarity of the presentation, and overall quality.
Important Dates:
Submission deadline: December 15th, 2006
Notification of acceptance: February 1st, 2007
Camera-ready papers due: March 1st, 2007Programme Committee:
Chris Adami
Lee Altenberg
Mark Bedau
Hugues Bersini
John Bonner
Dominique Chu
Jim Crutchfield
Bruce Edmonds
Carlos Gershenson
Mario Giacobini
Franics Heylighen
Tom Lenaerts
Juan Julián Merelo
Barry McMullin
Chrystopher Nehaniv
Charles Ofria
Jorge Pacheco
Tom Ray
Jon Rowe
Stanley Salthe
Cosma Shalizi
Richard Watson
Larry Yeager
Posted by crshalizi at August 25, 2006 07:40 |
| permanent link
Posting will be light as I gear up for teaching (36-350, data mining for undergraduates, based on this excellent course template by Tom Minka, and Principles of Data Mining by Hand, Mannila and Smyth), and try to finish some papers. I'd advise you to go outside and enjoy what's left of the summer, but since you're reading this, you're not the type of person to do that. Go read Three Quarks Daily instead: they've pre-empted my idle notions of writing posts on deep states and Jack Chick, and they have a better sense of humor and much better taste in art than I do.
The regularly-scheduled cursing of R, LaTeX, time, and an inadequate supply of ibuprofen will now recommence.
Posted by crshalizi at August 23, 2006 21:37 |
| permanent link
This is a joke that never grows old — just more bitter. (The latter via everyone in the known universe; provoked into posting by John Burke in e-mail.)
Posted by crshalizi at August 22, 2006 15:17 |
| permanent link
One of my friends in graduate school had an adviser with a gift for memorable expressions, at least for a theoretical physicist, especially when dismayed by some stupidity. Two of his put-downs which stuck with me were "I could go crazy tomorrow and find an appointment in the sociology department", and "I don't want to criticize you, but this is the way superstring people think". I was never sure which was supposed to be worse, but now I know. Sociologists have many faults, but they do know better than to try explaining a variable with a constant, while string theorists evidently do not. (Via CapitalistImperialistPig, who has better things to write about.)
The fact that Prof. Motl reasons so badly here that he'd fail my freshman stats class is, of course, infinitely less offensive than fact that he's a bigot (of the "we must squarely face the harsh light of my pseudo-scientific prejudices" variety). But I can't help feeling — hoping, even — that the two sorts of idiocy are linked.
Update, next day: Greetings, readers from Reference Frame and Brad DeLong! Just to correct some mis-apprehensions: my Ph.D. is in statistical physics, not sociology; I'm an assistant professor of statistics (not sociology) at Carnegie Mellon (not Michigan, where I was a post-doc); the closest I have ever come to "committing a social science" was drawing a map with some other physicists. If you're actually interested in my qualifications, you can look at my CV, or my research and teaching.
Second Update, 20 August: I'm afraid I was too elliptical above. The occasion for Motl's outpourings was this story in the New York Times on how the fraction of black and Hispanic students at New York's specialized high schools, like the famous Bronx High School of Science, has fallen over the last ten years. Fallen, as in, decreased, as in, changed over time, as in, been variable. To account for this, Motl pointed to the black-white IQ gap, which he proclaims one of the great invariant facts of human life, as in, nothing changes it, as in, constant. Even if one grants him his premise (which I would not), the IQ gap might, with a lot of other assumptions, explain why the number of blacks in these schools is low, but cannot explain why it has fallen. This is why I said he would fail my freshman stats class. A Marxist who tried to use the conflicting interests of capital and labor to explain the wage stagnation of recent decades would be guilty of exactly the same fallacy, and I'd fail them for the same reason.
I hope I have made myself clear.
Manual trackback: Nanopolitan
Posted by crshalizi at August 22, 2006 09:56 |
| permanent link
A cheerful comparison problem, in honor of the day:
Which analogy is more valid? Which one is more inflammatory?
Posted by crshalizi at August 18, 2006 23:45 |
| permanent link
Reading Billmon and Adele Stan reminds me of a question which has been bugging me for a while, but not so much as to actually investigate it. Namely: how much did traditional anti-Semitic stereotypes contribute to the stereotypes associated nowadays with conservative attacks on the "liberal cultural elite", "transational professional class", etc.? (I'm thinking of ideas like: Jews run the media etc. behind the scenes; Jews are clever but shallow; the bankers are all Jews; the Communists are all Jews; the Jews want to do away with our wholesome institutions and religion.) It would seem like a natural translation for someone to have made, but I don't know of any evidence that it did happen that way. Maybe, after all, there are only so many ways of disliking other groups that any pair of negative stereotypes is going to have a lot in common, if you look for it. (Of course, people who harbor or play to such stereotypes about liberals are not necessarily anti-Semites, even if those stereotypes historically developed out of anti-Semitic ones.)
Has anyone with some actual knowledge looked into this?
Update 17 August 2006: Edward Burns writes to point out that before the current fabricated outrage over the "war on Christmas", it was being pushed as a UN plot by the John Birch Society in the 1950s, and before that Henry Ford was warning that the Jews were trying to get rid of Christmas and Easter. (See this good story by Michelle Goldberg in Salon.) It's not clear, however, that there was any actual transmission of ideas from one episode to the next; if I were a historian I'd think it would be worth looking into, though.
Posted by crshalizi at August 17, 2006 15:52 |
| permanent link
Starting about a year ago, I have refused to referee papers for journals owned by Elsevier, since it sticks in my craw to provide free labor for people who turn around and gouge the academic community mercilessly. This reasoning applies, to some degree, to all commercial journal publishers, though Elsevier is unusually exploitative in its pricing. There is however a more substantial reason to dislike them: their — forgive the phrase — mercenary involvement in the international arms trade. Tom Stafford, who blogs at Idiolect, is organizing a petition of academics to try to get Elsevier to stop organizing arms fairs; it's worth signing.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked Timber.
Now this is what I call "filling the mind with ever new and increasing admiration and awe, the more often and steadily we reflect upon them".
(Via David R. in e-mail.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Almost two years ago, I complained that there was no English e-text of Zadig, Voltaire's oriental Enlightenment detective fable. At the time, I thought this was my usual pointless whining into the void; but no. Through the good efforts of Barbara Tozier, and the rest of the people associated with Project Gutenberg and Distributed Proofreading, one of the most charming of M. Voltaire's literary productions is now available for free to the English-reading web, in both plain-text and handsome HTML versions.
You realize this means I'm only going to whine here more.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked Timber.
Kieran Healy's post about his book on organ donation gives me a hook to write something about the other end of the system, about organ recipients and the institutions which are supposed to match them up with donated organs. More specifically, how one such institution, the Kaiser HMO of Northern California, quite spectacularly failed several thousand people who were depending on them, by not matching them up. The story has been around since early May, when it was broken by Charles Ornstein and Tracy Weber in the Los Angeles Times (cached here), since confirmed by an investigation by Medicare/Medicaid. It doesn't seem to have gotten all that much attention among the blogs, but it's outrageous, and deserves, for that reason alone, to be better known.
Kaiser is a very large and old HMO, with a huge presence in the Bay Area and northern California. (In fact it was one of the very first HMOs, started by Kaiser Steel during WWII as a way of attracting workers to its foundries and shipyards when wages were frozen.) It actually consists of non-profit health plans and hospitals, and for-profit physicians' partnerships, the Permanente Medical Groups; normally, patients in the health plans have to go to physicians in the groups to have their medical expenses covered. One notable exception, up until 2004, was the 1500--2000 Kaiser patients in northern California who needed kidney transplants because of renal failure: those procedures were sub-contracted to the hospitals at UCSF and UC Davis, which also managed the patients' places on the waiting list.
That last bit requires some explanation. Once organs are donated, they need to be matched up to recipients. This is done by an organization called the United Network for Organ Sharing, which tries to trade off urgency, seniority (i.e., time spent waiting for an organ), proximity and compatibility (since the closer the match between the donor's immune system and the recipient's, the less problems from rejection). There is a heavy weight put on seniority, though especially close immunological matches can over-ride it. Each transplant center is responsible for keeping the network up-to-date about their patients who need organs, their immunological profiles, and their time spent on the waiting list.
What seems to have happened is that in 2002, a transplant surgeon named Arturo Martinez proposed to Kaiser that it could save money, and increase the utilization of its hospitals' surgical capacity, by bringing the kidney transplant program in-house, and Kaiser agreed, with Martinez becoming head transplant surgeon. (It would be unfair, at this point, to say that Kaiser did this because it meant more business for the for-profit Permanente Medical Group, but it's hard to imagine that counted against the proposal.) As of mid-2004, Kaiser patients on the waiting list were informed that they would no longer be covered for transplants at UCSF or UC Davis, though they were free to go ahead and have them if they could come up with the money (roughly $100,000).
So far, all this is maybe a little self-serving on Kaiser's part, but not, in itself, appalling. (It's certainly more than legitimate for health-care organizations to try to save money.) What happened, though, was that Kaiser completely screwed up the program. Remember that organs are allocated (basically) through the UNOS system. The patients were being removed from the listings under the university hospitals, and being added to the listings under the new Kaiser transplant program. Unless this was done correctly, this would mean that they'd look like new names on the list, and so all of their accumulated waiting time, one of the main determinants of priority, would vanish. This happened to a huge number of people on the list, basically reducing the chance that they'd get a kidney to next to nothing. This becomes less surprising when one learns that Kaiser never consulted UNOS about the massive transfer of patients it was planning, and "placed responsibility for submitting patient data ... in the hands of a single clerk who had one hour of telephone training on UNOS's database", though not any more excusable. Needless to say, patients were not told that by staying with Kaiser, they were losing their place on the lists, and thereby reducing their odds of survival. Some of them, at least, seem to have been assured that they were keeping their places, when that wasn't true, though this is less clear to me.
Losing seniority on the transplant lists wasn't the only problem. Kaiser did very few transplants, compared to the number of organs which were available. This happened in part because they just didn't have the capacity to keep up with their many patients (at one point they were down to a single nephrologist for the whole program, who was also supposed to be its medical overseer), and in part because of what seems to have been mis-placed perfectionism or caution. These combined to the point of repeatedly turning down "zero mismatch" kidneys, ones where the likely compatibility over-rode considerations of seniority. This happened several dozen times at least — twice for one patient alone. Again, needless to say, patients weren't told about this. In a "it's not a bug, it's a feature" moment, Kaiser initially attempted to defend its program by pointing out how few patients had died after transplants — since they'd done so few.
What strikes me as especially outrageous about all this is that the people being screwed over were people who needed new kidneys. To state the obvious, anyone who needs an organ transplant is very ill. It's maybe less obvious that being that ill is a full-time job. One of the vital parts of the body is no longer working; to substitute for it requires extraordinarily complicated, time-consuming and generally unpleasant procedures. People who need new kidneys are people who are kept alive by dialysis, which is, indeed, complicated, time-consuming, often painful, almost always exhausting, and carries a non-trivial risk of infections, possibly fatal. People who need new kidneys are also often people who are very ill in other ways, since it's not that common for both your kidneys to just stop working if nothing else is going on. (Kidney problems are, for instance, a not-uncommon complication of diabetes, and of high blood pressure. Dialysis, naturally, messes with blood pressure, adding yet another variable to monitor and regulate.) Simply staying alive, when you are multiply-sick person with organ failure, can pretty much demand all the time and attention you have to give, and a fair chunk of your loved ones' as well. (There are good reasons why the families of people in situations like this tend to fall ill themselves.) You are certainly not in a position to check up on whether your medical organization has, through incompetence, messed up your position on the transplant lists. (Some Kaiser patients actually tried to keep up with their place on the lists, but were given the run-around.) And as for switching to another medical organization, do please show me the company which will extend coverage to someone who needs a new kidney, at a price which can be afforded by someone who needs a new kidney.
Since the Times broke the story, there's been some improvement. The doctor who was medical head of the program, and apparently at least partly responsible for snafu of not transferring patients' time, Sharon Inokuchi, has been relieved of her administrative duties. (In fact, if memory serves, she left Kaiser, but now I can't find where I think i read that.) The program has been investigated by the Center for Medicare and Medicaid Services, which basically confirmed the newspaper reports, and forced it to promise major changes; it could still lose its eligibility for funding under those programs. The California state agency which regulates managed care is still, I believe, investigating. There's talk of large fines, and there will certainly be lawsuits. All of this is to the good; it's certainly better than nothing. But still, thinking about this makes me angry: Kaiser had a duty towards many very sick people, who were in a very poor position to look after themselves. It failed in that duty quite dramatically. In any organ transplant program, patients will die while waiting for a match. In most kidney transplant programs, though, about twice as many patients receive transplants as die while waiting; Kaiser managed to reverse that ratio. While it's hard, in the nature of things, to identify any one patient who's died and say "They would have lived, if only Kaiser hadn't done this", it's almost certain that more of these people have died than would have otherwise. I don't have a better remedy to propose than fines or lawsuits or institutional tinkering, but they all seem horribly inadequate.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked Timber
Via everyone in the profession: the statistician Frederick Mosteller has died. Mosteller was one of the great leaders of the generation of statisticians in which our field went from being an annex of mathematics (as it was when he attended Carnegie Tech) to an autonomous, institutionalized discipline. He had an astonishing range as a researcher, but is perhaps best known for his work on stochastic theories of learning theory and the authorship of the Federalist Papers. He was also a notable teacher, as his essay "Classroom and Platform Performance" suggests, and in the later part of his career tried to bring elementary inferential hygenie to educational research. More anecdotes are available from Tales of the Statisticians, or this brief sketch by his student Stephen Fienberg.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked TimberAs a good neoclassical, neoliberal economist, Brad DeLong is acutely aware that the market system is not natural at all, but a delicate historical anomaly. He is worried that it is so familiar to his students that they will find alternate modes of social organization almost incredible; accordingly he wants to mess with their heads:
Would making Berkeley's first-year economics Ph.D. graduate students this fall read short biographies of William Gates and William Marshall as a way of getting at the idea that there are non-market societies that work very differently from our own today--would that be a teaching idea of extraordinary brilliance or of total insane lunacy?The rest of the post is an extended excerpt from the New York Review of Books review of a biography of William Marshal (which goes on to my to-read list). The question I have is, what should DeLong make his students read, to give them a vivid sense of just how differently production and distribution could be and have been organized? Argonauts of the Western Pacific, perhaps? Gilgamesh?
And: those of us who teach things other than economics, what books do or should we hand out as ice-axes for our students' frozen seas? (This one is mine.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked Timber.
It's a hot, lazy Sunday, which seems like a good time for browsing through livejournal communities dedicated to photos of peacefully rusting machines, quietly crumbling buildings, and similar modern ruins:
The photographers are all amateurs, so the quality (to the slight extent I can judge) is quite variable, but many manage to capture the suggestion of sunset and sadness, of unhappy stories brought to a close, which fascinates me about such scenes. Some of these photos, in fact, seem as good as, say, those in Terry Evans's book on the former Joliet Arsenal, Disarming the Prairie, bringing to mind the words of the poet:
These are the halls of the dead, where the spiders spin and the great circuits fall quiet, one by one.— But I see I'm getting melodramatic, and it's just too hot and sticky and still to sustain that.
Update: John Burke (who needs to revive his blog) writes to point to Jef Poskanzer's great industrial archaeology page, with many fine pictures and links.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Attention Conservation Notice: Over 1500 words on a wacky quasi-socialist economic scheme, from someone utterly lacking in credentials in economics. The scheme does not respect the sanctity of private enterprise, but at the same time would not reduce the alienation of labor one iota. Includes a lengthy quotation of a game-theoretic impossibility result. Also, you already saw it when it was cross-posted at Crooked Timber.
In the previous installment in this series of modest proposals, I looked at ways of making the incentives of the managers of large, publicly-held corporations align more closely with those of their long-term shareholders. This left alone the question of the beneficiaries of corporate value; assuming that the managers are busily working to maximizing their revenue streams, who benefits from their industry and diligence? Having just read Mark Greif's great essay on redistribution in n+1 (issue 4; sadly, not online), I would like to make a suggestion.
The text for today is Gary Miller's Managerial Dilemmas: The Political Economy of Hierarchy, an excellent book which I learned about from Henry Farrell. Ambitiously, Miller tries to explain why hierarchical corporations exist at all, why they take some of the forms they do, and how, in part, their form relates to their performance. Much of the book, especially the first part, is a partially-successful attempt to find good economic reasons for their features, i.e., efficiency-enhancing ones. (He does not seriously consider the option that enterprises are hierarchical for non-economic reasons, say that some people like bossing others around, which hierarchies let them do, and those people are able to select hierarchies over other, more efficient, forms. After all, it's hardly historically unprecedented for powerful people to prefer institutions which lower aggregate output but give them a bigger share of the product. See, e.g., here.) He also tries to explain why theories of corporate organization that rely solely on economic "mechanism design", i.e., structuring information and material incentives, will actually lead to sub-optimal results, for pretty basic game-theoretic reasons; getting beyond these impasses is fundamentally a political problem. This is potentially quite subversive in its own way, but it's really the first part of the work, about the economic justification of the hierarchical enterprise, that I'm going to twist and abuse.
One of the features of the modern corporation that Miller attempts to rationalize is the existence of shareholders who are passive and, in the overwhelming majority, utterly disconnected from the day-to-day or even year-to-year operations of the company. He does so by means of the following impossibility theorem, attributed to Bengt Holmstrom. Having tried to summarize Holmstrom's theorem better than Miller, and failed, I'll just quote Miller.
Holmstrom assumes that there are n agents whose actions determine a level of revenue x. The actions taken are unobservable and are costly to each of the agents. In particular, we assume the production function is a team production in which the productivity of each individual's action is determined by other individuals' levels of effort.
Holmstrom points out the desirability of three characteristics of an incentive system — and then shows that they are logically inconsistent. First, Holmstrom examines the Nash equilibrium outcome of an incentive system. At such an equilibrium, each individual will find that he or she could not do better by choosing a different effort level, as long as all others do not change their effort levels. Simple marginal analysis tells us that, in such an equilibrium, each person will find that his or her marginal cost of effort is exactly equal to the marginal gain; otherwise, the individual could be better off by working harder or not as hard. Second, Holmstrom stipulates that the outcome be budget balancing — that is, the incentive system should exactly distribute the revenues generated by the actors among the actors. Third, Holmstrom examines Pareto efficiency. This means that the outcome should be such that the individuals in the organization could not find a different outcome that would make them all better off.
Holmstrom shows that no budget-balancing system can create a Nash equilibrium that is also Pareto efficient. In other words, every budget-balancing incentive system will induce a social dilemma among its participants. The reason is that individuals will bring their own marginal costs of effort into equality with their own marginal gain. This means that each individual will not undertake an additional unit of effort that will produce less individual gain than individual cost — even if that extra unit of effort produces more gain for other individuals on the team.
As an example, suppose there is some individual who has a marginal revenue productivity of $12: Each unit of her own effort generates an extra $12 for the team. According to Pareto optimality, she should exert additional effort as long as the cost to her of that effort is less than or equal to $12; each such unit of effort generates more revenue for the team that it costs her as an individual. The only way to motivate her is to make sure that she gets all of the marginal revenue of her last unit of effort. In a team, it is impossible for this to be the case for every individual, as long as the incentive system is budget balancing. If everyone gets all of the last dollar produced, the team will have to pay out more in incentives than it generates. But if the individual gets only one-third of the marginal revenue from her actions, she will work only as long as her effort costs her less than $4 per unit. [pp. 129--130]
This suggests a rather unusual role for shareholders: they provide a money-sink, someplace money can go other than those actually involved in production. This means that the economic mechanism no longer has to be budget-balancing, which actually makes efficiency possible. Miller suggests that this is one reason why the modern public corporation, with its separation between legal ownership (by stockholders) and day-to-day control (by managers) can work, to the extent that it does. It is precisely because the shareholders are passive, with very limited influence over the actual running of the corporation!
Today's modest proposal — and I should make it very clear that Miller suggests nothing of the kind — is to take this separation of functions even further. Shareholders can use their legal ownership to intervene in the running of the company, though it is hard (and managers try to make it harder). By doing so, however, they become players in the team-production game, and lose their useful role as a money-sink. To limit this danger, while retaining the advantages of competitive markets for capital allocation and corporate control, I suggest the following. A substantial fraction — say three-quarters — of all profits of publicly-held corporations are to be paid to a new institution, which we might call the National Mutual Fund. (Close corporations and partnerships are exempt.) Once a year, the Fund would pay out its accumulated profits as dividend checks, giving an equal amount to every adult citizen. And that's it.
Substantially reducing the flow of dividends associated with stock ownership should cause a large one-time shock to the level of the stock market. (Roughly speaking, shares should drop by about 3/4.) However, because the Fund collects uniformly, it should not distort relative prices, which are what matter for purposes of capital allocation. The net worth of stock-holders, likewise, will suffer a one-time drop, but this will be partially compensated for by their receiving payments from the Fund in the future. Anyway, lots of things affect the value of stock holdings; it's not like someone purchased their labor with a promise of future benefits, and then tried to back out of a freely-entered contract when it came time to pay up.
A further wrinkle would be to curb the practice of retained earnings. These account for a huge fraction of corporate capital formation, but they are also one of the ways in which managements escape market discipline. (For some figures on this, see Henwood's Wall Street, pp. 72--76.) I suppose one could make a Hayekian argument in favor of the practice, but, really, if management can make a good case that a pet project will earn at least a normal rate of return, it shouldn't be hard for them to raise funds on the open capital market, and if they can't make such a case, it's hard to see how they'd be discharging their fiduciary duties to shareholders by pursuing it anyway. This reform, I should add, is logically separate from that of instituting the National Mutual Fund. However, since corporations would pay more out in dividends, it would tend to increase the value of shares, reducing the shock to the level of the stock market.
It is hard to see why the actions of the National Mutual Fund could not be at least as rule-bound and de-politicized as those of a central bank run by skilled technocrats. Indeed, it would seem easier to reduce the discretion of the Fund's officials to the vanishing point, and to strictly keep it from meddling with the affairs of any corporation, which would be deeply counter-productive. For their part, the citizens receiving the dividends would get the benefits of "portfolio diversification in their income", but their incentives to meddle politically with individual firms, even quite large firms, would be quite muted. Moreover, they would have a direct and tangible incentive in the health of the corporate sector as a whole, making them less likely to support market-distorting measures to benefit particular firms, geographical regions or industrial sectors. We would move, in short, towards a true ownership society.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked TimberIn the comments to my post on Onsager, Maynard Handley explains why he finds himself somewhat unsympathetic, as Onsager apparently did not expend the effort necessary to make himself understood by others.
You, the author of the paper, have a responsibility to make your ideas comprehensible. If the first method you choose to explain them fails, then you listen to what people say about where they lost all track of understanding and write a new paper with NEW explanations, not the same explanations that failed last time only renumbered. ... [This is] not something that is drilled into young scientists; that it is YOUR responsibility to make your ideas clear to others, not their responsibility to try to figure out what you are talking about. As science grows ever larger and ever more complex, I think it is time for all scientists to be much more explicit and much more ruthless on this point.Whether this is really a fair criticism of Onsager, I couldn't say, but the general point is true, important, and a perfect hook for the next thing I wanted to post about.
Science is a social, collaborative process, so part of being a good scientist is effective communication. Scientific communication is overwhelmingly written communication (scientific disciplines are, in a sense, literary communities), so part of being a good scientist is being a good writer. Unfortunately, scientists get little training in writing, and much of that consists of being advised to follow the rules found in horrid little compendia. Fortunately, there is some actual research on effective written communication, that is, on how to arrange your words so that their readers tend to acquire clear notions of your ideas. The best practical guide here, I've found, is Joseph William's Style: Towards Clarity and Grace. However, I have just discovered (via Paradise Blogged) a fine essay by George Gopen and Judith Swan, "The Science of Scientific Writing", which gives a clear yet concise presentation of the work. (Gopen and Williams are collaborators.) Here is their own summary of how to be clear:
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked Timber.
Two of my more public-spirited fellow citizens have recently identified looming threats to our own Commonwealth of Pennsylvania.
Most scientists unfortunately, those that certainly are advocating for this [embryonic stem cell research], and many others feel very little moral compulsion. It's a utilitarian, materialistic view of doing whatever they can do to pursue their desired goals.
I, for one, will be happier voting on Mr. Santorum's re-election in November, knowing that my ballot will play a part in the age-old struggle between utilitarian materialism and deontological idealism, as well as the sagas of human-canine relations and Old Corruption.
More than a third of all Pennsylvanians are native speakers of a language other than English — and many of them have not even tried to learn English since immigrating, or at least prefer to carry out their daily lives in another language, living together in neighborhoods where their native language dominates. Some people worry that the majority status of English is critically endangered. 25 years ago, a major political figure warned that these "aliens ... will never adopt our language or customs, any more than they can acquire our complexion", and so far, his prediction seems to be right on the money.
And let's not forget what they've done to our cooking!
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked Timber.
A staple of bad movies and trashy novels about scientists (i.e., the kind I read) is the neglected genius whose ideas are rejected with incomprehension by the scientific establishment during his life, because they are simply Too Far Ahead Of His Time to be grasped by lesser mortals, only for the scientific community to rediscover these insights decades later. This sort of thing can make for entertaining fiction (if dreary self-mythologization), but it simply doesn't happen all that often in real life, especially not when the hero is a part of the establishment, and indeed a much-honored one. It certainly doesn't show up, with documentary evidence, in the staid, reliable pages of Reviews of Modern Physics. Nonetheless:
Nobody outside of statistical physics (and maybe physical chemistry) has heard of Onsager, but he was indeed a great physicist, albeit in a very technical, non-flashy way. By the time he did this work on turbulence, he was already well-known in statistical mechanics for the analytical solution of the Ising model, his theory of phase transitions in liquid crystals, and, perhaps most importantly, a pair of classic papers from 1931 which basically founded modern irreversible thermodynamics, for which he would eventually win the Nobel Prize. (Eyink and Sreenivasan give a fuller discussion of his accomplishments, including the Onsager-Machlup theory of non-equilibrium processes, on which Eyink himself has done important work.) We're definitely not talking about some marginal figure cut off from the scientific community.
Nonetheless, his attempts to get people to pay attention to these ideas on turbulence were singularly unsuccessful. The reaction of Theodore von Kármán, a deservedly great name in fluid mechanics, was to describe it (in a letter to his student C. C. Lin) as "somewhat 'screwy' "; Onsager also corresponded with Lin, who replied in the classic manner of someone wanting to put an end to a conversation (quoted on p. 117): "I am sorry to say that I have not made much progress, except that I desire still more to see something done in this line to bring your ideas down to my level of understanding." As for the statistical physicists, Eyink and Sreenivasan describe their reaction as one of "polite incomprehension" (except on the part of von Neumann — in an unpublished report). The fact that one of Onsager's letters describing his ideas (reproduced as Appendix A in this paper) is headed "The little vortices who wanted to play", and begins "Once upon a time there were n vortices of strengths K1, ... , Kn in a two-dimensional frictionless incompressible fluid" probably didn't help, either. Most of all, a combination of discouragement over this reception, a tendency to be a slow and perfectionist author, and having scads of major research projects going simultaneously kept Onsager from even trying to publish any of this material.
The moral, I hope, is clear: statistical physicists who wander into other areas of science, and find their ideas dismissed by the best experts on those subjects, should nonetheless publish in Physical Review, in a "Fools! I'll show them all!" spirit, provided they are Lars Onsager.
(It's interesting that this paper was written by two physicists active in this area, rather than by a historian of science. It seems doubtful to me that a historian, reading the relevant materials in the Onsager archives, would have realized that there was a story here, unless they were familiar with modern work on turbulence at a deeply technical level — unless they had "contributory" as well as "interactional" expertise. And if anyone had gone over those archives around 1990, before these ideas were re-discovered, what would they have made of it?)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Cross-posted to Crooked Timber, hence the parting shot about comments.
Under the rubric "Can Blogging Derail Your Career?", the Chronicle of Higher Education has seven bloggers discussing Yale's decision to not hire Juan Cole as a professor of history, and the role, if any, played by his blog in that decision: Siva Vaidhyanathan, Dan Drenzer, Brad DeLong, Michael Bérubé (all: yay!), Glenn Reynolds and Ann Althouse (both: hiss), and Erin O'Connor (null result), with a "response" by Cole, which doesn't actually address the others' posts specifically, and reads like a separate essay on the same subject as the others. (Via De Long.)
(Some of the things which were written about Cole as part of the controversy (e.g.,) give the impression of a professor who attains incomprehensibility not through obscurity but through foaming at the mouth. As it happens, though, I sat in on his seminar on millenarian movements when I was a post-doc at Michigan, and nothing could be further from the truth. I suppose I could have missed all the sessions which degenerated into hours-long rants about Zionist Entities... Of course, I don't know why Yale didn't give him the job, but if it was because they thought he was too spittle-flecked to be presentable to parents and alumni, they were misinformed.)
The fact that this post is not filed under "Middle East Politics" isn't going to stop anyone in the comments, is it?
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
The Xanadu region of Titan has rivers:

No images, yet, of "caverns measureless to man", or a "sunless sea" of liquid methane.
(Via Beyond the Beyond.)
Update, 20 July: I should have read the press release first (via Uncertain Principles). More importantly: Venus! (via The Daily Llama).
Update, 21 July: The Quantum Pontiff wonders whether Einstein's work on the physics of rivers will hold on Titan.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Attention conservation notice: 1600 words of pedantic whining about how a book on architecture didn't meet my parochial expectations. Also, it was mostly written and then abandoned to the gnawing criticism of the mice over a year ago.
I give up; it's got me beat.
I asked the good people at the MIT Press for a review copy of this, this thing, years and years ago, back when I was still regularly writing book reviews, because it sounded cool.
We humans owe an immense architectural debt to many other species. Indeed, the first hexagons humans saw may have been in honeycombs, the first skyscrapers termitaries (termite high-rises), and the first tents those of African weaver ants. In The Monumental Impulse, art historian George Hersey investigates many ties between the biological sciences and the building arts. Natural building materials such as wood and limestone, for example, originate in biological processes. Much architectural ornament borrows from botany and zoology. Hersey draws striking analogies between building types and animal species. He examines the relationship between physical structures and living organisms, from bridges to mosques, from molecules to mammals.Insects, mollusks, and birds are given separate chapters, and three final chapters focus on architectural form and biological reproduction. Hersey also discusses architecture in connection with the body's interior processes and shows how buildings may be said to reproduce, adapt, and evolve, like other inanimate or "nonbiotic" entities such as computer programs and robots. The book is both learned and entertaining, and is abundantly illustrated with fascinating visual comparisons.
It would have been cool, too, if Hersey weren't an ignorant idiot. Oh, I'm pretty sure he's OK when it comes to purely architectural matters — though how would I know if he wasn't? What drives me up the wall every time I try to write about this book is that he gets everything else wrong.
Let's take history and languages first. Hersey is a professor of art history, and so I don't think it's unfair to expect him to get these straight. But we are talking about a man who can instance a spiral design from a Byzantine church as an example of an "Islamic spiral", whatever that may be (p. 47). He doesn't seem to realize (pp. 28–29) that the early Greek architects who first defined the classical orders wouldn't have used a Latin word (capitulum) to refer to the top of a column. I'm not even sure he realizes (pp. 7–8) that Hermes Trismegistus was a Hellenistic myth. For that matter, after presenting a fanciful analogy (pp. 17–18) between the ground-plan of Lemba, a Chalcolithic village in Cyprus, c. 3900 B.C., and cross-sectional diagrams of biological cells, he writes:
Certainly the builders of these dwellings would have known similar forms in their immediate surroundings — things that, unlike true body cells, were visible to the naked eye. One prototype would be the egg, which begins as a contained for a single-cell embryo embedded in the nourishing matter it will need in order to reproduce and grow. ... And then there are cell-like beehives, birds' nests, and plants. To the Greeks, moreover, and therefore maybe even to the Chalcolithic residents of Lemba, the word for cell (kutos) also meant uterus, and even the whole human body. So we must not relinquish the thought that the Lemba cells are the extended phenotype of builders whose own bodies, though they did not consciously know this, were put together similarly.Let me try to extract everything that's broken in these sentences, and see what's left.
Bad as he is on history and languages, Hersey is worse at science and mathematics, and the history of science. He has no idea what "a topology" is (p. 51; at best this is a garbling of a curved manifold). He is capable of writing about distances "increasing at a fixed angle" (p. 45; he seems to mean rate), and of "steel molecules" (p. 13). He thinks sharks are not vertebrates (p. xvii). He literally does not know what a virus is (p. 15). (He also doesn't know when they were first discovered, nor when their shapes were first determined [pp. 15–16], and so his suggestion that illustrations of the shapes of viruses influenced some of Gaudí's designs on the Sagrada Família [p. 16] is just wrong.) He doesn't realize that DNA molecules don't actually look like the simple diagrams people draw of double helices, but are bent, folded and twisted, and so resemble spiral staircases not at all (pp. 6–8). And so on, and so forth.
Let me give a last example of the kind of thing which irritates the hell out of me; it comes from very early in the book (pp. xviii-xix), but it's central to whatever attempt at an argument Hersey makes.
Homo sapiens shares something that I don't yet dare call a gene sequence for building — shares it, perhaps homologously, perhaps convergently, with other constructing creatures such as birds, crustaceans, ants, termites, and bees. I will also be claiming, as a corollary, that the shapes of our monumental shelters, whether bicycle sheds or cathedrals, reflect and often derive from the shapes first created by these other species — species that, like us, are subject to the monumental impulse. ...This isn't a paradox for his idea — it's a refutation. The last common ancestor of humans and termites lived before the Cambrian explosion, presumably in the oceans; whatever bizarre wormy thing it may have been, it assuredly didn't build. Even if the genes "for" building in humans and termites are both descended from the same set of genes in that remote common ancestor, they are no more homologous than flight is homologous in birds and pterodactyls, because they both independently modified vertebrate forelimbs into wings. A little reading on comparative methods, and how homologies are actually established, would have kept Hersey from wasting his and his reader's time. (Similarly for the chapter on the reproduction and evolution of architectural designs, which is completely innocent of all actual work on, say, the evolution of technology, or even on the cognitive processes of architectural representation.)
But now comes a paradox: certain ants, termites, honeybees, and birds build elaborate structures. So do humans. But, as humans, we are anomalous in doing this. Only a few other mammals build — most obviously beavers and badgers. Worse still, our own closest cousins, the other primates, hardly build at all. An African termitary might remind us of Wright, of a Gaudí spire, or of a skyscraper by Hermann Obrist. But no such thoughts come to mind when we look at the rudimentary retreats of chimps and gorillas. Thus any genetic homology that brackets us with the other builder-species will have to be very ancient and, also, will have to have bypassed our immediate ancestors and cousins.
The whole book is like that — a series of conceits which a little thought or research would've shown don't work, presented as real scholarship. To be fair, Hersey sometimes allows himself a certain levity of presentation: in chapter eight he claims (basically) that people like domes because they remind them of breasts, which he illustrates by juxtaposing a picture of the Taj Mahal with one he attributes to "D-Cup Superstars, February 1992". But even then, he concludes, on no basis whatsoever, that the Taj is "an architectural thernody to the queen's breasts" (p. 155), and means us to take that seriously.
I won't say that Hersey's book is bullshit, because Hersey buys it. But I will say it's crap. It's crap in the same way that much too much of what I read by scholars in the humanities is crap: Hersey doesn't think carefully and critically, he tries to use ideas he doesn't understand, he's sloppy about facts, and he thinks he's establishing reliable conclusions when he can't argue his way out of a wet paper bag. Lest by saying this I call up the wrathful wraith of Chun the Unavoidable, I hasten to add that (1) many scholars in the humanities are, indeed, excellent and careful scholars, who do not suffer from these debilities, and (2) I think this cannot in the least be blamed on any value of post-*ism, or or "theory" or anything of that sort. Certainly Hersey is not a post-*ist, and I have no reason to believe this sort of crappiness has become more common among humanists in recent years. (If anything, I'd guess that the causal arrows point from crappiness to post-*ism, rather than the other way. For this, too, however, I have no evidence.)
— But, you see, this is what always happens when I try to write a proper review of The Monumental Impulse: I end up wandering hopelessly off topic, in order to avoid having to think of all the ways the book vexed me.
Do not read this book.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Attention conservation notice: 1500 words on psychophysics and the statistical mechanics of disordered excitable media. Also, it was cross-posted to Crooked Timber, where I am guest-blogging this week, so you've seen it already.
First off, I should thank Henry and the rest of the Timberites for the kind invitation to guest-post, and that very warm introduction. In exchange, I'm going to blog more or less as I usually would, only here. This means some big bricks of posts about "complex systems", so called, which is or was my scientific field, more or less; and also any miscellaneous outrages which catch my eye this week. Mounting my usual hobby-horses on this stage is a poor exchange for their generosity, but mounting hobby-horses is why I started blogging in the first place, and anyway I'm big on conscienceless exploitation of cooperators.
Today I want to talk (below the line) about some recent work in the statistical mechanics of disordered systems, which might help explain how our sense organs work, and actually involves some good uses of the self-organized criticality and power laws; tomorrow or the day after I'll get to the smoldering question of "Why Oh Why Can't We Have Better Econophysics?"
Folklore says that the dark-adapted human eye can detect a single photon; this isn't quite true, but we can consciously detect a few tens of photons, and some species are that sensitive. Of course, we can see not only in the dark but also during broad daylight, but then the number of photons falling on every part of the retina is huge; the eye isn't overwhelmed and saturated, though now one or ten photons more or less makes no discernible difference. In the jargon, the eye, and the other sensory organs, have both a large "dynamic range" (we can see in the dark and in the daylight), and "nonlinear response" (changes which are noticeable in the dark aren't against a high-intensity background). Some version of these facts, including the basic (power-law) form of the relationship between physical stimulus intensity and perceived sensory magnitude, have been known since the nineteenth century. This makes it all the more puzzling that sensory neurons show a linear response over a narrow dynamic range, beyond which they saturate.
You could evade this difficulty by having lots of neurons with different operating ranges, so that raising stimulus intensity saturated some but activated others. The problem is that there don't seem be that wide a spectrum of operating ranges for individual neurons. In a recent paper, Osame Kinouchi and Mauro Copelli (who blog together at Semciência) offer another way, which has to do with the way sensory neurons interact with each other in a network.
Neurons, like muscle cells, are "excitable", in that the right stimulus will get them to suddenly expend a lot of energy in a characteristic way — muscle cells twitch, and neurons produce an electrical current called an action potential or spike. Kinouchi and Copelli use a standard sort of model of an excitable medium of such cells, which distinguish between the excited state, a sequence of "refractory" states where the neuron can't spike again after it's been excited, and a resting or quiescent state when the right input could get it to fire. (These models have a long history in neurodynamics, the study of heart failure, cellular slime molds, etc.) Normally, in these models the cells are arrayed in some regular grid, and the probability that a resting cell becomes excited goes up as it has more excited neighbors. This is still true in Kinouchi and Copelli's model, only the arrangement of cells is now a simple random graph. Resting cells also get excited at a steady random rate, representing the physical stimulus.
Kinouchi and Copelli argue that the key quantity in their model is how many cells are stimulated into firing, on average, by a single excited cell. If this "branching ratio" is less than one, an external stimulus will tend to produce a small, short-lived burst of excitation, and there will be no spontaneous activity; the system is sub-critical. If the branching ratio is greater than one, outside stimuli produce very large, saturating waves of excitation, and there's a lot of self-sustained activity, making it hard to use a super-critical network as a detector. At the critical point, however, where each excited cell produces, on average, exactly one more excited cell, waves of excitation eventually die out, but they tend to be very long-lived, and in fact their distribution follows a power law.
(People who teach courses on random processes are very fond of branching processes, because the basic model can be solved exactly with hundred-year-old math, but there are endless ramifications, and some of the applications are very technically sweet. Like most mathematical scientists, Kinouchi has certain tools he tends to return to, and critical branching processes are one of them.)
As Kinouchi and Copelli say in their abstract, the idea that the critical point, where things are just about to go unstable, is a useful place for processing or transmitting information is a persistent theme of complex systems. (You could, arguably, even trace a version of the idea back to William James's Principles of Psychology.) It has also, before this, been one of the weakest of our ideas. The original work from the 1980s on "evolving to the edge of chaos" has proved impossible to replicate, I would even say experimentally refuted. (Why the phrase and idea continue to propagate is another question for another time.) Stu Kauffman's studies of models of gene regulatory networks certainly suggests that information moved through these most easily near their critical point, but I don't think anyone has done a careful information-theoretic analysis of that. In any case, E. coli doesn't care about the bandwidth of its regulatory network: it cares about reliably making lactase when it only has lactose to eat, i.e., specific adaptive functions. Prior to this, I can only think of one situation where the idea has been made precise and has strong evidence to back it up (namely, this paper), but that's a purely mathematical exercise of no biological relevance.
What Kinouchi and Copelli have done is very different: they've actually identified something biologically important which is maximized at the critical branching ratio, namely the dynamic range. The network as a whole responds to the stimulus, and its dynamic range can be many orders of magnitude wider than that of its component cells. It is this enhancement which is maximized at the critical branching ratio, and falls off sharply for networks which are even a little sub- or super- critical. As a bonus, the shape of the response function is of the correct power-law form, though, in their model, the exact exponent isn't right. Modifying the network structure, or some model details, changes the exponent, but the dynamic range is still sharply peaked at the critical branching ratio.
There are a lot of other nice things about this paper, which I won't get in to least I repeat it all, but I will point out one thing: while their central qualitative results are pretty robust to small tweaks, there are some details of their model which make it a fair caricature of some excitable media, but not all. These are quite deliberately matched to properties of the olfactory system and the retina, but wouldn't work in, say, the cortex, where the dynamics of excitation are different. So this isn't an "over-universal" model, but one of particular phenomena produced by particular mechanisms. In fact, looking at olfaction, they are able to make a prediction about the effects of knocking out specific genes which are involved in the fast, symmetrical electric couplings they assume. Nobody seems to have done those experiments yet, but, at least to this non-biologist, it seems feasible, and, now, very interesting.
*: Here's an anecdote illustrating how broken academic publishing is. Kinouchi and Copelli work at the University of Saõ Paulo, which doesn't, for reasons of economy, subscribe to Nature Physics. To get an electronic copy of their own published paper, they were forced to write correspondents at other universities. I couldn't help them, because my school doesn't feel like it can afford to subscribe to Nature Physics either.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Speaking of air travel, as we were, Kara dreams of traveling to the Beijing Complex Systems Summer School.

Alas! She will not be gracing the Fragrant Villa with her presence. Twenty hours in planes and airports would not agree with her temperament, nor would jet-lag be pleasant for one so distressed by any interruption of her accustomed nap cycle.
As for me, I have cleverly worked on my three lectures in reverse order, so the last two are finished...
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
I wrote this about a year and a half ago; I'm cleaning out my drafts folder.
There are reasons why one might think that Turkey should not be admitted to the European Union, but surely the silliest must be that Turkish is not an Indo-European language. Following Phersu, I can just imagine the consequences of taking this seriously. First, the Basque-speaking provinces of France and Spain leave the EU, along with Hungary, Finland, Estonia and Malta. But then, of course, India and Pakistan will submit rival applications to join, closely followed, no doubt, by the Iraqi Kurds. The whole idea is so stupid that I can't believe it was meant seriously, or even guess what Giscard d'Estaing thought "Indo-European" meant.
That said, Turkish does have features which are absent or attenuated in (most) Indo-European languages. (Disclaimer: I do not speak Turkish.) For instance, it's highly agglutinative, forming new words by adding suffixes to roots, and doing so recursively. (German does this too, but to nowhere near the same degree.) This leads to words like yapabilecekdiyseniz, "if you were going to be able to do". (Readers may amuse themselves by analyzing this example using the Turkish Suffix Dictionary.) Moreover, these words are not oddities, like "antidisestablishmentarianistic", but in everyday use. I once heard a talk by a computational linguist specializing in Turkish — Gerjan van Schaaik, who oddly seems to have no web presence — where he mentioned that if one studied the corpus of Turkish daily newspapers, one could easily build a lexicon of 500,000 entries, and still cover only 95% of the words in the corpus. (I can't tell, from my notes, whether van Schaaik was talking about something that had actually been done, or just making a rough estimate.) This property of Turkish becomes very important for a number of technologies, including one without which the modern world would simply grind to a halt: spam filtering.
Özgür et al. do not report on the ability of their classifiers to discriminate between spam, and weirdly pseudo-learned pronouncements from former presidents of France.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
I wrote this about a year ago; I'm cleaning out my drafts folder.
As a paid-up member of the Viridian movement, I it would be a Very Good Thing for industrial civilization to find a power source which will let it run and grow without choking on its own waste. (I'm no good at ranting about this, so if you're curious I refer you to the master.) But it would be foolish to pretend that non-fossil-fuel power sources will not carry their own costs, including the possibility to alter the climate, and something worse than foolish not to consider those costs. After all, one of the Viridian design principles advises us to "Look at the Underside First":
Legions of people are paid large sums to promote the positive aspects of commercially available products. Very few people earn their daily bread by pointing out malfunctions, bugs, screw-ups, design failures, side-effects and the whole sad galaxy of trade-offs and failings that are inherent in any technological artifact. To counteract this gross social imbalance, a wise designer and a wise critic will make it a matter of principle to look at the underside first.
In which spirit, I bring you the following.
There are basically three channels through which wind power would influence climate. The first is direct increase of friction in the vicinity of the turbine, which dissipates kinetic energy in the wind into heat. The second is that the increased drag produced by the turbines will change the speed and direction of prevailing winds, and since these winds carry heat and moisture, can potentially alter the climate over large areas. The third, and most indirect, channel is that increasing use of wind power may reduce the amount of fossil fuel consumption, and interact with climate changes produced by the existing load of greenhouse gases.
The present paper is a first cut at considering all three channels, using two standard global climate models and a variety of different assumptions about where wind-turbines will be situated, how efficient they will be, and the manner and magnitude of the resulting increase in surface-layer drag. There are plenty of crude approximations made here (for instance, that the increase in drag is uniform over the affected area), but they're recognized as such. The results thus need to be treated with caution, but they are interesting. The second channel (changing atmospheric transport) has a much bigger impact than the first (direct friction); together, under plausible assumptions, they give a near-zero impact on the global mean, root-mean-square change in seasonal means of about one twentieth of a degree centigrade at one point, and peak changes of half a degree. Under further plausible, but of course more arguable, assumptions, the third channel (reduced carbon emissions) swamps these effects, too, by about a factor of five. Interestingly, the configurations for wind power they consider tend to cool the poles and warm the lower lattitudes, while carbon dioxide warming has the opposite effect. This suggests (though the authors don't go there) that we might consider massive wind farms as a pure climate-change moderator, even if they weren't a practical power source from an engineering point of view.
It should probably be noted that the first two sponsors listed in the acknowledgments are British Petroleum and the Electric Power Research Institute.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Reading this piece in the New York Times about Farecast, Oren Etzioni & co.'s attempt to forecast airline ticket price movements (through the power of machine learning!) leaves me with three reactions.
First, it's a relief to read something about data-mining and airlines which isn't Yet Another Unconstitutional Step Towards a National Surveillance State. In fact, this would be cool if it works.
Second, I'm professionally curious about how well Farecast's predictions, with over 100 independent variables, would compare to simple alternatives, like low-memory hidden Markov models, or throwing out all the variables except the time remaining until departure. (Since I'm teaching data mining in the fall, I'm also professorially curious.) I suspect that, even if there is some real improvement, it is small. Come to that, the trivial predictor which always forecasts a price increase is going to set a pretty high baseline accuracy...
Third, I'll be surprised if does work, not on technical grounds, but because I don't see how it makes sense for airlines to cooperate. The ideal airline pricing scheme is one which gouges you just enough that you're indifferent between taking the flight and not going at all (or taking some alternate mode of transport, etc.). This is why flights at ungodly hours are cheaper than those on the same route at decent times: by showing up at five in the morning to have your luggage prodded, etc., you signal to the airline that they really can't get any more money out of you. [1] Now suppose that you want to take a certain flight, and there's a maximum amount you'd pay for it. As things currently work, you look up the price and see what the airline is currently charging. If that price is less than its value to you, you buy the ticket, and the difference is your "consumer surplus". Now Farecast comes along and says, in effect "sure, you could do that, but the price is going to drop --- hold on and you'll do even better." So you buy when the price hits its trough, and are better thereby. (Yes, some people act like this now, I'd guess not many, and not very successfully.) From the airlines' point of view, however, every dollar by which your consumer surplus grows is a dollar they could have had. ("And I would have gotten away with it, too, if it hadn't been for you meddling KDDs!") Consumers and airlines are engaged in a zero-sum competition over the potential surplus, and this doesn't help the airlines.
Which, again, makes me very puzzled about why they would cooperate with it. The smart move on their side, I think, would be to systematically undermine the reliability of Farecast. This could be done very simply, without even attempting to reverse-engineer the predictor: monitor its forecasts of your own flights, and, all else being equal, do the opposite. It's true that a reliable forecast of a price increase isn't so bad, for the airlines, as a forecast of a price decrease, but systematically jamming and confusing Farecast should be easier than selectively doing so. But I defer to real economists about the importance of this wrinkle.
The larger moral ought to be a familiar one: in strategic interactions, you have to assume that the other side will adapt to you. This doesn't mean that statistical methods have no place in studying strategic interaction (see, e.g., the second paper here), but it does mean we should be very dubious about the ability of simple data mining to give us an advantage over an opponent as smart and determined as a commercial airline.
(Thanks to K. for sending me the article, and discussing it.)
1: If you want to understand the logic of airline pricing, among much else, a great read is Carl Shapiro and Hal Varian's Information Rules: A Strategic Guide to the Network Economy. [I have been sitting on a draft review for seven years now, and am not about to stop.] Despite the very late-1990s title, this is really about the general economic principles involved in any industry where high first-unit costs and low marginal costs give you positive economies of scale, or where there are strong positive network externalities. Airlines are in the first category, because the cost of getting a jet from New York to LA with 100 passengers is almost the same as getting it there with 101 passengers, and is mostly the cost of getting it there empty. Alas, appreciating the rational essence of the process does not help make the lived experience any more endurable.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
I wrote this about a year ago; I'm cleaning out my drafts folder.
As every schoolchild knows, what became the Internet was at first ARPANET, one of the many projects by which the Pentagon's Advanced Research Projects Agency nurtured the development of computer science and technology in the US. ARPA eventually became DARPA, but for a long time retained its character of a patron of basic and curiosity-driven research. Lately, however, it seems to have lost its touch, what with the whole terrorism futures markets debacle, "total information awareness", and so on. (I myself was for several years sustained, as a graduate student and post-doc, by my adviser's DARPA grant, familiarly known as cooperative agreement F30602-00-2-0583, part of the TASK program. Of course, it's not for me to say where our work fell in the spectrum from foundational to flaky.) The sad but predictable response has been to demand that DARPA de-emphasize basic academic research in favor of working with private-sector contractors for short-term military payoff.
All of which is by way of lead-in for this post to the connectionists mailing list:
Postdoctoral position in neurobiology / engineering in Woods HoleA 4-year DARPA research project, funded annually, to steer the behavior of sharks in the natural environment through stimulation of selected sensory brain areas. Expertise in brain stimulation, multi-electrode recording and neural data analysis most desirable. Interfacing with wireless data transmission and stereotactic electrode positioning.
A year earlier, a Boston Globe story had more information, but lacked the admirably matter-of-fact tone of the job-ad.
I don't see much by way of follow-up, but I haven't looked very hard. For all I know, this work is already well on its way to giving us a beloved childrens' classic.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Nature, in its wisdom, has compiled a list of the fifty most popular weblogs by scientists. I am more than a bit astonished to see this blog on the list, but my faith in the reading public is bolstered by seeing that it's only in the fiftieth place, and everything ahead of it that I recognize definitely deserves to be. (Via Aetiology [#7] and Pharyngula [#1].)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
You make me want to quote the Medium Lobster's cost-benefit analysis of pre-emptively blowing up the moon in my report.
I hope you appreciate what it costs me to resist this temptation.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
From John Hall and Charles Lindholm's Is America Breaking Apart? (pp. 83--90, omitting footnotes):
One thing that has often been held to characterize Americans is the ambiguity, confusion, and "contagious vagueness" of their understanding of political theory. Americans may know, for example, that citizens have rights, but are extremely unclear about what those rights might be; they know Americans are supposed to be free, but not how freedom is limited, or what freedoms are permitted; they know that "all men are created equal," but cannot reconcile that precept with the protection of property. In other words, most Americans are very proud indeed of the principles that their country is built upon, but for them those principles consist primarily of abstract notions, such as liberty, justice, and equality, rather than a systematic set of specific precepts or practices.They go on to talk about why we are so friendly and nice, and why believe that love is the only legitimate bond between people, but I'm skirting the bounds of fair use as it is, even in the interest of patriotic display.The major sources of "contagious vagueness" are to be discovered in the priority Americans have always given to pragmatic experience over philosophical system-building. Under the fluid circumstances of American life, ambivalence and woolly abstraction have certain advantages. ... While remaining faithful to nebulous abstractions of "freedom," "individualism," and "equality," Americans can still be pliable in action without betraying their integrity or finding themselves immobilized by contradiction. Ideological vagueness thus allows Americans to feel a sense of unity without the trouble of actually considering exactly what that unity is based upon.
The abstraction and ambiguity of taken-for-granted foundational principles also allows Americans easily to "hold contradictory ideas simultaneously without bothering to resolve the potential conflict between them." This, too, is not necessarily a bad thing in a pluralistic society, where central authority is relatively weak. The blurry quality of American assumptions about their shared creed allows them to accept innovations easily, so long as the innovations are metaphorically bathed in the aura of tradition, and to react according to circumstances without too much concern about agreement with prior positions. Most important, an ability to ignore contradictions permits Americans to overlook disputes that might tear a more ideologically consistent society apart.
Corresponding to American abstraction and vagueness in the realm of political philosophy is a positive can-do approach to ordinary problems. ... It is no surprise that the predominant American philosophy is pragmatism.... However, what pragmatism takes for granted as "common sense" is actually a culturally constructed perspective, based in large measure on what has recently been called modular thinking. This is a strategy for instrumental action which assumes that complex wholes can be broken into elementary parts; these parts can then be efficiently recombined according to need. Modular thinking is American to the core: it is an atomistic, flexible, anti-organic, and anti-authoritarian view of the world — one which dispenses with tradition in favor of efficiency, and places all alternatives on an equal footing, subject to personal evaluation by the active innovator, who decides which combination is best.
Modular thinking has had a successful history in America. It is responsible for the development of the assembly line and Taylorist innovations in scientific management, and it provides the foundational principles behind everything from the construction of shopping malls to the planning of school curricula. ... [T]he pervasive pragmatic modular approach to life permits Americans to avoid divisive ideological issues by visualizing the world around them as a machine that can be retooled, or taken apart and rebuilt, in order to achieve maximum efficiency. ... Disagreements are not over principles, but over design. Though this mechanistic instrumental worldview may remove much of the magic from the cosmos, and though it certainly does not grasp complex social realities, it is not likely to arouse great passions either — and so is conducive to social peace.
It is especially striking that for Americans even the self is considered to be a kind of modular entity, capable of being reconfigured to fit into preferred life styles. This malleability is often decried as indicating American shallowness, or else praised as the postmodern triumph of the signifier. But the American emphasis on perpetual self-transformation also serves the cause of unity, though perhaps not in the way Protestant moralists would prefer. This is because the search for identity is a notoriously solipsistic pursuit: such quests do not lead to revolution, but to harmless participation in the therapeutic, self-help, and twelve-step groups that have so mushroomed in America. At the very worst, the search for an authentic self draws the most perplexed seekers towards immersion in the multitude of sects and cults that have always sprouted on American soil. Occasionally, it is true, these groups spiral into psychosis ... but generally these new religions are akin to the "healthy-minded," "once-born" faiths that William James found so characteristic of America. They typically affirm the goodness of all creation and preach accommodation with the world as it is, stressing mental discipline, while applying the optimistic American "can-do" attitude to spiritual uplift and practical self-betterment. Membership in them is no more harmful than membership in any local PTA.
American faith in the power of individuals to change themselves is quite understandable as a product of the immigrant experience in combination with the Protestant ethos. Protestant sects believe that individuals can be spiritually transformed through disciplined, virtuous action in this world. For most of the original settlers immigration to America was just such a transformative action, a voluntary pilgrimage in search of the City on a Hill. In secular garb, this model continues to hold: becoming an American is a kind of conversion experience. The newcomer "is not required to learn a philosophy," Daniel Boorstin notes, "so much as to rid his lungs of the air of Europe." This point is not invalidated by the fact that more recent immigrants, male and female, rid their lungs of the air of China, India, and Africa. For all these newcomers, past and present, America has been the "Mother of Exiles. From her beacon-hand glows world-wide welcome." The content of the glowing welcome offered by the Statue of Liberty is not a dogma, but an opportunity. America presents itself as a place where newcomers can achieve their dreams, free at least from the chains of tradition, class, and history...
For the zealous believer of colonial times, the end sought through migration to America was a passage into heaven.... For the modern entrepreneur, the goal is likely to be far more mundane: owning one's own business and acquiring a house in an exclusive suburb. Whether what they sought was spiritual or material, immigrants to America have worried little about conceptual consistency or a systematic organization of principles. Central instead is a belief that individuals have the capacity, through personal effort, dogged discipline, and creative innovation, to leave the past behind, to pursue happiness, and to become whatever their potential allows. Only in America would the Army call on its recruits to "be all you can be."
We can feel the heady appeal of this transformative aspect of American life in a letter sent by a French migrant to California during the Gold Rush:
In the mist of this world of adventurers, who change their occupations as often as they do their shirt, egad, I did as the others. As mining did not turn out remunerative enough, I left it for the town, where in succession I became a typographer, a slater, plumber, etc. In consequence of thus finding out that I am fit for any sort of work, I feel less of a mollusk and more of a man.For such adventurous souls, America indeed offered — and continues to offer — an opportunity for taking on a new and better identity, for making a mollusk into a human being.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
The importance of coffee-houses in the Enlightenment, and the rise of the public sphere, is a historical common-place. But it's also puzzling: historians can say a lot of sensible things about how, as a social setting, the cafe was conducive to the give and take of (more or less) rational argument, and (relative) indifference to social standing in favor of persuasion. But I've never heard a good story for why coffee houses had to be run that way, nor that (say) taverns weren't, or couldn't have been, run that way. So, while not denigrating the social factor, it doesn't seem to explain why this connection took hold. Now, at last, scientific proof that Enlightenment had a sound material basis (via Mind Hacks):
I should perhaps add that the leap from their findings to the rise of modern rationalism is entirely my own.
Manual trackback: Stephen Laniel; Brad DeLong
Update, 13 June: On the basis of my correspondence about this post, I feel like I have to add that I was not serious. "Differential diagnosis, people" (to quote a great sage and eminent junkie): both coffee and the coffee-house were imported into Europe from the Levant, where the coffee-house developed as a recognizable social form, without triggering a local version of the Enlightenment. For that matter, all the physiological studies point to the influence of caffeine as such, not just coffee, so why not tea? (Though, come to think of it, didn't the proto-scientific and industrial development of the Song dynasty coincide with the rise of tea houses?) No doubt patterns of caffeine consumption have had some effect on culture, and it would be nice if historians could study them, but not like this. I'm a reductionist, but when I toss out simplistic biological explanations for complicated, ill-defined social phenomena, I'm joking, unlike some people.
Also: Ruchira Dutta writes with news of a learned investigation into related questions, A. A. Reade's Study and Stimulants; or, the Use of Intoxicants and Narcotics in Relation to Intellectual Life, as Illustrated by Personal Communications on the Subject, from Men of Letters and of Science (Philadelphia: Lippincott, 1883). As the subtitle suggests, it largely consists of letters from various eminences of the day on their own experiences of using drugs, and thoughts on whether and how drugs are of use to intellectuals. Some of them are quite charming, in a rather prim way. Thus H. H. Bancroft's letter, in its entirety: "In my opinion some constitutions are benefited by a moderate use of tobacco and alcohol; others are not. But to touch these things is dangerous." Or, speaking of the rise of rationalism, Mr. W. E. H. Lecky: "I am not a smoker, and am therefore unable to give you any evidence on the subject." Other correspondents were not similarly inhibited by their lack of first person data, such as Keshub Chunder Sen ("The problem you have undertaken to solve is, indeed, one of intense importance and interest, and all who can ought to help its solution in the interests both of science and morality") or Mr. Ivan Tourguéneff.
Minds, Brains, and Neurons; The Collective Use and Evolution of Concepts; The Great Transformation
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Kara has a charming habit of secreting herself behind some corner and landing a devastating swipe of her paw on the calves of passers-by. This would be very effective at crippling us, if only she had not been cruelly declawed at some point before we adopted her from the shelter. Today's post, however, will not be about the irreducible intensionality revealed by her desire to bring down prey much larger than herself, nor about the qualia of her phantom claws (which I am convinced she experiences). Rather, I want to talk about the mechanism which brings her paw — and with it her fore-arm and shoulder and the rest of her body — to move just so as to effectively ham-string us. This mechanism is accessible to experimental investigation, and, as it turns out, pretty nearly linear.
To elaborate a little on the experimental procedure: the cats were anesthetized with ketamine, which allowed for surgery to attach sensors measuring electrical activity to eight muscles, as well as brain surgery opening up the motor cortex. They also attached a sensor to the paw, tracking its position in space. Then, maintained on a steady flow of ketamine, the "body of the cat was laid on a cushion with its forelimbs hanging perpendicular to the ground and free to move in all directions against gravity". At this point, they began small-scale, low-current stimulation of the motor cortex, until they identified a number of points each of which, when stimulated, produced a repeatable and distinct motion of the paw and fore-limb. They recorded both the displacement of the paw, and the total activity, over the course of the movement, of each of the eight muscles. Thus, each point in the cortex corresponded to both a three-dimensional vector, in ordinary space, and an eight-dimensional vector, in muscle space.
In a linear system, if you add two inputs, you also add two outputs; this, and nothing more, is what "linear" means. Vectors add according to the parallelogram rule, so if the cat motor system is linear, if we stimulate two points simultaneously, the movement of the paw should be the superposition of the movements produced by either point on its own. I wish I could build some suspense at this point, by harping about how neurons are notoriously nonlinear devices, so it's madness to expect any sort of linearity here, but Ethier et al.'s title rather gives the show away. When they simultaneously stimulated pairs of points, they got paw motions which were almost perfect linear sums of the individual movements. This was true of both the paw-displacement vector, and the muscle-activation vectors. (See especially their figures 2 and 5, and accompanying text.) It didn't matter whether they electrically stimulated both points, or stimulated one point while chemically reducing the inhibition at the other. This is very nice, but what I like even more is the experiment summarized in their figure 7, where they took two cortical points, which produced nearly perpendicular movements on their own, and by varying the magnitude of the stimuli at each, got a sequence of movements which smoothly interpolated between them, exactly as one would hope for a linear control system.
Now, I should say that the linearity of response wasn't perfect. The largest systematic deviations from linearity occurred when summing the individual motions would have produced the largest displacements — in a word, the muscle response saturated. To avoid this effect, Ethier et al. first established an input-output relationship for each cortical point on its own, and kept the stimulus magnitude low enough to avoid saturation there. They suspect the place where the response became sub-linear was in the spinal cord, but they don't, that I can see, really establish it.
There are two different larger morals to be drawn from this story. One has to do with the functional anatomy of the motor cortex (a larger, long-running story nicely presented in an older paper of Capaday's). There is clearly a great deal of localization of function there — this point produces a swiping motion of the paw, that that one pulls it back towards the chest, neither makes the tail twitch — but of a peculiar sort. One might well imagine that each point in the motor cortex would correspond to a particular muscle or group of muscles; instead, at least in the part Ethier et al. worked with, they seem to correspond to motions involving many muscles to varying degrees, overlapping from one cortical point to another. Linearity means that a reasonably small set of motions could serve as a basis for a vast range of coordinated actions, without all of those having to be separately stored in the motor cortex.
The other moral has to do with the general principles of neural representation and computation. Neurons are, indeed, horribly nonlinear little things, so it would be entirely reasonable to suppose that neural codes are too; but that would be too quick. One of the few efforts in this area that is general, abstract and predictive enough that it seems to me to be worth calling a theory, the "neural engineering" advanced by Chris Eliasmith and Charles Anderson in a book of that title, takes as its first principle "nonlinear encoding and linear decoding". That is, while the mapping from input to output is hairy and nonlinear, for typical outputs you can recover the input, to high accuracy, using a linear rule. This is especially easy to arrange in neural systems where excitation and inhibition are neural balanced, so Ethier et al.'s findings on dis-inhibition fit in nicely.
Nonlinear encoding and linear decoding is not just an assumption of Eliasmith and Anderson, but, e.g., features quite prominently in Spikes, and is implicit in the now-standard "reverse correlation" method. While I am not, usually, one to argue with scientific success, I have reservation about this. William James used to decry, as "the psychologist's fallacy", the "confusion of his own standpoint with that of the mental fact about which he is making his report" (Principles of Psychology, ch. 7). Something similar (the "computational neuroscientist's fallacy", perhaps?) seems to me involved here. Neural representations do not exist to be decoded by scientists, but to be used by other parts of the organism, and ultimately to produce adaptive actions. What is lacking, in most of these studies, is evidence that linear decodings of neural activity are in any way biologically relevant. (One of the nice things about Eliasmith and Anderson is they see at least part of this, since their second principle is that other parts of the brain use a neural representation by applying alternately-weighted linear transformations to it, i.e., biased linear decodings. But they present less evidence for this than for their first principle.)
In this case, however, I don't find much room for doubt: the points in the motor cortex represent actions, and those representations are, when the paw meets the calf, linearly decoded. It's still not clear to me that linear decodings and transformations are any easier for the brain to implement, but at least in this case that's what's going on, and it's an empirical fact we will have to incorporate in our models, or ideally explain in our theories.
And now, if you will excuse me, I'm being attacked.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
A year and a half ago, I commenced a major purge of my book collection. A year ago, when I left Ann Arbor for Pittsburgh, I consigned thirty-odd boxes of used books to Corners Bumped, a.k.a. my friends Bill and Barbara Tozier. (This tells you something about the number of books I did end up shipping here.) Recently, they have been refurbishing their online book-selling presence, listing new stuff from my boxes and their own stock, and musing (1, 2, 3, 4) on used books and the trade therein. Go buy something, will you? It will make all of us — you, me, Bill and Barbara, and not least the books — better off.
This concludes today's crass commercial plug.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Paradoxes and fallacies are fun teaching tools, and elementary courses in probability theory are well supplied with them. I have long thought that more advanced probability classes would be more palatable if we presented our students with more opportunities to go "Wait, that can't be right", and sink their teeth into something really just-plain-wrong. Happily, we now have about ten more such morsels to offer them:
Anyone taking 36-754 from me next spring should expect to see sections 2 and 3 as assignments.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
I'm afraid I was a bit premature in announcing my return to posting, since I find my time being taken up by actually doing some science, and being, for the first time, on the sending rather than the receiving end of a qualifying exam. In the meanwhile, let me enthuse about Far from the Madding Gerund, a selection (and annotation) of the best of Mark Liberman's and Geoffrey Pullum's posts from Language Log. Doing a proper review, or even justice to their weblog, is really beyond what I have time for, but this book seems to capture quite perfectly their mixture of whimsy, skepticism, accessible scholarship, and pure good-natured zeal for their subject. If you have been reading them, then you will find these little essays as good as you remembered, if not better, and you'll wonder how you forgot about some; if you have not been reading Language Log, then their book should convince you to start. Either way, reading it will make you better and happier. (Disclaimer: I got an unsolicited review copy of this book this week, which didn't help my productivity any. Also, when I gave a talk at Penn on April Fool's Day last year, Mark was kind enough to let me crash in the hospitality suite at One Language Log Plaza.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Kara has Views about humans working on Friday nights:
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Hopefully, it's just jet-lag, but seeing highway signs marked "Wien", "Praha" and "Budapest" makes me feel much the same way one marked "Minas Tirith" would, that something from known only from books has become unsettlingly real.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Or rather, from Ernest Gellner's Language and Solitude: Wittgenstein, Malinowski and the Habsburg Dilemma (pp. 108--109):
Imagine the conversation of a few Viennese characters, in the Cafe Central, on the assumption that the Tractactus is a correct account of the human condition.[Thanks to Wolfgang for checking my transcription of German titles!]CHARACTER A: High up on the left hand of my visual field, I note a fact in which a 9-value predicate links the appropriate number of things... I don't think I have had this one before, I'd like a snapshot for my collection.
B: I have a much more interesting one right in the middle of my field, a variable with 127 things attached — it has a lot of tentacles, holding those things. I've counted them.
C (to A): Don't you believe him. I have known him for years, he is invariably given to exaggerating the complexity of his facts, just to make himself interesting.
D (to C): I don't think he is deliberately lying, he just drinks too much and then imagines things.
B (furiously, to both C and D): What you have both said is extremely offensive and I have no option but to call you out! My seconds will call on you. That is, assuming you have honour: my fraternity has decreed that Jews have none and we may not duel with them. Is either of you at least half Aryan? That is all one can hope for in Vienna these days.
A: Gentlemen, gentlemen, please calm down. May I remind you first of all that dueling is forbidden by law and, secondly, death not being an event in life, is totally pointless anyway.The above conversation is not copied out of Die letzten Tage der Menscheit or Der Weg ins Freie or Der Mann ohne Eigenschaften or even Radetzky Marsch. It is all my own work and it is copyright.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
As a member of the program committee for the workshop Statistical Network Analysis: Models, Issues and New Directions, part of the 2006 International Conference on Machine Learning, I urge you to submit your best work by 28 April; you can send us the bad stuff after that. (That, in response to hearing endless variants on "First prize, a trip to Pittsburgh; second prize, two trips", which was old in my grandfather's day.)
CALL FOR PAPERS
Statistical Network Analysis:
a workshop at the
Models, Issues and New Directions23rd International Conference on Machine Learning
Thursday, June 29, 2006, Pittsburgh PA, USA
(ICML 2006)Overview:
This workshop focuses on probabilistic methods for network analysis, paying special attention to model design and computational issues of learning and inference.
Many modern data analysis problems involve large data sets of artificial, social, and biological networks. In these settings, traditional IID assumptions are blatantly inappropriate; the analyses must take into account the structure of relationships between the data. As a result, there has been increasing research developing techniques for incorporating network structures into machine learning and statistics.
Network modeling is an active area of research in several domains. Statisticians have mostly concentrated on models of static networks. These models are concerned with the existence of edges between individual nodes, but do not attempt to model aggregate properties. In contrast, physicists have addressed global properties of large complex networks. Their models describe average statistics of the network, or properties of typical networks in large ensembles; the links between particular nodes are less meaningful.
This workshop aims to bring together statistical network modeling researchers from different communities, thereby fostering collaborations and intellectual exchange. Our hope is that this will result in novel modeling approaches, diverse applications, and new research directions.
We wish to clarify that in this workshop, the word "relational" carries a different meaning from the usual sense of the word in Probabilistic Relational Models (PRMs). For example, in real life, any two random people maybe connected through a complex web of friendships; estimation of interpersonal connections thus cannot be done independently of the rest of the network. We focus on modeling statistical properties of the network, as opposed to different types of probabilistic relations. This differentiates us from the co-located ICML workshop on Statistical Relational Models.
Online Submissions:
We welcome the following types of papers:
We encourage authors to emphasize the role of learning and its relevance to the application domains at hand. In addition, we hope to identify current successes in the area, and will therefore consider papers that apply previously proposed models to novel domains and data sets.
- research papers that introduce new models or apply established models to novel domains,
- research papers that explore theoretical and computational issues,
- position papers that discuss shortcomings and desiderata of current approaches, or propose new directions for future research.
Submissions should be limited to a maximum of 8 pages, and adhere to ICML format. Please email your submissions to: edo [at] cmu.edu.
Deadline for Submissions: Friday, April 28, 2006
Notification of Decision: Friday, May 5, 2006Format:
This is a one-day workshop. It will consist of several themed sessions targeting methodological and application issues (e.g., estimation in static models, network evolution modeling, and statistical modeling of large scale networks) with talks (invited and contributed) and moderated discussion. Discussions at the workshop will facilitate exchanging of research ideas and help identify other challenging problems in the area. At the end of the workshop, a panel of statisticians, physicists, and computer scientists will discuss the points arising throughout the day and identify the most promising and challenging directions.Publication:
Accepted papers will be distributed on a CD and made available for download.Organizers:
Edo Airoldi, Carnegie Mellon University
David Blei, Princeton University
Stephen Fienberg, Carnegie Mellon University
Anna Goldenberg, Carnegie Mellon University
Eric Xing, Carnegie Mellon University
Alice Zheng, Carnegie Mellon UniversityProgram Committee:
David Banks, Duke University
Peter Dodds, Columbia University
Lise Getoor, University of Maryland
Mark Handcock, University of Washington, Seattle
Peter Hoff, University of Washington, Seattle
David Jensen, University of Massachusetts, Amherst
Alan Karr, National Institute of Statistical Sciences
Jon Kleinberg, Cornell University
Andrew McCallum, University of Massachusetts, Amherst
Foster Provost, New York University
Cosma Shalizi, Carnegie Mellon University
Padhraic Smyth, University of California, Irvine
Josh Tenenbaum, Massachusetts Institute of Technology
Stanley Wasserman, Indiana University
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
will be held at Oxford, 25--29 September. Since I'm on the program committee, I ought to point interested parties to the call for papers (deadline 7 April) and the call for workshop proposals (deadline 5 May).
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
I occasionally dabble in mathematical aspects of neuroscience. My brother does the real thing:
Also check out the accompanying "perspectives" piece.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Attention conservation notice: Over 6000 words of dubious value. Written as a contribution to the Valve event on Franco Moretti's Graphs, Maps, Trees. Contains advice to literary scholars from someone utterly unqualified to give it, hypothesis testing, rastergrams, many long quotations, and ruminations on materialism and rational history. If this is the sort of thing you're interested in, you'd really be better off reading Moretti yourself.
Cross-posted to the Valve, where there is a comments section. I can't promise to reply...
A few years ago, I wrote a review of Moretti's Atlas of the European Novel, in which I presumed to tell him how to go about his business. When he ran across it, his reaction was not (as mine would've been, had our situation been reversed) to tell me where to get off, but to invite me to a workshop he was organizing at Stanford on new interdisciplinary work on the novel — its motto, the quotation from Brecht about "questions that appear to us completely unsolved", is recycled for this book — where I had a great time. Reading these essays as they came out in New Left Review, I enjoyed them greatly, and recall thinking that Moretti could hardly have done a better job of appealing to my prejudices if he'd tried. (Said prejudices are those of someone almost equally fond of The Extended Phenotype and Main Currents of Marxism.)
With this kind of background, it comes as no surprise, I trust, that I really like this book, and finding objecting to what he actually proposes here highly wrong-headed. In what follows, I want to say a bit about "graphs" and a bit about "trees", and explain why this sounds so promising to me. I am not going to say anything about "maps", because I don't think I have anything to add to that discussion, but I will, for the sake of getting an M in there, end with some remarks on "materialism". At no point can I pretend to be competent to evaluate the originality of Moretti's work within literary scholarship, to say how much of a departure, say, the trees really are. In a feeble attempt to pretend that my price is higher than a weekend in California and a review copy, I will make some criticisms, most about tedious extra stuff I wish Moretti had also done. I'd like to think that what I say will also have some value for those who don't share my rather haphazard intellectual trajectory, but my experience with trying to communicate across disciplines means I'll get a warm glow if I'm even comprehensible, never mind persuasive. I am accordingly very grateful to the Valve, and especially to Jonathan Goodwin, for letting someone with my credentials (viz., none) participate in this event.
Let's be a little more precise about what we'd mean by "chance" and "coincidence" here. One natural possibility is that new genres appear at a constant rate over time, utterly independently of one another. Every year, then, there would be a constant probability of a new genre forming, but whether it did or not would have no bearing on whether the next year saw a new genre. This is our null model — the one which says what things should look like if we're just fooling ourselves, and there are no clusters. To get slightly technical, the distribution of intervals between genre-arrivals should have what's called a geometric distribution. Assuming, for the sake of argument, that that's true, we can use the average time between genre-appearances (3.44 years) to estimate the most likely value for the probability of a new genre appearing in any given year (about 29%).
Once we assume that the inter-arrival distribution is geoemtric and find the parameter, we can simulate from it, and get examples of what Moretti's graph would look like, if only chance were at play.
|
| The top line shows the appearance dates of Moretti's 44 genres; the next two lines give the results of simulating from a model of uniform random appearance, with the same mean time between genres as the actual history. |
Is there more clustering in reality than in the results of the null model? I couldn't say, by eye, but I don't have to. I can calculate the probability of generating Moretti's history from the null model: it's somewhat less than 1 in 10^45. This in itself isn't decisive, since any particular history becomes less and less probable as one considers longer and longer intervals of time (cue Stoppard), so we need to know what fraction of all histories of that length are at least that unlikely. I could work this out exactly, if I were willing to do some actual math, but I'm lazy, so I just had the computer simulate a million histories and evaluated all of their likelihoods. If the null model were actually true, we'd see histories like Moretti's only about 0.4% percent of the time. [1] So this is actually pretty good evidence that the null model is not true, and Moretti's history does show the kind of clustering he thinks it does.
Of course, this only underlines the question of why Moretti's data is clustered. I can think of a couple of deflating explanations (maybe the clusters match the periods more intensely scrutinized by historians; maybe they tend to adjust when they report genres appearing towards certain focal dates). Or it could be due to some sort of exogeneous influence, from war, politics, economic shifts, etc. (I did not try removing the obviously-topical genres, like Chartist novels, and repeating the analysis.) Or it could be due to some sort of endogenous mechanism within the system of literary production and consumption — generational turn-over of authors, of readers, of editors and publishers (suggested by my friend Bill Tozier). Or: maybe there's some space of things-people-like-in-novels, which the popular genres at any one time partition up in various ways; if one genre dies out or another appears, this might destabilize all the others as well. I don't think Moretti's time series, by itself, is enough to begin to let us decide among these mechanisms (some of which are compatible), but I do think it lets us see that some mechanism is called for.
Here is my first reproach: Moretti should have been the one to do this analysis, not me. If testing hypotheses is too banausic and mechanical for the pages of New Left Review, then it should either be in another article, or in the book. Moretti is a shrewd man, and in this case his intuitive analysis of the data was right, but there is no reason to rely on intuition alone for something like this. And, if one is going to go to the trouble to collect quantitative data, one ought to use it quantitatively. Mathematical abstraction (quantitative or otherwise) is not valuable for its own sake, but for the inferences it lets us make, when the proper tools are applied. In this case, those tools are pretty easy to bring to bear. They should be.
Here is Moretti at the end of "Graphs":
For most literary historians ... there is a categorical difference between 'the novel' and the various 'novelistic (sub)genres': the novel is, so to speak, the substance of the form, and deserves a full general theory; subgenres are more like accidents, and their study, however interesting, remains local in character, without real theoretical consequences. The forty-four genres of figure 9, however, suggest a different historical picture, where the novel does not develop as a single entity—where is 'the' novel, there?—but by periodically generating a whole set of genres, and then another, and another... Both synchronically and diachronically, in other words, the novel is the system of its genres: the whole diagram, not one privileged part of it. Some genres are morphologically more significant, of course, or more popular, or both—and we must account for this: but not by pretending that they are the only ones that exist. And instead, all great theories of the novel have precisely reduced the novel to one basic form only (realism, the dialogic, romance, meta-novels...); and if the reduction has given them their elegance and power, it has also erased nine tenths of literary history. Too much.
On the one hand, this seems to me to be obviously correct. On the other hand, I wonder very much why Moretti stops here. If we look within any one of those forty-four genres, I think we have every reason to suppose that we'd find it composed, in its turn, of sub-genres, and so on, and ultimately of a shift succession of individual texts. "The" Bildungsroman (to pick one of the forty-four, not entirely at random) is a short-hand way of referring to the most common and enduring features of a historically-changing and always-various population of books, just as "the" bottle-nosed dolphin is an abbreviation for the leading tendencies of a certain population of organisms. What Moretti hints at, in the paragraph I quoted, is that "the" novel is itself a population, either of genres, or of texts structured into genres. But he doesn't say outright what seems very plain to me, and so I'd like to know why, and specifically whether he thinks it's actually wrong, or unhelpful.
The assumptions of population thinking are diametrically opposed to those of the typologist. The populationist stresses the uniqueness of everything in the organic world. What is true for the human species—that no two individuals are alike—is equally true for all other species of animals and plants. Indeed, even the same individual changes continuously throughout its lifetime and when placed into different environments. All organisms and organic phenomena are composed of unique features and can be described collectively only in statistical terms. Individuals, or any kind of organic entities, form populations of which we can determine the arithmetic mean and the statistics of variation. Averages are merely statistical abstractions, only the individuals of which the populations are composed have reality. The ultimate conclusions of the population thinker and of the typologist are precisely the opposite. For the typologist, the type (eidos) is real and the variation an illusion, while for the populationist the type (average) is an abstraction and only the variation is real. [2]
This makes salient the question of how we mark off different populations as distinct. The usual biological criterion is through common descent, and the possibility of inter-breeding —- Mayr's "biological species concept". (There is a vast controversial literature on the details.) Ruth Garrett Millikan has a closely related notion of "reproductively-established families", which doesn't lean so heavily on the details of biology, and which would seem to fit the case of genres of novels. One could also define classes of texts purely morphologically, which might include many unrelated lineages (just as one might consider all streamlined marine predators which live in the water all the time, a class including dolphins, killer whales, sharks, tuna, ichthyosaurs, etc.). Just as such organic forms have appeared in several lineages, morphologically-defined categories could appear in multiple places and periods, the way novels arose, apparently quite independently, in both the Hellenistic world and in China (and elsewhere, for all I know). Historical populations, however, are unique.
One could ... take evolutionary bibliography as the prototypical evolutionary science and think of biology in terms of bibliographic analogies... [3]
In his comments on the links among capitalism, Protestantism, and Catholicism Marx set a disastrous precedent for many later writers who have attempted to find "structural homologies" or "isomorphisms" (two fancy terms for "similarities") between economic structures and mental products. Because virtually any two entities can be said to resemble each other in some respect, this practice has no constraints other than the inventiveness of and ingenuity of the writer: There are no reality constraints and no reality control.
Marx suggests two inconsistent lines of argument. One is that there is a strong connection between mercantilism and Protestantism, the other that there is an elective affinity between mercantilism and Catholicism. He was confused, apparently, by the fact that money has two distinct features that point to different religious modes. On the one hand, money (gold and silver), unlike credit, can be hoarded. Hoarding easily turns into an obsession, which is related to the fanatical self-denying practices of extreme Protestantism. On the other hand, money can be seen as the "incarnation" or "transubstantiation" of real wealth. In that sense the money fetishism associated with mercantilism is related to the specifically Catholic practice of investing relics and the like with supernatural significance. Both arguments are asserted several times by Marx, each serving to show up the essential arbitrariness of the other. Later attempts to explain the theology of Port Royal, the philosophy of Descartes, or the physics of Newton in terms of similarities with the underlying economic structure are equally arbitrary. Like the analogies between societies and organisms that flourished around the turn of the century, they belong to the cabinet of horrors of scientific thought. Their common ancestor is the theory of "signs" that flourished in the century prior to the scientific revolution inaugurated by Galileo — the idea that there are natural, noncausal correspondences between different parts of the universe. What Keith Thomas refers to as the "short-lived union of science and magic" maintained a subterranean existence of which the doctrine of ideology, in one of its versions, has been one manifestation. [4]
Even if we shutter and lock the Cabinet of Horrors, and go to look for
explanations of trends in such cultural products as novels (which is, after
all, what Moretti wants), I'm afraid we will find most of them in the capacious
Closet of Mildly Appalling Objects. There is no shortage of attempts to give
such changes meaning as signs of something else, some aspect of the social or
economic structure, of the way we live now (or the way they lived then), but
very, very few of them are convincing. In his great book on changing
fashions, A
Matter of Taste, the sociologist Stanley Lieberson looks at some of
the reasons why these attempts at ad hoc explanation are so often bad.
(He puts things more politely; I paraphrase.) First, the facts are often just
screwy, both about the developments to be explained: non-existent trends,
non-existent causes, weirdly mis-characterized trends, trends being explained
by events which happened long after the former began, etc. (In fairness, such
"scholarly
misconstruction of reality" is a lot more common than we academics like to
think.) Second, the mechanism connecting the explanatia to the
explananda is left totally obscure. Third, no attempt is made to test
the explanation, by checking that it can account for the magnitude of the
observed change, by ruling out alternative explanations, or by much of anything
else. The result is a steady stream of claims about how culture works which
are advanced with what is, under the circumstances, an astonishing degree of
assurance. Lieberson's
book provides many fine examples of such cavalier just-so story-telling for
names, the decline of hats, etc. [5]
Checking hypotheses about causation, and still more about adaptation, is really hard with just one case, arguably hopeless. What you need is the ability to reliably detect departures from the hypothesis, if they are actually present — "power", in the statisticians' jargon. It is hard to get much power when n=1. If you want to claim that certain aspects of 19th century British novels were the way they were because those features fitted with ideologies of British imperialism — a fairly strong hypothesis about adaptation — I don't see how you can do it just by interpreting Mansfield Park, no matter how subtle and sophisticated your reading. On the other hand, if you look at lots of contemporary novels, and the ones which (say) depict Great Britain's relations with its colonies in the same way as Mansfield Park does are systematically more successful, on average, than those which depict it differently, well then I don't see how that couldn't be good news for your idea, though even that would really only be the beginning of backing it up.
Biologists have given a lot of thought to checking hypotheses about adaptation, and developed many means of doing so. Mutatis mutandis, many of these means could also be applied to literature, or other aspects of culture. Eric Rabkin, Carl Simon and their collaborators have started doing just this with their Genre Evolution Project, looking at short stories from 20th century American science fiction, and no doubt there are others doing this kind of thing too.
One way of checking adaptive hypotheses, especially relevant here, is the "comparative method", or rather methods, which work much, much better when combined with good phylogenies. I think a literary historian who wants to study the evolution of genres and devices would be very well advised to look at the comparative methods biologists employ to study the evolution of qualitative characteristics of organisms. (The major issue would be that literary phylogenies will not be trees but more complicated lattices. But this is analogous to the effects of lateral gene transfer, common among bacteria, and so I'd suspect not only solvable but solved, someplace in the literature. Whether inheritance is by means of discrete-valued, particulate factors, i.e., genes, is not a crucial issue for such methods.) What I really want to see from Moretti (or someone) is a study along these lines of clues in the detective story; I'd be even more interested in one of free indirect discourse.
A crucial aspect of testing hypothesis about adaptation is a contrast with the outcome of a well-crafted neutral model — a way of saying what to expect if no adaptation were present, or not that adaptation anyway. These often have surprising consequences; for instance, neutral genetic drift will tend to fix some version of a gene in a given population, even if it confers no fitness advantage. (This is described in any book on population genetics.) So I wonder about things like whether we should expect, under a reasonable neutral model, that some formal device should become universal within a genre? If so, did clues take over detective stories any faster than neutrality would predict? (It's hard to imagine a successful genre where every story relies on confessions found by accident, but whether that's intrinsically weirder than actually existing detective stories, I can't say.)
The foregoing shouldn't be taken to mean that comparative literature should slavishly imitate comparative biology. There are people who have thought about the application of evolutionary ideas to social and cultural change in ways which are much more sophisticated about psychology, social organization and human interaction than (most) advocates of memetics; I am thinking particularly of David Hull, W. G. Runciman, Dan Sperber, Stephen Toulmin's great The Collective Use and Evolution of Concepts, and even the fragmentary MS. of Adam Westoby. As the economist Richard Nelson writes, we should expect our ideas of general evolution to change as we learn more about cultural evolution. We should also expect to have to develop different methods of data analysis. But, as always, we start with what we already know how to do.
I share Moretti's hope for a "materialist sociology of literary form"; Hell, I'd like a materialist sociology of culture generally. But I suspect it won't be able to do everything he wants it to.
When Moretti quotes D'Arcy Thompson on how the form of an object is a diagram of the forces which produced it, I'm happy to go along, and even happy to agree that this gives us some ability to work backwards, from form to force. But this sort of inverse problem generally doesn't have a unique solution, especially if some of the forces were transient and highly contingent... Less metaphorically, something Lieberson argues very convincingly is that we often have to distinguish between the social forces causing there to be a change in some taste, and those which shape the content of the new taste. Often the latter mechanisms are more or less internal to the bit of culture in question, like ratcheting. Or: culture doesn't have to express or reflect the social order. I suspect Moretti would be disappointed if this were the case for, say, genres of novels. Well, so would I. But this needs to be checked. One way would be to try to develop good neutral models, and see whether, and where, they break down
Dan Sperber has a great essay, in his Explaining Culture, on "how to be a genuine materialist in anthropology", where he complains about treating Capital, the World-System, cultural symbol-systems, mentalities, etc. as reified causal forces, if not self-interested foresightful agents, forgetting that human history, society and culture are actually "real individuals, their activity and the conditions under which they live". It seems, at least to this interested outsider, that the study of literature in society suffers from this, too. And I think what Sperber advocates there should go here, too: give actual causal accounts of how macroscopic patterns emerge from the interaction of many material bodies (notably, people and books), of the sort we know to exist, endowed with the kinds of abilities we know them to have.
This commitment may sound harmless, because contentless, but it does actually have implications. It means that you have to do a lot of work to justify functionalist explanations (though it's not impossible). It should make you very dubious about ideal types. It should make you more interested in exploring variation, and not dismissing it. It should make you very dubious about "practices" and other shared mental objects, at least as ordinarily conceived. And it suggests a lot of productive directions, investigating communication, cognition, and the collective patterns they produce.
In Graphs, Maps, Trees, as in his Atlas, Moretti is basically looking at the communication end of things. He doesn't say much about cognition, or individual thought more generally. Elsewhere (see e.g. Signs Taken for Wonders) he has dabbled in psychoanalysis, but I hope that's past. A materialist theory of literary form will ultimately have to concern itself with the organic processes of reading and composition, but the way to do this is through empirical study of readers and writers, not more interpretation of texts, or armchair ruminations (whether those are on the primal scene, the environment of evolutionary adaptation, or conceptual blending). Of course literary scholars have been making stabs in this direction at least since Richards's Practical Criticism, but with the advent of cognitive psychology this can be done in a much more systematic way, combining modeling of cognition with experimental tests of the models. [6] Again, many people (e.g., Jerry Hobbs, Herbert Simon) have been proposing this for some little while, but it's only recently, with works like Bortolussi and Dixon's Psychonarratology, that people have begun to actually do it, taking the predictions of various theories of narrative, which say that changing stories in certain ways should affect readers' responses, and seeing whether that's actually right. This, and not desk-bound speculation about analogies, seems to me the proper way to start on a cognitive psychology of literature. It is obviously complementary to what Moretti wants to do, and (this is the sweet part) the two enquiries can be pursued in parallel; neither has to wait for the other.
One thing Moretti does not do, anywhere, is construct models linking interacting individual behavior to aggregate patterns. Economists and sociologists already make such models, and anthropologists are starting to do so. It may be premature here, but ultimately it will be vital. If different social groups have different beliefs, is that because those beliefs express their relations to the mode of production, or is it because they tend to talk more with in the group than across group boundaries? Adaptationist theories of culture tend to go for the first choice, but we don't really know whether the latter could account for the specific patterns of cultural difference and change that we see.
What I said above about not mindlessly imitating biology deserves some amplification.
Evolution ought to have a bad name in the study of literary history. Reading Rene Wellek's "The Concept of Evolution in Literary History" (or his article for the Dictionary of the History of Ideas) is actually quite depressing. (It brings to mind Kurt Vonnegut's line "they deserved to fail, because they were all so stupid".) The many post-Darwinian ventures in this direction went, essentially, nowhere, at least as far as understanding literature better goes. It surely didn't help that their understandings of biological evolution were often very bad, generally some kind of Spencerian or even Lamarckian belief in tendencies of progressive development — perhaps inspiring, but hopelessly un-explanatory. (This has vitiated far too much evolutionary theorizing about social processes; cf. Toulmin's chapter 5.) As for the more recent wave, since the 1980s, the people who seem to think that literature exists because humanity craves dramatizations of Daly and Wilson's Sex, Evolution and Behavior drive me up the wall. (Their idea makes no sense even if you are very sympathetic to evolutionary psychology, which I am.)
Which said, this is not at all what Moretti is proposing, and I don't see the harm in trying to make this all fit together as another instance of a general pattern, alongside biological evolution, because they have similar causally-relevant features, and so similar mechanisms are at work. Many people have pointed out, in some detail, that explaining biological processes through the joint action of variation and selective transmission in populations is one instance of a general pattern of historical explanation; Toulmin is particularly clear on this [7]. There is a demography of businesses, of interest groups, even of medieval manuscripts of classical works, and so why not one of literary texts? Inheriting discrete, particulate hereditary factors from a small, fixed number of immediate ancestors is not the sine qua non of this form of historical explanation, though the details of the process of inheritance will very strongly affect the character of the resulting dynamics. It might be that theories of literary change cast in this form are too complicated to be useful, or that we just don't know enough yet to find the useful ways to formulate them. But it wouldn't hurt to seriously try, and we'd learn a lot, no matter the eventual outcome.
One way to take the bit from Braudel about "a more rational history" that Moretti adopts as a motto is simply to hope that literary history will be a rational enterprise. There are various aspects to this — the accumulation of knowledge, a desire to give explanations, a realization that more than one explanation might be possible and a desire to check which one is right, and so on. To do all this, it's important to develop, use and refine reliable methods of inquiry — ones which are unlikely to lead you into error, and where errors are apt to be self-correcting. You want to be able to persuade others, and you want to know that you're not just persuading yourself. As a statistician, my job is to help with that bit, so it looms large for me. I think this is more or less what Moretti has in mind when he talks (elsewhere) about wanting "falsifiable" literary history — for ideas which have enough content that they can not only be communicated from one person to another (without tripping Liberman's detector), but checked. Which said, I wish that here, as in his Atlas, Moretti had done a more systematic job of checking his conclusions. Would it be unfair to suggest that, while he sees the need for data analysis, it will be left to a successor generation to put it into routine practice?
If you want to say that asking literary history to be communicable, testable and reliable is asking it to be scientific and that's icky, well, it's a free country (at least for now). The more I think about what makes something a science, the less that seems like an important question. But whether something is a rational enterprise of inquiry matters. I'm sure it's possible to object to wanting history to be more rational in this sense, but I find that thought so alien and pointless I won't even try to engage it.
Another take on "rational history" is that the vast mass of details in small-scale history are essentially random, or, more exactly, the connections among them are as convoluted and involved as the details themselves. (This is one way to define randomness, mathematically.) But looking at larger scales, the randomness averages out, leaving regularities which are simpler and more nearly comprehensible by finite minds, and more reliable. As a statistical physicist and a statistican, I am the last to disagree: "In fact, all epistemological value of the theory of probability is based on this: that large-scale random phenomena in their collective action create strict, nonrandom regularity." [8] The small-scale details of literature and of human life have an intrinsic interest and value that is missing from the small-scale detail of molecular chaos, so there is certainly all the room in the world for what Moretti would like to do and close reading, and even essayistic appreciation. (But there is not, I am afraid, room enough in the world for Harold Bloom.) Whether there is room in an academy organized around the production of peer-reviewed research findings for all of them, is fortunately not a question I need to have an opinion on.
Finally, you might be tempted to go from the last sense of "rational" to supposing that large-scale history must be the working-out of some scheme which is "rational" in that it's really deterministic, or even teleological. This would be a mistake. It is not at all hard to give examples of stochastic processes which combine random evolution and feedback, which converge on very nice large-scale regularities, but which regularity they converge on is completely random and indeterminate. [9] Brian Arthur, among others, argues that processes like this are important in the evolution of technology. Is literature like that? I have no idea. But I don't see any reason it can't be, and this needs to be borne in mind.
Let me close by quoting the same paragraph twice, once from the version in NLR, and then again from the closing pages of the book. In both cases, he is enumerating themes which stretch across his chapters.
First, a total indifference to the philosophizing that goes by the name of 'Theory' in literature departments. It is precisely in the name of theoretical knowledge that 'Theory' should be forgotten, and replaced with the extraordinary array of conceptual constructions, —theories, plural, and with a lower case 't'—developed by the natural and by the social scences. 'Theories are nets', wrote Novalis, 'and only he who casts will catch'. Theories are nets, and we should learn to evaluate them for the empirical data they allow us to process and understand: for how they concretely change the way we work, rather than as ends in themselves. Theories are nets; and there are so many interesting creatures that await to be caught, if only we try.
First of all, a somewhat pragmatic view of theoretical knowledge. 'Theories are nets', wrote Novalis, 'and only he who casts will catch'. Yes, theories are nets, and we should evaluate them, not as ends in themselves, but for how they concretely change the way we work: for how they allow us to enlarge the literary field, and re-design it in a better way, replacing the old, useless distinctions (high and low; canon and archive; this or that national literature...) with new temporal, spatial and morphological distinctions.Whether this pragmatic message is what Novalis meant, I have no idea; I only know the line because Popper used it as the epigraph for The Logic of Scientific Discovery. But that's what Popper meant by it, and I think it's right, and I look forward to seeing the coelacanths and tube-worms and giant squid which will be brought up from the deeps in years to come.
[1]: More on testing the null model of genre appearance, for those into that kind of thing: Really, of course, the most suitable null model for random appearance would be a continuous-time Poisson process. Since the data are discretized by years, however, I'm faking it by using a geometric distribution of inter-arrival intervals. (I also tried simulating from a Poisson process and then discretizing the result; the results weren't much different.) The only parameter of such a process is the mean inter-arrival time, or equivalently the "intensity", the probability per year of producing a new genre. Simple maximum likelihood estimation gives this as 0.2905405, which implies a log-likelihood for the original data of -103.9498. To evaluate the significance, I generated 1,000,000 sample paths, of the same length as Moretti's, and then for each one re-estimated the intensity and used that to evaluate the log-likelihood. (This sort of "bootstrapping" should account for the fact that I fit that parameter to the data in the first place. It wouldn't be appropriate if, say, Moretti had advanced the conjecture that the mean inter-arrival time should be 10 years on independent grounds.) Of the 1,000,000 sample paths, only 3,802 had log-likelihoods as small or smaller than the original data. That is to say, if the null model were correct, we'd see results like this only about 0.38 percent of the time. So we can certainly reject the null model at the conventional 5 percent significance level, or even the 1 percent level, and in fact this is a considerably more severe test than that.
[2]: Ernst Mayr, What Evolution Is, p. 84, quoting a 1959 paper of his own.
[3]: This is from Sidney Winter's article on "Natural Selection and Evolution" in the New Palgrave Dictionary of Economics (1987), where he works out the analogy in some detail.
[4]: An Introduction to Karl Marx, pp. 183--184.
[5]: "Adventures of a Man of Science", Elif Batuman's wonderfully-titled review of Graphs, Maps, Trees in n+1 magazine, is a quite nice essay, but it also provides what looks like a typical example of the kind of mere plausibility I have in mind:
Perhaps the Holmes stories are not half-baked versions of the "correct" mystery story, but a different kind of mystery story, wherein the nondecodability of clues is not a bug, but a feature. Conan Doyle was writing during the conquest of England by industry and rationalism; perhaps his readers wanted stories about the kinds of magic that are possible within the constraints of science. Holmes categorically rejects the supernatural, not in order to show that the new, rational rules preclude magic, but in order to show that you can still have magic even if you play by the rules. Decodable clues came a "generation" later, with Agatha Christie and the first World War, and became more rigorous after the second—by which time readers wanted to be reminded that the world was still rational. [pp. 146--147]First of all, it seems bizarre to say that Britain was being conquered by "industry and rationalism" in the 1890s, long after the scientific revolution, the Enlightenment, the Industrial Revolution and all its social consequences, utilitarianism, etc. (Indeed, Mr. Lecky might want to have a few words...) Second, Batuman gives us no reason to think that contemporary readers saw what Holmes did as (pardon the phrase) magic within the bounds of reason alone. Third, even if she were right about the social situation and the cultural product, the hypothesized causal connection is really just another arbitrary analogy, of the sort Elster complained about. Suppose Conan Doyle had been better about using decodable clues than Christie. Would it not then sound just as plausible to say this expresses the triumph of rationalism, followed by a post-war weakening? As it is, Batuman's account seems to appeal, implicitly, to a desire to hang on to older ways of thinking. Either the whole reading public of Britain in the 1890s is being treated, in a grossly anthropomorphic fashion, as a single person, with such a desire, or she is making a quite specific prediction about which readers Conan Doyle appealed to, one which does not seem especially plausible, though it might be tested. (It is utterly unclear whose purposes or needs are invokes by the in-order-to's — Conan Doyle's? his original readers'? society's? — but I fear the worst.) Finally, no attempt is made to check that this is the source of the appeal, nor that the later strict decodability of clues really was caused by the World Wars, for the reasons given. I don't know enough to say that this suggestion is false, or that checking it would be impossible. I don't even want to suggest that a book review in a little magazine would be a good place to do such tests. But it doesn't seem to worry Batuman that there is no support for this idea (yet). — Let me repeat that I like the essay.
[6]: Incidentally, thinking that cognition is computational, and even that its computational architecture is strongly constrained by organically-evolved developmental processes, in no way commits one to denying that thought is also profoundly cultural and historical. Sperber is very good on this, but also see Frawley's Vygotsky and Cognitive Science, or the papers collected in The Elements of Reason.
[7]: Of course it isn't the only pattern of successful historical explanation. Even within the natural sciences, geology and astronomy provide very different ones.
[8]: Gnedenko and Kolmogorov, Limit Distributions for Sums of Independent Random Variables, p. 1.
[9]: More exactly, there are stochastic processes ("urn schemes") where the relative frequencies of different outcomes are guaranteed to converge, with 100% probability, but the ratio at which they converge is itself a random variable, not determined by the initial set-up in any way. The models of lock-in developed by Brian Arthur and his collaborators in the 1980s are urn models, but actually less indeterministic than the classical ones.
Manual trackback: Reprieved; Crooked Timber; Pedantry; Three Quarks Daily; Idiocentrism; An Unenviable Situation ("deeply offensive").
Update, 7 February: Seth Edenbaum has more on why he dislikes this post so much &emdash; and why he dislikes me (or at least my online persona; I don't believe we've ever met). I think he's wrong, both about this and about me, but it's only right to point to criticisms.
The Commonwealth of Letters; Writing for Antiquity; Biology; Enigmas of Chance
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
I seem to have locked myself into writing five pages of math textbook three times a week between now and the end of the semester. If I am to pull this off while still doing research and have something like a life, I can't spend any time posting here, or even really reading weblogs. So I'm putting this on hold until after May 5th, and deleting my RSS reader for the duration. (I will keep updating the book log in the side-bar, though.) Regular service will resume once school's out, and in the meanwhile you can get along without me for three months.
(There, that should be sufficiently emphatic to discourage backsliding.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
The last time I did one of these things, my weblog broke, resulting in a several-month hiatus. But, since you asked, and since this lets me procrastinate working on my lecture notes, my research, my house and my contribution to the Moretti event at the Valve all at once...
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
From Joan Didion's Democracy. Applications to current events are entirely your affair.
At the time I thought that she had decided to talk to me only because Jack Lovett's name was just beginning to leak out of the various investigations into arms and currency and technology dealings on the part of certain former or perhaps even current overt and covert agents of the United States government. There had even been hints about narcotics dealings, which, although they made good copy and were played large in the early coverage (I recall the phrase "Golden Triangle" in many headlines, and a photograph of two blurred figures leaving a house on Victoria Peak, one identified as a "sometimes Lovett business associate" and the other as a "known Hong Kong Triad opium lord"), remained just that, hints, rumors, that would never be substantiated, but the other allegations were solid enough, and not actually surprising to anyone who had bothered to think about what Jack Lovett was doing in that part of the world.There had been the affiliations with interlocking transport and air courier companies devoid of real assets. There had been the directorship of the bank in Vila that put the peculiarities of condominium government to such creative use. There had been all the special assignments and the special consultancies and the special relationships in a fluid world where the collection of information was indistinguishable from the use of information and where national and private interests (the interests of state and non-state actors, Jack Lovett would have said) did not collide but merged into a single pool of exchanged favors.
In order to understand what Jack Lovett did it was necessary only to understand how natural it was for him to do it, how at once entirely absorbing and supremely easy. There had always been that talent for putting the right people together, the right man at the Department of Defense, say, with the right man at Livermore or Los Alamos or Brookhaven, or, a more specific example with a more immediately calculable payout, the Director of Base Development for CINCPAC/MACV with Dwight Christian.
There had always been something else as well.
There had been that emotional solitude, a detachment that extended to questions of national or political loyalty.
It would be inaccurate to call Jack Lovett disloyal, although I suppose some people did at the time.
It would be accurate only to say that he regarded the country on whose passport he traveled as an abstraction, a state actor, one of several to be factored into any given play.
In other words.
What Jack Lovett did was never black or white, and in the long run may even have been (since the principal gain to him was another abstraction, the pyramiding of further information) devoid of ethical content altogether, but since shades of grey tended not to reproduce in the newspapers the story was not looking good on a breaking basis. That Jack Lovett had reportedly made some elusive deals with the failed third force (or fourth force, or fifth force, this was a story on which the bottom kept dropping out) in Phnom Penh in those days after the embassy closed there did not look good. That the London dealer who was selling American arms abandoned in South Vietnam had received delivery from one of Jack Lovett's cargo services did not look good....
This is from the end of chapter 2 in part 4 --- pp. 207--209 of the Pocket Books edition (New York, 1985).
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
My father-in-law passed away last month. Here is his obituary.
I wasn't lucky enough to know Jim before his illness; even so, he was one of the most vital people I've ever known, eager to embrace all life's possibilities. He was a man of considerable talents in many directions, of curiosity and accomplishment. A holder of strong opinions, he respected them in others, even when he disagreed. His love for those who mattered to him was sincere and profound. If I had to bear the same burden, I hope I would have it in me to do so with as much fortitude.
That burden has, at last, been lifted from him. I miss him. I'm going to keep missing him.

Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
... is a fascinating area of inquiry, containing a small quantity of outstanding work, surrounded by a vast expanse of rubbish. This is your chance to help improve the ratio, if you can fit your contribution into six pages by February 28th.
Workshop on the Evolution of Complexity
June 3rd, 2006, Bloomington, IN, USA
as a part of the
Tenth International Conference on the Simulation and Synthesis of Living Systems: ALife XThe evolution of complexity is a central theme in Biology. Yet it is not without any ambiguity. Complexity has been used to refer to different things. For instance, complexification has been interpreted as a process of diversification between evolving units or as a scaling process that is related to the idea of transitions between different levels of complexity. Other meanings of complexity have been introduced, both inside and outside the realm of Biology. What concerns most researchers is to get insight into the mechanisms that produce their notion of complexity.
The focus of this workshop will be on biological interpretations of complexity and the driving mechanisms: primarily we want the focus to be on evolutionary and related dynamics as mechanisms for producing complexity. Furthermore, we want to bring together historical and novel research in this context.
Questions to be addressed at the workshop include:
- What are the environmental constraints of complexity growth in living systems?
- What is the origin and role of developmental mechanisms in evolution?
- Are the principles of natural selection, as they are currently understood, sufficient to explain the evolution of complexity?
- What are the limits at different levels to the evolution of complexity, and which conditions could reduce evolved complexity?
- <Which models are | What language is> more appropriate to <understand | speak about> the evolution of complexity in living systems?
- How could complexity growth be measured or operationalised in natural and artificial systems?
- How can data from nature be brought to bear on the study of this issue?
- What are the main hypotheses about complexity growth that can actually be tested?
- Is it possible to <direct|manipulate> the evolution of complexity, or which benefits would bring its understanding?
See the full call for papers for more information about submission, publication, peer-review, etc. (I am on the program committee, which means I will be one of the reviewers.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Larry "All of Statistics/All of Nonparametric Statistics" Wasserman has just started a weblog, The Academic Curmudgeon. By my count, this brings the number of faculty blogs in the department to three (and the proportion to about 0.09); none of our students seem to have any. Larry's second post is about peer review.
Here is a summary of common reasons for rejecting papers:Good Reasons For Rejecting a Paper:
- The results are incorrect (unfixable, critical errors).
- The results are not new.
Bad Reasons For Rejecting a Paper:
- The referee doesn't like the paper.
- The referee doesn't like the author's approach.
Tricky:
- The contribution of the paper is too small.
Numbers 3 and 4 are bad because they are based on the taste of the referee which is far too subjective. Number 5 is problematic. True, we don't necessarily want top journals publishing every small idea that occurs to someone. The problem is this: almost all research, including good research, is incremental. The idea that most papers in top journals are breakthough papers is fantasy. What is too incremental to publish is highly subjective.
I think the basic problem is that most referees have the wrong view of the purpose of publishing. Ideally, publishing is about disseminating knowledge. It should not be regarded as admittance to a high and mighty priesthood.
I am going to use Larry's post as an excuse to ruminate about peer review, partly because I spent yesterday whittling down the stack of manuscripts I'd agreed, in weak moments, to review, and partly because I think Larry might be unhappy with my refereeing. Maybe he gets to see a better class of manuscript than I do (I wouldn't be surprised), but it seems to me that his list of good reasons to reject a paper is seriously incomplete.
I am happy to reject papers on such grounds, because they seem to follow from my understanding of the point of peer review. This is, as Larry says, about the dissemination of knowledge, not initiation into a priesthood. But the people peer-reviewers serve are not the authors but the potential readers. Passing peer review ought to endorse a manuscript, not as correct, but as possibly worthy of attention: not obviously wrong, not disconnected from the field, not out to lunch, not a waste of the reader's time. (This is more modest, and more achievable, than actively picking out the good stuff; it's a type I/type II error issue.) One important wrinkle here is that, if you're already an expert in a given field, the extra value of having somebody else filter the stream of manuscripts in that field is small — part of your expertise is being able to make such judgments reliably and cheaply yourself. But if you need to use results or ideas from another field, then you either need to become an expert there, too, or you need experts there to tell you what's worth attention. Very few scientists never need ideas from other fields, which is to say that most of us will benefit from peer review. (Similarly for hiring and tenure, but let's not dwell on such unpleasant subjects.) For complete non-experts, i.e., the lay public who ultimately support us all, peer review is about the only way they have of telling possibly-legitimate scientists from the cranks and the frauds. (More exactly, because peer review only says "not obviously wrong": anyone who can't get over the peer review barrier is so weak as to not be worth bothering with.)
I rather doubt, however, that the current journal/peer-review system is the ideal way of doing this filtering. Journals can be too conservative. Journals can not be conservative enough, when the topic is fashionable. (Not that I have anything in mind.) Journals can get locked into a vicious cycle in which they become so bad that publication there constitutes an anti-endorsement, so that only really bad scientists publish there, and they in turn become recruited as referees. (Still, an anti-endorsement is not without its own value.) There is something perverse about refereeing for commercial publishers, since publishers charge scientists larcenous rates to bring them the results of free labor on the part of authors and referees.
Ultimately, I hope that we move away from the current system towards
something more like Paul
Ginsparg's ideas. He envisions a system of "tiers" of publication; the
lowest tiers, like the current arxiv.org, would
have nearly-open submission and dissemination, and be most valuable to experts.
Above them, operating more slowly, would be more selective tiers of peer
review, commentary, review papers, etc., which will be more valuable to less
expert readers, and won't try to filter the whole manuscript stream,
which is what peer review now does. Getting there from here will probably
involve lining the war-mongering parasites
at Elsevier
up against a wall a good deal of time and effort, but would lead
to something much more efficient and intellectually valuable.
And now, back to revising our manuscript to please its referees.
Update, 29 November 2005: Dave Feldman, propelled by the burning need to procrastinate which drives so many academics to blogging, suggests that the peer review system would be much improved by free socks. He's right.
Update, 7 December 2005: For once, I wish I had comments here. (I don't feel like committing myself to the endless struggle against spam.) "Cog", of The Abstract Factory, writes in (quoted with permission):
I believe your 3 and 4 are subspecies of "the contribution is too small" (5 on Larry's list). Your 1 and 2 are both subspecies of "the paper does not communicate effectively enough for the contribution to be evaluated", which is a new reason. Number 5 is also a new reason. So you've really come up with two distinct extra reasons.I would actually subsume Larry's reason 1, and your 1+2 under:
1'. The results are not clearly shown to be (probably) correct.
Because, of course, the burden of proof is on the authors to convince the reader that their result is not obviously wrong, not on the reviewer to show that the result is obviously wrong. To discharge this obligation requires both technical soundness and effective, precise communication.
Manual Trackback: An Ergodic Walk
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
NYU's graduate student teaching assistants are unionized, and have been for several years, but the National Labor Relations Board recently reversed itself and ruled that the university did not have to recognize the union or negotiate with it. The teaching assistants are now on strike, apparently with wide support from the faculty (so that the administration has been snooping through course websites to see who the faculty sympathizers are). To break the strike, NYU's president, John Sexton, is threatening to withhold the whole semester's stipend from any TAs who are not back to work by today, and that any TAs who strike next semester will lose their stipend for the whole year. I understand withholding wages during a strike, but this is simply vicious, and so far as I can work out would be illegal in any normal labor dispute. (Of course Sexton's position is that the TAs are not really employees, which is hogwash.) You can sign a petition against this travesty via Faculty Democracy at New York University. Leaving aside the claims of justice and elementary fairness, how many other chances will you have to agree with Andrew Ross and Alan Sokal?
(Surveying the treatment of our graduate student employees from the lofty perch of half a year on the faculty, it seems to me that CMU, at least in the statistics department, treats them pretty well, and much better than we had it at Madison when I was a TA there, and a member of AFT local #3220. But still, if they wanted to unionize, I'd be completely behind them, and I think it's idiotic and reprehensible for universities to refuse to even recognize and negotiate with graduate student unions. Unions can ask for stupid and/or selfish things, of course — which distinguishes them from any other organization how, exactly? — but the merits of particular proposals isn't the issue here; punishing people who attempt to organize to exert their rights is.)
Via John Burke (in e-mail) and Michael Bérubé; reporting and photos of the strike from Majikthise.
Update, 7 December 2005: I now see that Asad Raza, one of the strikers, has been filing dispatches (1, 2) at Three Quarks Daily.
Manual trackback: Crooked Timber
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
It's been a while since I did one of these, hasn't it? But sometimes they just hop onto the keyboard and start meowing in my face.
In this paper, Ryugo et al. are mostly concerned with structures at the synapses in certain nerve fibers, rejoicing in the name of "the endbulbs of Held", which they describe in fairly flowery language: "Endbulbs have a calyxlike appearance that is formed from the main axon as several gnarled branches that arborize repeatedly to enclose the postsynaptic cell in a nest of en passant swellings and terminal boutons." These are abnormal in congenitally deaf animals: they don't branch so much, they're enlarged, they've got a flat rather than an undulating profile, and they've got fewer of the vesicles containing neurotransmitters that make synapses work. Not surprisingly, these endbulbs don't seem to transmit signals very well. This is a problem, especially since the nerve pathways where they tend to be found are the ones which encode precise timing information about sounds, important alike for for predators fond of twilight and leaping in ambush, and for chattering East African Plains Apes ("The critical nature of temporal resolution in facilitating speech recognition is underscored by studies that show speech recognition based on temporal cues while spectral content is systematically degraded").
What they did was to take congenitally deaf cats and, as kittens, give them cochlear implants which restored their capacity to hear. The physical capacity was verified by recording the propagation of neural signals; also by the fact that "we could routinely 'call' implanted cats for a food reward." After several months, they examined the development of the end-bulbs of Held in these cats, compared to matched normal animals, and to congenitally deaf cats which received no implant. (Don't ask how.) The results, photographically, are pretty convincing: the endbulbs look a lot more like those of normal cats than deaf, non-implanted cats, and quantitative comparisons of e.g. size are also fairly persuasive.
It would be interesting to know how old cats can be before simply providing the cochlear implant isn't enough for these synapses to develop properly. They speculate (but don't really show) that the same effect takes place in people, and that this is why congenitally deaf children benefit more from implants the earlier they get them. If that's so, it would just reinforce the importance of making sure all the children who need them get such implants swiftly. It would also make it nice to know what (if anything) could be done in conjunction with such implants, to help gnarl-up children's endbulbs.
Notice, by the way, that, as I've had occasion to remark before, that the whole nature-nurture division is not actually useful to understanding what's going on in processes like the development of hearing in these cats. (Go on, calculate a heritability here and tell me it means something, I dare you.) But this is generally the case with cognition.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Deborah Mayo's great book on Error and the Growth of Experimental Knowledge was published in 1996. In it, she lays out a way of thinking about statistics, and about learning from experience more generally, which is at once principled, powerful and useful. It helps make sense of what we already do, and suggests new ways in which we can improve our practices. I read it in 1998, and it roused me from my dogmatic slumbers about statistical inference. It's hard to imagine another path to where I am now if I hadn't read it. All of which is to say that I was very pleased to find the following in my inbox this morning:
Check out the invited speakers and the call for papers.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
What mysterious civilization carved these designs in the high desert — designs which can only be seen from the air?
Did these so-called primitives unravel the mysteries of electricity by themselves?
For how many aeons have these forbidding mountains born witness to eldritch rites?
What disturbing pre-human legends are engraved on these tablets — products of an advanced metallurgy far beyond any native to the region?
Which unspeakable cult conceals its world-shattering secrets and shambling, amorphous blasphemies behind the walls of its armed compound nearby?
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Marcus Porcius Cato was famous for, among other things, ending all of his speeches in the Senate by saying "that is my opinion. It is further my opinion, that Carthage must be destroyed" (ceterum censeo, Carthago delenda est). Eventually, so the story goes, his fellow senators got so worn down that they launched the Third Punic War, which ended with Carthage destroyed, and the ground sowed with salt.
Senator McCain seems to be adopting a similar tactic, in a much higher cause than establishing hegemony over the western Mediterranean basin:
McCain Vows to Add Detainee-Abuse Provision to All Senate Bills: The U.S. Senate added language barring inhumane treatment of enemy combatants to legislation that sets military policy, the second major defense measure the chamber has amended with this provision.The amendment sponsored by Arizona Republican Senator John McCain passed by a voice vote. It was attached to the Senate's fiscal 2006 defense spending bill Oct. 6 by a vote of 90-9. That bill is being negotiated with members of the U.S. House, including Republicans whose support is in question.
McCain said his intent is to prevent abuses such as those at Abu Ghraib prison in Iraq. He vowed today that his measure would be "on every vehicle that goes through this body" until it's enacted into law. "It's not going away," he said on the Senate floor. "This issue is incredibly harmful to the United States of America and our image throughout the world."
Via Michael Froomkin; I agree with his comment that "I still think he'd be an awful President, but this is good stuff." Unfortunately, the link Froomkin gives to the Bloomberg news story he quotes is broken, so I can't see if anyone else is drawing the classical parallel.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
I used to re-read Something Wicked This Way Comes every Halloween. This year I'm without my copy of both that and A Night in the Lonesome October, so I might as well blog.
The Little Professor offers some Victorian terrors. Actual Victorian-era terrors: the resurrectionists of Ann Arbor.
Barcelona, city of jack-o-lanterns and bull-rings in the air.
Fafblog: the world's only source for haunted Fafblog. (Plus, the Medium Lobster explains the Plame affair in one sentence.)
Mad science: The Annals of Improbable Research re-runs its Halloween Research Review from 2000 (1, 2)
Zombies: Kids! Did you know you can use Rocky Mountain Spotted Fever to make zombies? By Wil McCarthy, who wrote two novels I liked and many others I've not gotten around to. (Via /dev/null, who reproduces a tribute to the Little Prince.) "And I say to any flesh-eating zombies who might be listening to the Factor this evening: Bill O'Reilly is looking out for you." (For more zombies, see Destroy All Bookmarks!.)
Vampires: Carmilla is free online, along with many other good stories by J. Sheridan LeFanu. (The Oxford collection of LeFanu's stories is nice, too.) I've always thought this was a much better-written book than Dracula. It's worth noting, though, that the traditional eastern European vampire was not a pale, skinny, strangely seductive aristocrat, but a fat, red-faced peasant whose carnal designs on the living are limited to their blood, such as drips from his (fangless) mouth, and in many ways corresponded pretty well to what peasants would find if they opened up Uncle Ivan's grave after a few weeks. The whole sexual aspect of vampirism — now, apparently, its main selling point — appears to have been invented by 19th century writers in western Europe (paging Dr. Praz, paging Dr. Mario Praz, to the locked stacks please). — It would be a shame to pass up this opportunity to plug, again, Suzy McKee Charnas's The Vampire Tapestry, unquestionably the most intelligent interpretation of the modern vampire.
Less definable horrors: Probably nobody now producing horror fiction is a better writer than Peter Straub. Here's the beginning of his latest, In the Night Room.
"I can't get that monster out of my mind": If you want to know why we're fascinated by stories of being preyed upon by monsters, Barbara Ehrenreich's observations on the effects of several million years of predation on the hominid psyche is a good place to start. (Here's chapter 1.)
All too definable horrors: Ultimately, as Bradbury's readers know, Halloween is about death; and we turn to magic because grief and loss are intolerable. For a wrenching reminder of just how intolerable, read Joan Didion's Year of Magical Thinking — here is a long, moving excerpt, and here is a review by John Leonard, in the New York Review of Each Others' Books. (How good a writer is Didion? Well, while she was going crazy with grief, she was able to write like this.)
Manual Trackback: The Mystery of the Haunted Vampire.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Back when Wolfram's monstrous tome came out, in the summer of 2002, I wrote
a long review, which I "left to the gnawing criticism of the mice", thinking
I'd get back to it when the cultural moment was safely past. Since I'm
not getting anywhere tonight showing that the error in my filtering scheme is a
non-negative supermartingale, and need to procrastinate some people
who saw the manuscript liked it, and
Wolfram keeps on keeping on, I
felt I might as well brush off the dust, make a few adjustments,
and put it online. Enjoy, if
you're into that kind of thing.
Manual trackback: Bill Tozier; Quantum Pontiff; Danny Yee; Bruce Sterling ("maybe reviewers shouldn't pick on isolated, wealthy math geniuses who have intensely private, highly bonkers-sounding, self-published cosmological schemes. I mean -- what if he comes out of his ivory basement and deliberately DISTURBS THE UNIVERSE? We could be looking at the pixelated rags and tatters of reality by Friday!"); Geekable; Omniorthogonal; Three Quarks Daily; Jonathan Goodwin/The Valve; Crooked Timber; Chrononautic Log; Nonplatonic; East of the Sun, West of the Moon; Paracelsus Rambles; Blog Khoa Hoc Máy Tính; Dubbings and Diversions
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Attention Conservation Notice: a 2100-word, semi-serious proposal for politically inflammatory social-scientific research, worked out by non-social scientists. Does not adequately explain all the technical concepts involved. Contains media criticism and blogging in-jokes.Note: This idea was developed with a close collaborator who prefers, however, not to be named. Using the first person singular just means that I'm the only one doing the actual writing here, not that I deserve the credit for whatever merits this may have. (The blame for its faults, however, is all mine.)
I haven't written anything about Hurricane Katrina and its aftermath, because nothing I could think to say seemed at all adequate in the face of the horrible human disaster and national shame. Others have said what I'd want to say better than I could have. But about the political fall-out, and the cronyism that preceded the catastrophe, and which continues to be relevant day in and day out — there I think I can suggest something.
It's by now abundantly established, at least to all sentient beings, that Michael Brown had no business running much of anything, never mind FEMA, and had the job not because people thought he was the most qualified possible person, or even a reasonably qualified person with the right political ties, but solely and exclusively because of his political ties. This naturally leads to two logically independent questions: (1) How many other people hold important government positions just because of who they know in the GOP apparatus and/or Bush family retinue? (2) Is this any worse than it was under any fairly recent previous president? It could be, after all, that the level and proportion of incompetence is no higher than it ever was, or even lower, but the Bush crew has had the bad luck of having disasters happen which exposed that incompetence; or perhaps the quality and morale of the civil service has been steadily eroded by many decades of serving under incompetent political hacks, and so can no longer adequately compensate for the folly of their masters. (Think of an unreleased episode of Yes, Minister where Sir Humphrey is merely counting the days until he can begin collecting his pension, and Bernard has split for a private-sector consulting gig.)
I've now read a couple of news stories which attempt to address these questions, and I am dissatisfied. Take, for instance, this one in the New York Times. It purports to be about this subject, but, while it reports some striking instances of cronyism and patronage, contains nothing like facts on which a reader could judge whether this problem is any worse than it's ever been, or even worse than it's been recently. It also contains the following sentence about half-way through: "People who have studied the workings of the federal government for years say this administration is no worse than President Bill Clinton's or any other recent ones in the qualifications of political appointees." This is followed by a number of quotes from such experts, none of which, carefully examined, actually say anything of the kind.
For another example, here's a story from Time, again going through the anecdotes, and quoting one of the same experts (Paul C. Light) to the opposite effect, saying that things are now much more "centralized" and politicized than before. So, from reading the reports filed by — one presumes — well-regarded journalists, not only can the concerned citizens learn absolutely nothing about whether the Bush administration is unusually incompetent and cronyist, they can't even learn what one presumptive expert (Prof. Light) thinks about the topic. Citizens might even wonder what "centralized" means here, and the news isn't going to help. The New Republic's list of the administration's fifteen worst hacks, while at once amusing, depressing and frightening, shouldn't actually convince anyone that "no administration has etched the principles of hackocracy into its governing philosophy as deeply as this one". It may be — it is! — unacceptable to treat the government of a free people this way, but it doesn't mean that this is anything new.
Let us not gnash our teeth in despair over the mainstream media, however: social network analysis can come to our rescue! What's wanted — but what the journalists don't provide — is a study where one builds the network of Presidential cronies, cronies' cronies, cronies' cronies' cronies, etc., and then asks questions such as:
Many people have asserted that networks of influence and social connection are important to how the modern GOP works — Henry Farrell reports that this is an important part of Jacob Hacker and Paul Pierson's Off Center, which I'm eager to read, and it's more or less explicit in Michael Lind's Up from Conservatism and John Judis's The Paradox of American Democracy — but nobody seems to have really studied this thoroughly. To do it right, you need to carefully define what you mean by "crony". Since, ultimately, the whole species forms a single human web, you want to only consider ties which are actually meaningful indicators of political alliance and, still more, of nepotism and cronyism. Also, you want to set out your criteria carefully and rigidly before collecting data, otherwise there'll be a lot of temptation to manipulate things as you go along, and the result will be closer to Lyndon LaRouche than to Randall Collins (or even Malcolm Gladwell). At the very least, I'd think you want to include the following kinds of ties:
Having fixed our criteria for which kind of relationships will count as links in the network, it'd then be necessary to build the network. A natural starting point would be the strategy sometimes called "snowball sampling": pick an initial target, say G. W. Bush, and identify everyone who counts as one of his cronies (by our criteria). Then go over each of his cronies, and see who their cronies are — Bush will be one of them, but presumably there will be others. Repeat this until either all the cronies are exhausted, or you're exhausted. Note that, if there really is just one network, then it doesn't matter whether you start with Bush, or Karl Rove, or Tom DeLay, or Jack Abramoff, or any of their other unindicted co-conspirators, except for the people who are so marginal to the network that you might reach them from one starting point but not another.
Once you have people in the network, we need to see whether they've been named to government positions (not necessarily confirmed, just named), and whether they met the legally-defined norms of competence for those positions. A simple scheme would be to code them 0 if they just met the qualifications, +1 if they were clearly more than qualified, and -1 if they had no discernible qualifications. This could be hard to do — some key positions don't seem to actually have any minimum qualifications at the moment — but something like this is necessary to answer questions like, "How much more likely are you to be named to a post if you're qualified than not, controlling for social position?" and "How much more likely are you to be named to a post as a function of social position, controlling for qualifications?" (For aficionados: I'm contemplating logistic regression coefficients here.) It's probably completely unrealistic to imagine having a matrix of qualifications scores for all people for all 3000-odd appointed posts, which would let us see whether favorably-situated cronies get named to posts they might be able to do, or just to any old thing, or what.
Now, to really do this right, we'd need to do it all over again, not just for the current administration, but for another one as a control — the Clinton administration, say, or Bush's father; Reagan or earlier is probably too far back. This seems to be the only way to answer questions like whether this administration is more centralized than its predecessors, or more likely to nominate incompetents. The crucial question, for us, is whether your odds of being nominated are more or less dependent on your distance from the center of the network under this administration than under previous ones.
Even without doubling our workload by doing a comparative study, however, simply seeing the network of cronies would let us answer some interesting questions. Who really are the most central members of the network? Are they people with formal positions of authority? Are they people you've ever even heard of? Or are they comparatively little-known fixers with huge address books, but no officially constituted authority? (Bruce Sterling, back during the Clinton troubles, compared our current mode of government to being ruled by some sort of literary movement, where often the most well-connected and subterraneanly influential people are not the most public figures.) Could we discern factions or communities, in the form of cohesive sub-networks? Is the president — the object of such veneration, verging on idolatry (no, that's not a joke) — actually at the head of his machine?
We could also compare the structure of the crony network to that of other well-studied networks of interest. Sageman looked at al-Qaeda, and while the comparison would be provocative, it's probably not really fair: al-Qaeda is very small, comparatively, and also very hard to study, so issues of missing data are much more serious. Perhaps more interesting would be a comparison with the network of people who sit on boards of directors of corporations, where two people are linked if they serve on the same board. This is a fairly sizeable network — some data sets contain over 7000 people — but one with very little formal structure. (Once you take into account the distribution of the number of boards people serve on, it looks almost perfectly random.) Economic sociologists have established that this network is a very important coordinating mechanism for big business, and, less adaptively for the corporations concerned, a mechanism for cronyism, patronage, and giving responsibility to incompetents. (Despite its coordinating role, the board network is not group which tells, e.g., the gas-station owners of America how much to charge for a gallon of regular unleaded, as apparently imagined by certain rabble-rousers who fear the market system because they don't understand it.) It would be interesting to see, then, whether the presidential crony network can be distinguished, in its broad, structural features, from the board interlock network, or whether they are both, in practice, acephalous.
It will not have escaped the reader's notice that I do not present anything like the kind of network I say we should find. In the immortal words of Stephen Pinker, "Good science is pedantic, expensive, and subversive", and this is certainly all three. We're talking, after all, about collecting and manually processing an immense amount of information on at least 3,000 people, and then doing it all over again on another administration. This would be a lot of work, of a kind to which I am totally unsuited; to get results in less than a year would need a team. Moreover, it is completely unfundable, unless the Ancient and Hermetic Order of the Shrill is now giving out grants to further the study of the "mendacity, malevolence, incompetence, corruption, uselessness, simple idiocy, or sheer disconnection from realty of the George W. Bush administration". Nonetheless, I would very much like somebody to do it, because it seems to me that it could actually answer some important questions about how our country now works.
Manual Trackback: In Search of 42; Green Gabbro; Crooked Timber
Networks; The Running Dogs of Reaction; Modest Proposals; The Beloved Republic
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Recent advances in high-energy physics. (Apropos of the second link, I bought my copy of Streater and Wightman's PCT, Spin & Statistics, and All That at university library booksale, where it was shelved under "politics/current events"; I guess they missed the "T" in "PCT".)
"Pittsburgh Unprepared For Full-Scale Zombie Attack", warns The Onion (via Johnny Logic). Here in Shadyside, we can expect to be attacked by fashionably-dressed yuppie zombies (yombies?), who will serve our freshly-sliced brains in a balsamic reduction, dusted with fennel pollen and accompanied by organic heirloom tomatoes. (I rented Night of the Living Dead shortly after moving here, but found it unwatchably bad.)
The Head Heeb looks at 18th century forensics.
The Abstract Factory turns out "the only debate on Intelligent Design that is worthy of its subject".
Bill Tozier finds tongues in trees. (I had no idea that maples actually fluoresce.) Also, an astute observation on cell-phone hazards.
Why does Leon Kass say modern women are "car-owning, pill-popping, body-piercing, career-oriented, degree-granted, sexually confident, frequent-flyer, atheistic sluts" like it's a bad thing?
Tim Burke on UNESCO, Department of Bad Ideas.
Larry Bartels on "What's the Matter with What's the Matter with Kansas?" (via Phil Klinkner at PolySigh, who somehow forgot to actually give the link).
Mark Liberman elucidates an important linguistic question: when does "fuzzy" mean "smoothed piecewise linear"?
Michael Bérubé looks forward to the Miers court (as does Brad DeLong), and dares anyone to mess with his reading of Thomas Kuhn. (I like Bérubé's writing, generally, and I hope that I'll get around to posting something about The Employment of English, and how complex systems is like cultural studies. But it does bug me that he seems to care so much more about whether he got Kuhn right than about whether Kuhn got it right, since subsequent work has revealed a lot of problems with Kuhn's scheme as an accurate description of scientific change; see e.g. the papers in Scrutinizing Science.)
I have mixed feelings about stuff like this, since things like this, this and this seem pretty well institutionalized. The fact that the career military (like career academia) is socially quite isolated from the rest of the country, and tends to look down on the people it's pledged to serve, is a long-standing problem. (It's a bit more worrisome in the case of the officer corps than the professoriat.) Under the circumstances, one should be encouraging the decent, sane, capable people who are left in the service to do what they can to redeem its honor, rather than shame them into leaving. Speaking of which: Phil Carter of Intel Dump makes his first post after being deployed to Iraq.
Matthew Yglesias on the intellectuals' war, or rather case for war, and its basic folly. (Includes self-criticism.) Also from MY: why blaming declining benefits for American workers on globalization is bullshit (ObKrugman: Pop Internationalism), and why some form of American social democracy is nearly inevitable. (Brad DeLong points out that however much sense that might make, barbarism is always an alternative.) Relatedly, Nathan Newman points out that the only reason GM's workers are getting reduced health benefits rather than none is that they have a strong union; also that Harriet Miers would benefit from restoring the estate tax.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Attention conservation notice: Promotes a technical preprint by some friends. I make no attempt to explain the science, owing to lack of time.
As you know, Bob, in the 1950s J. L. Kelly established that there were intimate connections between optimal strategies for repeated gambling and Shannon's information theory. (For instance, the best achievable growth rate for the gambler's wealth is set by the entropy rate of the random sequence of gambling outcomes.) As you know also know, Bob, the mathematical theory of natural selection is closely connected to that of repeated gambling (so that, e.g., John Holland's Adaptation in Natural and Artificial Systems is in some ways an extended treatise on multi-armed bandits.) This suggests that information theory could be useful in analyzing natural selection, and it would be natural to suppose that information about the environment should manifest itself as increased fitness somehow. There's been sporadic interest in the topic (e.g., J. B. S. Haldane, with his usual prescience had an early paper in this area), but really, in my humble opinion, not enough. By way of rectification, I submit the following for your favorable consideration:
Kussell and Leibler consider Markovian environments (technically, the environmental state is a semi-Markov process), and show that the fitness penalty paid for getting the statistics of environmental changes wrong is proportional to the relative entropy (Kullback divergence) rate between the organism's switch rates and the environments. Bergstrom and Lachmann consider only independent, identically-distributed environments, but go much further in relating the fitness value of signals about the environment to traditional information-theoretic quantities, essentially considering those signals as transmission channels. (They like thinking about the value of signals.) In both cases, my feeling is that, since the Kelly gambling results carry over to general ergodic environments (see the papers of Thomas Cover, especially the ones with Paul Algoet), the evolutionary results should too. I am not, however, volunteering to perform the extensions.
I happen to know that Bergstrom and Lachmann's work is part of a more general program investigating the role of information in evolution, because I've been bugging Michael to publish his results since I heard him talk about them at the first "Science et Gastronomie" workshop two years ago. I won't say any more, for fear of spoiling their surprises, except to say that further exciting revelations are close at hand.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Start with your favorite large Erdos-Renyi random graph. Color all of the nodes, in such a manner that the number of nodes of a given color follows a strongly skewed distribution, perhaps a power law. (Exponential growth easily gives power-law size distributions.) Now form the aggregated graph, with one node for each color, and an edge between colors if any two disaggregated nodes of those colors are linked. Query: What is the degree distribution of the aggregated graph? (Inspired by thinking, while walking home, about attempts to model the structure of the Internet at the autonomous system level. Why I was doing that, I have no idea.)
Update, later that night: Aaron Clauset writes to point me to this paper:
This isn't exactly the model I had in mind; it's more realistic, for the Internet, than aggregating a static random graph. (I'm pleased to see that people who know what they're doing also thought to employ the idea that exponential growth leads to a power-law size distribution; presumably a re-invention, since they don't cite Reed and Hughes.) I remain a bit curious about the effects of aggregating a random network, but now will definitely not pursue it.
Update, 7 October: Aaron was too well-bred to point out his own papers on why many (in fact, almost all) networks seem to have power-law link distributions, when you probe them the wrong way. Fortunately, someone reminded me.
Update, 21 October: This looks relevant, if anyone's interested.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Wolfram Research has now released what is, without question, the most convicing demonstration yet of the power and utility of Stephen Wolfram's New Kind of Science: a cellphone ringtone generator. I will be terribly, terribly disappointed if these don't contain subliminal commands furthering a plan for world domination.
Update, 21 October: By coincidence, I've just run across this paper reviewing the history of using cellular automata to generate music, by Dave Burraston and Ernest Edmonds. (Pulbic-access copies of related papers here, under "cellular automata and music papers".)
Manual trackback: Three Quarks Daily
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Or, the role of the amygdala in the remembrance of things past:
That is all.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Since it is finally sinking in that I teach my first class on Monday, posting will be light for a while. So, read a book! I've updated the book recommondations in the sidebar, and put up the archived recommendations from May, June and July.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
When you want a smart take on how the more robust findings of cognitive psychology relate to the social organization of interdisciplinary knowledge-production, where do you turn but to the Central Intelligence Agency?
I'm feeling lazy, so I'll quote extensively. (All ellipses are mine.)
Intelligence analysis, like other complex tasks, demands considerable expertise. It requires individuals who can recognize patterns in large data sets, solve complex problems, and make predictions about future behavior or events. To perform these tasks successfully, analysts must dedicate a considerable number of years to researching specific topics, processes, and geographic regions. ...The very method by which one becomes an expert explains why experts are much better at describing, explaining, performing tasks, and problem-solving within their domains than are novices, but, with a few exceptions, are worse at forecasting than actuarial tables based on historical, statistical models.
A given domain has specific heuristics for performing tasks and solving problems. These rules are a large part of what makes up expertise. In addition, experts need to acquire and store tens of thousands of cases within their domains in order to recognize patterns, generate and test hypotheses, and contribute to the collective knowledge within their fields. In other words, becoming an expert requires a significant number of years of viewing the world through the lens of one specific domain. It is the specificity that gives the expert the power to recognize patterns, perform tasks, and solve problems.
Paradoxically, it is this same specificity that is restrictive, narrowly focusing the expert's attention on one domain to the exclusion of others. It should come as little surprise, then, that an expert would have difficulty identifying and weighing variables in an interdisciplinary task such as forecasting an adversary's intentions. ...One obvious solution to the paradox of expertise is to assemble an interdisciplinary team. Why not simply make all problem areas or country-specific data available to a team of experts from a variety of domains? ...
Ignoring potential security issues, there are practical problems with this approach. First, each expert would have to sift through large data sets to find data specific to her expertise....
Second, during the act of scanning large data sets, the expert inevitably would be looking for data that fit within her area of expertise. Imagine a chemist who comes across data that show that a country is investing in technological infrastructure, chemical supplies, and research and development.... The chemist recognizes that these are the ingredients necessary for a nation to produce a specific chemical agent, which could have a military application or could be benign. The chemist then meshes the data with an existing pattern, stores the data as a new pattern, or ignores the data as an anomaly.
The chemist, however, has no frame of reference regarding spending trends in the country of interest. The chemist does not know if this is an increase, a decrease, or a static spending pattern—answers that the economist could supply immediately. There is no reason for the chemist to know if a country's ability to produce this chemical agent is a new phenomenon. Perhaps the country in question has been producing the chemical agent for years and these data are part of some normal pattern of behavior.
One hope is that neither expert treats the data set as an anomaly, that both report it as significant. Another hope is that each expert's analysis of the data... will come together at some point. The problem is at what point? Presumably, someone will get both of these reports somewhere along the intelligence chain. Of course, the individual who gets these reports may not be able to synthesize the information. That person is subject to the same three confounding variables described earlier: processing time, pattern bias, and heuristic bias. Rather than solving the paradox of expertise, the problem has merely been shifted to someone else in the organization.
In order to avoid shifting the problem from one expert to another, an actual collaborative team could be built. Why not explicitly put the economist and the chemist together to work on analyzing data? The utilitarian problems with this strategy are obvious. Not all economic problems are chemical and not all chemical problems are economic. Each expert would waste an inordinate amount of time. Perhaps one case in one hundred would be applicable to both experts; during the rest of the day, the experts would drift back to their individual domains, in part because that is what they are best at and in part just to stay busy.
Closer to the real world, the same example may also have social, political, historical, and cultural aspects.... In order for collaboration to work, each team would have to have experts from many domains working together on the same data set.
Successful teams have very specific organizational and structural requirements.... Effective teams require cohesion, formal and informal communication, cooperation, and shared mental models, or similar knowledge structures. While cohesion, communication, and cooperation might be facilitated by specific work practices, creating shared mental models, or similar knowledge structures, is not a trivial task. Creating shared mental models may be possible with an air crew or a tank crew, where an individual's role is clearly identifiable as part of a larger team effort—like landing a plane or acquiring and firing on a target. Creating shared mental models in an intelligence team is less likely, given the vague nature of the goals, the enormity of the task, and the diversity of individual expertise. Moreover, the larger the number of team members, the more difficult it is to generate cohesion, communication, and cooperation. Heterogeneity can also be a challenge: It has a positive effect on generating diverse viewpoints within a team, but requires more organizational structure than does a homogeneous team.
Without specific processes, organizing principles, and operational structures, interdisciplinary teams will quickly revert to being just a room full of experts who ultimately drift back to their previous work patterns. That is, the experts will not be a team at all; they will be a group of experts individually working in some general problem space. ...Intelligence analysis uses a wide variety of expertise to address a multivariate and complex world. Each expert uses his or her own heuristics to address a small portion of that world. Intelligence professionals have the perception that somehow all of that disparate analysis will come together at some point, either at the analytic team level, through the reporting hierarchy, or through some computational aggregation.
The intelligence analyst is affected by the same confounding variables that affect every other expert: processing time, pattern bias, and heuristic bias. This is the crux of the paradox of expertise. Domain experts are needed for describing, explaining, and problem solving; yet, they are not especially good at forecasting because the patterns they recognize are limited to their specific fields of study. They inevitably look at the world through the lens of their own domain's heuristics.
What is needed to overcome the paradox of expertise is a combined approach that includes formal thematic teams with structured organizational principles; technological systems designed with significant input from domain experts; and a cadre of analytic methodologists. Intelligence agencies continue to experiment with the right composition, structure, and organization of analytic teams; they budget significant resources for technological solutions; but comparatively little is being done to advance methodological science.
Advances in methodology are primarily left to the individual domains. But relying on the separate domains risks falling into the same paradoxical trap that currently exists. What is needed is an intelligence-centric approach to methodology, an approach that will include the methods and procedures of many domains and the development of heuristics and techniques unique to intelligence. In short, intelligence analysis needs its own analytic heuristics designed, developed, and tested by professional analytic methodologists. This will require using methodologists from a variety of other domains and professional associations at first, but, in time, the discipline of analytic methodology will mature into its own sub-discipline with its own measures of validity and reliability.
I have to say it's a bit obscure to me how Johnston thinks the development of intelligence-specific methods will rectify the central problem he diagnoses. (He might just mean that it can't possibly be fixed without such methodology.) That said, the whole thing's well worth reading, especially if you're interested in the earlier discussions of heuristic diversity, or interdisciplinary science. According to this, Johnston, a post-doc at the CIA's Center for the Study of Intelligence, is by training an anthropologist, and has a forthcoming book (based on his dissertation?) titled The Culture of Analytic Tradecraft: An Ethnography of the Intelligence Community, which I'd now like to read...
The archive of declassified Studies in Intelligence articles, 1955--1976, has a lot of interesting stuff in it too, though the transcription into HTML is occasionally shaky, and it's not convenient to link directly to articles.
Update, 25 August: Henry Farrell writes to point to a forthcoming paper in Studies in Intelligence, D. Calvin Andrus's "The Wiking and the Blog: Toward a Complex Adaptive Intelligence Community". I haven't had a chance to read it yet, but it might be worthwhile. And, yes, this post was missing for a few days. I could tell you what happened, but then I'd have to...
(Profuse thanks to K. for pointing out Johnston's paper and discussing it with me.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Snow leopards (Uncia uncia) are big cats native to the mountains of Central Asia. They are not in fact particularly closely related to leopards, but they are solitary, beautiful animals (the young are intensely cute), and, unsurprisingly, endangered. Hearteningly, their numbers are actually increasing. Somewhat dishearteningly, part of this is due to an "involuntary park" effect: a lot of their habitat lies along the borders of unfriendly states, where armies exclude people who might otherwise want to use that land for grazing. (Obviously this excludes actual areas of continuing hostilities, like Siachen Glacier, site the "war above the clouds", of one of the most extraordinary, and pointless, conflicts of modern times.) You have to be very poor to find land like this desirable, but there's no shortage of really poor people in Central Asia. However, good work is being done by the Snow Leopard Trust in community-based conservation, trying to devise ways of actually making the presence of the animals beneficial to their human neighbors. (Iowa could use some of this.) The latest effort, in conjunction with the International Finance Corporation (a part of the World Bank group) is to bring this approach to the Sary-Chat Ertash nature reserve in Kyrgyzstan, next door to a substantial gold-mining area. Since the park rangers appear to have exactly one jeep, this seems like a good thing. In the meanwhile, if you find yourself yearning for some Central Asian handicrafts (and who doesn't, from time to time?), the Snow Leopard Trust's online store seems like a beneficent way to get them. (Via the Private Sector Development Blog at the World Bank.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Due to circumstances which I am at a loss to explain, I find myself on the editorial board of a new, open access, peer-reviewed journal, Structure and Dynamics: eJournal of Anthropological and Related Sciences. I am really happy to report that Volume 1, Issue 1 is now live, though I believe there are still some papers which are being processed for this issue, and the next two issues of volume 1 should follow shortly. While they are all good papers (seriously!), I will, somewhat arbitrarily and unfairly, single out one of them for special mention, on the grounds of popular appeal:
There is much more good stuff in this issue, though, and more to come soon. If you're work falls within the aims and scope of SDEAS, I strongly encourage you to submit.
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Adorable pictures of orphaned baby sloths (and their teddy bears) in Costa Rica (via just about everybody).
(My class continues to eat up all my time. I continue to find certain of Bill's posts uncomfortable reading. What to do, what to do...)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
It was almost two years ago that I (modestly) proposed to solve our geo-political difficulties by turning the Afghan militias into US auxiliaries, and legalizing drugs. Now I see (via Intel Dump) that the first part of this idea has also occurred to Roberto Bran, who recently gave a presentation sketching out the details on the model of the Gurkhas. When I said this, it was just a sarcastic expression of despair, but Bran is trying to make a constructive policy proposal, has thought carefully about issues like language training, and, again unlike me, has some idea of what he's talking about:
I spent six months as an embedded advisor under Task Force Phoenix with a Quick Reaction force kandak (battalion) and tolei asleyah (weapons company) of the Afghan National Army (ANA). Following that assignment, the 10th Mountain Division returned to Fort Drum and I was sold to Combined Forces Command - Afghanistan (CFC-A) in Kabul, where I served as the Interagency Strategic Plans Officer and worked under some of the smartest men I have ever met. I don't pretend this qualifies me for "expertise," but at least I do have some practical experience.
I'm led to believe that this could be a good thing, if well-implemented. But I really doubt it will happen, because there is simply no domestic political constituency for it. (Thanks to Captain Bran for graciously answering some questions in e-mail.)
Afghanistan and Central Asia; Modest Proposals; The Continuing Crisis
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Behold: The High Weirdness Project, a wiki descendant of the Rev. Mr. Ivan Stang's classic High Weirdness by Mail. As one of the people responsible, along with Mitch Porter, for producing High Weirdness by World Wide Web (back when the Late Chalcolithic of Internet time was just giving way to the present Iron Age), what can I say except "Praise 'Bob'!"? (Thanks to Modemac for the pointer.)
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Does Bush's endorsement of teaching the "controversy" over intelligent design actually surprise anyone? Hopefully not, and let's take some comfort from the fact that, according to a reporter who was there, he didn't seem too eager to discuss the topic. (Full transcript here.) As usual, only Fafblog is capable responding to the news in an adequate manner.
Still, for the record: there is no scientific controversy over intelligent design. The best attempts of the intelligent design movement to produce scientific work are, as we've seen, rubbish (e.g., 1, 2). I read them from time to time, but then, I also read people who claim to have found the lost city of Atlantis in Wisconsin, or unearthed the suppressed secrets of anti-gravity, and many other varieties of crackpot. There are two reasons why the best efforts of the intelligent design movement are rubbish. The first, and most important, is that the theory, to the extent there is a theory, is false. Still, I could make out better arguments for ID than they're managing to do; they are either not trying very hard, or just not very good. Which brings us to the other reason why those best efforts are rubbish: the goal is not to produce scientific work. It is instead to give lay-people the appearance of a controversy — to generate uncertainty and doubt — so as to give excuses to politicians like Bush. Organizations like the Discovery Institute do not exist to make discoveries, or advance knowledge; they are, rather, front organizations. In their less guarded moments, people like William Dembski realize this perfectly well, and say things like "intelligent design is just the Logos theology of John's Gospel restated in the idiom of information theory". (See Larry Arnhart's exchange with Behe and Dembski in First Things. Arnhart, incidentally, is proof that intelligent, conservative evolutionists are possible; he even has an interesting book on Darwinian Natural Right, about which more another time, perhaps.)
(Some other time, I'll talk about the history which links places like the Discovery Institute back to the first wave of right-wing think-tanks like Heritage and the American Enterprise Institute, and what those in turn owe to the intellectual Cold War and ultimately to the Communist Party (USA); but in the meanwhile I'll just recommend that you read Creationism's Trojan Horse, and the chapter on "the triangular trade" in Michael Lind's Up from Conservatism. — Paul Krugman has now written about this, without, however, going all the way back to the CP.)
The thing is, this leads to bad science, and, if an unbeliever can say so, bad religion. The stakes are more serious here than with silly "devotionals with mathematical content", but the issues are not that different. Doing what you must know is shoddy science, in the hope that it will provide cover for propagating the gospel, shows a poor opinion of your fellow creatures, of the gospel, and of God. Of your fellow creatures, because you are resorting to trickery, rather than honest persuasion or the example of your own life, to win converts. Of the gospel, because you do not trust its ability to change lives and win souls. Last and worst, of God, because you are perverting what you believe to be the divine gift of intelligence, and refusing to learn about the Creator from the creation. And for what? To protect your opinion about what measure you think it fitting for God to employ.
One of the greatest passages in the Bible is when "the Lord answered Job out of the whirlwind":
Where was thou when I laid the foundations of the earth? declare, if thou hast understanding. Who hath laid the measures thereof, if thou knowest? or who hath stretched the line upon it? Whereupon are the foundations thereof fastened? or who laid the corner stone thereof; when the morning stars sang together, and all the sons of God shouted for joy?Creationism is a way of responding to this profound challenge by saying "I know! I know! You did it just like I woulda!"
Manual trackback: Crooked Timber; Nanopolitan; Signal + Noise; MoJo; Idiolect
Posted by crshalizi at August 15, 2006 13:22 |
| permanent link
Attention conservation notice: 2800 words and many large figures advertising our latest scientific paper. More than most people will ever want to know about nonlinear filtering, cellular automata and coherent structures.
A year and a day after we began working on the manuscript, here it is:
Rob and Kristina worked out the fundamental theory and algorithms for this paper and its predecessor; Rob also figured out the order parameter for cyclic cellular automata, and Kristina did the actual statistical analysis. Jean-Baptiste implemented all our ideas (in Objective CAML), and ran all the simulations. Cris came up with the idea of local sensitivity, and pushed for harder examples. I pushed for local statistical complexity, and a lot of misconceptions.
OK, assuming anyone's still reading, let me give you an illustration of the kind of thing we're talking about in the abstract.

That is a picture of the time-evolution of a one-dimensional cellular automaton ("ECA rule 54", in the jargon), starting from a random initial condition. Space goes up and down, and time advances from left to right. What you can see is that, most of the system is soon dominated by patches of a single repeating regular pattern, called a "domain". Technically, each domain is defined by a "regular language" (a certain kind of rule describing the pattern), which can extend indefinitely across the lattice, and persist indefinitely in time under the action of the cellular automata rule. ("The regular language is invariant under the time evolution".) There are also things moving through the domains ("particles"), which are another kind of structure. All this is, in this case, reasonably easy to make out by eye. You'll also notice, if you look long enough, that every once in a while the domains are disrupted seemingly out of nowhere. Since the rule is deterministic, there has to be a reason, and it turns out, if you look quite carefully, that there are multiple phases the domain could be in, that the boundaries between regions of different phase act like particles, and you're seeing the collision of those phase defects.
As I've discussed before, understanding the particles and domains of such systems is important in understanding their dynamics, and still more important in grasping their computational properties — particles and their collisions are the components people use to build computational circuits in cellular automata, and appear spontaneously in CA evolved to do non-trivial computations. Accordingly, there's a fair amount of theory now for the regular-language patterns of one-dimensional deterministic cellular automata with known rules. (Important early contributions were made by, inter alia, Wolfram, Grassberger, and Boccara; the most general theory I know of was developed by Hanson and Crutchfield. Ilachinski's textbook on CA has a pretty good review, but it's still a live subject, witness Pivato's recent work on particle kinematics.) But "general theory" here means a general framework, where all the details still have to be filled in by hand, case-by-case, after intensive communion with the pictures like that figure, and with mathematical objects like the regular-language evolution operator induced by the CA. In the end, you can build a little filter which will scan over configurations produced by the system and identify the domains and particles. (Here is a figure showing the domain and the filter, from an old paper I wrote with Wim Hordijk and Jim Crutchfield.)
And then you turn to another system, like this one (ECA rule 110), and you have to do everything over again, because all the work you've done is completely dependent on that particular system. Use your old filter on the new data, and you get nonsense.

Now, unlike my co-authors, I am lazy, which means I don't like putting that much work into figuring out the coherent structures in one system, let alone many. This is the kind of thing I want a computer to do for me, with as little input or insight on my part as possible. (As Larry Wall has said about Perl programming, this sort of laziness implies a negative discount rate: you do work now so your future self won't have to.) More seriously, the primate visual cortex is a remarkable thing, and does a marvelous job of analyzing the kinds of patterns needed to get East African Plains Apes through their natural life-cycles, but it was never supposed to cope with massive collections of high-dimensional multi-variate data, which is what science increasingly is faced with. (Talk to, say, these people if you don't believe me.) Something more automatic and principled is deeply to be desired. We tried to find a generic way of locating the places where interesting, important things were happening, on the grounds that the most interesting and important things in the system are the coherent structures. In fact, we found two ways of doing this, which turn out to be quite distinct. (I was sure, before we actually had any results, that they'd turn out to be two ways of getting at the same aspects of the system, which shows it's a good thing wiser heads were involved.)
The first quantity, which we ended up calling "local sensitivity", tries to quantify interest and importance in the sense of "having a lot of influence on the rest of the system" and "small changes here make a big difference". In classical dynamics, you quantify things like this with the Lyapunov exponents, but for a number of reasons, explored in the paper, we ended up needing something different. Basically, we perturb a small-ish region in the vicinity of a given point, and then see how large an area is affected by the perturbation over a certain interval of time; the bigger that area is, compared to how large it possibly could be, the larger the sensitivity at the point in question. Areas of high sensitivity are ones where small perturbations influence the future evolution of large parts of the system; they tend to drive their neighbors, rather than be driven by them.
The other quantity is the "local statistical complexity", in essence the number of bits of information about the past of a given point needed to optimally predict its future behavior. You might worry that this is not an objectively well-defined quantity, but we'd earlier shown how to put those fears to rest: we showed how to reconstruct the effect state space at each point ("causal state reconstruction"), and then use some information theory and the idea of a minimal sufficient statistic to show this gives an objective forecasting complexity. The details are too technical to go into here (though the connection between physical complexity and statistical inference is pleasing), so if you're really interested I'll refer you to the paper. Complex regions, in this sense, are ones where a great deal of information about the past is required for optimal prediction — where a lot of the past is relevant to the future, and fine distinctions have to be drawn between similar histories.
In practical terms, what we did was take the original CA configurations, and then compute the values of these two fields — local sensitivity and local statistical complexity — at each point in space, at each moment in time, over and over again, and then compare those results to the original field. Here, for example, is what we get looking at the first example (ECA 54) — in order, the original system (repeated here for comparison), the system as filtered for sensitivity, and as filtered for complexity. (The darker points are more sensitive or more complex, respectively.)
Looking at this, all the little defects just pop right out, even though the filters don't know anything about the phase structure of the background domain, or even that there is a background domain. Now, when we apply exactly the same filters to the second example (ECA 110), this is what we get:
In other words, we see the particles cleanly separated from their regular domain backgrounds, and the particle collisions/interactions as well; they're as complex or even more complex than the particles. Since the interactions are what you use to build a universal computer in this CA rule, that's pleasing, but secondary.
Which is all very well, but there are remarkably few one-dimensional pattern-forming systems in nature, and the regular-language story gets weird and unsatisfying in two or more dimensions. Do our filters work in more than one dimension? Well, it depends what you mean by work. The defining math has no problem in higher dimensions, but how do we know that the structures they find are the right, important ones? We needed a two-dimensional dynamical system where people had already figured out what the important structures were. For a number of reasons, we chose cyclic cellular automata (CCA) — partly because they self-organize into cute spiral waves, partly because some clever mathematicians had already thoroughly studied the system, and partly because some of us had already written papers about them and felt comfortable with them.
You really need a movie to appreciate what CCA do, but my skills don't extend to creating one, so I'll just recommend that you download Mcell