Error: I'm afraid this is the first I've heard of a "Rss" flavoured Blosxom. Try dropping the "/+Rss" bit from the end of the URL.
Tue, 24 Nov 2009
Mysteries
I've spun off the list of recommendations
into a separate page. True crime goes here too, for want of a better spot.
To read:
- Boris Akunin, The Winter Queen
- Eric Ambler
- The Dark Frontier
- Dirty Story
- Doctor Frigo
- The Intercom Conspiracy
- A Kind of Anger
- The Siege of the Villa Lipp
- Poul Anderson
- Perish by the Sword
- Murder by the Books
- Sarah Andrews
- Margot Arnold
- Kate Atkinson, Case Histories
- Sandi Ault, Wild Penance
- Robert Barnard
- The Bad Samaritan
- The Corpse at the Haworth Tandoori
- Death of an Old Goat
- Dave Barry, Big Trouble
- Nancy Bartholomew, Strip Poker
- Benjamin Black, Christine Falls
- Roberto Bolano, The Savage Detectives
- Joan Brady, Bleedout
- Ken Bruen, The Killing of the Tinkers
- Dorothy Cannell
- How to Murder the Man of Your Dreams
- How to Murder Your Mother-In-Law
- John Case, The First Horseman [Thriller about a millenarian cult (which sounds like it's based on
Aum Shinrikyo, only with more of an environmentalist ideology) and a
genetically-engineered plague based on the 1918
influenza. Thanks to Mark Worden for the rec.]
- Raymond Chandler
- Douglas Clark [Police procedurals; ingenious, "scientific" ways
to die]
- Max Allan Collins, The History of Mystery
- Martha Conley, Growing Light
- K. C. Constantine, Cranks and Shadows
- Susan Rogers Cooper, Doctors and Lawyers and Such
- Robert Crais
- The Monkey's Raincoat
- L.A. Requiuem
- Sunset Express
- Amanda Cross
- An Imperfect Spy
- The Players Come Again
- The Puzzled Heart
- The Question of Max
- A Trap for Fools
- Bill DeAndrea
- The Detective and the
Toga
- Michael Dibdin
- Vendetta
- Ratking
- Cabal
- Dark Lagoon
- Cosi Fan Tutti
- Margaret Anne Doody
- Aristotle and Poetic Justice
- Poison in Athens
- Mysteries of Eleusis
- James D. Doss, White Shell Woman
- Susan Dunlap
- Cop Out
- Not Exactly a Brahmin
- Jerrilyn Farmer, Immaculate Reception
- Robert Ferrigno
- Dead Man's Dance
- Scavenger Hunt
- Leslie Forbes
- Bombay Ice
- Fish, Blood, and Bone
- Karin Fossum, Don't Look Back
- Christopher Fowler, Ten Second Staircase
- Nicholas Freeling, Cold Iron
- Alan Furst [Many more, too...]
- Blood of Victory
- Dark Voyage
- The Polish Officer
- Red Gold
- The World at Night
- Jim Fusilli, Hard, Hard City
- Luis Alfredo Garcia-Roza, Southwesterly Wind
- Anne George, Murder Shoots the Bull
- Joe Gores
- Laurence Gough, "Willow and Parker" series (except Heartbreaker, which I've read)
- Sarah Graves
- Trap Door
- The Book of Old House
- Annie Griffin, Date with the Perfect Dead Man
- Martha Grimes, Rainbow's End
- Dashiell Hammett, The Maltese Falcon
- Rick Hanson, Body Parts
- George Harrar, The Spinning Man
- Charlaine Harris
- A Fool and His Honey
- Shakespeare's Landlord
- Shakespeare's Champion
- Shakespeare's Trollop
- Lauren Henderson, Pretty Boy
- Hiaasen
- Basket Case
- Powder Burn
- Trap Line
- A Death in China
- Susan Holtzer, The Wedding Game: A Mystery at the
University of Michigan
- Richard Hoyt, Whoo?
- Elizabeth Ironsides
- Death in the Garden
- The Accomplice
- John T. Irwin, The Mystery to a Solution: Poe, Borges,
and the Analytic Detective Story
- "Marshall Jevons"
- A Deadly Indifference
- Fatal Equilibrium
- Catherine Jinks, The Inquisitor [Review by Danny Yee]
- D. J. H. Jones, Murder in the New Age
- Clarence Budington Kelland, The Artless Heiress
- Stephen King, The Colorado Kid
- Henry Kisor, Cache of Corpses
- Gary Krist, Chaos Theory
- Jane Langton
- Emily Dickinson Is Dead
- The Shortest Day
- Gaylord Larson ["writes a series about an agnostic/rationalist
detective who has various jobs in fundamentalist Christian colleges and
solves crimes in those places... really first rate" --- Jane Haddam on
dorothy-L, of happy memory]
- Don Lee, Country of Origin
- Denis Lehane
- A Drink Before the War
- Darkness, Take My Hand
- Sacred
- Gone, Baby, Gone
- Mystic River
- Elmore Leonard [w/ thanks to John Burke for recommendations]
- Bandits
- Cat Chaser
- City Primeval
- Freaky Deaky
- Get Shorty
- Glitz
- Gold Coast
- La Brava
- The Moonshine War
- Out of Sight
- Pagan Babies
- Pronto
- Rum Punch
- Split Images
- Stick
- Swag
- Tishomingo Blues
- Herbert Lieberman, Brilliant Kids
- Laura Lippman
- Every Secret Thing
- To the Power of Three
- What the Dead Know
- Peter Lovesey, The Reaper
- Shaun Maloney, Stiff ["Set in Melbourne amid the
machinations of the Australian Labor Party, this book is extremely funny,
beautifully written and for those of us of the left persuasion, scarily close
to the bone" --- from review in rec.arts.books.reviews by Katrina Beard]
- Henning Mankell, Before the Frost
- Lia Matera
- Seicho Matsumoto, The Voice
- Lise McClendon, The Bluejay Shaman
- Miyuki Miyabe, All She Was Worth
- Bob Morris, Bahamarama
- John Mortimer
- Rumpole on Trial
- The Penge Bungalow Murders
- Walter Mosley, Devil in a Blue Dress
- Magdalen Nabb, Some Bitter Fruit
- Leslie O'Kane, The Cold Hard Fax
- Jill Paton Walsh
- A Piece of Justice
- Debts of Dishonour
- The Bad Quarto
- Barbara Paul, Your Eyelids are Growing Heavy
- Rebecca Pawel, The Law of Return
- George P. Pelecanos
- Arturo Pérez-Reverte, The Queen of the South
- Marissa Piesman
- Close Quarters
- Survival Instincts
- Unorthodox Practices
- Lev Raphael, The Edith Wharton Murder
- Ruth Rendell
- A Demon in My View
- Gallowglass [as "Barbara Vine"]
- The Speaker of Mandarin
- To Fear a Painted Devil
- Robert Rice, The Nature of Midnight
- Phil Rickman, To Dream of the Dead
- Candace Robb, The Lady Chapel
- Rebecca Rothenberg, The Shy Tulip Murders
- Barbara Sarenella
- No Human Involved
- No Offense Intended
- David Schmid, Natural Born Celebrities: Serial Killers in American Culture
- Lisa Scottoline
- Everywhere that Mary Went
- Legal Tender
- Mistaken Identity
- Running from the Law
- Mabel Seeley
- The Beckoning Door
- The Chuckling Fingers
- The Whispering Cup
- Catherine Shaw, The Three Body Problem [That three body problem, no less]
- Jenny Siler, Easy Money
- David "Mr. Laura Lippman" Simon, Homicide
- John Sladek, Invisible Green
- James B. Stewart, Blind Eye [#2 Amazon seller with the
Massachusetts Medical Society, as of 22 Feb. 2000.]
- Dorothy Sucher, Dead Men Don't Marry
- Akimitsu Takagi
- Honeymoon to Nowhere
- The Informer
- James Thompson, Snow Angels
- Stefan Timmermans, Postmortem: How Medical Examiners Explain
Suspicious Deaths [Blurb]
- Marilyn Todd
- I, Claudia
- Virgin Territory
- Man Eater
- Kathy Hogan Trocheck
- Lisa Unger, Beautiful Lies
- Mark Urban, The Linguist
- T. C. Van Adler, St. Agatha's Breast [Review by Danny Yee]
- Fred Vargas [Thanks to Jan Johnson for the recommendation]
- Have Mercy on Us All
- Three Evangelists
- Wash This Blood Clean from My Hand
- Patricia Wentworth
- Out of the Past
- Wicked Uncle
- Donald Westlake
- Cops and Robbers
- Don't Ask
- Put a Lid on It
- Humans [I know, it's not really a mystery]
- Polly Whitney
- Until Death
- Until the End of Time
- Don Winslow, A Long Walk up the Waterslide
- L. R. Wright, A Touch of Panic
- Rafael Yglesias, The Murderer Next Door
#
Recommended Mystery Novels
Including true crime and spy stories, for lack of better places to put them.
See also:
Mysteries to read.
- Bruce Alexander (pseud. of Bruce Alexander Cook)
- Blind Justice
- Murder in Grub Street
- Watery Grave
- Person or Persons Unknown
- Eric Ambler
- The Ability to Kill [Nonfiction]
- Background to Danger
- Cause for Alarm
- A Coffin for Dimitrios
- Journey into Fear
- Judgment on Deltchev
- The Levanter
- The Light of Day [filmed as
Topakpi, unseen by me]
- A Passage of Arms
- The Schermer Inheritance
- State of Siege
- Donna Andrews
- Murder, with Peacocks
- Murder with Puffins
- Revenge of the Wrought-Iron Flamingoes
- Noreen Ayres
- A World the Color of Salt
- Carcass Trade
- Margaret Barrett and Charles Dennis, Given the Crime
- Nancy Bartholomew [Very light, but fun if approached in
the right spirit]
- Miracle Strip
- Drag Strip
- Film Strip
- Cheryl Benard, Moghul Buffet
- John Billheimer, Highway Robbery
- C. J. Box
- Out of Season
- Savage Run
- Winterkill
- Trophy Hunt
- Edgar Box (pseud. of Gore Vidal) [Review]
- Death in the Fifth Position
- Death Likes It Hot
- Death Before Bedtime
- Ken Bruen, The Guards
- Paul Bryers, The Prayer of the Bone
- Chelsea Cain
- Heartsick
- Sweetheart
- Evil at Heart
- Andrea Camilleri [A series, but I've read them out of order without
any discernable loss, so I won't try straighten them out here]
- Excursion to Tindari
- The Heat of August
- The Patience of the Spider
- The Shape of Water
- The Smell of the Night
- The Snack Thief
- The Terra-cotta Dog
- Voice of the Violin
- Rounding the Mark
- The Patience of the Spider
- The Paper Moon
- Sarah Caudwell [Pseud. of the late Sarah Caudwell Cockburn, sister
of the unfortunate leftist columnist Alexander Cockburn.]
- Thus Was Adonis Murdered
- The Sirens Sang of Murder
- The Shortest Route to Hades
- The Sibyl in Her Grave
- Karen Rose Cercone
- Steel Ashes
- Blood Tracks
- Coal Bones
- Amanda Cross (pseud. of Prof. Dr. Carolyn Heilbrun)
- The James Joyce Murder
- Poetic Justice
- The Theban Mysteries
- Avram Davidson [General Review: Avram Davidson's
Afterlife]
- The Enquiries of Dr. Eszterhazy [There's a
later collection, Adventures of Dr. Eszterhazy, with more stories;
I've not yet laid hands upon it.]
- The Investigations of Avram Davidson
[Posthumous collection, ed. Grania Davis and Richard A. Lupoff]
- Lindsay
Davis
- Silver Pigs [Review by Danny Yee]
- Shadows in Bronze
- Venus in Copper
- The Iron Hand of Mars
- Poseidon's Gold
- Last Act in Palmyra
- Time to Depart
- A Dying Light in Corduba
- Three Hands in the Fountain
- Two for the Lions
- One Virgin too Many
- Bill DeAndrea [The "Killed In" books are a series, but I won't
try to sort out order]
- Fatal Elixir
- Five O'Clock Lightning
- The Hog Murders
- Killed in the Ratings
- Killed in Paradise
- Killed on the Rocks
- Killed in the Fog
- Killed in Fringe Time
- Killed in the Act
- The Lunatic Fringe
- Margaret Anne Doody
- Aristotle Detective
- Aristotle and the Secrets of Life
- Aaron Elkins
- Fellowship of Fear
- The Dark Place
- Murder in the Queen's Armes
- Old Bones
- Curses!
- Icy Clutches
- Make No Bones
- Dead Men's Hearts
- Twenty Blue Devils
- Janet Evanonvich [The
series loses interest after a while. Maybe even before the 4th book...]
- One for the Money
- Two for the Dough
- Three to Get Deadly
- Four to Score
- Robert Ferrigno
- Cheshire Moon
- Dead Silent
- Flinch
- Heart Breaker
- The Horse Latitudes
- Leslie Forbes, Waking Raphael
- John M. Ford, Scholars of Night
- Alan Furst [To be completely honest, Furst strikes me as imitation
Eric Ambler, differing mostly in being more explicit about sex. However, it
is good imitation Ambler, and since the original article is no longer
being produced...]
- Dale Furutani
- Death at the Crossroads
- Jade Palace Vendetta
- Nicholas Freeling, Auprès de ma blonde
- Randall Garrett
- Too Many Magicians
- Lord Darcy Investigates
- Murder and Magic
- Nadia Gordon
- Sharpshooter
- Death by the Glass
- Murder Alfresco
- Lethal Vintage [comments]
- Laurence Gough, Heartbreaker
- Sarah Graves
- The Dead Cat Bounce
- Triple Witch
- Wicked Fix
- Repair to Her Grave
- Wicked Fix
- Unhinged
- Mallets Aforethought
- Tool and Die
- Nail Biter
- Jane Haddam (pseud. of
Orania Papazoglou; see below)
- Not a Creature Was Stirring
- Precious Blood
- Acts of Darkness
- Quoth the Raven
- Great Day for the Deadly
- A Stillness in Bethlehem
- Feast of Murder
- Dear Old Dead
- Fountain of Death
- Murder Superior
- Festival of Deaths
- Bleeding Hearts
- And One to Die On
- Deadly Beloved
- Baptism in Blood
- Skeleton Key
- True Believers
- Somebody Else's Music
- Conspiracy Theory
- The Headmaster's Wife
- Hardscrabble Road
- Glasshouses
- Cheating at Solitaire
- Living Witness
- Steve Hamilton, A Cold Day in Paradise
- Charlaine Harris, Last Scene Alive
- Sparkle Hayter (just
looks like a pseud.)
- What's a Girl Gotta Do?
- Nice Girls Finish Last
- Revenge of the Cootie Girls
- The Last Manly Man
- Lauren Henderson
- Black Rubber Dress
- Freeze My Margarita
- Strawberry Tattoo
- Chained
- Carl Hiaasen
- Double Whammy
- Native Tongue
- Striptease
- Stormy Weather
- Tourist Season
- Tony Hillerman
- Peter Hoeg, Smilla's Sense of Snow [Except that the
ending really makes no sense at all.]
- Elizabeth Ironside, A Very Private Enterprise
- Stan Jones
- White Sky, Black Ice
- Shaman Pass
- B. B. Jordan (pseud. of Frances Brodsky), Principal
Investigation [The sequel, Secondary Immunization, was
disappointing.]
- Jane Langton
- Dennis Lehane, Shutter Island
- Jonathan Lethem, Motherless Brooklyn
- Laura Lippman [Baltimore mysteries (except for In Big
Trouble), which are supposed to be really good at conveying the feel of
the city. I can't speak to that, but they did help me recognize some of the
neighborhoods and even restaurants when I went there for a job interview in
2001.]
- Balitmore Blues
- Charm City
- Butcher's Hill
- In Big Trouble
- The Sugar House
- In a Strange City
- The Last Place
- By a Spider's Thread
- No Good Deed
- Margaret Maron
- Bootlegger's Daughter
- Southern Discomfort
- Shooting at Loons
- Up Jumps the Devil
- Killer Market
- Home Fires
- Uncommon Clay
- William Marshall ["Yellowthread Street" mysteries, police
procedurals set in Hong Kong. A weird combination of suspense, detection,
broad humor and really disturbing situations, which works better than it has
any right to, and sometimes (e.g., Perfect End,
Frogmouth, and especially Out of Nowhere) leaves me
twitchy. --- The internal chronology of the books is unimportant, so I've
listed them alphabetically.]
- Frogmouth
- Gelignite
- The Hatchet Man
- Head First
- Inches
- Nightmare Syndrome
- Out of Nowhere
- Perfect End
- Roadshow
- Sci Fi
- Skullduggery
- Thin Air
- War Machine
- Yellowthread Street
- Sujata Massey, The Salaryman's Wife
- Seicho Matsumoto
- Inspector Imanishi Investigates
- Points and Lines
- Henry Mazel Murderously Incorrect [Review by Danny Yee]
- Kirk Mitchell [Police procedurals, set in various parts of Indian
Country. Benefit a bit from reading in order, though it's not strictly
required.]
- Cry Dance
- Spirit Sickness
- Ancient Ones
- Sky Woman Falling
- Dance of the Thunder Dogs
- John Mortimer
- Rumpole of the Bailey
- The Trials of Rumpole
- Rumpole's Return
- Rumpole for the Defense
- Rumpole and the Golden Thread
- Rumpole's Last Case
- Rumpole and the Age of Miracles
- Rumpole à la Carte
- Rumpole and the Angel of Death
- Katy Munger
- Legwork
- Money to Burn
- Bad to the Bone
- Better Off Dead
- Out of Time
- Jake Page
- The Stolen Gods [But maybe only those who have
lived in Santa Fe can fully appreciate the delicious absurdity of the car
chase]
- The Deadly Canyon
- The Knotted Strings
- The Lethal Partner
- Orania Papazoglou
- Charisma [Very good but very disturbing novel]
- Patience McKenna books
- Death's Savage Passion
- Sweet Savage Death
- Wicked Loving Murder
- Rich, Radiant Slaughter
- Once and Always Murder
- I. J. Parker [Narrative order, which is not the same as the
publication order]
- The Dragon Scroll [Remarks]
- Rashomon Gate [Remarks]
- Black Arrow [Remarks]
- Island of Exiles [Remarks]
- The Hell Screen [Remarks]
- The Convict's Sword [Remarks]
- Jill Paton Walsh, The Wyndham Case
- Elizabeth Peters
- Richard Price
- Phil Rickman [Nominally horror novels tracking the career of the
Reverend Merrily Watkins, an Anglican priest working in a particularly
depressed area of the Welsh-English border, they follow the British ghost-story
convention of having a parallel "rational" explanation, so they can be enjoyed
as mysteries with creepy effects. Sometimes, to be honest, Rickman's
non-supernatural explanations are so contrived any thinking person would go for
the ghosts.]
- The Wine of Angels
- Midwinter of the Spirit
- A Crown of Lights
- The Cure of Souls
- The Lamp of the Wicked
- Prayer of the Night Shepherd
- The Smile of a Ghost
- The Remains of An Altar
- Fabric of Sin
- Madeleine E. Robins, Point of Honour [Mini-review]
- Laura Joh Rowland [Historical mysteries in Tokugawa-era Japan.
Series fatigue set in for me around volume 4...]
- Shinju
- Bundori
- The Concubine's Tattoo
- The Samurai's Wife
- C. J. Sansom
- Dissolution
- Dark Fire
- Sovereign
- Revelation
- Dorothy Sayers
- Steven Saylor
- Roman Blood
- A Murder on the Apian Way
- Aileen Schumacher [Engineering mysteries]
- Engineered for Murder
- Framework for Death
- Affirmative Reaction
- Rosewood's Ashes
- Roger L. Simon, Raising the Dead
- Dan Simmons
- Karin Slaughter [Warning: these are extremely disturbing
novels; the first two are about, respectively, serial rape and murder, and
child pornography. The others are not much lighter, if at all. But they
are very well-written.]
- "Grant County" series:
- Blindsighted
- Kisscut
- A Faint Cold Fear
- Indelible
- Faithless
- Beyond Reach
-
- Triptych
- Fractured
- Undone [fuses the two series]
- Julia Spencer-Fleming
- In the Bleak Midwinter
- A Fountain Full of Blood
- Out of the Deep I Cry
- To Darkness and to Death
- Rex Stout
- Peter Straub [Straub normally writes horror novels, and these get
marketed as such, but they're really excellent mystery novels and thrillers,
with no supernatural elements at all, unless I've completely misunderstood
them. Which is possible, since he's tricky.]
- The Hellfire Club
- The Blue Rose books [Three interlocking but
completely incompatible renditions of a story about a killer called "Blue
Rose", and three different disguises for Straub's native city of Milwaukee;
this is impressive, if you notice it, but not distracting in the least if you
don't. Reading Koko and The Throat could convince
you that Straub fought in the Vietnam War, but then, reading his Ghost
Story could convince you that he lived in a haunted town in upstate New
York.]
- Koko
- Mystery
- The Throat
- Dorothy Sucher, Dead Men Don't Give Seminars [Good
observations of what theoretical physicists are like; also a fun novel. But I
can attest, on the basis of my experience as a graduate student, that the title,
however fine, is not strictly true.]
- Leann Sweeney, Pick Your Poison
- Akimitsu Takagi, The Tattoo Murder Case [Nowhere near
as kinky as the cover-blurbs make it sound]
- James Tucker [Pittsburgh settings, but I read and enjoyed
the first two before moving here]
- Abra Cadaver
- Hocus Corpus
- Tragic Wand
- Janwillen van de Wettering, Robert Van Gulik: His
Life, His Work
- Robert Hans van Gulik [Unfortunately, I can't really put these in
narrative order, since I haven't any copies to hand...]
- (trans.) Celebrated Cases of Judge Dee: Dee Gong
An
- The Chinese Nail Murders [Review by Danny Yee]
- The Chinese Gold Murders [Review by Danny Yee]
- The Chinese Maze Murders
- Chinese Lake Murders
- The Moneky and the Tiger
- Judge Dee at Work
- The Phantom of the Temple
- The Haunted Monastery
- Murder in Canton
- The Red Pavillion
- The Willow Pattern
- Poets and Murder
- John Holbrook Vance
- Fred Vargas, Seeking Whom He May Devour
- Donald Westlake
#
Sat, 21 Nov 2009
Evolutionary Economics
See also Learning in Games;
Memes;
QWERTY. The connection to institutional economics is something I want to
understand better, but then, I need to learn a lot more about institutionalism.
Recommended:
- Esben Sloth Andersen, Evolutionary Economics:
Post-Schumpetrian Contributions [The man has this hideous way of
dragging out obscure methodological
arguments, but once you cut through those parts, it's rather good]
- Jenna Bednar and Scott Page, "Games Theory and Culture" [PDF]
- Michel Benaïm and Jörgen W. Weibull, "Deterministic Approximation of Stochastic Evolution in Games", Econometrica 71 (2003): 879--903 [JSTOR]
- Lawrence E. Blume and David Easley
- "If You're So Smart, Why Aren't You Rich? Belief Selection
in Complete and Incomplete Markets," SFI Working Paper 01-06-031
- "Optimality and Natural Selection in Markets," SFI
Working Paper 98-09-0 82
- Samuel Bowles, Microeconomics: Behavior, Institutions, and
Evolution
- Steven N. Durlauf and H. Peyton Young (eds.), Social
Dynamics [Mini-review]
- Richard W. England (ed.), Evolutionary Concepts in
Contemporary Economics [Contains an absolutely appalling essay by a pair
of dyed-in-the-wool Althusserians, and England on entropy isn't much better; but
Hodgson is good, and so is Nelson, and the rest are at least passable]
- Herbert Gintis, Game Theory Evolving: A Problem-Centered
Introduction to Modeling Strategic Interaction
- Alexis Jacquemin, The New Industrial Organization: Market
Forces and Strategic Behavior ["New" in 1985, when this was published in
French as Selection et pouvoir dans la nouvelle economie
industrielle. Still, good on the relationships between evolutionary
processes and strategic behavior, and the weakness of the
evolution-to-optimality arguments. The last sections, on sociobiology and the
meaning of life, are however very weak.]
- Paul Krugman, "What Economists Can Learn
from Evolutionary Theorists"
- Richard R. Nelson, "Evolutionary Theories of Cultural Change: An
Empirical Perspective" ["the standard articulations of a Universal Darwinism
put forth by biologists and philosophers tends to be too narrow, in particular
too much linked to the details of evolution in biology, to fit with what is
known about cultural
evolution." PDF
preprint.]
- Nelson and Winter, An Evolutionary Theory of Economic
Change [To be honest, I've not finished this yet, but I have gotten far
enough to agree that it deserves to be a standard work]
- Larry Samuelson (no relation of the Samuelson),
Evolutionary Games and Equilibrium Selection
- Reinhard
Selten, "Evolution, Learning, and Economic Behavior", Games and
Economic Behavior 3 (1991): 3--24 [In the form of a
dialogue between a Bayesian, an economist, an experimental psychologist, an
adaptationist biologist, a population geneticist, and an ethologist]
- John Sutton [Finally an economist who appreciates (a) evolutionary thinking
and (b) the importance of neutrality and null models]
- Technology and Market Structure: Theory and
History
- "Flexibility, Profitability and Survival in an (Objective)
Model of Knightian Uncertainty"
[PDF
preprint. Decision-making when the crucial variable is the indicator
function of an unmeasurable set, i.e., one which doesn't actually have
a probability.]
- H. Peyton Young, Individual Strategy and Social Structure: An
Evolutionary Theory of Institutions
[Review: A Myopic (and
Sometimes Blind) Eye on the Main Chance, or, the Origins of Custom]
To read:
- Howard Aldrich, Organizations Evolving
- James Bergin and Barton L. Lipman, 1996, "Evolution with
State-Dependent Mutations," Econometrica 64
(1996): 943--956
- Kenneth Boulding, Evolutionary Economics
- Glenn R. Carroll and Michael T. Hannan, The Demography of
Corporations and Industries
- Choi, Paradigms and Conventions
- Day and Chen (eds.), Non-Linear Dynamics and Evolutionary
Economics
- John De La Mothe and Gilles Paquet (eds.), Evolutionary
Economics and the New International Political Economy
- Delmore and Dorfer (eds.), The Political Economy of
Diversity: Evolutionary Perspectives on Economic Order and Disorder
- Kurt Dopfer (ed.), The Evolutionary Foundations of Economics
- Wendell Gordon and John Adams, Economics as Social Science:
An Evolutionary Approach
- Peter Hall, Innovation, Economics and Evolution: Theoretical
Perspectives on Changing Technology in Economic Systems
- David Boyce Hamilton, Evolutionary Economics: A Study of
Change in Economic Thought
- Hardy Hanappi, Evolutionary Economics: The Evolutionary
Revolution in the Social Sciences
- Michael T. Hannan and John Freeman, Organizational
Ecology
- Hodgson
- Evolution and Economics: Bringing Life Back Into
Economics
- Economics and Institutions
- Geoffrey M. Hodgson and Thorbjorn Knudsen, "The firm as an
interactor: firms as vehicles for habits and routines", Journal of
Evolutionary Economics 14 (2004): 281--307
- Richard N. Langlois and Paul L. Robertson, Firms, Markets and
Economic Change: A Dynamic Theory of Business Institutions
- Sten A. O. Thore, The Diversity, Complexity and Evolution of
High Tech Capitalism
- Philippe van Parijs, Evolutionary Explanation in the Social
Sciences
- Thorstein Veblen, "Why Is Economics Not an Evolutionary Science?",
Quarterly Journal of Economics 12 (1898):
373--397[JSTOR; plain
text transcription]
- Jack J. Vromen, Economic Evolution: An Inquiry into the
Foundations of the New Institutional Economics
#
Evolution (of Organisms)
[A proper discussion of evolution will appear here Any Time Now.]
Issues in evolution proper:
adaptation;
complexity;
developmental constraints and the
evolution of development;
ecology and co-evolution;
genetics;
sociobiology (in non-human beasties; in human beings, and what exactly
it can and cannot account for);
units of selection controversies (genes [Dawkins, Maynard Smith,
Williams] vs. gene-complexes [Lewontin, sorta] vs. organisms [Williams the
first time around] vs. groups [Sober?]) and group-selection arguments (when
can traits which benefit a higher level of selection at the expense of the
lower ones evolve? Probably never; the higher level entities don't have enough
coherence and persistence to act as replicators).
Query: What is known about the asymptotic distribution of the
population under (discrete-time) replicator dynamics? What if the space of
types in the replicator dynamics is infinite-dimensional? Or the fitness
function is subject to stochastic shocks? Or both? (This now has
its own notebook.)
Extensions of evolution:
to brain function;
to computer programming ;
to culture (memetics);
to economics;
to epistemology;
to psychology.
Mathematical modeling: classical population genetics à
la Fisher, Haldane and Wright, and its extensions via dynamics; game theory à la John Maynard
Smith. Connections to physics.
Agent-based modeling.
Challenges to neo-Darwinism: Here, as usual, my inclinations are
conservative, in that I really don't see what's wrong with the orthodox theory.
In any case, there don't seem to be any real alternatives yet
advanced. (Neutral mutations by definition explain the origins of neither
adaptations nor species.)
Recommended, non-technical:
- John Tyler Bonner
- On Development [Points out that it's a mistake
to think of just the adult form-and-behavior "evolving"; it's really the
whole life-cycle, complicating matters considerably...]
- On the Evolution of Complexity, by Means of Natural
Selection [Review]
- Richard Dawkins
- The Blind Watchmaker
- Climbing Mount Improbable
- The Selfish Gene
- Daniel Dennett, Darwin's Dangerous
Idea
- Theodosius Dobzhansky, "Nothing in Biology Makes Sense
Except in the Light of Evolution" [first pub. American Biology
Teacher 35 (March 1973): 125--129]
- Stephen Jay Gould, anything. [But be warned that a lot of what he
says about the Burgess Shale fauna in Wonderful Life is disputed
--- by his own sources...]
- François Jacob, The Possible and the Actual [I
really must get around to typing up my notes from his lecture on conserved gene
sequences]
- Charles H. Lineweaver, "Paleontological Tests: Human-like
Intelligence is not a Convergent Feature of
Evolution", arxiv:0711.1751 [A
very nice, and simple, way to see the error of some common ways of looking at
evolution]
- John Maynard Smith [As a god to population genetics; disciple of
Haldane]
- Did Darwin Get It Right? Essays on Games, Sex, and
Evolution
- The Theory of Evolution [An introductory book
which avoids getting tangled up in technical details without sacrificing
intellectual rigor or pushing any really strange ideas]
- and Eörs Szathmáry, The Origins of Life:
From the Birth of Life to the Origin of Language [Review: Major Transitions
Minor]
- Jacques Monod, Chance and Necessity [Review by Danny
Yee]
- Karl Sigmund, Games of Life: Explorations in Ecology,
Evolution and Behavior
- Homer W. Smith, Kamongo, or, The Lungfish and the
Padre
- talk.origins is where Usenet hashes
out creation vs. evolution. (It should be pretty obvious that I am on the side
of the apes.) There's a fine website
with archives of posts to the newsgroups, the excellent FAQs (read 'em
before attempting to convert the heathen), a nifty gallery of
fossils and the University of
Edicara
- Bruce Wallace and Adrian M. Srb, Adaptation
- Jonathan Weiner, The Beak of the Finch [Field
confirmation of neo-Darwinism, from, nicely enough, the Galapagos. (Please
don't start an argument about what it means to confirm a scientific theory
here.)]
Recommended, technical:
- Armen A. Allahverdyan and Chin-Kun Hu, "Replicators in Fine-grained
Environment: Adaptation and
Polymorphism", arxiv:0905.3297 [A
cute use of averaging techniques, though the particular one they employ will I
think break down if the perturbations to fitness are not strictly periodic.]
- Lauren W. Ancel and Walter Fontana, "Plasticity, Evolvability and
Modularity in RNA," Journal of Experimental Zoology (Molecular and
Developmental Evolution), 288 (2000): 242--283 [Reprint]
- Carl T. Bergstrom and Michael Lachmann, "The fitness value of
information", q-bio.PE/0510007
- Charles Darwin, On the Origin of
Species, by Means of Natural Selection, or Preservation of Favored Races in the
Struggle for Life [Get the fascimile of the first edition with the
introduction by Ernst Mayr]
- Richard Dawkins, The Extended
Phenotype [Review: Not
Your Parents' Web of Life]
- R. A. Fisher, The Genetical Theory of Natural
Selection
- Laura F. Galloway and Julie R. Etterson, "Transgenerational
Plasticity Is Adaptive in the Wild", Science 318
(2007): 1134--1136
- John Gerhart and Marc Kirschner, Cells, Embryos and
Evolution: Toward a Cellular and Developmental Understanding of Phenotypic
Variation and Evolutionary Adaptability
- John H. Gillespie, Population Genetics: A Concise
Guide [Review: Darwin's
Equations (II)]
- J. B. S. Haldane, The Causes of Evolution [Review: Darwin's Equations]
- Paul H. Harvey and Mark D. Pagel, The Comparative Method in
Evolutionary Biology [Review: On the
Uses of Knowing Where Birds and Bees Come From]
- Josef Hofbauer and Karl Sigmund, The Theory of Evolution and
Dynamical Systems: Mathematical Aspects of Selection [Mathematically
explicit, i.e., full frontal differentiable manifolds]
- John Holland, Adaptation in Natural and Artificial Systems
- T. H. Huxley, The
Huxley File [Nearly complete works, ed. and put on-line by Charles
Blinderman and David Joyce]
- R. C. Lewontin, The Genetic Basis of Evolutionary
Change [An excellent book; but The
Dialectical Biologist may be safely ignored]
- John Maynard Smith, Evolution and the Theory of Games [Review
by Danny Yee]
- Mark E. J. Newman
- "Simple Models of Evolution and Extinction," adap-org/9910003
- MEJN and Gunther J. Eble, "Decline in Extinction Rates and
Self-Similarity in the Fossil Record," Paleobiology
25 (1999): 434--439 = adap-org/9809004
- MEJN and Richard G. Palmer, "Models of Extinction: A
Review," adap-org/9908002
- Martin Nilsson and Nigel Snoad, "Error Thresholds for Quasispecies
on Dynamic Fitness Landscapes," Physical Review Letters
84 (2000): 191--194
- Mark Pagel, "Inferring the Historical Patterns of Biological
Evolution," Nature 401 (1999): 877--884
[Warning: This paper makes the staggering mistake of confusing likelihood
(probability of a model's generating results like the data) with posterior
probability (probability of the model, given the data). Pagel definitely
doesn't make this elementary blunder elsewhere; I dunno what happened here.]
- David C. Queller, "The Spaniels of St. Marx and the Panglossian
Paradox: A Critique of a Rhetorical Programme," Quarterly Review of
Biology 70 (1995): 485--489
- Erik van Nimwegen, The Statistical Dynamics of Epochal
Evolution [Ph.D. thesis, University of Utrecht,
1999; on-line]
- George C. Willams, Adaptation and Natural Selection; a
Critique of Some Current Evolutionary Thought [Well, current in 1966,
anyway; that it is no longer current is due largerly to this book!]
- Guoping Zhu, G. Brian Golding and Antony M. Dean, "The Selective
Cause of an Ancient Adaptation", Science 307
(2005): 1279--1282 [Where "ancient" means "about 3.5 billion years old".]
To read:
- P. Ao, "Mathematical Structure of Evolutionary Theory", q-bio.QM/0403020 ["Here we
postulate three laws which form a mathematical framework to capture the essence
of Darwinian evolutionary dynamics. The second law is most quantitative and is
explicitly expressed by a unique form of stochastic differential equation."
Color me skeptical, but I haven't read beyond the abstract.]
- Wallace Arthur, Creatures of Accident: The Rise of the
Animal Kingdom
- John C. Avise, Evolutionary Pathways in Nature: A Phylogenetic
Approach [blurb]
- Ellen Baake and Wilfried Gabriel, "Biological Evolution through
Mutation, Selection, and Drift: An Introductory Review," cond-mat/9907372
- Ellen Baake, Hans-Otto Georgii, "Mutation, selection, and ancestry
in branching models: a variational
approach", q-bio.PE/0611018
- Richard K. Belew and Melanie Mitchell (eds.), Adaptive
Individuals in Evolving Populations: Models and Algorithms
- J. B. Beltman and P. Haccou, "Speciation through the learning of
habitat features", Theoretical Population
Biology 67 (2005): 189--202
- Ioana Bena, Michel Droz and Andrzej Pekalski, "Complex population
dynamics as a competition between multiple time-scale
phenomena", q-bio.PE/0703033
- Carl T. Bergstrom and Rustom Antia, "How do adaptive immune
systems control pathogens while avoiding
autoimmunity?", Trends in Ecology and
Evolution 21 (2006): 22--28
[PDF
reprint via Carl]
- Peter Bowler, Evolution: The History of an Idea
- R. A. Blythe and A. J. McKane, "Stochastic Models of Evolution in
Genetics, Ecology and
Linguistics", cond-mat/0703478
- Daniel R. Brooks and Deborah A. McLennan, The Nature of
Diversity: An Evolutionary Voyage of Discovery [Blurb]
- Sean B. Carroll, From DNA to Diversity: Molecular Genetics
and the Evolution of Animal Design
- Nicolas Champagnat, Régis Ferrière, Sylvie
Méléar, "Individual-based probabilistic models of adaptive
evolution and various scaling
approximations", math.PR/0510453 ["We are
interested in modelling Darwinian evolution, resulting from the interplay of
phenotypic variation and natural selection through ecological interactions. Our
models are rooted in the microscopic, stochastic description of a population of
discrete individuals characterized by one or several adaptive traits. The
population is modelled as a stochastic point process whose generator captures
the probabilistic dynamics over continuous time of birth, mutation, and death,
as influenced by each individual's trait values, and interactions between
individuals. An offspring usually inherits the trait values of her progenitor,
except when a mutation causes the offspring to take an instantaneous mutation
step at birth to new trait values. We look for tractable large population
approximations. By combining various scalings on population size, birth and
death rates, mutation rate, mutation step, or time, a single microscopic model
is shown to lead to contrasting macroscopic limits, of different nature:
deterministic, in the form of ordinary, integro-, or partial differential
equations, or probabilistic, like stochastic partial differential equations or
superprocesses. In the limit of rare mutations, we show that a possible
approximation is a jump process, justifying rigorously the so-called trait
substitution sequence."]
- Nicolas Champagnat and Amaury Lambert, "Evolution of discrete
populations and the canonical diffusion of adaptive
dynamics", math.PR/0601643
- Kim Christensen, Simone A. di Collobiano, Matt Hall, and Henrik
J. Jensen, "Tangled Nature: a model of evolutionary ecology," cond-mat/0104116
- Freddy Bugge Christiansen, Theories of Population Variation in
Genes and Genomes [blurb, intro]
- Jens Christian Claussen and Arne Traulsen, "Non-Gaussian
fluctuations arising from finite populations: Exact results for the
evolutionary Moran
process", Physical
Review E 71 (2005): 025101
- Charles C. Cockell, Impossible Extinction: Natural
Catastrophes and the Supremacy of the Microbial World
[blurb]
- Elisheva Cohen, David A. Kessler and Herbert Levine, "Recombination
dramatically speeds up evolution of finite populations", q-bio.PE/0410015
- Claude Combes, The Art of Being a Parasite
[Review by Danny Yee]
- Vincent Courtillot, Evolutionary Catastrophes: The Science of
Mass Extinction
[blurb]
- Depew and Weber, Darwinism Evolving: Systems Dynamics and the
Genealogy of Evolution [Beware of genealogies which aren't actually
about families. Review
by John Maynard Smith.]
- Fabio Dercole and Sergio Rinaldi, Analysis of Evolutionary
Processes: The Adaptive Dynamics Approach and Its Applications
[Blurb, ch. 1]
- Adrian Desmond, Huxley: From Devil's Disciple to Evolution's
High Priest
- Theodosius Dobzhansky
- Mankind Evolving: The Evolution of the Human
Species
- Genetics and the Origin of Species
- Nikolay V. Dokholyan and Eugene I. Shakhnovich, "Understanding
hierarchical protein evolution from first principles," cond-mat/0104469
- Rick Durrett and Jason Schweinsberg, "A coalescent model for the
effect of advantageous mutations on the genealogy of a population", math.PR/0411071
- Gunther Eble
- Manfred Eigen, The Hypercycle, a Principle of Natural Self-organization
- Bryan K. Epperson, Geographical Genetics
- A. Eriksson, B. Haubold, and B. Mehlig, "Statistics of
selectively neutral genetic variation," physics/0111205
- David E. Fastovsky and David B. Weishampel, The Evolution and
Extinction of the Dinosaurs
[blurb]
- Luca Ferraro, Andrea Giansanti, Giovanni Giuliano and Vittorio
Rosato, "Co-expression of statistically over-represented peptides in proteomes:
a key to phylogeny?", q-bio.MN/0410011
- Walter Fontana and Peter Schuster, "Continuity in Evolution: On the
Nature of Transitions," SFI
Working Paper 98-04-030
- Barbara Forrest and Paul R. Gross, Creationism's Trojan
Horse: The Wedge of Intelligent Design
- Sergey Gavrilets, Fitness Landscapes and the Origin of
Species
- Diane P. Genereuz and Carl T. Bergstrom, "Evolution in Action:
Understanding antibiotic resistance" [forthcoming book
chapter; PDF]
- Ulrich Gerland and Terence Hwa, "On the Selection and Evolution of
Regulatory DNA Motifs," physics/0112039
- M. T. Ghiselin, The Triumph of the Darwinian Method
- John H. Gillespie, The Causes of Molecular Evolution
- Thomas J. Givnish and Kenneth J. Sytsma (eds.), Molecular
Evolution and Adaptive Radiation
- Peter Godfrey-Smith, Darwinian Populations and
Natural Selection [blurb]
- Stephen Jay Gould
- Full House
- Ontogeny and Phylogeny
- The Structure of Evolutionary Theory
- B. Rosemary Grant and Peter R. Grant, Evolutionary Dynamics
of a Natural Population: The Large Cactus Finch of the Galapagos
- Dan Graur and William Martin, "Reading the entrails of chickens:
molecular timescales of evolution and the illusion of precision",
Trends in Genetics 20 (2004): 80--86
[PDF
reprint]
- Joachim Hermisson, Oliver Redner, Holger Wagner and Ellen Baake,
"Mutation-Selection Balance: Ancestry, Load, and Maximum Principle," cond-mat/0202432
- Hofbauer and Sigmund, Evolutionary Games and Replicator
Dynamics
- David Hull, Science and Selection: Essays on Biological
Evolution and Philosophy of Science
- G. Evelyn Hutchinson, The Ecological Theater and the
Evolutionary Play
- Eva Jablonka and Marion J. Lamb, Evolution in Four
Dimensions: Genetic, Epigenetic, Behavioral, and Symbolic Variation in the
History of Life [Blurb]
- Kavita Jain and Joachim Krug, "Evolutionary trajectories in rugged
fitness landscapes", q-bio.PE/0501028 = Journal of
Statistical Mechanics: Theory and Experiment (2005): P04008
- P. D. Jarvis, J. D. Bashford and J. G. Sumner, "Path integral
formulation of phylogenetic branching processes", q-bio.PE/0411047
- Marc Kirschner and John Gerhart, The Plausibility of
Life
[Capsule review
by Frenando Pereira]
- Laura Landweber and Erik Winfree (ed.), Evolution as
Computation [The dual subject to evolutionary
computation, as were]
- Daniel J. Lawson and Henrik Jeldtoft Jensen, "Evolution as
Diffusion in type
space", q-bio.PE/0609009
- George Levine, Darwin Loves You: Natural Selection and the
Re-Enchantment of the World
- Konrad Lorenz, Evolution and Modification of Behavior
- Elisabeth Lloyd, The Structure and Confirmation of
Evolutionary Theory
- Long, Rise of the Fishes
- David J. C. MacKay, "Rate of Information Acquisition by a Species
subjected to Natural Selection" [link]
- Marc Mangel, The Theoretical Biologist's Toolbox:
Quantitative Methods for Ecology and Evolutionary Biology
[blurb]
- John Maynard Smith, Evolutionary Genetics [Review by Danny
Yee]
- Ernst Mayr
- Growth of Biological Thought
- Populations, Species and Evolution
- This Is Biology
- What Evolution Is
- Margaret McFall-Ngai, "Adaptive Immunity: Care for the community",
Nature 445
(2007): 153
- Géza Meszéna, Mats Gyllenberg, Frans J. Jacobs and
Johan A. J. Metz, "Link between Population Dynamics and Dynamics of Darwinian
Evolution",
Physical Review
Letters 95 (2005): 078105
- Richard E. Michod, Darwinian Dynamics: Evolutionary
Transitions in Fitness and Individuality
- Jacek Miekisz, "Evolutionary game theory and population dynamics",
q-bio.PE/0703062
[51 pp. lecture notes]
- Stephen P. Miller, Mark Lunzer, and Antony M. Dean, "Direct
Demonstration of an Adaptive Constraint", Science 314 (2006):
458--461
- Mitton, Selection in Natural Populations
- Karl J. Niklas, The Evolutionary Biology of Plants [Ringing endorsement
from Danny Yee --- "the most rewarding book on evolution I have read for
years."]
- Martin Nilsson, "Optimal Mutation Rates on Static Fitness
Landscpes", physics/0005018
- Karen M. Page and Martin A. Nowak, "Unifying Evolutionary
Dynamics", Journal of
Theoretical Biology 219 (2002): 93--98 [Thanks
to David Krakauer]
- John Pepper, "Relatedness in Trait Group Models of Social
Evolution," SFI Working Paper 00-07-034
- Frank J. Poelwijk, Daniel J. Kiviet, Daniel M. Weinreich and Sander J. Tans, "Empirical fitness landscapes reveal accessible evolutionary paths",
Nature 445
(2007): 383--386
- Angela Potochnik, "Optimality Modeling and Explanatory Generality",
phil-sci/3011
- William B. Provine
- The Origins of Theoretical Population Genetics
- Sewall Wright and Evolutionary Biology
- José M. Ranz and Carlos A. Machado, "Uncovering evolutionary
patterns of gene expression using
microarrays", Trends in Ecology and
Evolution 21 (2006): 29--37
- Alex Rosenberg, Darwinian Reductionism: Or, How to Stop
Worrying and Love Molecular Biology [Blurb]
- Scott W. Roy and Walter Gilbert, "Complex early genes",
Proceedings of the
National Academy of Sciences (USA) 102 (2005):
1986--1991
- Tor Schoenmeyr and Claus O. Wilke, "Finite genome size can halt
Muller's ratchet," physics/0109058
- Veit Schämmle and E. Brigatti, "Speciational view of
macroevolution: are micro and macroevolution decoupled?", q-bio.PE/0509032 ["We
introduce a simple computational model that, with a microscopic dynamics driven
by natural selection and mutation alone, allows the description of true
speciation events. A statistical analysis of the so generated evolutionary tree
captures realistic features showing power laws for frequency distributions in
time and size. Albeit these successful predictions, the difficulty in obtaining
punctuated dynamics with mass extinctions suggests the necessity of decoupling
micro and macro-evolutionary mechanisms in agreement with some ideas of Gould's
and Eldredge's theory of punctuated equilibrium."]
- William A. Searcy and Stephen Nowicki, The Evolution of
Animal Communication: Reliability and Deception in Signaling Systems
[Blurb, intro]
- Homer William Smith, From Fish to Philosopher [Any
book which is listed under both "Consciousness" and "Kidneys" in the
library of Congress classification is worth looking up]
- Nigel Snoad, Limits to Evolvability in Changing
Environments [Ph.D. thesis, Australian National University, 2000]
- Elliott Sober
- Ricard V. Solé, Isaac Salazar-Ciudad and Jordi
Garcia-Fernández, "Landscapes, Gene Networks and Pattern Formation: On
the Cambrian Explosion," SFI
Working Paper 00-08-046
- Stanley, Extinction
- Mark Stegeman and Paul Rhode, "Stochastic Darwinian equilibria in
small and large populations", Games and Economic
Behavior 49 (2004): 171--214
- Jun Sun and Michael W. Deem, "Spontaneous Emergence of Modularity
in a Model of Evolving Individuals", Physical Review
Letters 99 (2007): 228107
- Gergely J Szollosi, Imre Derenyi, "Hierarchical meanfield theory of
evolutionary games on structured
populations", arxiv:0704.0357
- Simon Tavaré "Ancestral Inference in Population Genetics"
- Guido Tiana, Boris E. Shakhnovich, Nikolay V. Dokholyan and Eugene
I. Shakhnovich, "Imprint of evolution on protein structures", PNAS
101 (2004): 2846--2851
- Daniel P. Todes, Darwin Without Malthus: The Struggle for
Existence in Russian Evolutionary Thought
- Arne Traulsen, Jens Christian Claussen, and Christoph Hauert,
"Coevolutionary Dynamics: From Finite to Infinite Populations",
Physical Review
Letters 95 (2005): 238701
- Arne Traulsen, Jorge M. Pacheco and Lorens A. Imhof, "Stochasticity
and evolutionary stability", q-bio.PE/0609021
= Physical
Review E 74 (2006): 021905
- Annie E. Tsong, Brian B. Tuch, Hao Li and Alexander D. Johnson,
"Evolution of alternative transcriptional circuits with identical logic",
Nature
443 (2006): 415--420
- J. Scott Turner, The Tinkerer's Accomplice: How Design Emerges From Life Itself
- Geerat J. Vermeij
- Christian von Mering, Evgeny M. Zdobnov, Sophia Tsoka, Francesca D.
Ciccarelli, Jose B. Pereira-Leal, Christos A. Ouzoounis and Peer Bork, "Genome
evolution reveals biochemical networks and functional modules",
PNAS 100
(2003): 15428--15433
- Andreas Wagner, Robustness and Evolvability in Living
Systems [Blurb,
chapter 1]
- Gunter P. Wagner, Chris Amemiya and Frank Ruddle, "Hox cluster
duplications and the opportunity for evolutionary novelties", PNAS 100
(2003): 14603--14606
- Denis M. Walsh, "Fit and Diversity: Explaining Adaptive
Evolution", Philosophy of Science 70 (2003):
280--301
- Richard A. Watson, Compositional Evolution: The Impact of
Sex, Symbiosis, and Modularity on the Gradualist Framework of Evolution
[Blurb]
- Claus O. Wilke
- "Adaptive evolution on neutral networks," physics/0101021
- "Selection for Fitness vs. Selection for Robustness in RNA
Secondary Structure Folding," physics/0103022
- Claus O. Wilke and D. Allan Drummond, "Population genetics of
translational robustness", q-bio.PE/0509031
- Claus O. Wilke, Christopher Ronnewinkel and Thomas Martinetz,
"Dynamic Fitness Landscapes in Molecular Evolution," physics/9912012
#
The Left
The right-thinking people who move us forward.
Whigs, philosophes, Liberals, Radicals, Leftists, Progressives,
etc. History and views; causes of our present decrepitude and ways to fix it.
See also:
Cultural Criticism;
Economics;
the Enlightenment;
Environmentalism;
Feminism;
Socialism;
Revolutions and Revolutionaries;
the Right;
Unions;
the Welfare State
See:
- Akhil Reed Amar and Alan Hirsch, For the People: What the
Constitution Really Says about Your Rights
- Paul Berman, A Tale of Two Utopias: The Political Journey of the Generation of 1968
- Sheri Berman, The Primacy of Politics: Social Democracy and
the Making of Europe's Twentiet Century
- Michael Bérubé, The Left at War
- Robert A. Dahl, A Preface to Economic Democracy
- William O. Douglas, Points of Rebellion [A sermon on
the text that "a little rebellion now and again is a good thing", circa 1970.
To see how both American liberalism and the Republic have declined, recall that
Douglas was a Supreme Court justice at the time.]
- Barbara Ehrenreich
- Susan George, "How to Win
the War of Ideas: Lessons from the Gramscian Right"
- Todd Gitlin, Straight from the Sixties
- Albert Hirschman, "The Rhetoric of Reform" [online]
- Robert Hughes, Culture of Complaint
- Stephen Holmes, An Anatomy of Antiliberalism
- Leszek Kolakowski
- Towards a Marxist Humanism [British title:
Marxism and Beyond]
- Main Currents of Marxism (his magnum opus)
- Modernity on Endless Trial
- Paul Krugman,
Peddling Prosperity
- John Stuart Mill, On
Liberty
- John Roemer, A Future for Socialism [Review: The Red Monday Efficient
Allocation Blues]
- Richard Rorty, "First Projects,
Then Principles" [Even though he's wrong about the effects of international trade]
- Bertrand Russell
- Freedom and Organization, 1814--1914
- Power
- Authority and the Individual
- George Soros, "The
Capitalist Threat" and "Towards a Global
Open Society".
To read:
- Ronald Aronson, After Marxism
- Brian Barry, Why Social Justice Matters
- Becker, Heavenly City of the 18th Century
Philosophers
- David Belkin, The
Left and Limited Government
- Norman Birnbaum, After Progress: American Social Reform and
European Socialism in the Twentieth Century
- Carl Boggs, Social Movements & Political Power: Emerging
Forms of Radicalism in the West
- Derek Bok, The Trouble with Government [blurb]
- Stephen Eric Bronner, Reclaiming the Enlightenment:
Towards a Politics of Radical Engagement
- Peter N. Carroll, Odyssey of the Abraham Lincoln
Brigade
- Coleman, The Liberal Conspiracy (Congress for Cultural
Freedom, CIA)
- Condorcet
- Bogan Denitch, After the Flood: World Politics and Democracy
in the Wake of Communism
- Charles Derber et al., What's Left? Radical
Politics in the Postcommunist Era
- George Eley, Forging Democracy: The History of the Left in
Europe, 1850--2000
- Todd Gitlin, The Twilight of Common Dreams
- Stuart Hall, The Hard Road to Renewal
- Judge Learned Hand, Spirit of Liberty
- Stephen Hart, Cultural Dilemmas of Progressive Politics:
Styles of Engagement among Grassroots Activists
[Blurb]
- Paul Hollander, Political Pilgrims: Western Intellectuals in
Search of the Good Society [Covers only leftists. This is unfortunate,
since the pilgrimages of leftists to the Soviet Union in the 1930s were
paralleled by those of rightists to Italy and Germany...]
- Hughes, Sophisticated Rebels (French left since the
'60s)
- John M. Jordan, Machine-Age Ideology: Social Engineering and
American Liberalism, 1911--1939
- Ira Katznelson, Liberalism's Crooked Circle: Letters to
Adam Michnick
- Khilnani, Arguing Revolution (French leftists)
- Joss Marsh, Word Crimes: Blasphemy, Culture, and Literature
in Nineteenth-Century England ["tells the forgotten stories of more
than two hundred working-class `blasphemers,' whose stubborn refusal to silence
their `hooligan' voices helped secure our rights to speak and write freely
today."]
- Kevin Mattson, Intellectuals in Action: The Origins of
the New Left and Radical Liberalism, 1945--1970
- Gerassimos Moschonas, In the Name of Social Democracy: The
Great Transformation, 1945 to the Present
- Cosma Orsi, "The Political Economy of Solidarity"
[PDF. No relation.]
- C. Owen Peapke, Evolution of Progress
- Rawls
- A Theory of Justice
- Political Liberalism
- John Reed, Ten Days that Shook the World
- John Roemer, Equality of Opportunity
- Saul, Voltaire's Bastards
- Amartya Sen, Inequality Reexamined [Review by Danny
Yee around someplace]
- Victor Serge, Memoirs of a Revolutionist
- Charles Shipman, It Had to Be Revolution: Memoirs of an
American Radical
- Gregory D. Sumner, Dwight Macdonald and the Politics Circle:
The Challenge of Cosmopolitan Democracy
- Eric Voeglin, New Science of Politics [Ouch. You talk
about your reactionaries. "Conservative" is much too pale a word for Voeglin;
he's extremely metaphysical, possibly Thomist, certainly peddling an odd hybrid
of Christianity, Plato and Aristotle, for which the State's "representing" some
sort of "truth" is decidedly more important than representing the actual people
who have to live with it. But very interesting.]
- Ellen Willis
- Don't Think, Smile!
- No More Nice Girls
- Garry Wills, Necessary Evil
#
Learning in Games
See also
Collective Cognition;
Evolutionary Economics;
Machine Learning, Statistical Inference and Induction;
the Minority Game;
Sequential Decisions Under
Uncertainty;
Universal Prediction Algorithms
Recommended:
- Jenna Bednar and Scott Page, "Games Theory and Culture" [PDF]
- Nicolo Cesa-Bianchi and Gabor Lugosi, Prediction, Learning,
and Games [Mini-review]
- Dean P. Foster and H. Peyton Young, "Learning, hypothesis testing,
and Nash equilibrium," Games and Economic
Behavior 45 (2003): 73--96 [pdf]
- Herbert Gintis, Game Theory Evolving: A Problem-Centered
Introduction to Modeling Strategic Interaction
- Ariel Rubinstein, Modeling Bounded Rationality [Review: O docta
simplicitas!]
- Larry Samuelson (no relation of the Samuelson),
Evolutionary Games and Equilibrium Selection
- José M. Vidal and Edmund H. Durfee, "Predicting the Expected
Behavior of Agents That Learn About Agents: The CLRI Framework," cs.MA/0001008
- H. Peyton Young, Individual Strategy and Social Structure: An
Evolutionary Theory of Institutions [Review: A Myopic (and Sometimes
Blind) Eye on the Main Chance, or, the Origins of Custom]
To read:
- Jacob Abernethy, Alekh Agarwal, Peter L. Bartlett, Alexander Rakhlin, "A Stochastic View of Optimal Regret through Minimax Duality", arxiv:0903.5328
- James Bergin and Barton L. Lipman, 1996, "Evolution with
State-Dependent Mutations," Econometrica 64
(1996): 943--956
- Andreas Blume, "A Learning-Efficiency Explanation of Structure in
Language", Theory
and Decision 57 (2004): 265--285
- Lawrence E. Blume and David Easley
- "If You're So Smart, Why Aren't You Rich? Belief Selection
in Complete and Incomplete Markets," SFI Working Paper 01-06-031
- "Optimality and Natural Selection in Markets," SFI
Working Paper 98-09-0 82
- Oliver Board, "Dynamic interactive epistemology", Games and
Economic Behavior 49 (2004): 49--80
- Christophe Chamley, Rational Herds: Economic Models of Social
Learning
- Emilio De Santis and Carlo Marinelli, "Stochastic games with
infinitely many interacting
agents", math.PR/0505608
[Sounds very cool: "study a class of infinite-horizon non-zero-sum
non-cooperative stochastic games with infinitely many interacting agents using
ideas of statistical mechanics.... in the general case of asymmetric
interactions, the existence of a strategy that allows any player to eliminate
losses after a finite random time. In the special case of symmetric
interactions ... as time goes to infinity, the game converges to a Nash
equilibrium. Moreover, assuming that all agents adopt the same strategy, using
arguments related to those leading to perfect simulation algorithms, spatial
mixing and ergodicity are proved ... ergodicity [implies] ``fixation'',
i.e. that players will adopt a constant strategy after a finite
time. ... related to zero-temperature Glauber dynamics on random graphs of
possibly infinite volume."]
- Pradeep Dubey and Ori Haimanko, "Learning with Perfect
Information", Games and Economic
Behavior 46 (2004): 304--324
- Jim Engle-Warnick, William J. McCausland and John H. Miller,
"The Ghost in the Machine: Inferring Machine-Based Strategies from
Observed Behavior" [i.e., inferring stochastic transducers from data; hence
the inclusion here]
- Anders Eriksson and Kristian Lindgren, "A simple model of cognitive
processing in repeated
games", q-bio.PE/0608015
- Fudenberg and Levine, The Theory of Learning in Games
- Douglas Gale and Hamid Sabourian, "Complexity and Competition",
Econometrica 73
(2005): 739--769 ["Extensive-form market games typically have a large
number of noncompetitive equilibria. In this paper, we argue that the
complexity of noncompetitive behavior provides a justification for competitive
equilibrium in the sense that if rational agents have an aversion to complexity
(at the margin), then maximizing behavior will result in simple behavioral
rules and hence in a competitive outcome. For this purpose, we use a class of
extensive-form dynamic matching and bargaining games with a finite number of
agents. In particular, we consider markets with heterogeneous buyers and
sellers and deterministic, exogenous, sequential matching rules, although the
results can be extended to other matching processes. If the complexity costs of
implementing strategies enter players' preferences lexicographically with the
standard payoff, then every equilibrium strategy profile induces a competitive
outcome."]
- Val E. Lambson and Daniel A. Probst, "Learning by Matching
Patterns", Games and Economic
Behavior 46 (2004): 398--409
- Jacek Miekisz
- "Statistical mechanics of spatial evolutionary games", cond-mat/0210094
- "Stochastic stability in spatial games", cond-mat/0409647 = Journal of Statistical Physics 117 (2004): 99--110
- "Long-run behavior of games with many players", cond-mat/0409742
- Gillies Pagès, "A two armed bandit type problem
revisited", math.PR/0502182
- Liviu Panait, Karl Tuyls, Sean Luke, "Theoretical Advantages of
Lenient Learners: An Evolutionary Game Theoretic Perspective",
Journal of
Machine Learning Research 9 (2008): 423--457
- Mark Stegeman and Paul Rhode, "Stochastic Darwinian equilibria in
small and large populations", Games and Economic
Behavior 49 (2004): 171--214
- José M. Vidal, Computational Agents That Learn About
Agents: Algorithms for Their Design and a Predictive Theory of Their
Behavior [Ph.D. thesis, U. Michigan, 1998;
on-line]
#
Economics
I have always felt a certain horror of political economists, since
I heard one of them say that he feared the famine of 1848 in Ireland would not
kill more than a million people, and that would scarcely be enough to do much
good.
---Attrib. to Benjamin Jowett
History of markets. QWERTY. Other parts
of evolutionary economics. Arrow-Debreu model of
general equilibrium. Market failure. Input-output
models. Planning and regulation, esp. during the World Wars. Economic
policy of US, Europe, East Asia, 3rd
World. Development economics, growth
theory. Corporations. Finance.
Claims of analogies between physics and conventional economics (whether with
laudatory or debunking intent) --- not to be confused with applying methods of
theoretical physics to economic questions
("econophysics"). Agent-based
modeling.
See also
decision theory;
historical materialism;
information economy;
institutional economics;
socialism, market socialism
Recommended non-technical works:
- Dean Baker, The Conservative Nanny State: How the Wealthy Use
the Government to Stay Rich and Get Richer
[Full text free online]
- J. Bradford De
Long [Prof. at UC Berkeley; arrived after I graduated, or else his
courses'd be another foolishly wasted opportunity. What's
New page.]
- William Easterly, The Elusive Quest for Growth: Economists'
Adventures and Misadventures in the Tropics
- Barry Eichengreen, Globalizing Capital: A History of the
International Monetary System [Review: Turning the Wheels]
- Robert H. Frank, Passions within Reason: The Strategic Role
of Emotions
- Justin Fox, The Myth of the Rational Market: A History of Risk, Reward, and Delusion on Wall Street [Review by Steve
Laniel]
- John Kenneth Galbraith
- Robert Heilbroner, The Worldly Philosophers [If you
don't have time to actually learn economics, this will help you fake it.
Faking it is very important in a soi-disant
information society, and for the following reason: if you are over your
head in debt, the last thing you want to do is suddenly live frugally.]
- Doug Henwood, Wall Street: How It Works and for Whom
[Free online!]
- John Maynard Keynes
- Economic
Consequences of the Peace [To the best of my knowledge, the
only accurate economic prophecy ever, written in Keynes's usual
immaculate prose.]
- Essays in Persuasion [Especially the last
essay, "Economic Possibilities for Our Grandchildren".]
- Paul Krugman
- The Age of Diminished Expectations
- Peddling Prosperity [Shows, very effectively,
that all the conventional wisdom about America's economic problems --- "the
twin deficits," globalization (for more on which, see Pop
Internationalism), the success of supply-side economics, "high-value
added, high-technology manufacturing jobs," etc., etc., is wrong.]
- Pop Internationalism [Review: International Economics with
the Brothers Grimm, or, Krugman Discovers Ideology]
- Charles Lindblom
- The Market System: What It Is, How It
Works, and What to Make of It [Probably the single best treatment of
the subject I have read. Reviews by Danny Yee and George Scialabba.]
- "The Market as Prison", The Journal of Politics
44 (1982): 324--336
[JSTOR]
- Karl Polanyi, The Great Transformation
- Thomas Schelling, Micromotives and Macrobehavior
- Joseph Schumpeter, Capitalism, Socialism, and
Democracy [Schumpeter was arrogant, elitist, reactionary and brilliant.
Half the time he had me saying "why didn't I see that?" and half the time he
has my blood boiling. These two states are not mutually exclusive. --- My
father suggests that Schumpeter should be counted as the most unorthodox
Marxist ever, but perhaps he must share the honor with Neurath.]
- Amartya Sen, Development as Freedom
- Robert J. Shiller, Irrational Exuberance [Is the stock
market overvalued? Hell yes! (Comment written in late 1999; still true in
April 2001.)]
- Herbert Simon
- The Sciences of the Artificial, [Especially
ch. 2, "Economic Rationality." "As Samuel Johnson said of the dancing dog,
`The marvel is not that it dances well, but that it dances at all' ---- the
marvel is not that markets optimize (they don't) but that they often clear."]
- "Rationality as Process and as Product of Thought",
American Economic Review 68 (1978): 1--16 [JSTOR]
- Tom Slee, No One Makes You Shop at Wal-Mart: The Surprising
Deceptions of Individual Choice [Comments]
- Robert Solow, Work and Welfare [On-line essay
derived therefrom]
- Statistical
Abstract of the United States ["The best book published in the
America.... If more people would get into the habit of checking it, our
politics would be utterly transformed." --- Paul Krugman.]
- Lester Thurow, The Zero-Sum Society [But many of his
later books aren't worth the time, and some are actively misleading]
- Shigeto Tsuru, Japan's Capitalism
Recommended semi-technical works (jargon but no math):
- Samuel Bowles and
Herbert Gintis, "Walrasian
Economics in Retrospect", Quarterly Journal of Economics (2000):
1411--1439 [PDF
reprint]
- David Colander, Hans Follmer, Armin Haas, Michael Goldberg,
Kataraina Juselius, Alan Kirman, Thomas Lux and Brigitte Sloth,
"The Financial Crisis and the Systemic Failure of Academic Economics"
[PDF preprint]
- David Colander, Peter Howitt, Alan Kirman, Axel Leijonhufvud and Perry Mehrling, "Beyond DSGE Models: Towards an Empirically-Based Macroeconomics"
[PDF preprint]
- Jan Elster, "Excessive
Ambitions", Capitalism
and Society 4:2 (2009): 1
- Bennett Harrison,
Lean and Mean [Everything you know about small companies and
business networks is wrong.]
- F. A. Hayek, Individualism and Economic Order [Bearing
in mind the important distinction between Hayek-the-profound-social-scientist,
and his evil twin, Hayek-the-right-wing-ideologue.
"The Use of
Knowledge in Society" is superb, and in
"Economics
and Knowledge" he seems to have been channeling evolutionary game
theory...]
- Frank Levy and Peter Temin, "Inequality and Institutions in 20th
Century America", MIT Economics Working Paper 07-17
[SSRN]
- Alexander Rosenberg, Economics: Mathematical Politics or
Science of Diminishing Returns? [Mini-review]
- Reinhard
Selten, "Evolution, Learning, and Economic Behavior", Games and
Economic Behavior 3 (1991): 3--24 [In the form of a
dialogue between a Bayesian, an economist, an experimental psychologist, an
adaptationist biologist, a population geneticist, and an ethologist]
- Joseph Stiglitz
- Whither Socialism? [This book is
really much broader than its title suggests; effectively it's a primer
on the economics of imperfect information and imperfect competetion, and their
implications. Review by Steve Laniel]
- "Information and the Change in the Paradigm in Economics" [Nobel Prize lecture, 2001. PDF]
- Sweezy and Magdoff [Of course they're Marxists. They're also damn
good economists, and they've supported distinctly fewer dictators than Milton
Friedman. Deal. --- These books, it should be said, are about capitalism, not
its alternatives.]
- Stagnation and the Financial Explosion
- The Irreverisble Crisis
- Richard H. Thaler, The Winner's Curse: Paradoxes and
Anomalies of Economic Life [Why rational-expectations-and-efficiency is
demonstrably not right, though there's no comprehensive replacemen]
Recommended technical works (math and/or heavy jargon):
- Nabil al-Najjar, "Decision Makers as Statisticians:
Diversity, Ambiguity and Learning", Econometrica
forthcoming [preprint. The bits with
finitely-additive probability are weird; I think they could be replaced by
sticking to countably additive probability and considering convergence rates.]
- Theodore Bergstrom and John Miller, Experiments with Economic
Principles [Textbook on experimental economics; very nice]
- Samuel Bowles, Microeconomics: Behavior, Institutions, and
Evolution [In my humble and supremely unqualified opinion, the best book
on microeconomics now available.]
- Samuel Bowles and Herbert Gintis, "The Inheritance of
Inequality", Journal of Economic Perspectives 16
(2002): 3--30 [PDF
reprint]
- Bent Jesper Christensen and Nicholas M. Kiefer, Economic
Modeling and Inference [Good as a summary of current recommended
practices for setting up, solving, and estimating dynamic programming models.
I wanted to like this more than I did. Review: An Optimal Path to a Dead End.]
- Gerard Debreu, Theory of Value: An Axiomatic Analysis of
Economic Equilibrium [One of the stupidest things I ever did was pass up
a chance to take mathematical economics from Debreu. This is neo-classical
economics in its purest form, and absolutely beautiful. PDF]
- Steven N. Durlauf, "How Can Statistical Mechanics Contribute to
Social Science?" Proceedings
of the National Academy of Sciences USA 96 (1999):
10582--10584
- Herbert
Gintis, Game Theory Evolving: A Problem-Centered Introduction to
Modeling Strategic Interaction
[Author's
book-site]
- Trygve Haavelmo, "The Probability Approach in Econometrics",
Econometrica 12 (1944, supplement): iii--115
[JSTOR]
- Jack Hirshleifer, "The Private and Social Value of Information and
the Reward to Inventive Activity", American Economic Review
61 (1971): 561--574
[JSTOR]
- Charles Kenny, "What Does the Eastern European Growth Experience
Tell Us About the Policy and Convergence Debates?"
[PDF
preprint]
- Alan Kirman, "Whom or What Does the Representative Individual
Represent?", Journal of Economic Perspectives 6
(1992): 117--136
[Answer: no one; accordingly it "deserves to be buried". JSTOR]
- Paul Krugman
- Development, Geography and Economic Theory
- Geography and Trade
- The Self-Organizing Economy [Review]
- Aki Lehtinen and Jaakko Kuorikoski, "Computing the Perfect Model:
Why Do Economists Shun
Simulation", Philosophy
of Science 74 (2007): 304--329 [This seems right,
but more like reasons for economists to change their ideals than anything
else.]
- Charles Manski, Identification for Prediction
and Decision [Review:
Better Roughly Right Than Exactly Wrong]
- Rosario N. Mantegna and H. Eugene Stanley, An Introduction to
Econophysics: Correlations and Complexity in Finance [Review: Not Exactly Rocket
Science]
- Oskar Morgenstern, On the Accuracy of Economic
Observations [Read this, and you will never pay any attention to
forecasters, or half of the economic news, ever again: and you'll be better off
for it.]
- The New Palgrave Dictionary of Economics [An
encyclopedia, really, with excellent articles from all over the spectrum, many
from Big Names --- Arrow, Debreu, Nove, Simon, Sweezy, Winter, etc. --- and a
rich feeling for history throughout.]
- Ariel Rubinstein, Modeling Bounded Rationality [Review: O docta
simplicitas!]
- Herbert Simon
- An Empirically-based Microeconomics
- Models of Bounded Rationality
- Robert Solow
- Growth Theory: An Exposition
- Learning from "Learning by Doing": Lessons for
Economic Growth
- Laura Spierdijk and Mark Voorneveld, "Superstars without
Talent? The Yule Distribution Controversy", The Review of Economics and Statistics 91 (2009): 648--652 [PDF preprint]
- John Sutton
- "Gibrat's Legacy", Journal of Economic
Literature 35 (1997): 40--59 [JSTOR]
- Marshall's Tendencies: What Economists Can
Know?
[Micro-review.]
- Technology and Market Structure: Theory and
History
To read:
- George A. Akerlof, Explorations in Pragmatic Economics
- George Akerlof and Robert Shiller, Animal Spirits: How
Human Psychology Drives the Economy, and Why It Matters for Global
Capitalism [Blurb]
- Peter S. Albin, Barriers and Bounds to Rationality: Essays on
Economic Complexity and Dynamics in Interactive Systems [Blurb]
- Armen A. Alchian, Economic Forces at Work
- Michael Alexeev and Robert Conrad, "The Elusive
Curse of Oil", The Review
of Economics and Statistics 91 (2009): 586--598
- Masanao Aoki
- New Approaches to Macroeconomic Modeling:
Evolutionary Stochastic Dynamics, Multiple Equilibria, and Externalities as
Field Effect
[Blurb]
- Modeling Aggregate Behavior and Fluctuations in
Economics: Stochastic Views of Interacting Agents [Blurb]
- Backhouse, The Ordinary Business of Life
- Nicholas Bardsley, Robin Cubitt, Peter Moffatt, Graham Loomes,
Chris Starmer and Robert Sugden, Experimental Economics: Rethinking the
Rules [blurb]
- William J. Baumol
- The Free-Market Innovation Machine
- Good Capitalism, Bad Capitalism
- J.-P. Bénassy, The Macroeconomics of Imperfect
Competition and Nonclearing Markets: A Dynamic General Equilibrium
Approach [Blurb]
- Giuseppe Bertola, Reto Foellmi, and Josef
Zweim¨ller, Income Distribution in Macroeconomic Models
[Blurb, ch. 1]
- Richard Blundell and Thomas M. Stoker, "Heterogeneity
and Aggregation", Journal of Economic Literature
43 (2005): 347--391 [JSTOR]
- Mark Blyth, Great Transformations: Economic Ideas and
Institutional Change in the Twentieth Century
- Tito Boeri and Jan van Ours, The Economics of Imperfect Labor Markets [blurb, ch. 1]
- Samuel Bowles, Steven N. Durlauf and Karla Hoff (eds.),
Poverty Traps
[Blurb, intro]
- Steven Brakman and Ben J. Heijdra (eds.), The Monopolistic
Competition Revolution in Retrospect
- Robert Brenner, The Boom and the Bubble
- George P. Brockway, Economists Can Be Bad for Your Health:
Second Thoughts on the Dismal Science
- Pierre Cahuc and Andre Zylberberg, The Natural Survival of
Work: Job Creation and Job Destruction in a Growing Economy
[Blurb]
- Colin F. Camerer, Behavioral Game Theory: Experiments in
Strategic Interaction
- Colin F. Camerer and Ernst Fehr, "When Does 'Economic Man' Dominate
Social Behavior?", Science 311
(2006): 47--52
- Fred Campano and Dominick Salvatore, Income Distribution
- Card and Krueger, Myth and Measurement [The minimum
wage]
- Richard E. Caves, Creative Industries: Contracts between
Art and Commerce
- Carl Chiarella, Foundations for a Disequilibrium Theory of
the Business Cycle: Qualitative Analysis and Quantitative Assessment
[Blurb]
- Michael P. Clements and David F. Hendry (eds.), Companion
to Economic Forecasting
- Russell W. Cooper, Coordination Games: Complementarities and Macroeconomics
- James Crotty, "Are Keynesian Uncertainty and Macrotheory
Compatible? Conventional Decision Making, Institutional Structures, and
Conditional Stability in Keynesian Macromodels" [PDF preprint]
- G. Cuniberti, A. Valleriani and J. L. Vega, "Effects of regulation
on a self-organized market," cond-mat/0108533 [Conclusion:
"the introduction of regulation on the market contributes to lower the average
fitness of companies." Of course, since "regulation" is interpreted as a
penalty on companies whose fitness improves faster than normal, this is a
completely transparent and completely tendentious result...]
- Sebastian de Grazia, Of Time, Work and Leisure
- David N. DeJong and Chetan Dave, Structural Macroeconometrics [blurb,
sample chapters]
- Marco Del Negro, Frank Schorfheie, Frank Smets and Rafael
Wouters, "On the Fit of New Keynesian Models", Journal of Business and Economic Statistics 25 (2007): 123--162 [Including discussion and reply]
- T. Di Matteo, T. Aste and M. Gallegati, "Innovation flow through
social networks: Productivity distribution", physics/0406091 [Those look an
awful lot like log-normals to me.]
- Avinash Dixit, The Making of Economic Policy: A
Transaction-Cost Politics Perspective
- Kevin Doogan, New Capitalism? The Transformation of Work
- Robert Dorfman, Paul Samuelson and Robert Solow, Linear
Programming and Economic Analysis
- Robert B. Ekelund and Robert F. Hebert, Secret Origins of Modern Microeconomics: Dupuit and the Engineers [Blurb. The book description
raises a nice question, which I'm sure the book deals with. Assume, for the
sake of argument, that these engineers did invent the ideas of modern micro.
before Marshall et al. did — did anyone else pick up on them, or did
everyone actually get them from Marshall? In the later case, in what sense
did modern micro. originate with these people?]
- Yi Feng, Democracy, Governance, and Economic Performance:
Theory and Evidence [Blurb]
- Franklin M. Fisher, Disequilibrium Foundations of Equilibrium
Economics [blurb]
- Marion Fourcade, Economists and Societies: Discipline and Profession in the United States, Britain, and France, 1890s to 1990s
[Blurb]
- Robert H. Frank
- Choosing the Right Pond: Human Behavior and the Quest
for Status
- Luxury Fever
- and P. K. Cook, The Winner-Take-All Society
- Roman Frydman and Michael D. Goldberg, Imperfect Knowledge
Economics: Exchange Rates and Risk [blurb, ch. 1]
- Herbert Gintis, Samuel Bowles, Robert T. Boyd and Ernst Fehr
(eds.), Moral Sentiments and Material Interests: The Foundations of
Cooperation in Economic Life
- Paul Glimcher, Decisions, Uncertainty, and the Brain: Science
of Neuroeconomics
- Dhananjay K. Gode, Shyam Sunder, "Allocative Efficiency of Markets with Zero-Intelligence Traders: Market as a Partial Substitute for Individual Rationality", The Journal of Political Economy 101 (1993): 119--137 [JSTOR]
- Zvi Griliches, "Economic Data Issues" in Handbook of
Econometrics, vol. III. [Opens: "My father would never eat minced meat
patties in the old country. He would not eat them in restaurants because he
didn't know what they were made of, and he wouldn't eat them at home because he
did." --- Incidentally, that a set of four books, each large and solid enough
to be used as catapult ammunition, can call itself a "handbook" is one of the
wonders of the dismal science.]
- Gene M. Grossman, Innovation and Growth in the Global
Economy
- Benedetto Gui and Robert Sugde (eds.), Economics and Social
Interaction: Accounting for Interpersonal Relations
[Blurb]
- Hari M. Gupta and Jose R. Campanha, "Firms Growth Dynamics,
Competition and Power Law Scaling," cond-mat/0201219
- Joseph Y. Halpern, "A computer scientist looks at game theory,"
cs.GT/0201016
- Omar Hamouda and Robin Rowley, Expectations, Equilibrium and
Dynamics
- Zellig Harris, The Transformation of Capitalist
Society [in the direction of giving workers more power over decisions]
- Hayek
- Constitution of Liberty
- Studies in Philosophy, Politics and Economics
and New Studies
- The Counter-Revolution of Science
- Geoffrey Heal
- Valuing the Future: Economic Theory and
Sustainability
- Nature and the Marketplace: Capturing the Value of
Ecosystem Services
- Michael Heller, The Gridlock Economy: How Too Much Ownership
Wrecks Markets, Stops Innovation, and Costs Lives
[Review
by Stephen Laniel]
- Doug Henwood, After the New Economy
- Jack Hirshleifer, The Dark Side of the Force: Economic
Foundations of Conflict Theory [blurb]
- Jack Hirshleifer and John G. Riley, The Analytics of
Uncertainty and Information
- Hobson, Imperialism [Hilferding and Hobson are about
the unprecedented increases in global trade and the mobility of capital,
emerging markets and the export of manufacturing industries to the third world:
so it's more than slightly astonishing to find that they were written before
the Great War.]
- Kevin D. Hoover, Causality in Macroeconomics
- Douglas Irwin, Against the Tide: An Intellectual History of
Free Trade [Review by
Krugman]
- Eric L. Jones, Cultures Merging: A Historical and Economic
Critique of Culture
[Blurb]
- Steve Keen, Debunking Economics: The Naked Emperor
of the Social Sciences [Review by Danny Yee]
- Keynes
- Essays in Biography
- Treatise on Money
- Charles Kindleberger, Historical Economics: Art or
Science? [online]
- Janos Kornai, Anti-Equilibrium
- Roberto Leombruni and Matteo Richiardi, "Why are economists
sceptical about agent-based simulations?", Physica A
355 (2005): 103--109 ["We look at the following
problematic areas: (i) interpretation of the simulation dynamics and
generalization of the results, and (ii) estimation of the simulation model. We
show that there exist solutions for both these issues."]
- Michael Margill and Martine Quinzii, Theopry of Incomplete
Markets [Blurb]
- James G. March, A Primer on Decision Making: How Decisions
Happen
- Bhashkar Mazumder, "Fortunate Sons: New Estimates of
Intergenerational Mobility in the United States Using Social Security Earnings
Data", The Review of
Economics and Statistics 87 (2005): 235--255
- Thomas K. McCraw, Prophet of Innovation: Joseph
Schumpeter and Creative Destruction
- John McMillan, Reinventing the Bazaar: The Natural History of
Markets
- Diana A. Mendes, Vivaldo M. Mendes, J. Sousa Ramos, "Symbolic
Dynamics in a Matching Labour Market
Model",nlin.CD/0608002
- Pierre-Michel Menger, "Artistic Labor Markets and Careers",
Annual Review of Sociology 25 (1999): 541--574
- Franco Modigliani, Adventures of an Economist
- Barrington Moore, Moral Aspects of Economic Growth, and Other
Essays
- Mary S. Morgan, The History of Econometric Ideas
[blurb]
- Gustavus Myers, Hereditary American Fortunes
- Jean-Pierre Nadal, Denis Phan, Mirta B. Gordon and Jean Vannimenus,
"Monopoly Market with Externality: An Analysis with Statistical Physics and
Agent Based Computational Economics," cond-mat/0311096
- Edward J. Nell, Making Sense of a Changing Economy:
Technology, Markets and Morals
- Leland Gerson Neuberg, Conceptual Anomalies in Economics and
Statistics: Lessons from the Social Experiment
[blurb]
- Kiyohiko G. Nishimura, Imperfect Competition, Differential
Information, and Microfoundations of Macroeconomics [Blurb]
- Haim Ofek, Second Nature: Economic Origins of Human
Evolution [blurb]
- Paul Osterman, Securing Prosperity: The American Labor
Market: How It Has Changed and What to Do about It
- Muge Ozman, "Interactions in economic models: Statistical mechanics
and networks", Mind
and Society
4 (2005): 223--238
- Charles Perrow, Organizing America: Wealth, Power, and the
Origins of Corporate Capitalism
- Joel M. Podolny, Status Signals: A Sociological
Study of Market Competition
- Adam Przeworski, States and Markets: A Primer in Political
Economy
- Don Ross, Economic Theory and Cognitive Science:
Microexplanation
[Blurb]
- Alvin E. Roth, "Repugnance as a Constraint on Markets",
Journal of Economic Perspectives forthcoming (2007)
[PDF
preprint. Thanks to reader Nicolas D. P. for pointing this out to me.]
- Emma Rothschild, Economic Sentiments: Adam Smith, Condorcet,
and the Enlightenment
- Gilles Saint-Paul, Innovation and Inequality: How Does Technical Progress Affect Workers? [blurb, intro]
- Paul Samuelson, Foundations of Economic Analysis
- Margaret Schabas, The Natural Origins of Economics
[That is, the debt of economic thought, in its formative period, to ideas about
the natural world. Blurb]
- J. Schumpeter, History of Economic Analysis
- Barry Schwartz, The Paradox of Choice
- Paul Seabright, The Company of Strangers: A Natural History
of Economic Life
- Robert Solow
- An Almost Practical Step Towards
Sustainability
- Monopolistic Competition and Macroeconomic
Theory
- Adam Smith
- Joe Stiglitz
- Susan Strange, Casino Capitalism
- John Sutton, Sunk Costs and Market Structure: Price
Competition, Advertising, and the Evolution of Concentration
- Leigh Tesfatsion and Kenneth L. Judd (eds.), Agent-Based
Computational Economics, vol. 2 of the Handbook of Computational Economics
- Richard Thaler, Quasi-Rational Economics
- Hal Varian, Microeconomic Analysis [The standard
reference for lo, these many years; need to actually read it...]
- Geerat J. Vermeij, Nature: An Economic History [Review
in American Scientist]
- Dejan Vinkovic and Alan Kirman, "A Physical Analogue of the
Schelling Model", Proceedings
of the National Academy of Sciences 103 (2006):
19261--19265
- Steven Vogel, Freer Markets, More Rules: Regulatory Reform in
Advanced Economies
- Kathleen D. Vohs, Nicole L. Mead, and Miranda R. Goode, "The
Psychological Consequences of Money", Science
314 (2006): 1154--1156
- Joel Waldfogel, The Tyranny of the Market: Why You Can't
Always Get What You Want
[Blurb]
#
Thu, 19 Nov 2009
Causality and Causal Inference
There is unfortunately no accepted name for the scientific study of
causality, or of methods for inferring it. "Etiology" suggests itself, but
it's already taken...
Things I need to learn more about: Matched sampling methods.
See also:
Computational Mechanics;
Graphical Models;
Machine Learning,
Statistical Inference, and Induction
Recommended (current big picture):
- Clark Glymour
- The Mind's Arrows: Bayes Nets and Graphical Causal
Models in Psychology
[Mini-review]
- "What Went Wrong? Reflections on Science by Observation
and The Bell Curve", Philosophy of Science
65 (1998): 1--32
[PDF
reprint via Prof. Glymour]
- Sander Greenland, Judea Pearl and James M. Robins,
"Causal Diagrams for Epidemiologic Research", Epidemiology
10 (1999): 37--48 [PDF via Prof. Pearl. Very much not just for
epidemiologists.]
- Judea Pearl
- "Causal Inference in Statistics: An Overview", forthcoming
in Statistics Surveys 3 (2009): 96--146
[PDF]
- Causality: Models, Reasoning and
Inference
- Donald B. Rubin and Richard P. Waterman, "Estimating the Causal
Effects of Marketing Interventions Using Propensity Score
Methodology", math.ST/0609201
= Statistical Science 21 (2006): 206--222 [A good
description of Rubin et al.'s methods for causal inference, adapted to the
meanest understanding. I list this here rather than under "more specialized"
because Rubin and Waterman do a very good job of explaining, in a clear and
concrete problem, just how and why the newer techniques of causal inference are
valuable, with just enough technical detail that it doesn't seem like magic.]
- Peter Spirtes, Clark Glymour and Richard Scheines, Causation,
Prediction and Search
- Christopher Winship
Recommended (historical):
- David Hume
- ibn Rushd (= Averroes)
- Tahafut al-Tahafut [Which, needless to say,
I've only read in translation]
- Barry Kogan, Averroes and the Metaphysics of
Causation
- Bertrand Russell
- The Analysis of Matter
- Human Knowledge: Its Scope and Limits
Recommended (more specialized):
- Tianjiao Chu and Clark Glymour, "Search for Additive Nonlinear Time Series Causal Models", Journal of Machine Learning Research 9 (2008): 967--991
- David Galles and
Judea Pearl
- "Axioms of Causal Relevance" [preprint]
- "An Axiomatic Characterization of Causal Counterfactuals"
[preprint]
- "Testing Identifiability of Causal Effects" [preprint]
- Clark Glymour, "When Is a Brain Like the Planet?",
Philosophy of
Science 74 (2007): 330--347
- Clive Granger [His original paper on what has come to be called
"Granger causality" is actually very interesting — I hadn't realized he
got the idea from reading Norbert Wiener, but in
retrospect that makes sense and explains why he formulated his test in the
frequency domain — but I don't feel energetic enough right now to either
find it in my filing cabinet or look up the exact citation.]
- Dominik Janzing, "On causally asymmetric versions of Occam's Razor and their relation to thermodynamics", arxiv:0708.3411
- Kevin T. Kelly and Conor Mayo-Wilson, "Causation, Retraction,
Simplicity, and Truth" [Unpublished; thanks to Kevin for a preprint]
- Gustavo Lacerda, Peter Spirtes, Joseph Ramsey and Patrik O. Hoyer,
"Discovering Cyclic Causal Models by using Independent Components Analysis"
[PDF draft via
Gustavo]
- Milan Palus and Aneta
Stefanovska, "Direction of coupling from phases of interacting oscillators: An
information-theoretic approach", Physical Review
E 67 (2003): 055201 [Thanks to Prof. Palus for a
reprint. This is a kind of information-theoretic generalization of Granger
causality.]
- Judea Pearl, "On a Class of Bias-Amplifying Covariates that Endanger Effect Estimates", Technical Report R-356, UCLA Cognitive
Systems Lab, 2009 [Those would be instrumental variables (among
others).]
- J. D. Ramsey, S. J. Hanson, C. Hanson, Y. O. Halchenko,
R. A. Poldrack and C. Glymour, "Six Problems for Causal Inference from
fMRI" [Thanks to Prof. Glymour for a preprint]
- James M. Robins, Richard Scheines, Peter Spirtes and Larry
Wasserman, "Uniform Consistency in Causal Inference",
Biometrika 90 (2003): 491--515
[CMU Statistics Tech
Report 725, 2000]
- Wesley Salmon
- Scientific Explanation and the Causal Structure of the World
- Causality and Explanation
- Herbert Simon, "Causal Ordering and
Identifiability"
- Peter Spirtes, "Limits on Causal Inference from
Observational Data" [PostScript preprint]
- Halbert White and Karim Chalak, "A Unified Framework for Defining
and Identifying Causal Effects"
[Preprint
of Jan. 30, 2006; thanks to D. R. White for letting me know about this
paper and sending me a later version. Submitted to Econometrica]
To read:
- Mickel Aickin, Causal Analysis in Biomedicine and
Epidemiology: Based on Minimal Sufficient Causation
- Nicola Ancona, Daniele Marinazzo and Sebastiano Stramaglia,
"Extending Granger causality to nonlinear systems", physics/0405009
- Nihat Ay, "A Refinement of the Common Cause Principle",
SFI Working Paper 08-01-001 [PDF]
- Aron Barbey and Phillip Wolff, "Learning Causal Structure from
Reasoning", phil-sci/3176
- Michael Baumgartner, "Inferring Causal
Complexity", phil-sci/2879
[Identifying causal structures among Boolean variables, handling "both mutually
dependent causes, i.e. causal chains, and multiple effects, i.e. epiphenomena"]
- Aaron P. Blaisdell, Kosuke Sawa, Kenneth J. Leising, and Michael
R. Waldmann, "Causal Reasoning in Rats", Science
311 (2006): 1020--1022
- Blalock, Causal Inferences in Nonexperimental Research
- Hans-Peter Blossfeld and Gotz Rohwer, Techniques of
Event-History Modeling: New Approach to Causal Analysis
- Nancy Cartwright, Hunting Causes and Using Them: Approaches
in Philosophy and Economics
[blurb. Extremely harsh
critiques by Pearl
and Glymour
("All of her critical claims are false or at best fractionally true")]
- Xiaohong Chen, Markus Reiss, "On rate optimality for ill-posed
inverse problems in
econometrics", arxiv:0709.2003
[Non-parametric instrumental variables]
- Yonghong Chen, Steven L. Bressler, and Mingzhou Ding, "Frequency
decomposition of conditional Granger causality and application to multivariate
neural field potential
data", q-bio.NC/0608034
= Journal of Neuroscience Methods 150 (2006):
228--237
- John Collins, Ned Hall, L.A. Paul (eds.), Causation and
Counterfactuals [Forthcoming]
- Daniel Commenges, Anne Gegout-Petit, "A general dynamical statistical model with possible causal interpretation", arxiv:0710.4396
- Rajeev H. Dehejia and Sadek Wahba, "Propensity Score-Matching
Methods for Nonexperimental Causal Studies", The Review of Economics and
Statistics 84 (2002): 151--161
- Mingzhou Ding, Yonghong Chen and Steve L. Bressler,
"Granger Causality: Basic Theory and Application to Neuroscience",
q-bio.QM/0608035 = pp.
451--474 in B. Schelter, M. Winterhalder, and J. Timmer (eds.), Handbook
of Time Series Analysis
- Patrick Doreian, "Causality in Social Network Analysis",
Sociological
Methods and Research 30 (2001): 81--114
- Frederick Eberhardt and Richard Scheines, "Interventions and Causal
Inference", phil-sci/2944
- Ellery Eells, Probabilistic Causality
- Michael Eichler, "Graphical modelling of multivariate time
series", math.ST/0610654
- Adam Elga, "Isolation and Folk Physics", phi-sci/2678
[Ordinary notions of causality as approximations to real physics, under
conditions of near-independence]
- Elena Erosheva, Emily W. Walton and David T. Takeuchi, "Self-Rated
Health among Foreign- and U.S.-Born Asian Americans: A Test of
Comparability", Medical
Care 45 (2007): 80--87 [As an application of
propensity-score matching to a multi-level response]
- David A. Freedman
- "On Specifying Graphical Models for Causation," UCB
Stat. Tech. Rep. 601 [abstract, pdf]
- Statistical Models and Causal Inference: A Dialogue
with the Social Sciences [blurb]
- Galavotti (ed.), Stochastic Causality
- Anne Gegout-Petit and Daniel Commenges, "A general definition of
influence between stochastic processes", arxiv:0905.3619
- Clark Glymour, "Rabbit
Hunting", Synthese 121 (1999): 55--78
[PDF
reprint]
- Glymour and Cooper (eds.), Computation, Causation and
Discovery
- Adam Glynn and Kevin Quinn, "Non-parametric Mechanisms and Causal
Modeling" [PDF
preprint]
- Jorge Goncalves and Sean Warnick, "Dynamical Structure Functions
for the Estimation of LTI Networks with Limited Information", q-bio.MN/0610008
[LTI = "linear, time-invariant"]
- Alison Gopnik and Laura Schulz (eds.), Causal Learning:
Psychology, Philosophy and Computation
- Joseph Y. Halpern and Judea Pearl, "Causes and Explanations: A
Structural-Model Approach", "Part I: Causes", cs.AI/0011012, and "Part II:
Explanations,"
cs.AI/0208034
- Stefan Haufe, Guido Nolte, Klaus-Robert Mueller and Nicole Kraemer,
"Sparse Causal Discovery in Multivariate Time
Series", arxiv:0901.1234 [I am not
altogether happy with defining "causes" as "has a non-zero coefficient in a
vector autoregression"...]
- Yang-Bo He and Zhi Geng, "Active Learning of Causal
Networks with Intervention Experiments and Optimal Designs",
Journal of
Machine Learning Research 9 (2008): 2523--2547
- Joe Henson, "Comparing causality principles",
Studies in
History and Philosophy of Modern Physics
36 (2005): 519--543
- Kevin D. Hoover, Causality in Macroeconomics
- Kosuke Imai, Gary King and Elizabeth Stuart, "Misunderstandings
among Experimentalists and Observationalists about Causal Inference"
[PDF pre-print]
- Dominik Janzing, Xiaohai Sun and Bernhard Schölkopf, "Distinguishing Cause and Effect via Second Order Exponential Models", arxiv:0910.5561
- David D. Jensen, Andrew S. Fast, Brian J. Taylor, Marc E. Maier,
"Automatic Identification of Quasi-Experimental Designs for Discovering Causal
Knowledge", SIGKDD 2008
[PDF
reprint]
- Jack Katz, "From How to Why: On Luminous Description and
Causal Inference in Ethnography"
- Alon Keinan, Ben Sandbank, Claus C. Hilgetag, Isaac Meilijson and
Eytan Ruppin, "Fair Attribution of Functional Contribution in Artificial and
Biological Networks", Neural
Computation 16 (2004): 1887--1915
- Manabu Kuroki, "Bounds on average causal effects in studies with a
latent response variable", Metrika
61 (2005): 63--71
- Junning Li, Z. Jane Wang, "Controlling the False Discovery Rate of
the Association/Causality Structure Learned with the PC
Algorithm", Journal of Machine
Learning Research 10 (2009): 475--514
- Judith J. Lok
- "Mimicking counterfactual outcomes for the
estimation of causal effects", math.ST/0409045
- "Statistical modelling of causal effects in continuous
time", math.ST/0410271
- Daniele Marinazzo, Mario Pellicoro and Sebastiano Stramaglia,
"Nonlinear parametric model for Granger causality of time series",
Physical Review
E 73 (2006): 066216
= cond-mat/0602183
- Vaughn R. McKim and Stephen P. Turner (ed.), Causality in
Crisis? Statistical Methods and the Search for Causal Knowledge in the Social
Sciences
- K. Mengersen, S. A. Moynihan, R. L. Tweedie, "Causality and
Association: The Statistical and Legal
Approaches", arxiv:0710.4459
- Peter Menzies, "A Structural Equations Account of Negative
Causation", phil-sci/2962
- Morgan and Winship, Counterfactuals and Causal Inference:
Methods and Principles for Social Research
[blurb]
- John D. Norton, "Causation as Folk
Science," phil-sci/1214
- Farid Nouioua, "Why did the accident happen? A norm-based reasoning
approach", cs.AI/0610015
- L. A. (Laurie) Paul
- David T. Pegg, "Causality in quantum mechanics",
Physics
Letters A 349 (2006): 411--414
- Jean-Philippe Pellett and Andre Elisseeff, "Using Markov Blankets for Causal Structure Learning", Journal of Machine Learning
Research 9 (2008): 1295--1342
- Jonas Peters, Dominik Janzing and Bernhard Schökopf, "Causal
Inference on Discrete Data using Additive Noise
Models", arxiv:0911.0280
- Huw Price and Richard Corry (eds.), Causation, Physics, and
the Constitution of Reality: Russell's Republic Revisited
- Adam Przeworski, "Is the Science of Comparative Politics Possible?"
[PDF
preprint. On drawing causal conclusions from natural "quasi-experiments".]
- Miklós Rédei and Stephen J. Summers, "Remarks on
Causality in Relativistic Quantum Field Theory", quant-ph/0302115
- Hans Reichenbach, The Direction of Time
- Eva Riccomagno, Jim Q. Smith
- "Algebraic causality: Bayes nets and beyond",
arxiv:0709.3377
- "The causal manipulation of chain event
graphs", 0709.3380
- Donald B. Rubin, Matched Sampling for Causal Effects
[Blurb. Collection of papers by
Rubin and collaborators. Haven't finished them all yet.]
- Anil K. Seth and Gerald M. Edelman, "Distinguishing Causal
Interactions in Neural Populations", Neural
Computation 19 (2007): 910--933
- Glenn Shafer, The
Art of Causal Conjecture [Bought from
an on-line bookstore
which gave the title as The Art of Casual Conjecture; a book which
should be
written. Reviwed
by Glymour (PDF)]
- Ilya Shpitser, Judea Pearl, "Complete Identification Methods for
the Causal
Hierarchy", Journal of
Machine Learning Research 9 (2008): 1941--1979 ["We
consider a hierarchy of queries about causal relationships in graphical models,
where each level in the hierarchy requires more detailed information than the
one below. The hierarchy consists of three levels: associative relationships,
derived from a joint distribution over the observable variables; cause-effect
relationships, derived from distributions resulting from external
interventions; and counterfactuals, derived from distributions that span
multiple "parallel worlds" and resulting from simultaneous, possibly
conflicting observations and interventions. We completely characterize cases
where a given causal query can be computed from information lower in the
hierarchy"]
- Silva, Scheines, Glymour and Spirtes, "Learning the Structure
of Linear Latent Variable Models", Journal of Machine Learning Research 7 (2006): 191--246 [open access]
- Dan Sperber, David
Premack and Ann James Premack (eds.),
Causal Cognition: A Multidisciplinary Debate
- Peter Spirtes, "Graphical models, causal inference, and
econometric models", Journal of Economic Methodology 12 (2005): 1--33 [PDF]
- Patrick Suppes
- Patrick Suppes, Scientific Philosopher
- A Probabilistic Theory of Causality
- Representation and Invariance
- G. A. Svechnikov, Causality and the Relation of States in
Physics
- Tyler J. VanderWeele and James M. Robins, "Minimal sufficient causation and directed acyclic graphs", Annals of Statistics 37 (2009): 1437--1465
- P. F. Verdes, "Assessing causality from multivariate time series",
Physical Review
E 72 (2005): 026222
- Brad Weslake, "Common Causes and The Direction of Causation", phil-sci 2383
- Phillip Wolff, "Representing Causation", phil-sci/3177
- James Woodward, Making Things Happen: A Theory of Causal Explanation [Review by Glymour]
- Jiji Zhang, "Causal Reasoning with Ancestral Graphs",
Journal of
Machine Learning Research
9 (2008): 1437--1474
- Zhang Jiji and Peter Spirtes, "Detection of Unfaithfulness and
Robust Causal
Inference", phil-sc/3188
To write:
- CRS, "Causality in Models of Dynamics"
- CRS, "Homophily, Contagion, Confounding: Pick Any Three"
#
Graphical Models
A.k.a. causal models, causal graphs, Bayes graphs, Bayes networks, Bayesian
networks. (Here "Bayes" is a metonym for "conditional probability". There are
perfectly good frequentist interpretations of these models.) I'm sticking
latent-variable and path-analysis models in here, too, because they all pretty
much work the same way.
Everyone who takes basic statistics has it drilled into them that
"correlation is not causation." (When I took psych. 1, the professor said he
hoped that, if he were to come to us on our death-beds and prompt us with
"Correlation is," we would all respond "not causation.") This is a problem,
because one can infer correlation from data, and would like to be able
to make inferences about causation. There are typically two ways out of this.
One is to perform an experiment, preferably a randomized double-blind
experiment, to eliminate accidental sources of correlation, common causes, etc.
That's nice when you can do it, but impossible with supernovae, and not even
easy with people. The other out is to look for correlations, say that of
course they don't equal causations, and then act as if they did anyway. The
technical names for this latter course of action are "linear regression" and
"analysis of variance," and they form the core of applied quantitative social
science, e.g., The Bell Curve.
Graphical models are, in part, a way of escaping from this impasse.
The basic idea is as follows. You have a bunch of variables, and you want
to represent the causal relationships, or at least the probabilistic
dependencies, between them. You do so by means of a graph. Each node in the
graph stands for a variable. If variable A is a cause of B, then an arrow runs
from A to B. If A is a cause of B, we also say that A is one of B's
parents, and B one of A's children. If there is a causal path
from A to B, then A is an ancestor of B, and B is a
descendant of A. If a variable has no parents in the graph, it is
exogenous, otherwise it is endogenous.
Part of what we mean by "cause" is that, when we know the immediate causes,
the remoter causes are irrelevant --- given the parents, remoter ancestors
don't matter. The standard example is that applying a flame to a piece of
cotton will cause it to burn, whether the flame came from a match, spark,
lighter or what-not. Probabilistically, this is a conditional indepedence
property, or a Markov property: a variable is independent of its ancestors
conditional on its parents. In fact, given its parents, its children, and its
childrens' other parents, a variable is conditionally independent of all other
variables. This is called the graphical or causal Markov property. When this
holds, we can factor the joint probability distribution for all the variables
into the product of the distribution of the exogenous variables, and the
conditional distribution for each endogenous variable given its parents.
(You may be wondering what happens if A is a parent of B and B is a parent
of A, as can happen when there is feedback between the variables. This leads
to difficulties, traditionally dealt with by explicitly limiting the discussion
to acyclic graphs. I shall follow this wise precedent here.)
Now, there are certain rules which let us infer conditional independence
relations from each other. For instance, if X is independent of the
combination of Y and W, given Z, then X is indepdent of Y alone given Z. So,
if we have a graph which obeys the causal Markov condition, there are generally
other conditional independence relations which follow from the basic ones. If
these are the only conditional indepences which hold in the distribution, it is
said to be faithful to the graph (or vice versa); otherwise it is
unfaithful. For a graph to be Markov and unfaithful, there must (as it were)
be an elaborate conspiracy among the conditional distributions, so elaborate
that it will generally be destroyed by any change in any of those
distributions. So faithfulness is a robust property.
This may sound pretty arcane, but that's just because it is arcane.
The point, however, is that if you can make the three assumptions above (no
causal cycles, Markov property, faithfulness), you're in business in a really
remarkable way. There are very powerful statistical techniques that will let
you infer the causal structure connecting your variables. This comes in two
flavors. One is the Bayesian way: cook up a prior distribution over all
possible causal graphs; compute the likelihood of the data under each graph;
update your distribution over graphs; iterate. This is generally
computationally intractable, assuming you can come up with a meaningful prior
in the first place. The other approach is to use tests for conditional
independence to eliminate possible connections between variables, and so to
narrow down the range of candidate structures; it is basically frequentist, and
can be shown, under a broad range of circumstances, to be asymptotically
reliable.
Once you have your causal graph --- whether through estimation or through
simply being handed one --- you can do lots of great things with it, like
predict the effects of manipulating some of the variables, or make backward
inferences from effects to causes. Of course, if the graph is big, doing the
necessary calculations can be very troublesome in itself, and so people work on
approximation methods and even ways of doing statistical inference on models of
statistical distributions...
It's probably obvious I think this is incredibly neat, and even one of the
most important ideas to come out of machine learning. Of course it
doesn't really solve the problem of establishing causal relations, in
the way Hume objected to; it says, assuming there are
causal relations, of a certain stochastic form, and that these are stable, then
they can be learned. But that, and the more general questions of what we ought
to mean by "cause", deserve a notebook of their
own.
Things I want to understand better: frequentist inference procedures.
Computational learning theory for graphical models (the paper by Janzing and
Herrmann is good). How to treat systems with feedback? How to
treat dynamical systems
and time series? How does all of this fit
together with computational
mechanics?
Not even a conjecture. Back in the 1960s, Chow and Liu (reference
below) gave a polynomial algorithm for finding the best approximation to a
global joint probability distribution using only pairwise interactions among
the variables, i.e., the one which minimized the Kullback-Leibler divergence
between the true and the approximating distribution. I have read that
extending this to even three-way interactions is NP, though I don't know if
it's NP-complete. (1) How is the intractability result established? (2) Is
this the same as the computational phase transition one finds in going from
2-SAT to 3-SAT, where the critical point is at two-point-something SAT?
(Presumably the answer to (1) would shed some light on this.) (3) Even if not,
is there an analogous phase transition, perhaps in a different universality
class? (Update in 2009, several years later: Bento and Montanari,
below, sounds relevant, but I haven't read it yet.)
Recommended, more general:
- Clark Glymour, The Mind's Arrows: Bayes Nets and Graphical
Causal Models in Psychology
[Mini-review]
- Michael Irwin
Jordan (ed.), Learning in Graphical Models
- Jordan and Sejnowski (eds.), Graphical Models [Nice
collection of papers from Neural Computation]
- Judea Pearl
- "Causal Inference in Statistics: An Overview", forthcoming
in Statistics Surveys 3 (2009): 96--146
[PDF]
- Causality: Models, Reasoning and
Inference
- Peter Spirtes, Clark Glymour and Richard Scheines, Causation,
Prediction, and Search
Recommended, more specialized:
- C. K. Chow and C. N. Liu, "Approximating Discrete Probability
Distributions with Dependence Trees", IEEE Transactions on Information
Theory 14 (1968): 462--467 [An old but very nice result
on how to get the optimal approximation to a global probability distribution
using only pairwise interactions among the variables]
- Ghahramani, "Learning Dynamic Bayesian Networks," in
Giles and Gori (eds.), Adaptive Processing of Sequences and Data
Structures
- Ghahramani and Jordan, "Factorial Hidden Markov Models,"
Machine Learning 29 (1997): 245--273
- Dominik Janzing and Daniel J. L. Herrmann, "Reliable and
Efficient Inference of Bayesian Networks from Sparse Data by Statistical
Learning Theory", cs.LG/0309015
- Lauritzen, Graphical Models [A fairly abstract
probabilistic/mathematical-statistical treatment; I have to confess I'm only
about half-way done with it.]
- Han Liu, John Lafferty and Larry Wasserman, "The Nonparanormal:
Semiparametric Estimation of High Dimensional Undirected Graphs",
Journal of
Machine Learning Research 10 (2009): 2295--2328
= arxiv:0903.0649
- John C. Loehlin, Latent Variable Models: An Introduction to
Factor, Path, and Structural Analysis [An intro. to old-school linear
latent-variable models, especially of the sort used by psychologists. Good in
its own domain, but does not make enough contact with modern graphical models.]
- Eric Mjolsness, "Stochastic Process Semantics for Dynamical Grammar
Syntax: An Overview", cs.AI/0511073
- Pawel Wocjan, Dominik Janzing, and Thomas Beth, "Required
sample size for learning sparse Bayesian networks with many variables," cs.LG/0204052
To read:
- Francis R. Bach and Michael I. Jordan, "Learning Graphical Models
for Stationary Time Series", UCB Statistics
Tech. Rep. 650
- Onureena Banerjee, Laurent El Ghaoui, Alexandre d'Aspremont, "Model
Selection Through Sparse Maximum Likelihood
Estimation", arxiv:0707.0704
- Jose Bento, Andrea Montanari, "Which graphical models are difficult to learn?", arxiv:0910.5761
- David Brillinger, "Remarks Concerning Graphical Models for
Time Series and Point Processes," Revista de Econometria
16 (1996): 1--23 [PS]
- Michael Chertkov and Vladimir Y. Chernyak
- David Maxwell Chickering, "Optimal Structure Identification
With Greedy Search," Journal of Machine Learning Research
3 (2002): 507--554
- Robert G. Cowell, A. Philip Dawid, Steffen L. Lauritzen and David
J. Spiegelhalter, Probabilistic Networks and Expert Systems
- David Cox and Nanny Warmuth, Multivariate Dependcencies: Models, Analysis, and Interpretation
- Rainer Dahlhaus, "Graphical interaction models for
multivariate time series," Metrika 51
(2000): 157--172
- Luis M. de Campos, "A Scoring Function for Learning Bayesian Networks based on Mutual Information and Conditional Independence Tests",
Journal of
Machine Learning Research 7 (2006): 2149--2187
[Sounds damn cool]
- Amir Dembo and Andrea Montanari, "Gibbs Measures and Phase Transitions on Sparse Random Graphs", arxiv:0910.5460
- Michael Eichler, "Graphical modelling of multivariate time
series", math.ST/0610654
- Gal Elidan, Iftach Nachman and Nir Friedman, "'Ideal Parent'
Structure Learning for Continuous Variable Bayesian
Networks", Journal of
Machine Learning Research 8 (2007): 1799--1833
- Sergi Elizalde and Kevin Woods, "Bounds on the number of inference
functions of a graphical
model", math.CO/0610233
- Juan Ferrándiz, Enrique F. Castillo and Pilar Sanmartin,
"Temporal aggregation in chain graph models", Journal of
Statistical Planning and Inference 133 (2005):
69--93
- Freedman, "On Specifying Graphical Models for Causation,"
UCB Stat. Tech. Rep. 601 [abstract, pdf]
- Frey, Graphical Models in Machine Learning and Data
Communication
- Roland Fried and Vanessa Didelez, "Latent variable analysis and
partial correlation graphs for multivariate time series", Statistics and
Probability Letters 73 (2005): 287--296
- Cyril Furtlehner, Jean-Marc Lasgouttes, Arnaud De La Fortelle,
"Belief Propagation and Bethe approximation for Traffic Prediction",
physics/0703159
- Dan Geiger, David Heckerman, Henry King, and Christopher
Meek, "Stratified exponential families: Graphical models and model selection",
Annals of
Statistics 29 (2001): 505--529
- Christophe Giraud, "Estimation of Gaussian graphs by model
selection", arxiv:0710.2044
- Glymour and Cooper (eds.), Computation, Causation and
Discovery
- Jorge Goncalves and Sean Warnick, "Dynamical Structure Functions
for the Estimation of LTI Networks with Limited Information", q-bio.MN/0610008
[LTI = "linear, time-invariant"]
- Green, Hjort and Richardson (eds.), Highly Structured
Stochastic Systems
- Vikas Hamine and Paul Helman, "Learning Optimal Augmented Bayes
Networks", cs.LG/0509055
- Holger Höfling and Robert Tibshirani, "Estimation of Sparse
Binary Pairwise Markov Networks using
Pseudo-likelihoods", Journal of
Machine Learning Research 10 (2009): 883--906
- Shiro Ikeda, Toshiyuki Tanaka and Shun-ichi Amari, "Stochastic
Reasoning, Free Energy, and Information
Geometry", Neural
Computation 16 (2004): 1779--1810
- Manfred Jaeger & co., Primula [Java implementation
of a modeling language for relational Bayesian networks; released under GPL]
- Markus Kalisch and Peter Bühlmnann, "Estimating
High-Dimensional Directed Acyclic Graphs with the
PC-Algorithm", Journal
of Machine Learning Research 8 (2007): 616--636
- Nicole Kraemer, Juliane Schaefer, Anne-Laure Boulesteix,
"Regularized estimation of large-sacle gene association networks using
graphical Gaussian
models", arxiv:0905.0603
- Sanjiang Li, "Causal models have no complete axiomatic
characterization", arxiv:0804.2401
- Stephen Luttrell, "Adaptive Cluster Expansion (ACE): A Hierarchical
Bayesian Network", cs.NE/0410020
- Dhafer Malouche and Sylvie Sevestre-Ghalila, "Estimating High
dimensional faithful Gaussian graphical Models :
uPC-algorithm", arxiv:0705.1613
- Giovanni M. Marchetti, Nanny Wermuth, "Matrix representations and independencies in directed acyclic graphs", arxiv:0904.0333
- Eric Mjolsness, "Labeled graph notations for graphical models", UCI
School of Information and Computer science Technical Report 04-03 [PDF]
- Jennifer Neville and David Jensen, "Relational Dependency Networks",
Journal
of Machine Learning Research 8 (2007): 653--692
- Lior Pachter and Bernd Sturmfels
- Alessandro Pelizzola, "Cluster variation method in statistical
physics and probabilistic graphical models", Journal of Physics
A: Mathematical and General 38 (2005): R309--R339
= cond-mat/0508216
- Tapani Raiko, Harri Valpola, Markus Harva and Juha Karhunen,
"Building Blocks for Variational Bayesian Learning of Latent Variable
Models", Journal
of Machine Learning Research 8 (2007): 155--201
- Pradeep Ravikumar, Martin J. Wainwright, John D. Lafferty,
"High-Dimensional Graphical Model Selection Using $\ell_1$-Regularized Logistic
Regression", arxiv:0804.4202
- Marco Reale, A Graphical Modelling Approach to Time
Series, Ph.D. thesis, Lancaster University, 1998 [Reale's
website]
- Marco Reale and Granville Tunnicliffe Wilson
- "Identification of vector AR models with recursive
structural errors using conditional independence graphs"
- "The Sampling Properties of Conditional Independence Graphs
for Structural Vector Autoregressions"
- T. Rizzo, B. Wemmenhove, H.J. Kappen, "On Cavity Approximations for
Graphical
Models", cond-mat/0608312
- Philipp Rütimann and Peter Bühlmann, "High
dimensional sparse covariance estimation via directed acyclic graphs",
arxiv:0911.2375
- Marco Scutari, "Learning Bayesian Networks with the bnlearn
Package", arxiv:0908.3817
- Bill Shipley, Cause and Correlation in Biology: A User's
Guide to Path Analysis, Structural Equations and Causal Inference
- Tomi Silander, Teemu Roos, Petri Myllymaki, "Locally Minimax Optimal Predictive Modeling with Bayesian Networks", Journal of Machine
Learning Research Workshop and Conference Proceedings 5 (AISTATS 2009): 504--511
- Milan Studeny, Probabilistic Conditional Independence
Structures ["The main topic of the monograph is a non-graphical
algebraic method for describing probabilistic CI structures. However, one of
the first two chapters in the book recalls and gathers basic mathematical tools
for study of probabilistic conditional independence (CI) and the other one is a
sketchy overview of recent advanced graphical approaches to the desciption of
CI structures. The next four chapters develop the non-graphical method. The
last standard chapter is an attempt to apply the method in practice: it is
devoted to learning Bayesian nets and it is more mathematical (and
'philosophical') revision of some methods for learning Bayesian networks. The
main aim of that chapter is to indicate that an algebraic approach can also be
applied in this area."]
- Charles Sutton, Andrew McCallum and Khashayar Rohanimanesh,
"Dynamic Conditional Random Fields: Factorized Probabilistic Models for
Labeling and Segmenting Sequence Data", Journal
of Machine Learning Research 8 (2007): 693--723
- Vincent Y. F. Tan, Animashree Anandkumar, Lang Tong and Alan
S. Willsky, "A Large-Deviation Analysis of the Maximum-Likelihood Learning of
Markov Tree
Structures", arxiv:0905.0940
[Large deviations for Chow-Liu trees]
- Robert E. Tillman, Arthur Gretton and Peter Spirtes,
"Nonlinear directed acyclic structure learning with weakly additive
noise models" [Thanks to Prof. Spirtes for a preprint]
- Achim Tresch, Florian Markowetz, "Structure Learning in Nested
Effects Models", 0710.4481
- M. J. Wainwright and M. I. Jordan, "Graphical models, exponential
families, & variational inference", UCB Statistics
Tech. Rep. 649
- Xianchao Xie, Zhi Geng, "A Recursive Method for Structural Learning
of Directed Acyclic Graphs", Journal of
Machine Learning Research 9 (2008): 459--483
- Jonathan S. Yedidia, William T. Freeman and Yair Weiss,
"Understanding Belief Propagation and its Generalizations", Mitsubshi Electric Research
Laboratories Tech. Rep. 2001-22
- Marco Zaffalon and Marcus Hutter, "Robust Inference of Trees",
cs.LG/0511087
(Thanks to Gustavo Lacerda for pointing out a goof.)
#
Regression, especially Nonparametric Regression
"Regression", in statistical jargon, is the problem of guessing the average
level of some quantitative response variable from various predictor variables.
Linear regression is perhaps the single most common quantitative tool in
economics, sociology, and many other fields; it's certainly the most common use
of statistics. (Analysis of variance, arguably
more common in psychology and biology, is a disguised form of regression.)
While linear regression deserves a place in statistics, that place
should be nowhere near as large and prominent as it currently is. There are
very few situations where we actually have scientific support for
linear models. Fortunately, very flexible nonlinear regression methods now
exist, and from the user's point of view are just as easy as linear regression,
and at least as insightful. (Regression trees and additive models, in
particular, are just as interpretable.) At the very least, if you do
have a particular functional form in mind for the regression, linear or
otherwise, you should use a non-parametric regression to test the adequacy of
that form.
From a technical point of view, the main drawback of modern regression
methods is that their extra flexibility comes at the price of less "efficiency"
— estimates converge more slowly, so you have less precision for the same
amount of data. There are some situations where you'd prefer to have more
precise estimates from a bad model than less precise estimates from a model
which doesn't make systematic errors, but I don't think that's what most users
of linear regression are chosing to do; they're just taught to
type lm
rather
than gam.
In this day and age, though, I don't understand why not.
(Of course, for the statistician, a lot of the more flexible regression
methods look more or less like linear regression in some disguised form,
because fundamentally all it does is projection. So it's not crazy to make it
a foundational topic for statisticians. We should not, however, give
the rest of the world the impression that the hat matrix is the source of all
knowledge.)
The use of regression, linear or otherwise,
for causal inference, rather than prediction, is a
different, and far more sordid, story.
See also:
Computational Statistics;
Data Mining;
Learning Theory;
Model Selection;
Neural Nets;
Social Science Methodology;
What Is the Right Null Model
for Linear Regression?
Recommended, more general:
- Richard A. Berk
- Regression Analysis: A Constructive Critique
[Mini-review]
- Statistical Learning from a Regression
Perspective
- Julian J. Faraway, Extending the Linear Model with R:
Generalized Linear, Mixed Effects and Nonparametric Regression Models
- Andrew Gelman and Iain Pardoe, "Average predictive comparisons
for models with nonlinearity, interactions, and variance components",
Sociological Methodology forthcoming (2007)
[PDF
preprint,
Gelman's comments]
- Jeffrey D. Hart, Nonparametric Smoothing and Lack-of-Fit
Tests [Mini-review]
- Trevor Hastie and Robert Tibshirani and Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction
[This is a corner-stone book, but is about much, much more than just
regression.]
- Jeffrey S. Racine, "Nonparametric Econometrics: A Primer",
Foundations and Trends in Econometrics
3 (2008): 1--88 [Good primer of nonparametric techniques
for regression, density estimation and hypothesis testing; next to no economic
content (except for
examples). PDF
reprint]
- Grace Wahba,
Spline Models for Observational Data
- Larry Wasserman
- Weisberg, Applied Linear Regression
Recommended, more specialized:
- Norman H. Anderson and James Shanteau, "Weak inference with linear models", Psychological Bulletin 84 (1977): 1155--1170 [A demonstration of why you should not rely on R2 to back up your claims]
- Raymond J. Carroll, Aurore Delaigle, and Peter Hall, "Nonparametric
Prediction in Measurement Error
Models", Journal of
the American Statistical Association 104 (2009):
993--1003
- Kevin A. Clarke, "The Phantom Menace: Omitted Variables Bias in
Econometric Research"
[PDF. Or: Kitchen-sink
regressions considered harmful. Including extra variables in your linear
regression may or may not reduce the bias in your estimate of any particular
coefficients of interest, depending on the correlations between the added
variables, the predictors of interest, the response, and omitted relevant
variables. Adding more variables always increases the variance of your
estimates.]
- Berthold R. Haag, "Non-parametric Regression Tests Using Dimension
Reduction Techniques", Scandinavian Journal of Statistics 35 (2008): 719--738
- Jon Lafferty and Larry Wasserman, "Rodeo: Sparse Nonparametric
Regression in High Dimensions", math.ST/0506342 ["We present a
method for simultaneously performing bandwidth selection and variable selection
in nonparametric regression."]
- Lukas Meier, Sara van de Geer and Peter Bühlmann,
"High-Dimensional Additive
Modeling", arxiv:0806.4115 =
Annals of Statistics 37 (2009): 3779--3821
- Pradeep Ravikumar, John Lafferty, Han Liu, Larry Wasserman, "Sparse
Additive Models", arxiv:0711.4555
- Sara van de Geer, Empirical Process Theory in
M-Estimation
To read:
- Sylvain Arlot and Pascal Massart, "Data-driven Calibration of Penalties for Least-Squares Regression", Journal of Machine Learning Research 10 (2009): 245--279
- Gilles Blanchard, Nicole Kraemer, "Kernel Conjugate Gradient is
Universally
Consistent", arxiv:0902.4380
["approximate solutions are constructed by projections onto a nested set of
data-dependent subspaces"]
- Borowiak, Model Discrimination for Nonlinear Regression
Models
- Adrian W. Bowman and Adelchi Azzalini, Applied Smoothing
Techniques for Data Analysis: The Kernel Approach with S-Plus
Illustrations
- Thomas Brambor, William Roberts Clark and Matt Golder,
"Understanding Interaction Models: Improving Empirical Analyses",
Political Analysis
14 (2006): 63--82
- Lawrence D. Brown and Mark G. Low, "Asymptotic Equivalence of
Nonparametric Regression and White Noise", Annals of Statistics
24 (1996): 2384--2398
[JSTOR]
- T. Tony Cai, Harrison H. Zhou, "Asymptotic equivalence and adaptive estimation for robust nonparametric regression", Annals of
Statistics 37 (2009): 3204--3235 = arxiv:0909.0343
- Arnak Dalalyan and Alexandre B. Tsybakov, "Sparse Regression Learning by Aggregation and Langevin Monte-Carlo", arxiv:0903.1223
- Sam Efromovich, Nonparametric Curve Estimation
- Andrew Gelman and Jennifer Hill, Data Analysis Using Regression and Multilevel/Hierarchical Models
- Christopher R. Genovese and Larry Wasserman
- Jose M. Gonzalez-Barrios and Silvia Ruiz-Velasco, "Regression
analysis and dependence", Metrica
61 (2005): 73--87
- Emmanuel Guerre and Pascal Lavergne, "Data-driven rate-optimal
specification testing in regression models", math.ST/0505640 = Annals
of Statistics 33 (2005): 840--870
- Laszlo Gyorfi et al., A Distribution-Free Theory of
Nonparametric Regression
- Peter Hall, "On Bootstrap Confidence Intervals in Nonparametric
Regression", Annals of Statistics 20 (1992):
695--711
- Bruce E. Hansen
- Wolfgang Härdle, Applied Nonparametric Regression [blurb; online]
- Wolfgang Härdle, Marlene Müller, Stefan Sperlich and
Axel Werwatz, Nonparametric and Semiparametric Models: An
Introduction [Full text online]
- Salvatore Ingrassia, Simona C. Minotti, Giorgio Vittadini, "Local statistical modeling by cluster-weighted" [sic], arxiv:0911.2634 [Revisiting Gershenfeld et al.'s "cluster-weighted
modeling" from a more properly statistical perspective]
- Sameer M. Jalnapurkar, "Learning a regression function via Tikhonov
regularization", math.ST/0509420
- Estate V. Khmaladze, Hira L. Koul, "Goodness-of-fit problem for errors in nonparametric regression: Distribution free approach", Annals
of Statistics 37 (2009): 3165--3185 = arxiv:0909.0170
- Michael R. Kosorok, Introduction to Empirical Processes and
Semiparametric Inference
[partial PDF
preprint]
- Nicole Kraemer, Anne-Laure Boulesteix, Gerhard Tutz, "Penalized
Partial Least Squares Based on B-Splines
Transformations", math.ST/0608576
- Qi Li and Jeffrey Scott Racine, Nonparametric Econometrics: Theory and Practice
- Oliver Linton and Zhijie Xiao, "A Nonparametric Regression
Estimator That Adapts To Error Distribution of Unknown Form",
Econometric
Theory 23 (2007): 371--413
- Abdelkader Mokkadem, Mariane Pelletier, Yousri Slaoui, "Revisiting
Révész's stochastic approximation method for the estimation of a
regression
function", arxiv:0812.3973
- Philippe Rigollet, "Maximum likelihood aggregation and
misspecified generalized linear models", arxiv:0911.2919
- Cynthia Rudin, "Stability Analysis for Regularized Least Squares
Regression", cs.LG/0502016
- George A. F. Seber and C. J. Wild, Nonlinear Regression
- David Shilane, Richard H. Liang and Sandrine Dudoit, "Loss-Based
Estimation with Evolutionary Algorithms and Cross-Validation",
UC Berkeley Biostatistics Working Paper 227 [Abstract, PDF]
- Jeffrey S. Simonoff, Smoothing Methods in Statistics
- Aris Spanos, "Revisiting the Omitted Variables Argument:
Substantive vs. Statistical Adequacy" [PDF preprint]
- Liangjun Su and Aman Ullah, "Local polynomial estimation of nonparametric simultaneous equations models", Journal of Econometrics 144 (2008): 193--218
- Gerhard Tutz and Jan Ulbricht, "Penalized regression with
correlation-based
penalty", Statistics
and Computing
19 (2008): 239--253
- Daniela M. Witten and Robert Tibshirani, "Covariance-regularized
regression and classification for high dimensional problems", Journal of the Royal
Statistical Society B 71 (2009): 615--636
- Hirokazu Yanagiharaa and Chihiro Ohmoto, "On distribution of AIC in
linear regression models", Journal of
Statistical Planning and Inference 133 (2005):
417--433
- Peng Zhau and Bin Yu, "On Model Selection Consistency of Lasso",
Journal
of Machine Learning Research 7 (2006): 2541--2563
#
Time Series, or Statistics for Stochastic Processes and Dynamical Systems
Rates of convergence of estimators; confidence intervals, analogs to
VC-dimension results (see Meir's paper
below). Large deviation techniques; why
are large deviation rate functionals, when they exist, generally relative
entropies? Prediction schemes. Are there universal schemes which do not
demand exponentially growing volumes of data? Can any of the "universal
algorithm" schemes actually be used for anything?
If you have an ergodic process, then the
sample-path mean for any nice statistic you care to measure will, almost
surely, converge to the distributional mean. This is even true of trajectory
probabilities (i.e., if you want to know the probability of a certain
finite-length trajectory, simply count how often it happens.) So "sit and
count" is a reliable and consistent statistical procedure. If the process
mixes sufficiently quickly, the rate of convergence might even be respectable.
But this doesn't say anything about the efficiency of such procedures, which is
surely a consideration. And what do you do for non-ergodic processes? (Take
multiple runs and hope they're telling you about different ergodic components?)
Non-stationary, even?
I need to learn more about frequency-domain approaches; despite being raised
as a physicist, I find the time domain much more natural. After all, the
frequency domain is effectively just one choice of a function basis, and there
are infinitely many others, which might in some sense be more appropriate to
the process at hand. But that's at least in part a rationalization against
having to learn more math.
LSE econometrics and its "general-to-specific" modeling procedure
is very interesting, and I think possibly even related to stuff I've done, but
I need to understand it much better than I do.
(This notebook probably needs subdivision.)
See also:
Control Theory;
Dynamical Systems;
Ergodic Theory;
Filtering, State Estimation and Signal Processing;
Grammatical Inference;
Information Theory;
Machine Learning, Statistical Inference and Induction;
Markov Models and Hidden Markov Models;
Neural Coding;
Power Law Distributions, 1/f Noise and
Long-Memory Processes;
Recurrence Times of Stochastic Processes (also Hitting, Waiting, and First-Passage Times)
Sequential Decisions Under
Uncertainty;
State-Space
Reconstruction;
Statistical Learning Theory with Dependent Data;
Statistics;
Stochastic Processes;
Symbolic Dynamics;
Universal Prediction Algorithms
Recommended, big picture:
- M. S. Bartlett, An Introduction to Stochastic Processes,
with Special Reference to Methods and Applications [Classic stuff on
likelihood theory for stochastic processes]
- Ishwar V. Basawa and B. L. S. Prakasa Rao, Statistical
Inference for Stochastic Processes [Assumes familiarity with normal
theoretical statistics, i.e., you have to have already been taught to care
about confidence intervals, hypothesis tests, estimation efficiency, etc. But
very nice, given that background.]
- Peter Guttorp, Stochastic Modeling of Scientific Data
[An introduction to statistical inference for many different kinds of dependent
data, not just time series; can be used by scientists and statisticians.]
- Holger Kantz and Thomas Schreiber, Nonlinear Time Series
Analysis [An excellent presentation of the nonlinear dynamical systems
approach, which comes out of physics]
- Judy Klein, Statistical Visions in Time: A History of
Time-Series Analysis, 1662--1938
- Robert Shumway and David Stoffer, Time Series Analysis and
Its Applications: With R Applications [A standard applied statistics
text, but better than many at creating pathways into theory, and realizing that
ARIMA is not the beginning and the end of the subject!]
- Jorma Rissanen, Stochastic Complexity in Statistical
Inquiry [Review: Less Is
More, or, Ecce data!]
- David Ruelle, Chaotic Evolution and Strange Attractors: The
Statistical Analysis of Deterministic Nonlinear Systems [From notes
prepared by Stefano Isola]
- Norbert Wiener, Extrapolation,
Interpolation and Smoothing of Stationary Time Series
Recommended, closeups:
- Markus Abel, K. H. Andersen and Guglielmo Lacorata, "Hierarchical
Markovian modeling of multi-time systems," nlin.CD/0201027
- Miika Ahdesmäki, Harri Lähdesmäki, Ron Pearson,
Heikki Huttunen, and Olli Yli-Harja, "Robust detection of periodic time series
measured from biological systems", BMC
Bioinformatics 6 (2005): 117 [Open access, yay!]
- Jushan Bai, "Testing parametric conditional distributions of
dynamic
models", The
Review of Economics and Statistics 85 (2003):
531--549 [Proposes "a nonparametric test for parametric conditional
distributions of dynamic models. The test is of the Kolmogorov type.... It is
asymptotically distribution-free and has nontrivial power against root-n local
alternatives..."]
- Matthew J. Beal, Zoubin Ghahramani and Carl Edward Rasmussen, "The
Infinite Hidden Markov Model", in NIPS 14 [Link]
- Patrick Billingsley, Statistical Inference for Markov
Processes [Discrete-time and cadlag processes only]
- Denis Bosq, Nonparametric Statistics for Stochastic
Processes
- Denis Bosq and Delphine Blanke, Inference and Prediction in
Large Dimensions
- David Brillinger
- "Remarks concerning graphical models for time series and
point processes," Revista de Econometria 16
(1996): 1--23
- "Second-order moments and mutual information in the
analysis of time series and point processes," Proceedings of the
Conference Statistics 2001 Canada [online]
- "Does anyone know when the correlation coefficient is
useful?: A study of the times of extreme river flows,"
Technometrics 43 (2001), 266-273
- Prabir Burman, Edmond Chow and Deborah Nolan, "A Cross-Validatory
Method for Dependent Data", Biometrika 81
(1994): 351--358 [JSTOR]
- S. Caires and J. A. Ferreira, "On the Non-parametric Prediction of
Conditionally Stationary Sequences", Statistical Inference
for Stochastic Processes 8 (2005): 151--184
- Luca Capriotti
- "A Closed-Form Approximation of Likelihood
Functions for Discretely Sampled Diffusions: the Exponent Expansion",
physics/0703180
- "The Exponent Expansion: An Effective Approximation of
Transition Probabilities of Diffusion Processes and Pricing Kernels of
Financial
Derivatives", physics/0602107
= International Journal of Theoretical and Applied
Finance 9 (2006): 1179--1199
- Tianjiao Chu and Clark Glymour, "Search for Additive Nonlinear Time Series Causal Models", Journal of Machine Learning Research 9 (2008): 967--991
- Jérôme Dedecker, Paul Doukhan, Gabriel Lang,
José Rafael León R., Sana Louhichi and Clémentine Prieur, Weak Dependence: With Examples and Applications
- Piet de Jong and Jeremy Penzer, "ARMA models in state space
form", Statistics
and Probability Letters 70 (2004): 119--125
[preprint]
- Piet De Jong, "A Cross-Validation Filter for Time Series
Models", Biometrika 75 (1988): 594--600
[JSTOR]
- Victor H. de la Pena, Rustam Ibragimov, and Shaturgun Sharakhmetov,
"Characterizations of joint distributions, copulas, information, dependence and
decoupling, with applications to time
series", math.ST/0611166
- Andrew M. Fraser, Hidden Markov Models and Dynamical
Systems [Review: Statistics of
Moving Shadows]
- Neil Gershenfeld, B. Schoner and E. Metois, "Cluster-Weighted
Modelling for Time-Series Analysis," Nature 397
(1999): 329--332 [Also described in Gershenfeld's incredible Nature of
Mathematical Modeling]
- Gershenfeld and Weigend (eds.), Time Series Prediction:
Forecasting the Future and Understanding the Past
- Silvia Goncalves and Halbert White, "Maximum likelihood and the
bootstrap for nonlinear dynamic models",
Journal of
Econometrics 119 (2004): 199--219
- Christian Gouriéroux and Alain Monfort,
Simulation-Based Econometric Methods
[Review: By
Indirection Find Direction Out]
- Kevin D. Hoover and Stephen J. Perez, "Data-Mining Reconsidered:
Encompassing and the General-to-Specific Approach to Specification Search,"
Econometrics Journal 2 (1999): 167--191
- Marc Joannides and Francois Le Gland, "Small Noise Asymptotics of
the Bayesian Estimator in Nonidentifiable Models", Statistical Inference
for Stochastic Processes 5 (2002): 95--130
- M. L. Kleptsyna, A. Le Breton and M.-C. Roubaud, "Parameter
Estimation and Optimal Filtering for Fractional Type Stochastic
Systems", Statistical Inference for Stochastic Processes
3 (2000): 173--182
- Rudolf Kulhavy, Recursive Nonlinear Estimation: A Geometric
Approach [Includes, explicitly, estimation in time-series systems]
- Kevin Judd, "Chaotic-time-series reconstruction by the Bayesian
paradigm: Right results by wrong methods,"
Physical Review E 67 (2003): 026212 [Word.]
- Guglielmo Lacorata, Ruben A. Pasmanter and Angelo Vulpiani,
"Markov-chain approach to a process with long-time memory," nlin.CD/0110010 [A special case
of a more general result encompassed in my paper with Cris Moore]
- Ron Meir, "Nonparametric Time Series Prediction Through Adaptive
Model Selection," Machine Learning 39 (2000):
5--34 [PDF.
Extending the "structural risk minimization" framework due to Vapnik to time
series. Unfortunately Meir's approach demands knowledge of the mixing rate of
the process, which we don't really know how to estimate, but this is a very
encouraging first step.]
- Gusztáv Morvai, Sidney J. Yakowitz and Paul Algoet, "Weakly
Convergent Nonparametric Forecasting of Stationary Time Series," IEEE
Trans. Info. Theory 43 (1997): 483--498
- Martin Nilsson, "Generalized Singular Spectrum Time Series
Analysis," physics/0205094
- Andrey Novikov, "Optimal sequential multiple
hypothesis tests",arxiv:0811.1297
- Maxim Raginsky, Roummel F. Marcia, Jorge Silva and Rebecca M.
Willett, "Sequential Probability Assignment via Online Convex Programming
Using Exponential Families" [ISIT 2009; PDF]
- James Ramsay, Giles Hooker, David Campbell and Jiguo Cao,
"Parameter Estimation for Differential Equations: A Generalized Smoothing
Approach", Journal of the Royal Statistical Society forthcoming
(2007) [PDF
preprint]
- P. A. Robinson, "Interpretation of scaling properties of
electroencephalographic fluctuations via spectral analysis and underlying
physiology," Physical Review
E 67 (2003): 032902 [A polite but devastating
demonstration that "detrended fluctuation analysis", per Gene Stanley & co., is
an obfuscated way of looking at the power spectrum.]
- George G. Roussas, "Asymptotic distribution of the log-likelihood
function for stochastic processes," Zeitschrift für
Wahrscheinlickkeitstheorie und verwandte Gebiete 47
(1979): 31--46 [Elegant solution of a basic problem for a pretty broad class of
processes; extends work in his 1972 book, listed below because I can't lay
hands on it.]
- Daniil Ryabko, "A criterion for hypothesis testing for
stationary processes", arxiv:0905.4937
- Daniil Ryabko and Boris Ryabko, "Testing Statistical Hypotheses
About Ergodic
Processes", arxiv:0804.0510
- Nobusumi Sagara, "Nonparametric maximum-likelihood estimation of
probability measures: existence and consistency", Journal of
Statistical Planning and Inference 133 (2005):
249--271 ["This paper formulates the nonparametric maximum-likelihood
estimation of probability measures and generalizes the consistency result on
the maximum-likelihood estimator (MLE). We drop the independent assumption on
the underlying stochastic process and replace it with the assumption that the
stochastic process is stationary and ergodic. The present proof employs
Birkhoff's ergodic theorem and the martingale convergence theorem. The main
result is applied to the parametric and nonparametric maximum-likelihood
estimation of density functions." Very cool.]
- Statistical Inference for Stochastic Processes
[Journal]
- Christopher C. Strelioff and Alfred W. Hübler, "Medium-Term
Prediction of Chaos", Physical Review
Letters 96 (2006): 044101
- Masanobu Taniguchi and Yoshihide Kakizawa, Asymptotic Theory
of Statistical Inference for Time Series [Finally, a proper statistical
treatment which doesn't confine itself to expletive-deleted ARMA processes.
Neat information geometry too. Expensive but worth it.]
- Albert Vexler, "Martingale Type Statistics Applied to Change Point
Detection", Communications in Statistics - Theory and Methods
37 (2008): 1207--1224
- Wei Biao Wu, "Nonlinear system theory: Another look at
dependence", Proceedings of the
National Academy of Sciences 102 (2005):
14150--14154 ["we introduce previously undescribed dependence measures for
stationary causal processes. Our physical and predictive dependence measures
quantify the degree of dependence of outputs on inputs in physical systems. The
proposed dependence measures provide a natural framework for a limit theory for
stationary processes. In particular, under conditions with quite simple forms,
we present limit theorems for partial sums, empirical processes, and kernel
density estimates. The conditions are mild and easily verifiable because they
are directly related to the data-generating mechanisms."]
To read:
- Luis A. Aguirre, Ubiratan S. Freitas, Christophe Letellier and Jean
Maquet, "Structure-selection techniques applied to continuous-time nonlinear
models," Physica D 158 (2001): 1--18
- Eduardo G. Altmann and Holger Kantz, "Recurrence time analysis,
long-term correlations, and extreme events", physics/0503056
- Shun-ichi Amari, "Estimating Functions of Independent Component
Analysis for Temporally Correlated Signals," Neural Computation
12 (2000): 2083--2107
- Heather M. Anderson, "Choosing Lag Lengths in Nonlinear Dynamic
Models," Monash Econometric Working Paper [online]
- Claudia Angelini, Daniela Cavab, Gabriel Katul, and Brani
Vidakovic, "Resampling hierarchical processes in the wavelet domain: A case
study using atmospheric turbulence", Physica
D 207 (2005): 24--40
- J. A. D. Aston, "Modeling macroeconomic time series via heavy
tailed distributions", math.ST/0702844
- Alexander Aue, Siegfried Hörmann, Lajos Horváth
and Matthew Reimherr, "Break detection in the covariance structure of multivariate time series models", Annals of Statistics 37
(2009): 4046--4087
- Ishwar V. Basawa and D. J. Scott, Asymptotic Optimal
Inference for Non-ergodic Models
- Nathaniel Beck and Jonathan N. Katz
- István Berkes, Lajos Horváth and Shiqing Ling, "Estimation in nonstationary random coefficient autoregressive models",
Journal of Time Series Analysis 30 (2009): 395--416 ["the unit root problem does not exist in the RCA model"!]
- Alain Berlinet and Gérar Biau, "Minimax Bounds in
Nonparametric Estimation of Multidimensional Deterministic Dynamical
Systems", Statistical Inference for Stochastic Processes
4 (2001): 229--248 ["We consider the problem of estimating
a multidimensional discrete deterministic dynamical system from the first n+1
observations. We exhibit the optimal rate function ... the near neighbor
estimator achives this optimal rate.... optimal rate function is defined from
multidimensonal spacings which are edge lengths of simplicies associated with a
triangulation of the Voronoi cells built from the observations." Sounds very
cool!]
- Alain Berlinet and Christian Francq, "On the Identifiability
of minimal VARMA representations", Statistical Inference for Stochastic
Processes 1 (1998): 1--15
- Arturo Berrones, "Knowledge Network Approach to Noise Reduction",
physics/0609048
- Patrice Bertail, Paul Doukhan and Philippe Soulier
(eds.), Dependence in Probability and Statistics ["recent
developments in the field of probability and statistics for dependent
data... from Markov chain theory and weak dependence with an emphasis on some
recent developments on dynamical systems, to strong dependence in times series
and random fields. ... section on statistical estimation problems and specific
applications". Full blurb,
contents]
- D. Blanke, D. Bosq and D. Guegan, "Modelization and Nonparametric
Estimation for Dynamical Systems with Noise", Statistical Inference for
Stochastic Processes 6 (2003): 267--290
- Noelle Bru, Laurence Despres and Christian Paroissin, "A comparison
of statistical models for short categorical or ordinal time series with
applications in
ecology", math.ST/0702706
- Prabir Burman and Robert H. Shumway, "Estimation of trend in state-space models: Asymptotic mean square error and rate of convergence", arxiv:0911.3469 = Annals of Statistics 37
(2009): 3715--3742
- Alexandre X. Carvalho and Martin A. Tanner, "Mixtures-of-Experts of
Autoregressive Time Series: Asymptotic Normality and Model
Specification", IEEE Transactions on Neural
Networks 16 (2005): 39--56
- Kung-Sik Chan and Howell H. Tong, Chaos: A Statistical
Perspective
- J.-R. Chazottes, P. Collet and B. Schmitt, "Statistical
Consequences of Devroye Inequality for Processes. Applications to a Class of
Non-Uniformly Hyperbolic Dynamical Systems", math.DS/0412167
- J.-R. Chazottes, E. Floriani and R. Lima, "Relative Entropy and
Identification of Gibbs Measures in Dynamical Systems," Journal of
Statistical Physics 90 (1998): 697--725
- Zhuo Chen and Yuhong Yan, "Time Series Models for Forecasting:
Testing or Combining?", Studies in Nonlinear Dynamics and
Econometrics 11:1 (2007): 3
- Zhiyi Chi, "Large deviations for template matching between point
processes", Annals of Applied
Probability 15 (2005): 153--174 = math.PR/0503463
- P. Cizek, W. Hardle, V. Spokoiny, "Adaptive pointwise estimation in time-inhomogeneous conditional heteroscedasticity models", arxiv:0903.4620 [I'm more interested in the idea of adaptively estimating non-stationary time series here than the finance application...]
- Michael P. Clements and David F. Hendry, Forecasting Non-Stationary Economic Time
Series
- Michael P. Clements and David F. Hendry (eds.), Companion
to Economic Forecasting
- P. Collet, S. Martinez and B. Schmitt, "Asymptotic distribution of
tests for expanding maps of the interval", Ergodic Theory and Dynamical
Systems 24 (2004): 707--722 [Kolmogorov-Smironov-type
results for the empirical distribution under the invariant measure of a
dynamical system]
- Daniel Commenges and Anne Gegout-Petit, "Likelihood inference for
incompletely observed stochastic processes: ignorability conditions", math.ST/0507151 ["We define a
general coarsening model for stochastic processes. We decribe incomplete data
by means of sigma-fields and we give conditions of ignorability for likelihood
inference."]
- Colleen D. Cutler and Daniel T. Kaplan (eds.), Nonlinear
Dynamics and Time Series: Building a Bridge between the Natural and Statistical
Sciences
- Serguei Dachian, Yury A. Kutoyants
- "Hypotheses Testing: Poisson Versus Self-exciting", arxiv:0903.4636
= Scandinavian Journal of Statistics 33 (2006): 391
- "On the Goodness-of-Fit Tests for Some Continuous Time
Processes", arxiv:0903.4642 ["We
present a review of several results concerning the construction of the
Cramer-von Mises and Kolmogorov-Smirnov type goodness-of-fit tests for
continuous time processes. As the models we take a stochastic differential
equation with small noise, ergodic diffusion process, Poisson process and
self-exciting point processes"]
- Arnak Dalalyan and Markus Reiss, "Asymptotic statistical
equivalence for ergodic diffusions: the multidimensional case", math.ST/0505053
- Youri Davydov, "Remarks on Estimation Problem for Stationary
Processes in Continuous Time", Statistical Inference for Stochastic
Processes 4 (2001): 1--15
- D. Dehay and Yu. A. Kutoyants, "On confidence intervals for
distribution function and density of ergodic diffusion process", Journal of
Statistical Planning and Inference 124 (2004):
63--73
- Miguel A. Delgado, Javier Hidalgo and Carlos Velasco, "Distribution
free goodness-of-fit tests for linear
processes", math.ST/0603043
= Annals of
Statistics 33 (2005): 2568--2609 [i.e.,
goodness-of-fit for the autocorrelation function]
- Thomas G. Dietterich,
"Machine Learning for Sequential Data"
[PDF.
Thanks to Gustavo Lacerda for a pointer.]
- Dmitry Dolgopyat, Vadim Kaloshin, Leonid Koralov, "Sample path
properties of the stochastic flows," math.PR/0111011
- Randal Douc, Eric Moulines and Tobias Ryden, "Asymptotic properties
of the maximum likelihood estimator in autoregressive models with Markov
regime", Annals
of Statistics 32 (2004): 2254--2304 = math.ST/0503681
- K. Dzhaparidze, Parameter Estimation and Hypothesis Testing
in Spectral Analysis of Stationary Time Series
- Pierre Duchesne, "On Testing for Serial Correlation with a
Wavelet-Based Spectral Density Estimator in Multivariate Time Series", Econometric
Theory 22 (2006): 633--676
- Michael Eichler, "Graphical modelling of multivariate time
series", math.ST/0610654
- Jianqing Fan and Qiwei Yao, Nonlinear Time Series:
Nonparametric and Parametric Methods [Blurb]
- Yanqin Fan, Qi Li and Insik Min, "A Nonparametric Bootstrap Test of
Conditional Distributions", Econometric
Theoy 22 (2006): 587--613
- Enrique Figueroa-Lopez and Christian Houdre, "Nonparametric
estimation for Levy processes with a view towards mathematical finance", math.ST/0412351
- D. Florens and H. Pham, "Large Deviations in Estimation of an
Ornstein-Uhlenbeck Model," Journal of Applied Probability
36 (1999): 60--77
- Jurgen Franke, Jens-Peter Kreiss and Enno Mammen,
"Bootstrap of Kernel Smoothing in Nonlinear Time Series",
Bernoulli 8 (2002): 1--37
- Christian Francq and Jean-Michel Zakoian, "Bartlett's formula for
a general class of nonlinear processes", Journal of Time Series Analysis 30 (2009): 449--465
- Cheng-Der Fuh
- Philip Hans Franses and Dick Van Dijk, Non-Linear Time Series
Models in Empirical Finance
- T. D. Frank, "Delay Fokker-Planck equations, perturbation theory,
and data analysis for nonlinear stochastic systems with time delays",
Physical Review
E 71 (2005): 031106
- Roland Fried and Vanessa Didelez, "Latent variable analysis and
partial correlation graphs for multivariate time series", Statistics and
Probability Letters 73 (2005): 287--296
- Irene Gannaz and Olivier Wintenberger, "Adaptative density
estimation with dependent observations", math.ST/0510311 [Using wavelets
to estimate the invariant density of weakly-dependent processes, assumes
geometric ergodicity but not stationarity]
- Jiti Gao, Maxwell King, Zudi Lu and Dag Tjostheim, "Specification
testing in nonlinear and nonstationary time series autoregression",
Annals of Statistics 37 (2009): 3893--3928
- Basillis Gidas and Alejandro Murua, "Optimal transformations for
prediction in continuous-time stochastic processes: finite past and future", Probability Theory and
Related Fields 131 (2005): 479--492
- Ciprian Doru Giurcuaneanu and Jorma Rissanen, "Estimation of AR and
ARMA models by stochastic
complexity", math.ST/0702765
- Georg A. Gottwald and Ian Melbourne, "Testing for chaos in
deterministic systems with
noise", Physica
D 212 (2005): 100--110
- Janez Gradisek, Silke Siegert, Rudolf Friedrich and Igor Grabec,
"Analysis of time series from stochastic processes," Physical Review
E 62 (2000): 3146--3155
- Grassberger and Nadal (eds.), From Statistical Physics to
Statistical Inference and Back
- Grenander and Rosenblatt, Time Series
- David Gubbins, Time Series and Inverse Theory for
Geophysicists
- Laszlo Gyorfi et al., Nonparametric Curve Estimation from
Time Series
- Peter Hall, Soumendra Nath Lahiri and Jorg Polzehl,
"On Bandwidth Choice in Nonparametric Regression with Both Short- and
Long-Range Dependent Errors", Annals of Statistics
23 (1995): 1921--1936
- Wolfgang Hardle, Helmut Lutkepohl, Rong Chen, "A Review of Nonparametric Time Series Analysis", International Statistical Review
65 (1997): 49--72 [JSTOR]
- Jeffrey D. Hart, "Automated Kernel Smoothing of Dependent
Data by using Time Series Cross-Validation", Journal of the
Royal Statistical Society B 56 (1994): 529--542
[JSTOR]
- Andrew Harvey et al (eds.), State Space and Unobserved
Component Models: Theory and Applications
- Stefan Haufe, Guido Nolte, Klaus-Robert Mueller and Nicole Kraemer,
"Sparse Causal Discovery in Multivariate Time
Series", arxiv:0901.1234 [I am not
altogether happy with defining "causes" as "has a non-zero coefficient in a
vector autoregression"...]
- David Hendry, Econometrics: Alchemy or Science?
[Review by
Bruce Hansen]
- David F. Hendry and Bent Nielsen, Econometric Modeling: A
Likelihood Approach
[Blurb, preface,
ch.1 ]
- Junichi Hirukawa and Masanobu Taniguchi, "LAN theorem for
non-Gaussian locally stationary processes and its applications", Journal
of Statistical Planning and Inference 136 (2006):
640--688
- Jinh Hu, Wen-wen Tung, Jianbo Gao and Yinhe Cao, "Reliability of
the 0-1 test for
chaos", Physical
Review E 72 (2005): 056207 [On Gottwald
and Melbourne]
- Jianhua Z. Huang and Lijian Yang, "Identification
of Non-Linear Additive Autoregressive Models", Journal of
the Royal Statistical Society B 66 (2004): 463--477 [JSTOR. Proves
consistency under the assumption that the data-generating process
is strictly stationary and strongly mixing.]
- Stefano M. Iacus, "Statistical analysis of stochastic resonance
with ergodic diffusion noise," math.PR/0111153
- Massimiliano Ignaccolo, P. Allegrini, P. Grigolini, P. Hamilton and
Bruce J. West
- Ching-Kang Ing, Jin-Lung Lin, Shu-Hui Yu, "Toward optimal multistep
forecasts in non-stationary
autoregressions", Bernoulli 15 (2009): 402--437
= arxiv:0906.2266 ["Optimal"
assuming that you know you are facing a linear AR model.]
- Atsushi Inoue and Lutz Kilian, "In-sample or out-of-sample tests of
predictability: which one should we use?", European Central Bank Working Paper
[PDF]
- Akihiko Inoue and Yukio Kasahara, "Explicit representation of
finite predictor coefficients and its applications", math.ST/0405051 = Annals of
Statistics 34 (2006): 973--993
- D. A. Ioannides and D. P. Papanastassiou, "Estimating the
distribution function of a stationary process involving measurement
errors", Statistical Inference for Stochastic
Processes 4 (2001): 181--198
- E. L. Ionides, C. Breto and A. A. King, "Inference for nonlinear
dynamical
systems", Proceedings
of the National Academy of Sciences (USA) 103 (2006):
18438--18443
- S. Ishii and M.-A. Sato, "Reconstruction of chaotic dynamics by
on-line EM algorithm," Neural Networks 14
(2001): 1239--1256
- Joseph Tadjuidje Kamgaing, Hernando Ombao and Richard A. Davis,
"Autoregressive processes with data-driven regime switching",
Journal of Time Series Analysis 30
(2009): 505--533
- George Kapetanios and Massimiliano Marcellino, "A Comparison of
Estimation Methods for Dynamic Factor Models of Large Dimensions"
[PDF]
- Yan Karklin and Michael S. Lewicki, "A Hierarchical Bayesian Model
for Learning Nonlinear Statistical Regularities in Nonstationary Natural
Signals", Neural
Computation 17 (2005): 397--423
- Matthew B. Kennel, "Testing time symmetry in time series
using data compression dictionaries", Physical Review E
69 (2004): 056208
- Tae Yoon Kim and Sangyeol Lee, "Kernel density estimator for strong
mixing processes", Journal of
Statistical Planning and Inference 133 (2005):
273--284
- Jon Kleinberg, "Bursty and Hierarchical Structure in Streams"
[PDF]
- D. Kleinhans, R. Friedrich, "Maximum Likelihood Estimation of Drift
and Diffusion
Functions", physics/0611102
- D. Kleinhans, R. Friedrich, A. Nawroth and J. Peinke, "An iterative
procedure for the estimation of drift and diffusion coefficients of Langevin
processes", Physics Letters
A 346 (2005): 42--46
= physics/0502152 ["The
analysis is based on an iterative procedure minimizing the Kullback-Leibler
distance between measured and estimated two time joint probability
distributions of the process."]
- M. L. Kleptsyna and A. Le Breton, "Statistical Analysis of the
Fractional Ornstein-Uhlenbeck Type Process", Statistical Inference for
Stochastic Processes 5 (2002): 229--248
- Rahul Konnur, "Estimation of all model parameters of chaotic
systems from discrete scalar time series measurements", Physics Letters
A 346 (2005): 275--280
- D. Kugiumtzis, "Statically Transformed Autoregressive Process and
Surrogate Data Test for Nonlinearity," nlin.CD/0110025
- Uwe Küchler and Michael Sørensen, Exponential
Families of Stochastic Processes
- Hans R. Künsch, "State Space and Hidden Markov Models",
pp. 109--173 in Ole E. Barndorff-Nielsen, David R. Cox and Claudia
Klüppelberg (eds.), Complex Stochastic Systems
- Y. A. Kutoyants
- Statistical Inference for Ergodic Diffusion
Processes
- "On the Goodness-of-Fit Testing for Ergodic Diffusion Processes", arxiv:0903.4550
- "Goodness-of-Fit Tests for Perturbed Dynamical Systems", arxiv:0903.4612
- "On Properties of Estimators in non Regular Situations for Poisson Processes", arxiv:0903.4613
- B. Lacaze, "Errorless uniform sampling of complex stationary
processes," Signal
Processing 83 (2003): 913--917
- Stephen M. S. Lee and P. Y. Lai, "Improving coverage accuracy
of block bootstrap confidence intervals", arxiv:0804.4361
- J. Lember and A. Koloydenko, "Adjusted Viterbi training", math.ST/0406237
- Daniel Lemire, "A Better Alternative to Piecewise Linear Time
Series Segmentation", cs.DB/0605103
- N. N. Leonenko and L. M. Sakhno, "On the Whittle estimators for
some classes of continuous-parameter random processes and fields",
Statistics and
Probability Letters 76 (2006): 781--795
- J. K. Lindsey, Statistical Analysis of Stochastic Processes
in Time [old
draft in Postscript; data and R code]
- Shiqing Ling and Howell Tong, "Testing for a linear MA model
against threshold MA
models", math.ST/0603040
= Annals of
Statistics 33 (2005): 2529--2552
- Yu. N. Lin'kov, Asymptotic Statistical Methods for Stochastic
Processes
[Restricted to semi-martingales. blurb]
- E. Locherbach, "Likelihood Ratio Processes for Markovian Particle
Systems with Killing and Jumps", Statistical Inference for Stochastic
Processes 5 (2002): 153--177
- Wei Lu, Namrata Vaswani, "The Wiener-Khinchin Theorem for Non-wide
Sense stationary Random Processes" ["under certain assumptions, the power
spectral density (PSD) of any random process is equal to the Fourier transform
of the time-averaged autocorrelation function"]
- Xiaodong Luo, Tomomichi Nakamura and Michael Small, "Surrogate data
method applied to nonlinear time
series", nlin.CD/0603004
- Xiaodong Luo, Jie Zhang, Junfeng Sun, Michael Small, Irene Moroz,
"Asymptotically pivotal statistic for surrogate testing with extended
hypothesis", nlin.CD/0701008
- Xiaodong Luo, Jie Zhang and Michael Small, "Exact nonparametric
inference for detection of nonlinear determinism", nlin.CD/0507049 [More exactly,
this is an exact test for linear stochasticity --- rejecting the null indicates
either nonlinearity or determinism, or both.]
- Enno Mammen and Swagata Nandi, "Change of the nature of a test when
surrogate data are applied", Physical Review
E 70 (2004): 016121
- Heikki Mannila and Dmitry Rusakov, "Decomposition
of Event Sequences into Independent Components" [short
and long
versions in PS]
- T. K. March, S. C. Chapman and R. O. Dendy, "Recurrence plot
statistics and the effect of embedding", physics/0502042
- Pierre-Francois Marteau, "Time Warp Edit Distances with Stiffness
Adjustment for Time Series
Matching", cs.IR/0703033
- Norbert Marwan and Jurgen Kurths, "Nonlinear analysis of bivariate
data with cross recurrence plots," physics/0201061
- Norbert Marwan, M. Thiel, N. R. Nowaczyk, "Cross Recurrence Plot
Based Synchronization of Time Series," physics/0201062
- Norbert Marwan, N. Wessel, U. Meyerfeldt, A. Schirdewan, J. Kurths,
"Recurrence Plot Based Measures of Complexity and its Application to Heart
Rate Variability Data," physics/0201064
- Ikuo Matsuba, Hiroshi Takahashi and shinya Wakasa, "Stochastically Equivalent Dynamical System Approach to Nonlinear Deterministic Prediction",
International
Journal of Bifurcation and Chaos
16 (2006): 2721--2728 [I can't tell, from the abstract,
if they're proposing to use stochastic systems to predict deterministic ones
or vice versa; it'd be interesting either way!]
- Muneya Matsui, "A characterization of ARMA and Fractional ARIMA
models with infinitely divisible
innovations", math.ST/0703731
- Patrick E. McSharry and Leonard A. Smith, "Consistent nonlinear
dynamics: identifying model inadequacy", nlin.CD/0401024 = Physica
D 192 (2004): 1--22
- Javier R. Movellan, Paul Mineiro, and R. J. Williams, "A Monte
Carlo EM Approach for Partially Observable Diffusion Processes: Theory and
Applications to Neural Networks," Neural Computation
14 (20020: 1507--1544
- Eric Moulines, Pierre Priouret and Francois Roueff, "On recursive
estimation for time varying autoregressive
processes", math.ST/0603047
= Annals of
Statistics 33 (2005): 2610--2654
- Jose M. F. Moura and Sanjoy K. Mitter, "Identification
and Filtering: Optimal Recursive Maximum Likelihood Approach" [1986
technical report from MIT, found looking for something else, original URL now
lost --- presumably long since published]
- George V. Moustakides, "Sequential change detection
revisited", arxiv:0804.0741
= Annals of Statistics 36 (2008): 787--807
- Hans-Georg Muller and Ulrich Stadtmuller, "Generalized functional
linear models", math.ST/0505638 = Annals of
Statistics 33 (2005): 774--805 ["We propose a
generalized functional linear regression model for a regression situation where
the response variable is a scalar and the predictor is a random function. A
linear predictor is obtained by forming the scalar product of the predictor
function with a smooth parameter function, and the expected value of the
response is related to this linear predictor via a link function. If, in
addition, a variance function is specified, this leads to a functional
estimating equation which corresponds to maximizing a functional
quasi-likelihood. This general approach includes the special cases of the
functional linear model, as well as functional Poisson regression and
functional binomial regression. The latter leads to procedures for
classification and discrimination of stochastic processes and functional
data. ... An essential step in our proposal is dimension reduction by
approximating the predictor processes with a truncated Karhunen-Loeve
expansion."]
- Ursula U. Müller, Anton Schick and Wolfgang Wefelmeyer,
"Estimating the innovation distribution in nonparametric autoregression",
Probability Theory and Related Fields
144 (2009): 53--77 ["We prove a Bahadur representation for
a residual-based estimator of the innovation distribution function in a
nonparametric autoregressive model. The residuals are based on a local linear
smoother for the autoregression function."]
- Tomomichi Nakamura, Yoshito Hirata, and Michael Small, "Testing for
correlation structures in short-term variabilities with long-term trends of
multivariate time
series", Physical
Review E 74 (2006): 041114
- Tomomichi Nakamura, Xiaodong Luo, and Michael Small, "Testing for
nonlinearity in time series without the Fourier transform", Physical Review
E 72 (2005): 055201
- Tomomichi Nakamura and Michael Small, "Small-shuffle surrogate
data: Testing for dynamics in fluctuating data with
trends", Physical
Review E 72 (2005): 056216
- Ilia Negri, "Efficiency of a class of unbiased estimators for the
invariant distribution function of a diffusion
process", math.ST/0609590
- Jimmy Olsson, Olivier Cappe, Dandal Douc and Eric Moulines,
"Sequential Monte Carlo smoothing with application to parameter estimation in
non-linear state space
models", math.ST/0609514
- Sorinel Adrian Oprisan, "An application of the least-squares method
to system parameters extraction from experimental data", Chaos
12 (2002): 27--32
- Brahim Ouhbi and Nikolaos Limnios, "Nonparametric estimation for
semi-Markov processes based on its hazard rate functions", Statistical
Inference for Stochastic Processes 2 (1999): 151--173
- P. Palaniyandi and M. Lakshmanan, "Estimation of System Parameters
and Predicting the Flow Function from Time Series of Continuous Dynamical
Systems", nlin.CD/0406027
- Milan Palus, "Coarse-grained
entropy rate for characterization of complex time series", Physica
D 93 (1996): 64--77 [Thanks to Prof. Palus for a
reprint]
- Zacharias Psaradakis
- Zacharias Psaradakis, Martin Sola, Fabio Spagnolo and Nicola Spagnolo, "Selecting nonlinear time series models using information criteria",
Journal of
Time Series Analysis
30 (2009): 369--394
- N. U. Prabhu and Ishawar V. Basawa (eds.), Statistical
Inference in Stochastic Processes (1991)
- B. L. S. Prakasa Rao
- Semimartingales and Their Statistical
Inference
- Statistical Inference for Diffusion-Type
Processes
- E. Racca and A. Porporato, "Langevin equations from time series",
Physical Review
E 71 (2005): 027101
- Ali
Rahimi, Learning to Transform Time Series with a Few Examples
[Ph.D. thesis, MIT dept. of electrical engineering and computer science,
2005. PDF]
- Ali Rahimi, Ben Recht and Trevor Darrell, "Learning to Transform
Time Series with a Few Examples", tech report
[PDF]
- M. M. Rao, Stochastic Processes: Inference Theory
- Ramiro Rico-Martinez, K. Krischer, G. Flaetgen, J.S. Anderson and
I. G. Kevrekidis, "Adaptive Detection of Instabilities: An Experimental
Feasibility Study," nlin.CD/0202057
- Christoph Rieke, Ralph G. Andrzejak, Florian Mormann and Klaus
Lehnertz, "Improved statistical test for nonstationarity using recurrence time
statistics", Physical Review E 69 (2004): 046111
[link]
- John C. Robertson, Ellis W. Tallman and Charles H. Whiteman,
"Forecasting using relative entropy," Federal Reserve Bank of Atlanta Working
Paper 2002-20
[PDF]
- J. W. C. Robinson, J. Rung, A. R. Bulsara and M. E. Inchiosa,
"General measures for signal-noise separation in nonlinear dynamical
systems," Physical Review E 63 2000: 011107
- G. G. Roussas, Contiguity of Probability Measures: Some
Applications in Statistics [1972; "established a modern and elegant
approach for statistical analysis of a Markov proces. Using the concept of
contiguity and LAN {local asymptotic normality}, a concept which goes back to
LeCam, he studied asymptotic optimality of sequences of estimators and tests.
The description is systematic and mathematically rigorous." --- Taniguchi and
Kakizawa, p. 63]
- Boris Ryabko and Jaakko Astola
- "Universal Codes as a Basis for Time Series
Testing", cs.IT/0602084
- "Universal Codes as a Basis for Nonparametric Testing of
Serial Independence for Time Series", cs.IT/0506094
- Daniil Ryabko, "Characterizing predictable classes of processes",
arxiv:0905.4341
- Nicola Scafeta, Patti Hamilton and Paolo Grigolini, "The
Thermodynamics of Social Processes: The Teen Birth Phenomenon," cond-mat/0009020 [Not because
I believe them about sociology, but because they claim to have new and powerful
nonparametric methods for detecting and quantifying memory in time series]
- Thomas Schreiber and Andreas Schmitz, "Surrogate time series," chao-dyn/9909037
- Reiner Schulz and James A. Reggia, "Temporally Asymmetric Learning Supports Sequence Processing in Multi-Winner Self-Organizing Maps",
Neural
Computation 16 (2004): 535--561 [the "model
presented here raises the possibility that SOMs may ultimately prove useful as
visualization tools for temporal sequences and as preprocessors for sequence
pattern recognition systems."]
- Xiaofeng Shao, Wei Biao Wu, "Asymptotic spectral theory for
nonlinear time
series", math.ST/0611029
- Olimjon Sh. Sharipov and Martin Wendler, "Bootstrap
for the Sample Mean and for U-Statistics of Stationary Processes',
arxiv:0911.3083
- M. Siefert, J. Peinke and R. Friedrich, "On a quantitative method
to analyse dynamical and measurement noise," physics/0108034
- Przemyslaw Sliwa and Wolfgang Schmid, "Monitoring the
cross-covariances of a multivariate time series", Metrika
61 (2005): 89--115
- A. Sitz, U. Schwarz, J. Kurths, H. U. Voss, "Estimation of
parameters and unobserved components for nonlinear systems from noisy time
series," Physical Review E 66 (2002): 016210
- Michael Small
- Applied Nonlinear Time Series Analysis:
Applications in Physics, Physiology and Finance
- "Optimal time delay embedding for nonlinear time
series modeling", nlin.CD/0312011
- Michael Small and Kevin Judd, "Detecting periodicity in
experimental data using linear modeling techniques", physics/9810021
- Vadim N. Smelyanskiy and Dmitry G. Luchinsky, "Inference of
stochastic nonlinear oscillators with applications to physiological problems",
physics/0403121 [They
present this as a Bayesian inference issue, but the core of their work appears,
from skimming, to be an efficient method for computing the likelihood, so it'd
apply equally well to maximum likelihood inference, for instance.]
- V. N. Smelyanskiy, D. A. Timucin, A. Bandrivskyy and D. G.
Luchinsky, "Model reconstruction of nonlinear dynamical systems driven by
noise", physics/0310062
[Same as earlier paper --- was this one submitted to PRL?]
- Dmitry A. Smirnov, Vladislav S. Vlaskin and Vladimir I.
Ponomarenko, "Estimation of parameters in one-dimensional maps from noisy
chaotic time series", Physics Letters
A 336 (2005): 448--458
- Eduardo D. Sontag, "For differential equations with r parameters,
2r+1 experiments are enough for identification," math.DS/0111135
- D. Sornette and V. F. Pisarenko, "Properties of a simple bilinear
stochastic model: estimation and predictability", physics/0703217
- Tomoya Suzuki, Tohru Ikeguchi, and Masuo Suzuki, "Effects of data
windows on the methods of surrogate data", Physical Review
E 71 (2005): 056708
- Alexander G. Tartakovsky, "Asymptotic Optimality of Certain
Multihypothesis Sequential Tests: Non-i.i.d. Case", Statistical Inference
for Stochastic Processes 1 (1998): 265--295
- Marco Thiel, M. Carmen Romano and Jurgen Kurths, "How much
information is contained in a recurrence plot?", Physics Letters
A 330 (2004): 343--349
- Tina Toni, David Welch, Natalja Strelkowa, Andreas Ipsen, Michael P.H. Stumpf, "Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems", arxiv:0901.1925
- Wilson Truccolo, John P. Donoghue, "Nonparametric Modeling of
Neural Point Processes via Stochastic Gradient Boosting Regression", Neural
Computation 19 (2007): 672-705
- Ciprian A. Tuder and Frederi G. Viens, "Statistical Aspects of the
Fractional Stochastic
Calculus", math.ST/0609295
- Masayuki Uchida and Nakahiro Yoshida, "Information Criteria in
Model Selection for Mixing Processes", Statistical Inference for
Stochastic Processes 4 (2001): 73--98 ["The emphasis is
put on the use of the asymptotic expansion of the distribution of an estimator
based on the conditional Kullback-Leibler divergence for stochastic processes.
Asymptotic properties of information criteria and their improvement are
discusssed."]
- Aad van der Vaart and Harry van Zanten, "Donsker theorems for
diffusions: Necessary and sufficient conditions", math.PR/0507412 = Annals of
Probability 33 (2005): 1422--1451
- Harry van Zanten, "On Uniform Laws of Large Numbers for Ergodic
Diffusions and Consistency of Estimators", Statistical Inference for
Stochastic Processes 6 (2003): 199--213 ["In contrast
with uniform laws of large numbers for i.i.d. random variables, we do not need
conditions on the 'size' of the class [of functions] in terms of bracketing or
covering numbers. The result is a consequence of a number of asymptotic
properties of diffusion local time that we derive."]
- J. H. van Zanten, "On the Uniform Convergence of the Empirical
Density of an Ergodic Diffusion", Statistical Inference for
Stochastic Processes 3 (2000): 251--262
- P. F. Verdes, P. M. Granitto and H. A. Ceccatto, "Overembedding
Method for Modeling Nonstationary Systems", Physical Review
Letters 96 (2006): 118701
- Juan M. Vilar-Fernandez and Ricardo Cao, "Nonparametric Forecasting
in Time Series - A Comparative Study", Communications in
Statistics: Simulation and Computation 36 (2007):
311--334
- R. Vilela Mendes, R. Lima and T. Araujo, "A Process-Reconstruction
Analysis of Market Fluctuations," cond-mat/0102301 [I don't care
about the market, but they claim to have a new method for identifying
distributions over entire sequences]
- Zijun Wang, "Finite Sample Performances of the Model Selection Approach in Nonparametric Model Specification for Time Series", Communications in Statistics: Theory and Methods 38
(2009): 2302--2330
- Halbert White
- Asymptotic Theory for Econometricians
- Estimation, Inference and Specification
Analysis [Or, what shall we do with a mis-specified model?]
- Wei Biao Wu and
Jan Mielniczuk, "Kernel Density Estimation for Linear Processes", Annals
of Statistics 30 (2002): 1441--1459 [PDF]
- Herwig Wendt, Patrice Abry and Stephane Jaffard, "Bootstrap for
Empirical Multifractal Analysis", IEEE Signal Processing Magazine
July 2007, pp. 38--48 [+ technical papers by these authors]
- A. Zeileis and G. Grothendieck, "zoo: S3 Infrastructure for Regular
and Irregular Time Series", Journal of Statistical
Software 14 (2005): 1--27 = math.ST/0505527 ["zoo is an R
package providing an S3 class with methods for indexed totally ordered
observations, such as discrete irregular time series. Its key design goals are
independence of a particular index/time/date class and consistency with base R
and the "ts" class for regular time series."]
- M. Zukovic, D. T. Hristopulos, "Spartan Random Processes in Time
Series Modeling", 0709.3418
To write/finish:
- CRS, "Learning Rates and Recurrence Times" [a.k.a. "Wait and see"]
- CRS, "Algorithms for Inferring the Statistical Structure of Symbol
Sequences: History and Review"
#
Model Selection
(Reader, please make your own suitably awful pun about the different senses
of "model selection" here, as a discouragement to those finding this page
through prurient searching. Thank you.)
In statistics
and machine learning, "model
selection" is the problem of picking among different mathematical models which
all purport to describe the same data set. This notebook will not (for now)
give advice on it; as usual, it's more of a place to organize my thoughts and
references...
Classification of approaches to model selection (probably not really
exhaustive but I can't think of others, right now):
- Direct optimization of some measure of goodness of fit or risk on training
data.
- Seems implicit in a lot of work which points to marginal improvements in
"the proportion of variance explained", mis-classification rates, "perplexity",
etc. Often, also, a recipe for over-fitting and chasing snarks. What's wanted
is (almost always) some way of measuring the ability to generalize to new data,
and in-sample performance is a biased estimate of this. Still,
with enough data, if the gods
of ergodicity are kind, in-sample performance
is representative of generalization performance, so perhaps this will work
asymptotically, though in many cases the researcher will never even glimpse
Asymptopia across the Jordan.
- Optimize fit with model-dependent penalty
- Add on a term to each model which supposed indicates its ability to
over-fit. (Adjusted R^2, AIC, BIC, ..., all do this in terms of the number of
parameters.) Sounds reasonable, but I wonder how many actually work better, in
practice, than direct optimization. (See Domingos for some depressing evidence
on this score.)
- Classical two-part minimum description length
methods were penalties; I don't yet understand one-part MDL.
- Penalties which depend on the model class
- Measure the capacity of a class of models to over-fit;
penalize all models in that class accordingly, regardless of their
individual properties. Outstanding example: Vapnik's "structural risk
minimization" (provably consistent under some circumstances). Only
sporadically coincides with *IC-type penalties based on the number of
parameters.
- Cross-validation
- Estimate the ability to generalize to different data by, in fact, using
different data. Maybe the "industry standard" of machine learning. Query, how
are we to know how much different data to use?
- Query, how are we to cross-validate when we have complex, relational data?
That is, I understand how to do it for independent samples, and I even
understand how to do it for time series, but I
do not understand how to do it
for networks, and I don't think I am
alone in this. (Well, I understand how to do it for Erdos-Renyi networks,
because that's back to independent samples...)
- The method of sieves
- Directly optimize the fit, but within a constrained
class of models; relax the constraint as the amount of data grows. If the
constraint is relaxed slowly enough, should converge on the truth. (Ordinary
parametric inference, within a single model class, is a limiting case where the
constraint is relaxed infinitely slowly, and we converge on the pseudo-truth
within that class [provided we have a consistent estimator].)
- Encompassing models
- The sampling distribution of any estimator of any model class is a function
of the true distribution. If the true model clss has been well-estimated, it
should be able to predict what other, wrong model classes will
estimate, but not vice versa. In this sense the true model class "encompasses
the predictions" of the wrong ones. ("Truth is the criterion both of itself
and of error.")
- General or covering models
- Come up with a single model class which includes all the interesting model
classes as special cases; do ordinary estimation within it. Getting a
consistent estimator of the additional parameters this introduces is often
non-trivial, and interpretability can be a problem.
- Model averaging
- Don't try to pick the best or correct model; use them all with different
weights. Chose the weighting scheme so that if one is best, it will tend to be
more and more influential. Often I think the improvement is not so much from
using multiple models as from smoothing, since estimates of
the single best model are going to be more noisy than estimates
of a bunch of models which are all pretty good. (This leads
to ensemble methods.)
- Adequacy testing
- The correct model should be able to encode the data as uniform IID noise.
Test whether "residuals", in the appropriate sense, are IID uniform. Reject
models which can't hack it. Possibly none of the models on offer is adequate;
this, too, is informative. Or: models make specific probabilistic assumptions
(IID Gaussian noise, for example); test those. Mis-specification testing.
The machine-learning-ish literature on model selection doesn't seem to ever
talk about setting up experiments to select among models; or do I just not read
the right papers there? (The statistical literature on experimental design
tends to talk about "model discrimination" rather than "model selection".)
Recommended, big-picture:
- Leo Breiman, "Heuristics of Instability and Stabilization in Model
Selection," Annals of Statistics 24 (1996):
2350--2383
- Gerda Claeskens and Nils Lid Hjort, Model Selection
and Model Averaging
- Pedro
Domingos, "The Role of Occam's Razor in Knowledge Discovery," Data
Mining and Knowledge Discovery, 3 (1999) [Online]
- Trever Hastie, Robert Tibshirani and Jerome Friedman, The
Elements of Statistical Learning: Data Mining, Inference, and Prediction
- C. R. Rao, Y. Wu, Sadanori Konishi and Rahul Mukerjee, "On Model
Selection", in P. Lahiri (ed.), Model Selection, pp. 1--64
[Thorough review paper, if from a rather old-school statistical-theory
perspective. The rest of the volume is too Bayesian to be of interest to
me. JSTOR]
- Aris Spanos, "Curve-Fitting, the Reliability of Inductive
Inference and the Error-Statistical Approach" [PDF
preprint]
- V. N. (=Vladimir Naumovich) Vapnik, The Nature of
Statistical Learning Theory [Review:
A Useful Biased Estimator]
- Quang H. Vuong, "Likelihood Ratio Tests for Model Selection and
Non-Nested Hypotheses", Econometrica 57 (1989):
307--333
Recommended, close-ups:
- Sylvain Arlot, "V-fold cross-validation improved: V-fold
penalization",
arxiv:0802.0566 [Seeing
cross-validation as a penalization method, and improving it accordingly by
strengthening the penalty term]
- A. C. Atkinson and A. N. Donev, Optimum Experimental
Design [Review]
- Leo Breiman and Philip Spector, "Submodel Selection and Evaluation
in Regression: The X-Random Case", International
Statistical Review 60 (1992): 291--319
[JSTOR]
- Prabir Burman, Edmond Chow and Deborah Nolan, "A cross-validatory method for dependent data", Biometrika 81 (1994): 351--358 [JSTOR]
- Patrick S. Carmack, William R. Schucany, Jeffrey S. Spence, Richard
F. Gunst, Qihua Lin and Robert W. Haley, "Far Casting Cross Validation"
[Leave-one-out CV, with a constant-radius window skipped around each hold-out
point as well; this is designed to deal with correlations in time or in
space. PDF
preprint]
- Nicolo Cesa-Bianchi and Gabor Lugosi, Prediction, Learning,
and Games
[Mini-review. For
avoiding model selection in favor of adaptively-weighted combinations of
models.]
- Snigdhansu Chatterjee, Nitai D. Mukhopadhyay, "Risk and resampling
under model
uncertainty", arxiv:0805.3244 [an
interesting approach to model averaging with provably good frequentist
properties, via bootstrapping --- for a trivial linear-Gaussian problem; not
clear to me how to generalize]
- Bruce E. Hansen, "Challenges for Econometric Model
Selection", Econometric Theory 21 (2005): 60--68
["Standard econometric model selection methods are based on four fundamental
errors in approach: parametric vision, the assumption of a true
[data-generating process], evaluation based on fit, and ignoring the impact of
model uncertainty on inference. Instead, econometric model selection methods
should be based on a semiparametric vision, models should be viewed as
approximations, models should be evaluated based on their purpose, and model
uncertainty should be incorporated into inference
methods." PDF]
- Marcus Hutter, "The Loss Rank Principle for Model Selection",
math.ST/0702804 [This is a
simplified form of Deborah Mayo's
"severity".]
- Pascal Lavergne and Quang H. Vuong, "Nonparametric Selection of
Regressors: The Nonnested Case", Econometrica 64
(1996): 207--219 [Picking which variables belong in a regression, by looking at
the error of non-parametric kernel
regressions. JSTOR]
- Charles Mitchell and Sara van de Geer, "General Oracle Inequalities
for Model
Selection", Electronic
Journal of Statistics 3 (2009): 176--204 [Analyzes
a data-set splitting scheme (like cross-validation with only one "fold")]
- Jeffrey S. Racine
- "Feasible Cross-Validatory Model Selection for General Stationary Processes", Journal of Applied Econometrics
12 (1997): 169--179
[JSTOR. This is closely
related to (maybe algebraically just a special case of?) the familiar trick
from splines of writing the CV criterion in terms of the
hat/influence/projection matrix.]
- "Consistent cross-validatory model-selection for dependent
data: hv-block
cross-validation", Journal
of Econometrics 99 (2000): 39--61
- David J. Spiegelhalter, Nicola G. Best, Bradley P. Carlin and
Angelika van der Linde, "Bayesian Measures of Model Complexity and
Fit", Journal of the Royal Statistical Society
B 64 (2002): 583--639
[PDF reprint]
- Ryan J. Tibshirani and Robert Tibshirani, "A bias correction for
the minimum error rate in
cross-validation", Annals
of Applied Statistics 3 (2009): 822--829
= arxiv:0908.2904
- Sara van de Geer, Empirical Process Theory in
M-Estimation
- Mark J. van der Laan and Sandrine Dudoit, "Unified Cross-Validation
Methodology for Selection Among Estimators and a General Cross-Validated
Adaptive Epsilon-Net Estimator: Finite Sample Oracle Inequalities and Examples"
[PDF working paper,
i.e., a 100-page tome. The first part proves that multi-fold cross-validation
and the like will work for selecting the best estimator out of a finite set of
estimators (provided the loss function is nicely bounded and the data are IID).
The second part ingeniously turns this into a complete estimation procedure, by
effectively creating a discrete sieve and then using CV to say which part of
the sieve to use. This is a very cool set of results, but (1) the limitations
to bounded loss functions make me nervous, and (2) the formulas appearing in
the finite-sample and even asymptotic bounds are ugly. On the other
hand, they have finite-sample bounds! — I wonder if the
bounded-and-IID restrictions could be lifted using the techniques in Jiang's
"On Uniform Deviation Bounds" (link and description
under Learning Theory), or those
in Dedecker et al.'s Weak
Dependence.]
- Aad W. van der Vaart, Sandrine Dudoit and Mark J. van der Laan,
"Oracle inequalities for multi-fold cross
validation", Statistics
and Decisions 24 (2006): 351--371 [Thanks to Prof.
van der Vaart for a reprint]
To read:
- Sylvain Arlot
- "Model selection by resampling penalization",
arxiv:0906.3124 =
Electronic Journal of Statistics 3 (2009):
557--624
- "Suboptimality of penalties proportional to the dimension
for model selection in heteroscedastic regression", arxiv:0812.3141
- Sylvain Arlot and Pascal Massart, "Data-driven Calibration of
Penalties for Least-Squares
Regression", Journal
of Machine Learning Research 10 (2009): 245--279
- Maria Maddalena Barbieri and James O. Berger, "Optimal Predictive
Model Selection", math.ST/0406464 = Annals
of Statistics 32 (2004): 870--897 [Unfortunately,
Bayesian]
- Andrew Barron, Lucien Birgé, and Pascal Massart, "Risk
bounds for model selection via penalization", Probability Theory and
Related Fields<./cite> 113 (1999): 301--413
- Lucien Birgé
- "The Brouwer Lecture 2005: Statistical estimation with
model
selection", math.ST/0605187
- "Model selection for Poisson processes",
math/0609549
- Lucien Birgŕ and Pascal Massart
"Minimal Penalties for Gaussian
Model Selection", Probability Theory and
Related Fields 138 (2007): 33--73
- "From model selection to adaptive estimation", pp. 55--87
in Pollard, Torgersen and Yang (eds.), Fetschrift for Lucien Le Cam:
Research Papers in Probability and Statistics (1997)
- Borowiak, Model Discrimination for Nonlinear Regression
Models
- P. Burman, "A comparative study of ordinary cross-validation,
v-fold cross-validation and the repeated learning-testing methods",
Biometrika 76 (1989): 503--514
- Alain Celisse, "Model selection in density estimation via
cross-validation", arxiv:0811.0802
- A. E. Clark and C. G. Troskie, "Time Series and Model Selection",
Communications in Statistics: Simulation and computing
37 (2008): 766--771 [Simulation study of the accuracy of
different information criteria]
- Kevin A. Clarke, "A Simple Distribution-Free
Test for Nonnested Hypotheses" [PDF preprint]
- Guilhem Coq, Olivier Alata, Marc Arnaudon and Christian Olivier,
"An improved method for model selection based on Information Criteria",
math.ST/0702540
- Pedro Domingos
- "Process-Oriented Estimation of Generalization Error" [PDF]
- "A Process-Oriented Heuristic for Model Selection"
[PDF]
- Sandrine Dudoit and Mark J. van der Laan, "Asymptotics of Cross-Validated Risk Estimation in Estimator Selection and Performance Assessment",
Statistical Methodology 2 (2005): 131--154
[preprint]
- Hugo Jair Escalante, Manuel Montes, Luis Enrique Sucar, "Particle
Swarm Model Selection",
Journal
of Machine Learning Research 10 (2009): 405--440
- Jianqing Fan and Runze Li, "Variable Selection via Nonconcave
Penalized Likelihood and its Oracle Properties", Journal of
the American Statistical Association 96 (2001): 1348--1360 [PDF reprint via Prof. Fan]
- Magalie Fromont, "Model selection by bootstrap penalization for
classification", Machine
Learning
66 (2007): 165--207
- Christophe Giraud, "Estimation of Gaussian graphs by model
selection", arxiv:0710.2044
- Alexander Goldenshluger and Eitan Greenshtein, "Asymptotically
minimax regret procedures in regression model selection and the magnitude of
the dimension
penalty", Annals of
Statistics 28 (2000): 1620--1637 [Hmmm. Not sure
how relevant this will be to anything I'd need to do, given the assumptions
they load on. Via Kevin Kelly.]
- Christian Gourieroux and Alain Monfort, "Testing, Encompassing, and
Simulating Dynamic Econometric Models", Econometric Theory
11 (1995): 195--228 [JSTOR]
- Michael Kearns and Dana Ron, "Algorithmic Stability and
Sanity-Check Bounds for Leave-One-Out Cross-Validation," Neural
Computation 11 (1999): 1427--1453
- Nicholas M. Kiefer and Hwan-Sik Choi, "Robust Model Selection in
Dynamic Models with an Application to Comparing Predictive Accuracy"
[SSRN]
- Sadanori Konishi and Genshiro Kitagawa, "Asymptotic theory for
information crteria in model selection --- functional approach," Journal of
Statistical Planning and Inference 114 (2003):
45--61
- Hannes Leeb,
"Conditional Predictive Inference Post Model Selection", Annals of
Statistics 37 (2009): 2838--2876
= arxiv:0908.3615 [I heard Leeb
give a talk on this, but I should read the paper]
- Hannes Leeb and Benedikt M. Poetscher
- F. Liang and A. Barron, "Exact Minimax Strategies for Predictive
Density Estimation, Data Compression, and Model Selection", IEEE Transactions on
Information Theory 50 (2004): 2708--2726
- Pascal Massart, Concentration Inequalities and Model
Selection
[PDF preprint
version (large!)]
- Abraham Meidan and Boris Levin, "Choosing from Competing Theories
in Computerised Learning", Minds and Machines 12
(2002): 119--129
- Nicolai Meinshausen and Peter Buehlmann, "Stability Selection",
arxiv:0809.2932 ["Estimation of
structure, such as in graphical modeling, cluster analysis or variable
selection, is notoriously difficult, especially for high-dimensional data. We
introduce the new method of stability selection."]
- Grayham E. Mizon and Massimiliano Marcellino (eds.),
Progressive Modelling: Non-nested Testing and Encompassing
[Blurb, table of contents]
- Ali Mohammad-Djafari, "Model selection for inverse problems: Best
choice of basis functions and model order selection," physics/0111020
- M. Pavlic and M. J. van der Laan, "Fitting of mixtures with
unspecified number of components using cross validation distance
estimate", Computational Statistics and Data
Analysis 41 (2003): 413--428
- Zacharias Psaradakis, Martin Sola, Fabio Spagnolo and Nicola Spagnolo, "Selecting nonlinear time series models using information criteria",
Journal of
Time Series Analysis
30 (2009): 369--394
- Pradeep Ravikumar, Martin J. Wainwright, John D. Lafferty,
"High-Dimensional Graphical Model Selection Using $\ell_1$-Regularized Logistic
Regression", arxiv:0804.4202
- Douglas Rivers and Quang H. Vuong, "Model selection tests for
nonlinear dynamic
models", The
Econometrics Journal
5 (2002): 1--39
- Yiyuan She, "Thresholding-based Iterative Selection
Procedures for Model Selection and Shrinkage", arxiv:0812.5061
- David Shilane, Richard H. Liang and Sandrine Dudoit, "Loss-Based
Estimation with Evolutionary Algorithms and Cross-Validation",
UC Berkeley Biostatistics Working Paper 227 [Abstract, PDF]
- Aris Spanos
- "Statistical Induction, Severe Testing, and Model
Validation" [Preprint]
- "Statistical Model Specification vs. Model Selection: Akaike-type Criteria and the Reliability of Inference" [preprint kindly
provided by Prof. Spanos]
- Tina Toni and Michael P. H. Stumpf
- "Parameter Inference and
Model Selection in Signaling Pathway Models", arxiv:0905.4468
- "Simulation-based model selection for dynamical systems in systems and population biology", arxiv:0911.1705
- Masayuki Uchida and Nakahiro Yoshida, "Information Criteria in
Model Selection for Mixing Processes", Statistical Inference for
Stochastic Processes 4 (2001): 73--98 ["The emphasis is
put on the use of the asymptotic expansion of the distribution of an estimator
based on the conditional Kullback-Leibler divergence for stochastic processes.
Asymptotic properties of information criteria and their improvement are
discussed."]
- Tim van Erven, Peter Grunwald and Steven de Rooij, "Catching Up Faster by Switching Sooner: A Prequential Solution to the AIC-BIC Dilemma", arxiv:0807.1005
- Geert Verbeke, Geert Molenberghs, Caroline Beunckens, "Formal and
Informal Model Selection with Incomplete Data", Statistical
Science 23 (2008): 201--218
= arxiv:0808.3587
- Zijun Wang, "Finite Sample Performances of the Model Selection Approach in Nonparametric Model Specification for Time Series", Communications in Statistics: Theory and Methods 38
(2009): 2302--2330
- Hirokazu Yanagiharaa and Chihiro Ohmoto, "On distribution of AIC in
linear regression models", Journal of
Statistical Planning and Inference 133 (2005):
417--433
- Peng Zhau and Bin Yu, "On Model Selection Consistency of Lasso",
Journal
of Machine Learning Research 7 (2006): 2541--2563
#
Ensemble Methods in Machine Learning
Boosting, bagging, binning, stacking, mixtures of experts, ...
Value of diversity.
See also:
Collective Cognition;
Learning Theory;
Model Selection
Recommended (totally inadequate, what happened to come to mind cleaning
up my files):
- Sanjeev Arora, Elad Hazan and Satyen Kale, "The Multiplicative
Weights Update Method: a Meta Algorithm and Applications "
[PDF
preprint. This is an interesting kind of result, which promises
performance which comes to close that achieved by any strategy within a fixed
class, no matter what sequence of data is observed --- but it's
performance on that sequence, which, as the saying goes, "is no
guarantee of future results". Cesa-Bianchi and Lugosi's book has a lot more
along these lines.]
- Nicolo Cesa-Bianchi and Gabor Lugosi, Prediction, Learning,
and Games [Mini-review]
- Gerda Claeskens and Nils Lid Hjort, Model Selection
and Model Averaging
- Pedro
Domingos, "The Role of Occam's Razor in Knowledge Discovery," Data
Mining and Knowledge Discovery, 3 (1999) [Online.
Ensemble methods as an apparent violation of Occam's Razor.]
- A. Juditsky, P. Rigollet, A. B. Tsybakov, "Learning by mirror averaging", arxiv:math/0511468 =
Annals of Statistics 36 (2008): 2183--2206
- G. Langer and U. Parlitz, "Modeling parameter dependence from time
series", Physical
Review E 70 (2004): 056217 [Interesting use of
ensemble methods in state space modeling]
- Laurence K. Saul and Michael I. Jordan, "Mixed Memory Markov
Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler
Ones", Machine Learning 37 (1999): 75--87
- Robert E. Schapire, Yoav Freund, Peter Bartlett and Wee Sun Lee,
"Boosting the Margin: A New Explanation for the Effectiveness of Voting
Methods", Annals of Statistics 26 (1998):
1651--1686
To read:
- Ran Avnimelech and Nathan Intrator, "Boosted Mixture of Experts: An
Ensemble Learning Scheme", Neural Computation 11
(1999): 483--497
- Larry M. Bartels, "Specification Uncertainty and Model
Averaging", American Journal of Political
Science 41 (1997): 641--674
- Gérard Biau, Luc Devroye and Gábor Lugosi,
"Consistency of Random Forests and Other Averaging Classifiers",
Journal of Machine Learning Research 9
(2008): 2015--2033 ["In the last years of his life, Leo Breiman promoted
random forests for use in classification. He suggested using averaging as a
means of obtaining good discrimination rules. The base classifiers used for
averaging are simple and randomized, often based on random samples from the
data. He left a few questions unanswered regarding the consistency of such
rules. In this paper, we give a number of theorems that establish the universal
consistency of averaging rules. We also show that some popular classifiers,
including one suggested by Breiman, are not universally consistent."]
- Gavin Brown, Jeremy L. Wyatt and Pter Tino, "Managing Diversity
in Regression Ensembles", Journal of Machine Learning Research
6 (2005): 1621--1650
- Bruno Caprile, Cesare Furlanello and Stefano Merler, "The Dynamics
of AdaBoost Weights Tells You What's Hard to Classify," cs.LG/0201014
- Zhuo Chen and Yuhong Yan, "Time Series Models for Forecasting:
Testing or Combining?", Studies in Nonlinear Dynamics and
Econometrics 11:1 (2007): 3
- M. Di Marzio and C. C. Taylor, "Kernel density classification and
boosting: an L2 analysis", Statistics and
Computing 15 (2005): 113--123
- Yoav Freund, Yishay Mansour and Robert E. Schapire, "Generalization
bounds for averaged classifiers", Annals of
Statistics 32 (2004): 1698--1722 = math.ST/0410092
- Yoav Freund, Robert E. Schapire, Yoram Singer and Manfred
K. Warmuth, "Using and combining predictors that specialize" [PDF
preprint]
- Jerome H. Friedman, Bogdan E. Popescu, "Predictive learning via rule ensembles", arxiv:0811.1679
- G. Fumera and F. Roli, "A Theoretical and Experimental Analysis of
Linear Combiners for Multiple Classifier Systems", IEEE Transactions on
Pattern Analysis and Machine Intelligence 27 (2005):
942--956
- Nicolas Garcia-Pedrajas, Cesar Garcia-Osorio and Colin Fyfe,
"Nonlinear Boosting Projections for Ensemble Construction",
Journal
of Machine Learning Research 8 (2007): 1--33
- Etienne Grossmann, "A Theory of Probabilistic Boosting, Decision
Trees and Matryoshki", cs.LG/0607110
- Jakob Vogdrup Hansen, Combining Predictors: Meta Machine
Learning Methods and Bias/Variance & Ambiguity Decompositions [Ph.D.
thesis, University of Aarhus, 2000;
on-line]
- Geoffrey E. Hinton, "Training Products of Experts by Minimizing
Contrastive Divergence," Neural Computation 14
(2002): 1771--1800.
- Marcus Hutter and Jan Poland, "Adaptive Online Prediction by
Following the Perturbed Leader", cs.AI/0504078 = Journal of
Machine Learning Research 6 (2005): 639--660
- Robert A. Jacobs, "Bias/Variance Analyses of Mixtures-of-Experts
Architectures", Neural Computation 9 (1997):
369--383 ["This article investigates the bias and variance of
mixtures-of-experts (ME) architectures. The variance of an ME architecture can
be expressed as the sum of two terms: the first term is related to the
variances of the expert networks that comprise the architecture and the second
term is related to the expert networks' covariances. One goal of this article
is to study and quantify a number of properties of ME architectures via the
metrics of bias and variance. A second goal is to clarify the relationships
between this class of systems and other systems that have recently been
proposed. It is shown that in contrast to systems that produce unbiased experts
whose estimation errors are uncorrelated, ME architectures produce biased
experts whose estimates are negatively correlated."]
- Wenxin Jiang, "Boosting with Noisy Data: Some Views from
Statistical
Theory", Neural
Computation 16 (2004): 789--810
- Ludmila I. Kuncheva, Combining Pattern Classifiers: Methods
and Algorithms
- Nicole Kraemer, "Boosting for Functional
Data", math.ST/0605751
- Guillaume Lecu&eaucte;, "Lower Bounds and Aggregation in Density
Estimation", Journal of
Machine Learning Research
7 (2006): 971--981
- David Mease, Abraham J. Wyner and Andreas Buja, "Boosted
Classification Trees and Class Probability/Quantile
Estimation", Journal
of Machine Learning Research 8 (2007): 409--439
- Nicolai Meinshausen, "Forest
Garrote", arxiv:0906.3590
- David J. Miller and Siddharth Pal, "Transductive Methods for the
Distributed Ensemble Classification Problem", Neural
Computation 19 (2007): 856--884
- Seiji Miyoshi, Kazuyuki Hara, and Masato Okada, "Analysis of
ensemble learning using simple perceptrons based on online learning theory",
Physical Review
E 71 (2005): 036116
- L. Nunes and E. Oliveira, "On Learning by Exchanging
Advice," cs.LG/0203010
- Frenando C. Pereira and Yoram Singer, "An Efficient Extension to
Mixture Techniques for Prediction and Decision Trees", Machine
Learning 36 (1999): 183--199
- Evgueni Petrov, "Constraint-based analysis of composite
solvers," cs.AI/0302036
- Philippe Rigollet, "Maximum likelihood aggregation and
misspecified generalized linear models", arxiv:0911.2919
- Yoram Singer, "Adaptive Mixtures of Probabilistic Transducers", Neural
Computation 9 (1997): 1711--1733 [PS.gz preprint]
- Eiji Takimoto and Akira Maruoka, "Top-down decision tree learning
as information based boosting," Theoretical
Computer Science 292 (2002): 447-464
- Héla Zouari, Laurent Heutte and Yves Lecourtier,
"Controlling the diversity in classifier ensembles through a measure of
agreement", Pattern
Recognition 38 (2005): 2195--2199
#
Sat, 14 Nov 2009
Assortative Social Networks and Neutral Cultural Evolution
It is a common-place observation that there are strong relationships between
cultural traits and social attributes; that different social groups accept and
transmit different bits of culture. Most attempts to explain this from within
the social sciences (emphatically including historical materialism and its variants)
argue that this is due to some causal influence of social organization on
culture. ("Social being determines consciousness" --- or, once the Hegelian
gas has been released, social life shapes
thought.) In these views, culture varies with social
position because it's an adaptation to social position, or a
reflection thereof, or an expression thereof. However, it is not clear to me
that this can only be explained by a causal linkage.
A simple model to test this would be as follows. Imagine a population where
each individual has a couple of social traits, which can take discrete values,
and a cultural trait, which can likewise take a number of discrete values.
Social traits are fixed. Now form a social
network that's
assortative, i.e., two
individuals are more likely to be directly linked the more social traits they
have in common. The cultural trait is variable over time. We start with some
initial random distribution, but then, at each point in time, randomly pick one
individual, who randomly copies one of their neighbors. Thus, culture is
completely socially-neutral, and every cultural trait is just as well adapted
as every other. My prediction is that, for reasonable-looking assortative
networks, we'll see a good degree of correlation between social and cultural
traits, just because people will be mostly learning from those close
to them socially.
A slight refinement would be to make people uniformly more likely
to adopt certain values of the cultural trait than others, independently of
their social position. Then I predict that the less-popular cultural values
will be concentrated in the smaller sub-networks.
(One could argue that this is still "social position" shaping thought,
namely one's position in the social network. But now network structure
screens-off and renders causally irrelevant
the content of that social position.)
I hasten to add that such a model would be perfectly compatible with the
pious hope that people have good reasons for their actions and
beliefs; all that's really assumed is that there is no systematic
relation between those reasons and social position. (So I'm not denying
agency, rationality, etc.)
Needless to say, this would massively complicate the interpretation of
opinion surveys. The typical practice
of regressing responses on attributes of the
responders will give you results which are weird hybrids of actual links
between social status and beliefs, and the residue of diffusion.
The day after writing this, I found Hidalgo, Claro and Marquet's "Simple
Dynamics on Complex Networks" (cond-mat/0411295). This looks
at exactly the kind of random copying dynamic I have in mind, but divides the
network into "guilds", in which all members have the same in-degree. Their
surprising (to me) result is that, in equilibrium, the distribution of states
(i.e., cultural traits) has to be same for all guilds. However, their guilds
do not, in general, correspond to socially-defined groups, so I still have some
hope my intuition is not totally and completely wrong.
Update, 21 March 2005: I should also mention (now that I've
read it) V. Sood and S. Redner, "Voter Model on Heterogeneous Graphs", cond-mat/0412599 (= PRL 94
(2005): 178701). This paper's starting point is the easily-seen fact that,
under the pure case of the copy-a-random-neighbor dynamics I'm considering (and
which is one of several very different things called "the voter model"),
everyone must come to share the same opinion. That is, the consensus states
are absorbing states. Sood and Redner try to calculate the mean time to
consensus as a function of properties of the social network. This is going to
be useful to me, but it's not quite the same thing.
While I'm updating this, I should maybe say expand on what I hinted at
above, about network structure "screening off" social status from cultural
traits. There are several ways of expressing this formally, but the one I have
in mind relies on our ability to decompose networks into "communities",
sub-networks whose members are more closely tied to one another than to
outsiders. (There are many ways of
doing that, too, but I like the
Newman-Girvan approach, not
just because Mark is a good friend whom I can persuade to share code, but also
because their algorithms make sense.) So, formally, what I'm proposing is that
the dynamics I'm considering will (1) lead to strong statistical dependence
between social position and cultural traits, but (2) social position and
cultural traits will be (nearly) independent, conditional on community
membership. (These statistical dependencies can be measured in any convenient
way, e.g. through mutual information, or perhaps chi-squared to get p-values.)
Of course, in the pure-copying case, this will be a transient effect, since
ultimately everyone will share the same opinion. One thing I'm not sure of yet
is whether it's better to just look at the transients (which Sood and Redner
indicate might be very long), or to introduce some amount of perturbation
(e.g., through copying errors) which will lead to a non-trivial statistical
equilibrium. Maybe I should just try both and see.
22 April 2005: In conversation, Eric Smith suggests that Bill Labov's work
on phonological changes in American English might have enough data to actually
test such a neutral model.
Update, 16 October 2007: It works. Two social types
(equiprobably), binary cultural trait (initially equiprobable). Nodes form
ties with probability p if they are of the same type and probability q if they
are of different types. Cultural traits change by random copying, as outlined
above. I've plotted the chi-squared statistic for the association between
social type and cultural trait as a function of time. The black line is a run
where p=0.09, q=0.01, and the assortativity coefficient of the resulting
network was r = 0.80. The grey line is a run where p=q=0.05, giving a graph
with r = 0.045.
Modesty forbids me to recommend:
- CRS, "Social Media as Windows on the Social Life of the
Mind", arxiv:0710.4911, for the
2008 AAAI symposium on "Social Information Processing"
To read:
- T. Antal, S. Redner and V. Sood, "Evolutionary Dynamics on
Degree-Heterogeneous Graphs", Physical Review Letters 96 (2006): 188104
- G. J. Baxter, R. A. Blythe and A. J. McKane, "Fixation and
Consensus Times on a Network: A Unified
Approach", Physical Review
Letters
101 (2008): 258701
- I. J. Benczik, S. Z. Benczik, B. Schmittmann, R. K. P. Zia, "Lack
of consensus in social
systems", arxiv:0709.4042
- Claudio Castellano, "Effect of network topology on the ordering
dynamics of voter models", cond-mat/0504522
- Claudio Castellano, Vittorio Loreto, Alain Barrat, Federico Cecconi
and Domenico Parisi, "Comparison of voter and Glauber oridering dynamics on
networks", Physical Review
E 71 (2005): 066107
- Damon Centola, Juan Carlos Gonzalez-Avella, Victor M. Eguiluz and
Maxi San Miguel, "Homophily, Cultural Drift and the Co-Evolution of Cultural
Groups", physics/0609213
- Noah E. Friedkin, A Structural Theory of Social
Influence [Blurb]
- Aram Galstyan and Paul Cohen, "Cascading dynamics in modular
networks", Physical Review
E 75 (2007): 036109
- Aram Galstyan, Vahe Musoyan and Paul Cohen, "Maximizing Influence
Propagation in Networks with Community Structure", arxiv:0905.1108
- Laszlo Gulyas and Elenna R. Dugundji, "Emergent Opinion Dynamics on
Endogeneous
Networks", physics/0610125
- Petter Holme and Andreas Gronlund, "Modelling the dynamics of youth
subcultures", physics/0504181
- Petter Holme and M. E. J. Newman, "Nonequilibrium phase transition in the coevolution of networks and opinions", Physical Review E 74 (2006): 056108
- Robert Huckfeldt, Paul E. Johnson and John Sprague, Political
Disagreement: The Survival of Diverse Opinions Within Communication
Networks [Sounds very cool. Blurb]
- Elihu Katz and Paul Lazarsfeld, Personal Influence: The Part
Played by People in the Flow of Mass Communications
- Jason Kaufman, "Endogenous explanation in the sociology of
culture", Annal
Review of Sociology 30 (2004): 335--357
- Marcelo N. Kuperman, "Cultural propagation on social networks",
nlin.AO/0509004 [The
Axelrod model]
- R. Lambiotte and M. Ausloos, "Coexistence of opposite opinions in a
network with communities", physics/0703266 =
Journal of
Statistical Mechanics (2007) P08026
- R. Lambiotte, M. Ausloos, and J. A. Hoyst, "Majority model on a
network with communities", Physical Review
E 75 (2007): 030101
- Emanuele Pugliese and Claudio Castellano, "Heterogeneous pair approximation for voter models on networks", arxiv:0903.5489
- Usha Nandini Raghavan, Reka Albert, Soundar Kumara, "Near linear
time algorithm to detect community structures in large-scale
networks", arxiv:0709.2938 ["every
node is initialized with a unique label and at every step each node adopts the
label that most of its neighbors currently have". I suspect using this
definition of "community" would make it a tautology that community membership
makes social position irrelevant for culture.]
- Fabio
Rojas and Tom Howe, "Contact Patterns and Aggregate Opinion Levels: Results
from a Simulation Study"
[PDF
preprint]
- Anja Sturm and Jan Swart, "Voter models with heterozygosity
selection", math.PR/0701555
- Krzysztof Suchecki, Víctor M. Eguíluz, and Maxi San
Miguel, "Voter model dynamics in complex networks: Role of dimensionality,
disorder, and degree distribution", Physical Review
E 72 (2005): 036132
- F. Vazquez, V.M. Eguiluz, M. San Miguel, "Generic absorbing transition in coevolution dynamics", arxiv:0710.4910 ["We study a coevolution voter model on a network that evolves according to the state of the nodes. In a single update, a link between opposite-state nodes is rewired with probability $p$, while with probability $1-p$ one of the nodes takes its neighbor's state." In what sense is this generic, however?]
- John R. Zaller, The Nature and Origins of Mass Opinion
[Blurb]
- Alan S. Zuckerman, Josip Dasovic, Jennifer Fitzgerald,
Partisan Families: The Social Logic of Bounded Partisanship in Germany
and Britain [blurb]
#
Social Networks
See also:
Community Discovery;
Complex Networks;
Institutions and Organizations;
Network Data Analysis;
Networks of Political
Actors;
Sociology;
Sociology of Science;
Terrorism
Recommended (very misc. and inadequate):
- R. Alberich, J. Miro-Julia and F. Rossello, "Marvel Universe looks
almost like a real social network,"
cond-mat/0202174 [The small
world of superhero comic books. Of course, in the end, we are all connected
via Death --- whoops, wrong mythos.]
- John Arquilla, David Ronfeldt, Networks and Netwars: The
Future of Terror, Crime, and Militancy [From RAND, the people who
brought you the American strategy in Indochina. But nonetheless interesting.
Online.]
- Wayne
E. Baker, Achieving Success through Social Capital: Tapping the
Hidden Resources in Your Personal and Business Networks [Don't snicker
so. Baker is actually very good on social networks, and does a nice job of
explaining the ideas here, in the service of helping people do better in their
professional lives. The first chapter, "What Is Social Capital and Why Should
You Care About It?", is available for free as
a PDF]
- Wayne E. Baker and Robert R. Faulkner, "Social Networks and Loss
of Capital", Social Networks 26 (2004): 91--111
[If you must invest in a dodgy company, be friends with the management.
PDF]
- Sumit Basu, Tanzeem Choudhury, Brian Clarkson and Alex (Sandy)
Pentland, "Learning Human Interactions with the Influence
Model", Media
Lab Vision and Modeling Technical Report 539 (June 2001) [This is an
interesting but rather special model for social influence: basically, one fits
a model of pairwise influence for each dyad, and then predicts the behavior of
a given individual by taking a weighted sum of the predictions of those models.
So one needs to learn the pairwise model parameters and the prediction weights.
Not at all obvious how to do specification testing in this framework... Thanks
to Gustavo Lacerda and Kevin Murphy for the pointer]
- Vaughan Bell, C. Maiden, A. Munoz-Solomando and V. Reddy, "'Mind control experiences' on the internet: Implications for the psychiatric diagnosis of delusions", Psychopathology 39 (2006): 87--91 [pdf;
my comments]
- Randall Collins, The Sociology of Philosophies: A Global
Theory of Intellectual Change [Many very interesting observations on how
social network structure can facilitate and shape intellectual development,
backed up by a massive, global acquaintance with the history of philosophy.
His own philosophical conclusions, e.g. about realism, seem to me however
astonishingly bad --- naive
social constructionism.]
- Thomas X. Hammes, "Countering Evolved Insurgent
Networks", Military
Review (July-August 2006): 18--26 ["Insurgency is a competition
between human networks. We must understand that salient fact before can we
develop and execute a plan to defeat the insurgents."]
- Shin-Kap Han, "Tribal regimes in academia: a comparative analysis
of market structure across disciplines", Social
Networks 25 (2003): 251--280
- Judith Kleinfeld, "Could It Be a Big World After All? What the
Milgram Papers in the Yale Archive Reveal About the Original Small World Study"
[Six degrees of separation, for the general population, is quite
unsupported empirically. Of course it works for other kinds of networks, e.g.,
people in a common profession, or participating in a common institution; but
that's different. Preprint.]
- Roger Th. A. J. Leenders, Structure and Influence:
Statistical Models for the Dynamics of Actor Attributes, Network Structure and
Their Interdependence [mini-review]
- Miller McPherson, Lynn Smith-Loving and James M. Cook, "Birds of a
Feather: Homophily in Social Networks", Annual Review of
Sociology 27 (2001): 415--444
- James Moody
and Douglas R. White,
"Social Cohesion and Embeddedness: A Hierarchical Conception of Social
Groups", American Sociological Review 68 (2003):
103--127 [PDF
preprint via Doug's website]
- John F. Padgett and Christopher K. Ansell, "Robust Action
and the Rise of the Medici, 1400--1434", American Journal of
Sociology 98 (1993): 1259--1319
[JSTOR]
- Robin Pemantle and Brian Skyrms, "Network formation by
reinforcement learning: the long and medium run", math.PR/0404106
- David A. Siegel,
"When Does Repression Work? Collective Behavior Under the Threat of Violence"
[Detailed model involving adaptive social learning, shaped by the network
structure, targeted repression, and mass media, with some applications to the
Iraqi elections at the start of 2005. One wonders if there isn't some way of
extracting analytical results, rather than just
simulations... PDF
preprint]
- Brian Skyrms and Robin Pemantle, "A Dynamic Model of Social Network
Formation", math.PR/0404101
= Proceedings of the National Academy of
Sciences 97 (2000): 9340--9346
- Troy
Tassier's work on labor markets and social networks is very cool, but I
can't recommend particular papers because he explained it to me while we were
office mates...
- Charles Tilly, Trust and Rule [Mini-review]
- S. Wasserman and K. Faust, Social Network Analysis
[blurb]
- Douglas R. White, Natasa Kejzar, Constantino
Tsallis, Doyne Farmer and Scott White, "A generative model for feedback
networks", cond-mat/0508028
= Physical Review E 73 (2006): 016119 [I find the
growth model here very interesting, because it breaks with the now-usual
"preferential attachment" mechanism. I think this model would repay very
careful attention, both dynamically (could one map this onto
preferential attachment in some meaningful way?) and statistically
(what is the limiting degree distribution, and how does it vary with
the growth parameters?).]
To read:
- A.-L. Barabasi, H. Jeong, Z. Neda, Erzsebet Ravasz, A. Schubert and
T. Vicsek, "Evolution of the social network of scientific collaborations,"
cond-mat/0104162
- M. J. Barber, A. Krueger, T. Krueger, T. Roediger-Schluga,
"The Network of European Research and Development Projects",
physics/0509119
- Vilna Francine Bashi, Survival of the Knitted: Immigrant
Social Networks in a Stratified World
[Blurb]
- N. Berger, C. Borgs, J. T. Chayes, R. M. D'Souza and R. D.
Kleinberg, "Competition-Induced Preferential Attachment", cond-mat/0402268
- Marian Boguna, Romualdo Pastor-Satorras, Albert Diaz-Guilera and
Alex Arenas, "Models of social networks based on social distance
attachment",
Physical Review
E 70 (2004): 056122
- Samuel Bowles and Herbert Gintis, "Persistent Parochialism: Trust
and Exclusion in Ethnic Networks", Journal of Economic Behavior and
Organization (2004)
[Abstract,
with link to full text]
- Ronald S. Burt, Brokerage and Closure: An Introduction to
Social Capital
- Horacio Castellini and Lilia Romanelli, "Social network from
communities of electronic
mail", nlin.CD/0509021
- Damon Centola and Michael W. Macy, "Complex Contagion and the
Weakness of Long Ties", American Journal of Sociology submitted
[PDF preprint
via Macy]
- Yen-Sheng Chiang, "Birds of Moderately Different Feathers: Bandwagon Dynamics and the Threshold Heterogeneity of Network Neighbors",
Journal of
Mathematical Sociology 31 (2006): 47--69
- David Chisholm, Coordination without Hierarchy: Informal
Structures in Multiorganizational Systems
[Blurb]
- Miriam Cooke and Bruce B. Lawrence (eds.), Muslim Networks from Hajj to Hip Hop [Blurb]
- Dora L. Costa and Matthew E. Kahn, Heroes and Cowards: The
Social Face of War [Blurb, ch. 1]
- Darren P. Croft, Richard James and Jens Krause, Exploring
Animal Social Networks [blurb, ch. 1]
- Joern Davidsen, Holger Ebel, and Stefan Bornholdt, "Emergence of a
small world from local interactions: Modeling acquaintance networks,"
cond-mat/0108302
- G. F. Davis and H. R. Greve, "Corporate elite networks and
governance changes in the 1980s", American Journal of Sociology
103 (1997): 1--37
- G. F. Davis, M. Yoo and W. E. Baker, "The small world of the
corporate elite"
- Mario Diani and Doug McAdam (eds.), Social Movements and Networks: Relational Approaches to Collective Action
- T. Di Matteo, T. Aste and M. Gallegati, "Innovation flow through
social networks: Productivity distribution", physics/0406091 [Those look an
awful lot like log-normals to me.]
- Patrick Doreian
- George C. M. A. Ehrhardt, Matteo Marsili, and Fernando
Vega-Redondo, "Emergence and resilience of social networks: a general
theoretical framework", physics/0504124
- Claude S. Fischer, To Dwell among Friends: Personal Networks in Town and City
- James Fowler,
the why-people-vote papers
- James H. Fowler and Nicholas A. Christakis, "Cooperative Behaviour
Cascades in Human Social
Networks", arxiv:0908.3497
- Linton C. Freeman, The Development of Social
Network Analysis
- Noah E. Friedkin, A Structural Theory of Social
Influence [Blurb]
- T.L. Goedeke and S. Rikoon, "Otters as Actors: Scientific
Controversy, Dynamism of Networks, and the Implications of Power in Ecological
Restoration", Social
Studies of Science 38 (2008): 111--132
- Sanjeev Goyal, Connections: An Introduction to the Economics
of Networks [Blurb, introduction]
- Mark Granovetter, Getting a Job: A Study of Contacts
and Careers
- Matthew Haag and Roger Lagunoff, "Social Norms, Local Interaction,
and Neighborhood Planning,"
ewp-game/9907004
- Robert Hobbs, Mark Lombardi: Global Networks [New
York: Independent Curators International, 2003, ISBN 0-916365-67-0. Lombardi
produced more-than-slightly paranoid network diagrams of
political-financial-intelligence malfeasance, which seem less than perfectly
reliable, but of some artistic value...]
- Petter Holme, Christofer R. Edling and Frederik Liljeros,
"Structure and time evolution of an Internet dating community", Social
Networks 26 (2004): 155-174
- Robert Huckfeldt, Paul E. Johnson and John Sprague, Political
Disagreement: The Survival of Diverse Opinions within Communication
Networks
- Eiko Ikegami, Bonds of Civility: Aesthetic Networks and the
Political Origins of Japanese Culture [This sounds very cool: "uncovers a
complex history of social life in which aesthetic images became central to
Japan's cultural identities. The people of premodern Japan built on earlier
aesthetic traditions in part for their own sake, but also to find space for
self-expression in the increasingly rigid and tightly controlled Tokugawa
political system. In so doing, they incorporated the world of the beautiful
within their social life which led to new modes of civility. They explored
horizontal and voluntary ways of associating while immersing themselves in
aesthetic group activities."]
- Matthew O. Jackson, Social and Economic
Networks [blurb, ch. 1]
- Charles Kadushin
- Elihu Katz and Paul Lazarsfeld, Personal Influence
- Michael Kenney, From Pablo to Osama: Trafficking and
Terrorist Networks, Government Bureaucracies, and Competitive Adaptation
[Blurb]
- Martin Kilduff and David Krackhardt, Interpersonal Networks in Organizations: Cognition, Personality, Dynamics, and Culture [blurb]
- Konstantin Klemm, Victor M. Eguiluz, Raul Toral and Maxi San
Miguel, "Nonequilibrium transitions in complex networks: A model of social
interaction," Physical Review E 67 (2003): 026120
- Geuorgi Kossinets and Duncan J. Watts, "Empirical Analysis of an
Evolving Social Network", Science 311
(2006): 88--90
- Pamela Walker Laird, Pull: Networking and Success since Benjamin Franklin [blurb]
- Edward O. Laumann, Stephen Ellingson, Jenna Mahay, and Anthony Paik
(eds.), The Sexual Organization of the City ["The city" being
Chicago. Blurb,
intro]
- Nan Lin, Social Capital: A Theory of Social Structure and
Action
- James R. Lincoln and Michael L. Gerlach, Japan's Network
Economy: Structure, Peristence, and Change [Blurb]
- David Lusseau, "Evidence for social role in a dolphin social
network", q-bio/PE/0607048
- David Lusseau and M. E. J. Newman, "Identifying the role that
individual animals play in their social
network", q-bio.PE/0403029
- John Levi Martin, Social Structures [blurb]
- Seth A. Marvel, Steven H. Strogatz, Jon M. Kleinberg, "The Energy Landscape of Social Balance", arxiv:0906.2893
- Cathleen McGrath and David Krackhardt, "Network Conditions for
Organizational Change", The Journal of Applied Behavioral
Science 39 (2003): 324--336
[PDF
reprint]
- P. K. McGregor (ed.), Animal Communication Networks
[blurb]
- Miller McPherson, Lynn Smith-Lovin and Matthew E. Brashears,
"Social Isolation in America: Changes in Core Discussion Networks over Two
Decades", American Sociological Review 71 (2006):
353--375
[PDF; weblog
commentary by Kieran Healy]
- M. S. Mizurchi, The American Corporate Network,
1904--1974
- Philippa Pattison, Algebraic Models for Social
Networks [Blurb]
- Sean Safford, Why the Garden Club Couldn't Save Youngstown:
The Transformation of the Rust Belt [blurb]
- Ozgur Simsek and David Jensen, "Navigating networks by
using homophily and degree", Proceedings of the National
Academy of Sciences (USA) 105 (2008): 12758--12762
[Open access]
- Camille Roth, "Measuring Generalized Preferential Attachment in
Dynamic Social Networks", nlin.AO/0507021
- Deidre A. Royster, Race and the Invisible Hand: How
White Networks Exclude Black Men from Blue-Collar Jobs
- B. Ruyu and M. N. Kuperman, "Affinity driven social networks",
nlin.AO/0703038
- Recep Senturk, Narrative Social Structure: Anatomy of the
Hadith Transmission Network, 610--1505 [blurb]
- David A. Siegel,
"The Media as Spur and Spoiler: A Theory of Multiple Influences on Collective
Behavior" [Abstract: "I present a model of interdependent collective
behavior under the influence of both local social networks and a mass media.
Individual interests are heterogeneous, and people choose whether or not to
participate in the behavior based on a comparison of subjective costs and
benefits. Costs are updated in response to the activities of both their social
neighbors and the population as a whole; people obtain information about the
latter from the media. I find that, contrary to conventional wisdom, neither
increased connectivity in local networks nor an increased role for the media
uniformly increases participation in collective behavior: in many cases both
can decrease participation rates. Social elites who are unified in their
interests can play an outsized role in determining participation, as can a
biased media. The model I develop to derive these results additionally
provides a powerful methodological tool for analyzing the impact that
qualitative network structures can have on mass outcomes.
" PDF
preprint]
- Grahame F. Thompson, Between Hierarchies and Markets:
The Logic and Limits of Network Forms of Organization
- Charles Tilly
- Identities, Boundaries, and Social Ties
- Stories, Identities, and Political Change
- Namatié Traoré, "Networks and Rapid Technological
Change: Novel Evidence from the Canadian Biotech
Industry", Industry
and Innovation 13 (2006): 41--68
- Federico Varese, "How Mafias Migrate: The Case of the`Ndrangheta
in Northern Italy", Law and Society Review 40
(2006): 411--444 [PDF reprint]
- Katherine Cramer Walsh, Talking about Politics: Informal
Groups and Social Identity in American Life [Blurb]
- Frank E. Walter, Stefano Battiston, Frank Schweitzer,
"A Model of a Trust-based Recommendation System on a Social Network",
nlin.AO/0611054
- Duncan J. Watts, Peter S. Dodds and Mark E. J. Newman, "Identity
and Search in Social Networks,"
cond-mat/0205383 =
Science 296 (2002): 1302--1305
- Harrison White
- Hal Whitehead, Analyzing Animal Societies: Quantitative Methods for Vertebrate Social Analysis [From the blurb and table of contents, it's not obvious to me why he has the adjective "vertebrate"...]
- David Wilkinson, "Civilizations as Networks: Trade, War, Diplomacy,
and Command-Control", Complexity 8 (2002):
82--86
- H. Peyton Young, "The diffusion of innovations in social
networks", in L. E. Blume and S. N. Durlauf (eds.), The Economy as
an Evolving Complex System III (2003)
- Li Zhang, Strangers in the City: Reconfigurations of Space, Power, and Social Networks Within China's Floating Population
- W.-X. Zhou, D. Sornette, R. A. Hill and R. I. M. Dunbar, "Discrete
Hierarchical Organization of Social Group Sizes", cond-mat/0403299
#
Sociology of Science
Raymond Aron says somewhere that "science is inseparable from the republic
of scholars." This is substantially true, though I can imagine odd exceptions.
(R. Crusoe, FRS, could have done astronomy or botany or algebra before meeting
Friday, though I don't think he could have invented them.) In any event,
science is an activity which groups do vastly better, and easier, than isolated
individuals. In saying this, I trust I shan't have to defend myself against
suspicion
of social-constructionist
heresy. The practical recognition of this truth goes back to the founders of
the first academies during the scientific
revolution, and it was explicitly recognized in
the Enlightenment, for instance in
d'Alembert's "Preliminary Discourse" to the Encyclopedie. An
investigation into science which doesn't recognize, and account for, its social
nature is on all fours with one which doesn't recognize, and account for, the
fact that it produces reliable knowledge, which is to say much like an
investigation of agriculture which doesn't realize it produces food. These
should be "every schoolchild knows" truths, though sadly they're anything but.
Every schoolchild also knows that differences in social organization don't
completely explain why statistical mechanics is
fruitful, but UFOlogy is not — that matter really
is made out of molecules, and people really aren't abducted by aliens, has, to
say the least, something to do with it. But the sciences started from beliefs
about as wacko as anything today's kooks can produce —
say, alchemy — but haven't stayed there,
whereas the kooks have, and this deserves explanation. More: a proper
understanding of this could help
improve scientific method, something
eagerly to be desired.
Of course there are already lots of people engaged in this undertaking;
sociology of science is, in general, more sensible than most scientists
suppose. (Also more sensible than most of the rest
of sociology, but that's another story for another
time.) Even the noise in the management
literature recently about "learning organizations" and the like is not
unrelated, and might even be promising. (On the one hand, lots of problems get
cracked once people see that lots of money could be made from the solution. On
the other hand, we are talking about the management witch-doctors.)
There are, however, two potentially fruitful lines of research which nobody, so
far as I know, has bothered to undertake. One is straightforward comparative
sociology, contrasting genuine intellectual disciplines (including, besides the
natural sciences, things like history or philology) with the half-disciplines,
the pseudosciences, and the simple crackpots.
The other is to take some of the descriptions of how scientists act and
interact with each other from the existing sociological literature, throw them
on the computer, and see if they produce something which looks like the science
we know; also if they produce the results their authors claim they do. (My
suspicion is that most of them will not.)
See also:
Collective Cognition;
Evolutionary Epistemology;
History of Science;
Science;
Scientific Method
Recommended:
- Arthur Donovan, Larry Laudan and Rachel Laudan (eds.),
Scrutinizing Science: Empirical Studies of Scientific Change
- Ronald N. Giere, Explaining Science: A Cognitive
Approach
- Philip Kitcher, The Advancement of Science: Science without
Legend, Objectivity without Illusions [Formal, and at least
semi-plausible if abstract, modeling of just how messy, "sullied"
social groups, e.g. real scientific disciplines, can achieve genuine cognitive
progress — can even be more progressive than less sullied ones. Not so
well-written as Toulmin, but probably closer to his ideas than either of them
would like to admit.]
- Larry Laudan, Progress and Its Problems
- Robert K. Merton, Sociology of Science
- Mark Newman
[Many fine papers on co-authorship networks, produced more rapidly than I feel
like updating this notebook]
- Nienke Oomes, "Market Failures in the Economy of Science" [Chapter
in Nienke's dissertation, hopefully appearing soon as a paper]
- Derek J. de Solla Price, Little Science, Big Science
- Mark Risjord, "Scientific Change as Political Action: Franz Boas
and the Anthropology of
Race", Philosophy of
the Social Sciences 37 (2007): 24--45 [This is an
interesting case-study of how some intellectual work can be at once properly
scientific and carry ethical and political implications. However, I
think that Risjord is actually not very astutte about the philosophy here. (I
realize it takes some gall for me to say this.) On the one hand, what made
Boas's work compelling was that it appealed to purely cognitive
considerations, and did so validly. The motives which may or may
not have impelled Boas to undertake this work were simply irrelevant. For that
matter, it does not compel anyone to take up any position on the
ethical worth of human beings. Someone who had, as a fundamental part of their
system of values, a belief that the races have an intrinsic order of merit
could, with perfect consistency, accept all of Boas's arguments. I think any
such person would be crazy, but sputtering incredulity is not a
logical argument.]
- Camille Roth and Paul Bourgine [Commented on
elsewhere]
- Stephen Toulmin, Human Understanding, vol. 1:
The Collective Use and Evolution of Concepts [I think he's wrong
about some of the strictly philosophical implications of his approach, and
about formalization, but this is the best all-around consideration of how
science — and other proper intellectual disciplines, for that matter —
functions as a collective, social enterprise that I've ever seen; and this was
published in 1972. Volume 2, incidentally, was supposed to address individual
judgment, but I don't think it was ever written]
- Susan Trawek, Beam-times and Life-Times [Ethnographic
study of the "natives" at high-energy
accelerator labs. Remarkably for any ethnography, the natives don't, by
and large, think the depiction demeaning or bone-headed.]
- John Ziman, Real Science: What It Is, and What It
Means [An extremely sound synthesis by a good theoretical physicist
turned eminent science-studier]
Recommended, if somewhat tangential:
- Gross and Levitt, Higher Superstition [What to avoid]
- Richard F. Hamilton, The Social Misconstruction of Reality:
Validity and Verification in the Scholarly Community [How scholars in
the humanities and social sciences manage to repeat and elaborate on sheer
myths for generations; more of a social-psychology approach than a strictly
sociological one.]
- Noretta Koertege (ed.), A House Built on Sand: Flaws in the
Cultural Studies Account of Science [See especially Kitcher's apologia
for well-done sociological studies of science]
- Cass R. Sunstein, "Academic Fads and Fashions (with Special
Reference to Law)" [More social psychology of scholarship. Online]
To read:
- Albert-Laszlo Barabasi, H. Jeong, Zoltan Neda, Erzsebet Ravasz,
A. Schubert and Tamas Vicsek, "Evolution of the social network of scientific
collaborations," cond-mat/0104162
- M. J. Barber, A. Krueger, T. Krueger, T. Roediger-Schluga,
"The Network of European Research and Development Projects",
physics/0509119
- Elazar Barkan, The Retreat of Scientific Racism: Changing
Concepts of Race in Britain and the United States between the World Wars
[blurb]
- Stephen R. Barley and Julian Orr (eds.), Between Craft and
Science: Technical Work in the United States
- Mario Biagioli and Peter Galison (eds.), Scientific
Authorship: Credit and Intellectual Property in Science
- Geoffrey C. Bowker, Memory Practices in the Sciences
[Blurb]
- James Robert Brown, Who Rules in Science: An Opinionated
Guide to the Wars
- Carlos Cotta and Juan J. Merelo, "The Complex Network of
Evolutionary Computation Authors: an Initial Study", physics/0507196
- Diana Crane, Invisible Colleges: Diffusion of Knowledge in
Scientific Communities
- H.-D. Daniel, Guardians of Science: Fairness and Reliability
of Peer Review
- Bruce Edmonds, "Artificial science — a simulation test-bed for
studying the social processes of science", cogprints/4263
- Henry Etzkowitz, Carol Kemelgor and Brian Uzzi, Athena
Bound: The Advancement of Women in Science and Technology
- James A. Evans, "Electronic Publication and the Narrowing of
Science and Scholarship", Science
321 (2008): 395--399
- Ying Fan, Menghui Li, Jiawei Chen, Liang Gao, Zengru Di and Jinshan
Wu, "Network of Econophysicists: a weighted network to investigate the
development of Econophysics", cond-mat/0401054
- Trevor Fenner, Mark Levene and George Loizou, "A Model for
Collaboration Networks Giving Rise to a Power Law Distribution with an
Exponential Cutoff", physics/0503184
- Marion Fourcade, Economists and Societies: Discipline and
Profession in the United States, Britain, and France, 1890s to 1990s
[Blurb. Let us leave
to one side the question of whether economics
really qualifies as a science.]
- N. Gilbert, A Simulation of
the Structure of Academic Science
- C. Lee Giles and Isaac G. Councill, "Who gets acknowledged:
Measuring scientific contributions through automatic acknowledgment indexing",
Proceedings of the
National Academy of Sciences (USA) 101 (2004):
17599--17604
- T.L. Goedeke and S. Rikoon, "Otters as Actors: Scientific
Controversy, Dynamism of Networks, and the Implications of Power in Ecological
Restoration", Social
Studies of Science 38 (2008): 111--132
- Michel L. Goldstein, Steven A. Morris and Gary G. Yen, "Group-based
Yule model for bipartite author-paper networks", Physical Review
E 71 (2005): 026108
- Victor V. Kryssanov, Evgeny L. Kuleshov, Frank J. Rinaldo, and
Hitoshi Ogawa, "We cite as we communicate: A communication model for the
citation
process", cs.DL/0703115
- Timothy Lenoir, Instituting Science: The Cultural Production
of Scientific Disciplines
- Menghui Li, Jinshan Wu, Dahui Wang, Tao Zhou, Zengru Di and Ying
Fan, "Evolving Model of Weighted Networks Inspired by Scientific Collaboration
Networks", cond-mat/0501655
[Only "qualtitively consistent behavior with the empirical results" is claimed;
I should read it to see if that's because they haven't checked quantatitively,
or if it fails when it comes to actual numbers.]
- P. D. Magnus, "Distributed Cognition and the Task of Science",
Social Studies of
Science 37 (2007): 297-310
- Matti Peltomaki and Mikko Alava, "Correlations in Bipartite
Collaboration Networks", physics/0508027
- Michael Polanyi, "The republic of science: its political and
economic theory", Minerva 1 (1962): 54--73
- Anne E. Preston, Leaving Science: Occupational Exit from
Scientific Careers
- José J. Ramasco, S. N. Dorogovtsev and Romualdo
Pastor-Satorras, "Self-organization of collaboration networks", Physical Review
E 70 (036106) [It's not clear to me from the
abstract just what they mean by "self-organization", but of course it piques my interest]
- Jose J. Ramasco and Steven A. Morris, "Social inertia in
collaboration
networks", physics/0509247
- Jerome R. Ravetz, Scientific Knowledge and Its Social
Problems
- Camille Roth
- Nancy Rothwell, Who Wants to be a Scientist?: Choosing Science As a Career [Blurb]
- James D. Savage, Funding Science in America: Congress, Universities, and the Politics of the Academic Pork Barrel
- Mikhail V. Simkin and V. P. Roychowdhury
- "Read before you cite!"
cond-mat/0212043 [I can't
resist quoting the whole abstract. "We report a method of estimating what
percentage of people who cited a paper had actually read it. The method is
based on a stochastic modeling of the citation process that explains empirical
studies of misprint distributions in citations (which we show follows a Zipf
law). Our estimate is only about 20% of citers read the original."]
- "Copied citations create renowned papers?"
cond-mat/0305150 ["Recently
we discovered (cond-mat/0212043) that the majority of scientific citations are
copied from the lists of references used in other papers. Here we show that a
model, in which a scientist picks three random papers, cites them,and also
copies a quarter of their references accounts quantitatively for empirically
observed citation distribution. Simple mathematical probability, not genius,
can explain why some papers are cited a lot more than the other."]
- "A mathematical theory of citing", physics/0504094 ["Here we
propose a modified model: when a scientist writes a manuscript, he picks up
several random recent papers, cites them and also copies some of their
references. The difference with the original model is the word recent. We solve
the model using methods of the theory of branching processes, and find that it
can explain [certain] features of citation distribution, which our original
model couldn't account for. The model can also explain 'sleeping beauties in
science', i.e., papers that are little cited for a decade or so, and later
'awake' and get a lot of citations. Although much can be understood from purely
random models, we find that to obtain a good quantitative agreement with
empirical citation data one must introduce Darwinian fitness parameter for the
papers."]
- Miriam Solomon, Social Empiricism
- Kent W. Staley, Evidence for the Top Quark: Objectivity and
Bias in Collaborative Experimentation
- Carol Tenopir and Donald W. King, Communication Patterns of
Engineers
- Gordon Tullock, The Organization of Inquiry
- Alexei Vazquez, "Statistics of citation networks," cond-mat/0105031
- Walter G. Vincentin, What Engineers Know and How They Know
It: Analytical Studies from Aeronautical History.
- Matthew L. Wallace, Yves Gingras, Russell Duhon, "A new approach for detecting scientific specialties from raw cocitation networks", arxiv:0807.4903
- Etienne Wenger, Communities of Practice [Related but
not identical subject]
- K. Brad Wray, "The Epistemic Significance of Collaborative
Research", Philosophy of Science 69 (2002):
150--168
- John H. Zammito, A Nice Derangement of Epistemes:
Post-Positivism in the Study of Science from Quine to Latour
- Jesús P. Zamora Bonilla, "Scientific Inference and the
Pursuit of Fame: A Contractarian Approach", Philosophy of Science
69 (2002): 300--323
- John Ziman
- An Introduction to Science Studies: The Philosophical
and Social Aspects of Science and Technology [Blurb]
- Of One Mind: The Collectivization of Science
- Prometheus Bound: Science in a Dynamic Steady
State [Blurb]
- Public Knowledge: An Essay Concerning the Social
Dimension of Science
- Reliable Knowledge: An Exploration of the Grounds for
Belief in Science
#
Community Discovery Methods for Complex Networks
Given: a network, especially a large one, directed or not, weighted
or not. Desired: a sensible decomposition of the graph into
sub-graphs, where in some reasonable sense the nodes in each sub-graph have
more to do with each other than with outsiders, i.e., form communities.
This is also called "module detection".
This seems like a really useful idea to apply to problems I'm interested in,
in neural synchronization; also a place where
there could stand to be more interchange
between statistics
and complex-network-wallahs.
Some of the methods in this area remind me of stuff
Christopher Alexander did in his 1964
book Notes on the Synthesis of Form, but it's been a long time
since I read that, so my memory may be faulty.
See also:
Ecology;
Neuroscience;
Signal Transduction, Gene Regulation
and Control of Metabolism;
Social Networks;
Sociology of Science;
Statistical Mechanics;
Synchronization
Recommended:
- Aaron Clauset, "Finding local community structure in networks", physics/0503036 =
Physical Review
E 72 (2005): 026132 [Clever; but then, Aaron is
clever.]
- Aaron Clauset, M. E. J. Newman and Cristopher Moore, "Finding
Community Structure in Very Large Networks", cond-mat/0408187
= Physical Review E 70 (2004): 066111
- J.-J. Daudin, F. Picard and S. Robin, "A Mixture Model for Random
Graphs", Statistics
and Computing 18 (2008): 173--183
- Michelle Girvan and M. E. J. Newman, "Community structure in
social and biological networks,"
cond-mat/0112110
= Proceedings of the National Academy of Sciences
(USA) 99 (2002): 7821--7826
- Roger Guimera, Marta Sales-Pardo and Luis A. N. Amaral, "Modularity
from Fluctuations in Random Graphs", cond-mat/0403660
= Physical Review E 70 (2004): 025101
- Jake M. Hofman, Chris H. Wiggins, "A Bayesian Approach to Network
Modularity", arxiv:0709.3512
[For "Bayesian", read "smoothed maximum likelihood". But nonetheless: cool.]
- Andrea Lancichinetti, Santo Fortunato, Janos Kertesz, "Detecting
the overlapping and hierarchical community structure of complex networks",
arxiv:0802.1218 [An interesting
approach, but not quite as novel as they claim --- cf. Reichardt and
Bornholdt --- and I'd really like to see more evidence of superior accuracy
and/or robustness]
- E. A. Leicht, M. E. J. Newman, "Community structure in directed
networks", arxiv:0709.4500
- M. E. J. Newman
- M. E. J. Newman and Michelle Girvan
- "Mixing patterns and community
structure in networks", cond-mat/0210146
- "Finding and evaluating community structure in
networks", Physical Review E 69 (2003): 026113
= cond-mat/0308217
- Jörg Reichardt and Stefan Bornholdt [Code is available
by e-mail from Reichardt, who was very helpful to me when I needed to
implement their algorithm.]
- Jörg Reichardt and Douglas R. White, "Role models for complex
networks", arxiv:0708.0958
[Discussion]
- M. Sales-Pardo, R. Guimera, A. Moreira, L. Amaral, "Extracting the
hierarchical organization of complex
systems", arxiv:0705.1679
To read:
- Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and
Eric P. Xing, "Mixed membership stochastic blockmodels", arxiv:0705.4485
- Nelson Augusto Alves, "Unveiling community structures in weighted
networks", physics/0703087
- Leonardo Angelini, Stefano Boccaletti, Daniele Marinazzo, Mario
Pellicoro, and Sebastiano Stramaglia, "Fast identification of network modules
by optimization of ratio association", cond-mat/0610182
- L. Angelini, D. Marinazzo, M. Pellicoro and S. Stramaglia, "Natural
clustering: the modularity
approach", cond-mat/0607643
- A. Arenas, J. Duch, A. Fernandez, S. Gomez,
"Size reduction of complex networks preserving modularity",
physics/0702015
[Do you really need all those links? Wouldn't your life be simpler if
you could just ignore some of them?]
- Alex Arenas, Alberto Fernandez, Sergio Gomez, "Multiple resolution
of the modular structure of complex
networks", physics/0703218
- Alex Arenas, Alberto Fernandez, Santo Fortunato, Sergio Gomez,
"Motif-based communities in complex
networks", arxiv:0710.0059
- Jim Bagrow and Erik Bollt, "A Local Method for Detecting
Communities", cond-mat/0412482
- James Bagrow, Erik Bollt, Luciano da F. Costa, "Network Structure
Revealed by Short
Cycles", cond-mat/0612502
- S. Boccaletti, M. Ivanchenko, V. Latora, A. Pluchino and
A. Rapisarda, "Dynamical clustering methods to find community
structures", physics/0607179
- Michael James Bommarito II, Daniel Martin Katz, Jon Zelner, "On the
Stability of Community Detection Algorithms on Longitudinal Citation
Data", arxiv:0908.0449
- U. Brandes, D. Delling, M. Gaertler, R. Goerke, M. Hoefer,
Z. Nikoloski, and D. Wagner, "Maximizing Modularity is
hard", physics/0608255
[i.e., maximizing Newman's Q is NP hard. I haven't read beyond the
abstract yet, so I don't know if they address the question of what makes it
hard in the hard cases, and whether those are properties we should expect to
see in real-world networks. Conceivably, actual social networks are, on
average, easy to modularize...]
- Andrea Capocci, Vito D. P. Servedio, Guido Caldarelli, Francesca
Colaiori, "Detecting communities in large networks", cond-mat/0402499
- Horacio Castellini and Lilia Romanelli, "Social network from
communities of electronic
mail", nlin.CD/0509021
- Leon Danon, Albert Díaz-Guilera, and Alex Arenas, "The
effect of size heterogeneity on community identification in complex
networks", Journal of
Statistical Mechanics: Theory and Experiment (2006): P11010
= physics/0601144
- Leon Danon, Albert Díaz-Guilera, Jordi Duch and Alex Arenas,
"Comparing community structure
identification", Journal of
Statistical Mechanics: Theory and Experiment (2005): P09008
= cond-mat/0505245
- Jordi Duch and Alex Arenas, "Community detection in complex
networks using extremal
optimization", Physical Review
E 72 (2005): 027104
- Illes J. Farkas, Daniel Abel, Gergely Palla, Tamas Vicsek,
"Weighted network
modules", cond-mat/0703706
- S. Feldt, J. Waddell, V. L. Hetrick, J. D. Berke, and M. Zochowski,
"Functional clustering algorithm for the analysis of dynamic network
data", Physical
Review E
79 (2009): 056104
- Daniel J. Fenn, Mason A. Porter, Mark McDonald, Stacy Williams, Neil F. Johnson, Nick S. Jones, "Dynamic communities in multichannel data: An application to the foreign exchange market during the 2007--2008 credit crisis", arxiv:0811.3988
- Daniel J. Fenn, Mason A. Porter, Peter J. Mucha, Mark McDonald, Stacy Williams, Neil F. Johnson, Nick S. Jones, "Dynamical Clustering of Exchange Rates", arxiv:0905.4912
- Sam Field, Kenneth A. Frank, Kathryn Schiller, Catherine
Riegle-Crumb and Chandra Muller, "Identifying positions from affiliation
networks: Preserving the duality of people and
events", Social
Networks
28 (2006): 97--123
- G. W. Flake, S. R. Lawrence, C. L. Giles and F. M. Coetzee,
"Self-organization and identification of Web communities", IEEE
Computer 36 (2002): 66--71
- Santo Fortunato, "Community detection in
graphs", arxiv:0906.0612
- Santo Fortunato and Marc Bathélemy, "Resolution limit in
community
detection", physics/0607100
= cite>Proceedings of the
National Academy of Sciences (USA) 104 (2007):
36--41
- Santo Fortunato and Claudio Castellano, "Community Structure in Graphs", arxiv:0712.2716 [Review
paper; thanks to Ed Vielmetti for the pointer]
- Santo Fortunato, Vito Latora and Massimo Marchiori, "A Method to
Find Community Structures Based on Information Centrality", cond-mat/0402522
- Kenneth A. Frank, "Identifying Cohesive Subgroups",
Social Networks 17 (1995): 27--56
- David Gfeller, Jean-Cédric Chappelier, and Paolo De Los Rios,
"Finding instabilities in the community structure of complex networks",
Physical Review
E 72 (2005): 056135
- Rumi Ghosh, Kristina Lerman, "Structure of Heterogeneous Networks",
arxiv:0906.2212
- V. Gol'dshtein and G. A. Koganov, "An indicator for community
structure", physics/0607159
- Benjamin H. Good, Yves-Alexandre de Montjoye, Aaron Clauset, "The performance of modularity maximization in practical contexts", arxiv:0910.0165
- Mark S. Handcock, Adrian E. Raftery and Jeremy Tantrum, "Model-Based Clustering for Social Networks" Journal of the Royal Statistical Society A 170 (2007): 301--354
[PDF preprint]
- M. B. Hastings, "Community detection as an inference problem",
Physical Review
E 74 (2006): 035102
= cond-mat/0604429
- Erik Holmström, Nicolas Bock and Joan Brännlund, "Density
Analysis of Network Community Divisions", cond-mat/0608612
- I. Ispolatov, I. Mazo, A. Yuryev, "Finding mesoscopic communities
in sparse
networks", q-bio.MN/0512038
= Journal of Statistical Mechanics (2006): P09014
- Brian Karrer, Elizaveta Levina, M. E. J. Newman, "Robustness of
community structure in
networks", arxiv:0709.2108
- Jussi M. Kumpula, Jari Saramaki, Kimmo Kaski, and Janos Kertesz,
"Resolution limit in complex network community detection with Potts model
approach",cond-mat/0610370
- Andrea Lancichinetti, Santo Fortunato, "Benchmarks for testing
community detection algorithms on directed and weighted graphs with overlapping
communities", arxiv:0904.3940
- Pierre Latouche, Etienne Birmelé, Christophe Ambroise, "Overlapping Stochastic Block Models", arxiv:0910.2098
- Sune Lehmann, Martin Schwartz, Lars Kai Hansen, "Bi-clique
Communities", arxiv:0710.4867
- Michele Leone, Sumedha, Martin Weigt, "Clustering by
soft-constraint affinity propagation: Applications to gene-expression
data", arxiv:0705.2646
- Jure Leskovec, Kevin J. Lang, Anirban Dasgupta and Michael
W. Mahoney, "Community Structure in Large Networks: Natural Cluster Sizes and
the Absence of Large Well-Defined
Clusters", arxiv:0810.1355
- Ian X.Y. Leung, Pan Hui, Pietro Lio', Jon Crowcroft, "Towards Real
Time Community Detection in Large
Networks", arxiv:0808.2633
- Claire P. Massen, Jonathan P. K. Doye, "Thermodynamics of Community
Structure", cond-mat/0610077
- A. D. Medus and C. O. Dorso, "Alternative approach to community
detection in networks", Physical Review E 79 (2009): 066111
- Peter J. Mucha, Thomas Richardson, Kevin Macon, Mason A. Porter,
Jukka-Pekka Onnela, "Community Structure in Time-Dependent, Multiscale, and
Multiplex
Networks", arxiv:0911.1824
- Stefanie Muff, Francesco Rao, and Amedeo Caflisch, "Local
modularity measure for network clusterizations", Physical Review
E 72 (2005): 056107
- Andreas Noack, "Modularity clustering is force-directed
layout", arxiv:0807.4052
- Gergely Palla, Imre Derenyi, Illes Farkas and Tamas Vicsek,
"Uncovering the overlapping community structure of complex networks in nature
and society", Nature 435
(2005): 814--818 = physics/0506133
- Gergely Palla, Illes J. Farkas, Peter Pollner, Imre Derenyi, Tamas
Vicsek, "Directed network
modules", physics/0703248
- Nicolas Pissard and Houssem Assadi, "Detecting overlapping
communities in linear time with P&A
algorithm", physics/0509254
- Pascal Pons, "Post-Processing Hierarchical Community Structures:
Quality Improvements and Multi-scale
View", cs.DS/0608050
- Mason A. Porter, Jukka-Pekka Onnela, Peter J. Mucha, "Communities
in Networks", arxiv:0902.3788
- Josep M. Pujol, Javier Béjar, and Jordi Delgado, "Clustering
algorithm for determining community structure in large networks", Physical Review
E 74 (2006): 016107
- Francisco A. Rodrigues, Gonzalo Travieso, Luciano da F. Costa,
"Fast Community Identification by Hierarchical
Growth", physics/0602144
- Huaijun Qiu and Edwin R. Hancock, "Graph matching and clustering
using spectral
partitions", Pattern
Recognition 39 (2006): 22--34 [In this context, for
the ideas on hierarchical decomposition, which sounds like it might work
for community discovery, if in fact it's not equivalent to some existing
community-discovery algorithm.]
- Usha Nandini Raghavan, Reka Albert, Soundar Kumara, "Near linear
time algorithm to detect community structures in large-scale
networks", arxiv:0709.2938 ["every
node is initialized with a unique label and at every step each node adopts the
label that most of its neighbors currently have"]
- Jörg Reichardt and Stefan Bornholdt, "When are networks truly
modular?", cond-mat/0606220
- Jörg Reichardt and Michele Leone, "(Un)detectable cluster
structure in sparse
networks", arxiv:0711.1452
- Martin Rosvall and Carl T. Bergstrom
- Erin N. Sawardecker, Marta Sales-Pardo, Luís A. Nunes Amaral, "Detection of node group membership in networks with group overlap", arxiv:0812.1243
- Chayant Tantipathananandh, Tanya Berger-Wolf and David Kempe,
"A Framework For Community Identification in Dynamic Social Networks" [PDF]
- Joshua R. Tyler, Dennis M. Wilkinson and Bernardo A. Huberman,
"Email as Spectroscopy: Automated Discovery of Community Structure within
Organizations," cond-mat/0303264
- I. Vragovic and E. Louis, "Network community structure and loop
coefficient method", Physical
Review E 74 (2006): 016105
- Matthew L. Wallace, Yves Gingras, Russell Duhon, "A new approach for detecting scientific specialties from raw cocitation networks", arxiv:0807.4903
- Huijie Yang, Wenxu Wang, Tao Zhou, Binghong ang and Fangcui Zhao,
"Reconstruct the Hierarchical Structure in a Complex Network", physics/0508026 ["Based upon
the eigenvector centrality (EC) measure, a method is proposed to reconstruct
the hierarchical structure of a complex network. It is tested on the Santa Fe
Institute collaboration network, whose structure is well known."]
- Hugo Zanghi, Franck Picard, Vincent Miele, Christophe Ambroise, "Strategies for Online Inference of Model-Based Clustering in large Networks",
arxiv:0910.2034
- Haijun Zhou
- "Distance, dissimilarity index, and network community
structure," physics/0302032
- "Network Landscape from a Brownian Particle's
Perspective," physics/0302030
- Etay Ziv, Manuel Middendorf and Chris Wiggins, "An
Information-Theoretic Approach to Network Modularity", q-bio.QM/0411033
To finish writing:
- "Functional Community Discovery II"
#
The United States Congress, How It Works and For Whom
I'm starting to do research on this, Heaven help me, as an exercise
in network analysis.
See also
Campaign Finance;
Democracy;
Political Elites;
Networks of Political
Actors;
Political Decision Making
Recommended:
- James Fowler
- "Legislative Cosponsorship Networks in the U.S. House and
Senate," Social Networks forthcoming [Short version of the
conference paper "Who is the Best Connected
Congressperson?" PDF]
- "Who is the Best Connected Congressperson? A Study of
Legislative Cosponsorship Networks"
[Long version of the journal paper. PDF]
- Jacob Hacker and Paul Pierson, Off Center
- Kenneth T. Poole, "Recent Developments in Analytical Models
of Voting in the U.S. Congress", Legislative Studies
Quarterly 13 (1988): 117--133
- Mason A. Porter, Peter J. Mucha, M. E. J. Newman and Casey M.
Warmbrand, "A network analysis of committees in the U.S. House of
Representatives", Proceedings of the National Academy of Sciences
(USA) 102 (2005): 7057--7062
[PDF
reprint via Mark]
- Frank J. Sorauf, Inside Campaign Finance: Myths and
Realities [Good, but I found it vexing to read this 1992-vintage book in
2006 — I kept wanting to know what he thought about the last 14 years!]
To read:
- Brad Alexander, "Good Money and Bad Money: Do Funding Sources
Affect Electoral Outcomes?", Political Research Quarterly 58 (2005): 353--358
- E. Scott Adler, Why Congressional Reforms Fail: Reelection
and the House Committee System [Blurb]
- E. Scott Adler and John S. Lapinski (eds.), Macropolitics of Congress [Blurb]
- R. Douglas Arnold
- Congress and the Bureaucracy
- Congress, the Press, and Political
Accountability
[Blurb, ch. 1]
- The Logic of Congressional Action
- Joseph M. Bessette, The Mild Voice of Reason: Deliberative
Democracy and American National Government [Blurb. I
must confess that, like many Americans, my inclination is to scoff at the idea
of Congress as a deliberative body. But Bessette supposedly has empirical
evidence, which trumps cynicism. On the other hand, this was published in
1994, when I was, well, embarrassingly young; things may have been different
then.]
- Thomas L. Brunell, "The Relationship Between Political Parties
and Interest Groups: Explaining Patterns of PAC Contributions to Candidates
for Congress", Political Research Quarterly 58
(2005): 681--688
- Barry C. Burden, Gregory A. Caldeira and Tim Groseclose, "Measuring
Ideologies of U.S. Senators: The Song Remains the Same", Legislative
Studies Quarterly 25 (2000): 237--258
- Gary Cox and Matthew McCubbins, Legislative LeviathanMichael X. Delli Carpini, What Americans Know About Politics
and Why It Matters
- Elizabeth Drew, The Corruption of American Politics
- Amihai Glazer, "Predicting Committe Action" [Abstract:
"Success of a policy often requires both that a good policy be adopted, and
that the public or firms correctly anticipate what policy government will
adopt. This paper models a relation between committee size and the
effectiveness of policy, with a focus on how the accuracy of the public's
expectations varies with the size of the governmental committee setting
policy. The paper also argues that the demand for access by special interest
groups may arise not from a desire to influence policy, but from a desire to
learn about government's likely actions." PDF
preprint
via RePEc,
via Bill
Tozier.]
- Tim Groseclose, "An Examination of the Market for Favors and Votes
in Congress", Economic Inquiry 34 (1996):
320--340
- Richard L. Hall, Participation in Congress
- John R. Hibbing and Elizabeth Theiss-Morse, Congress as
Public Enemy: Public Attitudes Toward American Political Institutions
- Kristin Kanthak, "Top-Down Divergence: The Effect of
Party-Determined Power on Candidate Ideological Placement", journal
Of Theoretical Politics 14 (2002): 301--323
- Roderick Kiewiet and Matthew McCubbins,
- John W. Kingdon, "Models of Legislative Voting",
Journal of Politics 39 (1977): 563--595
[JSTOR]
- Keith Krehbiel, Information and Legislative
Organization [blurb]
- Steven D. Levitt, "How Do Senators Vote? Disentangling the
Role of Voter Preferences, Party Affiliation, and Senator Ideology",
The American Economic Review 86 (1996): 425--441
- David R. Mayhew, Congress: The Electoral Connection
- Gary Mucciaroni and Paul J. Quirk, Deliberative Choices:
Debating Public Policy in Congress [Blurb]
- Anthony Nownes, Total Lobbying: What Lobbyists Want (and How
They Try to Get It)
[blurb]
- Nelson W. Polsby, How Congress Evolves: The Social Bases
of Institutional Change
- Kenneth W. Shotts, "Does Racial Redistricting Cause Conservative
Policy Outcomes? Policy Preferences of Southern Representatives in the 1980s
and 1990s", The Journal of Politics 65 (2003):
216--226
- James M. Snyder, Jr. and Tim Groseclose, "Estimating Party
Influence in Congressional Roll-Call Voting", American Journal of
Political Sceince 44 (2000): 193--211
[JSTOR
link]
- Michele L. Swers, The Difference Women Make: The Policy
Impact of Women in Congress [Blurb]
- Andrew Scott Waugh, Liuyi Pei, James H. Fowler, Peter J. Mucha,
Mason Alexander Porter, "Party Polarization in Congress: A Social Networks Approach", SSRN/1437055
- Julian E. Zelizer, On Capital Hill: The Struggle to Reform
Congress and its Consequences, 1948--2000
[Blurb]
#
Networks of Political Actors
One of the things I'm interested in is understanding how network forms of
organization emerge among political actors, how they affect
decision-making, and how they
interact with other social networks
and institutions. I have
a ridiculously over-ambitious
research project, about networks of cronyism, that I'd like to do, but in
the meanwhile I;m settling for small steps. Presumably, like other social
networks, they serve as platforms for information exchange, deliberation, and
other forms of collective cognition.
Formal political organizations can also serve these functions, but it seems
easier to make organizations democratically
accountable than it is networks --- is this a problem? How does their
structure compare to that of
other networks?
Recommended:
- James Fowler
- "Legislative Cosponsorship Networks in the U.S. House and
Senate," Social Networks forthcoming [Short version of the
conference paper "Who is the Best Connected
Congressperson?" PDF]
- "Who is the Best Connected Congressperson? A Study of
Legislative Cosponsorship Networks"
[Long version of the journal paper. PDF]
- Mason A. Porter, Peter J. Mucha, M. E. J. Newman and Casey M.
Warmbrand, "A network analysis of committees in the U.S. House of
Representatives", Proceedings of the National Academy of Sciences
(USA) 102 (2005): 7057--7062
[PDF
reprint via Mark]
To read:
- Mariam Abou Zahab and Olivier Roy, Islamist Networks: The Afghan-Pakistan Connection
- Daniel
P. Carpenter, The Forging of Bureaucratic Autonomy: Reputations,
Networks, and Policy Innovation in Executive Agencies, 1862--1928
[Blurb]
- Gerald
M. Easter, Reconstructing the State: Personal Networks and Elite
Identity in Soviet Russia
[Review from H-Russia]
- Maarten A. Hajer and Hendrik Wagenaar (eds.), Deliberative
Policy Analysis: Understanding Governance in the Network Society
[blurb]
- Michael T. Heaney
- Identity, Coalitions, and Influence: The Politics of
Interest Group Networks in Health Policy
[PDF,
2.8Mb]
- "Issue Networks, Information, and Interest Group
Alliances: The Case of Wisconsin Welfare Politics, 1993--99",
State Politics and Policy Quarterly, 4:3 (Fall
2004): 237--270 [PDF reprint]
- Michael T. Heaney and Scott D. McClurg (eds.), Social
Networks and American Politics, special issue
(vol. 37, no. 5 =
September 2009) of American Politics Research
- Margaret E. Keck and Kathryn Sikkink, Activists Beyond
Borders: Advocacy Networks in International Politics ["Examines the
networks of activists that operate across national borders, including such
alliances as anti-slavery movements and woman suffrage campaigns in the past
and today's transnational activism in human rights and environmental
politics."]
- David Knoke, Political Networks: The Structural
Perspective
- Anne-Marie Slaughter, A New World Order
[Blurb, with link
to full text of introduction]
- Jasmien Van Daele, "Engineering Social Peace: Networks, Ideas, and
the Founding of the International Labour Organization", International Review
of Social History 50 (2005): 435--466 [From the
abstract: "In 1919 a pioneering generation of scholars, social policy experts, and politicians designed an unprecedented international organizational framework for labour politics. The majority of the founding fathers of this new institution, the International Labour Organization (ILO), had made great strides in social thought and action before 1919. The core members all knew one another from earlier private professional and ideological networks, where they exchanged knowledge, experiences, and ideas on social policy. In this study, one key question is the extent to which prewar 'epistemic communities' ... and political networks, such as the Second International, were a decisive factor in the institutionalization of international labour politics. In the postwar euphoria, the idea of a 'makeable society' was an important catalyst behind the social engineering of the ILO architects. ... This article also deals with how the utopian idea(l)s of the founding fathers --- social justice and the right to decent work --- were changed by diplomatic and political compromises made at the Paris Peace Conference...."]
- Andrew Scott Waugh, Liuyi Pei, James H. Fowler, Peter J. Mucha,
Mason Alexander Porter, "Party Polarization in Congress: A Social Networks Approach", SSRN/1437055
#
Fri, 13 Nov 2009
Phase Transitions and Critical Phenomena
One of the central areas of statistical
mechanics for the last, oh, forty years, to the point where it has
seriously shaped --- one might even say, warpped --- how those of us trained in
that tradition look at the world in general.
(See power laws and
especially self-organized criticality.)
Things I want to understand better. Rigorously separated phases
seem to only exist in infinite-system limits; what are the large-but-finite
regimes like? Connections between phase transitions and changes in the
topology of the phase space. Do there exist ways of deducing the order
parameter from either microscopic Hamiltonians or from macroscopic
observations? Is there a way of detecting phase transitions from macroscopic
observables other than the order parameter and the thermodynamic potential?
Why are there so few fixed points to the renormalization group?
Connections between power law distributions and critical
fluctuations. While I understand the physical arguments for why we see
power-law-distributed fluctuations at the critical point, I find myself wanting
a more probabilistic explanation as well. A crude sketch would go as follows.
Far from the critical point, the microscopic dynamics are rapidly mixing in
space and time --- and mixing in the
technical, ergodic theory sense, so that the
central limit theorem applies, and averages over spatio-temporal regions large
compared to the mixing scales are approximately Gaussian.
(Cf. Rosenblatt, 1956.)
As one approaches the critical point, however, giant, correlated fluctuations
begin to appear, i.e., the mixing scales diverge, and one is dealing with a
process with long-range memory (in both space and time). Under these
circumstances, averaging can deliver a non-Gaussian but still self-similar
distribution, which is where the power-law tails come from. The stable
distributions, including the Gaussian, emerge from the central limit theorem
for independent variables because they are unchanged under convolution
(averaging) with themselves --- there are ways, in renormalization group
theory, of trading off infinite variance (as in the non-Gaussian stable limits)
for infinite range-correlation. This, I should understand better. (The review
paper by Jona-Lasinio is a start, but does not leave me with enough intuition
that I feel entirely comfortable with what's going on — in part, I think,
because nobody entirely understands things.)
Recommended (big picture):
- P. W. Anderson, Basic Notions of Condensed Matter Physics
- L. D. Landau and E. M. Lifshitz, Statistical Physics
- Joel L. Lebowitz, "Statistical mechanics: A selective Review of Two
Central Issues", Reviews of Modern Physics 71
(1999):
S346--S357 = math-ph/0010018
[One of the two issues is first-order phase transitions.]
- James Sethna, "Order Parameters, Broken Symmetry, and Topology",
pp. 243--265 in Lynn Nadel and Daniel L. Stein (eds.), 1990 Lectures in
Complex Systems
- Geoffrey Sewell, Quantum Mechanics and Its Emergent
Macrophysics
- Julia Yeomans, The Statistical Mechanics of Phase Transitions
To read:
- N. G. Antoniou, F. K. Diakonos, E. N. Saridakis, and G. A. Tsolias,
"An efficient algorithm simulating a macroscopic system at the critical point",
physics/0607038 [Getting
around critical-slowing down, using the fact that "dynamics in the order
parameter space is simplified significantly ... due to the onset of
self-similarity in the [fluctuations]. ... [T]he effective action at the
critical point obtains a very simple form. ... [T]his simplified action can be
used in order to simulate efficiently the statistical properties of a
macroscopic system exactly at the critical point"]
- Amir Dembo and Andrea Montanari, "Gibbs Measures and Phase Transitions on Sparse Random Graphs", arxiv:0910.5460
- Cyril Domb, The Critical Point: A Historical Introduction to
the Modern Theory of Critical Phenomena
- Roberto Franzosi and Marco Pettini, "Topology and Phase
Transitions"
- and Lionel Spinelli, "Theorem on a necessary relation", math-ph/0505057
- "Entropy and Topology", math-ph/0505058
- Leo Kadanoff, "More is the Same; Phase Transitions and Mean Field Theories", arxiv:0906.0653
- Michael Kastner, "Phase transitions and configuration space
topology", cond-mat/0703401
- O. C. Martin, R. Monasson and R. Zecchina, "Statistical mechanics
methods and phase transitions in optimization problems," cond-mat/0104428
- Oliver Muelken, Heinrich Stamerjohanns, and Peter Borrmann, "The
Origins of Phase Transitions in Small Systems," cond-mat/0104307
- Marco Pettini, Roberto Franzosi and Lionel Spinelli, "Topology and
Phase Transitions: towards a proper mathematical definition of finite N
transitions," cond-mat/0104110
- Javier Rodriguez-Laguna, "Real Space Renormalization Group
Techniques and Applications," cond-mat/0207340
- Martin Weigel and Wolfhard Janke, "Cross Correlations in Scaling Analyses of Phase Transitions", Physical
Review Letters 102 (2009): 100601 = arxiv:0811.3097
- Ji-Feng Yang, "Renormalization group equations as 'decoupling'
theorems", hep-th/0507024
#
Foundations and History of Statistical Mechanics
Technical issues: things like, what exactly is a C* algebra?
Role of large deviations.
Conceptual issues: Why is it legitimate to treat deterministic
mechanical systems with many unstable degrees of freedom
as stochastic processes? (My impulse is
to appeal to ergodic theory.) When and why
do we get convergence to equilibria characterized by only a few macroscopic
degrees of freedom? (That sounds like a central limit theorem, some
kind of result about how the large-scale limit is insensitive to all but a few
aspects of the small scales.)
Historical issues: It's interesting to know how people have
argued about this stuff.
See also:
Statistical Mechanics;
Nonequilibrium Statistical Mechanics;
Maximum Entropy;
Tsallis Statistics
Recommended:
- David Z. Albert, Time and Chance
- Jean Bricmont, "Science of Chaos or Chaos in Science?", chao-dyn/9603009
- Stephen G. Brush, "Foundations of Statistical Mechanics 1845--1915",
Archive for the History of Exact Sciences 4
(1966): 145--183
- E. G. D. Cohen, "Entropy, Probability and Dynamics", arxiv:0807.1268
- W. De Roeck, Christian Maes and Karel Netocny, "H-Theorems from
Autonomous Equations", cond-mat/0508089
= Journal of
Statistical Physics 123 (2006): 571--584 ["If
for a Hamiltonian dynamics for many particles, at all times the present
macrostate determines the future macrostate, then its entropy is non-decreasing
as a consequence of Liouville's theorem. That observation, made since long, is
here rigorously analyzed with special care to reconcile the application of
Liouville's theorem (for a finite number of particles) with the condition of
autonomous macroscopic evolution (sharp only in the limit of infinite scale
separation); and to evaluate the presumed necessity of a Markov property for
the macroscopic evolution."]
- Richard S. Ellis, Entropy, Large Deviations and
Statistical Mechanics
- A. I. Khinchin, Mathematical Foundations of Statistical
Mechanics
- Joel L. Lebowitz, "Statistical mechanics: A selective Review of Two
Central Issues", Reviews of Modern Physics 71
(1999):
S346--S357, math-ph/0010018
[Abstract: "I give a highly selective overview of the way statistical mechanics
explains the microscopic origins of the time-asymmetric evolution of
macroscopic systems towards equilibrium and of first-order phase transitions in
equilibrium. These phenomena are emergent collective properties not discernible
in the behavior of individual atoms. They are given precise and elegant
mathematical formulations when the ratio between macroscopic and microscopic
scales becomes very large."]
- Michael C. Mackey, Time's Arrow: The Origins of Thermodynamic Behavior [This is a very valuable short introduction to the
ergodic theory of Markov operators, which is
highly relevant to the origins of irreversibility, etc., but I don't think his
approach works, because he focuses on the relative entropy
(Kullback-Leibler divergence from the invariant distribution), rather than the
Boltzmann entropy or even the Gibbs entropy.]
- Benoit Mandelbrot, "The Role of Sufficiency and of Estimation in
Thermodynamics", Annals of Mathematical
Statistics 33 (1962): 1021--1038 [JSTOR; free PDF reprint.
Extensive thermodynamic variables
as sufficient statistics for the
conjugate intensive variables; Gibbs canonical form arising from natural
requirements on finite-dimensional sufficient statistics, which can only be
achieved for exponential families of probability distributions. Very clever.]
- Sandu Popescu, Anthony J. Short, and Andreas Winter, "Entanglement
and the Foundations of Statistical Mechanics", quant-ph/0511225 [Roughly
speaking: due to environmental entanglement, most states of a sub-system look
"thermalized", no matter what the real state of the whole system is]
- Steven Savitt (ed.), Time's Arrows Today: Recent Physical and
Philosophical Work on the Direction of Time
- Geoffrey Sewell
- Quantum Mechanics and Its Emergent
Macrophysics [blurb, ch. 1]
- "On the Question of Temperature Transformations under
Lorentz and Galilei
Boosts", arxiv:0808.0803
[Punch-line: "there is no law of temperature transformation under either
Lorentz or Galilei boosts, and so the concept of temperature stemming from the
Zeroth Law is restricted to states of bodies in their rest frames."]
- Lawrence Sklar, Physics and Chance: Philosophical Issues in
the Foundations of Statistical Mechanics
- W. H. Zurek, "Algorithmic Randomness, Physical Entropy,
Measurements, and the Demon of Choice," quant-ph/9807007
Modesty forbids:
- CRS and Cristopher Moore, "What Is a Macrotate?" cond-mat/0303625
To read:
- Walid K. Abou Salem and Jürg Fröhlich, "Status of the
Fundamental Laws of Thermodynamics", Journal of Statistical
Physics 126 (2007): 1045-1068 ["We describe recent
progress towards deriving the Fundamental Laws of thermodynamics (the 0th, 1st,
and 2nd Law) from nonequilibrium quantum statistical mechanics in simple, yet
physically relevant models."]
- A. E. Allahverdyan and Th. M. Nieuwenhuizen, "Explanation of the
Gibbs paradox within the framework of quantum thermodynamics", Physical Review
E 73 (2006): 066119
= quant-ph/0507145 [The
abstract says many things with which I am sympathetic, most notably coming out
against "a direct association of physical irreversibility with lack of
information", but I don't know if I'll ever find time to read this...]
- Massimiliano Badino
- "The Foundational Role of Ergodic Theory",
phil-sci/2277
- "Probability and Statistics in Boltzmann's Early Papers on
Kinetic Theory", phil-sci/2276
- "Was there a statistical Turn? The Interaction between
Mechanics and Probability in Boltzmann's Theory of Non Equilibrium (1872-1877)",
phil-sci/2878
- Robert W. Batterman, "Why Equilibrium Statistical Mechanics Works:
Universality and the Renormalization Group", Philosopy of
Science 65 (1998): 183--208
[JSTOR]
- Battimelli et al., (eds.), Proceedings of the Int'l
Symposium on Ludwig Boltzmann
- Joseph Berkovitz, Roman Frigg and Fred Kronz, "The Ergodic
Hierarchy, Randomness and Hamiltonian Chaos", phil-sci/2927
- Ludwig Boltzmann, Lectures on Gas Theory [Get the
Dover reprint]
- Michele Campisi, "Mechanical Proof of the Second Law of
Thermodynamics Based on Volume
Entropy", arxiv:0704.2567 [i.e.,
Boltzmann entropy]
- Michele Campisi and Donald H. Kobe, "Derivation of Boltzmann
Principle", arxiv:0911.2070
- Miguel Carrion-Alvarez, "Variations on a theme of Gelfand and
Naimark", math.FA/0402150
[Algebras of observables, including C* algebras as a special case]
- P. Castiglione, M. Falcioni, A. Lesne and A. Vulpiani,
Chaos and Coarse Graining in Statistical Mechanics
[Blurb, Review in
J. Stat. Phys.]
- Hasok Chang, Inventing Temperature: Measurement
and Scientific Progress
- Marius Costeniuc, Richard S. Ellis, Hugo Touchette and Bruce
Turkington, "The Generalized Canonical Ensemble and Its Universal Equivalence
with the Microcanonical Ensemble", Journal of Statistical
Physics 119 (2005): 1283--1329
- Stefano Curtarolo and Gerbrand Ceder, "Dynamic of a non
homogeneously coarse grained system," cond-mat/0106263
- N. D. Hari Dass, S. Kalyana Rama and B. Sathiapalan, "On the
Emergence of the Microcanonical Description from a Pure State," cond-mat/0112439
- Paul and Tatiana Ehrenfest, The Conceptual Foundations of the
Statistical Approach in Mechanics
- Richard S. Ellis, Kyle Haven and Bruce Turkington, "The Large
Deviation Principle for Coarse-Grained Processes," math-ph/0012023
- Denis J. Evans, Debra J. Searles, Stephen R. Williams, "A simple mathematical proof of Boltzmann's equal a priori probability hypothesis", arxiv:0903.1480
- Roman Frigg, "Probability in Boltzmannian Statistical
Mechanics", phil-sci/3489
- Sheldon Goldstein, "Boltzmann's Approach to Statistical
Mechanics," cond-mat/0105242 ["most
twentieth-century innovations are thoroughly misguided"]
- Sheldon Goldstein, Joel L. Lebowitz, Roderich Tumulka, and Nino
Zanghi, "Canonical
Typicality", Physical
Review Letters 96 (2006): 050403
- H. Grad, "The many faces of entropy", Communications on Pure
and Applied Mathematics 14 (1961): 323--354 [Apparently
makes the point that the correct entropy function is dependent on the level of
description. This is important for revising my paper with Cris Moore...]
- A. Greven, G. Keller and G. Warnecke (eds.), Entropy
- D. H. E. Gross
- "Geometric Foundation of Thermo-Statistics, Phase
Transitions, Second Law of Thermodynamics, but without Thermodynamic Limit,"
cond-mat/0201235
- "The microcanonical entropy is multiply differentiable. No
dinosaurs in microcanonical gravitation: No special 'microcanonical phase
transitions'," cond-mat/0403582
- "On the Microscopic Foundation of Thermo-Statistics," cond-mat/0209482
- "A New Thermodynamics,From Nuclei to Stars," cond-mat/0302267
- "Second Law of Thermodynamics, Macroscopic Observables
within Boltzmann's Principle but without Thermodynamic Limit," cond-mat/0101281
- "Thermo-Statistics or Topology of the Microcanonical
Entropy Surface," cond-mat/0206341
- Meir Hemmo and Orly Shenker, "Quantum Decoherence and the Approach
to Equilibrium", Philosophy of Science 70 (2003):
330--358
- Dragi Karevski, "Foundations of Statistical Mechanics: in and out
of Equilibrium", cond-mat/0509595 ["The first
part of the paper is devoted to the foundations, that is the mathematical and
physical justification, of equilibrium statistical mechanics. It is a
pedagogical attempt, mostly based on Khinchin's presentation, which purpose is
to clarify some aspects of the development of statistical mechanics. In the
second part, we discuss some recent developments that appeared out of
equilibrium, such as fluctuation theorem and Jarzynski equality."]
- Martin Krieger, Constitutions of Matter: Mathematically
Modeling the Most Everyday of Physical Phenomena
- Juraj Kumicak, "Irreversibility in a simple reversible
model", Physical
Review E 71 (2005): 016115
= nlin.CD/0510016
- David A. Lavis
- Chuang Liu, "Approximations, Idealizations, and Models in
Statistical Mechanics," PITT-PHIL-SCI00000365
- Benoit Mandelbrot, "On the Derivation of Statistical Thermodynamics
from Purely Phenomenological Principles", Journal of Mathematical
Physics 5 (1964): 164--171 [PDF reprint]
- Oliver Penrose, Foundations of Statistical Mechanics: A
Deductive Treatment
[blurb]
- A. Perez-Madrid, "Gibbs Entropy and Irreversibility", cond-mat/0401532
- E. A. J. F. Peters, "Projection operator formalism and entropy",
cond-mat/0703672
- Denes Petz, "Entropy, von Neumann and the von Neumann Entropy,"
math-ph/0102013
- Peter Reimann, "Foundation of Statistical Mechanics under
Experimentally Realistic
Conditions", Physical Review
Letters 101 (2008): 190403
- David Ruelle
- Statistical Mechanics: Rigorous Results
- Thermodynamic Formalism
- Orly R. Shenker and Meir Hemmo
- "The Von Neumann Entropy: A
Reconsideration", phil-sci/2256
- "Von Neumann's Entropy Does Not Correspond to Thermodynamic Entropy", phil-sci/3716
- Hal Tasaki
- "From Quantum Dynamics to the Second Law of
Thermodynamics," cond-mat/0005128
- "The second law of Thermodynamics as a theorem in quantum
mechanics," cond-mat/0011321
- Jos Uffink
#
Power Law Distributions, 1/f Noise, Long-Memory Time Series
Why do physicists care about power laws so much?
I'm probably not the best person to speak on behalf of our tribal obsessions
(there was a long debate among the faculty at my thesis defense as to whether
"this stuff is really physics"), but I'll do my best. There are two parts
to this: power-law decay of correlations, and power-law size distributions.
The link is tenuous, at best, but they tend to get run together in our heads,
so I'll treat them both here.
The reason we care about power law correlations is that we're conditioned to
think they're a sign of something interesting and complicated happening. The
first step is to convince ourselves that in boring situations, we don't see
power laws. This is fairly easy: there are pretty good and rather generic
arguments which say that systems in thermodynamic equilibrium, i.e. boring
ones, should have correlations which decay exponentially over space and time;
the reciprocals of the decay rates are the correlation length and the
correlation time, and say how big a typical fluctuation should be. This is
roughly first-semester graduate statistical mechanics. (You can find those
arguments in, say, volume one of Landau and Lifshitz's Statistical
Physics.)
Second semester graduate stat. mech. is where those arguments break down ---
either for systems which are far from equilibrium
(e.g., turbulent flows), or in equilibrium but
very close to a critical point (e.g., the transition from a solid to liquid
phase, or from a non-magnetic phase to a magnetized
one). Phase transitions have fluctuations
which decay like power laws, and many non-equilibrium systems do too. (Again,
for phase transitions, Landau and Lifshitz has a good discussion.) If you're a
statistical physicist, phase transitions
and non-equilibrium processes define the
terms "complex" and "interesting" --- especially phase transitions, since we've
spent the last forty years or so developing a very successful theory of
critical phenomena. Accordingly, whenever we see power law correlations, we
assume there must be something complex and interesting going on to produce
them. (If this sounds like the fallacy of affirming the consequent, that's
because it is.) By a kind of transitivity, this makes power laws interesting
in themselves.
Since, as physicists, we're generally more comfortable working in the
frequency domain than the time domain, we often transform the autocorrelation
function into the Fourier spectrum. A power-law decay for the correlations as
a function of time translates into a power-law decay of the spectrum as a
function of frequency, so this is also called "1/f noise".
Similarly for power-law distributions. A simple use of the Einstein
fluctuation formula says that thermodynamic variables will have Gaussian
distributions with the equilibrium value as their mean. (The usual version of
this argument is not very precise.) We're also used to seeing
exponential distributions, as the probabilities of microscopic states. Other
distributions weird us out. Power-law distributions weird us out even more,
because they seem to say there's no typical scale or size for the variable,
whereas the exponential and the Gaussian cases both have natural scale
parameters. There is a connection here with fractals, which also lack typical
scales, but I don't feel up to going into that, and certainly a lot of the
power laws physicists get excited about have no obvious connection to any kind
of (approximate) fractal geometry. And there are lots of power law
distributions in all kinds of data, especially social data --- that's why
they're also called Pareto distributions, after the sociologist.
Physicists have devoted quite a bit of time over the last two decades to
seizing on what look like power-laws in various non-physical sets of data, and
trying to explain them in terms we're familiar with, especially phase
transitions. (Thus "self-organized criticality".) So
badly are we infatuated that there is now a huge, rapidly growing literature
devoted to "Tsallis statistics" or "non-extensive
thermodynamics", which is a recipe for modifying normal statistical
mechanics so that it produces power law distributions; and this, so far as I
can see, is its only good feature. (I will not attempt, here, to
support that sweeping negative verdict on the work of many people who have more
credentials and experience than I do.) This has not been one of our more
successful undertakings, though the basic motivation --- "let's see what we can
do!" --- is one I'm certainly in sympathy with.
There have been two problems with the efforts to explain all power laws
using the things statistical physicists know. One is that (to mangle Kipling)
there turn out to be nine and sixty ways of constructing power laws,
and every single one of them is right, in that it does indeed produce
a power law. Power laws turn out to result from a kind of central limit
theorem for multiplicative growth processes, an observation which apparently
dates back to Herbert Simon, and which has been rediscovered by a number of
physicists (for instance, Sornette). Reed and Hughes have established an even
more deflating explanation (see below). Now, just because these simple
mechanisms exist, doesn't mean they explain any particular case, but
it does mean that you can't legitimately argue "My favorite mechanism
produces a power law; there is a power law here; it is very unlikely there
would be a power law if my mechanism were not at work; therefore, it is
reasonable to believe my mechanism is at work here." (Deborah Mayo would say
that finding a power law does not constitute a severe test of your hypothesis.) You need to do
"differential diagnosis", by identifying other, non-power-law
consequences of your mechanism, which other possible explanations don't share.
This, we hardly ever do.
Similarly for 1/f noise. Many different kinds of stochastic process, with
no connection to critical phenomena, have power-law correlations.
Econometricians and time-series analysts have
studied them for quite a while, under the general heading of "long-memory"
processes. You can get them from things as simple as a superposition of
Gaussian autoregressive processes. (We have begun to awaken to this fact,
under the heading of "fractional Brownian motion".)
The other problem with our efforts has been that a lot of the power-laws
we've been trying to explain are not, in fact, power-laws. I should perhaps
explain that statistical physicists are called that, not because we know a lot
of statistics, but because we study the large-scaled, aggregated effects of the
interactions of large numbers of particles, including, specifically, the
effects which show up as fluctuations and noise. In doing this we learn,
basically, nothing about drawing inferences from
empirical data, beyond what we may remember about curve fitting and
propagation of errors from our undergraduate lab courses. Some of us,
naturally, do know a lot of statistics, and
even teach it --- I might
mention Josef Honerkamp's superb Stochastic Dynamical Systems.
(Of course, that book is out of print and hardly ever cited...)
If I had, oh, let's say fifty dollars for every time I've seen a slide (or a
preprint) where one of us physicists makes a log-log plot of their data, and
then reports as the exponent of a new power law the slope they got from doing a
least-squares linear fit, I'd at least not grumble. If my colleagues had gone
to statistics textbooks and looked up how to estimate the parameters of a
Pareto distribution, I'd be a happier man. If any of them had actually tested
the hypothesis that they had a power law against alternatives like stretched
exponentials, or especially log-normals, I'd think
the millennium was at hand. (If you want to
know how to do these things, please
read this paper, whose merits are
entirely due to my co-authors.) The situation for 1/f noise is not so dire,
but there have been and still are plenty of abuses, starting with the fact that
simply taking the fast Fourier transform of the autocovariance function
does not give you a reliable estimate of the power
spectrum, particularly in the tails. (On that point, see, for
instance, Honerkamp.)
See also:
Chaos and Dynamical Systems;
Complex Networks;
Self-Organized Criticality;
Time Series;
Tsallis Statistics
Recommended, bigger picture:
- Michael Mitzenmacher, "A Brief History of Generative Models for
Power Law and Lognormal Distributions", Internet Mathematics
1 (2003): 226--251
[PDF]
- M. E. J. Newman, "Power laws, Pareto distributions and Zipf's
law", cond-mat/0412004 [If
you read one other thing on power laws, read this]
- Manfred Schroeder, Fractals, Chaos, Power Laws: Minutes from
an Infinite Paradise
Recommended, more technical or more specialized:
- Robert J. Adler, Raise E. Feldman and Murad S. Taqqu
(eds.), A Practical Guide to Heavy Tails [Presumes that you
already know something about statistics and stochastic processes, so not
suitable for beginners.]
- Barry C. Arnold, Pareto Distributions [Fine guide
to the statistical literature, as it was in 1983; still valuable, though
many things which were nasty computations then are easy now.]
- Aaron Clauset, Maxwell Young, and Kristian Skrede Gleditsch, "Scale
Invariance in the Severity of
Terrorism", physics/0606007
[Surprising, but well-supported]
- F. Clementi, T. Di Matteo, M. Gallegati, "The Power-law Tail
Exponent of Income Distributions", physics/0603061
= Physica
A 370 (2006): 49--53 [An interesting way to improve
the accuracy of Hill-type (tail-conditional maximum likelihood) estimates of
the scaling parameter. Written with few concessions to those who are neither
statisticians nor econometricians. Not directly suitable for determining
the range of the scaling region. Income distribution is used only as
an example.]
- Andrew M. Edwards, Richard A. Phillips, Nicholas W. Watkins, Mervyn
P. Freeman, Eugene J. Murphy, Vsevolod Afanasyev, Sergey V. Buldyrev,
M. G. E. da Luz, E. P. Raposo, H. Eugene Stanley and Gandhimohan
M. Viswanathan, "Revisiting Lévy flight search patterns of wandering
albatrosses, bumblebees and
deer", Nature
449 (2007): 1044--1048
- Paul Embrechts and Makoto Maejima, Selfsimilar
Processes
- Michel L. Goldstein, Steven A. Morris and Gary G. Yen, "Fitting to
the Power-Law Distribution", cond-mat/0402322 [Pedestrian,
but accurate, exposition in terms physicists and engineers are likely to
understand. Insufficiently sourced to the statistical literature; e.g., their
calculation of the maximum likelihood estimator was first published in 1952.]
- Josef Honerkamp, Stochastic Dynamical Systems: Concepts,
Numerical Methods, Data Analysis
- Yuji Ijiri and Herbert Simon, Skew Distributions and the
Sizes of Business Firms [Collects Simon and co.'s pioneering papers on
power laws and related distributions --- including "On a Class of Skew
Distribution Functions", below --- as well as considering the limitations,
alternatives, modifications to match data, statistical issues, the connection
to Bose-Einstein statistics, the importance of going beyond just staring at
distributional plots if you want to learn about mechanisms, etc., etc. This
was all published in 1977...]
- A. James and M. J. Plank, "On fitting power laws to ecological
data", arxiv:0712.0613
- Raya Khanin and Ernst Wit, "How Scale-Free Are Biological
Networks?",
Journal of Computational Biology
13 (2006): 810--818 [Ans.: not very scale-free at all.]
- Joel Keizer, Statistical Thermodynamics of Nonequilibrium
Processes [Has a good discussion of critical fluctuations in chapter
8. Review: Molecular Fluctuations for Fun and
Profit]
- Paul Krugman, The Self-Organizing Economy [Has a nice
discussion of power-law size distributions in economics. Review]
- Michael LaBarbera, "Analyzing Body Size as a Factor in Ecology and
Evolution", Annual Review of Ecology and
Systematics 20 (1989): 91--117 [Statistical problems in
many studies of power-law scaling in biology, their effects on the conclusions
of those studies (ranging from "wrong, but correctable" to "meaningless"), and
how to do it right. JSTOR]
- J. Laherrère and D. Sornette, "Stretched exponential
distributions in nature and economy: 'fat tails' with characteristic scales",
The European Physical Journal B 2 (1998):
525--539
- L. D. Landau and E. M. Lifshitz, Statistical Physics
[For the theory of fluctuations in statistical mechanics, and for critical
phenomena in equilibrium]
- Elliott W. Montroll and Michael F. Shlesinger, "On 1/f noise and
other distributions with long
tails", Proceedings
of the National Academy of Sciences (USA) 79 (1982):
3380--3383
- V. F. Pisarenko and D. Sornette, "New statistic for financial
return distributions: power-law or exponential?", physics/0403075 [Actually, two
new statistics: one converges to a constant if the distribution you're sampling
from is an exponential, independent of the exponent, and the other converges to
a constant if the distribution is a power law, independent of the power. They
even have some indications of the sampling distributions, so you can at least
gauge the statistical signifcance, i.e., the probability of deviations from the
ideal value, even though the distribution really is of the appropriate type. I
don't recall anything about the power of these statistics, however (i.e., the
probability that a power law will look like an exponential, or vice-versa).]
- William J. Reed and Barry D. Hughes, "From Gene Families and Genera
to Incomes and Internet File Sizes: Why Power Laws are so Common in
Nature", Physical
Review E 66 (2002): 067103 [This is, as I said,
perhaps the most deflating possible explanation for power law size
distributions. Imagine you have some set of piles, each of which grows,
multiplicatively, at a constant rate. New piles are started at random times,
with a constant probability per unit time. (This is a good model of my
office.) Then, at any time, the age of the piles is exponentially distributed,
and their size is an exponential function of their age; the two exponentials
cancel and give you a power-law size distribution. The basic combination of
exponential growth and random observation times turns out to work even if it's
only the mean size of piles which grows exponentially.]
- M. V. Simkin and V. P. Roychowdhury, "Re-inventing Willis",
physics/0601192 [The comical,
yet pathetic, history of the innumerable re-inventions of basic mechanisms
which plague this area]
- Herbert Simon, "On a Class of Skew Distribution Functions",
Biometrika 42 (1955): 425--440 [JSTOR]
- Didier Sornette
- "Multiplicative Processes and Power Laws" cond-mat/9708231
= Physical Review E 57 (1998): 4811--4813
- "Mechanism for Powerlaws without Self-Organization"
cond-mat/0110426
- Stilian A. Stoev, George Michailidis, and Murad S. Taqqu,
"Estimating heavy-tail exponents through max self-similarity", math.ST/0609163
- Bruce J. West and Bill Deering, The Lure of Modern Science:
Fractal Thinking [Despite the painful title, this is actually a
very good book. I disagree with some of the more philosophical positions they
take, but on the actual science and math they're quite sound.]
- Damian H. Zanette, "Zipf's law and the creation of musical
context", cs.CL/0406015 [This
sounds bizarre, and I'd not have bothered to even note it if I didn't
know Zanette's work in other areas, which shows him to be a good and careful
scientist. And this is actually an interesting and meaningful little paper,
which has something non-trivial to say about music. It's worth noting,
perhaps, that the distribution he actually ends up fitting isn't a pure power
law, but a modification inspired by Simon's paper. Thanks to John Burke for
prodding me to actually read it.]
Not altogether recommended (without being actively dis-recommended either):
- R. Alexander Bentley, Paul Ormerod, Michael Batty, "An evolutionary
model of long tailed distributions in the social
sciences", arxiv:0903.2533 [This
is a minor modification of the classical Yule/Simon mechanism for random
growth, with the main advantage being that (with the right parameter tweaking)
it allows for more turn-over of which values are most common.
Unsurprisingly, this is done by adding extra parameters, and so the family of
distributions is more flexible. The statistical procedures used are however
bad, and the "regularity" that the estimated power law exponent grows as the
amount of data held in the tail shrinks has a very simple explanation: the
tails aren't really power laws.]
Modesty forbids me to recommend:
- Aaron Clauset, CRS and M. E. J. Newman, "Power-law distributions in
empirical data", SIAM Review 51 (2009): 661--703
= arxiv:0706.1062 [with commentary
by Aaron
and myself]
To read:
- Eduardo G. Altmann and Holger Kantz, "Recurrence time analysis,
long-term correlations, and extreme events", physics/0503056
- J. A. D. Aston, "Modeling macroeconomic time series via heavy
tailed distributions", math.ST/0702844
- Katarzyna Bartkiewicz, Adam Jakubowski, Thomas Mikosch, Olivier Wintenberger, "Infinite variance stable limits for sums of dependent random variables", arxiv:0906.2717
- Michael Batty, "Rank Clocks", Nature
444 (2006): 592--596
- P. Besbeas and B. J. T. Morgan, "Improved estimation of the stable
laws", Statistics
and Computing 18 (2008): 219--231
- Thierry Bochud and Damien Challet, "Optimal approximations of
power-laws with exponentials", physics/0605149 ["We propose an
explicit recursive method to approximate a power-law with a finite sum of
weighted exponentials. Applications to moving averages with long memory are
discussed in relationship with stochastic volatility models." The last part
sounds like a rediscovery of Granger.]
- Laurent E. Calvet and
Adlai J. Fisher, Multifractal Volatility: Theory, Forecasting, and
Pricing [Thanks to Prof. Calvet for bringing this to my attention]
- Anna Carbone and Giuliano Castelli, "Scaling Properties of
Long-Range Correlated Noisy Signals,"
cond-mat/0303465
- C. Cattuto, V. Loreto and V. D. P. Servedio, "A Yule-Simon process
with memory", cond-mat/0608672 [Memo
to self: compare this to the auto-correlated Yule-Simon process in
Ijiri and Simon's book.]
- Anirban Chakraborti, Marco Patriarca, "A Variational Principle for
Pareto's power
law", cond-mat/0605325
- Ali Chaouche and Jean-Noel Bacro, "Statistical Inference for
the Generalized Pareto Distribution: Maximum Likelihood Revisited", Communications in Statistics: Theory and Methods 35
(2006): 785--802
- F. Clementi, M. Gallegati, "Pareto's Law of Income Distribution:
Evidence for Germany, the United Kingdom, and the United
States", physics/0504217
- Cline, heavy-tailed noise, 1983 (?)
- B. Conrad and M. Mitzenmacher, "Power Laws for Monkeys Typing
Randomly: The Case of Unequal Probabilities", IEEE Transactions on
Information Theory 50 (2004): 1403--1414
- Bikramjit Das and Siddney I. Resnick, "QQ plots, Random sets and
data from a heavy tailed distribution", math.PR/0702551
- Anirban Dasgupta, John Hopcroft, Jon Kleinberg and
Mark Sandler, "On Learning Mixtures of Heavy-Tailed Distributions"
- T. Di Matteo, T. Aste and M. Gallegati, "Innovation flow through
social networks: Productivity distribution", physics/0406091 [Those look an
awful lot like log-normals to me.]
- Paul Doukhan, George Oppenheim and Murad S. Taqqu
(eds.), Theory and Applications of Long-Range Dependence
- Rick Durrett and Jason Schweinsberg, "Power laws for family sizes
in a duplication model", math.PR/0406216
= Annals of Probability 33 (2005): 2094--2126
- R. Fox and M. S. Taqqu
- "Noncentral Limit Thorems for Quadratic Forms in Random
Variables Having Long-Range Dependence," Annals of Probability
13 (1985) 428--446
- "Central Limit Theorems for Quadratic Forms in Random
Variables Having Long-Range Dependence," Probability Theory and Related
Fields 74 (1987): 213--240
- G. Frenkel, E. Katzav, M. Schwartz and N. Sochen, "Distribution of
Anomalous Exponents of Natural Images", Physical Review
Letters 97 (2006): 103902
- U. Frisch and D. Sornette, "Extreme Deviations and Applications",
J. Phys. I France 7 (1997): 1155--1171
- Akihiro Fujihara, Toshiya Ohtsuki and Hiroshi Yamamoto
- Akihiro Fujihara, Satoshi Tanimoto, Toshiya Ohtsuki, Hiroshi
Yamamoto, "Log-normal distribution in growing systems with weighted
multiplicative interactions", cond-mat/0511625
- Yoshi Fujiwara, Corrado Di Guilmi, Hideaki Aoyama, Mauro Gallegati,
Wataru Souma, "Do Pareto-Zipf and Gibrat laws hold true? An analysis with
European
Firms", cond-mat/0310061
- Xavier Gabaix, "Power Laws in Economics and Finance"
[PDF preprint]
- M. Ivette Gomes, M. Isabel Fraga Alves, Paulo Araujo Santos,
"PORT Hill and Moment Estimators for Heavy-Tailed Models",
Communications in Statistics:
Simulation and Computation 37
(2008): 1281--1306
- Alexander Gnedin, Ben Hansen, Jim Pitman, "Notes on the occupancy
problem with infinitely many boxes: general asymptotics and power
laws", math.PR/0701718
- J. A. Gubner, "Theorems and Fallacies in the Theory of
Long-Range-Dependent Processes", IEEE Transactions on
Information Theory 51 (2005): 1234--1239
- Alexandra Guerrero and Leonard A. Smith, "A maximum likelihood
estimator for long-range persistence", Physica
A 355 (2005): 619--632
- Rudolf Hanel and Stefan Thurner, "On the Derivation of power-law
distributions within standard statistical mechanics", cond-mat/0412016
- Bruce M. Hill and Michael Woodroofe, "Stronger Forms of Zipf's
Law", Journal of the American Statistical
Association 70 (1975): 212--219 [JSTOR]
- Byoung Hee Hong, Kyoung Eun Lee, Jae Woo Lee, "Power Law in Firms
Bankruptcy", physics/0701302
- Y. Hosoya
- "The quasi-likelihood approach to statistical inference on
multiple time-series with long-range dependence," Journal of
Econometrics 73 (1996): 217--236
- "A limit theory for long-range dependence and statistical
inference on related models," Annals of Statistics
25 (1997): 105--137
- Takashi Ichinomiya, "Power-law distribution in Japanese racetrack
betting", physics/0602165
- Milton Jara, Tomasz Komorowski and Stefano Olla,
"Limit theorems for additive functionals of a Markov chain", arxiv:0809.0177 [Convergence to alpha-stable
distributions]
- Predrag R. Jelenkovic, Jian Tan, "Modulated Branching Processes,
Origins of Power Laws and Queueing
Duality", 0709.4297
- Taisei Kaizoji, "Power laws and market
crashes", physics/0603138
- Imen Kammoun, Vernoique Billat and Jean-Marc Bardet, "A new
stochastic process to model Heart Rate series during exhaustive run and an
estimator of its fractality
parameter", arxiv:0803.3675
[Includes statistical criticism of the common, but deeply unsatisfying,
"detrended fluctuation analysis" method of estimating the Hurst exponent.]
- B. Kaulakys and J. Ruseckas, "Stochastic nonlinear
differential equation generating 1/f noise", Physical Review E
70 (2004): 020101 = cond-mat.0408507
- K. Kiyani, S. C. Chapman and B. Hnat, "A method for extracting the
scaling exponents of a self-affine, non-Gaussian process from a finite length
timeseries", physics/0607238
- Francois M. Longin, "The Asymptotic Distribution of Extreme Stock
Market Returns", The Journal of Business 69
(1996): 383--408
[JSTOR]
- Bruce D. Malamud, James D. A. Millington and George L. W. Perry,
"Characterizing wildfire regimes in the United States", Proceedings of the
National Academy of Sciences (USA) 102 (2005):
4694--4699
- Y. Malevergne, V.F. Pisarenko, D. Sornette, "Empirical
Distributions of Log-Returns: between the Stretched Exponential and the Power
Law?", physics/0305089
- Natalia Markovich, Nonparametric Analysis of Univariate
Heavy-Tailed Data: Research and Practice
- Joseph L. McCauley, Gemunu H. Gunaratne, Kevin E. Bassler, "Hurst
Exponents, Markov Processes, and Fractional Brownian
motion", cond-mat/0609671
- Richard Metzler, "Comment on 'Power-law correlations in the
southern-oscillation-index fluctuations characterizing El
Nino'", Physical
Review E 67 (2003): 018201
- Edoardo Milotti, "Model-based fit procedure for power-law-like
spectra", physics/0510011
- Elliott W. Montroll and Michael Shlesinger, "Maximum entropy
formalism, fractals, scaling phenomena and 1/f noise: A tale of tails",
Journal of Statistical Physics 32 (1983):
209--230
- Eric Moulines, Francois Roueff, Murad S. Taqqu, "A Wavelet Whittle
estimator of the memory parameter of a non-stationary Gaussian time
series", math/0601070
- Newton J. Moura Jr. and Marcelo B. Ribeiro, "Zipf Law for Brazilian
Cities", physics/0511216
- J. F. Muzy, E. Bacry and A. Kozhemyak, "Extreme values and fat
tails of multifractal
fluctuations", Physical Review
E 73 (2006): 066114
= cond-mat/0509357
["problem of the estimation of extreme event occurrence probability for data
drawn from some multifractal process. We also study the heavy (power-law) tail
behavior of probability density function associated with such data. We show
that because of strong correlations, standard extreme value approach is not
valid and classical tail exponent estimators should be interpreted cautiously"]
- Richard Perline, "Strong, Weak and False Inverse Power Laws",
Statistical
Science 20 (2005): 68--88
- Sidney I. Resnick, Heavy-Tail Phenomena: Probabilistic and
Statistical Modeling
[Blurb]
- Sidney Resnick and Catalin Starica, "Tail Index Estimation for
Dependent Data", The Annals of Applied
Probability 8 (1998): 1156--1183
[JSTOR]
- Massimo Riccaboni, Fabio Pammolli, Sergey V. Buldyrev, Linda Ponta,
H. Eugene Stanley , "The Size Variance Relationship of Business Firm Growth
Rates", arxiv:0904.1404
= Proceedings of the National Academy of Sciences
(USA) 105 (2008): 19595--19600
- Alexander Roitershtein, "One-dimensional linear recursions with
Markov-dependent
coefficients", math/0409335
= Annals of
Applied Probability 17 (2007): 572--608 [To
summarize the abstract, suppose S(n) = A(n) + B(n)*S(n-1), where A(n) and B(n)
are Markov sequences. Then "the distribution tail of its stationary solution
has a power law decay." This sounds like Simon's argument made fully general.]
- Holger Rootzen, M. Ross Leadbetter and Laurens de Haan, "On the
distribution of tail array sums for strongly mixing stationary
sequences", Annals of Applied Probability 8
(1998): 868--885
[JSTOR]
- Gennady Samorodnitsky and Murad S. Taqqu, Stable
Non-Gaussian Random Processes
- D. Sornette and V. F. Pisarenko, "Properties of a simple bilinear
stochastic model: estimation and predictability", physics/0703217
- Stilian A. Stoev and Murad S. Taqqu, "Limit Theorems for Sums of Heavy-tailed Variables with Random Dependent Weights",
Methodology and
Computing in Applied Probability 9 (2007): 55--87
- Ciprian Tudor and Frederi Viens, "Variations and estimators for the selfsimilarity order through Malliavin calculus", arxiv:0709.3896
- Caglar Tuncay, "A universal model for languages and cities, and
their
lifetimes", physics/0703144
- John Vandermeer and Ivette Perfecto, "A Keystone Mutualism Drives
Pattern in a Power Function", Science
311 (2006): 1000--1002 [I don't think their title
is grammatical!]
- Sergio Venturini, Francesca Dominici, Giovanni Parmigiani, "Gamma
shape mixtures for heavy-tailed distributions", Annals of Applied
Statistics 2 (2008): 756--776
= arxiv:0807.4663
- Rafal Weron
- "Estimating long range dependence: finite sample
properties and confidence intervals," cond-mat/0103510
- "Measuring long-range dependence in electricity prices,"
cond-mat/0103621
- T. S. T. Wong and W. K. Li, "A note on the estimation of extreme
value distributions using maximum product of spacings",
math.ST/0702830
- Wei Biao
Wu, Xiaofeng Shao, "Invariance principles for fractionally integrated
nonlinear
processes", math.PR/0608223
- Seokhoon Yun, "The Extremal Index of a Higher-Order
Stationary Markov Chain", The Annals of Applied Probability
8 (1998): 408--437
[JSTOR]
- Damian H. Zanette, "Zipf's law and city sizes: A short tutorial
review on multiplicative processes in urban
growth", arxiv:0704.3170
#
Neural Modeling and Data Analysis
Especially, but not exclusively, modeling of spike trains (which is
important for neural coding, and overlaps therewith).
Things to investigate: How easy would it be to adapt spike-sorting
algorithms to cluster or classify other kinds of time series? Easy or not,
would there be any point?
See also:
Neural Coding;
Synchronization in Neural Systems;
Neuroscience in general
Recommended (also look at the recommendations
under coding and
synchronization):
- David Brillinger, "Nerve Cell Spike Train Data Analysis: A
Progression of Technique," Journal of the American Statistical
Association
87 (1992): 260--270
- Emery N. Brown, Robert E. Kass and Partha P. Mitra, "Multiple
Neural Spike Train Data Analysis: State-of-the-art and Future
Challanges", Nature
Neuroscience 7 (2004): 456--461
[PDF reprint via
Rob]
- Sami El Boustani, Alain Destexhe, "Does brain activity stem from high-dimensional chaotic dynamics? Evidence from the human electroencephalogram, cat cerebral cortex and artificial neuronal networks", arxiv:0904.4217
- Chris Eliasmith and Charles Anderson, Neural Engineering:
Computation, Representation, and Dynamics in Neurobiological Systems
- Matthew T. Harrison and Stuart Geman, "A Rate and
History-Preserving Resampling Algorithm for Neural Spike Trains",
Neural Computation 21 (2009): 1244--1258
- Yoshito Hirata, Kevin Judd and Kazuyuki Aihara, "Characterizing
chaotic response of a squid axon through generating
partitions", Physics Letters
A 346 (2005): 141--147 [The obvious approach
to symbolic dynamics for spike trains
works.]
- Robert E. Kass, Valerie Ventura and Emery N. Brown, "Statistical
Issues in the Analysis of Neuronal
Data", Journal of
Neurophysiology 94 (2005): 8--25
[PDF reprint via
Rob]
- Shinsuke Koyama and Shiegeru Shinomoto, "Empirical Bayes
interpretations of random point
events", Journal
of Physics A: Mathematical and General
38 (2005): L531--L537
- Martin A. Lindquist, "The Statistical Analysis of fMRI
Data", Statistical Science 23 (2008): 439--464
= arxiv:0906.3662
- Murat Okatan, Matthew A. Wilson and Emery N. Brown,
"Analyzing Functional Connectivity Using a Network
Likelihood Model of Ensemble Neural Spiking Activity", Neural
Computation 17 (2005): 1927--1961
- Liam Paninski, Jonathan Pillow, and Jeremy Lewi, "Statistical
models for neural encoding, decoding, and optimal stimulus design", to appear
in P. Cisek, T. Drew and J. Kalaska (eds.), Computational Neuroscience:
Progress in Brain Research
[PDF
preprint]
- Sommer and Wichert (eds.), Exploratory Analysis and Data
Modeling in Functional Neuroimaging [Conference proceedings, so uneven and not too tightly integrated, but covers a lot of ground.]
Modesty forbids me to recommend:
- Robert Haslinger, Kristina Lisa Klinkner and CRS, "The
Computational Structure of Spike
Trains", Neural
Computation forthcoming
To read:
- Pierre Baldi, "Probabilistic Models of Neuronal Spike Trains," in
Giles and Gori (eds.), Adaptive Processing of Sequences and Data
Structures
- Peter beim Graben, J. Douglas Saddy, Matthias Schlesewsky and
Jürgen Kurths, "Symbolic Dynamics of Event-Related Brain Potentials,"
Physical Review E 62 (2000): 5518--5541
- William Bialek, "Thinking about the brain," physics/0205030
- Hemant Bokil, Bijan Pesaran, R. A. Andersen and Partha P. Mitra, "A
framework for detection and classification of events in neural activity", q-bio.NC/0507045
- Romain Brette and Wulfram Gerstner, "Adaptive Exponential
Integrate-and-Fire Model as an Effective Description of Neuronal
Activity", Journal of
Neurophysiology 94 (2005): 3637--3642
- R. Brette, M. Rudolph, T. Carnevale, M. Hines, D. Beeman,
J. M. Bower, M. Diesmann, A. Morrison, P. H. Goodman, F. C. Harris Jr.,
M. Zirpe, T. Natschlager, D. Pecevski, B. Ermentrout, M. Djurfeldt,
A. Lansner, O. Rochel, T. Vieville, E. Muller, A. P. Davison, S. El Boustani,
and A. Destexhe, "Simulation of networks of spiking neurons: A review of tools
and strategies", q-bio.NC/0611089
- David Brillinger, "Some statistical methods for random process
data from seismology and neurophysiology", Annals of Statistics
16 (1988): 1--54
- David Cai, Louis Tao and David W. McLaughlin, "An Embedded Network
Approach for Scale-Up of Fluctuation-Driven Systems with Preservation of Spike
Information", Proceedings of the
National Academy of Sciences (2004) [Abstract: " address
computational 'scale-up' issues in modeling large regions of the cortex, many
coarse-graining procedures have been invoked to obtain effective descriptions
of neuronal network dynamics. However, because of local averaging in space and
time, these methods do not contain detailed spike information and, thus, cannot
be used to investigate, e.g., cortical mechanisms that are encoded through
detailed spike-timing statistics. To retain high-order statistical information
of spikes, we develop a hybrid theoretical framework that embeds a subnetwork
of point neurons within, and fully interacting with, a coarse-grained network
of dynamical background. We use a newly developed kinetic theory for the
description of the coarse-grained background, in combination with a Poisson
spike reconstruction procedure to ensure that our method applies to the
fluctuation-driven regime as well as to the mean-driven regime. This
embedded-network approach is verified to be dynamically accurate and
numerically efficient. As an example, we use this embedded representation to
construct 'reverse-time correlations' as spiked-triggered averages in a ring
model of orientation-tuning dynamics." ]
- Hock Peng Chan and Wei-Liem Loh, "Some theoretical results on
neural spike train probability
models", math.ST/0703829
- Yonghong Chen, Steven L. Bressler, and Mingzhou Ding, "Frequency
decomposition of conditional Granger causality and application to multivariate
neural field potential
data", q-bio.NC/0608034
= Journal of Neuroscience Methods 150 (2006):
228--237
- Zhiyi Chi, "Large deviations for template matching between point
processes", Annals of Applied
Probability 15 (2005): 153--174 = math.PR/0503463
- Carson Chow, Boris Gutkin, David Hansel, Claude Meunier and Jean
Dalibard (eds.), Methods and Models in Neurophysics: Lecture Notes of the
Les Houches Summer School 200
- Mauro Copelli and Osame Kinouchi, "Intensity Coding in
Two-Dimensional Excitable Neural Networks", q-bio.NC/0409032
[Greenberg-Hastings cellular automata as a toy model of visual response!]
- Luciano da F. Costa and Olaf Sporns, "Hierarchical Features of
Large-Scale Cortical Connectivity", q-bio.NC/0508007
- J. Davidsen and H. G. Schuster, "Simple model for 1/f noise,"
cond-mat/0201198 [a null
model]
- Peter Dayan and Larry Abbott, Theoretical Neuroscience
[website]
- Matthieu Delescluse and Christophe Pouzat, "Efficient spike-sorting
of multi-state neurons using inter-spike intervals information", q-bio.QM/0505053
- Mingzhou Ding, Yonghong Chen and Steve L. Bressler,
"Granger Causality: Basic Theory and Application to Neuroscience",
q-bio.QM/0608035 = pp.
451--474 in B. Schelter, M. Winterhalder, and J. Timmer (eds.), Handbook
of Time Series Analysis
- Victor M. Eguiluz, Dante R. Chialvo, Guillermo A. Cecchi, Marwan
Baliki and A. Vania Apkarian, "Scale-free brain functional networks",
Physical Review
Letters 94 (2005): 018102 = cond-mat/0309092
- Michael D. Fox, Abraham Z. Snyder, Justin L. Vincent, Maurizio
Corbetta, David C. Van Essen and Marcus E. Raichle, "The human brain is
intrinsically organized into dynamic, anticorrelated functional networks", Proceedings of the National
Academy of Sciences 102 (2005): 9673--9678
- Wulfram Gerstner, Spiking Neuron Models
- Gail Gilboa, Ronen Chen, and Naama Brenner, "History-Dependent
Multiple-Time-Scale Dynamics in a Single-Neuron Model", Journal of
Neuroscience 25 (2005): 6479--6489
- Paul Glimcher, Decisions, Uncertainty, and the Brain:
The Science of Neuroeconomics
- Norma V. S. Graham, Visual Pattern Analyzers
- Andreas Grönlund, "The difference in directed structure of
Neural and Transcriptional Regulation Networks", cond-mat/0406268
- Richard H. R. Hahnloser, "Stationary transmission distribution of
random spike trains by dynamical synapses," Physical Review E
67 (2003) 022901
- Ronald M. Harris-Warrick, Eve Marder, Allen I. Selverston and
Maurice Moulins (eds.), Dynamic Biological Networks: The Stomatogastric
Nervous System [Blurb]
- H. R. Heekeren, S. Marrett, P. A. Bandettini and L. G. Ungerleider,
"A general mechanism for perceptual decision-making in the human brain",
Nature 431
(859--862)
- Kim L. Hoke, Michael J. Ryan, and Walter Wilczynski, "Social cues
shift functional connectivity in the hypothalamus", PNAS 102
(2005): 10712--10717
- Kazushi Ikeda, "Information Geometry of Interspike Intervals in
Spiking Neurons", Neural
Computation 17 (2005): 2719--2735
- Eugene M. Izhikevich, Dynamical Systems in Neuroscience: The
Geometry of Excitability and Bursting
[Blurb]
- Alon Keinan, Ben Sandbank, Claus C. Hilgetag, Isaac Meilijson and
Eytan Ruppin, "Fair Attribution of Functional Contribution in Artificial and
Biological Networks", Neural
Computation 16 (2004): 1887--1915
- Alexei A. Koulakov, Dmitry Rinberg and Dmitry N. Tsigankov, "How to
find decision makers in neural circuits?", q-bio.NC/0401005
- Li Zhaoping, Alex Lewis and Silvia Scarpetta, "Mathematical
Analysis and Simulations of the Neural Circuit for Locomotion in Lamprey", q-bio.NC/0404012
- Steven J. Luck, An Introduction to the Event-Related
Potential Technique
[Blurb]
- Wolfgang Maass and Eduardo D. Sontag, "Neural Systems as Nonlinear
Filters," Neural Computation 12 (2000):
1743--1772
- Roy Mukamel, Hagar Gelbard, Amos Arieli, Uri Hasson, Itzhak Fried
and Rafael Malach, "Coupling Between Neuronal Firing, Field Potentials, and
fMRI in Human Auditory Cortex", Science
309 (2005): 951--954 ["als in auditory cortex of two
neurosurgical patients and compared them with the fMRI signals of 11 healthy
subjects during presentation of an identical movie segment. The predicted fMRI
signals derived from single units and the measured fMRI signals from auditory
cortex showed a highly significant correlation (r = 0.75, P < 10^-47). Thus,
fMRI signals can provide a reliable measure of the firing rate of human
cortical neurons."]
- Murat Okatan, Matthew A. Wilson and Emery N. Brown, "Analyzing
Functional Connectivity Using a Network Likelihood Model of Ensemble Neural
Spiking
Activity", Neural
Computation 17 (2005): 1927--1961
- Liam Paninski, Jonathan W. Pillow and Eero P. Simoncelli, "Maximum
Likelihood Estimation of a Stochastic Integrate-and-Fire Neural Encoding
Model", Neural
Computation 16 (2004): 2533--2561
- G. Pola, R. S. Petersen, A. Thiele, M. P. Young and S. Panzeri,
"Data-Robust Tight Lower Bounds to the Information Carried by Spike Times of a
Neuronal Population", Neural
Computation
17 (2005): 1962--2005
- R. Quian Quiroga, Z. Nadasdy and Y. Ben-Shaul, "Unsupervised Spike
Detection and Sorting with Wavelets and Superparamagnetic
Clustering", Neural
Computation 16 (2004): 1661--1687
- Rajesh P. N. Rao (ed.), Probabilistic Models of the Brain:
Perception and Neural Function
- George N. Reeke and Allan D. Coop, "Estimating the Temporal
Interval Entropy of Neuronal
Discharge", Neural
Computation 16 (2004): 941--970 [From the abstract,
I'm skeptical. They're assuming that successive inter-spike intervals are
all independent samples from a fixed distribution
of known parametric form, and then using maximum likelihood to
estimate the parameters, which of course gives them an entropy estimate and
confidence intervals. But all of the italicized points seem dubious to me.
Still, I need to read it.]
- Hermann Riecke, Alex Roxin, Santiago Madruga and Sara A. Solla,
"Multiple attractors, long chaotic transients, and failure in small-world
networks of excitable
neurons", Chaos
17 (2007): 026110
- P. A. Robinson, "Propagator theory of brain dynamics", Physical Review
E 72 (2005): 011904
- Naoki Saito, "The Generalized Spike Process, Sparsity, and
Statistical Independence,"
math.PR/0110103
- P. S. Sastry and K. P. Unnikrishnan, "Conditional probability based
significance tests for sequential patterns in multi-neuronal spike
trains", arxiv:0808.3511
- Silvia Scarpetta, Zhaoping Li and John Hertz, "Hebbian imprinting
and retrieval in oscillatory neural networks," cond-mat/0111034
- Margaret Euphrasia Sereno, Neural Computation of Pattern
Motion
- Anil K. Seth, Gerald M. Edelman, "Distinguishing Causal
Interactions in Neural Populations", Neural
Computation 19 (2007): 910--933
- Xilin Shen, Francois G. Meyer, "Low Dimensional Embedding of fMRI
datasets", arxiv:0709.3121
["embedding optimally preserves the local functional coupling between fMRI time
series, and provides a low-dimensional coordinate system for detecting
activated voxels. To compute the embedding, we build a network of functionally
connected voxels and represent it with a graph. A spectral decomposition of the
graph probability transition matrix produces a set of eigenvectors that are
used to define the embedding"]
- Lavi Shpigelman, Yoram Singer, Rony Paz and Eilon Vaadia,
"Spikernels: Predicting Arm Movements by Embedding Population Spike Rate
Patterns in Inner-Product Spaces",
Neural
Computation 17 (2005): 671--690 ["Inner-product
operators, often referred to as kernels in statistical learning, define a
mapping from some input space into a feature space. The focus of this letter is
the construction of biologically motivated kernels for cortical activities. The
kernels we derive, termed Spikernels, map spike count sequences into an
abstract vector space in which we can perform various prediction tasks. We
discuss in detail the derivation of Spikernels and describe an efficient
algorithm for computing their value on any two sequences of neural population
spike counts. We demonstrate the merits of our modeling approach by comparing
the Spikernel to various standard kernels in the task of predicting hand
movement velocities from cortical recordings. All of the kernels that we tested
in our experiments outperform the standard scalar product used in linear
regression, with the Spikernel consistently achieving the best performance."]
- Terence R. Stanford, Stephan Quessy and Barry E. Stein, "Evaluating
the Operations Underlying Multisensory Integration in the Cat Superior
Colliculus", Journal of
Neuroscience 25 (2005): 6499--6508
- Akaysha C. Tang, Barak A. Pearlmutter, Natalie A. Malaszenko, Dan
B. Phung, and Bethany C. Reeb, "Independent Components of
Magnetoencephalography: Localization," Neural Computation
14 (2002): 1827--1858 [Reprinted in Sommer and Wichert?]
- Wilson Truccolo, John P. Donoghue, "Nonparametric Modeling of
Neural Point Processes via Stochastic Gradient Boosting Regression", Neural
Computation 19 (2007): 672-705
- Arjen vanOoyen (ed.), Modeling Neural Development
- Valérie Ventura, "Testing for and Estimating Latency Effects
for Poisson and Non-Poisson Spike
Trains", Neural
Computation 16 (2004): 2323--2349
- T. Verechtchaguina, L. Schimansky-Geier and I. M. Sokolov, "Spectra
and waiting-time distributions in firing resonant and non-resonant neurons", q-bio.NC/0401013 [Need to see
whether their ability to determine response properties from interspike-interval
distributions is limited to FitzHugh-Nagumo neurons, or is more general.]
- Hugh R. Wilson, Spikes, Decisions and Actions: The Dynamical
Foundations of Neuroscience
- Tor D. Wager and Tomas E. Nichols, "Optimization of experimental
design in fMRI: A general framework using a genetic
algorithm", Neuroimage 18 (2003): 293--309
- Masahiko Yoshioka, "The spike-timing-dependent learning rule to
encode spatiotemporal patterns in a network of spiking neurons," cond-mat/0110070
#
The Enlightenment
Voltaire, Diderot, Hume, La Mettrie, Smith,
Gibbon. Origins of the revolution, the Left. Relations to science, superstition,
Romanticism, the industrial revolution.
Connections and attitudes to classical antiquity, the Renaissance.
Recommended:
- Ernst Cassirer, Philosophy of the
Enlightenment [This is and fully deserves to be a classic work, but he
has far to much on obscure people who either thought they were
building on Leibniz, or whom Cassirer thought were making straight the way for
Kant and Hegel, i.e., his approach to history is still too teleological.]
- Jean Le Rond D'Alembert, Preliminary Discourse to the
Encyclopédie
- Robert Darnton, Mesmerism and the End of the Enlightenment in
France
- Peter Gay, The Enlightenment: An Interpretation
[Dividing through for the Freudian mish-mash, fortunately under pretty good
control here]
- Ernest Gellner, Thought and
Change [The philosophes as the first modernizing intellectuals
worked up about under-development.]
- Jonathan I. Israel [Or, how Spinoza overthrew the old regime.]
- Radical Enlightenment: Philosophy and
the Making of Modernity, 1650--1750
- A Revolution of the Mind:
Radical Enlightenment and the Intellectual Origins of Modern Democracy
- Steven Johnson, The Invention of Air
- Roy Porter, The Creation of the Modern World: The Untold
Story of the British Enlightenment
To read:
- C. B. A. Behrens, Society, Government and the Enlightenment:
The Experiences of Eighteenth-Century France and Prussia.
- Becker, The Heavenly City of the 18th-Century
Philosopher
- Stephen Eric Bronner, Reclaiming the Enlightenment:
Towards a Politics of Radical Engagement
- Condorcet, Sketch of the Progress of the Human Mind
- Robert Darnton, The Forbidden Best-Sellers of
Pre-Revolutionary France
- Sarah Ellenzweig, The Fringes of Belief English Literature,
Ancient Heresy, and the Politics of Freethinking, 1660-1760
[blurb]
- Tore Frangsmyr, J. L. Heilbron and Robin E. Rider (eds.), The
Quantifying Spirit in the Eighteenth Century
[online]
- Don Garrett and Edward Barbanell (eds.), Encyclopedia of
Empiricism [Focusing on the 17th and 178th centuries]
- Peter Gay, The Party of Humanity [I hope this
pre-dates Gay's days of, pardon the phrase, flaming Freudianism]
- Jurgen Habermas, The Structural Transformation of the
Public Sphere
- Paul Ilie, The Age of Minerva
- Ulrich Im Hof, The Enlightenment
- Jonathan Israel, Enlightenment Contested
- Margaret C. Jacob
- Radical Enlightenment
- Living the Enlightenment
- Sarah Maza, Private Lives and Public Affairs: The Causes
Celebres of Prerevolutionary France [Blurb]
- James Van Horn Melton, The Rise of the Public in Enlightenment Europe
- Thomas Munck, The Enlightenment: A Comparative Social
History, 1721--1794
- Sankar Muthu, Enlightenment against Empire
[Blurb]
- Roy Porter
- English Society in the Eighteenth Century
- The Enlightenment
- Flesh in the Age of Reason: The Modern Foundations of
Body and Soul
- Mind-Forg'd Manacles: A History of Madness in England
from the Restoration to the Regency
- Giuliano Pancaldi, Volta: Science and Culture in the Age of
Enlightenment [Blurb]
- Jessica Riskin, Science in the Age of Sensibility: The
Sentimental Empiricists of the French Enlightenmen [Blurb]
- Emma Rothschild, Economic Sentiments: Adam Smith and
Condorcet
- Robert E. Schofield, The Lunar Society of Birmingham
- Mary D. Sheriff, The Exceptional Woman: Elisabeth
Vigée-Lebrun and the Cultural Politics of Art
- David Sorkin, The Religious Enlightenment:
Protestants, Jews, and Catholics from London to Vienna [blurb, introduction]
- Geoffrey V. Sutton, Science for a Polite Society: Gender,
Culture, and the Demonstration of Enlightenment [He's a good writer, but
really, how far can one trust someone writing about the Enlightenment who
thinks that "no ought from is" is a discovery of the
deconstructionists?]
- Murad Wahbah and Mona Abousenna (eds.), Averroes and the
Enlightenment
- T. H. White, The Age of Scandal
#
Frequentist Consistency of Bayesian Procedures
"Bayesian consistency" is usually taken to mean showing that, under Bayesian
updating, the posterior probability concentrates on the true model. That is,
for every (measurable) set of hypotheses containing the truth, the posterior
probability goes to 1. (In practice one shows that the posterior probability
of any set not containing the truth goes to zero.) There is a basic result
here, due to Doob, which essentially says that the Bayesian learner is
consistent, except on a set of data of prior probability zero. That
is, the Bayesian is subjectively certain they will converge on the
truth. This is not as reassuring as one might wish, and showing Bayesian
consistency under the true distribution is harder. In fact, it
usually involves assumptions under which non-Bayes procedures will
also converge. These are things like the existence of very powerful consistent
hypothesis tests (an approach favored by Ghosal, van der Vaart, et al.,
supposedly going back to Le Cam), or, inspired
by learning theory, constraints on the
effective size of the hypothesis space which are gradually relaxed as the
sample size grows (as in Barron et al.). If these assumptions do not hold, one
can construct situations in which Bayesian procedures are inconsistent.
Concentration of the posterior around the truth is only a preliminary. One
would also want to know that, say, the posterior mean converges, or even better
that the predictive distribution converges. For many finite-dimensional
problems, what's called the "Bernstein-von Mises theorem" basically says that
the posterior mean and the maximum likelihood estimate converge, so if one
works the other will too. This breaks down for infinite-dimensional problems.
(PAC-Bayesian results don't fit into this picture particularly neatly.
Essentially, they say that if you find a set of classifiers which all classify
correctly in-sample, and ask about the average out-of-sample performance, the
bounds on the latter are tighter for big sets than for small ones. This is for
the unmysterious reason that it takes a bigger coincidence for many
bad classification rules to happen to all work on the training data
than for a few bad rules to get lucky. The actual Bayesian machinery
of posterior updating doesn't really come into play.)
I believe I have contributed a Result to this area, on what happens when the
data are dependent and all the models are mis-specified, but some are more
mis-specified than others.
Query: are there any situations where Bayesian methods are
consistent but no non-Bayesian method is? (My recollection is that John
Earman, in Bayes or Bust, provides a negative answer, but I forget
how.)
Recommended:
- Andrew Barron, Mark J. Schervish and Larry Wasserman, "The
Consistency of Posterior Distributions in Nonparametric Problems", Annals
of Statistics 27 (1999): 536--561 [While I am biased
— Mark and Larry are senior faculty here — I think this is
definitely one of the best-written papers on the topic.]
- Robert H. Berk [Old but quite nice papers on the
effect of mis-specification, though with IID data assumed, and stronger
assumptions about the models than modern writers are comfortable with.]
- Taeryon Choi, R. V. Ramamoorthi, "Remarks on consistency of posterior distributions", arxiv:0805.3248
- Ronald Christensen, "Inconsistent Bayesian Estimation",
Bayesian Analysis 4 (2009): 413--416 [An extremely simple example of how inconsistency can be generated]
- Persi Diaconis and David
Freedman, "On the
Consistency of Bayes Estimates", The Annals of
Statistics 14 (1986): 1--26 [With accompanying
discussion; the latter is worth reading if only to fully savor the academic
snark in Diaconis and Freedman's reply.]
- David Freedman, "On the Bernstein-von Mises Theorem with
Infinite-Dimensional Parameters", Annals of
Statistics 27 (1999): 1119--1140 [As you know, Bob, the
Bernstein-von Mises theorem asserts that, "under the usual conditions", in the
large sample limit the distribution of the maximum likelihood estimate is
basically the same as the Bayesian posterior distribution, so you can take
credible intervals as approximate confidence intervals and vice versa. It
turns out that the usual conditions can fail drastically even for very simple
infinite-dimensional problems.]
- Subhashis Ghosal, "A review of consistency and convergence rates of
posterior distribution"
[PDF]
- Subhashis Ghosal, Jayanta K. Ghosh and R. V. Ramamoorthi,
"Consistency Issues in Bayesian Nonparametrics" [Review of the IID case, on
Ghosal's website someplace]
- Subhashis Ghosal, Jayanta K. Ghosh and Aad W. van der Vaart,
"Convergence Rates of Posterior
Distributions", Annals
of Statistics 28 (2000): 500--531
- Subhashis Ghosal and Yongqiang Tang, "Bayesian Consistency for
Markov Processes", Sankhya 68 (2006): 227--239
[This is slick, but I think the cuteness of the proof of the main theorem is
achieved at the cost of the ugliness of verifying the main conditions, as in
their example. (That may just be jealousy
speaking.) PDF]
- Subhashis Ghosal and Aad van der Vaart, "Convergence Rates of
Posterior Distributions for Non-IID
Observations", Annals of
Statistics 35 (2007): 192--223
- J. K. Ghosh and R. V. Ramamoorthi, Bayesian
Nonparametrics [Mini-review]
- B. J. K. Kleijn and A. W. van der Vaart, "Misspecification in
infinite-dimensional Bayesian
statistics", Annals of
Statistics 34 (2006): 837--877
- Antonio Lijoi, Igor Prunster and Stephen G. Walker, "Bayesian
Consistency for Stationary
Models", Econometric
Theory 23 (2007): 749--759 [Gives a Doob-style
result, that the prior probability of failing to converge is zero.]
- David A. McAllester, "Some PAC-Bayesian
Theorems", Machine
Learning 37 (1999): 355--363
- Lorraine Schwartz, "On Bayes
Procedures", Z. Wahrsch. Verw. Gebiete 4
(1965): 10--26 [The journal now known as Probability Theory and Related
Fields]
- X. Shen and Larry Wasserman, "Rates of convergence of posterior
distributions", Annals of Statistics 29 (2001): 687--714
- Stephen Walker, "New Approaches to Bayesian Consistency",
Annals of Statistics 32 (2004): 2028--2043 = math.ST/0503672 [Clever
martingale tricks.]
- Yang Xing, "Convergence rates of posterior distributions for
observations without the iid
structure", arxiv:0811.4677
- Yang Xing and Bo Ranneby, "Both necessary and sufficient conditions
for Bayesian exponential
consistency", arxiv:0812.1084
[Essentially, a unifying presentation of several existing conditions for IID
samples.]
- Tong Zhang,
"From $\epsilon$-entropy to KL-entropy: Analysis of minimum information complexity density estimation", Annals of Statistics 34 (2006): 2180--2210 = arxiv:math.ST/0702653
To read:
- Dennis D. Cox, "An Analysis of Bayesian Inference for Nonparametric
Regression", Annals of Statistics 21 (1993):
903--923
- J. L. Doob, "Application of the theory of martingales", pp. 23--27
in Colloques Internationaux du Centre National de la Recherche
Scientifique, no. 13, Centre National de la Recherche Scientifique,
Paris, 1949 [Summary
in Mathematical
Reviews by William Feller]
- Subhashis Ghosal, Jüri Lember and Aad van der Vaart, "Nonparametric Bayesian model selection and averaging", Electronic Journal of Statistics 2 (2008): 63--89
- Marcus Hutter, "Exact Non-Parametric Bayesian Inference on Infinite Trees", arxiv:0903.5342
- John Langford, "Tutorial on Practical Prediction Theory for
Classification", Journal of Machine Learning Research
6 (2005): 273--306 [For the PAC-Bayesian result]
- Lucien LeCam, "On the Speed of Convergence of Posterior
Distributions"
[PDF]
- David A. McAllester, "PAC-Bayesian Stochastic Model
Selection", Machine
Learning 51 (2003): 5--21
To write:
- CRS, "Bayesian Learning, Information Theory, and Evolutionary
Search"
#
Democracy
And science. Export from Europe. Indigenous outside Europe? (Yes: see
Muhlberg.) In tribal and especially in nomadic cultures (like the
proto-Indo-Europeans)? And non-European philosophies. Pluralism, secularism,
liberty. Representative and direct. And telecommunications. Democratic
deliberation as a mechanism for
collective cognition.
Recommended (painfully inadequate):
- Philip Age, "Supporting the Intellectual Life of a Democratic
Society," Ethics and Information Technology, 3:4
(2001): 289--298 [draft]
- Gianpaolo
Baiocchi, "The Citizens
of Porto Alegre", Boston Review March-April 2006
- Robert A. Dahl, A Preface to Economic Democracy
- Michael X. Delli Carpini, Fay Lomax Cook and Lawrence R. Jacobs,
"Public Deliberation, Discursive Participation, and Citizen Engagement: A
Review of the Empirical Literature", Annual
Review of Political Science 7 (2004): 315--344
- John Dewey, The Public and Its Problems [Mini-review]
- Jonathan Israel, A Revolution of the Mind:
Radical Enlightenment and the Intellectual Origins of Modern Democracy
[blurb]
- Jack Knight and James Johnson, "Inquiry into Democracy: What
Might a Pragmatist Make of Rational Choice Theories?", American
Journal of Political Science 43 (1999): 566--589
[JSTOR]
- Charles Lindblom
- The Intelligence of Democracy
- The Market System: What It Is, How It Works, and What
to Make of It
- Steve Muhlberg, The
World History of Democracy and Democracy
in Ancient India
- Karl Popper, The Open Society and
Its Enemies
- Jennifer Tolbert Roberts, Athens on Trial: The Antidemocratic
Tradition in Western Thought
- Amartya Sen, "Democracy and Its Global Roots" [link]
- Joseph Stiglitz
- Charles Tilly Democracy
[Mini-review]
To read:
- Daron Acemoglu and James A. Robinson, Economic Origins of
Dictatorship and Democracy
- Daniele Archibugi, The Global Commonwealth of Citizens:
Toward Cosmopolitan Democracy
[Blurb, ch. 1]
- Gianpaolo Baiocchi, Militants and Citizens: The Politics of
Participation in Porto Alegre [blurb]
- Nancy Bermeo, Ordinary People in Extraordinary Times: The
Citizenry and the Breakdown of Democracy [Argues that "democratic
collapses are caused less by changes in popular preferences than by the actions
of political elites who polarize themselves and mistake the actions of a few
for the preferences of the many." Introduction.]
- William T. Bernhard and David Leblang, Democratic Processes
and Financial Markets: Pricing Politics
[Blurb]
- James Bohman
- Public Deliberation: Pluralism, Complexity,
and Democracy [Blurb]
- Democracy across Borders: From Demos to
Demoi
[Blurb]
- Xavier de Souza Briggs, Democracy as Problem Solving:
Civic Capacity in Communities Across the Globe [blurb]
- Robert Alan Dahl
- How Democratic Is the American
Constitution?
- Polyarchy
- On Democracy
- Matthew A. Crenson and Benjamin Ginsberg, Downsizing
Democracy: How America Sidelined Its Citizens and Privatized Its Public
- Barbara Cruikshank, The Will to Empower: Democratic Citizens and Other Subjects ["Considers the question of how liberal democracies produce citizens who are capable of governing themselves, rethinking the relationship between welfare and citizenship, democracy and despotism, and subjectivity and subjection."]
- Patrick Deneen, Democratic Faith
[Blurb, intro]
- Eley, Forging Democracy
- David M. Estlund, Democratic Authority: A Philosophical
Framework [blurb, ch. 1]
- Yi Feng, Democracy, Governance, and Economic Performance:
Theory and Evidence [Blurb]
- Archon Fung
- Archon Fung and Erik Olin Wright (eds.), Deepening
Democracy: Institutional Innovations in Empowered Participatory
Governance
- John David Funge, "Journal of New Democratic Methods: An
Introduction", cs.CY/0408048
[Probably crazy, but deserves a look at some point]
- John Gastil, By Popular Demand: Revitalizing Representative
Democracy through Deliberative Elections [Free online]
- Paul Edward Gottfried, After Liberalism: Mass Democracy
in the Managerial State
- William Graebner, The Engineering of Consent: Democracy and
Authority in Twentieth-Century America
- Judith Gruber, Controlling Bureaucracies: Dilemmas in
Democratic Governance [Online]
- Thom Hartmann, What Would Jefferson Do? [Great
title, if nothing else]
- David Held, Democracy and the Global Order: From the Modern
State to Cosmopolitan Governance
- Robert Huckfeldt, Paul E. Johnson and John Sprague, Political
Disagreement: The Survival of Diverse Opinions within Communication
Networks
- Vincent Hutchings, Public Opinion and Democratic
Accountability: How Citizens Learn about Politics
[blurb]
- Charles Kurzman, Democracy Denied, 1905-1915: Intellectuals
and the Fate of Democracy
[blurb]
- Leif Lewin, Democratic Accountability: Why Choice in Politics
Is Both Possible and Necessary
[Blurb]
- Charles Lipson, Reliable Partners: How Democracies Have Made
a Separate Peace
[blurb]
- Arthur Lupia and Matthew McCubbins, The Democratic Dilemma:
Can Citizens Learn What They Need to Know?
- Arthur Lupia and John G. Matsusaka, "Direct Democracy: New
Approaches to Old Questions", Annual
Review of Political Science 7 (2004): 463--482
[Abstract: "Until recently, direct democracy scholarship was primarily
descriptive or normative. Much of it sought to highlight the processes'
shortcomings. We describe new research that examines direct democracy from a
more scientific perspective. We organize the discussion around four 'old'
questions that have long been at the heart of the direct democracy debate: Are
voters competent? What role does money play? How does direct democracy affect
policy? Does direct democracy benefit the many or the few? We find that recent
breakthroughs in theory and empirical analysis paint a comparatively positive
picture of the initiative and referendum. For example, voters are more
competent, and the relationship between money and power in direct democracy is
less nefarious, than many observers allege. More new studies show that the mere
presence of direct democracy induces sitting legislatures to govern more
effectively."]
- James Macdonald, A Free Nation Deep in Debt: The Financial
Roots of Democracy
[Blurb, intro]
- Gerry Mackie, Democracy Defended
[blurb]
- Giandomenico Majone, Evidence, Argument, and Persuasion in
the Policy Process
- Pierre Manent, A World Beyond Politics? A Defense of the
Nation-State [Blurb,
intro]
- Michael Mann, The Dark Side of Democracy: Explaining
Ethnic Cleansing
- J. Mansbridge, Beyond Adversary Democracy
- McMahon, Authority and Democracy
- T. Mendelberg, "The deliberative citizen: theory and evidence", in
M. X. Delli Carpini, L. Huddy and R. Shapiro, eds., Research in
Micropolitics: Political Decisionmaking, Deliberation and
Participation 6 (2002): 151--93 [Review of work on
social psychology of group decision-making and argumentation relevant to
democratic deliberation]
- Ahmed Mushfiq Mobarak, "Democracy, Volatility and Economic
Development", The
Review of Economics and Statistics 87 (2005):
348--361
- Diana C. Mutz, Hearing the Other Side: Deliberative versus Participatory Democracy [blurb]
- Neil Netanel, "Is the Commercial Mass Media Necessary, or Even
Desirable, for Liberal Democracy?" cs.CY/0109092
- Beth Simone
Noveck, "A
Democracy of Groups", First Monday November 2005
- Josiah Ober
- Athenian Legacies: Essays of the Politics of
Going on Together
[Blurb, ch. 1]
- Democracy and Knowledge: Innovation and Learning
in Classical Athens [Blurb, ch. 1]
- Conor O'Dwyer, Runaway State-Building: Patronage Politics and
Democratic Development
- Marina S. Ottaway, Democracy Challenged: The Rise of
Semi-Authoritarianism
- Adam Przeworski, Sustainable Democracy
[blurb]
- Adam Przeworski et al., Democracy and Development: Political
Institutions and Well-Being in the World, 1950--1990
[Blurb]
- Darius Rejali, Torture and Democracy
[blurb]
- Henry S. Richardson, Democratic Automony: Public Reasoning
about the Ends of Policy
- John E. Roemer, Democracy, Education, and Equality [blurb]
- Michael Saward (ed.), Democratic Innovation: Deliberation,
Representation, and Association
- Ian Shapiro
- Democracy's Place
- The State of Democratic Theory
- Cass R. Sunstein, Why Societies Need Dissent
- Charles Tilly, Contention and Democracy in Europe,
1650--2000 [blurb]
- Daniel Treisman, The Architecture of Government: Rethinking
Political Decentralization
[blurb]
- Nadia Urbinati, Representative Democracy: Principles and
Genealogy
[blurb]
- Frank Vibert, The Rise of the Unelected: Democracy and the
New Separation of Powers
[blurb]
- Leonard Wantchekon, "The Paradox of Warlord Democracy: A
Theoretical Investigation", American Political Science
Review 98 (2004): 17--33 [When is liberty born
from the quarrels of tyrants?]
- Mark E. Warren, Democracy and Association [Chapter 1]
- Donald A. Wittman, The Myth of Democratic Failure: Why
Political Institutions Are Efficient [Blurb]
- Sheldon S. Wolin, Democracy Incorporated: Managed Democracy
and the Specter of Inverted Totalitarianism
- David Zaret, Origins of Democratic Culture: Printing,
Petitions, andthe Public Sphere in Early-Modern England
#
Tue, 10 Nov 2009
Statistics
An application of probability, with intimate
ties to machine learning,
non-demonstrative inference and induction.
Since June 2005, I have been a (very, very junior)
professor of statistics. This
made me interested in how to teach it.
See
also: Properties
vs. principles in defining "good statistics"
Things I need to learn more about:
- Dependent data
- Statistical inference for stochastic
processes, a.k.a. time-series analysis. Signal
processing and filtering.
Spatial statistics.
- Model selection
- Gets its own notebook.
- Adapting statistical procedures to data without losing
validity
- Sequential inference, adaptive sampling, bandwidth selection.
- Model discrimination
- That is, designing experiments so as to discriminate between
competing classes of model. Adaptation to data issues here, too.
- Rates of convergence of estimators to true values
- Empirical process
theory. (Cf. some questions in
ergodic theory).
- Estimating distribution functions
- And estimating entropies, or
other functionals of distributions.
- Non-parametric methods
- Both those that are genuinely distribution-free, and
those that would more accurately be mega-parametric (even
infinitely-parametric) methods, such as neural
networks
- Regression
- Resampling methods
- Including distribution-free resampling methods, especially for
dependent data
- Sufficient statistics
- Get their own notebook.
- Decision theory
- Conventional, and the sorts with
some connection to how real decisions are
made.
- Graphical models
- Monte Carlo and other simulation
methods
- "De-Bayesing"
- Ways of taking Bayesian procedures and eliminating dependence on
priors, either by replacing them by initial point-estimates, or by showing the
prior doesn't matter, asymptotically or hopefully sooner.
See: Frequentist consistency of Bayesian
procedures.
- Information Geometry
- Partial identification
of parametric statistical models
- Causal Inference
- Computational Statistics
- Statistics of structured data
- Grammatical Inference
Recommended, non-technical:
- Francis Galton, "Statistical Inquiries into the Efficacy of
Prayer," Fortnightly Review 12 (1872): 125--135
[online]
- Larry Gonick and Woollcott Smith, The Cartoon Guide to
Statistics
- Ian Hacking, The Taming of Chance [Putting chance to
work in the 19th century]
- D. Huff, How to Lie with Statistics
- Theodore Porter, The Rise of Statistical Thinking,
1820--1900
- Constance Reid, Neyman from Life [Biography of Jerzy
Neyman, one of the makers of modern statistical theory, and, I am
happy to say, among the brighter lights of my alma mater. Reid does an
excellent job of explaining Neyman's work in terms accessible to the
general reader. There is a new edition, titled simply Neyman, but
otherwise unchanged.]
Recommended, technical, big pictures:
- M. S. Bartlett, "Inference and Stochastic Processes",
Journal of the Royal Statistical Society A 130
(1967): 457--478 [JSTOR]
- Richard A. Berk, Regression Analysis: A Constructive
Critique [My comments]
- Leo Breiman, "Statistical Modeling: The Two Cultures",
Statistical Science 16 (2001): 199--231 [very
much including the discussion by others and the reply by Breiman. Thanks to
Chris Wiggins for alerting me to this.]
- Harald Cramér, Mathematical Methods of
Statistics [Review]
- Earman, Bayes or Bust? A Critical Account of Bayesian
Confirmation Theory
- C. David Garson, Statnotes: An Online Textbook
- Peter Guttorp, Stochastic Modeling of Scientific Data
[Good introduction to using dependent data]
- Tony Lin [Prof. Dr. Lin was working on his doctorate when I was an
undergrad at Berkeley; we became friends at the I-House, if that is the word I want for
someone who offered to keep my brain alive in a jigger-glass and subject it to
random electrical shocks ("Jzzt! Jzzt!"). But despite his questionable
tastes in acquaintances, he's a damn good statistician and a model teacher.]
- Deborah Mayo, Error and the Growth of Experimental
Knowledge [Review: We Have Ways of Making
You Talk, or, Long Live Peircism-Popperism-Neyman-Pearson Thought!]
- NIST, Electronic Handbook of Statistical Methods
[Full text
free online]
- E. J. G. Pitman, Some Basic Theory for Statistical
Inference [Review: Intermediate Statistics from an Advanced Point of View]
- Jorma Rissanen, Stochastic Complexity in Statistical
Inquiry [Review: Less Is
More, or, Ecce data!]
- Mark Schervish, Theory of Statistics
- John R. Taylor, An Introduction to Error Analysis: The Study
of Uncertainties in Physical Measurements [a.k.a. "the book with the
train-wreck on the cover"]
- Edward R. Tufte
- The Visual Display of Quantitative Information
- Visual Explanations
- Larry Wasserman
- All of Statistics
- All of Nonparametric Statistics
Recommended, technical, close-ups:
- A. C. Atkinson and A. N. Donev, Optimum Experimental
Design [Review]
- F. Bacchus, H. E. Kyburg and M. Thalos, "Against
Conditionalization," Synthese 85 (1990):
475--506 [Why "Dutch book" arguments do not, in fact, mean that rational
agents must be Bayesian reasoners]
- M. S. Bartlett, "The Statistical Significance of Odd Bits of
Information", Biometrika 39 (1952): 228--237 [A
goodness-of-fit test based on fluctuations of the
entropy. JSTOR]
- David Blackwell and M. A. Girshick, Theory of Games and
Statistical Decisions
- Leo Breiman, "No Bayesians in Foxholes", IEEE Expert:
Intelligent Systems and Their Applications 12 (1997):
21--24
[PDF
reprint; comments
by Andy Gelman]
- Jochen Brocker, "A Lower Bound on Arbitrary f-Divergences in
Terms of the Total Variation" arxiv:0903.1765
- Hwan-sik Choi and Nicholas M. Kiefer, "Differential Geometry and
Bias Correction in Nonnested Hypothesis Testing"
[PDF preprint
via Kiefer]
- A. C. Davison and D. V. Hinkley, Bootstrap Methods and
their Applications
- J. Bradford DeLong and Kevin Lang, "Are All Economic Hypotheses
False?", Journal of Political Economy 100 (1992):
1257--1272 [PDF
preprint. The point is about abuses of hypothesis testing, not economic
hypotheses as such. Note that the preprint, at least, systematically swaps the
label of "type I" and "type II" errors.]
- Devorye and Lugosi, Combinatorial Methods in Density
Estimation
- Bradley Efron
- Andrew Gelman and Iain Pardoe, "Average predictive comparisons
for models with nonlinearity, interactions, and variance components",
Sociological Methodology forthcoming (2007)
[PDF
preprint,
Gelman's comments]
- Christian Gouriéroux and Alain Monfort,
Simulation-Based Econometric Methods
[Review: By
Indirection Find Direction Out]
- Trygve Haavelmo, "The Probability Approach in Econometrics",
Econometrica 12 (1944, supplement): iii--115
- Peter Hall, Jeff Racine and Qi Li, "Cross-Validation and the Estimation of Conditional Probability Densities", Journal of the American Statistical Association 99 (2004): 1015--1026 [PDF]
- Mark S. Handcock and Martina Morris, Relative Distribution
Methods in the Social Sciences
[Review: Beyond Mean and
Deviance]
- Bruce E. Hansen, "The Likelihood Ratio Test Under Nonstandard
Conditions: Testing the Markov Switching Model of GNP", Journal of
Applied Econometrics
7 (1992): S61--S82 [I very much like the approach of treating
the likelihood ratio as an empirical
process; why haven't I seen it before? (Also, the state-of-the-art in
simulating Gaussian processes must be much better now than what Hansen had in
'92, which would make this even more
practical. PDF
reprint.]
- Gary King, A Solution to the Ecological Inference Problem:
Reconstructing Individual Behavior from Aggregate Data [Review]
- Solomon W. Kullback, Information Theory and Statistics
- Michael Lavine and Mark J. Schervish, "Bayes Factors: What They
Are and What They Are Not" [PS preprint]
- J. F. Lawless and Marc Fredette, "Frequentist prediction intervals
and predictive distributions", Biometrika
92 (2005): 529--542 ["Frequentist predictive distributions
are defined as confidence distributions .... A simple pivotal-based approach
that produces prediction intervals and predictive distributions with
well-calibrated frequentist probability interpretations is introduced, and
efficient simulation methods for producing predictive distributions are
considered. Properties related to an average Kullback-Leibler measure of
goodness for predictive or estimated distributions are given."]
- Lucien Le Cam
- "Neyman and Stochastic Models"
[PDF.
Some vignettes of Neyman putting together models, and his model-building
process.]
- "Maximum Likelihood; An Introduction"
[PDF.
Not really an introduction, but rather a collection of examples of where it
just does not work, or at least doesn't work well.]
- Erich L. Lehmann, "On likelihood ratio tests", math.ST/0610835
- Bing Li, "A minimax
approach to consistency and efficiency for estimating equations," Annals
of Statistics 24 (1996): 1283--1297
[online version]
- Charles Manski, Identification for Prediction
and Decision [Mini-review]
- Deborah G. Mayo and D. R. Cox, "Frequentist statistics as a theory
of inductive
inference", math.ST/0610846
- M. B. Nevel'son and R. Z. Has'minskii, Stochastic
Approximation and Recursive Estimation
- Andrey Novikov, "Optimal sequential multiple
hypothesis tests", arxiv:0811.1297
- David Pollard
- "Asymptotics via Empirical Processes",
Statistical Science 4 (1989): 341--354
- Empirical Processes: Theory and Applications
- Jeffrey S. Racine, "Nonparametric Econometrics: A Primer",
Foundations and Trends in Econometrics
3 (2008): 1--88 [Good primer of nonparametric techniques
for regression, density estimation and hypothesis testing; next to no economic
content (except for examples). Presumes reasonable familiarity with parametric
statistics. PDF
reprint]
- C. Scott and R. Nowak, "A Neyman-Pearson Approach to Statistical
Learning", IEEE
Transactions on Information Theory 51 (2005):
3806--3819
- Spyros Skouras, "Decisionmetrics: Towards a Decision-Based
Approach to Econometrics," SFI
Working Paper 2001-11-064 [Applies far outside econometrics. If what you
really want to do is to minimize a known loss function, optimizing a
conventional accuracy measure, e.g. least squares, can be highly
counterproductive.]
- Aris Spanos
- "The Curve-Fitting Problem, Akaike-type Model
Selection, and the Error Statistical Approach" [Or: could your model
selection tell you that Kepler is better than Ptolemy? Technical report,
economics dept., Virginia Tech,
2006. PDF]
- "Where do statistical models come from? Revisiting the
problem of specification", math.ST/0610849
- Sara van de Geer, Empirical Process Theory in
M-Estimation [Finding non-asymptotic rates of convergence for common
estimators]
- Quang H. Vuong, "Likelihood Ratio Tests for Model Selection and
Non-Nested Hypotheses", Econometrica 57 (1989):
307--333
- Grace Wahba,
Spline Models for Observational Data
- Michael E. Wall, Andreas Rechtsteiner and Luis M. Rocha, "Singular
Value Decomposition and Principal Component Analysis,"
physics/0208101
- Halbert White, Estimation, Inference and Specification
Analysis [mini-review]
- Achilleas Zapranis and Apostolos-Paul Refenes, Principles of
Neural Model Identification, Selection and Adequacy, with Applications to
Financial Econometrics
To read, textbooks, reviews, etc.:
- Hirotugu Akaike, Selected Papers
- R. Harald Baayen, Analyzing Linguistic Data: A Practical Introduction to Statistics Using R [blurb]
- Ole E. Barndorff-Nielsen and David R. Cox, Inference
and Asymptotics
- Vic Barnett, Comparative Statistical Inference
- Bucklew, Large Deviation Techniques in Decision, Simulation,
and Estimation
- Michael R. Chernick, Bootstrap Methods: A Practitioner's
Guide
- A. C. Davison, Statistical Models
- Steve Fienberg, The Analysis of Cross-Classified Categorical
Data
- Peter J. Huber, Robust Statistics
- Alan Julian Izenman, Modern Multivariate Statistical
Techniques: Regression, Classification, and Manifold Learning
- Erich L. Lehmann, Elements of Large-Sample Theory
- National Institute of Standards and Technology, Engineering Statistics
Handbook [All the sections I've looked at have been quite good.]
- Yudi Pawitan, In All Likelihood: Statistical Modeling
and Inference Using Likelihood
- Aris Spanos, Probability Theory and Statistical Interference:
Econometric Modeling with Observational Data
To read, history and philosophy:
- Carolina Armenteros, "From Human Nature to Normal Humanity: Joseph de Maistre, Rousseau, and the Origins of Moral Statistics", Journal of the
History of Ideas 68 (2007): 107--130 [Abstract, text links]
- Ian Hacking, "The Theory of Probable Inference: Neyman, Peirce and
Braithwaite," in Science, Belief and Behavior: Essays in Honor of
R. B. Braithwaite ed. D. H. Mellor
- Anders Hald, A History of Parametric Statistical Inference from Bernoulli to Fisher, 1713--1935 [Blurb]
- Kendall and Plackett (eds.), Studies in the History of
Statistics and Probability
- Kyburg, Uncertain Inference
- Mayo and Hollander (eds.), Acceptable Evidence: Science and
Values in Risk Management
- Leland Gerson Neuberg, Conceptual Anomalies in Economics and
Statistics: Lessons from the Social Experiment
[blurb]
- Theodore Porter, Trust in Numbers
- Stephen M. Stigler
- The History of Statistics: The Measure of Uncertainty
before 1900
- Statistics on the Table: The History of Statistical
Concepts and Methods
- S. L. Zabell, Symmetry and Its Discontents: Essays on the
History of Inductive Probability
[blurb]
To read, research literature:
- Felix Abramovich, Yoav Benjamini, David L. Donoho and Iain
M. Johnstone, "Adapting to Unknown Sparsity by controlling the False Discovery
Rate", math.ST/0505374 [I
don't really care about sparsity, but they promise novel relations between the
FDR control and asymptotic minimaxity and complexity-penalized model
selection.]
- Sophie Achard, "A quadratic measure of
dependence", math.ST/0609259
["Asymptotic properties of a dimension-robust dependence measure are
investigated. It is related to those used in independence tests, but is
derivable, thus suitable for independent component analysis. An adjustable
kernel allows to accelerate the convergence of the estimator without affecting
the bias."]
- Shotaro Akaho, "A kernel method for canonical correlation
analysis", cs.LG/0609071
- R. A. Bailey, Design of Comparative Experiments
[Blurb]
- Ole E. Barndorff-Nielsen and David R. Cox, "Prediction and
Asymptotics", Bernoulli
2 (1996): 319--340
- Ole E. Barndorff-Nielsen, David R. Cox and Claudia Klüppelberg
(eds.), Complex Stochastic Systems
- Roger Barlow, "Asymmetric Errors", physics/0401042
- Zvika Ben-Haim and Yonina C. Eldar, "The Cramer-Rao Bound for
Sparse Estimation", arxiv:0905.4378
- Alain Berlinet, Gérard Biau and Laurent Rouvière,
"Optimal L1 Bandwidth selection for variable kernel density estimates",
Statistics and
Probability Letters 74 (2005): 116--128 ["[O]ne can
improve performance of kernel density estimates by varying the bandwidth with
the location and/or the sample data at hand. Our interest in this paper is in
the data-based selection of a variable bandwidth... an automatic selection
procedure inspired by the combinatorial tools developed in Devroye and
Lugosi... the expected L1 error of the corresponding selected estimate is up to
a given constant multiple of the best possible error plus an additive term
which tends to zero under mild assumptions"]
- Alberto Bernacchia and Simone Pigolotti, "Self-consistent
method for density estimation", arxiv:0908.3856 [Physicists discover non-parametric density estimation. (Hey,
I've been there.)]
- Patrice Bertail, Paul Doukhan and Philippe Soulier
(eds.), Dependence in Probability and Statistics ["recent
developments in ... probability and statistics for dependent data... from
Markov chain theory and weak dependence with an emphasis on ... dynamical
systems, to strong dependence in times series and random fields. ... section on
statistical estimation problems and specific
applications". Full blurb,
contents]
- Rabi Bhattacharya and Vic Patrangenaru, "Large sample theory of
intrinsic and extrinsic sample means on manifolds--II", math.ST/0507423 = Annals of
Statistics 33 (2005): 1225--1259 [I need a notebook
on statistics on manifolds, as opposed to statistical
manifolds]
- G. Biau and L. Gyorfi, "On the Asymptotic Properties of a
Nonparametric $-Test Statistic of
Homogeneity", IEEE
Transactions on Information Theory 51 (2005):
3965--3973
- David R. Bickel and Rudolf Fruehwirth, "On a Fast, Robust Estimator
of the Mode: Comparisons to Other Robust Estimators with Applications", math.ST/0505419
- Peter J. Bickel, C. A. J. Klaassen, Y. Ritov and J. A. Wellner,
Efficient and Adaptive Estimation for Semiparametric Models
- Peter J. Bickel and Bo Li, "Regularization in Statistics",
Test 15 (2006): 271--344
[PDF
reprint]
- Peter J. Bickel and Y. Ritov, "Non-Parametric Estimators Which Can
Be `Plugged-In' " UCB Stat. Tech. Rep. 602 [abstract, pdf]
- L. Birge, "A New Lower Bound for Multiple Hypothesis Testing",
IEEE Transactions on
Information Theory 51 (2005): 1611--1615
- Michael Blum, "Approximate Bayesian Computation: a non-parametric
perspective", arxiv:0904.0635
- Yu. I. Bogdanov, "Statistical Inverse Problem,"
physics/0211109 [A new
density estimator]
- Ingwer Borg and Patrick J. F. Groenen, Modern Multidimensional Scaling: Theory and Application
- A. R. Brazzale and A. C. Davison, "Accurate Parametric Inference for Small Samples", Statistical Science 23
(2008): 465--484 [Apparently, a preview for the book.]
- A. R. Brazzale, A. C. Davison and N. Reid, Applied
Asymptotics: Case Studies in Small-Sample Statistics
[blurb]
- Trevor S. Breusch, "Hypothesis Testing in Unidentified Models",
Review of Economic Studies 53 (1986): 635--651
[JSTOR]
- Florentina Bunea, Alexandre B. Tsybakov, Marten H. Wegkamp, "Spades and Mixture Models", arxiv:0901.2044
- Dizza Bursztyn and david M. Steinberg, "Comparison of designs for
computer experiments", Journal
of Statistical Planning and Inference 136 (2006):
1103--1119
- T. Tony Cai and Mark G. Low, "An adaptation theory for
nonparametric confidence intervals", Annals of
Statistics 32 (2004): 1805--1840 = math.ST/0503662
- Emmanuel Candes and Terence Tao, "Near Optimal Signal Recovery from
Random Prjoections and Universal Encoding Strategies", math.CA/0410542
- Herve Cardot, Andre Mas and Pascal Sarda, "CLT in Functional Linear
Regression Models", math.ST/0508073
- Djalil Chafai and Didier Concordet, "On the strong consistency of
approximated M-estimators", math.ST/0507102 [Sounds
cool...]
- In Hong Chang and Rahul Mukerjee, "Asymptotic results on the
frequentist mean squared error of generalized Bayes point predictors", Statistics and
Probability Letters 67 (2004): 65--71 [Note to
self: file this one under "de-Bayesing".]
- Sandra Chapman, George Rowlands and Nicholas Watkins
- "Extremum statistics: A framework for data analysis," cond-mat/0106015
- "Extremum Statistics and Signatures of Long Range
Correlations," cond-mat/0106015
- "The relationship between extremum statistics and
universal fluctuations," cond-mat/0007275
- Snigdhansu Chatterjee and Arup Bose, "Generalized bootstrap for
estimating equations", math.ST/0504515 = Annals of
Statistics 33 (2005): 414--436
- Fateh Chebana, "On the optimization of the weighted
Bickel-Rosenblatt test", Statistics and
Probability Letters 68 (2004): 333--345
- Xiaohong Chen, Markus Reiss, "On rate optimality for ill-posed
inverse problems in
econometrics", arxiv:0709.2003
[Non-parametric instrumental variables]
- N. N. Chentsov, Statistical Decision Rules and Optimal
Inference
- H. Chernoff, "A Measure of Asymptotic Efficiency for Tests
of a Hypothesis Based on the Sum of Observations," Annals of
Mathematical Statistics 23 (1952): 493--507
- Arthur Cohen and Harold B. Sackrowitz, "Decision theory results for
one-sided multiple comparison procedures", math.ST/0504505 = Annals of
Statistics 33 (2005): 126--144
- Arthur Cohen, Harold B. Sackrowitz, Minya Xu, "A new multiple
testing method in the dependent
case", arxiv:0906.3082
= Annals of Statistics 37 (2009) 1518--1544
- Daniel Commenges, Helene Jacqmin-Gadda, Cecile Proust, and Jeremie
Guedj, "A Newton-Like Algorithm for Likelihood Maximization: The
Robust-Variance Scoring
Algorithm", math.ST/0610402
- J. Conrad, O. Botner, A. Hallgren and Carlos P. de los Heros,
"Including Systematic Uncertainties in Confidence Interval Construction for
Poisson Statistics," hep-ex/0202013
- Anirban DasGupta, Asymptotic Theory of Statistics and
Probability [Blurb]
- Alexandre d'Aspremont, Onureena Banerjee, Laurent El Ghaoui,
"First-order methods for sparse covariance
selection", math.OC/0609812
["Given a sample covariance matrix, we solve a maximum likelihood problem
penalized by the number of nonzero coefficients in the inverse covariance
matrix. Our objective is to find a sparse representation of the sample data and
to highlight conditional independence relationships between the sample
variables."]
- David L. Donoho, "Estimation by epsilon-nets" (Le Cam
Lecture, 2003; find citation)
- David L. Donoho and Richard C. Liu, "The ``Automatic'' Robustness
of Minimum Distance
Functionals", Annals
of Statistics 16 (1988): 552--586
- David L. Donoho and Jared Tanner, "Observed Universality of Phase
Transitions in High-Dimensional Geometry, with Implications for Modern Data
Analysis and Signal
Processing", arxiv:0906.2530
- Mathias Drton and Seth Sullivant, "Algebraic statistical
models", math.ST/0703609
- Sam Efromovich
- Bradley Efron and Robert Tibshirani, "Using Specially Designed Exponential Families for Density Estimation", Annals of Statistics
24 (1996): 2431--2461 [JSTOR]
- Jianqing Fan and Jian Zhang, "Sieve empirical likelihood ratio
tests for nonparametric functions", Annals of
Statistics
32 (2004): 1858--1907 = math.ST/0503667
- Thomas S. Ferguson, A Course in Large Sample Theory
- Jean-David Fermanian and Bernard Salanié "A Nonparametric
Simulated Maximum Likelihood Estimation Method", Econometric
Theory 20 (2004): 701--734
- Ana K. Fermin and Carenne Ludena, "A Statistical view of Iterative
Methods for Linear Inverse Problems", math.ST/0504064
- S. E. Fienberg, P. Hersh, A. Rinaldo and Y. Zhou, "Maximum Likelihood Estimation in Latent Class Models For Contingency Table Data", arxiv:0709.3535
- Magalie Fromont and Béatrice Laurent, "Adaptive
goodness-of-fit tests in a density model", Annals of
Statistics 34 (2006): 680--720
= math.ST/0607013
- Stephane Gaiffas, "Rates of convergence for pointwise curve
estimation with a degenerate design", math.ST/0410354
- Seymour Geisser, Predictive Inference
- Andrew Gelman, Jennifer Hill and Masanao Yajima, "Why we (usually)
don't have to worry about multiple comparisons"
[PDF
preprint]
- Christopher R. Genovese and Larry Wasserman