Error: I'm afraid this is the first I've heard of a "Rss" flavoured Blosxom. Try dropping the "/+Rss" bit from the end of the URL.
Sun, 05 Feb 2012
Time Series, or Statistics for Stochastic Processes and Dynamical Systems
Rates of convergence of estimators; analogs to VC-dimension results (see
Meir's paper below). Large deviation
techniques. Prediction schemes. Are there universal schemes which do not
demand exponentially growing volumes of data?
If you have an ergodic process, then the
sample-path mean for any nice statistic you care to measure will, almost
surely, converge to the distributional mean. This is even true of trajectory
probabilities (i.e., if you want to know the probability of a certain
finite-length trajectory, simply count how often it happens). So "sit and
count" is a reliable and consistent statistical procedure. If the process
mixes sufficiently quickly, the rate of convergence might even be respectable.
But this doesn't say anything about the efficiency of such procedures, which is
surely a consideration. And what do you do for non-ergodic processes? (Take
multiple runs and hope they're telling you about different ergodic components?)
Non-stationary, even?
I need to learn more about frequency-domain approaches; despite being raised
as a physicist, I find the time domain much more natural. After all, the
frequency domain is effectively just one choice of a function basis, and there
are infinitely many others, which might in some sense be more appropriate to
the process at hand. But that's at least in part a rationalization against
having to learn more math.
LSE econometrics and its "general-to-specific" modeling procedure
is very interesting, and I think possibly even related to stuff I've done, but
I need to understand it much better than I do.
(This notebook really needs subdivision.)
See also:
Bootstrapping;
Change-Point Problems;
Control Theory;
Dynamical Systems;
Ergodic Theory;
Filtering, State Estimation and Signal Processing;
Grammatical Inference;
Information Theory;
Machine Learning, Statistical Inference and Induction;
Markov Models and Hidden Markov Models;
Neural Coding;
Power Law Distributions, 1/f Noise and
Long-Memory Processes;
Recurrence Times of Stochastic Processes (also Hitting, Waiting, and First-Passage Times)
Sequential Decisions Under
Uncertainty;
State-Space
Reconstruction;
Statistical Learning Theory with Dependent Data;
Statistics;
Stochastic Processes;
Symbolic Dynamics;
Universal Prediction Algorithms
Recommended, big picture:
- M. S. Bartlett
- An Introduction to Stochastic Processes,
with Special Reference to Methods and Applications [Classic stuff on
likelihood theory for stochastic processes]
- "Inference and Stochastic Processes", Journal
of the Royal Statistical Society A 130 (1967): 457--478
[JSTOR]
- "Chance or Chaos?", Journal of the Royal
Statistical Society A 153 (1990): 321--347 [JSTOR]
- Ishwar V. Basawa and B. L. S. Prakasa Rao, Statistical
Inference for Stochastic Processes [Assumes familiarity with normal
theoretical statistics, i.e., you have to have already been taught to care
about confidence intervals, hypothesis tests, estimation efficiency, etc. But
good, given that background.]
- Jianqing Fan and Qiwei Yao, Nonlinear Time Series:
Nonparametric and Parametric Methods [Review:
Everyone Their Own Oracle]
- Peter Guttorp, Stochastic Modeling of Scientific Data
[An introduction to statistical inference for many different kinds of dependent
data, not just time series; can be used by scientists and statisticians.]
- Holger Kantz and Thomas Schreiber, Nonlinear Time Series
Analysis [An excellent presentation of the nonlinear dynamical systems
approach, which comes out of physics]
- Judy Klein, Statistical Visions in Time: A History of
Time-Series Analysis, 1662--1938
- Robert Shumway and David Stoffer, Time Series Analysis and
Its Applications: With R Applications [A standard applied statistics
text, but better than many at creating pathways into theory, and realizing that
ARIMA is not the alpha and omega of the subject!]
- Jorma Rissanen, Stochastic Complexity in Statistical
Inquiry
[Review: Review:
Less Is More, or, Ecce data!]
- David Ruelle, Chaotic Evolution and Strange Attractors: The
Statistical Analysis of Deterministic Nonlinear Systems [From notes
prepared by Stefano Isola]
- Norbert Wiener, Extrapolation,
Interpolation and Smoothing of Stationary Time Series
Recommended, closeups:
- Markus Abel, K. H. Andersen and Guglielmo Lacorata, "Hierarchical
Markovian modeling of multi-time systems," nlin.CD/0201027
- Miika Ahdesmäki, Harri Lähdesmäki, Ron Pearson,
Heikki Huttunen, and Olli Yli-Harja, "Robust detection of periodic time series
measured from biological systems", BMC
Bioinformatics 6 (2005): 117 [Open access, yay!]
- Francesco Audrino and Peter Bühlmann, "Splines for Financial
Volatility", Journal of the Royal
Statistical Society B 71 (2009): 655--670
- Jushan Bai, "Testing parametric conditional distributions of
dynamic
models", The
Review of Economics and Statistics 85 (2003):
531--549 [Proposes "a nonparametric test for parametric conditional
distributions of dynamic models. The test is of the Kolmogorov type.... It is
asymptotically distribution-free and has nontrivial power against root-n local
alternatives..."]
- Matthew J. Beal, Zoubin Ghahramani and Carl Edward Rasmussen, "The
Infinite Hidden Markov Model", in NIPS 14 [Link]
- Patrick Billingsley, Statistical Inference for Markov
Processes [Discrete-time and cadlag processes only]
- Denis Bosq, Nonparametric Statistics for Stochastic
Processes
- Denis Bosq and Delphine Blanke, Inference and Prediction in
Large Dimensions [Mini-review]
- David Brillinger
- "Remarks concerning graphical models for time series and
point processes," Revista de Econometria 16
(1996): 1--23
- "Second-order moments and mutual information in the
analysis of time series and point processes," Proceedings of the
Conference Statistics 2001 Canada [online]
- "Does anyone know when the correlation coefficient is
useful?: A study of the times of extreme river flows,"
Technometrics 43 (2001): 266--273 [JSTOR]
- Prabir Burman, Edmond Chow and Deborah Nolan, "A Cross-Validatory
Method for Dependent Data", Biometrika 81
(1994): 351--358 [JSTOR]
- S. Caires and J. A. Ferreira, "On the Non-parametric Prediction of
Conditionally Stationary Sequences", Statistical Inference
for Stochastic Processes 8 (2005): 151--184
- Luca Capriotti
- "A Closed-Form Approximation of Likelihood
Functions for Discretely Sampled Diffusions: the Exponent Expansion",
physics/0703180
- "The Exponent Expansion: An Effective Approximation of
Transition Probabilities of Diffusion Processes and Pricing Kernels of
Financial
Derivatives", physics/0602107
= International Journal of Theoretical and Applied
Finance 9 (2006): 1179--1199
- Tianjiao Chu and Clark Glymour, "Search for Additive Nonlinear Time Series Causal Models", Journal of Machine Learning Research 9 (2008): 967--991
- George Cybenko and Valentino Crespi, "Learning Hidden Markov Models Using
Nonnegative Matrix Factorization", IEEE
Transactions on Information Theory 57 (2011): 3963--3970, arxiv:0809.4086
[Though it contains an error, at least in the preprint version, about the
capacities of our CSSR algorithm --- we can get model structures right with
much less data than they think, though we presented examples using more data
than was strictly needed.]
- R. Dahlhaus, "Fitting Time Series Models to Nonstationary
Processes",
Annals of Statistics 25 (1997): 1--37
- Jérôme Dedecker, Paul Doukhan, Gabriel Lang,
José Rafael León R., Sana Louhichi and Clémentine Prieur, Weak Dependence: With Examples and Applications [Mini-review]
- David Degras, "Nonparametric inference of a trend using functional data", arxiv:0812.2749
- Piet de Jong and Jeremy Penzer, "ARMA models in state space
form", Statistics
and Probability Letters 70 (2004): 119--125
[preprint]
- Piet De Jong, "A Cross-Validation Filter for Time Series
Models", Biometrika 75 (1988): 594--600
[JSTOR]
- Victor H. de la Pena, Rustam Ibragimov, and Shaturgun Sharakhmetov,
"Characterizations of joint distributions, copulas, information, dependence and
decoupling, with applications to time
series", math.ST/0611166
- Andrew M. Fraser, Hidden Markov Models and Dynamical
Systems [Review: Statistics of
Moving Shadows]
- Neil Gershenfeld, B. Schoner and E. Metois, "Cluster-Weighted
Modelling for Time-Series Analysis," Nature 397
(1999): 329--332 [Also described in Gershenfeld's incredible Nature of
Mathematical Modeling]
- Gershenfeld and Weigend (eds.), Time Series Prediction:
Forecasting the Future and Understanding the Past
- Silvia Goncalves and Halbert White, "Maximum likelihood and the
bootstrap for nonlinear dynamic models",
Journal of
Econometrics 119 (2004): 199--219
- Christian Gouriéroux and Alain Monfort,
Simulation-Based Econometric Methods
[Review: By
Indirection Find Direction Out]
- Kevin D. Hoover and Stephen J. Perez, "Data-Mining Reconsidered:
Encompassing and the General-to-Specific Approach to Specification Search,"
Econometrics Journal 2 (1999): 167--191
- Aapo Hyvärinen, Kun Zhang, Shohei Shimizu, Patrik O. Hoyer, "Estimation of a Structural Vector Autoregression Model Using Non-Gaussianity", Journal of Machine Learning Research 11
(2010): 1709--1731
- Marc Joannides and Francois Le Gland, "Small Noise Asymptotics of
the Bayesian Estimator in Nonidentifiable Models", Statistical Inference
for Stochastic Processes 5 (2002): 95--130
- M. L. Kleptsyna, A. Le Breton and M.-C. Roubaud, "Parameter
Estimation and Optimal Filtering for Fractional Type Stochastic
Systems", Statistical Inference for Stochastic Processes
3 (2000): 173--182
- Rudolf Kulhavy, Recursive Nonlinear Estimation: A Geometric
Approach [Includes, explicitly, estimation in time-series systems]
- Guglielmo Lacorata, Ruben A. Pasmanter and Angelo Vulpiani,
"Markov-chain approach to a process with long-time memory," nlin.CD/0110010 [A special case
of a more general result encompassed in my paper with Cris Moore]
- S. N. Lahiri, Resampling Methods for Dependent Data
[Mini-review]
- R. Dean Malmgren, Jake M. Hofman, Luis A. N. Amaral, Duncan J. Watts, "Characterizing Individual Communication Patterns", arxiv:0905.0106
- Ron Meir, "Nonparametric Time Series Prediction Through Adaptive
Model Selection," Machine Learning 39 (2000):
5--34 [PDF.
Extending the "structural risk minimization" framework due to Vapnik to time
series. Unfortunately Meir's approach demands knowledge of the mixing rate of
the process, which we don't really know how to estimate, but this is a very
encouraging first step.]
- Gusztáv Morvai, Sidney J. Yakowitz and Paul Algoet, "Weakly
Convergent Nonparametric Forecasting of Stationary Time Series," IEEE
Trans. Info. Theory 43 (1997): 483--498
- Martin Nilsson, "Generalized Singular Spectrum Time Series
Analysis," physics/0205094
- Andrey Novikov, "Optimal sequential multiple
hypothesis tests",arxiv:0811.1297
- Maxim Raginsky, Roummel F. Marcia, Jorge Silva and Rebecca M.
Willett, "Sequential Probability Assignment via Online Convex Programming
Using Exponential Families" [ISIT 2009; PDF]
- James Ramsay, Giles Hooker, David Campbell and Jiguo Cao,
"Parameter Estimation for Differential Equations: A Generalized Smoothing
Approach", Journal of the Royal Statistical Society forthcoming
(2007) [PDF
preprint]
- P. A. Robinson, "Interpretation of scaling properties of
electroencephalographic fluctuations via spectral analysis and underlying
physiology," Physical
Review E 67 (2003): 032902 [A polite but
devastating demonstration that "detrended fluctuation analysis", per Gene
Stanley and co., is an obfuscated way of looking at the power spectrum.]
- George G. Roussas
- Contiguity of Probability Measures: Some Applications
in Statistics [Asymptotic theory of approximation, estimation and
testing, for discrete-time Markov processes on fairly general
state-spaces. Mini-review]
- "Asymptotic distribution of the log-likelihood
function for stochastic processes," Zeitschrift für
Wahrscheinlickkeitstheorie und verwandte Gebiete 47
(1979): 31--46 [Elegant solution of a basic problem for a pretty broad class of
processes; extends work in his book.]
- Daniil Ryabko, "A criterion for hypothesis testing for
stationary processes", arxiv:0905.4937
- Daniil Ryabko and Boris Ryabko, "Testing Statistical Hypotheses
About Ergodic
Processes", arxiv:0804.0510
[Appears to be the same as their "Nonparametric Statistical Inference for
Ergodic
Processes", IEEE
Transactions on Information Theory 56 (2010):
1430--1435
- Nobusumi Sagara, "Nonparametric maximum-likelihood estimation of
probability measures: existence and consistency", Journal of
Statistical Planning and Inference 133 (2005):
249--271 ["This paper formulates the nonparametric maximum-likelihood
estimation of probability measures and generalizes the consistency result on
the maximum-likelihood estimator (MLE). We drop the independent assumption on
the underlying stochastic process and replace it with the assumption that the
stochastic process is stationary and ergodic. The present proof employs
Birkhoff's ergodic theorem and the martingale convergence theorem. The main
result is applied to the parametric and nonparametric maximum-likelihood
estimation of density functions." Very cool.]
- Christopher C. Strelioff and Alfred W. Hübler, "Medium-Term
Prediction of Chaos", Physical Review
Letters 96 (2006): 044101
- Masanobu Taniguchi and Yoshihide Kakizawa, Asymptotic Theory
of Statistical Inference for Time Series [Finally, a proper statistical
treatment which doesn't confine itself to expletive-deleted ARMA processes.
Neat information geometry too.]
- Halbert White, Estimation, Inference and Specification
Analysis [Review]
- Andrew Gordon Wilson and Zoubin Ghahramani, "Copula Processes",
arxiv:1006.1350 [Theoretically
interesting, though on the real data example it does at most marginally better
than the off-the-shelf GARCH model, at considerably higher computational cost]
- Wei Biao Wu, "Nonlinear system theory: Another look at
dependence", Proceedings of the
National Academy of Sciences 102 (2005):
14150--14154 ["we introduce previously undescribed dependence measures for
stationary causal processes. Our physical and predictive dependence measures
quantify the degree of dependence of outputs on inputs in physical systems. The
proposed dependence measures provide a natural framework for a limit theory for
stationary processes. In particular, under conditions with quite simple forms,
we present limit theorems for partial sums, empirical processes, and kernel
density estimates. The conditions are mild and easily verifiable because they
are directly related to the data-generating mechanisms."]
Things modesty forbids me to recommend:
- CRS, Causal Architecture, Complexity and
Self-Organization in Time Series and Cellular Automata [Ph.D.
thesis, UW-Madison, 2001]
- CRS, Abigail Z. Jacobs, Kristina Lisa Klinkner and Aaron Clauset, "Adapting to Non-stationarity with Growing Expert Ensembles", arxiv:1103.0949
- CRS and Kristina
Lisa Klinkner, "Blind Construction of Optimal Nonlinear Predictors for
Discrete Sequences", cs.LG/0406011 = pp. 504--511
of Uncertainty in Artificial Intelligence: Proceedings of the
Twentieth Conference (UAI 2004)
To read:
- Luis A. Aguirre, Ubiratan S. Freitas, Christophe Letellier and Jean
Maquet, "Structure-selection techniques applied to continuous-time nonlinear
models," Physica D 158 (2001): 1--18
- Eduardo G. Altmann and Holger Kantz, "Recurrence time analysis,
long-term correlations, and extreme events", physics/0503056
- Pierre Alquier and Olivier Wintenberger, "Model selection and randomization for weakly dependent time series forecasting", arxiv:0902.2924
- Shun-ichi Amari, "Estimating Functions of Independent Component
Analysis for Temporally Correlated Signals," Neural Computation
12 (2000): 2083--2107
- Heather M. Anderson, "Choosing Lag Lengths in Nonlinear Dynamic
Models," Monash Econometric Working Paper [online]
- Claudia Angelini, Daniela Cavab, Gabriel Katul, and Brani
Vidakovic, "Resampling hierarchical processes in the wavelet domain: A case
study using atmospheric turbulence", Physica
D 207 (2005): 24--40
- J. A. D. Aston, "Modeling macroeconomic time series via heavy
tailed distributions", math.ST/0702844
- Alexander Aue, Siegfried Hörmann, Lajos Horváth
and Matthew Reimherr, "Break detection in the covariance structure of multivariate time series models", Annals of Statistics 37
(2009): 4046--4087
- Ishwar V. Basawa and D. J. Scott, Asymptotic Optimal
Inference for Non-ergodic Models
- Nathaniel Beck and Jonathan N. Katz
- Alain Berlinet and Gérar Biau, "Minimax Bounds in
Nonparametric Estimation of Multidimensional Deterministic Dynamical
Systems", Statistical Inference for Stochastic Processes
4 (2001): 229--248 ["We consider the problem of estimating
a multidimensional discrete deterministic dynamical system from the first n+1
observations. We exhibit the optimal rate function ... the near neighbor
estimator achives this optimal rate.... optimal rate function is defined from
multidimensonal spacings which are edge lengths of simplicies associated with a
triangulation of the Voronoi cells built from the observations." Sounds very
cool!]
- Alain Berlinet and Christian Francq, "On the Identifiability
of minimal VARMA representations", Statistical Inference for Stochastic
Processes 1 (1998): 1--15
- Patrice Bertail, Paul Doukhan and Philippe Soulier
(eds.), Dependence in Probability and Statistics ["recent
developments in the field of probability and statistics for dependent
data... from Markov chain theory and weak dependence with an emphasis on some
recent developments on dynamical systems, to strong dependence in times series
and random fields. ... section on statistical estimation problems and specific
applications". Full blurb,
contents]
- D. Blanke, D. Bosq and D. Guegan, "Modelization and Nonparametric
Estimation for Dynamical Systems with Noise", Statistical Inference for
Stochastic Processes 6 (2003): 267--290
- Tadeusz Bednarski, "Fréchet differentiability in statistical inference for time series", Statistical Methods and Applications 19 (2010): 517--528
- Noelle Bru, Laurence Despres and Christian Paroissin, "A comparison
of statistical models for short categorical or ordinal time series with
applications in
ecology", math.ST/0702706
- Prabir Burman and Robert H. Shumway, "Estimation of trend in state-space models: Asymptotic mean square error and rate of convergence", arxiv:0911.3469 = Annals of Statistics 37
(2009): 3715--3742
- Alexandre X. Carvalho and Martin A. Tanner, "Mixtures-of-Experts of
Autoregressive Time Series: Asymptotic Normality and Model
Specification", IEEE Transactions on Neural
Networks 16 (2005): 39--56
- Carlos M. Carvalho, Michael S. Johannes, Hedibert F. Lopes, and Nicholas G. Polson, "Particle Learning and Smoothing", Statistical Science 25 (2010): 88--106
- Kung-Sik Chan and Howell H. Tong, Chaos: A Statistical
Perspective
- J.-R. Chazottes, P. Collet and B. Schmitt, "Statistical
Consequences of Devroye Inequality for Processes. Applications to a Class of
Non-Uniformly Hyperbolic Dynamical Systems", math.DS/0412167
- J.-R. Chazottes, E. Floriani and R. Lima, "Relative Entropy and
Identification of Gibbs Measures in Dynamical Systems," Journal of
Statistical Physics 90 (1998): 697--725
- Zhuo Chen and Yuhong Yan, "Time Series Models for Forecasting:
Testing or Combining?", Studies in Nonlinear Dynamics and
Econometrics 11:1 (2007): 3
- Zhiyi Chi, "Large deviations for template matching between point
processes", Annals of Applied
Probability 15 (2005): 153--174 = math.PR/0503463
- P. Cizek, W. Hardle, V. Spokoiny, "Adaptive pointwise estimation in time-inhomogeneous conditional heteroscedasticity models", arxiv:0903.4620 [I'm more interested in the idea of adaptively estimating non-stationary time series here than the finance application...]
- Michael P. Clements and David F. Hendry (eds.), Companion
to Economic Forecasting
- Todd P. Coleman and Sridevi S. Sarma, "A Computationally Efficient Method for Nonparametric Modeling of Neural Spiking Activity with Point Processes", Neural Computation 22 (2010): 2002--2030
- P. Collet, S. Martinez and B. Schmitt, "Asymptotic distribution of
tests for expanding maps of the interval", Ergodic Theory and Dynamical
Systems 24 (2004): 707--722 [Kolmogorov-Smironov-type
results for the empirical distribution under the invariant measure of a
dynamical system]
- Daniel Commenges and Anne Gegout-Petit, "Likelihood inference for
incompletely observed stochastic processes: ignorability conditions", math.ST/0507151 ["We define a
general coarsening model for stochastic processes. We decribe incomplete data
by means of sigma-fields and we give conditions of ignorability for likelihood
inference."]
- Colleen D. Cutler and Daniel T. Kaplan (eds.), Nonlinear
Dynamics and Time Series: Building a Bridge between the Natural and Statistical
Sciences
- Sophie Dabo-Niang, Ali Laksaci, "Conditional mode regression: Application to functional time series prediction", arxiv:0812.4882
- Sophie Dabo-Niang, Christian Francq and Jean-Michel Zakoïan,
"Combining Nonparametric and Optimal Linear Time Series Predictions",
Journal of the American Statistical
Association 104 (2010): 1554--1565
- Serguei Dachian, Yury A. Kutoyants
- "Hypotheses Testing: Poisson Versus Self-exciting", arxiv:0903.4636
= Scandinavian Journal of Statistics 33 (2006): 391
- "On the Goodness-of-Fit Tests for Some Continuous Time
Processes", arxiv:0903.4642 ["We
present a review of several results concerning the construction of the
Cramer-von Mises and Kolmogorov-Smirnov type goodness-of-fit tests for
continuous time processes. As the models we take a stochastic differential
equation with small noise, ergodic diffusion process, Poisson process and
self-exciting point processes"]
- Rainer Dahlhaus and Wolfgang Polonik, "Empirical spectral processes for locally stationary time series", Bernoulli 15
(2009): 1--39, arxiv:902.1448
- Arnak Dalalyan and Markus Reiss, "Asymptotic statistical
equivalence for ergodic diffusions: the multidimensional case", math.ST/0505053
- Youri Davydov, "Remarks on Estimation Problem for Stationary
Processes in Continuous Time", Statistical Inference for Stochastic
Processes 4 (2001): 1--15
- A. De Gregorio and S. M. Iacus, "Adaptive Lasso-type estimation
for ergodic diffusion processes", arxiv:1002.1312
- D. Dehay and Yu. A. Kutoyants, "On confidence intervals for
distribution function and density of ergodic diffusion process", Journal of
Statistical Planning and Inference 124 (2004):
63--73
- Miguel A. Delgado, Javier Hidalgo and Carlos Velasco, "Distribution
free goodness-of-fit tests for linear
processes", math.ST/0603043
= Annals of
Statistics 33 (2005): 2568--2609 [i.e.,
goodness-of-fit for the autocorrelation function]
- Holger Dette, Philip Preuss, and Mathias Vetter, "A Measure of Stationarity in Locally Stationary Processes With Applications to Testing", Journal of the American Statistical Association
106 (2011): 1113--1124
- Vanessa Didelez, "Graphical models for marked point processes based on local independence", arxiv:0710.5874
- Thomas G. Dietterich,
"Machine Learning for Sequential Data"
[PDF.
Thanks to Gustavo Lacerda for a pointer.]
- Dmitry Dolgopyat, Vadim Kaloshin, Leonid Koralov, "Sample path
properties of the stochastic flows," math.PR/0111011
- Randal Douc, Eric Moulines and Tobias Ryden, "Asymptotic properties
of the maximum likelihood estimator in autoregressive models with Markov
regime", Annals
of Statistics 32 (2004): 2254--2304 = math.ST/0503681
- Holger Drees, "Some aspects of extreme value theory under serial dependence", arxiv:0710.5879
- Pierre Duchesne, "On Testing for Serial Correlation with a
Wavelet-Based Spectral Density Estimator in Multivariate Time Series", Econometric
Theory 22 (2006): 633--676
- K. Dzhaparidze, Parameter Estimation and Hypothesis Testing
in Spectral Analysis of Stationary Time Series
- Michael Eichler, "Graphical modelling of multivariate time
series", math.ST/0610654
- Enrique Figueroa-Lopez and Christian Houdre, "Nonparametric
estimation for Levy processes with a view towards mathematical finance", math.ST/0412351
- D. Florens and H. Pham, "Large Deviations in Estimation of an
Ornstein-Uhlenbeck Model," Journal of Applied Probability
36 (1999): 60--77
- Christian Francq and Jean-Michel Zakoian, "Bartlett's formula for
a general class of nonlinear processes", Journal of Time Series Analysis 30 (2009): 449--465
- Marc K. Francke, Siem Jan Koopman, Aart F. De Vos, "Likelihood functions for state space models with diffuse initial conditions",
10.1111/j.1467-9892.2010.00673.xJournal of Time Series Analysis 31 (2010): 407--414
- Jurgen Franke, Jens-Peter Kreiss and Enno Mammen,
"Bootstrap of Kernel Smoothing in Nonlinear Time Series",
Bernoulli 8 (2002): 1--37
- T. D. Frank, "Delay Fokker-Planck equations, perturbation theory,
and data analysis for nonlinear stochastic systems with time delays",
Physical Review
E 71 (2005): 031106
- Philip Hans Franses and Dick Van Dijk, Non-Linear Time Series
Models in Empirical Finance
- Roland Fried and Vanessa Didelez, "Latent variable analysis and
partial correlation graphs for multivariate time series", Statistics and
Probability Letters 73 (2005): 287--296
- Cheng-Der Fuh, "Efficient likelihood estimation in state space models", Annals of Statistics 34 (2006): 2026--2068, arxiv:0611376
- Irene Gannaz and Olivier Wintenberger, "Adaptative density
estimation with dependent observations", math.ST/0510311 [Using wavelets
to estimate the invariant density of weakly-dependent processes, assumes
geometric ergodicity but not stationarity]
- Jiti Gao, Maxwell King, Zudi Lu and Dag Tjostheim, "Specification
testing in nonlinear and nonstationary time series autoregression",
Annals of Statistics 37 (2009): 3893--3928, arxiv:0911.3736
- Basillis Gidas and Alejandro Murua, "Optimal transformations for
prediction in continuous-time stochastic processes: finite past and future", Probability Theory and
Related Fields 131 (2005): 479--492
- German Gomez-Herrero, Wei Wu, Kalle Rutanen, Miguel C. Soriano, Gordon Pipa, Raul Vicente, "Assessing coupling dynamics from an ensemble of time series", arxiv:1008.0539
- Georg A. Gottwald and Ian Melbourne, "Testing for chaos in
deterministic systems with
noise", Physica
D 212 (2005): 100--110
- Janez Gradisek, Silke Siegert, Rudolf Friedrich and Igor Grabec,
"Analysis of time series from stochastic processes," Physical Review
E 62 (2000): 3146--3155
- Grassberger and Nadal (eds.), From Statistical Physics to
Statistical Inference and Back
- Robert L. Grossman and Richard G. Larson, "State Space
Realization Theorems for Data Mining", arxiv:0901.2735
- Diego Guarin, Alvaro Orozco, Edilson Delgado, "A new surrogate data method for nonstationary time series", arxiv:1008.1804
- David Gubbins, Time Series and Inverse Theory for
Geophysicists
- Laszlo Gyorfi et al., Nonparametric Curve Estimation from
Time Series
- Peter Hall, Soumendra Nath Lahiri and Jorg Polzehl,
"On Bandwidth Choice in Nonparametric Regression with Both Short- and
Long-Range Dependent Errors", Annals of Statistics
23 (1995): 1921--1936
- Niels Richard Hansen, "Penalized maximum likelihood estimation for generalized linear point processes", arxiv:1003.0848
- Wolfgang Hardle, Helmut Lutkepohl, Rong Chen, "A Review of Nonparametric Time Series Analysis", International Statistical Review
65 (1997): 49--72 [JSTOR]
- Jeffrey D. Hart, "Automated Kernel Smoothing of Dependent
Data by using Time Series Cross-Validation", Journal of the
Royal Statistical Society B 56 (1994): 529--542
[JSTOR]
- David Harte, "PtProcess: An R Package for Modelling Marked Point
Processes Indexed by
Time", Journal of Statistical
Software 35 (2010): 8
- Andrew Harvey et al (eds.), State Space and Unobserved
Component Models: Theory and Applications
- Stefan Haufe, Guido Nolte, Klaus-Robert Mueller and Nicole Kraemer,
"Sparse Causal Discovery in Multivariate Time
Series", arxiv:0901.1234 [I am not
altogether happy with defining "causes" as "has a non-zero coefficient in a
vector autoregression"...]
- David Hendry, Econometrics: Alchemy or Science?
[Review by
Bruce Hansen]
- David F. Hendry and Bent Nielsen, Econometric Modeling: A
Likelihood Approach
[Blurb, preface,
ch.1 ]
- Nadine Hilgert, Vivien Rossi, Jean-Pierre Vila, Verene Wagner,
"Identification, Estimation, and Control of Uncertain Dynamic Systems: A
Nonparametric
Approach", Communications
in Statistics: Theory and Methods 36 (2007):
2509--2525
- Junichi Hirukawa and Masanobu Taniguchi, "LAN theorem for
non-Gaussian locally stationary processes and its applications", Journal
of Statistical Planning and Inference 136 (2006):
640--688
- Scott H. Holan, Robert Lund, and Ginger Davis, "The ARMA alphabet soup: A tour of ARMA model variants", Statistics Surveys 4 (2010): 232--274
- Jinh Hu, Wen-wen Tung, Jianbo Gao and Yinhe Cao, "Reliability of
the 0-1 test for
chaos", Physical
Review E 72 (2005): 056207 [On Gottwald
and Melbourne]
- Jianhua Z. Huang and Lijian Yang, "Identification
of Non-Linear Additive Autoregressive Models", Journal of
the Royal Statistical Society B 66 (2004): 463--477 [JSTOR. Proves
consistency under the assumption that the data-generating process
is strictly stationary and strongly mixing.]
- Stefano M. Iacus
- "Statistical analysis of stochastic resonance
with ergodic diffusion noise," math.PR/0111153
- Simulation and Inference for Stochastic Differential
Equations
- "On Lasso-type estimation for dynamical systems with
small noise", arxiv:0912.5078
- Ching-Kang Ing, "Accumulated prediction errors, information criteria and optimal forecasting for autoregressive time series", Annals of
Statistics 35 (2007): 1238--1277, arxiv:0708.2373
- Atsushi Inoue and Lutz Kilian, "In-sample or out-of-sample tests of
predictability: which one should we use?", European Central Bank Working Paper
[PDF]
- Akihiko Inoue and Yukio Kasahara, "Explicit representation of
finite predictor coefficients and its applications", math.ST/0405051 = Annals of
Statistics 34 (2006): 973--993
- D. A. Ioannides and D. P. Papanastassiou, "Estimating the
distribution function of a stationary process involving measurement
errors", Statistical Inference for Stochastic
Processes 4 (2001): 181--198
- E. L. Ionides, C. Breto and A. A. King, "Inference for nonlinear
dynamical
systems", Proceedings
of the National Academy of Sciences (USA) 103 (2006):
18438--18443
- S. Ishii and M.-A. Sato, "Reconstruction of chaotic dynamics by
on-line EM algorithm," Neural Networks 14
(2001): 1239--1256
- Christine Jacob, "Conditional least squares estimation in nonstationary nonlinear stochastic regression models", Annals of Statistics 38 (2010): 566--597
- Òscar Jordà, "Simultaneous Confidence Regions for
Impulse Responses", The Review of Economics
and Statistics 91 (2009): 629--647
- C. T. Jose, B. Ismail, S. Jayasekhar, "Trend, Growth Rate, and Change Point Analysis: A Data Driven Approach", Communications in Statistics: Simulation and Computation 37 (2008): 498--506
- Joseph Tadjuidje Kamgaing, Hernando Ombao and Richard A. Davis,
"Autoregressive processes with data-driven regime switching",
Journal of Time Series Analysis 30
(2009): 505--533
- Matthew B. Kennel, "Testing time symmetry in time series
using data compression dictionaries", Physical Review E
69 (2004): 056208
- Tae Yoon Kim and Sangyeol Lee, "Kernel density estimator for strong
mixing processes", Journal of
Statistical Planning and Inference 133 (2005):
273--284
- Jon Kleinberg, "Bursty and Hierarchical Structure in Streams"
[PDF]
- D. Kleinhans, R. Friedrich, "Maximum Likelihood Estimation of Drift
and Diffusion
Functions", physics/0611102
- D. Kleinhans, R. Friedrich, A. Nawroth and J. Peinke, "An iterative
procedure for the estimation of drift and diffusion coefficients of Langevin
processes", Physics Letters
A 346 (2005): 42--46
= physics/0502152 ["The
analysis is based on an iterative procedure minimizing the Kullback-Leibler
distance between measured and estimated two time joint probability
distributions of the process."]
- M. L. Kleptsyna and A. Le Breton, "Statistical Analysis of the
Fractional Ornstein-Uhlenbeck Type Process", Statistical Inference for
Stochastic Processes 5 (2002): 229--248
- Clemens Kreutz, Andreas Raue, Jens Timmer, "Likelihood based observability analysis and confidence intervals for predictions of dynamic models", arxiv:1107.0013
- D. Kugiumtzis, "Statically Transformed Autoregressive Process and
Surrogate Data Test for Nonlinearity," nlin.CD/0110025
- Uwe Küchler and Michael Sørensen, Exponential
Families of Stochastic Processes
- Hans R. Künsch, "State Space and Hidden Markov Models",
pp. 109--173 in Ole E. Barndorff-Nielsen, David R. Cox and Claudia
Klüppelberg (eds.), Complex Stochastic Systems
- Y. A. Kutoyants
- Statistical Inference for Ergodic Diffusion
Processes
- "On the Goodness-of-Fit Testing for Ergodic Diffusion Processes", arxiv:0903.4550
- "Goodness-of-Fit Tests for Perturbed Dynamical Systems", arxiv:0903.4612
- "On Properties of Estimators in non Regular Situations for Poisson Processes", arxiv:0903.4613
- B. Lacaze, "Errorless uniform sampling of complex stationary
processes," Signal
Processing 83 (2003): 913--917
- Guy Lebanon, Yang Zhao, and Yanjun Zhao, "Modeling temporal text streams using the local multinomial model", Electronic Journal of Statistics 4 (2010): 566--584
- J. Lember and A. Koloydenko, "Adjusted Viterbi training", math.ST/0406237
- Daniel Lemire, "A Better Alternative to Piecewise Linear Time
Series Segmentation", cs.DB/0605103
- Matthieu Lerasle, "Adaptive density estimation for stationary
processes", Mathematical Methods of Statistics 18
(2009): 59--83, arxiv:0909.0999
- J. K. Lindsey, Statistical Analysis of Stochastic Processes
in Time [old
draft in Postscript; data and R code]
- Yu. N. Lin'kov, Asymptotic Statistical Methods for Stochastic
Processes
[Restricted to semi-martingales. blurb]
- Weidong Liu and Wei Biao Wu, "Simultaneous nonparametric inference
of time
series", Annals
of Statistics
38 (2010): 2388--2421 ["kernel estimation of marginal
densities and regression functions of stationary processes. It is shown that
for a wide class of time series, with proper centering and scaling, the maximum
deviations of kernel density and regression estimates are asymptotically
Gumbel. Our results substantially generalize earlier ones which were obtained
under independence or beta mixing assumptions. The asymptotic results can be
applied to assess patterns of marginal densities or regression functions via
the construction of simultaneous confidence bands for which one can perform
goodness-of-fit tests"]
- E. Locherbach, "Likelihood Ratio Processes for Markovian Particle
Systems with Killing and Jumps", Statistical Inference for Stochastic
Processes 5 (2002): 153--177
- Wei Lu, Namrata Vaswani, "The Wiener-Khinchin Theorem for Non-wide
Sense stationary Random Processes" ["under certain assumptions, the power
spectral density (PSD) of any random process is equal to the Fourier transform
of the time-averaged autocorrelation function"]
- Zudi Lu, Dag Johan Steinskog, Dag Tjostheim and Qiwei Yao,
"Adaptively Varying-Coefficient Spatiotemporal Models", Journal of the Royal Statistical Society B 71 (2009): 859--880 [PDF preprint]
- Xiaodong Luo, Tomomichi Nakamura and Michael Small, "Surrogate data
method applied to nonlinear time
series", nlin.CD/0603004
- Xiaodong Luo, Jie Zhang, Junfeng Sun, Michael Small, Irene Moroz,
"Asymptotically pivotal statistic for surrogate testing with extended
hypothesis", nlin.CD/0701008
- Xiaodong Luo, Jie Zhang and Michael Small, "Exact nonparametric
inference for detection of nonlinear determinism", nlin.CD/0507049 [More exactly,
this is an exact test for linear stochasticity --- rejecting the null indicates
either nonlinearity or determinism, or both.]
- Enno Mammen and Swagata Nandi, "Change of the nature of a test when
surrogate data are applied", Physical Review
E 70 (2004): 016121
- Heikki Mannila and Dmitry Rusakov, "Decomposition
of Event Sequences into Independent Components" [short
and long
versions in PS]
- T. K. March, S. C. Chapman and R. O. Dendy, "Recurrence plot
statistics and the effect of embedding", physics/0502042
- Inés P. Mariño, Joaquín Míguez, and Riccardo Meucci, "Monte Carlo method for adaptively estimating the unknown parameters and the dynamic state of chaotic systems", Physical Review E 79 (2009): 056218
- Pierre-Francois Marteau, "Time Warp Edit Distances with Stiffness
Adjustment for Time Series
Matching", cs.IR/0703033
- Norbert Marwan and Jurgen Kurths, "Nonlinear analysis of bivariate
data with cross recurrence plots," physics/0201061
- Norbert Marwan, M. Thiel, N. R. Nowaczyk, "Cross Recurrence Plot
Based Synchronization of Time Series," physics/0201062
- Norbert Marwan, N. Wessel, U. Meyerfeldt, A. Schirdewan, J. Kurths,
"Recurrence Plot Based Measures of Complexity and its Application to Heart
Rate Variability Data," physics/0201064
- Ikuo Matsuba, Hiroshi Takahashi and shinya Wakasa, "Stochastically Equivalent Dynamical System Approach to Nonlinear Deterministic Prediction",
International
Journal of Bifurcation and Chaos
16 (2006): 2721--2728 [I can't tell, from the abstract,
if they're proposing to use stochastic systems to predict deterministic ones
or vice versa; it'd be interesting either way!]
- Muneya Matsui, "A characterization of ARMA and Fractional ARIMA
models with infinitely divisible
innovations", math.ST/0703731
- Brendan P. M. McCabe, Gael M. Martin, David Harris, "Efficient probabilistic forecasts for counts", Journal
of the Royal Statistical Society B 73 (2011): 253--272
- Emma J. McCoy, Sofia C. Olhede, David A. Stephens, "Non-Regular Likelihood Inference for Seasonally Persistent Processes", arxiv:0709.0139
- Patrick E. McSharry and Leonard A. Smith, "Consistent nonlinear
dynamics: identifying model inadequacy", nlin.CD/0401024 = Physica
D 192 (2004): 1--22
- Jan Mielniczuk, Zhou Zhou and Wei Biao Wu, "On nonparametric prediction of linear processes", Journal of
Time Series Analysis 30 (2009): 652--673
- Javier R. Movellan, Paul Mineiro, and R. J. Williams, "A Monte
Carlo EM Approach for Partially Observable Diffusion Processes: Theory and
Applications to Neural Networks," Neural Computation
14 (20020: 1507--1544
- Eric Moulines, Pierre Priouret and Francois Roueff, "On recursive
estimation for time varying autoregressive
processes", math.ST/0603047
= Annals of
Statistics 33 (2005): 2610--2654
- Jose M. F. Moura and Sanjoy K. Mitter, "Identification
and Filtering: Optimal Recursive Maximum Likelihood Approach" [1986
technical report from MIT, found looking for something else, original URL now
lost --- presumably long since published]
- Hans-Georg Muller and Ulrich Stadtmuller, "Generalized functional
linear models", math.ST/0505638 = Annals of
Statistics 33 (2005): 774--805 ["We propose a
generalized functional linear regression model for a regression situation where
the response variable is a scalar and the predictor is a random function. A
linear predictor is obtained by forming the scalar product of the predictor
function with a smooth parameter function, and the expected value of the
response is related to this linear predictor via a link function. If, in
addition, a variance function is specified, this leads to a functional
estimating equation which corresponds to maximizing a functional
quasi-likelihood. This general approach includes the special cases of the
functional linear model, as well as functional Poisson regression and
functional binomial regression. The latter leads to procedures for
classification and discrimination of stochastic processes and functional
data. ... An essential step in our proposal is dimension reduction by
approximating the predictor processes with a truncated Karhunen-Loeve
expansion."]
- Ursula U. Müller, Anton Schick and Wolfgang Wefelmeyer,
"Estimating the innovation distribution in nonparametric autoregression",
Probability Theory and Related Fields
144 (2009): 53--77 ["We prove a Bahadur representation for
a residual-based estimator of the innovation distribution function in a
nonparametric autoregressive model. The residuals are based on a local linear
smoother for the autoregression function."]
- Tomomichi Nakamura, Yoshito Hirata, and Michael Small, "Testing for
correlation structures in short-term variabilities with long-term trends of
multivariate time
series", Physical
Review E 74 (2006): 041114
- Tomomichi Nakamura, Xiaodong Luo, and Michael Small, "Testing for
nonlinearity in time series without the Fourier transform", Physical Review
E 72 (2005): 055201
- Tomomichi Nakamura and Michael Small, "Small-shuffle surrogate
data: Testing for dynamics in fluctuating data with
trends", Physical
Review E 72 (2005): 056216
- Sahand Negahban, , Pradeep Ravikumar, Martin J. Wainwright, Bin Yu, "A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers", arxiv:1010.2731
- Ilia Negri, "Efficiency of a class of unbiased estimators for the
invariant distribution function of a diffusion
process", math.ST/0609590
- Ilia Negri and Yoichi Nishiyama, "Goodness of fit test for ergodic
diffusions by tick time sample scheme", Statistical Inference for stochastic Processes 13
(2010): 81--95
- Yoichi Nishiyama, "Goodness-of-fit test for a nonlinear time
series", Journal of Time Series Analysis 30 (2009): 674--681
- Jimmy Olsson, Olivier Cappe, Dandal Douc and Eric Moulines,
"Sequential Monte Carlo smoothing with application to parameter estimation in
non-linear state space
models", math.ST/0609514
- Sorinel Adrian Oprisan, "An application of the least-squares method
to system parameters extraction from experimental data", Chaos
12 (2002): 27--32
- Brahim Ouhbi and Nikolaos Limnios, "Nonparametric estimation for
semi-Markov processes based on its hazard rate functions", Statistical
Inference for Stochastic Processes 2 (1999): 151--173
- P. Palaniyandi and M. Lakshmanan, "Estimation of System Parameters
and Predicting the Flow Function from Time Series of Continuous Dynamical
Systems", nlin.CD/0406027
- Milan Palus, "Coarse-grained
entropy rate for characterization of complex time series", Physica
D 93 (1996): 64--77 [Thanks to Prof. Palus for a
reprint]
- Christoph Pamminger and Sylvia Fruühwirth-Schnatter, "Model-based Clustering of Categorical Time
Series", Bayesian Analysis 5 (2010): 345--368
- Angeliki Papana and Dimitris Kugiumtzis, "Evaluation of Mutual Information Estimators for Time Series", arxiv:0904.4753
- Efstathios Paparoditis, "Validating Stationarity Assumptions in
Time Series Analysis by Rolling Local
Periodograms", Journal
of the American Statistical Association 105 (2010):
839--851
- Zoltán Prekopcsák, Daniel Lemire, "Time Series Classification by Class-Based Mahalanobis Distances", arxiv:1010.1526
- Zacharias Psaradakis
- Zacharias Psaradakis, Martin Sola, Fabio Spagnolo and Nicola Spagnolo, "Selecting nonlinear time series models using information criteria",
Journal of
Time Series Analysis
30 (2009): 369--394
- N. U. Prabhu and Ishawar V. Basawa (eds.), Statistical
Inference in Stochastic Processes (1991)
- B. L. S. Prakasa Rao
- Semimartingales and Their Statistical
Inference
- Statistical Inference for Diffusion-Type
Processes
- E. Racca and A. Porporato, "Langevin equations from time series",
Physical Review
E 71 (2005): 027101
- Ali
Rahimi, Learning to Transform Time Series with a Few Examples
[Ph.D. thesis, MIT dept. of electrical engineering and computer science,
2005. PDF]
- Ali Rahimi, Ben Recht and Trevor Darrell, "Learning to Transform
Time Series with a Few Examples", tech report
[PDF]
- M. M. Rao, Stochastic Processes: Inference Theory
- Ramiro Rico-Martinez, K. Krischer, G. Flaetgen, J.S. Anderson and
I. G. Kevrekidis, "Adaptive Detection of Instabilities: An Experimental
Feasibility Study," nlin.CD/0202057
- Christoph Rieke, Ralph G. Andrzejak, Florian Mormann and Klaus
Lehnertz, "Improved statistical test for nonstationarity using recurrence time
statistics", Physical Review E 69 (2004): 046111
[link]
- Ricardo Ríos, Luis-Angel Rodríguez,
"Penalized estimate of the number of states in Gaussian linear AR with Markov regime", Electronic Journal of Statistics 2 (2008): 1111--1128, arxiv:0807.2726
- John C. Robertson, Ellis W. Tallman and Charles H. Whiteman,
"Forecasting using relative entropy," Federal Reserve Bank of Atlanta Working
Paper 2002-20
[PDF]
- J. W. C. Robinson, J. Rung, A. R. Bulsara and M. E. Inchiosa,
"General measures for signal-noise separation in nonlinear dynamical
systems," Physical Review E 63 2000: 011107
- Brois Ryabko, "Applications of Universal Source Coding to Statistical Analysis of Time Series", arxiv:0809.1226
- Boris Ryabko and Jaakko Astola
- "Universal Codes as a Basis for Time Series
Testing", cs.IT/0602084
- "Universal Codes as a Basis for Nonparametric Testing of
Serial Independence for Time Series", cs.IT/0506094
- Daniil Ryabko
- Manuel S. Santos, "Consistency properties of a simulation-based estimator for dynamic processes", Annals of Applied Probability
20 (2010): 196--213
- Suchi Saria, Daphne Koller, Anna Penn, "Discovering shared and individual latent structure in multiple time series", arxiv:1008.2028
- Joao R. Sato, Sergi Costafreda, Pedro A. Morettin, Michael John Brammer, "Measuring Time Series Predictability Using Support Vector Regression",
Communications in Statistics: Simulation
and Computation 37 (2008): 1183--1197
- Nicola Scafeta, Patti Hamilton and Paolo Grigolini, "The
Thermodynamics of Social Processes: The Teen Birth Phenomenon," cond-mat/0009020 [Not because
I believe them about sociology, but because they claim to have new and powerful
nonparametric methods for detecting and quantifying memory in time series]
- Thomas Schreiber and Andreas Schmitz, "Surrogate time series," chao-dyn/9909037
- Reiner Schulz and James A. Reggia, "Temporally Asymmetric Learning Supports Sequence Processing in Multi-Winner Self-Organizing Maps",
Neural
Computation 16 (2004): 535--561 [the "model
presented here raises the possibility that SOMs may ultimately prove useful as
visualization tools for temporal sequences and as preprocessors for sequence
pattern recognition systems."]
- Xiaofeng Shao, "A self-normalized approach to confidence interval construction in time series", Journal of the
Royal Statistical Society B 72 (2010): 343--366,
arxiv:1005.2137
[Arxiv version includes an important correction to Assumption 2 and related
theorems]
- Xiaofeng Shao, Wei Biao Wu, "Asymptotic spectral theory for
nonlinear time
series", math.ST/0611029
- M. Siefert, J. Peinke and R. Friedrich, "On a quantitative method
to analyse dynamical and measurement noise," physics/0108034
- Przemyslaw Sliwa and Wolfgang Schmid, "Monitoring the
cross-covariances of a multivariate time series", Metrika
61 (2005): 89--115
- A. Sitz, U. Schwarz, J. Kurths, H. U. Voss, "Estimation of
parameters and unobserved components for nonlinear systems from noisy time
series," Physical Review E 66 (2002): 016210
- Michael Small and Kevin Judd, "Detecting periodicity in
experimental data using linear modeling techniques", physics/9810021
- Vadim N. Smelyanskiy and Dmitry G. Luchinsky, "Inference of
stochastic nonlinear oscillators with applications to physiological problems",
physics/0403121 [They
present this as a Bayesian inference issue, but the core of their work appears,
from skimming, to be an efficient method for computing the likelihood, so it'd
apply equally well to maximum likelihood inference, for instance.]
- V. N. Smelyanskiy, D. A. Timucin, A. Bandrivskyy and D. G.
Luchinsky, "Model reconstruction of nonlinear dynamical systems driven by
noise", physics/0310062
[Same as earlier paper --- was this one submitted to PRL?]
- Dmitry A. Smirnov, Vladislav S. Vlaskin and Vladimir I.
Ponomarenko, "Estimation of parameters in one-dimensional maps from noisy
chaotic time series", Physics Letters
A 336 (2005): 448--458
- Song Song and Peter J. Bickel, "Large Vector Auto Regressions",
arxiv:1106.3915
- Eduardo D. Sontag, "For differential equations with r parameters,
2r+1 experiments are enough for identification," math.DS/0111135
- D. Sornette and V. F. Pisarenko, "Properties of a simple bilinear
stochastic model: estimation and predictability", physics/0703217
- Jean-Pierre Stockis, Jurgen Franke and Joseph Tadjuidje Kamgaing,
"On geometric ergodicity of CHARME
models", Journal
of Time Series Analysis>cite> 31 (2010): 141--152
- Tomoya Suzuki, Tohru Ikeguchi, and Masuo Suzuki, "Effects of data
windows on the methods of surrogate data", Physical Review
E 71 (2005): 056708
- Alexander G. Tartakovsky, "Asymptotic Optimality of Certain
Multihypothesis Sequential Tests: Non-i.i.d. Case", Statistical Inference
for Stochastic Processes 1 (1998): 265--295
- Marco Thiel, M. Carmen Romano and Jurgen Kurths, "How much
information is contained in a recurrence plot?", Physics Letters
A 330 (2004): 343--349
- Madeleine B. Thompson, "A Comparison of Methods for Computing Autocorrelation Time", arxiv:1011.0175
- Tina Toni, David Welch, Natalja Strelkowa, Andreas Ipsen, Michael P.H. Stumpf, "Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems", arxiv:0901.1925
- Wilson Truccolo, John P. Donoghue, "Nonparametric Modeling of
Neural Point Processes via Stochastic Gradient Boosting Regression", Neural
Computation 19 (2007): 672-705
- Masayuki Uchida and Nakahiro Yoshida, "Information Criteria in
Model Selection for Mixing Processes", Statistical Inference for
Stochastic Processes 4 (2001): 73--98 ["The emphasis is
put on the use of the asymptotic expansion of the distribution of an estimator
based on the conditional Kullback-Leibler divergence for stochastic processes.
Asymptotic properties of information criteria and their improvement are
discusssed."]
- Aad van der Vaart and Harry van Zanten, "Donsker theorems for
diffusions: Necessary and sufficient conditions", math.PR/0507412 = Annals of
Probability 33 (2005): 1422--1451
- Harry van Zanten, "On Uniform Laws of Large Numbers for Ergodic
Diffusions and Consistency of Estimators", Statistical Inference for
Stochastic Processes 6 (2003): 199--213 ["In contrast
with uniform laws of large numbers for i.i.d. random variables, we do not need
conditions on the 'size' of the class [of functions] in terms of bracketing or
covering numbers. The result is a consequence of a number of asymptotic
properties of diffusion local time that we derive."]
- J. H. van Zanten, "On the Uniform Convergence of the Empirical
Density of an Ergodic Diffusion", Statistical Inference for
Stochastic Processes 3 (2000): 251--262
- Paolo Vidoni, "A simple procedure for computing improved
prediction intervals for autoregressive models", Journal of Time Series Analysis 30 (2009): 577--590
- Juan M. Vilar-Fernandez and Ricardo Cao, "Nonparametric Forecasting
in Time Series - A Comparative Study", Communications in
Statistics: Simulation and Computation 36 (2007):
311--334
- R. Vilela Mendes, R. Lima and T. Araujo, "A Process-Reconstruction
Analysis of Market Fluctuations," cond-mat/0102301 [I don't care
about the market, but they claim to have a new method for identifying
distributions over entire sequences]
- Divakar Viswanath, Xuan Liang, Kirill Serkh, "Metric Entropy and the Optimal Prediction of Chaotic Signals", arxiv:1102.3202
- Zijun Wang, "Finite Sample Performances of the Model Selection Approach in Nonparametric Model Specification for Time Series", Communications in Statistics: Theory and Methods 38
(2009): 2302--2330
- Halbert White ,Asymptotic Theory for Econometricians
- Wei Biao Wu
- Herwig Wendt, Patrice Abry and Stephane Jaffard, "Bootstrap for
Empirical Multifractal Analysis", IEEE Signal Processing Magazine
July 2007, pp. 38--48 [+ technical papers by these authors]
- Simon N. Wood, "Statistical inference for noisy nonlinear ecological dynamic systems", Nature 466 (2010): 1102--1104
- Hongqi Xue, Hongyu Miao, and Hulin Wu, "Sieve estimation of constant and time-varying coefficients in nonlinear ordinary differential equation models by considering both numerical error and measurement error", Annals of Statistics 38 (2010): 2351--2387
- A. Zeileis and G. Grothendieck, "zoo: S3 Infrastructure for Regular
and Irregular Time Series", Journal of Statistical
Software 14 (2005): 1--27 = math.ST/0505527 ["zoo is an R
package providing an S3 class with methods for indexed totally ordered
observations, such as discrete irregular time series. Its key design goals are
independence of a particular index/time/date class and consistency with base R
and the "ts" class for regular time series."]
- Zhou Zhou, "Nonparametric inference of quantile curves for nonstationary time series", Annals of Statistics 38 (2010): 2187--2217
- M. Zukovic, D. T. Hristopulos, "Spartan Random Processes in Time
Series Modeling", 0709.3418
To write/finish:
- CRS, "Learning Rates and Recurrence Times" [a.k.a. "Wait and see"]
- CRS, "Algorithms for Inferring the Statistical Structure of Symbol
Sequences: History and Review"
#
Sat, 04 Feb 2012
Recommended Science Fiction
These range from merely good reads to really outstanding books. A raw ranking
of them would be of little use to others, unless I explained why I gave them
the ranks I did, and anyway I'd probably give different rankings by the time
your read this. (When I know of an on-line review about a book which I agree
with --- e.g., because I wrote it --- I've included a link; also some
exceedingly short remarks about interesting cases.)
See also
Fantasy and Horror Recommendations;
Science Fiction
- Brian Aldiss, Helliconia Spring [I don't include the
other two books of the trilogy --- Helliconia Summer and
Winter --- simply because I haven't read them yet...]
- Poul Anderson, The Man who Counts
- Taylor Anderson [These could be summed up, entirely
fairly, as "What these lemurs need is a boatload of vintage honkies".]
- Into the Storm
- Crusade
- Maelstrom
- Distant Thunders
- Rising Tides
- Isaac Asimov
- Foundation Trilogy [not the much-later sequels]
- Foundation
- Foundation and Empire
- Second Foundation
- I, Robot [Stories]
- Iain M. Banks
- Inversions
- Look to Windward
- Greg Bear
- Forge of God [One of the scariest books I've
ever read; the sequel, Anvil of Stars, didn't work anywhere near
as well]
- Heads
- Queen of Angels and / [Review: De nos fabula]
- Tangents [Stories]
- The Wind from a Burning Woman [Stories]
- Vitals [Thanks, Cris!]
- Cherly Benard, Turning on the Girls [Not very
imaginative as SF, but very funny]
- Alfred Bester
- The Demolished Man
- The Stars My Destination
- James Blish, A Case of Conscience [Theologically
inaccurate Catholic first-contact story]
- Ray Bradbury
- I Sing the Body Electric [Stories]
- The Martian Chronicles [Linked stories]
- R Is for Rocket [Stories]
- David Brin
- The Uplift books [space-opera, but good space opera, except
for the highly unfortunate last book in the series, which I shan't list]
- Sundiver
- Startide Rising
- The Uplift War
- Brightness Reef
- Infinity's Shore
- Damien Broderick, The Black Grail
- John Brunner
- The Atlantic Abomination [Lovecraftian
monsters vs. early '60s technological optimism; not great but fun. Dated
gender roles are dated.]
- Double, Double
- The Shockwave Rider
- The Squares of the City [Perhaps the best
novel ever written about urban planning]
- Stand on Zanzibar
- Steven Brust, My Own Kind of Freedom [Free online]
- Lois McMaster
Bujold [Normally, I have a special place in my heart
for military SF, and it's run by the ghost of Felix Dzherzhinsky; but these are
such well-written books (just think of the scene in Warrior's
Apprentice where Miles ad libs the Dendarii into existence) that one
hardly notices one's reading about utter monsters --- and the shock is all
the greater when the realization does penetrate, which always happens. I
literally read all I could lay hands upon in a week, have now gone through all
of them, and wait impatiently for more.]
- Shards of Honor
- Barrayar
- Cordelia's Honor [=the first two collected]
- The Warrior's Apprentice
- The Vor Game
- Young Miles [=Warrior's
Apprentice plus Vor Game plus a story from Borders of
Infinity which falls between them]
- Cetaganda
- Borders of Infinity
- Brothers in Arms
- Mirror Dance
- Memory
- Komarr
- A Civil Campaign
- Diplomatic Immunity
- Cryoburn
- Falling Free [In the same universe, but a long time earlier]
- Jack Campbell, The Lost Fleet series [Horrible
cover-art, titles I'm embarrassed to name, compelling stories]
- Dauntless
- Fearless
- Courageous
- Valiant
- Relentless
- Victorious
- Dreadnaught
- Karel Capek, War with the
Newts
- Adam-Troy Castro, Emissaries from the Dead
- Suzy McKee Charnas, The Vampire Tapestry
- C. J. Cherryh
- Cyteen
- Downbelow Station
- Foreigner, Invader,
Inheritor [There is a notable downward gradient in these books,
and I do not recommend the further sequels]
- Heavytime and Hellburner
- Arthur Clarke
- 2001 and 2010
- The City and the Stars
- Imperial Earth
- Tales from the White Hart
- The Other Side of the Sky
- Rendezvous with Rama [emphatically not any of
the sequels]
- Fountains of Paradise
- The Nine Billion Names of God
- Helen Collins, Mutagenesis
- Sara Creasy, Song of Scarabaeus
- Brian Daley [Comic space opera]
- Reqiuem for a Ruler of Worlds
- Jinx on a Terran Inheritance
- The Fall of the White Ship Avatar
- Avram Davidson, The Avram Davidson Treasury [Davidson
was mainly a short-story writer of genius; this is the only collection of his
stories still in print, but it's very recent and very good. General review: Avram Davidson's
Afterlife]
- Jenny Davidson, The Explosionist [Warning: the story
is not finished here, something none of the publicity for the book
leads one to expect.]
- L. Sprague de Camp
- George Alec Effinger
- When Gravity Fails
- Fire in the Sun
- The Exile Kiss
- Schrödinger's Kitten
- Greg Egan
- Warren Ellis and Darick
Robertson, Transmetropolitan [Yes, they're comic books. They're
also brilliant: imagine Hunter Thompson and John Brunner collaborating
on the script for a movie to be filmed by Fritz Lang, Capra and John Carpenter,
rendered by a Hogarth who has seen the future and loathes it.]
- Back on the Street
- Lust for Life
- Year of the Bastard
- The New Scum
- Lonely City
- Gouge Away
- Spider's Trash
- Dirge
- The Cure
- One Last time
- Harlan "I Am Not a Science Fiction Writer" Ellison
[Note: many of these short-story collections overlap. I suggest starting with
The Essential Ellison. Website: Ellison Webderland]
- Angry Candy
- Deathbird Stories
- An Edge in My Voice [essay collection]
- The Essential Ellison
- Paingod and Other Delusions
- Shatterday
- Strange Wine
- John M. Ford
- Growing Up Weightless
- How Much Just for the Planet?
- The Princes of the Air
- Web of Angels
- Karen Joy Fowler, Artificial Things
- R. García y Robertson, The Virgin and the
Dinosaur
- Randall Garrett
- Alexis A. Gilliland, The Revolution from Rosinante
- Phylis Gotlieb, Flesh and Gold
- Thomas Harlan
- Wasteland of Flint
- House of Reeds
- Land of the Dead
- Robert Heinlein, The Moon Is a Harsh Mistress
- John G. Hemry, A Just Determination
- Frank Herbert, Dune [I think I've read all the fiction
he ever published; I cannot now imagine why]
- Theodore Judson, Fitzpatrick's War [A re-telling of
the story of Alexander the Great, making him out to be the psychopathic
catatstrophe in human form he really was]
- Ernst Jünger, The Glass Bees
[Parallel, distributed
robotic artificial intelligence, as seen by an immensely
cultivated, immensely reactionary German novelist and cavalry officer around
1957. Which sounds like it should be in a science fiction novel.
The New York Review edition has, aptly enough, an introduction by
Bruce Sterling.]
- Janet Kagan, Mirabile
- Rosemary Kirstein
- Steerswoman's Road [=Steerswoman + The Outskirter's Secret]
- The Lost Steersman
- The Language of Power
- Sharon Lee and Steven Miller, Partners in Necessity [=
omnibus edition of A Conflict of Honors, Agent of
Change and Carpe Diem. Light entertainments, which owe
more than I'd usually like to romance novels, but fun reads.]
- Ursula Le Guin
- The Dispossessed
- Four Ways to Forgiveness
- Three Hainish Novels
- Fritz Leiber
- Gather, Darkness
- A Sepcter Is Haunting Texas
- Stanislaw
Lem [Lem was one of the great thinkers of the 20th century. People didn't
realize this, because he put his social philosophy and epistemology in science
fiction, but it's there all the same.]
- The Cyberiad
- The Futurological Congress [Review by Danny
Yee]
- His Master's Voice [The Great Information Theory Novel]
- Imaginary Magnitude [Introductions to
imaginary books]
- The Invincible
- Peace on Earth
- A Perfect Vacuum [Reviews of imaginary books,
including a harsh take on Stanislaw Lem's A Perfect Vacuum.
Brilliant and funny; includes what I think is an entirely new probabilistic
fallacy, and several weird cosmologies, all hard to refute and utterly
incompatible.]
- Solaris
- Paul J. McAuley
- Confluence [A mutant member of the Dying Earth sub-genre.]
- Child of the River
- Ancients of Days
- Shrine of Stars
- Fairyland
- Pasquale's Angel [Alternate history where
Leonardo started the industrial revolution during
the Renaissance. I introduced Danny Yee to this
book, producing this
review.]
- The Quiet War
- Wil McCarthy
- Bloom
- Murder in the Solid State
- Ian
McDonald
- Out on Blue Six
- Scissor Cut Paper Wrap Stone
- Sandra McDonald, The Outback Stars
- Maureen McHugh
- China Mountain Zhang
- Half the Day Is Night
- Patricia McKillip, Fool's Run
- Ken MacLeod
- George R. R. Martin, Tuf Voyaging
- Elizabeth Moon
["Familias Regnant" books.]
- Hunting Party
- Sporting Chance
- Winning Colors
- Once a Hero
- Rules of Engagement
- Change of Command
- Against the Odds
[A different series, in a different universe]
- Trading in Danger
- Marque and Reprisal
- Chris Moriarty, Spin State
- Larry Niven [All from back when Niven was good. Narrative order,
though it's not necessary to read them that way]
- The Long Arm of Gil Hamilton [Stories]
- World of Ptaavs
- A Gift from Earth
- Neutron Star [Stories]
- Ringworld
- Alexei Panshin
- Larry Niven and Jerry Pournelle, The Mote in God's Eye
- Frederik Pohl [Pohl has written a few excellent books, a very large
number of good ones, and none (that I've read) which wasn't an at least OK way
to pass the time. At the moment, I'd say his three best
are Gateway, The Space Merchants and the Eschaton
Sequence (taken as a whole), but I'm open to persuasion.]
- Black Star Rising
- The Eschaton Sequence [Series premised on Frank Tipler's
Omega Point Theory, only correcting a serious defect in Tipler's formulation,
viz., realizing that there's no reason why Omega should be benevolent.]
- The Other End of Time
- The Siege of Eternity
- The Far Shore of Time
- Gateway [The sequels are OK]
- Homegoing
- Mining the Oort
- Narabedla Ltd.
- O Pioneer!
- Outnumbering the Dead
- The World at the End of Time
- and C. M. Kornbluth, The Space Merchants
- Laura E. Reeve
- Peacekeeper
- Vigilante
- Pathfinder
- Alastair Reynolds [Narrative order. Extremely hard
science space opera.]
- Revelation Space [Or, Some of My Best Friends
Are Monstrous Chimeras of Tortured Flesh and Nanomechanical Viruses]
- Chasm City
- Redemption Ark
- Chris Roberson, Paragaea
- Kim Stanley Robinson, The Years of Rice and Salt [This
is clearly the most probable path for world
history to have taken; we are a very unlikely fluctuation.]
- Matt Ruff, Sewer Gas & Electric
- John Scalzi
- Old Man's War
- The Ghost Brigades
- The Last Colony
- Zoe's Tale
- James Schmitz
- Mike Shepherd, Kris Longknife [I'm obscurely ashamed
of myself for finding this series compulsively readable, but I do.]
- Mutineer
- Deserter
- Defiant
- Resolute
- Audacious
- Intrepid
- Robert Silverberg
- Dying Inside
- Nightwings
- Thorns
- Tower of Glass
- Dan Simmons
- Children of the Night [A vampire story; but
really SF]
- The Hyperion series [Hyperion and
Fall of Hyperion are really a single novel, and that novel is one
of the best I've read. It has all the virtues: plot, character,
world-building, neat ideas, description, suspension of disbelief, prose style
(at least eight, all handled expertly), nuance of language and allusion and
construction. "The Scholar's Tale" in Hyperion reduced me to
tears; "The Detective's Tale", immediately following, is a hilarous dead-pan
satire of hardboiled detective stories. Simmons must have cackled
while writing this book, thinking of what he was doing to the reader's mind.
It deserved every award which was thrown at it and more. The second set of
novels, Endymion and Rise of Enydmion, are however
vastly inferior.]
- Hyperion
- Fall of Hyperion
- John Sladek
- Kristine Smith, Code of Conduct
- Olaf Stapeldon
- The Last and the First Men
- A Last Man in London
- Neal Stephenson [Stephenson is funny, he tells a good story, he's a
sucker for neat techy ideas which he does really well (when I teach theory of computation I'm going to use excerpts
from the Young Lady's Illustrated Primer), and he couldn't write a
decent ending to save his life.]
- The Big U [Monstrous campus architecture meets the bicameral mind. Very plainly a first novel, but
very amusing to anyone who's attended an American college of over, say, 20,000
students. The new edition has a cover it is not embarrassing to be seen with.]
- Cryptonomicon [No visible SF
elements. I liked the historical part, during the Second World War, far
better than the modern story about the descendants of those characters. Said
characters are, principally, twerps and idiots. (Will somebody tell me why
the cornflakes scene is supposed to be funny?) Plus, Stephenson rode various
unruly hobby-horses with them, showing profound cluelessness about economics.
And the ending was bizarre and pointless --- whereas normally his endings are
strained, abrupt and unsatisfying. But the WWII story was brilliant.]
- The Diamond Age, or, A Young Lady's Illustrated
Primer
- Snowcrash
- Bruce Sterling
- Crystal Express [Stories]
- Distraction
- A Good Old-Fashioned Future [Stories]
- The Hacker Crackdown [Nonfiction, actually]
- Heavy Weather
- Holy Fire
- Islands in the Net
- Schismatrix [Now available in
Schismatrix Plus, the plus being a new introduction and about half
the stories earlier collected in Crystal Express, which are set in
the same universe]
- Zeitgeist [Taking the social construction of reality
seriously is so science fiction]
- S. M. Stirling
- Charles
Stross
The Merchant Princes [This is science fiction;
who are you going to believe, me or your own eyes looking at the cover?]
- The Family Trade
- The Hidden Family
- The Clan Corporate
- Singularity Sky
- Michael Swanwick
- Stations of the Tide
- Vacuum Flowers
- William Tenn, anything
- Maggy Thomas, Broken Time
- Jack Vance deserves his own page
- Kurt "I'm not a science fiction writer either" Vonnegut
- Cat's Cradle
- Palm Sunday [Not fiction]
- Player Piano
- Slaughterhouse Five
- Scott Westerfeld
- Walter Jon Williams
- Ambassador of Progress
- Aristoi
- Day of Atonement
- Dread Empire's Fall [Space opera, with ugly
cover art and highly misleading blurbs; also really good novels. For
instance, I don't think I've ever read a better portrayal of the special
intoxication which comes when sexual love coincides with intellectual
collaboration. And The Praxis, in particular, contains an
embedded novella about identity, ambition, friendship and betrayal which is
simply devastating.]
- The Praxis
- The Sundering
- Conventions of War
- Facets [Stories]
- Knight Moves
- Implied Spaces
- Voice of the Whirlwind
- The near-future-if-not-present alternate reality game series:
- This Is Not a Game
- Deep State
- The Fourth Wall
- His interview
(sort of) with Hot Wired is also worthwhile, if only for the bits where he
discusses his suit against Wired Ventures Inc.
- Connie Willis
- And Not Forgetting the Dog [Sequel to
Doomsday Book, but can be enjoyed independently]
- Impossible Things [Stories]
- Doomsday Book [Prequel to the title story in
Fire Watch, but can be enjoyed --- if that is the word ---
independently]
- Fire Watch [Stories]
- Remake
- Gene Wolfe
- John Wyndham
- Consider Her Ways [Stories]
- Day of the Triffids
- Gooseflesh and Laughter [Stories]
- Kraken
- The Midwich Cuckoos
- Roger Zelazny
- Creatures of Light and Darkness
- The Doors of His Face, the Lamp of His Mouth
[Stories]
- Four for Tomorrow [Four novellas]
- Lord of Light [Review by Danny Yee]
- Sarah Zettel
#
Recommended Fantasy Books
These range from merely good reads to really outstanding books; but rather than
trying to rate each one, or (what would be more to the point) explain my
ratings, I've merely listed them without any particular indication of rank.
Horror novels are included here for want of anyplace better to put them.
Titles are added as they occur to me.
Links on titles are generally to my review or briefer comments, if I have any.
See also
Fantasy;
Science Fiction Recommendations.
Joe Abercrombie, The First Law [comments with spoilers]
- The Blade Itself
- Before They Are Hanged
- Last Argument of Kings
- Kage Baker
- The Anvil of the World
- The House of the Stag
- The Bird of the River
- John Barnes, One for the Morning Glory
- Peter Beagle
- I See by My Outfit [Not fantasy, and I have no
idea what the title means, but lovely]
- The Last Unicorn
- Elizabeth Bear, New Amsterdam
- Robert Jackson Bennett, Mr. Shivers
[Author's
comments]
- K. J. Bishop, The Etched City
- Ray Bradbury, Something Wicked This Way Comes
- Patricia Briggs
- Dragon Bones and Dragon Blood
- Raven's Shadow and Raven's Strike
- "Mercy Thompson" series
- Moon Called
- Blood Bound
- Iron Kissed
- Bone Crossed
- Silver Borne
- River Marked
- Cry Wolf
- Max Brooks, World War Z: An Oral History of the Zombie
War
- Steven Brust
- The Vlad Taltos books [Enjoyable so far --- there're a good
many more in the series, which hasn't ended, and will probably amount to
seventeen books in all. Brust may be the last writer of real talent to be a
follower of Trotsky.]
- To Reign in Hell [Making sense of Milton; see
under Demonology]
- The Khaavren books [Faithful pastiches of Dumas,
consistently fit into a world Brust had already made up for the Vlad Taltos
books --- and more fun than I remember Dumas being.]
- The Phoenix Guards
- Five Hundred Years After
- Steven Brust and Megan Lindholm, The Gypsy
- Lois McMaster Bujold
- The Curse of Chalion
- The Paladin of Souls
- The Spirit Ring
- The Hallowed Hunt
- The Sharing Knife:
- Beguilement
- Legacy
- Passage
- Horizon
- Emma
Bull, War for the Oaks
- Robert W. Chambers, The King in Yellow [online]
- Suzy McKee Charnas, The Vampire Tapestry
- C. J. Cherryh
- The Dreaming Tree [=The
Dreamstone, plus The Tree of Swords and Jewels, plus
Cherryh's corrections and ending]
- Fortress in the Eye of Time [I think the story
ends best here, with the first book; certainly the sequels go downhill, and the
last one I read, Fortress of Dragons, had me wondering whether my
copy was defectively printed and missing about, oh, two hundred pages. Sadly,
no.]
- The Gate of Ivrel
- The Paladin
- Susanna Clarke, Jonathan Strange and Mr. Norrell
- B. W. Clough, The Dragon of Mishbil
- Seamus Cooper, The Mall of Cthulhu
- Avram Davidson [General Review: Avram Davidson's
Afterlife]
- Vergil Magus
(unfinished):
- The Phoenix and the Mirror
- Vergil in Averno [Anyone have a copy
they'd be willing to sell?]
- Peregrine: Primus and Secundus
- The Enquiries of Dr. Estzerhazy
- Adventures in Unhistory [Essays on legendary
subjects]
- The Avram Davidson Treasury [The best of his
numerous short-story collections]
- Avram Davidson and Grania Davis, The Boss in the Wall: A
Treatise on the House Devil
- Pamela Dean, Tam
Lin
- L. Sprague de Camp
- Stephen R. Donaldson [I read all his Thomas Covenant books, and
cannot know imagine why, except for adolescent masochism; these are utterly
different, and, to my mind, vastly better]
- The Mirror of Her Dreams
- A Man Rides Through
- Doyle and MacDonald
- Lord Dunsany
- The King of Elfland's Daughter
- The Pegana books:
- The Shadow Valley books:
- Don Rodriguez
- The Charwoman's Shadow
- Rosemary Edghill
(a.k.a. eluki bes shahar)
- The Sword of Maidens' Tears
- The Cup of Morning Shadows
- The Cloak of Night and Daggers
- Harlan Ellison, see under science
fiction
- Charles Finney, The Circus of Dr. Lao
- John M. Ford
- Casting Fortune
- The Dragon Waiting [Almost an alternate
history of Renaissance Europe, except that magic works.]
- The Last Hot Time
- Diana Pharaoh Francis
- The Cipher
- The Black Ship
- Esther Friesner, Yesterday We Saw Mermaids
- Neil Gaiman
- American Gods
- The Sandman
- Randall Garrett
- Murder and Magic
- Too Many Magicians
- Lord Darcy Investigates
- Mary Gentle
- Rats and Gargoyles
- The Book of Ash [Owing to the manifold crimes
and wickedness of American publishers, I've only seen the first half of this]
- A Secret History: The Book of Ash, #1
- Carthage Ascendant: The Book of Ash,
#2
- Felix Gilman
- Charlaine Harris, Dead Until Dark [series fatigue
sets in rapidly after that]
- P. C. Hodgell
- God Stalk
- Dark of the Moon
- Seeker's Mask
- To Ride a Rathorn
- Barry Hughart
- Bridge of Birds
- The Story of the Stone
- Eight Skilled Gentlemen
- Diana Wynne Jones
- Dark Lord of Derkholm
- Deep Secret
- Hexwood
- The Tough Guide to Fantasyland [Review]
- Howard Andrew Jones, Desert of Souls
- Guy Gavriel Kay, The Lions of Al-Rassan
- Brian Keene and Nick Mamatas, The Damned Highway: Fear and Loathing in Arkham: A Savage Journey into the Heart of the American Nightmare
- Caitlín R. Kiernan
- Stephen King [What can I say? When he's good, he is good;
but his novels, increasingly, need ruthless editing, which they do not, sadly,
receive. The short stories remain excellent.]
- The Dark Half
- The Dark Tower:
- The Gunslinger
- The Drawing of the Three
- The Wastelands
- Wizard and Glass [Review]
- Wolves of the Calla
- Song of Susannah
- The Dark Tower ["There I will sing all
their names..."]
- Desperation
- Everything's Eventual
- Eyes of the Dragon
- Four Past Midnight
- Misery
- Nightshift
- Salem's Lot
- The Shining
- The Stand
- Danse Macabre [Reflections on the horror
genre]
- Stephen King and Peter Straub
- The Talisman
- The Black House
- Rosemary Kirstein
- Steerswoman's Road [=Steerswoman + The Outskirter's Secret]
- The Lost Steersman
- The Language of Power
- Ellen
Kushner, Swordspoint
- Jay Lake
- Green
- Endurance
- Sarah Langan
- Sheridan LeFanu, Carmilla
[After whom I once named a computer]
- Ursula K. Le Guin
- A Wizard of Earthsea
- The Tombs of Atuan
- The Farthest Shore
- Tehanu [This book caps the series, but it was
written much later, and is very different from the others; I'm not
sure how I feel about it.]
- Fritz Leiber
- Conjure Wife
- Our Lady of Darkness
- Megan Lindholm
- Jane Lindskold, The Buried Pyramid
- Jeff Long, The Descent [A case could be made for this
being science fiction, but anything with possession and Satan in it is fantasy
in my book. I hesitate to recommend it, because it's got plot- and world-building- holes big enough to drive tanks through, but in the end I couldn't
put it down, or put it out of my mind once I was done.]
- H. P. Lovecraft
- At the Mountains of Madness [Arguably,
however, this is pure science fiction]
- The Dream-Quest of Unknown Kadath
- There are now many short story collections. One of the
nicest is the one from the Library of America (edited by the great Peter
Straub), which includes At the Mountains of Madness, but I have a
soft-spot for the old one edited by August Derleth, rejoicing in the
title Blood-curdling Tales of Horror and the Macabre, which I
re-read so much as a boy that it literally disintegrated...
- Scott Lynch
- The Lies of Locke Lamora
- Red Seas Under Red Skies
- Elizabeth A. Lynn, Dragon's Winter
- R. A. MacAvoy
- Tea with the Black Dragon [The sequel,
Twisting the Rope, is good, but not nearly so good as
Tea, and hard to find]
- The Lens of the World trilogy:
- The Lens of the World
- King of the Dead
- Belly of the Wolf
- George R. R. Martin [This is the kind of multi-volume,
multi-threaded fantasy epic which is supposed to be absolute dreck,
at least by the end of the second lap-breaking book. Sadly, it is excellent
fun. He just needs to write more!]
- A Game of Thrones
- A Clash of Kings
- A Storm of Swords
- A Feast for Crows
- A. Lee Martinez, Gil's All Fright Diner
- Misty Massey, Mad Kestrel
- Patricia A. McKillip
- Alphabet of Thorn
- The Bell at Sealey Head
- The Book of Atrix Wolfe
- The Changeling Sea
- The Forgotten Beasts of Eld
- In the Forest of Serre
- Od Magic
- Ombria in Shadow
- Riddle-Master [Originally published as a
trilogy]
- Solstice Wood
- Song for the Basilisk
- The Sorceress and the Cygnet and The Cygnet and the Firebird
- The Tower at Stony Wood
- Winter Rose
- Robin McKinley, Sunshine [As a wise man has said, "nearly perfect". I
have a problem with the ending, but it'd be a massive spoiler to say what it
is, so I'll bite my tongue.]
- Brian McNaughton, Throne of Bones [Imagine an even
more twisted and pervy version of Clark Ashton Smith]
- Christopher Moore [No
relation of my friend and sometime co-author, Cristopher Moore.]
- Richelle Mead, Succubus Blues
- Mike Mignola and Christopher Golden, Baltimore, or, The
Steadfast Tin Soldier and the Vampire: An Illustrated Novel
- Moira J. Moore, Heroes at Risk
- James Morrow
- Bible Stories for Adults
- Only Begotten Daughter [A smart Jewish girl
from New Jersey, of course]
- Towing Jehovah [When God dies, what happens to
His Corpse?]
- Justine Musk, Blood Angel
- Garth Nix, Sabriel
- Naomi Novik
- His Majesty's Dragon
- Throne of Jade
- Black Powder War
- Empire of Ivory
- Paul Parks, A Princess of Roumania [Recommended
by Henry Farrell]
- Victor Pelevin, The Sacred Book of the Werewolf
- Tim Powers, Declare
- Terry Pratchett
- Discworld novels [merely my favorites among the ones I've
read; alphabetical order]
- The Amazing Maurice and His Educated
Rodents
- Carpe Jugulum
- Feet of Clay
- Guards! Guards!
- Going Postal
- A Hat Full of Sky
- Hogfather [A delight "on many levels,
not least of which is a deep reminder of the old blood-on-snow huge sweating
bristly virile boar nightmare ice age aspects of Christmas" --- Bill Tozier]
- Interesting Times
- Jingo
- Maskerade [Not for opera-lovers]
- Monstrous Regiment
- Mort
- Men at Arms
- Night Watch
- Reaper Man
- Small Gods
- Soul Music
- The Truth
- Witches Abroad
- Wyrd Sisters
- Pyramids
- Thief of Time
- Thud!
- The Wee Free Men
- Nation
- and Neil Gaiman, Good Omens
- Cherie Priest, Four and Twenty Blackbirds
- Christopher Priest, The Prestiege
- Philip Pullman
- The Golden Compass = The Northern
Lights
- The Subtle Knife
- Cat Rambo and Jeff VanderMeer, The Surgeon's Tale: And Other Stories
- Kat Richardson
- Greywalker
- Poltergeist
- Underground
- Vanished
- Labyrinth
- Phil Rickman
- Curfew = Crybbe [Part rural
comedy, part New Age satire, entirely a well-written, scary horror novel]
- December [Way too much about John Lennon, but
still good]
- The Man in the Moss
- Merrily Watkins series [Mysteries about a
contemporary Church of England exorcist; addictive]
- The Wine of Angels
- Midwinter of the Spirit
- A Crown of Lights
- The Cure of Souls
- The Lamp of the Wicked
- Prayer of the Night Shepherd
- The Smile of a Ghost
- The Remains of An Altar
- Fabric of Sin
- Diana Rowland [Enjoyable hybrids of police procedurals and
contemporary fantasy, with local color for southern Louisiana.]
- Michelle Sagara
- Cast in Shadow
- Cast in Courtlight
- Cast in Secret
- Cast in Fury
- Cast in Silence
- Cast in Chaos
- Cast in Ruin
- Brandon Sanderson, The Final Empire [first book of
a trilogy which I haven't finished]
- Charles Saunders,Imaro
- Will Shetterly
- Cats Have No Lord
- Elsewhere
- Witchblood
- Trey Shiels, The Dread Hammer
- Susan Shwartz, Shards of Empire
- Dan Simmons
- Carrion Comfort
- Children of the Night [Really SF, but it's
about vampires, and told like a horror novel]
- Fires of Eden [Comic horror novel]
- Song of Kali [Scary as hell, but also
borderline racist]
- The Terror
- Clark Ashton Smith, Complete
Online Works
- Lucy A. Snyder
- Alexandra Sokoloff, The Harrowing
- Nancy Springer, Apocalypse
- Brian Stableford, The Last Days of the Edge of the
World [" 'Vanity,' said the mirror in tones of mild reproof, 'is
not nice.' "]
- Caroline Stevermer
- A College of Magics
- When the King Comes Home
- Peter Straub
- Charles
Stross [Lovecraftian spy fiction]
- The "Laundry" series
- The Atrocity Archive [Basically
upbeat, with happy ending. The second half
is free online
now, actually.]
- The Jennifer Morgue
- The Fuller Memorandum
- A Colder War
[Deeply horrifying]
- J. R. R. Tolkien, The Lord of the Rings [Does it need
to be said?]
- Henry Turtledove, Between the Rivers
- Catherynne Valente, The Orphan's Tales: In the Night Garden
- Jack Vance [Deserves his own page]
- Carrie Vaughn
- "Kitty Norville" series
- Kitty and the Midnight Hour
- Kitty Goes to Washington
- Kitty Takes a Holiday
- Kitty and the Silver Bullet
- Kitty and the Dead Man's Hand
- Kitty Raises Hell
- Kitty's House of Horrors
- Kitty Goes to War
- After the Golden Age
- Paula Volsky [Volsky might --- a bit unfairly --- be called a poor
person's Jack Vance. Vance writes this sort of thing better, and has for about
as long as Volsky's been alive, but she's still quite good, and, besides, what
is one to do in the intervals between acquiring new books by Vance?]
- The Curse of the Witch-Queen
- The Gates of Twilight
- The Grand Ellipse
- Illusion
- The Luck of Relian Kru
- The White Tribunal
- The Wolf of Winter
- A trilogy (with no title that I know of) set in Lanthi Ume
(roughly, Venice with the government of Naples, plus magicians):
- The Sorceror's Lady
- The Sorceror's Heir
- The Sorceror's Curse
- Lawrence Watt-Evans
- The Lords of Dûs
- The Lure of the Basilisk
- The Seven Altars of Dûsarra
- The Sword of Bheleu
- The Book of Silence
- The Misenchanted Sword
- Touched by the Gods
- Martha Wells
- City of Bones
- The Wheel of the Infinite
- A series of loosely connected novels in a common universe:
- The Element of Fire [now
free online]
- The Death of the Necromancer
- The Wizard Hunters
- The Ships of Air
- The Gate of Gods
- Elizabeth Willey
- A Sorceror and a Gentleman
- The Price of Blood and Honor
- A Well-Favored Man
- Tad Williams
- Tailchaser's Song
- Memory, Sorrow, Thorn:
- The Dragonbone Chair
- The Stone of Farewell
- To Green Angel Tower
- Walter Jon Williams [Not what is usually meant by "urban
fantasy," but rather much better: this has brains]
- Metropolitan
- City on Fire
- Gene Wolfe
- The Devil in a Forest
- Free Live Free
- There Are Doors
- Gene Wolfe and Neil Gaiman, A Walking Tour of the
Shambles (volume 16 of Little Walks for Sightseers)
- N. Lee Wood, Bloodrights
- Patricia Wrede [Wrede's books are all more or less pitched at
younger readers (except The Seven Towers). Lucky kids.]
- The Book of Enchantments
- The Seven Towers
- Duology in a Regency England with period squalor, and
scholarly magic:
- Mairelon the Magician
- Magician's Ward
- Lyra books [Now back in print in an omnibus, Shadows
over Lyra, except for Raven]:
- Shadow Magic
- Daughter of Witches
- The Harp of Imach Thyssel
- The Raven Ring
- Roger Zelazny
- The first Amber series (of five books) is the only one
worth bothering with, and even then the first two books are by far the best:
- Nine Princes in Amber
- The Guns of Avalon
- A Night in Lonesome October
- The Unicorn Variations
- Roger Zelazny and Jane Lindskold, Lord Demon
#
Fri, 03 Feb 2012
Dynamic Stochastic General Equilibrium Models in Macroeconomics (DSGEs)
Pretend that the national economy consists of a single person, the
"representative agent". This agent owns all the goods, especially all the
capital goods, and does all the work in the economy. The agent is greedy for
material consumption, and lazy. To consume, which it likes, it must produce,
which is a matter of indifference, except that to produce it must work, which
it dislikes. If it produces more now than it consumes, it can save the
difference as capital goods, which make its future labor more productive.
There are also shocks to "technology", i.e., to how effectively it can use
capital to turn labor into consumption goods; rather bizarrely, these shocks
are both negative and positive, which means that it regularly forgets
productive technologies, and not because better replacements have come along.
In addition to being greedy and lazy, the agent is is determined to act now
so as to maximize not present utility, but the discounted future stream of
utility at all times (since it is also immortal). Fortunately, it is
incredibly foresighted, and knows the exact distribution of future shocks to
technology. (This distribution is not changed by anything the agent does; or,
if you like, it always acts in such a way that its expectations are exactly
fulfilled.) Possessing unlimited cognitive resources, it is easy for the agent
to solve the resulting dynamic programming
problem optimally. This will not lead to a smooth pattern of production,
investment and consumption; if, for instance, there is a big negative shock to
technology, and shocks are persistent, it becomes rational to slack off now,
and enjoy leisure; extra work will be more rewarded later when the agent will
have remembered how to do stuff. These fluctuations are, supposedly, the
fluctuations of the macroeconomy, the business cycle.
I have sketched this sort of model in a deliberately hostile way, because I
think such things are remarkably silly. But many very eminent economists
regard them very highly indeed. Mostly I think this reflects badly on the
discipline of macroeconomics, but it does raise some interesting technical
problems, like:
- How do people evaluate the fit of these models?
- Do those evaluation methods make any sense?
- How many of the parameters of these models are actually identifiable?
- How much data would it take to estimate one of these models to a given degree of precision?
- How much data would it take to detect that one of these models was wrong?
- How well would these models fit in-sample if they were wrong about the structure of the economy?
(You might well ask "where is the equilibrium, let alone the general
equilibrium, in a model with one agent and no trade?" You might very well ask
that.)
Recommended, examples and textbooks:
- Bent Jesper Christensen and Nicholas M. Kiefer, Economic
Modeling and Inference [Review: An Optimal Path to a Dead End.]
- David N. DeJong and Chetan Dave, Structural
Macroeconometrics [Review]
- Finn E. Kydland and Edward C. Prescott, "Time to Build and
Aggregate Fluctuations", Econometrica 50
(1982): 1345--1370 [JSTOR. Pretty much the origins of the approach, and of "real business cycle theory".]
- Frank Smets and Rafael Wouters, "Shocks and Frictions in US
Business Cycles: A Bayesian DSGE Approach", American Economic
Review 97 (2007): 586--606 [Perhaps the best-regarded
current DSGE of the US
economy. Preprint
version]
Recommended, statistical aspects (in addition to the books by Christen and Kiefer, and by DeJong and Dave):
- Marco Del Negro, Frank Schorfheie, Frank Smets and Rafael
Wouters, "On the Fit of New Keynesian Models", Journal of Business and Economic Statistics 25 (2007): 123--162 [Including discussion and reply]
- Rochelle M. Edge and Refet S. Gurkaynak, "How Useful are Estimated
DSGE Model Forecasts?", ssrn/1810075 [Recommended for actually
going through the exercise of comparing out-of-sample forecasts, and including
simple baseline models. But the methodological ideas here are suspect. It is
true that there is not much to predict about an in-control system, and what is
happening is largely random and so unpredictable, so that even the true model
would show low forecasting ability. The question however is why we are supposed
to think that the DSGE does give us good information about
counterfactuals. If you could show that it had much better predictive
performance than baselines like constants or random walks
during out-of-control periods, that would be something; but they
don't.]
- Lars Peter Hansen and James J. Heckman, "The Empirical
Foundations of Calibration", Journal of Economic Perspectives
10 (1996): 87--104 [Or, rather, the lack thereof.
JSTOR]
- Katarina Juselius and Massimo Franchi, "Taking a DSGE Model to the
Data
Meaningfully", Economics 1
(2007): 4 [There are places where Juselius and Franchi write as though a
VAR, or co-integrated VAR, were automatically
a sufficient statistic for any time
series. This I think is mere minor carelessness; the general strategy here, of
seeing what implications the generative model has for phenomenological models,
and testing those implications, is quite sound (it's obviously close
to indirect inference). It is
particularly interesting to try to translate specific pieces of the
generative model to specific observable hypotheses. — The fact that the
DSGE model is a miserable failure at matching the data is, of course, just a
bonus.]
- Ivana Komunjer and Serena Ng, "Dynamic Identification of
DSGE Models" [Preprint available via Prof. Komunjer]
- Mark Watson, "Measures of Fit for Calibrated Models", Journal
of Political Economy 101 (1993): 1011--1041
Recommended, criticisms:
- Willem Buiter, "The unfortunate uselessness of most `state of the art' academic monetary economics", Financial Times 3 March 2009 [The point about needing to impose the transversality condition is particularly interesting]
- Ricardo J. Caballero, "Macroeconomics after the Crisis: Time to Deal with the Pretense-of-Knowledge Syndrome", SSRN/1683617
- David Colander, Peter Howitt, Alan Kirman, Axel Leijonhufvud and Perry Mehrling, "Beyond DSGE Models: Towards an Empirically-Based Macroeconomics"
[PDF preprint]
- Alan Kirman, "Whom or What Does the Representative Individual
Represent?", Journal of Economic Perspectives 6
(1992): 117--136
[Answer: No one and nothing; accordingly it "deserves to be buried". JSTOR]
- James Morley, "The Emperor Has No Clothes", Macro Focus 5:2 (March 2010) [PDF]
- Robert Solow, "The State of
Macroeconomics", Journal
of Economic Perspectives 22 (2008): 243--249 ["the
claim that `modern macro' somehow has the special virtue of following the
principles of economic theory is tendentious and misleading... The other
possible defense of modern macro is that, however special it may seem, it is
justified empirically. This too strikes me as a delusion."]
- Lawrence H. Summers, "Some Skeptical Observations on
Real Business Cycle Theory" [1986; PDF]
Recommended, miscellaneous:
- James K. Galbraith, Olivier Giovanni and Ann J. Russo, "The
Fed's Real Reaction Function: Monetary Policy, Inflation, Unemployment,
Inequality — and Presidential Politics" [Not directly about DSGEs, but
since so many of them incorporate some version of the Taylor rule, it amuses me
to think about writing one using this
instead. University of Texas Inequality
Project working
paper 42, 2007]
- Herbert Gintis, "The Dynamics of General Equilibrium",
The Economic Journal 117 (2007): 1280--1309
[PDF reprint courtesy of Prof. Gintis]
- Aapo Hyvärinen, Kun Zhang, Shohei Shimizu, Patrik O. Hoyer, "Estimation of a Structural Vector Autoregression Model Using Non-Gaussianity", Journal of Machine Learning Research 11
(2010): 1709--1731
To read:
- George-Marios Angeletos and Jennifer La'O, "Animal Spirits"
[PDF. From a quick
examination, epi-cycle adding --- or, rather, shock-adding.]
- Jean Boivin and Marc P. Giannoni, "DSGE Models in a Data-Rich Environment" [PDF preprint]
- Fabio Canova and Luca Sala, "Back to square one: identification issues in DSGE models" [PDF preprint]
- David Colander (ed.), Post Walrasian Macroeconomics: Beyond the Dynamic Stochastic General Equilibrium Model [blurb]
- Olivier Coibion and Yuriy Gorodnichenko, "Strategic Interaction among Heterogeneous Price-Setters in an Estimated DSGE Model", The Review of Economics and Statistics 93 (2011): 920--940
- Roger E. A. Farmer, Expectations, Employment and Prices
[]
- Lance Taylor, Maynard's Revenge: The Collapse of Free Market Macroeconomics [Blurb]
To write:
- Co-conspirators to be named later + CRS, "Your Favorite DSGE
Sucks"
#
Mon, 23 Jan 2012
Literary Criticism and Theory of Criticism
There are a great many books to read; there are many place to travel to.
Travellers are often much better for advice --- where to go, where to avoid,
what to know and what to do to get the most out of their trip. It is my humble
opinion that works of literary criticism are the travel books of the written world --- sometimes
guides (and it is, for instance, a rash traveller who visits Wallace Stevens without one), sometimes
reportage, travelogues, impressions. This is, or can be, a worthwhile
enterprise, but it does not sound like one which needs or would benefit from a
vast and obscure body of theory, nor one whose successful practioners are
likely to be able theorists.
What, then, accounts for the current deluge of theory of criticism, as
opposed to criticism proper (and as opposed to critical theory, a different beast
altogether)? I have no idea, but I feel licensed by the subject matter to
speculate as to the causes.
- Disaffection. Mencken observed seventy or eighty years
ago: "Every now and then, a sense of the futility of their daily endeavors
falling suddenly upon them, the critics of Christendom turn to a somewhat sour
and depressing consideration of the nature and objects of their own craft."
This however merely backs things up one stage: why should critics feel that
criticism is not enough, and practice it? Failing to practise criticism, why
don't they give up and become actual novelists, poets, etc.? (Frank
Lentricchia has finally taken this honorable course.)
- Vicious cycle. Suppose that, for whatever reason, theory
of criticism came to be prized more highly than criticism itself. Then it
would be to the benefit of fledgling literary scholars to turn to theory, and
to continue to place a high value upon it. (This last is important, since the
study of literature, at least in the West, is close to self-governing.) Selection can take it from there, though that is
not a guarantee that the result will be sustainable. Obvious query:
why should theory be more valued than criticism? Second obvious query: what
are the coefficients of selection?
- Professional deformation. During this century, and
especially since the Second World War, criticism, and literary culture
generally, have migrated into academia in the most striking way. The qualities
needed by a good critic --- "intelligence, toleration, wide information,
genuine hospitality to ideas," to keep with Mencken --- are hard to inculcate
in a lecture or seminar, and make very poor dissertation material. But theory
of criticism, however appalling (perhaps especially if appalling) can be
lectured on and debated endlessly and published. (And cited.
Criticism of, say, Milton, is unlikely to be cited by anyone but other Milton
scholars; but theory of criticism can be cited by other theorists and by
critics.) Because they no longer need appeal to any public other than
themselves, the usual concentration of mutants and anomalies found in small,
in-bred populations may be expected.
- Physics Envy. Modesty forbids me to elaborate on this.
- Spirit of the Age. It has sometimes been claimed that
"we" are now much more self-conscious and reflexive than our predecessors.
This would seem to fit with critics preferring to theorize about criticism to
criticizing, but the exact relationship is obscure. Would the general increase
in self-consciousness explain the shift to theory, or would the shift
be part of what is meant by the general increase in self-consciousness?
But at this point a doubt arises. Has higher-order writing grown
faster than direct, first-order literature or its immediate, second-order
criticism? I know of no statistics on this, so I made some very crude ones of
my own, by counting the number of titles in the UW-Madison on-line catalog
assigned to various Library of Congress call numbers. Books in the category
PN, which are about literature in general, grew at 4.1 +- 0.2 percent
between January 1950 and April 1998; the PS, PR and PZ categories, which
roughly comprise literature in English (with some translations in PZ, and
criticism in PS and PR) at only 2.9 +- 0.1 percent. By way of comparison, the
QC category, which is (almost all of) physics grew at 4.8 +- 0.4, and the
combination of PG, PQ and PT (literature in modern European languages other
than English) at 3.9 +- 0.2. (The numbers are from a least-squares fit to a
simple exponential curve, so the error bars should be taken with grains of
salt.) The growth of non-English literature is probably mostly a change in our
acquisition policy, but the difference between English literature and writing
about literature is clearly statistically significant. Going from the number
of books to the number of writers and so to something like relative fitnesses
for different sorts of literary writers would, however, be pretty difficult.
(Thanks to Jason Hsu for pointing out an unfortunate ambiguity of wording.)
At some point I should use this space to record some thoughts about what a
natural history of literature would look like, and how it would differ from
hitherto-existing literary criticism; but really I should be working now, and
you can probably figure out what I'd say from my contribution to
the
Valve's symposium on
Moretti. At the same time, because I seem to have been unclear about this,
I should emphasize that I don't think that sort of natural history is
the only sort of literary scholarship, much less the only sort of
literary criticism, worth pursuing.
See also
Analogy and Metaphor;
Books and Their History;
Cognitive Science;
Cultural Criticism;
Epics and Oral Poetry;
Fantasy;
Sigmund Freud and Psychoanalysis;
Intellectual Standards and
Competence;
Intellectuals;
Linguistics;
Modernity and All That;
Mysteries;
Myths;
Narratives;
Novels;
Poetry, Poets;
Postmodernism, Poststructuralism, etc.;
the Romanticists;
Rhetoric;
Science Fiction;
Semiotics;
Structuralism;
Universal signs, images and symbols
Recommended:
- Aristotle, Poetics
- Marissa Bortolussi and Peter Dixon, Psychonarratology:
Foundations for the Empirical Study of Literary Response [Experimental
cognitive psychology applied to analyzing
questions of narrative theory and response to
literature. This is brilliant; why doesn't everyone know about it?]
- Brophy, Levey and Osborne, Fifty Works of English (and
American) Literature We Could Do Without [Amusing, even if you don't
agree with all their choices.]
- Kenneth Burke, The Philosophy of Literary Form
- A. S. Byatt, Passions of the Mind
- John Carey, What Good are the Arts?
- Steven Cassedy, Flight from Eden: The Origins of Modern
Literary Criticism and Theory [Online. --- It's not
relevant to his argument, but, contrary to what he says on p. 137, the integers
are not a group under multiplication (merely a monoid). Otherwise I can't
detect any errors of fact.]
- Frederick Crews
- The Critics Bear It Away: American Fiction and the
Academy ["The New Americanists" and
"Whose American
Renaissance?" are on-line]
- The Pooh Perplex
- Postmodern Pooh
- Skeptical Engagements
- Out of My System: Psychoanalysis, Ideology, and
Critical Method
- John M. Ellis, The Theory of Literary Criticism: A Logical
Analysis [An extremely appealing account by somebody who actually knows
something about logic, methodology, etc., and cares about literature.
In particular, he's convinced me that literature consists of those texts which
people use in a certain way, and while some properties of
those texts make them more or less well-suited to that role, it's the usage
that's defining, not the properties. I'm less completely convinced he's
correctly characterized that usage, and much of the rest of his argument rests
on that. But he actually argues rigorously from his definitions! —
More comments along these lines.]
- E. D. Hirsch, The Aims of Interpretation [See comments
under Interpretation.]
- John
Holbo, "The Advantages and Disadvantages of Theory for Life" [PDF]
- Frank Lentriccha, "Last Will and Testament of an Ex-Literary
Critic", Lingua Franca, Sept.--Oct. 1996 [Now reprinted in
the Lingua Franca anthology, Quick Studies]
- John Leonard [Literary editor for The Nation. Reading his essays
makes me sick with envy and despair, but I can't help myself, because they're
so ridiculously well-done.]
- This Pen for Hire [Reviews from when he worked
for the New York Times, which either cramped his style a bit or
before he really found it]
- The Last Innocent White Man in America and Other
Writings [The mature style begins here, it seems]
- Smoke and Mirrors: Violence, Television and Other
American Cultures
- When the Kissing Had to Stop: Cult Studs, Khmer Newts, Langley Spooks, Techno-Geeks, Video Drones, Author Gods, Serial
Killers, Vampire Media, Alien Sperm-Suckers, Satanic Therapists, and Those of
Us Who Hold a Left-Wing Grudge in the Post Toasties New World
Hip-Hop
- Lonesome Rangers: Homeless Minds, Promised Lands,
Fugitive Cultures
- John Livingston Lewes, The Road to Xanadu
- Carol Lloyd, I Was Michel
Foucault's Love Slave
- H. L. Mencken, "Criticism of Criticism
of Criticism"
- Franco Moretti [Struggling towards a naturalistic, evolutionary
theory of literature --- a "materialist sociology of literary forms," as he
says. I think his efforts in this direction are entirely laudable, but, at
least in the books below, he's still too much given to type-thinking (as
opposed to population thinking), to psychoanalysis, and to
teleology. --- Graphs, Maps, Trees, published 2005, collects
three recent articles from New Left Review, and largely corrects
these deviations from proper materialist thinking, though he is, in places,
excessively tactful towards those who still are in the thrall of error.]
- Atlas of the European Novel,
1800--1900 [Review:
One Effort More, Litterateurs, if You Would Be Empiricists!]
- Graphs, Maps, Trees: Abstract Models for Literary
History [Discussed at great length in my
weblog.]
- Modern Epic: The World-System from Goethe to
Garcia Marquez
- Signs Taken for Wonders: Essays on the Sociology of
Literary Forms
- "The Slaughterhouse of Literature", Modern
Language Quarterly 61 (2000): 207--227
- Ezra Pound, The ABC of Reading [Pound is frustrating,
because he has some interesting and insightful things to say, mixed from page
to page --- even from sentence to sentence --- with rubbish. Fortunately, here
the rubbish (about, e.g., the nature of Chinese characters) is entertaining,
and his fascism is not on display in this book.]
- I. A. Richards
- Practical Criticism: A Study of Literary
Judgment
- Principles of Literary Criticism
- Science and Poetry
- Alan Richardson, Literature, Cognition and the
Brain --- includes a very useful annotated
bibliography
- Levin L. Schücking, The Sociology of Literary
Taste [The specific examples are both German and dated; but with a
little search-and-replace it becomes universal.]
- Herbert Simon, "Literary Criticism: A
Cognitive Approach," Stanford Humanities Review, 1994 [My
discussion, with links]
- René Wellek, Concepts of Criticism [Paper
collection. The attempts at periodization are especially interesting, though
ultimately they leave me unpersuaded that one can identify dominant
clusters of ideas in the way Wellek proposes. Similarly, the attack on
"evolutionism" only convinces me of the awful lack of understanding of
evolution among Wellek's literary-theoretical predecessors.]
- René Wellek and Austin Warren, Theory of
Literature [Sensible and acute, but they give a hopelessly circular
definition of the function and value of literature.]
To read:
- M. H. Abrams, The Mirror and the Lamp: Romantic Theory and
the Critical Tradition.
- Auerbach, Mimesis: The Representation of Reality in Western
Literature
- Bakhtin
- Mark Bauerlein, Literary Criticism: An Autopsy
[Recommended, without comment, by John Holbo. I've only read the introduction
so far, and it inspires very mixed feelings --- I alternate, sometimes within a
single sentence, between "right on!", "this needs saying?" and "that's just
wrong". Also, I cannot determine if Bauerlein gets his underlying epistemology
from Althusser, or a certain strand of American pragmatism...]
- Michael Bérubé [Has a nice blog, and some very agreeable essays, so I feel like
reading his actual work.]
- The Employment of English: Theory, Jobs, and the
Future of Literary Studies
- Public Access: Literary Theory and American Cultural
Politics
- What's Liberal About the Liberal Arts?
- Wayne C. Booth, The Rhetoric of Fiction
- A. S. Byatt, On Histories and Stories
- David Carroll, French Literary Fascism: Nationalism,
Antisemitism, and the Ideology of Culture [I had a copy of this
very interesting book, but it was destroyed by the post office. Blurb]
- Antoine Compagnon, Literature, Theory, and Common
Sense [Blurb]
- John
Constable
- David Damrosch, What Is World Literature? [Blurb, introduction]
- Peter Dear (ed.), The Literary Structure of Scientific
Argument: Historical Studies
- Mark Edmundson, Literature Against Philosophy, Plato to
Derrida: A Defense of Poetry
- John M. Ellis, Literature Lost: Social Agendas and the
Corruption of the Humanities
- Raplh Ellison, Shadow and Act
- Empson
- Seven Types of Ambiguity
- Some Versions of Pastoral
- Angus Fletcher, Allegory: The Theory of a Symbolic
Mode
- Donald Freeman, Cognitive Metaphor
and Literary Theory: Towards the New Philology
- Girard, Fiction and Diction
- John Guillory, Cultural Capital: The Problem of Literary
Canon Formation
- Giles Gunn, The Culture of Criticism and the Criticism of
Culture
- Stephen Halliwell, Aesthetics of Mimesis: Ancient Texts and Modern Problems
- Geoffrey G. Harpham, Shadows of Ethics: Criticism
and the Just Society
- Jerry R. Hobbs, Literature and Cognition [Fulltext as a free
PDF. Large, because it's scanned images, rather than electronically-set
text. But very cool!]
- Patrick Colm Hogan, Empire and Poetic Voice: Cognitive and
Cultural Studies of Literary Tradition and Colonialism
- Norman N. Holland, The Brain of Robert Frost: A Cognitive
Approach to Literature
- Irving Howe
- A Critic's Notebook
- Politics and the Novel
- Selected Writings, 1950--1990
- Stanley Edgar Hyman
- The Armed Vision
- The Tangled Bank
- Charles Kaplan, The Overwrought Urn: A Potpourri of Parodies
of Critics Who Triumphantly Present the Real Meaning of Authors from
Jane Austen to J. D. Salinger
- Paisley Livingston, Literary Knowledge: Humanistic Inquiry
and the Philosophy of Science
- Deidre Shauna Lynch, The Economy of Character: Novels, Market
Culture, and the Business of Inner Meaning
- Nick Montfort, Twisty Little Passages: An Approach to
Interactive Fiction [Blurb]
- Franco Moretti [NLR = New Left Review]
- "Conjectures on World
Literature", NLR 1 (2000): 54--68 [online]
- "More Conjectures", NLR 20
(2003): 73--81 [online]
- "Markets of the Mind", NLR 5
(2003): 111--115
- "MoMA 2000: The
Capitulation", NLR 4 (2000): 98--102
- "New York Times Obituaries", NLR
2 (2000): 104--108
- "Planet Hollywood", NLR 9
(2001): 90--101
- The Way of the World
- Franco Moretti (ed.), The Novel [Multi-volume survey:
five volumes in the Italian original, but we're only getting a two-volume
selection in English
translation. Blurb and
samples for volume
1; for volume 2]
- William Paulson, Literary Culture in a World Transformed: A
Future for the Humanities
- John Press, The Chequer'd Shade: Reflections on Obscurity in
Poetry
- Kenneth Quinn, How Literature Works [modest, ain't
he?]
- Stephen Ramsay, Reading Machines: Toward an
Algorithmic Criticism [Blurb]
- Phil Roberts, How Poetry Works [More modesty]
- William Elford Rogers, Interpreting Interpretation: Textual
Hermeneutics as an Ascetic Discipline
- Shawn James Rosenheim, The Cryptographic Imagination
- Edward Said, Orientalism
- Gordon E. Slethaug, Beautiful Chaos: Chaos Theory and
Metachaotics in Recent American Fiction
- George Steiner [James Wood's marvellous polemic against Steiner,
"Toppling the Monument" (Prospect, no. 14, December 1996) is sadly
no longer available on-line. But it's dead on.]
- No Passion Spent
- Errata: An Examined Life
- Peter Steiner, Russian Formalism: A Metapoetics
- Peter Stockwell
- Cognitive Poetics: An Introduction
- The Poetics of Science Fiction
- Robert Storey, Mimesis and the Human Animal: On the
Biogenetic Foundations of Literary Representation
- Peter Swirski, Between Literature and Science: Poe, Lem, and
Explorations of Aesthetics, Cognitive Science, and Literary Knowledge
- Tzvetan Todorov, Literature and Its Theorists
- Mark Turner
- The Literary Mind
- Reading Minds
- René Wellek, Discriminations
- Edmund Wilson
- Classics and Commercials
- A Piece of My Mind
- The American Earthquake: A Chronicle of the
Roaring Twenties, the Great Depression, and the Dawn of the New Deal
- Martha Woodmansee, The Author, Art and the Market: Rereading
the History of Aesthetics
- Virginia Woolf
- The Common Reader
- The Second Common Reader
- Lisa Zunshine, Strange Concepts and the Stories They Make Possible: Cognition, Culture, Narrative [Blurb]
- Lisa Zunshine (ed.), Introduction to Cognitive Cultural Studies [Blurb]
#
Graphical Models
[Update, 12 March 2010: On re-reading I am less than happy with this, because
I have come appreciate the uses of graphical models in non-causal
modeling more, and this slights them unnecessarily. I will try to re-write
this soon.]
A.k.a. causal models, causal graphs, Bayes graphs, Bayes networks, Bayesian
networks. (Here "Bayes" is a metonym for "conditional probability". There are
perfectly good frequentist interpretations of these models.) I'm sticking
latent-variable and path-analysis models in here, too, because they all pretty
much work the same way.
Everyone who takes basic statistics has it drilled into them that
"correlation is not causation." (When I took psych. 1, the professor said he
hoped that, if he were to come to us on our death-beds and prompt us with
"Correlation is," we would all respond "not causation.") This is a problem,
because one can infer correlation from data, and would like to be able
to make inferences about causation. There are typically two ways out of this.
One is to perform an experiment, preferably a randomized double-blind
experiment, to eliminate accidental sources of correlation, common causes, etc.
That's nice when you can do it, but impossible with supernovae, and not even
easy with people. The other out is to look for correlations, say that of
course they don't equal causations, and then act as if they did anyway. The
technical names for this latter course of action are "linear regression" and
"analysis of variance," and they form the core of applied quantitative social
science, e.g., The Bell Curve.
Graphical models are, in part, a way of escaping from this impasse.
The basic idea is as follows. You have a bunch of variables, and you want
to represent the causal relationships, or at least the probabilistic
dependencies, between them. You do so by means of a graph. Each node in the
graph stands for a variable. If variable A is a cause of B, then an arrow runs
from A to B. If A is a cause of B, we also say that A is one of B's
parents, and B one of A's children. If there is a causal path
from A to B, then A is an ancestor of B, and B is a
descendant of A. If a variable has no parents in the graph, it is
exogenous, otherwise it is endogenous.
Part of what we mean by "cause" is that, when we know the immediate causes,
the remoter causes are irrelevant --- given the parents, remoter ancestors
don't matter. The standard example is that applying a flame to a piece of
cotton will cause it to burn, whether the flame came from a match, spark,
lighter or what-not. Probabilistically, this is a conditional indepedence
property, or a Markov property: a variable is independent of its ancestors
conditional on its parents. In fact, given its parents, its children, and its
childrens' other parents, a variable is conditionally independent of all other
variables. This is called the graphical or causal Markov property. When this
holds, we can factor the joint probability distribution for all the variables
into the product of the distribution of the exogenous variables, and the
conditional distribution for each endogenous variable given its parents.
(You may be wondering what happens if A is a parent of B and B is a parent
of A, as can happen when there is feedback between the variables. This leads
to difficulties, traditionally dealt with by explicitly limiting the discussion
to acyclic graphs. I shall follow this wise precedent here.)
Now, there are certain rules which let us infer conditional independence
relations from each other. For instance, if X is independent of the
combination of Y and W, given Z, then X is indepdent of Y alone given Z. So,
if we have a graph which obeys the causal Markov condition, there are generally
other conditional independence relations which follow from the basic ones. If
these are the only conditional indepences which hold in the distribution, it is
said to be faithful to the graph (or vice versa); otherwise it is
unfaithful. For a graph to be Markov and unfaithful, there must (as it were)
be an elaborate conspiracy among the conditional distributions, so elaborate
that it will generally be destroyed by any change in any of those
distributions. So faithfulness is a robust property.
This may sound pretty arcane, but that's just because it is arcane.
The point, however, is that if you can make the three assumptions above (no
causal cycles, Markov property, faithfulness), you're in business in a really
remarkable way. There are very powerful statistical techniques that will let
you infer the causal structure connecting your variables. This comes in two
flavors. One is the Bayesian way: cook up a prior distribution over all
possible causal graphs; compute the likelihood of the data under each graph;
update your distribution over graphs; iterate. This is generally
computationally intractable, assuming you can come up with a meaningful prior
in the first place. The other approach is to use tests for conditional
independence to eliminate possible connections between variables, and so to
narrow down the range of candidate structures; it is basically frequentist, and
can be shown, under a broad range of circumstances, to be asymptotically
reliable.
Once you have your causal graph --- whether through estimation or through
simply being handed one --- you can do lots of great things with it, like
predict the effects of manipulating some of the variables, or make backward
inferences from effects to causes. Of course, if the graph is big, doing the
necessary calculations can be very troublesome in itself, and so people work on
approximation methods and even ways of doing statistical inference on models of
statistical distributions...
It's probably obvious I think this is incredibly neat, and even one of the
most important ideas to come out of machine learning. Of course it
doesn't really solve the problem of establishing causal relations, in
the way Hume objected to; it says, assuming there are
causal relations, of a certain stochastic form, and that these are stable, then
they can be learned. But that, and the more general questions of what we ought
to mean by "cause", deserve a notebook of their
own.
Things I want to understand better: frequentist inference procedures.
Computational learning theory for graphical models (the paper by Janzing and
Herrmann is good). How to treat systems with feedback? How to
treat dynamical systems
and time series? How does all of this fit
together with computational
mechanics?
Not even a conjecture. Back in the 1960s, Chow and Liu (reference
below) gave a polynomial algorithm for finding the best approximation to a
global joint probability distribution using only pairwise interactions among
the variables, i.e., the one which minimized the Kullback-Leibler divergence
between the true and the approximating distribution. I have read that
extending this to even three-way interactions is NP, though I don't know if
it's NP-complete. (1) How is the intractability result established? (2) Is
this the same as the computational phase transition one finds in going from
2-SAT to 3-SAT, where the critical point is at two-point-something SAT?
(Presumably the answer to (1) would shed some light on this.) (3) Even if not,
is there an analogous phase transition, perhaps in a different universality
class? (Update in 2009, several years later: Bento and Montanari,
below, sounds relevant, but I haven't read it yet.)
Recommended, more general:
- Clark Glymour, The Mind's Arrows: Bayes Nets and Graphical
Causal Models in Psychology
[Mini-review]
- Michael Irwin
Jordan (ed.), Learning in Graphical Models
- Jordan and Sejnowski (eds.), Graphical Models [Nice
collection of papers from Neural Computation]
- Judea Pearl
- "Causal Inference in Statistics: An Overview", forthcoming
in Statistics Surveys 3 (2009): 96--146
[PDF]
- Causality: Models, Reasoning and
Inference
- Peter Spirtes, Clark Glymour and Richard Scheines, Causation,
Prediction, and Search [Comments]
Recommended, more specialized:
- C. K. Chow and C. N. Liu, "Approximating Discrete Probability
Distributions with Dependence Trees", IEEE Transactions on Information
Theory 14 (1968): 462--467 [An old but very nice result
on how to get the optimal approximation to a global probability distribution
using only pairwise interactions among the variables]
- Finale Doshe-Velez, David Wingate, Joshua Tenenbaum and Nicholas Roy, "Infinite Dynamic Bayesian Networks", ICML 2011 [PDF]
- Ghahramani, "Learning Dynamic Bayesian Networks," in
Giles and Gori (eds.), Adaptive Processing of Sequences and Data
Structures
- Ghahramani and Jordan, "Factorial Hidden Markov Models,"
Machine Learning 29 (1997): 245--273
- Aapo Hyvärinen, Kun Zhang, Shohei Shimizu, Patrik O. Hoyer, "Estimation of a Structural Vector Autoregression Model Using Non-Gaussianity", Journal of Machine Learning Research 11
(2010): 1709--1731
- Dominik Janzing and Daniel J. L. Herrmann, "Reliable and
Efficient Inference of Bayesian Networks from Sparse Data by Statistical
Learning Theory", cs.LG/0309015
- Steffen Lauritzen, Graphical Models [A fairly abstract
probabilistic/mathematical-statistical treatment; elegant, if you're into such
things.]
- Han Liu, John Lafferty and Larry Wasserman, "The Nonparanormal:
Semiparametric Estimation of High Dimensional Undirected Graphs",
Journal of
Machine Learning Research 10 (2009): 2295--2328
= arxiv:0903.0649
- John C. Loehlin, Latent Variable Models: An Introduction to
Factor, Path, and Structural Analysis [An intro. to old-school linear
latent-variable models, especially of the sort used by psychologists. Good in
its own domain, but does not make enough contact with modern graphical models.]
- Eric Mjolsness, "Stochastic Process Semantics for Dynamical Grammar
Syntax: An Overview", cs.AI/0511073
- Christopher J. Quinn, Todd P. Coleman, Negar Kiyavash, "Causal Dependence Tree Approximations of Joint Distributions for Multiple Random Processes", arxiv:1101.5108
- Thomas Richardson
- Pawel Wocjan, Dominik Janzing, and Thomas Beth, "Required
sample size for learning sparse Bayesian networks with many variables," cs.LG/0204052
To read:
- Edoardo M. Airoldi, "Getting started in probabilistic graphical models", arxiv:0706.2040 [Tutorial
aimed at biologists]
- Animashree Anandkumar, Kamalika Chaudhuri, Daniel Hsu, Sham M. Kakade, Le Song, Tong Zhang, "Spectral Methods for Learning Multivariate Latent Tree Structure", arxiv:1107.1283
[This sounds very much like Spearman's "tetrad equations" from 100 years ago!]
- Animashree Anandkumar, Vincent Y.F. Tan, Alan. S. Willsky, "High-Dimensional Gaussian Graphical Model Selection: Tractable Graph Families", arxiv:1107.1270
- Erik Aurell, Magnus Ekeberg, "Inverse Ising inference using all the data", arxiv:1107.3536
["We show that a method based on logistic regression, using all the data, solves the inverse Ising problem far better than mean-field calculations relying only on sample pairwise correlation functions, while still computationally feasible for numbers of nodes in the range of hundreds. The largest improvement in reconstruction occurs for strong interactions..."]
- Francis R. Bach and Michael I. Jordan, "Learning Graphical Models
for Stationary Time Series", UCB Statistics
Tech. Rep. 650
- Onureena Banerjee, Laurent El Ghaoui, Alexandre d'Aspremont, "Model
Selection Through Sparse Maximum Likelihood
Estimation", arxiv:0707.0704
- Jose Bento, Morteza Ibrahimi and Andrea Montanari, "Learning
Networks of Stochastic Differential Equations", NIPS 23 (2010) [PDF]
- Jose Bento, Andrea Montanari, "Which graphical models are difficult to learn?", arxiv:0910.5761
- Danny Bickson, Carlos Guestrin, "Linear Characteristic Graphical Models: Representation, Inference and Applications", arxiv:1008.5325
- David Brillinger, "Remarks Concerning Graphical Models for
Time Series and Point Processes," Revista de Econometria
16 (1996): 1--23 [PS]
- Clive G. Bowsher, "Stochastic kinetic models: Dynamic independence,
modularity and
graphs", Annals
of Statistics
38 (2010): 2242--2281
- Matthias Brocheler and Lise Getoor, "Computing Marginal Distributions over Continuous Markov Networks for Statistical Relational Learning", NIPS 23 (2010) [PDF]
- Elias Chaibub Neto, Mark P. Keller, Alan D. Attie, and Brian S. Yandell, "Causal graphical models in systems genetics: A unified framework for joint inference of causal network and genetic architecture for correlated phenotypes", Annals of Applied Statistics 4
(2010): 320--339
- Venkat Chandrasekaran, Pablo A. Parrilo, Alan S. Willsky, "Latent Variable Graphical Model Selection via Convex Optimization", arxiv:1008.1290
- Michael Chertkov and Vladimir Y. Chernyak
- David Maxwell Chickering, "Optimal Structure Identification
With Greedy Search," Journal of Machine Learning Research
3 (2002): 507--554
- Diego Colombo, Marloes H. Maathuis, Markus Kalisch, Thomas S. Richardson, "Learning high-dimensional directed acyclic graphs with latent and selection variables", arxiv:1104.5617
- Robert G. Cowell, A. Philip Dawid, Steffen L. Lauritzen and David
J. Spiegelhalter, Probabilistic Networks and Expert Systems
- David Cox and Nanny Warmuth, Multivariate Dependcencies: Models, Analysis, and Interpretation
- Mihai Cucuringu, Jesus Puente, David Shue, "Model Selection in Undirected Graphical Models with the Elastic Net", arxiv:1111.0559
- Rainer Dahlhaus, "Graphical interaction models for
multivariate time series," Metrika 51
(2000): 157--172
- Gabriel C. G. de Abreu, Rodrigo Labouriau, David Edwards, "High-dimensional Graphical Model Search with gRapHD R Package", Journal of
Statistical Software 37 (2010), arxiv:0909.1234
- Luis M. de Campos, "A Scoring Function for Learning Bayesian Networks based on Mutual Information and Conditional Independence Tests",
Journal of
Machine Learning Research 7 (2006): 2149--2187
[Sounds damn cool]
- Amir Dembo and Andrea Montanari, "Gibbs Measures and Phase Transitions on Sparse Random Graphs", arxiv:0910.5460
- Amir Dembo, Andrea Montanari, Nike Sun, "Factor models on locally tree-like graphs", arxiv:1110.4821
- Pedro Domingos and Daniel Lowd, Markov Logic: An Interface Layer for Artificial Intelligence
- Vanessa Didelez, "Graphical models for marked point processes based on local independence", arxiv:0710.5874
- Mathias Drton, Rina Foygel, and Seth Sullivant, "Global identifiability of linear structural equation models", Annals of
Statistics 39 (2011): 865--886
- Michael Eichler
- Seif Eldawlatly, Yang Zhou, Rong Jin
and Karim G. Oweiss, "On the Use of Dynamic Bayesian Networks in Reconstructing Functional Neuronal Networks from Spike Train Ensembles", Neural Computation 22 (2010): 158--189
- Gal Elidan, Iftach Nachman and Nir Friedman, "'Ideal Parent'
Structure Learning for Continuous Variable Bayesian
Networks", Journal of
Machine Learning Research 8 (2007): 1799--1833
- Sergi Elizalde and Kevin Woods, "Bounds on the number of inference
functions of a graphical
model", math.CO/0610233
- Juan Ferrándiz, Enrique F. Castillo and Pilar Sanmartin,
"Temporal aggregation in chain graph models", Journal of
Statistical Planning and Inference 133 (2005):
69--93
- G. David Forney, Jr., Pascal O. Vontobel, "Partition Functions of Normal Factor Graphs", arxiv:1102.0316
- Freedman, "On Specifying Graphical Models for Causation,"
UCB Stat. Tech. Rep. 601 [abstract, pdf]
- Frey, Graphical Models in Machine Learning and Data
Communication
- Roland Fried and Vanessa Didelez, "Latent variable analysis and
partial correlation graphs for multivariate time series", Statistics and
Probability Letters 73 (2005): 287--296
- Cyril Furtlehner, Jean-Marc Lasgouttes, Arnaud De La Fortelle,
"Belief Propagation and Bethe approximation for Traffic Prediction",
physics/0703159
- Dan Geiger, David Heckerman, Henry King, and Christopher
Meek, "Stratified exponential families: Graphical models and model selection",
Annals of
Statistics 29 (2001): 505--529
- Christophe Giraud, "Estimation of Gaussian graphs by model
selection", arxiv:0710.2044
- Glymour and Cooper (eds.), Computation, Causation and
Discovery
- Vibhav Gogate, William Austin Webb, and Pedro Domingos, "Learning
Efficient Markov Networks", NIPS 23 (2010) [PDF]
- Jorge Goncalves and Sean Warnick, "Dynamical Structure Functions
for the Estimation of LTI Networks with Limited Information", q-bio.MN/0610008
[LTI = "linear, time-invariant"]
- Green, Hjort and Richardson (eds.), Highly Structured
Stochastic Systems
- Vikas Hamine and Paul Helman, "Learning Optimal Augmented Bayes
Networks", cs.LG/0509055
- Patrick L. Harrington, Jr., and Alfred O. Hero III, "Spatio-Temporal Graphical Model Selection", arxiv:1004.2304
- Alain Hauser, Peter Bühlmann, "Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs", arxiv:1104.2808
- Raymond Hemmecke, Silvia Lindner, Milan Studeny, "Learning restricted Bayesian network structures", arxiv:1011.6664
- Holger Höfling and Robert Tibshirani, "Estimation of Sparse
Binary Pairwise Markov Networks using
Pseudo-likelihoods", Journal of
Machine Learning Research 10 (2009): 883--906
- Shiro Ikeda, Toshiyuki Tanaka and Shun-ichi Amari, "Stochastic
Reasoning, Free Energy, and Information
Geometry", Neural
Computation 16 (2004): 1779--1810
- Manfred Jaeger & co., Primula [Java implementation
of a modeling language for relational Bayesian networks; released under GPL]
- Ali Jalali, Chris Johnson, Pradeep Ravikumar, "On Learning Discrete Graphical Models Using Greedy Methods", arxiv:1107.3258
- Markus Kalisch and Peter Bühlmnann
- Mladen Kolar, Eric P. Xing, "Estimating Networks With Jumps", arxiv:1012.3795
- Nicole Kraemer, Juliane Schaefer, Anne-Laure Boulesteix,
"Regularized estimation of large-sacle gene association networks using
graphical Gaussian
models", arxiv:0905.0603
- John Lafferty, Han Liu, Larry Wasserman, "Sparse Nonparametric Graphical Models", arxiv:1201.0794
- Sanjiang Li, "Causal models have no complete axiomatic
characterization", arxiv:0804.2401
- Shai Litvak and Shimon Ullman, "Cortical Circuitry Implementing Graphical Models", Neural Computation
21 (2009): 3010--3056 [or rather, models of cortical circuitry implementing graphical models]
- Han Liu, Xi Chen, John Lafferty, Larry Wasserman, "Graph-Valued Regression", arxiv:1006.3972
- Han Liu, John Lafferty and Larry Wasserman, "Tree
Density Estimation", arxiv:1001.1557
- Han Liu, Kathryn Roeder, Larry Wasserman, "Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models", arxiv:1006.3316 == NIPS 23 (2010)
- Han Liu, Min Xu, Haijie Gu, Anupam Gupta, John Lafferty, Larry Wasserman, "Forest Density Estimation", Journal of Machine
Learning Research 12 (2011): 907--951
- Stephen Luttrell, "Adaptive Cluster Expansion (ACE): A Hierarchical
Bayesian Network", cs.NE/0410020
- Marloes H. Maathuis, Markus Kalisch, Peter Bühlmann, "Estimating high-dimensional intervention effects from observational data", Annals
of Statistics 37 (2009): 3133--31654, arxiv:0810.4214
- Dhafer Malouche and Sylvie Sevestre-Ghalila, "Estimating High
dimensional faithful Gaussian graphical Models :
uPC-algorithm", arxiv:0705.1613
- Giovanni M. Marchetti, Nanny Wermuth, "Matrix representations and independencies in directed acyclic graphs", arxiv:0904.0333
- Lilyana Mihalkova, Lise Getoor, "Lifted Graphical Models: A Survey", arxiv:1107.4966
- Eric Mjolsness, "Labeled graph notations for graphical models", UCI
School of Information and Computer science Technical Report 04-03 [PDF]
- Andrea Montanari, "Graphical Models Concepts in Compressed Sensing", arxiv:1011.4328
- Jennifer Neville and David Jensen, "Relational Dependency Networks",
Journal
of Machine Learning Research 8 (2007): 653--692
- Lior Pachter and Bernd Sturmfels
- Alessandro Pelizzola, "Cluster variation method in statistical
physics and probabilistic graphical models", Journal of Physics
A: Mathematical and General 38 (2005): R309--R339
= cond-mat/0508216
- Jose M. Pena, "Reading Dependencies from Covariance Graphs", arxiv:1010.4504
- Tapani Raiko, Harri Valpola, Markus Harva and Juha Karhunen,
"Building Blocks for Variational Bayesian Learning of Latent Variable
Models", Journal
of Machine Learning Research 8 (2007): 155--201
- Johannes Rauh, Nihat Ay, "Robustness and Conditional Independence Ideals", arxiv:1110.1338
- Pradeep Ravikumar, Martin J. Wainwright, and John D. Lafferty,
"High-dimensional Ising model selection using $\ell_1$-regularized logistic
regression", Annals
of Statistics 38 (2010):
1287--1319, arxiv:0804.4202,
also arxiv:1010.0311
- Marco Reale, A Graphical Modelling Approach to Time
Series, Ph.D. thesis, Lancaster University, 1998 [Reale's
website]
- Marco Reale and Granville Tunnicliffe Wilson
- "Identification of vector AR models with recursive
structural errors using conditional independence graphs"
- "The Sampling Properties of Conditional Independence Graphs
for Structural Vector Autoregressions"
- T. Rizzo, B. Wemmenhove, H.J. Kappen, "On Cavity Approximations for
Graphical
Models", cond-mat/0608312
- Joshua W. Robinson, Alexander J. Hartemink, "Learning Non-Stationary Dynamic Bayesian Networks", Journal of Machine Learning Research 11 (2010): 3647--3680
- Philipp Rütimann and Peter Bühlmann, "High
dimensional sparse covariance estimation via directed acyclic graphs",
arxiv:0911.2375 = Electronic
Journal of Statistics 3 (2009): 1133--1160
- Kayvan Sadeghi, Steffen L. Lauritzen, "Markov Properties for Loopless Mixed Graphs", arxiv:1109.5909
- Federico Schlüter, "A survey on independence-based Markov networks learning", arxiv:1108.2283
- Marco Scutari
- "Learning Bayesian Networks with the bnlearn
Package", arxiv:0908.3817
- "Measures of Variability for Bayesian Network Graphical Structures", arxiv:1005.4214
- Bill Shipley, Cause and Correlation in Biology: A User's
Guide to Path Analysis, Structural Equations and Causal Inference
- Tomi Silander, Teemu Roos, Petri Myllymaki, "Locally Minimax Optimal Predictive Modeling with Bayesian Networks", Journal of Machine
Learning Research Workshop and Conference Proceedings 5 (AISTATS 2009): 504--511
- Milan Studeny, Probabilistic Conditional Independence
Structures ["The main topic of the monograph is a non-graphical
algebraic method for describing probabilistic CI structures. However, one of
the first two chapters in the book recalls and gathers basic mathematical tools
for study of probabilistic conditional independence (CI) and the other one is a
sketchy overview of recent advanced graphical approaches to the desciption of
CI structures. The next four chapters develop the non-graphical method. The
last standard chapter is an attempt to apply the method in practice: it is
devoted to learning Bayesian nets and it is more mathematical (and
'philosophical') revision of some methods for learning Bayesian networks. The
main aim of that chapter is to indicate that an algebraic approach can also be
applied in this area."]
- Charles Sutton, Andrew McCallum, "An Introduction to Conditional Random Fields", arxiv:1011.4088
- Charles Sutton, Andrew McCallum and Khashayar Rohanimanesh,
"Dynamic Conditional Random Fields: Factorized Probabilistic Models for
Labeling and Segmenting Sequence Data", Journal
of Machine Learning Research 8 (2007): 693--723
- Vincent Y. F. Tan, Animashree Anandkumar, Lang Tong and Alan
S. Willsky, "A Large-Deviation Analysis of the Maximum-Likelihood Learning of
Markov Tree
Structures", IEEE Transactions on Information Theory 57 (2011): 1714--1735, arxiv:0905.0940
[Large deviations for Chow-Liu trees]
- Robert E. Tillman, Arthur Gretton and Peter Spirtes,
"Nonlinear directed acyclic structure learning with weakly additive
noise models" [Thanks to Prof. Spirtes for a preprint]
- Achim Tresch, Florian Markowetz, "Structure Learning in Nested
Effects Models", 0710.4481
- Vivian Viallon, Onureena Banerjee, Gregoire Rey, Eric Jougla, Joel Coste, "An empirical comparative study of approximate methods for binary graphical models; application to the search of associations among causes of death in French death certificates", arxiv:1004.2287
- Martin J. Wainwright and Michael I. Jordan, "Graphical Models,
Exponential Families, and Variational Inference", Foundations and Trends in Machine Learning 1 (2008): 1--305 [PDF reprint via Prof. Jordan]
- Nanny Wermuth, "Probability distributions with summary graph structure", Bernoulli 17 (2011): 845--879, arxiv:1003.3259
- Xianchao Xie, Zhi Geng, "A Recursive Method for Structural Learning
of Directed Acyclic Graphs", Journal of
Machine Learning Research 9 (2008): 459--483
- Gui-Bo Ye, Yuanfeng Wang, Yifei Chen, Xiaohui Xie, "Efficient Latent Variable Graphical Model Selection via Split Bregman Method", arxiv:1110.3076
- Jonathan S. Yedidia, William T. Freeman and Yair Weiss,
"Understanding Belief Propagation and its Generalizations", Mitsubshi Electric Research
Laboratories Tech. Rep. 2001-22
- Marco Zaffalon and Marcus Hutter, "Robust Inference of Trees",
cs.LG/0511087
- Shuheng Zhou, John Lafferty, Larry Wasserman, "Time Varying Undirected Graphs", arxiv:0802.2758
- Piotr Zwiernik, "An Asymptotic Behaviour of the Marginal Likelihood for General Markov Models", Journal of Machine Learning Research 12 (2011): 3283--3310 [where "general Markov models" == "binary graphical tree models where all the inner nodes of a tree represent binary hidden variables"]
(Thanks to Gustavo Lacerda for pointing out a goof.)
#
Model Selection
(Reader, please make your own suitably awful pun about the different senses
of "model selection" here, as a discouragement to those finding this page
through prurient searching. Thank you.)
In statistics
and machine learning, "model
selection" is the problem of picking among different mathematical models which
all purport to describe the same data set. This notebook will not (for now)
give advice on it; as usual, it's more of a place to organize my thoughts and
references...
Classification of approaches to model selection (probably not really
exhaustive but I can't think of others, right now):
- Direct optimization of some measure of goodness of fit or risk on training
data.
- Seems implicit in a lot of work which points to marginal improvements in
"the proportion of variance explained", mis-classification rates, "perplexity",
etc. Often, also, a recipe for over-fitting and chasing snarks. What's wanted
is (almost always) some way of measuring the ability to generalize to new data,
and in-sample performance is a biased estimate of this. Still,
with enough data, if the gods
of ergodicity are kind, in-sample performance
is representative of generalization performance, so perhaps this will work
asymptotically, though in many cases the researcher will never even glimpse
Asymptopia across the Jordan.
- Optimize fit with model-dependent penalty
- Add on a term to each model which supposed indicates its ability to
over-fit. (Adjusted R^2, AIC, BIC, ..., all do this in terms of the number of
parameters.) Sounds reasonable, but I wonder how many actually work better, in
practice, than direct optimization. (See Domingos for some depressing evidence
on this score.)
- Classical two-part minimum description length
methods were penalties; I don't yet understand one-part MDL.
- Penalties which depend on the model class
- Measure the capacity of a class of models to over-fit;
penalize all models in that class accordingly, regardless of their
individual properties. Outstanding example: Vapnik's "structural risk
minimization" (provably consistent under some circumstances). Only
sporadically coincides with *IC-type penalties based on the number of
parameters.
- Cross-validation
- Estimate the ability to generalize to different data by, in fact, using
different data. Maybe the "industry standard" of machine learning. Query, how
are we to know how much different data to use? — .
- Query, how are we to cross-validate when we have complex, relational data?
That is, I understand how to do it for independent samples, and I even
understand how to do it for time series, but I
do not understand how to do it
for networks, and I don't think I am
alone in this. (Well, I understand how to do it for Erdos-Renyi networks,
because that's back to independent samples...)
- The method of sieves
- Directly optimize the fit, but within a constrained
class of models; relax the constraint as the amount of data grows. If the
constraint is relaxed slowly enough, should converge on the truth. (Ordinary
parametric inference, within a single model class, is a limiting case where the
constraint is relaxed infinitely slowly, and we converge on the pseudo-truth
within that class [provided we have a consistent estimator].)
- Encompassing models
- The sampling distribution of any estimator of any model class is a function
of the true distribution. If the true model class has been well-estimated, it
should be able to predict what other, wrong model classes will
estimate, but not vice versa. In this sense the true model class "encompasses
the predictions" of the wrong ones. ("Truth is the criterion both of itself
and of error.")
- General or covering models
- Come up with a single model class which includes all the interesting model
classes as special cases; do ordinary estimation within it. Getting a
consistent estimator of the additional parameters this introduces is often
non-trivial, and interpretability can be a problem.
- Model averaging
- Don't try to pick the best or correct model; use them all with different
weights. Chose the weighting scheme so that if one is best, it will tend to be
more and more influential. Often I think the improvement is not so much from
using multiple models as from smoothing, since estimates of
the single best model are going to be more noisy than estimates
of a bunch of models which are all pretty good. (This leads
to ensemble methods.)
- Adequacy testing
- The correct model should be able to encode the data as uniform IID noise.
Test whether "residuals", in the appropriate sense, are IID uniform. Reject
models which can't hack it. Possibly none of the models on offer is adequate;
this, too, is informative. Or: models make specific probabilistic assumptions
(IID Gaussian noise, for example); test those. Mis-specification testing.
The machine-learning-ish literature on model selection doesn't seem to ever
talk about setting up experiments to select among models; or do I just not read
the right papers there? (The statistical literature on experimental design
tends to talk about "model discrimination" rather than "model selection".)
Recommended, big-picture:
- Gerda Claeskens and Nils Lid Hjort, Model Selection
and Model Averaging [Review: How Can You Choose Just One?]
- Bruce E. Hansen, "Challenges for Econometric Model
Selection", Econometric Theory 21 (2005): 60--68
["Standard econometric model selection methods are based on four fundamental
errors in approach: parametric vision, the assumption of a true
[data-generating process], evaluation based on fit, and ignoring the impact of
model uncertainty on inference. Instead, econometric model selection methods
should be based on a semiparametric vision, models should be viewed as
approximations, models should be evaluated based on their purpose, and model
uncertainty should be incorporated into inference
methods." PDF]
- Trevor Hastie, Robert Tibshirani and Jerome Friedman, The
Elements of Statistical Learning: Data Mining, Inference, and Prediction
- C. R. Rao, Y. Wu, Sadanori Konishi and Rahul Mukerjee, "On Model
Selection", in P. Lahiri (ed.), Model Selection, pp. 1--64
[Thorough review paper, if from a rather old-school statistical-theory
perspective. The rest of the volume is too Bayesian to be of interest to
me. JSTOR]
- Brian Ripley, "Selecting Amongst Large Classes of Models"
[Talk slides (PDF), but informative, chatty, and approvable]
- Aris Spanos, "Curve-Fitting, the Reliability of Inductive
Inference and the Error-Statistical Approach", Philosophy of Science 74 (2007): 1046--1066 [PDF preprint]
Recommended, close-ups:
- Alekh Agarwal, John C. Duchi, Peter L. Bartlett, Clement Levrard, "Oracle inequalities for computationally budgeted model selection" [COLT 2011]
- Sylvain Arlot
- "V-fold cross-validation improved: V-fold
penalization",
arxiv:0802.0566 [Seeing
cross-validation as a penalization method, and improving it accordingly by
strengthening the penalty term]
- "Model selection by resampling penalization",
arxiv:0906.3124 =
Electronic Journal of Statistics 3 (2009):
557--624
- A. C. Atkinson and A. N. Donev, Optimum Experimental
Design [Review]
- Leo Breiman, "Heuristics of Instability and Stabilization in Model
Selection," Annals of Statistics 24 (1996):
2350--2383
- Leo Breiman and Philip Spector, "Submodel Selection and Evaluation
in Regression: The X-Random Case", International
Statistical Review 60 (1992): 291--319
[JSTOR]
- Peter Bühlmann, M. Kalisch and M. H. Maathuis, "Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm", Biometrika 97 (2010): 261--278
- Peter Bühlmann and Sara van de Geer, >Statistics for High-Dimensional Data: Methods, Theory and Applications [State-of-the
art (2011) compendium of what's known about using the Lasso, and related methods,
for model selection. Mini-review]
- George Casella and Guido Consonni, "Reconciling Model Selection and
Prediction", arxiv:0903.3620 ["It
is known that there is a dichotomy in the performance of model selectors. Those
that are consistent (having the "oracle property") do not achieve the
asymptotic minimax rate for prediction error. We look at this phenomenon
closely, and argue that the set of parameters on which this dichotomy occurs is
extreme, even pathological, and should not be considered when evaluating model
selectors. We characterize this set, and show that, when such parameters are
dismissed from consideration, consistency and asymptotic minimaxity can be
attained simultaneously." Comment: I agree; they show that you need a truly
bizarre sequence of local alternatives to get this behavior.]
- Nicolo Cesa-Bianchi and Gabor Lugosi, Prediction, Learning,
and Games
[Mini-review. For
avoiding model selection in favor of adaptively-weighted combinations of
models.]
- Snigdhansu Chatterjee, Nitai D. Mukhopadhyay, "Risk and resampling
under model
uncertainty", arxiv:0805.3244 [an
interesting approach to model averaging with provably good frequentist
properties, via bootstrapping --- for a trivial linear-Gaussian problem; not
clear to me how to generalize]
- D. R. Cox, "Tests of Separate Families of
Hypotheses", Proceedings of the Fourth
Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1 (Univ. of Calif. Press,
1961), 105-123 [The origins of Cox's test for non-nested hypotheses]
- Pedro
Domingos, "The Role of Occam's Razor in Knowledge Discovery," Data
Mining and Knowledge Discovery, 3 (1999) [PDF]
- Marcus Hutter, "The Loss Rank Principle for Model Selection",
math.ST/0702804 [This is a
simplified form of Deborah Mayo's
"severity".]
- Pascal Lavergne and Quang H. Vuong, "Nonparametric Selection of
Regressors: The Nonnested Case", Econometrica 64
(1996): 207--219 [Picking which variables belong in a regression, by looking at
the error of non-parametric kernel
regressions. JSTOR]
- Pascal Massart, Concentration Inequalities and Model
Selection [Using empirical process theory to get finite-sample, i.e.,
non-asymptotic, risk bounds for various forms
of model selection. Available for free as
a large PDF
preprint. Mini-review]
- Charles Mitchell and Sara van de Geer, "General Oracle Inequalities
for Model
Selection", Electronic
Journal of Statistics 3 (2009): 176--204 [Analyzes
a data-set splitting scheme (like cross-validation with only one "fold")]
- David J. Spiegelhalter, Nicola G. Best, Bradley P. Carlin and
Angelika van der Linde, "Bayesian Measures of Model Complexity and
Fit", Journal of the Royal Statistical Society
B 64 (2002): 583--639
[PDF reprint. However,
see Claeskens and Hjort, especially p. 92, for a discussion of how this just
turns into the Takeuchi (= "model-robust" Akaike) IC in the large-sample
limit.]
- Sara van de Geer, Empirical Process Theory in
M-Estimation
- V. N. (=Vladimir Naumovich) Vapnik, The Nature of
Statistical Learning Theory [Review:
A Useful Biased Estimator]
- Quang H. Vuong, "Likelihood Ratio Tests for Model Selection and
Non-Nested Hypotheses", Econometrica 57 (1989):
307--333
To read:
- Pierre Alquier and Olivier Wintenberger, "Model selection and randomization for weakly dependent time series forecasting", arxiv:0902.2924
- Animashree Anandkumar, Vincent Y.F. Tan, Alan. S. Willsky, "High-Dimensional Gaussian Graphical Model Selection: Tractable Graph Families", arxiv:1107.1270
- Sylvain Arlot, "Choosing a penalty for model selection in heteroscedastic regression", arxiv:0812.3141
- Sylvain Arlot and Alain Celisse, "A survey of cross-validation
procedures for model
selection", Statistics
Surveys
4 (2010): 40--79
- Sylvain Arlot and Pascal Massart, "Data-driven Calibration of
Penalties for Least-Squares
Regression", Journal
of Machine Learning Research 10 (2009): 245--279
- Francis Bach, "Model-Consistent Sparse Estimation through the
Bootstrap", arxiv:0901.3202 ["if
we run the Lasso for several bootstrapped replications of a given sample, then
intersecting the supports of the Lasso bootstrap estimates leads to consistent
model selection"]
- A. R. Baigorri, C. R. Goncalves, P. A. A. Resende, "Markov Chain Order Estimation and Relative Entropy", arxiv:0910.0264
- Maria Maddalena Barbieri and James O. Berger, "Optimal Predictive
Model Selection", math.ST/0406464 = Annals
of Statistics 32 (2004): 870--897 [Unfortunately,
Bayesian]
- Andrew Barron, Lucien Birgé, and Pascal Massart, "Risk
bounds for model selection via penalization", Probability Theory and
Related Fields 113 (1999): 301--413
- Lucien Birgé
- "The Brouwer Lecture 2005: Statistical estimation with
model
selection", math.ST/0605187
- "Model selection for Poisson processes",
math/0609549
- Lucien Birgŕ and Pascal Massart
"Minimal Penalties for Gaussian
Model Selection", Probability Theory and
Related Fields 138 (2007): 33--73
- "From model selection to adaptive estimation", pp. 55--87
in Pollard, Torgersen and Yang (eds.), Fetschrift for Lucien Le Cam:
Research Papers in Probability and Statistics (1997)
- Gilles Blanchard, Olivier Bousquet, Pascal Massart, "Statistical performance of support vector machines", Annals of Statistics 36 (2008): 489--531, arxiv:0804.0551
- Borowiak, Model Discrimination for Nonlinear Regression
Models
- Daniel R. Cavagnaro, Jay I. Myung, Mark A. Pitt and Janne V. Kujala,
"Adaptive Design Optimization: A Mutual Information-Based Approach to Model Discrimination in Cognitive Science", Neural
Computation 22 (2010): 887--905
- Gavin C. Cawley, Nicola L. C. Talbot, "On Over-fitting in Model
Selection and Subsequent Selection Bias in Performance
Evaluation", Journal
of Machine Learning Research 11 (2010): 2079--2107
- Xin Chen, Changliang Zou, and R. Dennis Cook, "Coordinate-independent sparse sufficient dimension reduction and variable selection",
Annals of Statistics 38 (2010): 3696--3723
- A. E. Clark and C. G. Troskie, "Time Series and Model Selection",
Communications in Statistics: Simulation and computing
37 (2008): 766--771 [Simulation study of the accuracy of
different information criteria]
- Kevin A. Clarke, "A Simple Distribution-Free
Test for Nonnested Hypotheses" [PDF preprint]
- Guilhem Coq, Olivier Alata, Marc Arnaudon and Christian Olivier,
"An improved method for model selection based on Information Criteria",
math.ST/0702540
- Pedro Domingos
- "Process-Oriented Estimation of Generalization Error" [PDF]
- "A Process-Oriented Heuristic for Model Selection"
[PDF]
- Hugo Jair Escalante, Manuel Montes, Luis Enrique Sucar, "Particle
Swarm Model Selection",
Journal
of Machine Learning Research 10 (2009): 405--440
- Jianqing Fan and Runze Li, "Variable Selection via Nonconcave
Penalized Likelihood and its Oracle Properties", Journal of
the American Statistical Association 96 (2001): 1348--1360 [PDF reprint via Prof. Fan]
- Jianqing Fan, Richard Samworth, Yichao Wu, "Ultrahigh dimensional
variable selection: beyond the linear
model", arxiv:0812.3201
- Magalie Fromont, "Model selection by bootstrap penalization for
classification", Machine
Learning
66 (2007): 165--207
- Christophe Giraud, "Estimation of Gaussian graphs by model
selection", arxiv:0710.2044
- Alexander Goldenshluger and Eitan Greenshtein, "Asymptotically
minimax regret procedures in regression model selection and the magnitude of
the dimension
penalty", Annals of
Statistics 28 (2000): 1620--1637 [Hmmm. Not sure
how relevant this will be to anything I'd need to do, given the assumptions
they load on. Via Kevin Kelly.]
- Christian Gourieroux and Alain Monfort, "Testing, Encompassing, and
Simulating Dynamic Econometric Models", Econometric Theory
11 (1995): 195--228 [JSTOR]
- Sonja Greven and Thomas Kneib, "On the behaviour of marginal and
conditional AIC in linear mixed models", Biometrika 97 (2010): 773--789
- Jenny Häggström and Xavier de Luna, "Estimating
Prediction Error: Cross-Validation vs. accumulated Prediction Error",
Communications in Statistics: Simulation
and Computation 39 (2010): 880--898
- Benjamin Hofner, Torsten Hothorn, Thomas Kneib, and Matthias Schmid, "A Framework for Unbiased Model Selection Based on Boosting", Journal of Computational and Graphical Statistics
forthcoming (2011)
- Ching-Kang Ing, "Accumulated prediction errors, information criteria and optimal forecasting for autoregressive time series", Annals of
Statistics 35 (2007): 1238--1277, arxiv:0708.2373
- Paul Kabaila and Khageswor Giri, "Upper bounds on the minimum coverage probability of confidence intervals in regression after variable selection", arxiv:0711.0993
- Nicholas M. Kiefer and Hwan-Sik Choi, "Robust Model Selection in
Dynamic Models with an Application to Comparing Predictive Accuracy"
[SSRN]
- Sadanori Konishi and Genshiro Kitagawa, "Asymptotic theory for
information crteria in model selection --- functional approach," Journal of
Statistical Planning and Inference 114 (2003):
45--61
- Hannes Leeb
- "Conditional Predictive Inference Post Model
Selection", Annals of Statistics 37 (2009):
2838--2876, arxiv:0908.3615 [I
heard Leeb give a talk on this, but I should read the paper]
- "Evaluation and selection of models for out-of-sample prediction when the sample size is small relative to the complexity of the data-generating process", Bernoulli 14 (2008): 661--690,
arxiv:0802.3364
- Hannes Leeb and Benedikt M. Pötscher
- Matthieu Lerasle, "Optimal model selection for density estimation of stationary data under various mixing conditions", Annals of Statistics 39 (2011): 1852--1877, arxiv:0911.1497
- Chenlei Leng, "The Residual Information Criterion, Corrected",
arxiv:0711.1918
- F. Liang and A. Barron, "Exact Minimax Strategies for Predictive
Density Estimation, Data Compression, and Model Selection", IEEE Transactions on
Information Theory 50 (2004): 2708--2726
- Han Liu, Kathryn Roeder, Larry Wasserman, "Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models", arxiv:1006.3316
- Abraham Meidan and Boris Levin, "Choosing from Competing Theories
in Computerised Learning", Minds and Machines 12
(2002): 119--129
- Nicolai Meinshausen and Peter Buehlmann, "Stability Selection",
arxiv:0809.2932 ["Estimation of
structure, such as in graphical modeling, cluster analysis or variable
selection, is notoriously difficult, especially for high-dimensional data. We
introduce the new method of stability selection."]
- Hugh Miller and Peter Hall, "Local polynomial regression and variable selection", arxiv:1006.3342
- Grayham E. Mizon and Massimiliano Marcellino (eds.),
Progressive Modelling: Non-nested Testing and Encompassing
[Blurb, table of contents]
- Ali Mohammad-Djafari, "Model selection for inverse problems: Best
choice of basis functions and model order selection," physics/0111020
- Samuel Mueller and A. H. Welsh, "Robust model selection
in generalized linear models", arxiv:0711.2349
- Benedikt M. Pötscher
- "The distribution of model averaging
estimators and an impossibility result regarding its estimation", arxiv:math/0702781
- "Confidence sets based on sparse estimators
are necessarily large", arxiv:0711.1036
- Zacharias Psaradakis, Martin Sola, Fabio Spagnolo and Nicola Spagnolo, "Selecting nonlinear time series models using information criteria",
Journal of
Time Series Analysis
30 (2009): 369--394
- Pradeep Ravikumar, Martin J. Wainwright, and John D. Lafferty,
"High-dimensional Ising model selection using $\ell_1$-regularized logistic
regression", Annals
of Statistics 38 (2010):
1287--1319, arxiv:0804.4202
- Douglas Rivers and Quang H. Vuong, "Model selection tests for
nonlinear dynamic
models", The
Econometrics Journal 5 (2002): 1--39
- Aris Spanos
- "Statistical Induction, Severe Testing, and Model
Validation" [Preprint]
- "Statistical Model Specification vs. Model Selection: Akaike-type Criteria and the Reliability of Inference" [preprint kindly
provided by Prof. Spanos]
- Tina Toni and Michael P. H. Stumpf
- "Parameter Inference and
Model Selection in Signaling Pathway Models", arxiv:0905.4468
- "Simulation-based model selection for dynamical systems in systems and population biology", arxiv:0911.1705
- Masayuki Uchida and Nakahiro Yoshida, "Information Criteria in
Model Selection for Mixing Processes", Statistical Inference for
Stochastic Processes 4 (2001): 73--98 ["The emphasis is
put on the use of the asymptotic expansion of the distribution of an estimator
based on the conditional Kullback-Leibler divergence for stochastic processes.
Asymptotic properties of information criteria and their improvement are
discussed."]
- Tim van Erven, Peter Grunwald and Steven de Rooij, "Catching Up Faster by Switching Sooner: A Prequential Solution to the AIC-BIC Dilemma", arxiv:0807.1005
- Ramon van Hanel, "On the minimal penalty for Markov order estimation", Probability Theory and Related Fields 150 (2011): 709--738, arxiv:0908.3666
- Geert Verbeke, Geert Molenberghs, Caroline Beunckens, "Formal and
Informal Model Selection with Incomplete Data", Statistical
Science 23 (2008): 201--218
= arxiv:0808.3587
- Junhui Wang, "Consistent selection of the number of clusters via crossvalidation", Biometrika 97 (2010): 893--904
- Zijun Wang, "Finite Sample Performances of the Model Selection Approach in Nonparametric Model Specification for Time Series", Communications in Statistics: Theory and Methods 38
(2009): 2302--2330
- Lan Xue, annie Qu, Jianhui Zhou, "Consistent Model Selection for Marginal Generalized Additive Model for Correlated Data", Journal of the American Statistical Association forthcoming
- Yiyun Zhang, Runze Li and Chih-Ling Tsai, "Regularization Parameter
Selections via Generalized Information
Criterion", Journal
of the American Statistical Association 105 (2010):
312--323
- Piotr Zwiernik, "An Asymptotic Behaviour of the Marginal Likelihood for General Markov Models", Journal of Machine Learning Research 12 (2011): 3283--3310 [where "general Markov models" == "binary graphical tree models where all the inner nodes of a tree represent binary hidden variables"]
#
Neural Modeling and Data Analysis
Especially, but not exclusively, modeling of spike trains (which is
important for neural coding, and overlaps therewith).
Things to investigate: How easy would it be to adapt spike-sorting
algorithms to cluster or classify other kinds of time series? Easy or not,
would there be any point?
What's up with all the papers on using Ising models (and their variants) to
model neural interactions? Some very respectable people are involved, but just
saying the words makes me dubious. What's been done on
using graphical-model structure learning
for neural data?
See also:
Neural Coding;
Synchronization in Neural Systems;
Neuroscience in general
Recommended (also look at the recommendations
under coding and
synchronization):
- David Brillinger, "Nerve Cell Spike Train Data Analysis: A
Progression of Technique," Journal of the American Statistical
Association
87 (1992): 260--270
- A. E. Brockwell, A. L. Rojas and R. E. Kass, "Recursive
Bayesian Decoding of Motor Cortical Signals by Particle Filtering",
Journal of
Neurophysiology 91 (2004): 1899--1907 [Very nice,
especially since they've combining data from multiple experiments. It is
a little disappointing that they set up a state-space model, but then
only use the state to enforce a kind of weak continuity constraint on the
decoding, rather than trying to capture the actual computations going on. But
I should talk to them about that... Appendix A gives a very clear and compact
explanation of particle filtering.]
- Emery N. Brown, Robert E. Kass and Partha P. Mitra, "Multiple
Neural Spike Train Data Analysis: State-of-the-art and Future
Challanges", Nature
Neuroscience 7 (2004): 456--461
[PDF reprint via
Rob]
- Sami El Boustani, Alain Destexhe, "Does brain activity stem from high-dimensional chaotic dynamics? Evidence from the human electroencephalogram, cat cerebral cortex and artificial neuronal networks", arxiv:0904.4217
- Chris Eliasmith and Charles Anderson, Neural Engineering:
Computation, Representation, and Dynamics in Neurobiological Systems
- Matthew T. Harrison and Stuart Geman, "A Rate and
History-Preserving Resampling Algorithm for Neural Spike Trains",
Neural Computation 21 (2009): 1244--1258
- Robert Haslinger, Gordon Pipa and Emery Brown,
"Discrete Time Rescaling Theorem: Determining Goodness of Fit for
Discrete Time Statistical models of Neural Spiking",
Neural Computation 22 (2010): 2477--2506
- Yoshito Hirata, Kevin Judd and Kazuyuki Aihara, "Characterizing
chaotic response of a squid axon through generating
partitions", Physics Letters
A 346 (2005): 141--147 [The obvious approach
to symbolic dynamics for spike trains
works.]
- Robert E. Kass, Valerie Ventura and Emery N. Brown, "Statistical
Issues in the Analysis of Neuronal
Data", Journal of
Neurophysiology 94 (2005): 8--25
[PDF reprint via
Rob]
- Shinsuke Koyama and Shiegeru Shinomoto, "Empirical Bayes
interpretations of random point
events", Journal
of Physics A: Mathematical and General
38 (2005): L531--L537
- Martin A. Lindquist, "The Statistical Analysis of fMRI
Data", Statistical Science 23 (2008): 439--464
= arxiv:0906.3662
- Murat Okatan, Matthew A. Wilson and Emery N. Brown,
"Analyzing Functional Connectivity Using a Network
Likelihood Model of Ensemble Neural Spiking Activity", Neural
Computation 17 (2005): 1927--1961
- Liam Paninski, Jonathan Pillow, and Jeremy Lewi, "Statistical
models for neural encoding, decoding, and optimal stimulus design", to appear
in P. Cisek, T. Drew and J. Kalaska (eds.), Computational Neuroscience:
Progress in Brain Research
[PDF
preprint]
- J. D. Ramsey, S. J. Hanson, C. Hanson, Y. O. Halchenko,
R. A. Poldrack and C. Glymour, "Six Problems for Causal Inference from
fMRI", NeuroImage 49 (2010): 1545--1558
[PDF via
Prof. Hanson; thanks to Prof. Glymour for having shared a preprint with me]
- Sommer and Wichert (eds.), Exploratory Analysis and Data
Modeling in Functional Neuroimaging [Conference proceedings, so uneven and not too tightly integrated, but covers a lot of ground.]
To read:
- Shun-ichi Amari, "Conditional Mixture Model for Correlated Neuronal
Spike
Trains", Neural
Computation 22 (2010): 1718--1736
- Pierre Baldi, "Probabilistic Models of Neuronal Spike Trains," in
Giles and Gori (eds.), Adaptive Processing of Sequences and Data
Structures
- Peter beim Graben, J. Douglas Saddy, Matthias Schlesewsky and
Jürgen Kurths, "Symbolic Dynamics of Event-Related Brain Potentials,"
Physical Review E 62 (2000): 5518--5541
- William Bialek, "Thinking about the brain," physics/0205030
- Hemant Bokil, Bijan Pesaran, R. A. Andersen and Partha P. Mitra, "A
framework for detection and classification of events in neural activity", q-bio.NC/0507045
- Romain Brette and Wulfram Gerstner, "Adaptive Exponential
Integrate-and-Fire Model as an Effective Description of Neuronal
Activity", Journal of
Neurophysiology 94 (2005): 3637--3642
- R. Brette, M. Rudolph, T. Carnevale, M. Hines, D. Beeman,
J. M. Bower, M. Diesmann, A. Morrison, P. H. Goodman, F. C. Harris Jr.,
M. Zirpe, T. Natschlager, D. Pecevski, B. Ermentrout, M. Djurfeldt,
A. Lansner, O. Rochel, T. Vieville, E. Muller, A. P. Davison, S. El Boustani,
and A. Destexhe, "Simulation of networks of spiking neurons: A review of tools
and strategies", q-bio.NC/0611089
- A. Brezger, L. Fahrmeir, A. Hennerfeind, "Adaptive Gaussian Markov random fields with applications in human brain mapping", Journal of the Royal Statistical Society C 56
(2007): 327--345
- David Brillinger, "Some statistical methods for random process
data from seismology and neurophysiology", Annals of Statistics
16 (1988): 1--54
- David Cai, Louis Tao and David W. McLaughlin, "An Embedded Network
Approach for Scale-Up of Fluctuation-Driven Systems with Preservation of Spike
Information", Proceedings of the
National Academy of Sciences (2004) [Abstract: " address
computational 'scale-up' issues in modeling large regions of the cortex, many
coarse-graining procedures have been invoked to obtain effective descriptions
of neuronal network dynamics. However, because of local averaging in space and
time, these methods do not contain detailed spike information and, thus, cannot
be used to investigate, e.g., cortical mechanisms that are encoded through
detailed spike-timing statistics. To retain high-order statistical information
of spikes, we develop a hybrid theoretical framework that embeds a subnetwork
of point neurons within, and fully interacting with, a coarse-grained network
of dynamical background. We use a newly developed kinetic theory for the
description of the coarse-grained background, in combination with a Poisson
spike reconstruction procedure to ensure that our method applies to the
fluctuation-driven regime as well as to the mean-driven regime. This
embedded-network approach is verified to be dynamically accurate and
numerically efficient. As an example, we use this embedded representation to
construct 'reverse-time correlations' as spiked-triggered averages in a ring
model of orientation-tuning dynamics." ]
- Hock Peng Chan and Wei-Liem Loh, "Some theoretical results on
neural spike train probability
models", math.ST/0703829
- Yonghong Chen, Steven L. Bressler, and Mingzhou Ding, "Frequency
decomposition of conditional Granger causality and application to multivariate
neural field potential
data", q-bio.NC/0608034
= Journal of Neuroscience Methods 150 (2006):
228--237
- Zhiyi Chi, "Large deviations for template matching between point
processes", Annals of Applied
Probability 15 (2005): 153--174 = math.PR/0503463
- Todd P. Coleman and Sridevi S. Sarma, "A Computationally Efficient Method for Nonparametric Modeling of Neural Spiking Activity with Point Processes", Neural Computation 22 (2010): 2002--2030
- Mauro Copelli and Osame Kinouchi, "Intensity Coding in
Two-Dimensional Excitable Neural Networks", q-bio.NC/0409032
[Greenberg-Hastings cellular automata as a toy model of visual response!]
- Luciano da F. Costa and Olaf Sporns, "Hierarchical Features of
Large-Scale Cortical Connectivity", q-bio.NC/0508007
- J. Davidsen and H. G. Schuster, "Simple model for 1/f noise,"
cond-mat/0201198 [a null
model]
- Peter Dayan and Larry Abbott, Theoretical Neuroscience
[website]
- Matthieu Delescluse and Christophe Pouzat, "Efficient spike-sorting
of multi-state neurons using inter-spike intervals information", q-bio.QM/0505053
- Mingzhou Ding, Yonghong Chen and Steve L. Bressler,
"Granger Causality: Basic Theory and Application to Neuroscience",
q-bio.QM/0608035 = pp.
451--474 in B. Schelter, M. Winterhalder, and J. Timmer (eds.), Handbook
of Time Series Analysis
- Alexander J. Dubbs, Brad A. Seiler and Marcelo O. Magnasco,
"A Fast Lp Spike Alignment Metric",
Neural Computation 22 (2010): 2785--2808
- Jean-Pierre Eckmann, Ofer Feinerman, Leor Gruendlinger, Elisha Moses, Jordi Soriano, Tsvi Tlusty, "The Physics of Living Neural Networks", arxiv:1007.5465
- Victor M. Eguiluz, Dante R. Chialvo, Guillermo A. Cecchi, Marwan
Baliki and A. Vania Apkarian, "Scale-free brain functional networks",
Physical Review
Letters 94 (2005): 018102 = cond-mat/0309092
- Seif Eldawlatly, Yang Zhou, Rong Jin
and Karim G. Oweiss, "On the Use of Dynamic Bayesian Networks in Reconstructing Functional Neuronal Networks from Spike Train Ensembles", Neural Computation 22 (2010): 158--189
- Nicholas Fisher and Arunava Banerjee, "A Novel Kernel for Learning a Neuron Model from Spike Train Data", NIPS 23 (2010) [PDF]
- Michael D. Fox, Abraham Z. Snyder, Justin L. Vincent, Maurizio
Corbetta, David C. Van Essen and Marcus E. Raichle, "The human brain is
intrinsically organized into dynamic, anticorrelated functional networks", Proceedings of the National
Academy of Sciences 102 (2005): 9673--9678
- Yun Gao, Ioannis Kontoyiannis, Elie Bienenstock, "From the entropy to the statistical structure of spike trains", arxiv:0710.4117
- Wulfram Gerstner, Spiking Neuron Models
- Gail Gilboa, Ronen Chen, and Naama Brenner, "History-Dependent
Multiple-Time-Scale Dynamics in a Single-Neuron Model", Journal of
Neuroscience 25 (2005): 6479--6489
- Paul Glimcher, Decisions, Uncertainty, and the Brain:
The Science of Neuroeconomics
- Andreas Grönlund, "The difference in directed structure of
Neural and Transcriptional Regulation Networks", cond-mat/0406268
- Richard H. R. Hahnloser, "Stationary transmission distribution of
random spike trains by dynamical synapses," Physical Review E
67 (2003) 022901
- Ronald M. Harris-Warrick, Eve Marder, Allen I. Selverston and
Maurice Moulins (eds.), Dynamic Biological Networks: The Stomatogastric
Nervous System [Blurb]
- H. R. Heekeren, S. Marrett, P. A. Bandettini and L. G. Ungerleider,
"A general mechanism for perceptual decision-making in the human brain",
Nature 431
(859--862)
- Kim L. Hoke, Michael J. Ryan, and Walter Wilczynski, "Social cues
shift functional connectivity in the hypothalamus", PNAS 102
(2005): 10712--10717
- Kazushi Ikeda, "Information Geometry of Interspike Intervals in
Spiking Neurons", Neural
Computation 17 (2005): 2719--2735
- Eugene M. Izhikevich, Dynamical Systems in Neuroscience: The
Geometry of Excitability and Bursting
[Blurb]
- Alon Keinan, Ben Sandbank, Claus C. Hilgetag, Isaac Meilijson and
Eytan Ruppin, "Fair Attribution of Functional Contribution in Artificial and
Biological Networks", Neural
Computation 16 (2004): 1887--1915
- Alexei A. Koulakov, Dmitry Rinberg and Dmitry N. Tsigankov, "How to
find decision makers in neural circuits?", q-bio.NC/0401005
- Danial Lashkari, Ramesh Sridhara and Polina Golland, "Categories and Functional Units: An Infinite Hierarchical Model for Brain Activations" [NIPS 2010]
- Li Zhaoping, Alex Lewis and Silvia Scarpetta, "Mathematical
Analysis and Simulations of the Neural Circuit for Locomotion in Lamprey", q-bio.NC/0404012
- Steven J. Luck, An Introduction to the Event-Related
Potential Technique
[Blurb]
- Wolfgang Maass and Eduardo D. Sontag, "Neural Systems as Nonlinear
Filters," Neural Computation 12 (2000):
1743--1772
- Olivier Marre, Sami El Boustani, Yves Fregnac and Alain
Destexhe, "Prediction of spatio-temporal patterns of neural activity from pairwise correlations", arxiv:0903.0127
= Physical Review Letters 102 (2009):
138101
- Roy Mukamel, Hagar Gelbard, Amos Arieli, Uri Hasson, Itzhak Fried
and Rafael Malach, "Coupling Between Neuronal Firing, Field Potentials, and
fMRI in Human Auditory Cortex", Science
309 (2005): 951--954 ["als in auditory cortex of two
neurosurgical patients and compared them with the fMRI signals of 11 healthy
subjects during presentation of an identical movie segment. The predicted fMRI
signals derived from single units and the measured fMRI signals from auditory
cortex showed a highly significant correlation (r = 0.75, P < 10^-47). Thus,
fMRI signals can provide a reliable measure of the firing rate of human
cortical neurons."]
- Aatira G. Nedungadi, Govindan Rangarajan, Neeraj Jain and
Mingzhou Ding, "Analyzing multiple spike trains with nonparametric granger causality", Journal of Computational
Neuroscience 27 (2009): 55--64
- Murat Okatan, Matthew A. Wilson and Emery N. Brown, "Analyzing
Functional Connectivity Using a Network Likelihood Model of Ensemble Neural
Spiking
Activity", Neural
Computation 17 (2005): 1927--1961
- Liam Paninski, Jonathan W. Pillow and Eero P. Simoncelli, "Maximum
Likelihood Estimation of a Stochastic Integrate-and-Fire Neural Encoding
Model", Neural
Computation 16 (2004): 2533--2561
- G. Pola, R. S. Petersen, A. Thiele, M. P. Young and S. Panzeri,
"Data-Robust Tight Lower Bounds to the Information Carried by Spike Times of a
Neuronal Population", Neural
Computation
17 (2005): 1962--2005
- R. Quian Quiroga, Z. Nadasdy and Y. Ben-Shaul, "Unsupervised Spike
Detection and Sorting with Wavelets and Superparamagnetic
Clustering", Neural
Computation 16 (2004): 1661--1687
- Rajesh P. N. Rao (ed.), Probabilistic Models of the Brain:
Perception and Neural Function
- George N. Reeke and Allan D. Coop, "Estimating the Temporal
Interval Entropy of Neuronal
Discharge", Neural
Computation 16 (2004): 941--970 [From the abstract,
I'm skeptical. They're assuming that successive inter-spike intervals are
all independent samples from a fixed distribution
of known parametric form, and then using maximum likelihood to
estimate the parameters, which of course gives them an entropy estimate and
confidence intervals. But all of the italicized points seem dubious to me.
Still, I need to read it.]
- Hermann Riecke, Alex Roxin, Santiago Madruga and Sara A. Solla,
"Multiple attractors, long chaotic transients, and failure in small-world
networks of excitable
neurons", Chaos
17 (2007): 026110
- P. A. Robinson, "Propagator theory of brain dynamics", Physical Review
E 72 (2005): 011904
- Yasser Roudi, Joanna Tyrcha and John Hertz, "The Ising Model for Neural Data: Model Quality and Approximate Methods for Extracting Functional Connectivity", arxiv:0902.2885
= Physical Review E 79 (2009): 051915
- Naoki Saito, "The Generalized Spike Process, Sparsity, and
Statistical Independence,"
math.PR/0110103
- P. S. Sastry and K. P. Unnikrishnan, "Conditional probability based
significance tests for sequential patterns in multi-neuronal spike
trains", Neural Computation 22 (2010): 1025--1059, arxiv:0808.3511
- Silvia Scarpetta, Zhaoping Li and John Hertz, "Hebbian imprinting
and retrieval in oscillatory neural networks," cond-mat/0111034
- Margaret Euphrasia Sereno, Neural Computation of Pattern
Motion
- Anil K. Seth, Gerald M. Edelman, "Distinguishing Causal
Interactions in Neural Populations", Neural
Computation 19 (2007): 910--933
- Xilin Shen, Francois G. Meyer, "Low Dimensional Embedding of fMRI
datasets", arxiv:0709.3121
["embedding optimally preserves the local functional coupling between fMRI time
series, and provides a low-dimensional coordinate system for detecting
activated voxels. To compute the embedding, we build a network of functionally
connected voxels and represent it with a graph. A spectral decomposition of the
graph probability transition matrix produces a set of eigenvectors that are
used to define the embedding"]
- Svetlana V. Shinkareva, Vladimir Gudkov, Jing Wang, "A Network Analysis Approach to fMRI Condition-Specific Functional Connectivity", arxiv:1008.0590
- Lavi Shpigelman, Yoram Singer, Rony Paz and Eilon Vaadia,
"Spikernels: Predicting Arm Movements by Embedding Population Spike Rate
Patterns in Inner-Product Spaces",
Neural
Computation 17 (2005): 671--690 ["Inner-product
operators, often referred to as kernels in statistical learning, define a
mapping from some input space into a feature space. The focus of this letter is
the construction of biologically motivated kernels for cortical activities. The
kernels we derive, termed Spikernels, map spike count sequences into an
abstract vector space in which we can perform various prediction tasks. We
discuss in detail the derivation of Spikernels and describe an efficient
algorithm for computing their value on any two sequences of neural population
spike counts. We demonstrate the merits of our modeling approach by comparing
the Spikernel to various standard kernels in the task of predicting hand
movement velocities from cortical recordings. All of the kernels that we tested
in our experiments outperform the standard scalar product used in linear
regression, with the Spikernel consistently achieving the best performance."]
- Sean L. Simpson, Satoru Hayasaka, Paul J. Laurienti, "Selecting an exponential random graph model for complex brain networks", arxiv:1007.3230
- Terence R. Stanford, Stephan Quessy and Barry E. Stein, "Evaluating
the Operations Underlying Multisensory Integration in the Cat Superior
Colliculus", Journal of
Neuroscience 25 (2005): 6499--6508
- Wilson Truccolo, John P. Donoghue, "Nonparametric Modeling of
Neural Point Processes via Stochastic Gradient Boosting Regression", Neural
Computation 19 (2007): 672-705
- Arjen vanOoyen (ed.), Modeling Neural Development
- J. C. Vasquez, B. Cessac and T. Viéville, "Entropy-based
parametric estimation of spike train statistics", arxiv:1003.3157 [From a first glance, here "entropy-based"
just means "exponential-family distribution"]
- Valérie Ventura, "Testing for and Estimating Latency Effects
for Poisson and Non-Poisson Spike
Trains", Neural
Computation 16 (2004): 2323--2349
- T. Verechtchaguina, L. Schimansky-Geier and I. M. Sokolov, "Spectra
and waiting-time distributions in firing resonant and non-resonant neurons", q-bio.NC/0401013 [Need to see
whether their ability to determine response properties from interspike-interval
distributions is limited to FitzHugh-Nagumo neurons, or is more general.]
- Hugh R. Wilson, Spikes, Decisions and Actions: The Dynamical
Foundations of Neuroscience
- Tor D. Wager and Tomas E. Nichols, "Optimization of experimental
design in fMRI: A general framework using a genetic
algorithm", Neuroimage 18 (2003): 293--309
- Masahiko Yoshioka, "The spike-timing-dependent learning rule to
encode spatiotemporal patterns in a network of spiking neurons," cond-mat/0110070
#
Forecasting Non-Stationary Processes
Some non-stationary processes are in fact easy to forecast: periodic ones,
for example, are strictly speaking not stationary. An ergodic Markov chain
started far from its invariant distribution is also non-stationary, but easy to
predict (it will approach the stationary distribution). Both of these cases
are conditionally stationary, which I think is all that's really needed.
What's more interesting is the problem of so to speak really
non-stationary processes. It's hard to imagine that there is any way to truly
predict an arbitrary non-stationary process. (Basically: as soon as
you think you have established a trend-line, the Adversary can always reverse
the trend, without creating any problems of consistency with earlier data.) If
you can constrain the class of allowable non-stationary processes, however,
then something might be possible. Alternately, one might lower expectations,
not to actually predicting well, but to predicting with low regret.
I actually have an Idea about using model averaging here, but need to find
the time to work on it.
See also:
Ensemble Methods in Machine Learning;
Time Series;
Universal Prediction
Recommended (very misc):
- S. Caires and J. A. Ferreira, "On the Non-parametric Prediction of
Conditionally Stationary Sequences", Statistical Inference
for Stochastic Processes 8 (2005): 151--184
- R. Dahlhaus, "Fitting Time Series Models to Nonstationary
Processes",
Annals of Statistics 25 (1997): 1--37
- Mark Herbster and Manfred K. Warmuth, "Tracking the Best
Expert", Machine Learning 32 (1998): 151--178
[PS version
via Dr. Herbster]
- Elad Hazan and Satyen Kale, "Extracting certainty from uncertainty: regret bounded by variation in costs", Machine Learning 80 (2010): 165--188
- Jeremy Zico Kolter and Marcus A. Maloof
- Claire Monteleoni and Tommi S. Jaakkola,
"Online Learning of Non-stationary Sequences", pp. 1093--1100 in
NIPS 2003 (vol. 16) [Figuring out at what rate to switch between experts]
- Maxim Raginsky, Roummel F. Marcia, Jorge Silva and Rebecca M.
Willett
- "Sequential Probability Assignment via Online Convex Programming
Using Exponential Families" [ISIT 2009; PDF]
- "Sequential anomaly detection in the presence of noise and limited feedback", arxiv:0911.2904
- Kyupil Yeon, Moon Sup Song, Yongdai Kim, Hosik Choi, Cheolwoo
Park, "Model averaging via penalized regression for tracking concept
drift", Journal of Computational and Graphical
Statistics online before print (2010)
Modesty forbids me to recommend:
- CRS, Abigail Z. Jacobs, Kristina Lisa Klinkner and Aaron Clauset, "Adapting to Non-stationarity with Growing Expert Ensembles", arxiv:1103.0949
To read:
- István Berkes, Lajos Horváth and Shiqing Ling, "Estimation in nonstationary random coefficient autoregressive models",
Journal of Time Series Analysis 30 (2009): 395--416 ["the unit root problem does not exist in the RCA model"!]
- Satish T. S. Bukkapatnam and Changqing Cheng, "Forecasting the evolution of nonlinear and nonstationary systems using recurrence-based local Gaussian process models", Physical Review E
82 (2010): 056206
- Alexey Chernov, Vladimir Vovk, "Prediction with Advice of Unknown Number of Experts", arxiv:1006.0475
- Michael P. Clements and David F. Hendry, Forecasting Non-Stationary Economic Time
Series
- Rainer Dahlhaus and Wolfgang Polonik, "Empirical spectral processes for locally stationary time series", Bernoulli 15
(2009): 1--39, arxiv:902.1448
- Ching-Kang Ing, Jin-Lung Lin, Shu-Hui Yu, "Toward optimal multistep
forecasts in non-stationary
autoregressions", Bernoulli 15 (2009): 402--437
= arxiv:0906.2266 ["Optimal"
assuming that you know you are facing a linear AR model.]
- C. T. Jose, B. Ismail, S. Jayasekhar, "Trend, Growth Rate, and Change Point Analysis: A Data Driven Approach", Communications in Statistics: Simulation and Computation 37 (2008): 498--506
- Yan Karklin and Michael S. Lewicki, "A Hierarchical Bayesian Model
for Learning Nonlinear Statistical Regularities in Nonstationary Natural
Signals", Neural
Computation 17 (2005): 397--423
- Zudi Lu, Dag Johan Steinskog, Dag Tjostheim and Qiwei Yao,
"Adaptively Varying-Coefficient Spatiotemporal Models", Journal of the Royal Statistical Society B 71 (2009): 859--880 [PDF preprint]
- Joaquin Quinonero-Candela, Masashi Sugiyama, Anton Schwaighofer and
Neil D. Lawrence (eds.), Dataset Shift in Machine Learning
- Joshua W. Robinson, Alexander J. Hartemink, "Learning Non-Stationary Dynamic Bayesian Networks", Journal of Machine Learning Research 11 (2010): 3647--3680
- P. F. Verdes, P. M. Granitto and H. A. Ceccatto, "Overembedding
Method for Modeling Nonstationary Systems", Physical Review
Letters 96 (2006): 118701
- Ou Zhao, Michael Woodroofe, "Estimating a monotone trend",
arxiv:0812.3188
- Shuheng Zhou, John Lafferty, Larry Wasserman, "Time Varying Undirected Graphs", arxiv:0802.2758
To write:
- CRS + co-conspirators to be named later, "This Time Is Different"
#
Relational Learning
That is, learning models of mathematical relations and relational structures
from data, not learning in a relational manner.
See also:
Graphical Models;
Network Data Analysis;
Statistics of Structured
Data;
Data Mining;
Machine Learning, Statistical Inference, and Induction;
Mathematical Logic
To read:
- Oliver Schulte, Hassan Khosravi, Flavia Moser, Martin Ester, "Learning Class-Level Bayes Nets for Relational Data", arxiv:0811.4458
- Katsuhiko Ishiguro, Tomoharu Iwata, Naonori Ueda, Joshua Tenenbaum, "Dynamic Infinite Relational Model for Time-varying Relational Data Analysis"
[NIPS 2010]
- Ingo Thon, Niels Landwehr and Luc De Raedt, "Stochastic relational processes: Efficient inference and applications", Machine Learning 82 (2011): 239--272
- Shuheng Zhou, John Lafferty, Larry Wasserman, "Time Varying Undirected Graphs", arxiv:0802.2758
#
Analysis of Network Data
That is, of data on the form of networks --- I don't (as such) care about
packet flow or other aspects of computer networks...
Things I wish I knew how to
do: bootstrap a network, non-parametrically.
(The model with a fixed degree sequence is a start, but what's the equivalent
of the block bootstraps used for time series, which preserve dependence?)
Cross-validation on networks. (You could
say that link prediction is leave-one-out CV, but how about k-fold CV?)
Estimate a distribution over networks by somehow smoothing an adjacency matrix.
— These may or may not be three aspects of a single problem.
Community discovery is an
important sub-topic, and I like exponential family random
graph models enough to give them their own notebook.
— Although
many of the relevant papers appear in the journal Social Networks,
published by Elsevier, the company responsible for deliberately publishing
pseudo-journals such as The Australasian Journal of Bone and Joint
Medicine, I know of no particular reason to believe that their
findings are problematic. It would, however, be good if the community
could shift to a journal whose publishers do not subvert the peer-review
process whenever they find it profitable to do so.
See also:
Complex networks;
Community discovery;
Exponential families of random graph models;
Homophily vs. influence;
Relational learning
Social networks;
Statistics in general;
Statistics of structured
data;
Recommended, big picture:
- Anna Goldenberg, Alice X. Zheng, Stephen E. Fienberg, Edoardo M. Airoldi, "A survey of statistical network models", Foundations and Trends in Machine Learning 2 (2009): 1--117 = arxiv:0912.5410
- Eric D. Kolaczyk, Statistical Analysis of Network Data:
Methods and Models [Best available up-to-date textbook on the subject.
Mini-review.]
- John Scott, Social Network Analysis: A Handbook [Short
introductory text. Good on the sociology, but the implied reader is not at all
comfortable with math, which can be tedious if you are.]
Recommended, close-ups:
- Nesreen K. Ahmed, Jennifer Neville, Ramana Kompella, "Reconsidering the Foundations of Network Sampling" [PDF preprint]
- Edo Airoldi, David M. Blei, Stephen E. Fienberg, Anna Goldenberg,
Eric P. Xing and Alice X. Zheng (eds.), Statistical Network Analysis:
Models, Issues, and New Directions [Disclaimer:
contains one of my papers.]
- Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg, Eric
P. Xing, "Mixed Membership Stochastic
Blockmodels", Journal of
Machine Learning Research 9 (2008): 1981--2014
- Peter J. Bickel, Aiyou Chen, and Elizaveta Levina, "The method of moments and degree distributions for network models", Annals
of Statistics 39 (2011): 38--59
- Peter J. Carrington, John Scott and Stanley Wasserman (eds.),
Models and Methods in Social Network Analysis [Best thought of as
a supplement to Wasserman and Faust, bringing it more up to
date. Blurb]
- Sourav Chatterjee, Persi Diaconis and Allan Sly,
"Random graphs with a given degree sequence", Annals of Applied Probability 21 (2011): 1400--1435, arxiv:1005.1136 [Interesting application of the new technology
of graph limits to a classic model. May not be terribly practical yet but definitely promising.]
- Aaron Clauset and Cristopher Moore, "Accuracy and Scaling Phenomena
in Internet
Mapping", cond-mat/0410059
= Physical Review Letters 94 (2005): 018701
- Aaron Clauset, Cristopher Moore and M. E. J. Newman, "Structural
Inference of Hierarchies in
Networks", physics/0610051
- Hoda Eldaridry and Jennifer Neville, "A Resampling Technique for
Relational Data Graphs", SNA-KDD 2008
[PDF
reprint via Prof. Neville]
- Linton C. Freeman and Douglas R. White (2003), "Using Galois
Lattices to Represent Network Data", Sociological Methodology 23:
127--146 [PDF
reprint]
- Wenjie Fu, Le Song, Eric P. Xing, "A State-Space Mixed Membership Blockmodel for Dynamic Network Tomography", arxiv:0901.0135
- Krista Gile and Mark S. Handcock, "Model-based Assessment of the
Impact of Missing Data on Inference for Networks" [Working Paper 66, Center for
Statistics and the Social Sciences, University of Washington
(2006). PDF
preprint.]
- Steven M. Goodreau, James A. Kitts and Martina Morris,
"Birds of a Feather, Or Friend of a Friend?: Using Exponential Random Graph Models to Investigate Adolescent Social Networks", Demography 46 (2009): 103--125 [In addition to the
substantive findings, this is a great introduction to the "exponential-family random graph model" (ERGM) approach to modeling complex networks.]
- Mark S. Handcock and Krista J. Gile, "Modeling social networks from
sampled
data", Annals
of Applied Statistics
4 (2010): 5--25, arxiv:1010.0891
- Mark S. Handcock, David R. Hunter, Carter T. Butts,
Steven M. Goodreau, and Martina Morris (eds.), "Statistical Modeling
of Social Networks with 'statnet'", special volume (24) of the Journal
of Statistical Software (2008) [Introduction to a whole issue on
the ERGM approach]
- J. A. Henderson and P. A. Robinson, "Geometric Effects on Complex Network Structure in the Cortex", Physical
Review Letters 107 (2011): 018102
- Peter D. Hoff,
Adrian E. Raftery and Mark S. Handcock, "Latent Space Approaches to Social
Network Analysis", Journal of the American Statistical
Association 97 (2002): 1090--1098
[PDF
preprint]
- Jake Hofman, "Large-scale social media analysis with Hadoop"
[Tutorial at ICWSM 2010. The content is not really specific to social media...]
- David R. Hunter,
Steven M. Goodreau and Mark
S. Handcock, "Goodness of Fit of Social Network Models", Journal of
the American Statistical Association 103 (2008):
248--258
[PDF]
- Gueorgi Kossinets, "Effects of Missing Data in Social Networks",
Social Networks 28 (2006):
247--268, arxiv:cond-mat/0306335
- Giuseppe Jurman, Samantha Riccadonna, Roberto Visintainer, Cesare Furlanello, "Biological network comparison via Ipsen-Mikhailov distance", arxiv:1109.0220 [The metric, to be honest,
is not especially compelling, but it's nice to see this done at all.]
- Eric D. Kolaczyk and Pavel N. Krivitsky, "On the question of effective sample size in network modeling", arxiv:1112.0840
- Mladen Kolar, Le Song, Amr Ahmed, and Eric P. Xing, "Estimating time-varying networks", Annals of Applied Statistics 4 (2010): 94--123,
arxiv:http://arxiv.org/abs/0812.5087
- Mahendra Mariadassou, Stéphane Robin, Corinne Vacher, "Uncovering latent structure in valued graphs: A variational approach", Annals
of Applied Statistics 4 (2010): 715--742, arxiv:1011.1813
- Manul Middendorf, Etay Ziv and Chris Wiggins, "Inferring Network
Mechanisms: The Drosophila melanogaster Protein Interaction
Network", q-bio.QM/0408010
[Machine learning meets complex
networks: specifically, learning decision trees to accurately classify networks
by the process which grew them. Neat.]
- M. E. J. Newman, Steven H. Strogatz and Duncan J. Watts,
"Random graphs with arbitrary degree distributions and their applications",
Physical Review E 64 (2001): 026118
= cond-mat/0007235
[Though they don't quite put it this way, these methods are very naturally
employed to generate surrogate network data, which keeps the degree distribution
of the original but is otherwise randomized.]
- Art B. Owen and Dean G. Eckles, "Bootstrapping data arrays of arbitrary order", arxiv:1106.2125
- Jörg Reichardt and Douglas R. White, "Role models for complex
networks", arxiv:0708.0958
- Purnamrita Sarkar and Andrew W. Moore, "Dynamic Social Network
Analysis using Latent Space Models", forthcoming in Advances in Neural
Information Processing Systems 18 (NIPS 2005)
[Abstract,
link to PDF]
- Michael P. H. Stumpf, Carsten Wiuf and Robert M. May, "Subnets of
scale-free networks are not scale-free: Sampling properties of networks", PNAS 102
(2005): 4221--4224
- Andrew C. Thomas, "Censoring Out-Degree Compromises Inferences of Social Network Contagion and Autocorrelation", arxiv:1008.1636
- Andrew C. Thomas and Joseph K. Blitzstein, "Valued Ties Tell Fewer Lies: Why Not To Dichotomize Network Edges With Thresholds", arxiv:1101.0788
- S. Wasserman and K. Faust, Social Network Analysis
[This was, for a long time, the Bible of the field. Like the Bible, it is not
without value, especially if approached as a historical document, but at the
same time, much of it is over-detailed, boring, and filled with prescriptions
that no longer make much sense.]
- Carsten Wiuf, Markus Brameier, Oskar Hagberg and Michael P. H.
Stumpf, "A likelihood approach to analysis of network
data", Proceedings of
the National Academy of Sciences (USA) 103 (2006):
7566--7570 [My comments. Shorter: A nice
piece of work, though limited to what they call "duplication attachment"
models, a limitation which is not really made clear by the abstract.]
- Douglas R. White and Vincent Duquenne, eds. (1996), special issue
on "Social Network and Discrete Structure Analysis", Social
Networks 18: 169--318
- Rongjing Xiang and Jennifer Neville, "Relational Learning with One Network: An Asymptotic Analysis", AI Stats 2011 [PDF reprint]
- Yang Yang, Ira M. Longini Jr, M. Elizabeth Halloran, "A resampling-based test to detect person-to-person transmission of infectious disease",
Annals of Applied Statistics 1 (2007):
211--228, arxiv:0709.0406 [Though
the null they are comparing it to is one of IID disease onset times, which is,
I think, only appropriate when there is no assortative mixing in the social
network for traits which influence onset times for a non-contagious disease.]
To read:
- Alexandre H. Abdo and A. P. S. de Moura, "Clustering as a measure
of the local topology of
networks", physics/0605235
- Elizabeth S. Allman, Catherine Matias, John A. Rhodes, "Parameter identifiability in a class of random graph mixture models", arxiv:1006.0826
- Gerrit Ansmann and Klaus Lehnertz, "Constrained randomization of weighted networks", Physical Review E
84 (2011): 026103
- Tomaso Aste, Ruggero Gramatica, T. Di Matteo, "Exploring complex networks via topological embedding on surfaces", arxiv:1107.3456
- Yves F. Atchade, "Estimation of Network structures from partially observed Markov random fields", arxiv:1108.2835
- Pierre Baldi et al., Modeling the Internet and the Web:
Probabilistic Methods and Algorithms
- Kim Baskerville and Maya Paczuski, "Subgraph ensembles and motif
discovery using an alternative heuristic for graph
isomorphism", Physical Review
E 74 (2006): 051903
- Etienne Birmele, "Detection of network motifs by local concentration", arxiv:0904.0365
- Cristiano Bocci, Luca Chiantini, Fabio Rapallo, "Max-plus objects to study the complexity of graphs", arxiv:1111.1352
- Stephen P. Borgatti, Kathleen M. Carley and David Krackhardt,
"On the robustness of centrality measures under conditions of imperfect data",
Social
Networks 28 (2006): 124--136
- Ulrik Brandes, Natalie Indekofer and Martin Mader, "Visualization methods for longitudinal social networks and stochastic actor-oriented modeling",
Social Networks forthcoming (2011)
- Andrea Capocci, G. Caldarelli and P. De Los Rios, "Quantitative
description and modeling of real networks,"
cond-mat/0206336
- Federica Cerina, Vincenzo De Leo, Marc Barthelemy, Alessandro Chessa, "Spatial correlations in attribute communities", arxiv:1112.3308
- Vittoria Colizza, Alessandro Flammini, M. Angeles Serrano,
Alessandro Vespignani, "Detecting rich-club ordering in complex
network", physics/0602134
- Luciano da F. Costa, Francisco A. Rodrigues, Gonzalo Travieso and
P. R. Villas Boas, "Characterization of complex networks: A survey of
measurements", cond-mat/0505185
- Leon Danon, Ashley P. Ford, Thomas House, Chris P. Jewell, Matt J. Keeling, Gareth O. Roberts, Joshua V. Ross, Matthew C. Vernon, "Networks and the Epidemiology of Infectious Disease", arxiv:1011.5950
- Anton Dries, Siegfried Nijssen, "Mining Patterns in Networks using Homomorphism", arxiv:1110.3225
- Daniel M. Dunlavy, Tamara G. Kolda, Evrim Acar, "Temporal Link Prediction using Matrix and Tensor Factorizations", arxiv:1005.4006
- Ernesto Estrada, "Quantifying network heterogeneity", Physical
Review E 82 (2010): 066102
- Jacob G. Foster, David V. Foster, Peter Grassberger and Maya
Paczuski, "Link likelihoods in random networks with fixed and partially fixed
degree
sequence", cond-mat/0610446
- Birgitte Freiesleben de Blasio, Taral Guldahl Seierstad, Odd O. Aalen, "Frailty effects in networks: comparison and identification of individual heterogeneity versus preferential attachment in evolving networks", Journal of
the Royal Statistical Society C forthcoming (2011)
- Rumi Ghosh and Kristina Lerman, "Parameterized centrality metric for network analysis", Physical Review E 83 (2011): 066118
- Gourab Ghoshal, Vinko Zlatic, Guido Caldarelli, M. E. J. Newman, "Random hypergraphs and their applications", Physical Review E
79 (2009): 066118, arxiv:0903.0419
- Reid Ginoza and Andrew Mugler, "Network motifs come in sets: Correlations in the randomization process", Physical Review E 82 (2010): 011921
- Benjamin Golub and Matthew O. Jackson, "Using selection bias to explain the observed structure of Internet diffusions", Proceedings of the National Academy of Sciences (USA) 107 (2010): 10833--10836
- Neil Zhenqiang Gong, Ameet Talwalkar, Lester Mackey, Ling Huang, Eui Chul Richard Shin, Emil Stefanov, Elaine (Runting) Shi, Dawn Song
"Predicting Links and Inferring Attributes using a Social-Attribute Network (SAN)", arxiv:1112.3265
- Manuel Gomez-Rodriguez, Jure Leskovec, Andreas Krause, "Inferring Networks of Diffusion and Influence", arxiv:1006.0234
- Daniel Grady, Christian Thiemann, Dirk Brockmann, "Parameter-free identification of salient features in complex networks", arxiv:1110.3864
- Roger Guimera and Marta Sales-Pardo, "Missing and spurious
interactions and the reconstruction of complex networks", Proceedings of the National Academy of
Sciences (USA) 106 (2009): 22073--22078
- Mark S. Handcock, Krista J. Gile, "On the Concept of Snowball Sampling", arxiv:1108.0301
- Robert A. Hanneman and Mark Riddle, Introduction to Social
Network Methods
[Online textbook, looks
decent.]
- Nicholas A. Heard, David J. Weston, Kiriaki Platanioti, David J. Hand, "Bayesian anomaly detection methods for social networks", Annals
of Applied Statistics 4 (2010): 645--662, arxiv:1011.1788
- Peter D. Hoff, "Modeling homophily and stochastic equivalence in symmetric relational data", arxiv:0711.1146
- Petter Holme, "Local symmetries in complex networks", cond-mat/0608695
- Rui Jiang, Zhidong Tu, Ting Chen and Fengzhu Sun, "Network motif identification in stochastic networks", Proceedings of the National Academy of Sciences (USA) 103 (2006): 9404--9409
- Brian Karrer and M. E. J. Newman, "Random graphs containing
arbitrary distributions of
subgraphs", Physical
Review E 82 (2010): 066118, arxiv:1005.1659
- Eric D. Kolaczyk, David B. Chua, Marc Barthelemy, "Co-Betweenness:
A Pairwise Notion of
Centrality", arxiv:0709.3420
- Mladen Kolar, Eric P. Xing, "Estimating Networks With Jumps", arxiv:1012.3795
- Gueorgi Kossinets and Duncan J. Watts
- Vassilis Kostakos, Eamonn O'Neill, Alan Penn, "Brief encounter
networks", 0709.0223 [Networks
defined by brief transactions, rather than persistent ties.]
- Mark A. Kramer, Uri T. Eden, Sydney S. Cash, Eric D. Kolaczyk,
"Network inference - with confidence - from multivariate time series",
arxiv:0903.2210
- Martin Krzywinski, Inanc Birol, Steven J. M. Jones and Marco
A. Marra, "Hive plots: rational approach to visualizing networks",
Briefings in
Bioinformatics
forthcoming (2011) [But really the action is
on the webpage]
- Jerome Kunegis, Ernesto W. De Luca, Sahin Albayrak, "The Link Prediction Problem in Bipartite Networks", arxiv:1006.5367
- Matthieu Latapy and Clemence Magnien, "Measuring Fundamental
Properties of Real-World Complex Networks", cs.NI/0609115 [How
asymptotic are we?]
- Pierre Latouche, Etienne Birmelé, and Christophe Ambroise, "Overlapping stochastic block models with application to the French political blogosphere", Annals of Applied Statistics 5 (2011): 309--336, arxiv:0910.2098
- Sang Hoon Lee, Pan-Jun Kim, and Hawoong Jeong, "Statistical
properties of sampled
networks", cond-mat/0505232
- Jure Leskovec, Deepayan Chakrabarti, Jon Kleinberg, Christos Faloutsos, Zoubin Ghahramani, "Kronecker Graphs: An Approach to Modeling Networks", arxiv:0812.4905
- Manuel Lima, Visual Complexity: Displaying Complex
Networks and Data Sets
- Han Liu, Xi Chen, John Lafferty and Larry Wasserman, "Graph-Valued
Regression", NIPS 23 (2010) [PDF], arxiv:1006.3972
- Linyuan Lu, Tao Zhou, "Link Prediction in Complex Networks: A Survey", arxiv:1010.0725
- David Lusseau, Hal Whitehead, Shane Gero, "Incorporating uncertainty into the study of animal social networks", arxiv:0903.1519 [From a quick look, nothing in this depends on
animals]
- Ben D. MacArthur, Rubén J. Sánchez-García, James
W. Anderson, "On Automorphism Groups of Networks", Discrete
Applied Mathematics 156 (2008): 3525--3531, arxiv:0705.3215
- Sofus A. Macskassy, Foster Provost, "Classification in Networked Data: A Toolkit and a Univariate Case Study", Journal of
Machine Learning Research 8 (2007): 935--983
- Yoshiharu Maeno, Yukio Ohsawa, "Node discovery problem for a social
network", arxiv:0710.4975
- Arun S. Maiya, Tanya Y. Berger-Wolf, "Benefits of Bias: Towards Better Characterization of Network Sampling", arxiv:1109.3911
- Sebastian Moreno, Sergey Kirshner, Jennifer Neville, S.V.N. Vishwanathan, "Tied Kronecker Product Graph Models to Capture Variance in Network Populations" [PDF reprint]
- Seth A. Myers and Jure Leskovec, "On the Convexity of Latent Social Network Inference", NIPS 23 (2010) [PDF]
- Jennifer Neville, Brian Gallaghr, Tina Eliassi-Rad and Tao Wang,
"Correcting evaluation bias of relational classifiers with network cross validation", Knowledge and Information Systems online before print (2011) [Open access]
- Benjamin P. Olding, Patrick J. Wolfe, "Inference for graphs and networks: Extending classical tools to modern data", arxiv:0906.4980
- Henry Pao, Glen A. Coppersmith and Carey E. Priebe, "Statistical
Inference on Random Graphs: Comparative Power Analysis",
Journal of Computational and Graphical Statistics forthcoming
(2011)
- Patrick O. Perry, Patrick J. Wolfe, "Point process modeling for directed interaction networks", arxiv:1011.1703
- Leonid Peshkin, "Structure induction by lossless graph compression",
cs.DS/0703132
- Art F. Y. Poon, Kimberly C. Brouwer, Stefannie A. Strathdee,
Michelle Firestone-Cruz, Remedios M. Lozada, Sergei L. Kosakovsky Pond, Douglas
D. Heckathorn, Simon D. W. Frost, "Parsing Social Network Survey Data from
Hidden Populations Using Stochastic Context-Free
Grammars", PLoS
One 4 (2009): 6777
- Mathias Raschke, Markus Schlapfer and Roberto Nibali, "Measuring
degree-degree association in networks", arxiv:1003.1634
- Mathias Raschke, Markus Schlapfer and Konstantinous Trantopoulos,
"Copula-based generation of degree-associated networks", arxiv:1012.0201
- Pradeep Ravikumar, Martin J. Wainwright, and John D. Lafferty,
"High-dimensional Ising model selection using $\ell_1$-regularized logistic
regression", Annals
of Statistics 38 (2010):
1287--1319, arxiv:0804.4202
- E. S. Roberts, A. C. C. Coolen, T. Schlitt, "Tailored graph ensembles as proxies or null models for real networks II: results on directed graphs", arxiv:1101.6022
- Karl Rohe, Sourav Chatterjee, Bin Yu, "Spectral clustering and the high-dimensional Stochastic Block Model", arxiv:1007.1684
- Martin Rosvall and Carl T. Bergstrom, "Mapping Change in Large
Networks", PLoS
One 5 (2010): e8694
- Camille Roth, "Measuring Generalized Preferential Attachment in
Dynamic Social Networks", nlin.AO/0507021 [Applies
more generally than to social networks]
- Areejit Samal, Olivier C. Martin, "Randomizing genome-scale metabolic networks", arxiv:1012.1473
- M. Angeles Serrano, Marian Boguna, Romualdo Pastor-Satorras,
"Correlations in weighted
networks", cond-mat/0609029
- Mahdi Shafiei, Hugh Chipman, "Mixed-Membership Stochastic Block-Models for Transactional Networks", arxiv:1010.1437
- Srinivas Gorur Shandilya, Marc Timme, "Inferring Network Topology from Complex Dynamics", arxiv:1007.1640
- Nino Shervashidze, Pascal Schweitzer, Erik Jan van Leeuwen, Kurt Mehlhorn, Karsten M. Borgwardt, "Weisfeiler-Lehman Graph Kernels", Journal of Machine Learning Research 12 (2011): 2539--2561
- Mile Sikic, Alen Lancic, Nino Antulov-Fantulin, Hrvoje Stefancic, "Epidemic centrality and the underestimated epidemic impact on network peripheral nodes", arxiv:1110.2558
- Aarti Singh, Robert D. Nowak, Robert Calderbank, "Detecting Weak but Hierarchically-Structured Patterns in Networks", Journal
of Machine Learning Research proceedings 9 (2010): 749--756,
arxiv:1003.0205
- Michael P. H. Stumpf, P. J. Ingram, I. Nouvel and Carsten Wiuf,
"Statistical model selection methods applied to biological
networks", Transactions in Computational Systems
Biology forthcoming (2005) = q-bio.MN/0506013
- Michael P. H. Stumpf and Carsten Wiuf, "Sampling properties of
random graphs: the degree
distribution", cond-math/0507345
= Physical Review
E 72 (2005): 036118
- Tiziano Squartini, Diego Garlaschelli, "Analytical maximum-likelihood method to detect patterns in real networks", New Journal of Physics
13 (2011): 083001, arxiv:1103.0701
- Lionel Tabourier, Camille Roth, Jean-Philippe Cointet, "Generating constrained random graphs using multiple edge switches", arxiv:1012.3023
- Andrew C. Thomas, Hierarchical Models for Relational Data
[Ph.D. thesis, Harvard statistics dept.,
2009; PDF]
- S.V.N. Vishwanathan, Nicol N. Schraudolph, Risi Kondor, Karsten
M. Borgwardt, "Graph
Kernels", Journal
of Machine Learning Research 11 (2010): 1201--1242
["Graphs become ever so much easier to understand when you project
them into a Hilbert space." (Not an actual quote.)]
- Sebastian Weber, Markus Porto, "Generation of arbitrarily two-point
correlated random
networks", arxiv:0708.4161
- Hal Whitehead, Analyzing Animal Societies: Quantitative Methods for Vertebrate Social Analysis [blurb]
- Ya Xu, Justin S. Dyer, Art B. Owen, "Empirical stationary correlations for semi-supervised learning on graphs", Annals of Applied Statistics 4 (2010): 589--614, arxiv:1011.1766
- Hyokun Yun, S. V. N. Vishwanathan, "Quilting Stochastic Kronecker Product Graphs to Generate Multiplicative Attribute Graphs", arxiv:1110.5383
- Hugo Zanghi, Franck Picard, Vincent Miele, and Christophe Ambroise, "Strategies for online inference of model-based clustering in large and growing networks", Annals of Applied Statistics
4 (2010): 687--714
- An Zeng, Giulio Cimini, "Removing spurious interactions in complex networks", arxiv:1110.5186
- Shuheng Zhou, John Lafferty, Larry Wasserman, "Time Varying Undirected Graphs", arxiv:0802.2758
To write:
- CRS, "Indirect Inference of Network Growth Models"
- CRS and Shawn Mankad, "Statistical Properties of Aggregated
Random Graphs"
- Co-conspirators to be named later + CRS, "Smoothing Adjacency Matrices" [if we can figure out how to do it!]
#
Sun, 22 Jan 2012
Finance, Banking, "the Markets"
Probably for as long as there has been money, there have been people who had
more of it than they wanted to spend right away, and many more people who
wanted to spend more money than they had. If money could somehow pass from the
first group to the second, people would be better off; the function of
financial markets is to ease this passage. Their point is to keep excess funds
from sitting idle, by allocating them among the different people and projects
asking for money. Ideally, just as ordinary markets allocate goods and
services to those for whom they are most "valuable" (i.e., have the highest
combination of desire and ability to pay), financial markets should
allocate money — which is, after all, a claim on the resources of the
community — to its most valuable, most productive uses.
Such markets are necessarily strange: those who have the money, the savers,
by definition do not want any tangible good or service those on the other side,
the borrowers, can currently sell. (Otherwise, it would be an ordinary
commercial transaction and not finance.) The trick is that borrowers sell
savers promises of more money in the future, in return for which they
get money now. Etymiologically, at least, to extend credit is to believe
(Latin credere) this promise. All this has been going on since
Gilgamesh was king in Uruk.
To give some concrete examples: A corporate bond is a promise by the
corporation to make regular interest payments for a number of years, ending
with a lump-sum principal payment of the bond's face value. A common stock is
a promise to get a fixed share of a firm's profits, along with a vote in how it
is run. The once-standard home mortgage was a promise to make regular payments
over, say, 30 years at a fixed interest rate, with the house, and a down
payment on it, as hostages for the fulfillment of this promise.
Because financial instruments are promises, there is an intimate connection
between them and predictions. All else being equal, how much you should pay
for a bond depends on how much you prefer money now to money later, but also on
how likely it is that the company will fulfill its promises. Turned around, a
company which is widely believed to be able to keep its promises can offer to
pay back less than one which is widely predicted to have trouble coming.
Similarly for stock: if you just buy and hold on to a stock, the price to pay
for a share depends on your prediction of the firm's future profits.
Since, as the saying goes, "prediction is difficult, especially of the
future", this makes pricing financial instruments hard enough, but there are
further complications. There are often times when lenders wish they had money
now, rather than just a promise. Demanding immediate repayment from their
debtors, while it certainly happens, often yields disappointingly little, and
can disrupt or even crush a useful enterprise that could have kept making
ordinary payments. The second big trick of financial markets is to make
promises of payment re-sellable, so that the payments go to whoever currently
owns the promise, not the original lender; the promise becomes a marketable
"security". When we speak of "the financial markets", we usually mean the
secondary markets in promises. When one of these secondary markets exists, the
value of a security depends not just on the direct, promised payments, but also
on the resale price of the security --- most notably, the value of a share of
stock depends not just on what the firm's profits will be, but also on what
other people will be willing to pay for a share of them. The latter price,
will of course, depend on the same things, but even further into the future,
and so on.
At this point, it might seem that the original objective of figuring out
good uses for the community's capital has fallen out of view; this is
superficial impression is, of course, correct. A large and able school of
economists has created a great deal of confusion on this score by pushing
something they call "the efficient markets hypothesis", which holds that it is
basically impossible to anticipate the evolution of financial market prices.
This is not quite true — it is merely very difficult and
hazardous — but in any case it is very much a separate issue from whether
securities prices actually are reliable signals about the relative value of
different uses for capital. Nonetheless, in this age of the world we have,
collectively, come to decide that financial markets beat any conceivable
alternative at this, and accordingly loaded them with more and more power and
responsibility; with what results, you can see around you.
— This does not explain how a socialist
with no formal training in economics came to write
for Quantitative
Finance and teach in
a computational
finance program, but another time.
See also:
Corporations and Corporate Finance;
Economics;
Globalization;
Time Series
Recommended (big pictures):
- Peter Bernstein, Capital Ideas
- Barry Eichengreen, Globalizing Capital: A History of the
International Monetary System [Review: Turning the Wheels]
- Justin Fox, The Myth of the Rational Market: A History of Risk, Reward, and Delusion on Wall Street [Review: Twilight of the Efficient Markets]
- John Kenneth Galbraith
- 1929: The Great Crash
- A Brief History of Financial Euphoria
- Money: Whence It Came, Where It Went
- Doug Henwood, Wall Street: How It Works and for Whom
[Free online!]
- Michael Lewis
- Liar's Poker
- The Money Culture
- Roger Lowenstein
- When Genius Failed: The Rise and Fall of Long-Term
Capital Management
- Origins of the Crash: The Great Bubble and Its
Undoing [This refers to the crash of 2002 or so, not the crash
of 2007--. It's hard to keep up.]
- Donald MacKenzie
- An Engine, Not a Camera: How Financial Models Shape
Markets
[Blurb. My comments.]
- "End-of-the-World Trade", London Review of
Books 30:9 (8 May 2008)
[Online. Nice
sociological/popular-scientific piece about credit derivatives, and broader
issues in risk assesment and the institutional
infrastructure of the financial markets.]
- Mantegna and Stanley, An Introduction to Econophysics
[Review: Not Exactly
Rocket Science]
- Karl Polanyi, The Great Transformation
- Robert J. Shiller, Irrational Exuberance
- Gillian Tett, Fool's Gold: How the Bold Dream of a Small
Tribe at J. P. Morgan Was Corrupted by Wall Street Greed and Unleashed a
Catastrophe [Review: The Tragedy
of Getting What You Want]
Recommended (close-ups):
- Francesco Audrino and Peter Bühlmann, "Splines for Financial
Volatility", Journal of the Royal
Statistical Society B 71 (2009): 655--670
- Amar Bhidé, A Call for Judgment: Sensible Finance for a Dynamic Economy [Review: Hayek contra Chicago]
- Alan Blinder, Central Banking in Theory and Practice
- Roland Bénabou, "Groupthink: Collective Delusions in
Organizations and Markets"
[PDF
preprint. A really brilliant paper on "individually rational collective
reality denial in groups, organizations and markets".]
- Joshua D. Coval, Jakub Jurek and Erik Stafford, "The Economics
of Structured Finance" (2009) [PDF preprint]
- James Crotty
- "Structural Causes of the Global Financial Crisis:
A Critical Assessment of the 'New Financial Architecture'" [PDF
preprint, August 2008. Summary: it was crazy to expect that minimally
regulated financial markets would be safe and efficient, and they weren't.]
- "If Financial Market Competition is so Intense, Why are Financial Firm Profits so High? Reflections on the Current 'Golden Age' of Finance",
Competition and Change12 (2008): 167--183
[A question of intense interest. PDF preprint]
- Marcus G. Daniels, J. Doyne Farmer, Laszlo Gillemot, Giulia Iori,
and Eric Smith, "A quantitative model of trading and price formation in
financial
markets", cond-mat/0112422
= "Quantitative Model of Price Diffusion and Market Friction Based on Trading
as a Mechanistic Random
Process", Physical Review
Letters 90 (2003): 108102
- J. Bradford DeLong
- J. Bradford DeLong and Konstantin Magin, "The U.S. Equity Return
Premium: Past, Present, and Future"
[PDF of preliminary
draft]
- J. Bradford DeLong, Andrei Shleifer, Lawrence H. Summers and Robert
J. Waldmann
- "Noise Trader Risk in Financial Markets", Journal of
Political Economy 98 (1990): 703--738 [PDF
preprint]
- "The Survival of Noise Traders in Financial Markets",
Journal of Business 64 (1991): 1--20 [PDF
preprint]
- James Dow and Gary Gorton, "Stock Market Efficiency and Economic
Efficiency: Is There a Connection?" Journal of
Finance 52 (1997): 1087--1129 [To summarize: No, there
is no connection. JSTOR]
- Eric Falkenstein, "Value-at-Risk and Derivatives Risk" ["An optimal
risk manangement process should work more at getting relevant risks on the
radar screen than measuring what appears on the screen more
precisely." PDF]
- Sanford J. Grossman and Joseph E. Stiglitz, "On the Impossibility
of Informationally Efficient Markets", American Economic
Review 70 (1980): 393--408 [This should have been the
end of the whole efficient-markets myth; alas, they failed to drive the stake
through the heart. JSTOR.]
- Kirill Ilinsky, Physics of Finance [Review: Gauge Connections for Fun and
(More Importantly) Profit]
- Simon Johnson and James Kwak, "Finance: Before the Next
Meltdown", Democracy 13 (Summer 2009)
- Magdoff and Sweezy
- The Irreversible Crisis
- Stagnation and the Financial Explosion
- Thomas Mikosch, "Copulas: Tales and Facts"
[PDF
preprint]
- Moody's Global Risk Analysis Group, "Archaeology of the Crisis"
[January 2008; link to free PDF, registration required;
some commentary by Andrew Leonard]
- Heinz Pagels, "The Quick Buck Becomes
Quicker"
- Frank Partnoy, F.I.A.S.C.O.: Blood in the Water on Wall
Street [Life as a credit derivative salesman at Morgan Stanley in the
early 1990s]
- Ser-Huang Poon and Clive W. J. Granger, "Forecasting Volatility in
Financial Markets: A Review", Journal of Economic Literature
41 (2003): 478--539 [JSTOR]
- William Poundstone, Fortune's Formula
[The saga of Kelly gambling]
- Riccardo Rebonato, Plight of the Fortune Tellers: Why We Need
to Manage Financial Risk Differently [Very good, except that what
Rebonato thinks is his philosophical position about the foundations of
statistics is wrong, wrong, wrong. Fortunately, his recommendations
are not actually based on his stated position.
(Further to this point.) Hopefully to be
reviewed...]
- John Roemer, A Future for Socialism [Why the
Revolution needs a stock market. Review: The Red Monday Efficient
Allocation Blues]
- Robert J. Shiller, Market Volatility
- A. N. Shiryaev, Essentials of Stochastic Finance
[Review]
- David Skeel and Frank Partnoy, "The Promise and Perils
of Credit Derivatives", 2008 [SSRN]
- Eric Smith, J. Doyne Farmer, Laszlo Gillemot, and Supriya
Krishnamurthy, "Statistical theory of the continuous double
auction", cond-mat/0210475
= Quantitative
Finance 3 (2003): 481--514
Not altogether recommended:
- Richard Bookstaber, A Demon of Our Own Design: Markets, Hedge Funds, and the Perils of Financial Innovation [Comments]
- George Cooper, The Origin of Financial Crises: Central Banks,
Credit Bubbles and the Efficient Market Fallacy [Comments]
- Charles P. Kindleberger, Manias, Panics and Crashes: A
History of Financial Crises [Comments]
- Andrew Lo, Hedge Funds: An Analytic Perspective
[Really a not-especially-well-integrated selection of Lo's recent papers. Some
interesting material but definitely for specialists only (who may well have
read the papers already). Some truly unholy linear
and logistic regressions.]
- Terence C. Mills and Raphael N. Markellos, The Econometric
Modelling of Financial Time Series [It's no worse than any other textbook
on the subject I've seen, but I can't make myself be any more enthusiastic.]
- Robert Shiller, The New Financial Order: Risk in the
Twenty-First Century [I agree with everything in
the eviscerating
review by Steve Laniel]
Pride compels me to recommend:
- Linqiao Zhao, A Model of Limit-Order
Book Dynamics and a Consistent Estimation
Procedure, Ph.D. thesis, Statistics Department, Carnegie
Mellon University, 2010 [PDF draft]
To read:
- Alessandro Andreoli, Francesco Caravenna, Paolo Dai Pra, Gustavo Posta, "Scaling and multiscaling in financial indexes: a simple model", arxiv:1006.0155
- Roy E. Bailey, The Economics of Financial Markets
[blurb]
- George P. Baker and George David Smith, The New Financial
Capitalists: Kohlberg Kravis Roberts and the Creation of Corporate Value
[Blurb]
- Ole E. Barndorff-Nielsen and Neil Shephard, "Econometric analysis
of realised covariation: high frequency covariance, regression and correlation
in financial economics"
[PDf]
- Kevin E. Bassler, Joseph L. McCauley, Gemunu H. Gunaratne,
"Nonstationary Increments, Scaling Distributions, and Variable Diffusion
Processes in Financial Markets",
physics/0609198
- Erhan Bayraktar, Ulrich Horst and Ronnie Sircar
- "Queueing Theoretic Approaches to Financial Price Fluctuations",
math.PR/0703832
- "A Limit Theorem for Financial Markets with Inert Investors",
math.PR/0703831
- Ricardo Bebczuk, Asymmetric Information in Financial Markets: Introduction and Applications [Blurb]
- William T. Bernhard and David Leblang, Democratic Processes
and Financial Markets: Pricing Politics
[Blurb]
- Lucy Bernholz, Creating Philanthropic Capital Markets: The
Deliberate Evolution [Author's book site]
- Paul Blustein, And the Money Kept Rolling in (and Out): Wall
Street, the IMF, and the Bankrupting of Argentina
- Peter Bossaerts, The Paradox of Asset Pricing
[Blurb, ch. 1]
- Jean-Philippe Bouchaud
- "An introduction to statistical finance,"
Physica A 313 (2002): 238--251
[PDF]
- "The subtle nature of financial random walks",
Chaos 15
(2005): 026104
- Reuven Brenner, Force of Finance: Triumph of the Capital
Markets
- Laurent E. Calvet and
Adlai J. Fisher, Multifractal Volatility: Theory, Forecasting, and
Pricing [Thanks to Prof. Calvet for bringing this to my attention]
- Youssef Cassis, Capitals of Capital: A History of
International Financial Centres, 1780--2005 [blurb
- Carl Chiarella and Giulia Iori
- P. Cizek, W. Hardle, V. Spokoiny, "Adaptive pointwise estimation in time-inhomogeneous conditional heteroscedasticity models", arxiv:0903.4620 [I'm more interested in the idea of adaptively estimating non-stationary time series here than the finance application...]
- Michel M. Dacorogna, Ramazan Gencay, Ulrich A. Müller, Richard
B. Olsen and Olivier V. Pictet, An Introduction to High-Frequency
Finance
- Satyajit Das, Traders Guns and Money: Knowns and Unknowns
in the World of Derivatives
- E. Philip Davis and Benn Steil, Institutional
Investors [Blurb]
- Gerald F. Davis, Managed by the Markets: How Finance
Re-Shaped America
- Davis, Duffie, Fleming and Shreve (eds.), Mathematical
Finance
- Paul De Grauwe and Marianna Grimaldi, The Exchange Rate in a
Behavioral Finance Framework
[Blurb, ch. 1]
- Emanuel Derman
- "The Perception of Time, Risk and Return During
Periods of Speculation," cond-mat/0201345
- My Life as a Quant
- Wen-Qi Duan and H. Eugene Stanley, "Volatility, irregularity, and predictable degree of accumulative return series", Physical Review E
81 (2010): 066116
- Gerard Dumenil and Dominique Levy, Capital Resurgent: Roots
of the Neoliberal Revolution [Blurb]
- Barry Eichengreen
- Capital Flows and Crises
- Toward a New International Financial Architecture: A
Practical Post-Asia Agenda
- Robert Engle
- Cheoljun Eom, Gabjin Oh, Woo-Sung Jung, "Relationship between
degree of efficiency and prediction in stock price
changes", arxiv:0708.4178 [I
should read this before dismissing it, but it seems from the abstract that
they're almost missing the point...]
- Todd Feldman, "Portfolio Manager Behavior and Global
Financial Crises" [PDF]
- Dean P. Foster and H. Peyton Young, "Gaming Performance Fees by
Portfolio
Managers", The
Quarterly Journal of Economics 125 (2010):
1435--1458 ["there exists no compensation mechanism that separates skilled
from unskilled managers solely on the basis of their returns histories."]
- Philip Hans Franses and Dick Van Dijk, Non-Linear Time Series
Models in Empirical Finance
- Anne Goldgar, Tulipmania: Money, Honor, and Knowledge in the Dutch Golden Age
- Vygintas Gontis and Bronislovas Kaulakys
- Martin D. Gould, Mason A. Porter, Stacy Williams, Mark McDonald, Daniel J. Fenn, Sam D. Howison, "The Limit Order Book: A Survey", arxiv:1012.0349
- Christian Gourieroux and Joann Jasiak, The Econometrics of
Individual Risk: Credit, Insurance, and Marketing
[Blurb, ch. 1]
- Richard S. Grossman, Unsettled Account:
The Evolution of Banking in the Industrialized World since 1800
[Blurb, ch. 1]
- Larry Harris, Trading and Exchanges: Market Microstructure
for Practitioners
- Jasmina Hasanhodzic, Andrew W. Lo, Emanuele Viola, "A Computational View of Market Efficiency", arxiv:0908.4580
- Christopher Hoag, "The Atlantic Telegraph Cable and Capital Market
Information
Flows", The Journal
of Economic History 66 (2006): 342--353 ["an event
study on the introduction of the Atlantic Cable in July 1866. Using daily data
on one security with a dual listing on the New York and London stock exchanges
... the information lag between the two markets shortened from ten days to zero
days. Cointegration analysis confirms the result. Historical markets priced
securities so well that transatlantic steamship crossing times can be recovered
from stock prices."]
- Jacques Janssen, Semi-Markov Risk Models for Finance,
Insurance and Reliability
- Eric Jondeau, Ser-Huang Poon and Michael Rockinger, Financial
Modeling Under Non-Gaussian Distributions
- Taisei Kaizoji, "Power laws and market
crashes", physics/0603138
- Ethan B. Kapstein, Governing the Global Economy:
International Finance and the State
- Michael Kearns, Alex Kulesza and Yuriy Nevmyvaka, "Empirical Limitations on High Frequency Trading Profitability", arxiv:1007.2593
- Dan Krier, Speculative Management: Stock Market Power and
Corporate Change
- Greta R. Krippner,
Capitalizing on Crisis:
The Political Origins of the Rise of Finance [Blurb]
- Edward LiPuma and Benjamin Lee, Financial Derivatives
and the Globalization of Risk [Blurb]
- James Macdonald, A Free Nation Deep in Debt: The Financial
Roots of Democracy
[Blurb, intro]
- Randy Martin, Financialization of Daily Life
- Hilton McCann, Offshore Finance
[blurb]
- Perry Mehrling
- The New Lombard Street:
How the Fed Became the Dealer of Last Resort [Blurb, introduction]
- Fischer Black and the Revolutionary Idea of Finance
- Ross M. Miller, "Don't Let Your Robots Grow Up To Be Traders:
Artificial Intelligence, Human Intelligence, and Asset-Market Bubbles"
[PDF]
- Bernadette A. Minton, Rene M. Stulz and Rohan Williamson,
"How Much Do Banks Use Credit Derivatives to Reduce Risk?"
[SSRN/785364. But data are from 1999--2003.]
- J. F. Muzy, E. Bacry and A. Kozhemyak, "Extreme values and fat
tails of multifractal
fluctuations", Physical Review
E 73 (2006): 066114
= cond-mat/0509357
[Abstract promises financial applications]
- Heikki Patomaki, Democratizing Globalization: The Leverage
of the Tobin Tax
- Louis M. Pauly, Who Elected the Bankers? Surveillance and
Control in the World Economy
- Huyen Pham, "Some applications and methods of large deviations in
finance and
insurance",math.PR/0702473
- Jocelyn Pixley, Emotions in Finance: Distrust and Uncertainty
in Global Markets [Blurb]
- Riccardo Rebonato, Coherent Stress Testing: A Bayesian
Approach to the Analysis of Financial Stress
[Review by
Danny Yee]
- Carmen M. Reinhart
and Kenneth
S. Rogoff, This Time Is Different: Eight Centuries of Financial
Folly
[blurb. This
paper on the now-contemporary US financial crisis is a bit of a preview,
apparently.]
- Mark J. Roe, Strong Managers, Weak Owners: The Political
Roots of American Corporate Finance
- Bertrand M. Roehner, Patterns of Speculation: A Study in
Observational Econophysics
[Blurb]
- Frank Schwed, Where Are the Customers' Yachts? or a Good Hard
Look at Wall Street
- Kenneth J. Singleton, Empirical Dynamic Asset Pricing: Model
Specification and Econometric Assessment
[Blurb, with links to
PDFs of first three chapters]
- D. Sornette, "Critical Market Crashes,"
cond-mat/0301543 [90 page
summary of his book Why Stock Markets Crash, and innumerable
other papers]
- David Strang, Learning by Example:
Imitation and Innovation at a Global Bank [Blurb, intro]
- Susan Strange, Mad Money: When Markets Outgrow
Governments
- Torsten Strulik and Helmut Willke (eds.), Towards a Cognitive
Mode in Global Finance?: The Governance of a Knowledge-Based Financial
System
[Blurb]
- Stephen J. Taylor, Asset Price Dynamics, Volatility, and
Prediction [Blurb,
introduction; author's
book-site]
- Kostas Triantafyllopoulos, Giovanni Montana, "Dynamic modeling of mean-reverting spreads for statistical arbitrage", arxiv:0808.1710
- Theodoros Tsagaris, Ajay Jasra, Niall Adams, "Robust and Adaptive Algorithms for Online Portfolio Selection", arxiv:1005.2979
- Samuel E. Vazquez, Simone Farinelli, "Gauge Invariance, Geometry
and Arbitrage", arxiv:0908.3043
- R. Vilela Mendes, R. Lima and T. Araujo, "A Process-Reconstruction
Analysis of Market Fluctuations," cond-mat/0102301
- Xavier Vives, Information and Learning in Markets: The
Impact of Market Microstructure [Blurb, ch. 1, ch. 7, lecture slides]
- David Weiss, After the Trade Is Made: Processing Securities
Transactions
- Biao Wu, "Interacting Agent Feedback Finance Model",
math.PR/0703827
- Caitlin Zaloom, Out of the Pits: Traders and Technology from Chicago to London [blurb]
#
Ensemble Methods in Machine Learning
Boosting, bagging, binning, stacking, mixtures of experts, ...
I have an Idea about how to use model averaging to cope with non-stationary
time series forecasting, but need to find time
to work on it.
Value of diversity.
See also:
Collective Cognition;
Learning Theory;
Model Selection
Recommended (totally inadequate, what happened to come to mind cleaning
up my files):
- Sanjeev Arora, Elad Hazan and Satyen Kale, "The Multiplicative
Weights Update Method: a Meta Algorithm and Applications "
[PDF
preprint. This is an interesting kind of result, which promises
performance which comes to close that achieved by any strategy within a fixed
class, no matter what sequence of data is observed --- but it's
performance on that sequence, which, as the saying goes, "is no
guarantee of future results". Cesa-Bianchi and Lugosi's book has a lot more
along these lines.]
- Nicolo Cesa-Bianchi and Gabor Lugosi, Prediction, Learning,
and Games [Mini-review]
- Gerda Claeskens and Nils Lid Hjort, Model Selection
and Model Averaging
- Pedro
Domingos, "The Role of Occam's Razor in Knowledge Discovery," Data
Mining and Knowledge Discovery, 3 (1999) [Online.
Ensemble methods as an apparent violation of Occam's Razor.]
- Elad Hazan and Satyen Kale, "Extracting certainty from uncertainty: regret bounded by variation in costs", Machine Learning 80 (2010): 165--188
- Robert Kleinberg, Alexandru Niculescu-Mizil, Yogeshwer Sharma,
"Regret Bounds for Sleeping Experts and Bandits", Machine Learning 80 (2010): 245--272
- J. Zico Kolter and Marcus A. Maloof, "Dynamic Weighted Majority: An
Ensemble Method for Drifting
Concepts", Journal
of Machine Learning Research 8 (2007): 2755--2790
- A. Juditsky, P. Rigollet, A. B. Tsybakov, "Learning by mirror averaging", arxiv:math/0511468 =
Annals of Statistics 36 (2008): 2183--2206
- G. Langer and U. Parlitz, "Modeling parameter dependence from time
series", Physical
Review E 70 (2004): 056217 [Interesting use of
ensemble methods in state space modeling]
- Laurence K. Saul and Michael I. Jordan, "Mixed Memory Markov
Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler
Ones", Machine Learning 37 (1999): 75--87
- Robert E. Schapire, Yoav Freund, Peter Bartlett and Wee Sun Lee,
"Boosting the Margin: A New Explanation for the Effectiveness of Voting
Methods", Annals of Statistics 26 (1998):
1651--1686
- Kyupil Yeon, Moon Sup Song, Yongdai Kim, Hosik Choi, Cheolwoo
Park, "Model averaging via penalized regression for tracking concept
drift", Journal of Computational and Graphical
Statistics online before print (2010)
To read:
- Pierre Alquier and Olivier Wintenberger, "Model selection and randomization for weakly dependent time series forecasting", arxiv:0902.2924
- Ran Avnimelech and Nathan Intrator, "Boosted Mixture of Experts: An
Ensemble Learning Scheme", Neural Computation 11
(1999): 483--497
- Larry M. Bartels, "Specification Uncertainty and Model
Averaging", American Journal of Political
Science 41 (1997): 641--674
- Gérard Biau, Luc Devroye and Gábor Lugosi,
"Consistency of Random Forests and Other Averaging Classifiers",
Journal of Machine Learning Research 9
(2008): 2015--2033 ["In the last years of his life, Leo Breiman promoted
random forests for use in classification. He suggested using averaging as a
means of obtaining good discrimination rules. The base classifiers used for
averaging are simple and randomized, often based on random samples from the
data. He left a few questions unanswered regarding the consistency of such
rules. In this paper, we give a number of theorems that establish the universal
consistency of averaging rules. We also show that some popular classifiers,
including one suggested by Breiman, are not universally consistent."]
- Gavin Brown, Jeremy L. Wyatt and Pter Tino, "Managing Diversity
in Regression Ensembles", Journal of Machine Learning Research
6 (2005): 1621--1650
- Bruno Caprile, Cesare Furlanello and Stefano Merler, "The Dynamics
of AdaBoost Weights Tells You What's Hard to Classify," cs.LG/0201014
- Kamalika Chaudhuri, Yoav Freund, Daniel Hsu, "A parameter-free hedging algorithm", arxiv:0903.2851 [Doing about as well as a given fraction of the ensemble]
- Zhuo Chen and Yuhong Yan, "Time Series Models for Forecasting:
Testing or Combining?", Studies in Nonlinear Dynamics and
Econometrics 11:1 (2007): 3
- Matthieu Cornec, "Estimating Subbagging by cross-validation", arxiv:1011.5142
- M. Di Marzio and C. C. Taylor, "Kernel density classification and
boosting: an L2 analysis", Statistics and
Computing 15 (2005): 113--123
- Yoav Freund, "A more robust boosting algorithm", arxiv:0905.2138
- Yoav Freund, Yishay Mansour and Robert E. Schapire, "Generalization
bounds for averaged classifiers", Annals of
Statistics 32 (2004): 1698--1722 = math.ST/0410092
- Yoav Freund, Robert E. Schapire, Yoram Singer and Manfred
K. Warmuth, "Using and combining predictors that specialize" [PDF
preprint]
- Jerome H. Friedman, Bogdan E. Popescu, "Predictive learning via rule ensembles", arxiv:0811.1679
- G. Fumera and F. Roli, "A Theoretical and Experimental Analysis of
Linear Combiners for Multiple Classifier Systems", IEEE Transactions on
Pattern Analysis and Machine Intelligence 27 (2005):
942--956
- Stéphane Gaïffas and Guillaume Lecué,
"Hyper-Sparse Optimal Aggregation", Journal of Machine
Learning Research 12 (2011): 1813--1833
- Nicolas Garcia-Pedrajas, Cesar Garcia-Osorio and Colin Fyfe,
"Nonlinear Boosting Projections for Ensemble Construction",
Journal
of Machine Learning Research 8 (2007): 1--33
- Alexander Goldenshluger, "A universal procedure for
aggregating estimators", arxiv:0704.2500 = Annals of Statistics 37 (2009): 542--568
- Etienne Grossmann, "A Theory of Probabilistic Boosting, Decision
Trees and Matryoshki", cs.LG/0607110
- S. Gualdi, A. De Martino, "How does informational heterogeneity affect the quality of forecasts?", arxiv:0906.0552
- Jakob Vogdrup Hansen, Combining Predictors: Meta Machine
Learning Methods and Bias/Variance & Ambiguity Decompositions [Ph.D.
thesis, University of Aarhus, 2000;
on-line]
- Geoffrey E. Hinton, "Training Products of Experts by Minimizing
Contrastive Divergence," Neural Computation 14
(2002): 1771--1800.
- Benjamin Hofner, Torsten Hothorn, Thomas Kneib, and Matthias Schmid, "A Framework for Unbiased Model Selection Based on Boosting", Journal of Computational and Graphical Statistics
forthcoming (2011)
- Marcus Hutter and Jan Poland, "Adaptive Online Prediction by
Following the Perturbed Leader", cs.AI/0504078 = Journal of
Machine Learning Research 6 (2005): 639--660
- Robert A. Jacobs, "Bias/Variance Analyses of Mixtures-of-Experts
Architectures", Neural Computation 9 (1997):
369--383 ["This article investigates the bias and variance of
mixtures-of-experts (ME) architectures. The variance of an ME architecture can
be expressed as the sum of two terms: the first term is related to the
variances of the expert networks that comprise the architecture and the second
term is related to the expert networks' covariances. One goal of this article
is to study and quantify a number of properties of ME architectures via the
metrics of bias and variance. A second goal is to clarify the relationships
between this class of systems and other systems that have recently been
proposed. It is shown that in contrast to systems that produce unbiased experts
whose estimation errors are uncorrelated, ME architectures produce biased
experts whose estimates are negatively correlated."]
- Wenxin Jiang, "Boosting with Noisy Data: Some Views from
Statistical
Theory", Neural
Computation 16 (2004): 789--810
- Jeremy Z. Kolter and Marcus A. Maloof, "Using Additive Expert
Ensembles to Cope with Concept Drift", ICML 2005
[PDF
reprint via Kolter]
- Nicole Kraemer, "Boosting for Functional
Data", math.ST/0605751
- Ludmila I. Kuncheva, Combining Pattern Classifiers: Methods
and Algorithms
- Guillaume Lecu&eaucte;, "Lower Bounds and Aggregation in Density
Estimation", Journal of
Machine Learning Research
7 (2006): 971--981
- David Mease, Abraham J. Wyner and Andreas Buja, "Boosted
Classification Trees and Class Probability/Quantile
Estimation", Journal
of Machine Learning Research 8 (2007): 409--439
- Nicolai Meinshausen, "Forest
Garrote", arxiv:0906.3590
- David J. Miller and Siddharth Pal, "Transductive Methods for the
Distributed Ensemble Classification Problem", Neural
Computation 19 (2007): 856--884
- Andriy Norets, "Approximation of conditional densities by smooth mixtures of regressions", Annals of Statistics 38
(2010): 1733--1766, arxiv:1010.0581
- L. Nunes and E. Oliveira, "On Learning by Exchanging
Advice," cs.LG/0203010
- Frenando C. Pereira and Yoram Singer, "An Efficient Extension to
Mixture Techniques for Prediction and Decision Trees", Machine
Learning 36 (1999): 183--199
- Evgueni Petrov, "Constraint-based analysis of composite
solvers," cs.AI/0302036
- Benedikt M. Pötscher, "The distribution of model averaging
estimators and an impossibility result regarding its estimation", arxiv:math/0702781
- Philippe Rigollet, "Maximum likelihood aggregation and
misspecified generalized linear models", arxiv:0911.2919
- Robert E. schapire and Yoav Freund, Boosting: Foundations
and Algorithms [Blurb]
- Yoram Singer, "Adaptive Mixtures of Probabilistic Transducers", Neural
Computation 9 (1997): 1711--1733 [PS.gz preprint]
- David S. Siroky, "Navigating Random Forests and related advances in algorithmic modeling", Statistics Surveys 3
(2009): 147--163
- Eiji Takimoto and Akira Maruoka, "Top-down decision tree learning
as information based boosting," Theoretical
Computer Science 292 (2002): 447-464
- Peter Welinder, Steve Branson, Serge Belongie and Pietro Perona,
"The Multidimensional Wisdom of Crowds", NIPS 2011 (NIPS 23) [PDF reprint]
- Héla Zouari, Laurent Heutte and Yves Lecourtier,
"Controlling the diversity in classifier ensembles through a measure of
agreement", Pattern
Recognition 38 (2005): 2195--2199
To write:
- CRS, "Adapting to non-stationarity with growing predictor ensembles"
#
Thu, 19 Jan 2012
Collective Cognition
Rather than repeating myself about what I mean by "collective cognition", I
refer you to my review of Ed
Hutchins's
Cognition in the Wild, and
the introduction
to the 2002 SFI Workshop on
Collective Cognition I co-organized (that introduction is primarily based
on an essay I wrote as a distraction from finishing my dissertation). I stole
the phrase from Philip
Agre, who told me he doesn't remember whence he got it. (This is fitting.)
The workshop was my first experience of helping to organizing a scientific
meeting, and quite enlightening. The focus shifted quite a bit from what I
originally had in mind, but I still think the papers presented were good; many
of them are available via the link for the workshop above.
Prediction markets, which I think are horribly over-rated, probably
deserve a notebook of their own.
See also:
Computational Models of
Linguistic Evolution;
Duality between Knowledge Centralization and Market Completeness;
Emergent Properties;
Ensemble Methods in Machine Learning;
Evolving Local Rules to Perform Global Computations;
Flocking and Swarms;
Institutions;
Sociology of Science
Recommended, non-academic:
- John Dewey, The Public and Its Problems [My
Mini-review]
- Malcolm Gladwell, "Group Think"
[online]
- Steve Berlin Johnson
- James Surowiecki, The Wisdom of Crowds: Why the Many Are
Smarter Than the Few and How Collective Wisdom Shapes Business, Economics,
Societies, and Nations [A pop-science book on precisely this subject,
which disappoints me, because I'd entertained fantasies of writing one myself.
It's interesting and well-written, and I certainly recommend it. But it's
limited by the fact that Surowiecki has essentially one picture of how
collective cognition could work, namely averaging a lot of guesses which are
randomly and independently distributed around the true answer --- in other
words, the law of large numbers. This makes the cases where collective
cognition depends very strongly on social interactions
(science and
democracy especially) unduly puzzling to him.
Also, he is entirely too credulous about prediction markets. There's a good review
by Scott McLemee, and another one by Cass
Sunstein.]
Recommended, academic:
- Philip Agre
- "Growing a Democratic Culture: John Commons on the
Wiring of Civil Society" [draft]
- "Supporting the Intellectual Life of a Democratic
Society", Ethics and Information Technology, 3:4
(2001): 289--298 [draft]
- Elizabeth Anderson, "The Epistemology of
Democracy", Episteme:
Journal of Social Epistemology 3 (2006): 8--22 [See
comments under Democracy]
- Roland Bénabou, "Groupthink: Collective Delusions in
Organizations and Markets"
[PDF
preprint. A really brilliant paper on "individually rational collective
reality denial in groups, organizations and markets".]
- David Braybrooke and Charles E. Lindblom, A Strategy of
Decision: Policy Evaluation as a Social Process [Mini-review]
- Christophe Chamley, Rational Herds: Economic Models of Social
Learning
- Andy Clark, Natural-Born Cyborgs: Minds, Technologies, and
the Future of Human Intelligence [The whole book is very good and
relevant to the topic, but chapter 6 is especially salient.]
- Robert S. Erikson and Christopher Wlezien, "Are Political Markets Really Superior to Polls as Election Predictors?" [Ans.:
No. PDF
preprint]
- F. A. Hayek, Individualism and Economic Order
[Especially the essays "Economics
and Knowledge"
and "The Use of
Knowledge in Society"]
- Dante R. Chialvo and Mark M. Millonas, "How Swarms Build
Cognitive Maps", [SFI Working
Paper 95-03-033]
- L. Conradt and T. J. Roper, "Group decision-making in animals", Nature 421
(2003): 155--158
- Esther Herrmann, Josep Call, María Victoria Hernàndez-Lloreda, Brian Hare and Michael Tomasello,
"Humans Have Evolved Specialized Skills of Social Cognition: The Cultural Intelligence Hypothesis", Science 317 (2007): 1360--1366
- Lu Hong and Scott
E. Page, "Groups of diverse problem solvers can outperform groups of
high-ability problem
solvers", Proceedings
of the National Academy of Sciences (USA) 101 (2004):
16385--16389 [free
PDF]
- Edwin Hutchins, Cognition in the Wild [Review: Naval Collective Intelligence]
- Ali Jadbabaie, Alvaro Sandroni and Alireza Tahbaz-Salehi,
"Non-Bayesian social Learning", SSRN/1550809
- Rob Johnston, "Integrating Methodologists into Teams of Substantive
Experts", Studies in
Intelligence 47:1 (2003): 6
[My comments/excerpts]
- Stephen Judd, Michael Kearns, and Yevgeniy Vorobeychik, "Behavioral dynamics and influence in networked coloring and consensus", Proceedings of the National Academy of Sciences
107 (2010): 14978--14982 [The collective-level results
here are extremely interesting; however, I find the way they
measure "individual influence" odd,
and am reluctant to conclude much from it.]
- Philip Kitcher, The Advancement of Science: Science without
Legend, Objectivity without Illusions
- Patrick R. Laughlin, Group Problem Solving
[Mini-review]
- David Lazer and Allan Friedman, "The Network Structure of Exploration and Exploitation", Administrative Science Quarterly
52 (2007): 667--694
[PDF
reprint via Prof. Lazer]
- Winter A. Mason, Andy Jones and Robert L. Goldstone, "Propagation
of Innovations in Networked
Groups", Journal of
Experimental Psychology: General 137 (2008):
427--433 [What kind of network works best for spreading useful discoveries
depends on how hard the problem being solved is; ones requiring more
exploration actually benefit from making it harder to make long-range
connections. It would be interesting to see what happens with a more
hierarchical network structure than the one they explored...]
- Winter Mason and Duncan J. Watts, "Collaborative Learning in Networks", Proceedings of the National Academy of Sciences (USA) 109 (2012): 764--769
- Neil Mercer, Words and Minds: How We Use Language to Think
Together [Mini-review]
- Josiah Ober
- Nienke Oomes, "Market Failures in the Economics of Science", [ch. 3
of Dr. Oomes's dissertation (Essays on Network Externalities and
Aggregate Persistence, Economics Dept., UW-Madison, 2001), hopefully
appearing in journal form sometime soon]
- Camille Roth and Paul Bourgine [My
comments]
- Thomas Schelling, Micromotives and Macrobehavior
- Dan Sperber, Explaining Culture: A Naturalistic
Approach [Review: How to Catch
Insanity from Your Kids (Among Others); or, Histoire naturelle de
l'infame]
- Dan Sperber and Hugo Mercier, "Reasoning as a Social
Competence", forthcoming in H. Landemore and J. Elster (eds.),
Collective Wisdom [preprint]
- William P. Thurston, "On proof and progress in mathematics",
arxiv:math.HO/9404236
[Comments
by Jordan Ellenberg]
- Stephen Toulmin, Human Understanding: The Collective Use and
Evolution of Concepts
- Lev Vygotsky, Mind in Society: The Development of Higher Psychological
Processes [Mini-review]
- David H. Wolpert and Kagan Tumer, "An Introduction to
Collective Intelligence", cs.LG/9908014
- Monika Wulz, "Collective Cognitive Processes around 1930. Edgar Zilsel's Epistemology of Mass Phenomena", phil-sci/4740
- H. Peyton Young, Individual Strategy and Social Structure: An
Evolutionary Theory of Institutions [Review: A Myopic (and Sometimes Blind) Eye on the Main Chance, or, the Origins of
Custom]
To read:
- Mark Ackerman, Volkmar Pipek and Volker Wulf (eds.), Sharing
Expertise: Beyond Knowledge Management [Preface, 59k
PDF]
- Daron Acemoglu, Asuman Ozdaglar, Ali ParandehGheibi, "Spread of Misinformation in Social Networks", arxiv:0906.5007
- Michael Bacharach (ed. Natalie Gold and Robert
Sugden), Beyond Individual Choice: Teams and Grames in Game Theory
[Blurb, ch. 1]
- David Barton and Karin Tusting, Beyond Communities of Practice: Language, Power and Social Context
- J. B. Batista and L. da F. Costa, "Knowledge acquisition by
networks of interacting agents in the presence of observation
errors", Physical
Review E 82 (2010): 016103
- Eric Baum and Igor Durdanovic, "Evolution of Cooperative Problem
Solving in an Artificial Economy", Neural
Computation 12 (2000): 2743--2775
- Bahador Bahrami, Karsten Olsen, Peter E. Latham, Andreas
Roepstorff, Geraint Rees and Chris D. Frith, "Optimally
Interacting Minds", Science
329 (2010): 1081--1085
- Marcel Blattner, Alexander Hunziker, Paolo Laureti, "When are
recommender systems
useful?", arxiv:0709.2562
- Xavier de Souza Briggs, Democracy as Problem Solving:
Civic Capacity in Communities Across the Globe [blurb]
- William A. ("Buzz") Brock and Steven N. Durlauf, "A
Formal Model of Theory Choice in Science", Economic Theory
14 (1999): 113--130 [PDF preprint]
- Henrik Bruun and Seppo Sierla, "Distributed Problem Solving in
Software Development: The Case of an Automation
Project", Social
Studies of Science 38 (2008): 133--158
- Pascal Boyer and James V. Wertsch (eds.), Memory in
Mind and Culture [blurb]
- Michel Callon and Fabian Muniesa, "Economic Markets as Calculative
Collective Devices" [Online preprint, but one is told there to quote "Les
marchés économiques comme dispositifs collectifs de
calcul", Réseaux 21(122), pp. 189-233.]
- Myong-Hun Chang and Joseph E. Harrington, "Discovery and Diffusion of Knowledge in an Endogenous Social Network", American Journal of Sociology 110 (2005): 937--976
- Kay-Yut Chen, Leslie R. Fine and Bernardo A. Huberman
- David Chisholm, Coordination without Hierarchy: Informal
Structures in Multiorganizational Systems
[Blurb]
- Herbert H. Clark, Using Language
[Blurb]
- Larissa Conradt and Timothy J. Roper, "Consensus decision making in
animals", Trends
in Ecology and Evolution 20 (2005): 449--456
- Mauro Copelli, Antonio C. Roque, Rodrigo F. Oliveira and Osame
Kinouchi, "Enhanced dynamic range in a sensory network of excitable
elements",
cond-mat/0112395 [Hey,
it's a start]
- Robin Cowan and Nicolas Jonnard, "Network structure and the
diffusion of knowledge", Journal of Economic
Dynamics and Control 28 (2004): 1557--1575
- Fred D'Agostino, Free Public Reason: Making It Up As We
Go
- Paul A. David, "Communication Norms and the Collective Cognitive
Performance of 'Invisible Colleges' ", in Creation and Transfer of
Knowledge: Institutions and Incentives, eds. G. Barba Navaretti, P.
Dasgupta and K.G. Maler, Berlin, Springer Verlag (1998)
- Rogier De Langhe and Matthias Greiff, "Standards and the
distribution of cognitive labour", phil-sci/4967
- Itiel Dror and Stevan Harnad, "Offloading Cognition onto Cognitive Technology", arxiv:0808.3569
- Darrell Duffie, Gaston Giroux, Gustavo Manso, "Information
Percolation", arxiv:0811.3024
- Darrell Duffie, Semyon Malamud, Gustavo Manso, "Information Percolation with Equilibrium Search Dynamics", arxiv:0811.3023
- Michael P. Farrell, Collaborative Circles: Friendship
Dynamics and Creative Work [Blurb]
- Jose F. Fontanari, "Social interaction as a heuristic for combinatorial
optimization problems", Physical Review E strong>82 (2010): 056118
- John Forester, The Deliberative Practitioner: Encouraging
Participatory Planning Processes [blurb]
- David Gamarnik, David Goldberg, Theophane Weber, "Correlation Decay in Random Decision Networks", arxiv:0912.0338
- Simon Garrod and Martin J. Pickering, "Why is conversation so
easy?", Trends in
Cognitive Sciences 8 (2004): 8--11
- Rishab Aiyer Ghosh (ed.), CODE: Collaborative Ownership and
the Digital Economy
[Blurb]
- Luc-Alain Giraldeau and Thomas Caraco, Social Foraging
Theory [Blurb]
- Alvin Goldman, Knowledge in a Social World
- Benjamin Golub, Matthew O. Jackson, "How Homophily Affects Diffusion and Learning in Networks", arxiv:0811.4013
- Robert L. Goldstone, Michael E. Roberts and Todd M. Gureckis,
"Emergent Processes in Group
Behavior", Current
Directions in Psychological Science 17 (2008):
10--15
- Patrick Grim, Paul St. Denis and Trina Kokalis, "Information and
Meaning: Use-Based Models in Arrays of Neural Nets", Minds and
Machines 14 (2004): 43--66 [From the abstract: "What we
offer here are simple computational models that show emergence of meaning and
information transfer in randomized arrays of neural nets. These we take to be
formal instantiations of a tradition of theories of meaning as use. What they
offer, we propose, is a glimpse into the origin and dynamics of at least simple
forms of meaning and information transfer as properties inherent in behavioral
coordination across a community." Or: Wittigenstein mechanized.]
- S. Gualdi, A. De Martino, "How does informational heterogeneity affect the quality of forecasts?", arxiv:0906.0552
- Maurice Halbwachs, On Collective Memory
- Steven Harnad, "Distributed Processes, Distributed Cognizers and
Collaborative Cognition", Pragmatics and
Cognition 13 (2005): 501--514
= cogprints/4765 ["there is no such
thing as distributed cognition, only collaborative cognition"]
- Stephan Hartmann, Gabriella Pigozzi and Jan Sprenger, "Reliable
Methods of Judgment Aggregation", phil-sci/4610
- Stephan Hartmann and Jan Sprenger, "Judgment Aggregation and the Problem of Tracking the Truth", phil-sci/4765
- Ming-Feng He, Cheng-Rui Deng, Lin Feng and Bo-Wen Tian, "A Cellular
Automata Model for a Learning
Process", Advances
in Complex Systems 7 (2004): 433--439 [From the
abstract: "Ideas on educational psychology suggest that a learning process
occurs when people participate within social communities. A model is
constructed based on two primary factors in the learning process: knowledge
storage and interactive ability of each person. Results of simulations are
consistent with some actual phenomena including the average knowledge achieved
and different educational effects under different conditions." I confess to a
certain skepticism, not having read any more than this.]
- Pamela J. Hinds and Sara Kiesler, Distributed Worked
[Blurb]
- Tad Hogg, Kristina Lerman, "Stochastic Models of User-Contributory Web Sites", arxiv:0904.0016
- H. Kargupta, B. Park, D. Hershberger and E. Johnson,
"Collective Data Mining: A New Perspective Toward Distributed Data
Mining", in Kargupta and Chan, eds., Advances in Distributed and
Parallel Knowledge Discovery [online]
- James Kennedy, Russell C. Eberhart and Yuhui Shi, Swarm
Intelligence
- Norbert L. Kerr, Robert J. MacCoun and Geoffrey P. Kramer, "Bias in
judgment: Comparing individuals and groups", Psychological
Review 103 (1996): 687--719
[Very large PDF
reprint]
- Norbert L. Kerr and R. Scott Tindale, "Group Performance and
Decision Making", Annual
Review of Psychology 55 (2004): 623--655
- Helene E. Landemore, "Democratic Reason: The Mechanisms of Collective Intelligence in Politics", ssrn/1845709 [Forthcoming in Landemore and Elster, eds., Collective Wisdom: Principles and Mechanisms]
- Sungmin Lee, Verónica C. Ramenzoni, Petter Holme, "Emergence of collective memories", arxiv:1008.2489
- Christian List, "Group knowledge and group rationality: a judgment aggregation perspective", Episteme: Journal of Social Epistemology 2 (2005): 25--38
- Christian List and Robert E. Goodin, "Epistemic Democracy: Generalizing the Condorcet jury theorem", Journal of Political Philosophy 9 (2001): 277--306
- P. D. Magnus, "Distributed Cognition and the Task of Science",
Social Studies of
Science 37 (2007): 297-310
- Naoki Masuda, N. Gilbert and S. Redner, "Heterogeneous voter
models", Physical Review E 82 (2010): 010103, arxiv:1003.0768
- Naoki Masuda and S. Redner, "Can Partisan Voting Lead to Truth?",
arxiv:1012.2462 [I heard Redner
give an excellent talk about this in the fall of 2010 at SAMSI, but I would
like to read the details]
- Cathleen McGrath and David Krackhardt, "Network Conditions for
Organizational Change", The Journal of Applied Behavioral
Science 39 (2003): 324--336
[PDF
reprint]
- Christopher McMahon, Collective Rationality and Collective
Reasoning [Review
in Notre Dame Philosophical Reviews, which I should also read
carefully]
- Peter B. Meyer, "Episodes of Collective Invention"
[Working Paper 368, Bureau
of Labor Statistics, 2003]
- Piotr Migdal, Michal Denkiewicz, Joanna Raczaszek-Leonardi, Dariusz Plewczynski, "Information-sharing and aggregation models for interacting minds", arxiv:1109.2044
- Mehdi Moussaid, Simon Garnier, Guy Theraulaz, Dirk Helbing, "Collective Information Processing and Pattern Formation in swarms, flocks and crowds",
Topics in Cognitive Science 1 (2009): 469--497, arxiv:1005.3507
- Lisa M. Osbeck, Nancy J. Nersessian, Kareen R. Malone, and Wendy C. Newstetter, Science as Psychology: Sense-Making and Identity in
Science Practice [Blurb;
review by Ronald Giere makes it sound more useful as source
material than for insights]
- L. Nunes and E. Oliveira, "On Learning by Exchanging
Advice", cs.LG/0203010
- Pettit, The Common Mind
- Gabriella Pigozzi, "Belief Merging and the Discursive Dilemma: An
Argument-Based Account to Paradoxes of Judgment
Aggregation", phil-sci/2882
- Stephen C. Pratt and David J. T. Sumpter, "A tunable algorithm for
collective decision-making", Proceedings of the
National Academy of Sciences (USA) 103 (2006):
15906--15910
- Joel B. Predd, Sanjeev R. Kulkarni and H. Vincent Poor,
"Distributed Regression in Sensor Networks: Training Distributively with
Alternating Projections", cs.LG/0507039
- Yaron Rachlin, Rohit Negi and Pradeep Khosla, "Sensing Capacity for
Markov Random Fields", cs.IT/0508054
- Roy Radner, "The Evaluation of Information in Organizations",
Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, 491--530
- Vitorino Ramos, Carlos Fernandes and Agostinho C. Rosa, "Social
Cognitive Maps, Swarm Collective Perception and Distributed Search on Dynamic
Landscapes", submitted to Brains, Minds and Media: Journal of New Media
in Neural and Cognitive Science [PDF
preprint]
- Vitorino Ramos and Ajith Abraham, "Evolving a Stigmeric
Self-Organized Data-Mining", cs.AI/0403001
- Vikas C. Raykar, Shipeng Yu, Linda H. Zhao, Gerardo Hermosillo Valadez, Charles Florin, Luca Bogoni, Linda Moy, "Learning From Crowds",
Journal of Machine Learning Research 11
(2010): 1297--1322
- Michel Regenwetter, Bernard Grofman, A. A. J. Marley and Ilia
Tsetlin, Behavioral Social Choice: Probabilistic Models, Statistical
Inference, and Applications
[Blurb]
- P. Resnick and H. R. Varian, "Recommender Systems", Comm.
ACM 40 (1997): 56--58
- Diana Richards, Whitman A. Richards and Brendan D. McKay,
"Collective Choice and Mutual Knowledge Structures," SFI Working
Paper 98-04-032
- Henry S. Richardson, Democratic Automony: Public Reasoning
about the Ends of Policy
- Marko A. Rodriguez, "Social Decision Making with Multi-Relational
Networks and Grammar-Based Particle
Swarms", cs.CY/0609034
- Marko A. Rodriguez and Jennifer H. Watkins, "Revisiting the
Age of Enlightenment from a Collective Decision Making Systems Perspective",
arxiv:0901.3929
= First Monday 14 (2009)
- Barbara Rogoff, Apprenticeship in Thinking: Cognitive
Development in Social Context
- Camille Roth, "Co-evolution in Epistemic Networks: Reconstructing
Social Complex Systems", Structure
and Dynamics: eJournal of Anthropological and Related
Sciences 1 (2006): 3:2
- Eduardo Salas and Stephen M. Fiore (eds.), Team Cognition:
Understanding the Factors That Drive Process and Performance
- Husain Sarkar, Group Rationality in Scientific Research
[blurb]
- R. Keith Sawyer, Group Genius: The Creative Power of
Collaboration
[book's
website]
- Robert E. schapire and Yoav Freund, Boosting: Foundations
and Algorithms [Blurb]
- Frank Schweitzer, Joerg Zimmermann and Heinz Muehlenbein,
"Coordination of Decisions in a Spatial Agent Model",
cond-mat/0109121
- S. M. D. Seaver, A. A. Moreira, M. Sales-Pardo, R. D. Malmgren, D. Diermeier, L. A. N. Amaral, "Micro-bias and macro-performance", European
Physical Journal B 67 (2009): 369--375, arxiv:0908.4261
- U. Shardanand and P. Maes, "Social information filtering:
Algorithms for automating 'word of mouth' ", in Proceedings of ACM
Conference on Human Factors and Computing Systems (1995), pp. 210--217
- Gerry Stahl, Group Cognition: Computer Support for Building
Collaborative Knolwedge
[Blurb]
- Kent W. Staley, Evidence for the Top Quark: Objectivity and
Bias in Collaborative Experimentation
- Dan Steinbock, Craig Kaplan, Marko Rodrigues, Juana Diaz, Newton
Der and Suzanne Garcia, "Collective Intelligence Quantified for
Computer-Mediated Group Problem Solving", cs.CY/0412064
- Quentin F. Stout, "Using Clerks in Parallel Processing",
pp. 272--279 in Proceedings of the 23rd IEEE Symposium on Foundations of
Computer Science (1982) [Abstract: "Some models of parallel
computers consist of copies of a single finite state automaton connected
together in a regular fashion. In such computers a self-organizing structure
called clerks can be useful, enabling one to simulate a more powerful
computer for which optimal algorithms are easier to design. The computation
proceeds by having the cellular automata organize themselves into clerks, and
then a stepwise simulation of the more powerful computer is performed. For a
system of n automata, each clerk contains \Theta(log n) automata, so first they
need to determine log(n), despite the fact that no single automata can count
higher than a fixed
number." Link]
- Torsten Strulik and Helmut Willke (eds.), Towards a Cognitive
Mode in Global Finance?: The Governance of a Knowledge-Based Financial
System
[Blurb]
- David J. T. Sumpter, Collective Animal Behavior
[Blurb]
- Ron Sun (ed.)
- Mikaela Sundberg, "The dynamics of coordinated comparisons: How simulationists in astrophysics, oceanography and meteorology create standards for results", Social Studies of Science 41
(2011): 107--125
- Cass R. Sunstein
- "The Law of Group Polarization" [online]
- Why Societies Need Dissent
- Infotopia: How Many Minds Produce Knowledge
- Tarja Susi and Tom Ziemke, "Social Cognition, Artefacts, and
Stigmergy: A Comparative Analysis of Theoretical Frameworks for the
Understanding of Artefact-mediated Collaborative Activity",
Cognitive Systems Research 2 (2001): 273--290 [Online]
- J. A. K. Suykens, J. Vandewalle and B. De Moor, "Intelligence
and Cooperative Search by Coupled Local Minimizers", cs.AI/0210030
- Robert Thompson, Frans N. Stokman and Rene Torenvlied (eds.),
Models of Collective Decision-Making [Special issue (vol. 15,
no. 1, 2003) of Rationality and Society]
- Paul Vogy and Evert Haasdijk, "Modeling Social Learning of Language and Skills", Artificial
Life 16 (2010): 289--309
- Frank E. Walter, Stefano Battiston, Frank Schweitzer,
"A Model of a Trust-based Recommendation System on a Social Network",
nlin/0611054
- Peter Welinder, Steve Branson, Serge Belongie and Pietro Perona,
"The Multidimensional Wisdom of Crowds", NIPS 2011 (NIPS 23) [PDF reprint]
- A. L. Wilkes, Knowledge in Minds: Individual and Collective
Processes in Cognition
- Anita William Woolley, Christopher F. Chabris, Alex Pentland, Nada
Nashmi and Thomas W. Malone, "Evidence for a Collective Intelligence Factor in
the Performance of Human
Groups", Science 330
(2010): 686--688 [Unfortunately, the sort of factor analysis they are
relying on to detect a single driving cause is incapable of distinguishing
between that, and the presence of immense numbers of independent causes,
haphazardly related to the tasks. (See.) In
other words, if your measurement procedures are sufficiently unrelated to the
real structure, it looks like you have a common factor. Perhaps they have some
way of ruling this out here.]
- Jesus P. Zamora Bonilla, "Optimal Judgment Aggregation", phil-sci/2945
- Eviatar Zerubavel, Social Mindscapes: An Invitation to
Cognitive Sociology
- Kevin
Zollman
- "Talking to Neighbors: The Evolution of Regional
Meanings", Philosophy of Science 72 (2005):
69--85[PDF
reprint]
- "The Communication Structure of Epistemic Communities" [An
extremely interesting presentation at PSA
2006; not yet published]
To write:
- CRS, The Social Life of the Mind
#
Wed, 18 Jan 2012
Decision Theory
By which I mean the various mathematical theories of optimal decison-making;
a division of both statistics
and economics. This is a fairly distinct topic
from actual human decision-making, since people do
not seem to conform very well to any of the theoretical ideals. This sometimes
leads to much wailing and gnashing of teeth over our irrationality; if
anything, however, it leads me to doubt that these theories are good
formalizations of rationality. Nonetheless, they're mathematically
interesting, and they do have certain very nice properties in the situations
where you can actually get them to work.
See also:
Sequential Decision Making Under Stochastic Uncertainty
Recommended, big picture:
- David Blackwell and M. A. Girshick, Theory of Games and
Statistical Decisions
- Herbert Gintis, Game Theory Evolving
- Luce and Raiffa, Games and Decisions
Recommended, close-ups:
- F. Bacchus, H. E. Kyburg and M. Thalos, "Against
Conditionalization," Synthese 85 (1990): 475--506
[Why "Dutch book" arguments do not, in fact, mean that rational agents must be
Bayesian
reasoners. PDF
preprint]
- Truman F. Bewley, "Knightian decision theory. Part I",
Decisions in Economics and Finance 25 (2002): 79--110 [Thanks to Maxim
Raginsky for the pointer]
- Ken Binmore, Making Decisions in Large Worlds ["This
paper argues that we need to look beyond Bayesian decision theory for an answer
to the general problem of making rational decisions under
uncertainty." PDF
manuscript; thanks to Nicolas Della Penna for the pointer]
- James Crotty, "Are Keynesian Uncertainty and Macrotheory
Compatible? Conventional Decision Making, Instititional
Structures and Conditional Stability in Keyneian Macromodels", pp. 105--142
in G. Dymski and R. Pollin (eds.), New Perspectives in
Monetary Macroeconomics: Explorations in the Tradition of Hyman Minsky
[PDF preprint. Repeats Davidson's errors on ergodicity (see below), but has many suggestive remarks about the uses of conventions
and institutions to reduce uncertainty.]
- Paul Davidson, "Is Probability Theory Relevant for Uncertainty? A Post Keynesian Perspective",
The Journal of Economic Perspectives 5 (1991):
129--143 [JSTOR. An extremely
interesting discussion of the distinctions between "objective probability",
i.e. an actual stochastic process,
"subjective probability" (in a degrees-of-belief sense), and genuine
uncertainty, when one doesn't have a clue, and the implications of the latter
for economics, especially macroeconomics.
However, he does make some annoying mistakes
about ergodic theory (especially on and
around p. 132, especially fn. 3, which asserts "Nonstationarity is a
sufficient, but not a necessary condition, for nonergodicity."). In
particular: (i) non-stationary processes can certainly be ergodic, e.g.,
asymptotically mean stationary ones are (see ch. 23 on the almost-sure ergodic
theorem
in Almost None
of the Theory of Stochastic Processes); (ii) non-stationarity is a
necessary condition for (practical) non-ergodicity, as all stationary processes
are mixtures of ergodic ones (ibid.); (iii) non-stationary,
non-ergodic processes can perfectly well be extrapolated
statistically if the form of the non-stationarity is known, as in the
case (to give a trivial example) of a random walk.
(Much, much more about this.) I
find this sort of mistake extra annoying because has arguments could still work
if he fixed this!]
- Tilmann Gneiting, "Making and Evaluating Point Forecasts",
Journal of the American Statistical Association 106 (2011): 746--762, arxiv:0912.0902
- Daniel M. Hausman, "Mistakes about Preferences in the Social Sciences", Philosophy of the Social Sciences
41 (2011): 3--25
- Charles Manski
- Mark E. J. Newman, Michelle Girvan, and J. Doyne Farmer, "Optimal
design, robustness, and risk aversion," cond-mat/0202330
- Spyros Skouras, "Decisionmetrics: Towards a Decision-Based
Approach to Econometrics," SFI
Working Paper 2001-11-064 [Applies far outside econometrics. If what you
really want to do is to minimize a known loss function, optimizing a
conventional accuracy measure, e.g. least squares, can be highly
counterproductive.]
- Amartya Sen, "Internal Consistency of Choice",
Econometrica 61 (1993): 495--521
[JSTOR]
- John Sutton, "Flexibility, Profitability and Survival in an
(Objective) Model of Knightian Uncertainty"
[PDF
preprint. Decision-making when the crucial variable is the indicator
function of an unmeasurable set, i.e., one which doesn't actually have
a probability.]
To read:
- Peter Bernstein, Against the Gods: The Remarkable Story of
Risk
- Ken Bimore, Rational Decisions
[Blurb]
- N. N. Chentsov, Statistical Decision Rules and Optimal
Inference
- H. Chernoff and Moses, Elementary Decision Theory
- Peter D. Grunwald and A. Philip Dawid, "Game theory, maximum
entropy, minimum discrepancy and robust Bayesian decision theory", Annals
of Statistics 32 (2004): 1367--1433, math.ST/0410076
- Claire Hill, "The Rationality of Preference Construction (and the Irrationality of Rational Choice)", ssrn/1288652
- Isaac Levi, "Money Pumps and Diachronic Books", Philosophy
of Science 69 (2002): S235--S247
- Jean-Pierre Ponssard, "On the Concept of the Value of
Information in Competitive Situations", Management Science
22 (1976): 739--747 [JSTOR]
- Joel Predd, Robert Seiringer, Elliott H. Lieb, Daniel Osherson, Vincent Poor, Sanjeev Kulkarni, "Probabilistic coherence and proper scoring rules",
IEEE Transactions on Information Theory 55 (2009): 4786, arxiv:0710.3183
- Tim Rakow, "Risk, uncertainty and prophet: The psychological
insights of Frank H. Knight", Judgment and Decision
Making 5 (2010): 458--466
- Rustem and Howe, Algorithms for Worst-Case Design and
Applications to Risk Management
- Kristin S. Shrader-Frechette, Risk and Rationality:
Philosophical Foundations for Populist Reforms [On the philosophy of
risk assessment; blurb;
full text]
- Cass R. Sunstein, Worst-Case Scenarios [Blurb]
- Paul Weirich, Equilibrium and Rationality: Game Theory
Revised by Decision Rules
[blurb]
- Richard Wilson and Edmund A. C. Crouch, Risk/Benefit
Analysis [Blurb]
#
Sun, 08 Jan 2012
Power Law Distributions, 1/f Noise, Long-Memory Time Series
Why do physicists care about power laws so much?
I'm probably not the best person to speak on behalf of our tribal obsessions
(there was a long debate among the faculty at my thesis defense as to whether
"this stuff is really physics"), but I'll do my best. There are two parts
to this: power-law decay of correlations, and power-law size distributions.
The link is tenuous, at best, but they tend to get run together in our heads,
so I'll treat them both here.
The reason we care about power law correlations is that we're conditioned to
think they're a sign of something interesting and complicated happening. The
first step is to convince ourselves that in boring situations, we don't see
power laws. This is fairly easy: there are pretty good and rather generic
arguments which say that systems in thermodynamic equilibrium, i.e. boring
ones, should have correlations which decay exponentially over space and time;
the reciprocals of the decay rates are the correlation length and the
correlation time, and say how big a typical fluctuation should be. This is
roughly first-semester graduate statistical mechanics. (You can find those
arguments in, say, volume one of Landau and Lifshitz's Statistical
Physics.)
Second semester graduate stat. mech. is where those arguments break down ---
either for systems which are far from equilibrium
(e.g., turbulent flows), or in equilibrium but
very close to a critical point (e.g., the transition from a solid to liquid
phase, or from a non-magnetic phase to a magnetized
one). Phase transitions have fluctuations
which decay like power laws, and many non-equilibrium systems do too. (Again,
for phase transitions, Landau and Lifshitz has a good discussion.) If you're a
statistical physicist, phase transitions
and non-equilibrium processes define the
terms "complex" and "interesting" --- especially phase transitions, since we've
spent the last forty years or so developing a very successful theory of
critical phenomena. Accordingly, whenever we see power law correlations, we
assume there must be something complex and interesting going on to produce
them. (If this sounds like the fallacy of affirming the consequent, that's
because it is.) By a kind of transitivity, this makes power laws interesting
in themselves.
Since, as physicists, we're generally more comfortable working in the
frequency domain than the time domain, we often transform the autocorrelation
function into the Fourier spectrum. A power-law decay for the correlations as
a function of time translates into a power-law decay of the spectrum as a
function of frequency, so this is also called "1/f noise".
Similarly for power-law distributions. A simple use of the Einstein
fluctuation formula says that thermodynamic variables will have Gaussian
distributions with the equilibrium value as their mean. (The usual version of
this argument is not very precise.) We're also used to seeing
exponential distributions, as the probabilities of microscopic states. Other
distributions weird us out. Power-law distributions weird us out even more,
because they seem to say there's no typical scale or size for the variable,
whereas the exponential and the Gaussian cases both have natural scale
parameters. There is a connection here with fractals, which also lack typical
scales, but I don't feel up to going into that, and certainly a lot of the
power laws physicists get excited about have no obvious connection to any kind
of (approximate) fractal geometry. And there are lots of power law
distributions in all kinds of data, especially social data --- that's why
they're also called Pareto distributions, after the sociologist.
Physicists have devoted quite a bit of time over the last two decades to
seizing on what look like power-laws in various non-physical sets of data, and
trying to explain them in terms we're familiar with, especially phase
transitions. (Thus "self-organized criticality".) So
badly are we infatuated that there is now a huge, rapidly growing literature
devoted to "Tsallis statistics" or "non-extensive
thermodynamics", which is a recipe for modifying normal statistical
mechanics so that it produces power law distributions; and this, so far as I
can see, is its only good feature. (I will not attempt, here, to
support that sweeping negative verdict on the work of many people who have more
credentials and experience than I do.) This has not been one of our more
successful undertakings, though the basic motivation --- "let's see what we can
do!" --- is one I'm certainly in sympathy with.
There have been two problems with the efforts to explain all power laws
using the things statistical physicists know. One is that (to mangle Kipling)
there turn out to be nine and sixty ways of constructing power laws,
and every single one of them is right, in that it does indeed produce
a power law. Power laws turn out to result from a kind of central limit
theorem for multiplicative growth processes, an observation which apparently
dates back to Herbert Simon, and which has been rediscovered by a number of
physicists (for instance, Sornette). Reed and Hughes have established an even
more deflating explanation (see below). Now, just because these simple
mechanisms exist, doesn't mean they explain any particular case, but
it does mean that you can't legitimately argue "My favorite mechanism
produces a power law; there is a power law here; it is very unlikely there
would be a power law if my mechanism were not at work; therefore, it is
reasonable to believe my mechanism is at work here." (Deborah Mayo would say
that finding a power law does not constitute a severe test of your hypothesis.) You need to do
"differential diagnosis", by identifying other, non-power-law
consequences of your mechanism, which other possible explanations don't share.
This, we hardly ever do.
Similarly for 1/f noise. Many different kinds of stochastic process, with
no connection to critical phenomena, have power-law correlations.
Econometricians and time-series analysts have
studied them for quite a while, under the general heading of "long-memory"
processes. You can get them from things as simple as a superposition of
Gaussian autoregressive processes. (We have begun to awaken to this fact,
under the heading of "fractional Brownian motion".)
The other problem with our efforts has been that a lot of the power-laws
we've been trying to explain are not, in fact, power-laws. I should perhaps
explain that statistical physicists are called that, not because we know a lot
of statistics, but because we study the large-scaled, aggregated effects of the
interactions of large numbers of particles, including, specifically, the
effects which show up as fluctuations and noise. In doing this we learn,
basically, nothing about drawing inferences from
empirical data, beyond what we may remember about curve fitting and
propagation of errors from our undergraduate lab courses. Some of us,
naturally, do know a lot of statistics, and
even teach it --- I might
mention Josef Honerkamp's superb Stochastic Dynamical Systems.
(Of course, that book is out of print and hardly ever cited...)
If I had, oh, let's say fifty dollars for every time I've seen a slide (or a
preprint) where one of us physicists makes a log-log plot of their data, and
then reports as the exponent of a new power law the slope they got from doing a
least-squares linear fit, I'd at least not grumble. If my colleagues had gone
to statistics textbooks and looked up how to estimate the parameters of a
Pareto distribution, I'd be a happier man. If any of them had actually tested
the hypothesis that they had a power law against alternatives like stretched
exponentials, or especially log-normals, I'd think
the millennium was at hand. (If you want to
know how to do these things, please
read this paper, whose merits are
entirely due to my co-authors.) The situation for 1/f noise is not so dire,
but there have been and still are plenty of abuses, starting with the fact that
simply taking the fast Fourier transform of the autocovariance function
does not give you a reliable estimate of the power
spectrum, particularly in the tails. (On that point, see, for
instance, Honerkamp.)
See also:
Chaos and Dynamical Systems;
Complex Networks;
Self-Organized Criticality;
Time Series;
Tsallis Statistics
Recommended, bigger picture:
- Michael Mitzenmacher, "A Brief History of Generative Models for
Power Law and Lognormal Distributions", Internet Mathematics
1 (2003): 226--251
[PDF]
- M. E. J. Newman, "Power laws, Pareto distributions and Zipf's
law", cond-mat/0412004 [If
you read one other thing on power laws, read this]
- Manfred Schroeder, Fractals, Chaos, Power Laws: Minutes from
an Infinite Paradise
Recommended, more technical or more specialized:
- Robert J. Adler, Raise E. Feldman and Murad S. Taqqu
(eds.), A Practical Guide to Heavy Tails [Presumes that you
already know something about statistics and stochastic processes, so not
suitable for beginners.]
- Barry C. Arnold, Pareto Distributions [Fine guide
to the statistical literature, as it was in 1983; still valuable, though
many things which were nasty computations then are easy now.]
- Arijit Chakrabarty, "Effect of truncation on large deviations for heavy-tailed random vectors", arxiv:1107.2476
- Aaron Clauset, Maxwell Young, and Kristian Skrede Gleditsch, "Scale
Invariance in the Severity of
Terrorism", physics/0606007
[Surprising, but well-supported]
- F. Clementi, T. Di Matteo, M. Gallegati, "The Power-law Tail
Exponent of Income Distributions", physics/0603061
= Physica
A 370 (2006): 49--53 [An interesting way to improve
the accuracy of Hill-type (tail-conditional maximum likelihood) estimates of
the scaling parameter. Written with few concessions to those who are neither
statisticians nor econometricians. Not directly suitable for determining
the range of the scaling region. Income distribution is used only as
an example.]
- Andrew M. Edwards, Richard A. Phillips, Nicholas W. Watkins, Mervyn
P. Freeman, Eugene J. Murphy, Vsevolod Afanasyev, Sergey V. Buldyrev,
M. G. E. da Luz, E. P. Raposo, H. Eugene Stanley and Gandhimohan
M. Viswanathan, "Revisiting Lévy flight search patterns of wandering
albatrosses, bumblebees and
deer", Nature
449 (2007): 1044--1048
- Paul Embrechts and Makoto Maejima, Selfsimilar
Processes
- Michel L. Goldstein, Steven A. Morris and Gary G. Yen, "Fitting to
the Power-Law Distribution", cond-mat/0402322 [Pedestrian,
but accurate, exposition in terms physicists and engineers are likely to
understand. Insufficiently sourced to the statistical literature; e.g., their
calculation of the maximum likelihood estimator was first published in 1952.]
- Josef Honerkamp, Stochastic Dynamical Systems: Concepts,
Numerical Methods, Data Analysis
- Yuji Ijiri and Herbert Simon, Skew Distributions and the
Sizes of Business Firms [Collects Simon and co.'s pioneering papers on
power laws and related distributions --- including "On a Class of Skew
Distribution Functions", below --- as well as considering the limitations,
alternatives, modifications to match data, statistical issues, the connection
to Bose-Einstein statistics, the importance of going beyond just staring at
distributional plots if you want to learn about mechanisms, etc., etc. This
was all published in 1977...]
- A. James and M. J. Plank, "On fitting power laws to ecological
data", arxiv:0712.0613
- Raya Khanin and Ernst Wit, "How Scale-Free Are Biological
Networks?",
Journal of Computational Biology
13 (2006): 810--818 [Ans.: not very scale-free at all.]
- Joel Keizer, Statistical Thermodynamics of Nonequilibrium
Processes [Has a good discussion of critical fluctuations in chapter
8. Review: Molecular Fluctuations for Fun and
Profit]
- Paul Krugman, The Self-Organizing Economy [Has a nice
discussion of power-law size distributions in economics. Review]
- Michael LaBarbera, "Analyzing Body Size as a Factor in Ecology and
Evolution", Annual Review of Ecology and
Systematics 20 (1989): 91--117 [Statistical problems in
many studies of power-law scaling in biology, their effects on the conclusions
of those studies (ranging from "wrong, but correctable" to "meaningless"), and
how to do it right. JSTOR]
- J. Laherrère and D. Sornette, "Stretched exponential
distributions in nature and economy: 'fat tails' with characteristic scales",
The European Physical Journal B 2 (1998):
525--539
- L. D. Landau and E. M. Lifshitz, Statistical Physics
[For the theory of fluctuations in statistical mechanics, and for critical
phenomena in equilibrium]
- Adrián López García de Lomana, Qasim K. Beg,
G. de Fabritiis and Jordi Villà-Freixa, "Statistical Analysis of Global Connectivity and Activity Distributions in Cellular Networks", arxiv:1004.3138
- R. Dean Malmgren, Daniel B. Stouffer, Adilson E. Motter, Luis A.N. Amaral, "A Poissonian explanation for heavy-tails in e-mail communication",
Proceedings of the
National Academy of Sciences (USA) 105 (2008): 18153--18158, arxiv:0901.0585
- Elliott W. Montroll and Michael F. Shlesinger, "On 1/f noise and
other distributions with long
tails", Proceedings
of the National Academy of Sciences (USA) 79 (1982):
3380--3383
- V. F. Pisarenko and D. Sornette, "New statistic for financial
return distributions: power-law or exponential?", physics/0403075 [Actually, two
new statistics: one converges to a constant if the distribution you're sampling
from is an exponential, independent of the exponent, and the other converges to
a constant if the distribution is a power law, independent of the power. They
even have some indications of the sampling distributions, so you can at least
gauge the statistical signifcance, i.e., the probability of deviations from the
ideal value, even though the distribution really is of the appropriate type. I
don't recall anything about the power of these statistics, however (i.e., the
probability that a power law will look like an exponential, or vice-versa).]
- William J. Reed and Barry D. Hughes, "From Gene Families and Genera
to Incomes and Internet File Sizes: Why Power Laws are so Common in
Nature", Physical
Review E 66 (2002): 067103 [This is, as I said,
perhaps the most deflating possible explanation for power law size
distributions. Imagine you have some set of piles, each of which grows,
multiplicatively, at a constant rate. New piles are started at random times,
with a constant probability per unit time. (This is a good model of my
office.) Then, at any time, the age of the piles is exponentially distributed,
and their size is an exponential function of their age; the two exponentials
cancel and give you a power-law size distribution. The basic combination of
exponential growth and random observation times turns out to work even if it's
only the mean size of piles which grows exponentially.]
- M. V. Simkin and V. P. Roychowdhury, "Re-inventing Willis",
physics/0601192 [The comical,
yet pathetic, history of the innumerable re-inventions of basic mechanisms
which plague this area]
- Herbert Simon, "On a Class of Skew Distribution Functions",
Biometrika 42 (1955): 425--440 [JSTOR]
- Didier Sornette
- "Multiplicative Processes and Power Laws" cond-mat/9708231
= Physical Review E 57 (1998): 4811--4813
- "Mechanism for Powerlaws without Self-Organization"
cond-mat/0110426
- Stilian A. Stoev, George Michailidis, and Murad S. Taqqu,
"Estimating heavy-tail exponents through max self-similarity", math.ST/0609163
- Bruce J. West and Bill Deering, The Lure of Modern Science:
Fractal Thinking [Despite the painful title, this is actually a
very good book. I disagree with some of the more philosophical positions they
take, but on the actual science and math they're quite sound.]
- Wei Biao Wu, Yinxiao Huang and Wei Zheng, "Covariances estimation
for long-memory
processes", Advances
in Applied Probability 42 (2010): 137--157 [How big
are the errors in your covariance estimates?]
- Damian H. Zanette, "Zipf's law and the creation of musical
context", cs.CL/0406015 [This
sounds bizarre, and I'd not have bothered to even note it if I didn't
know Zanette's work in other areas, which shows him to be a good and careful
scientist. And this is actually an interesting and meaningful little paper,
which has something non-trivial to say about music. It's worth noting,
perhaps, that the distribution he actually ends up fitting isn't a pure power
law, but a modification inspired by Simon's paper. Thanks to John Burke for
prodding me to actually read it.]
Not altogether recommended (without being actively dis-recommended either):
- R. Alexander Bentley, Paul Ormerod, Michael Batty, "An evolutionary
model of long tailed distributions in the social
sciences", arxiv:0903.2533 [This
is a minor modification of the classical Yule/Simon mechanism for random
growth, with the main advantage being that (with the right parameter tweaking)
it allows for more turn-over of which values are most common. Unsurprisingly,
this is done by adding extra parameters, and so the family of distributions is
more flexible. But they use bad statistical procedures, and the finding that
the estimated power law exponent grows as the amount of data held in the tail
shrinks is simply explained: the tails aren't power laws.]
Recommended, of a not entirely serious character:
- Mason Porter's Power Law Shop
Modesty forbids me to recommend:
- Aaron Clauset, CRS and M. E. J. Newman, "Power-law distributions in
empirical data", SIAM Review 51 (2009): 661--703
= arxiv:0706.1062 [with commentary
by Aaron
and myself]
To read:
- Eduardo G. Altmann and Holger Kantz, "Recurrence time analysis,
long-term correlations, and extreme events", physics/0503056
- J. A. D. Aston, "Modeling macroeconomic time series via heavy
tailed distributions", math.ST/0702844
- Katarzyna Bartkiewicz, Adam Jakubowski, Thomas Mikosch, Olivier Wintenberger, "Infinite variance stable limits for sums of dependent random variables", arxiv:0906.2717
- Michael Batty, "Rank Clocks", Nature
444 (2006): 592--596
- Marco Bee, Massimo Riccaboni and Stefano Schiavo, "Pareto versus lognormal: A maximum entropy test", Physical
Review E 84 (2011); 026104
- P. Besbeas and B. J. T. Morgan, "Improved estimation of the stable
laws", Statistics
and Computing 18 (2008): 219--231
- Danny Bickson, Carlos Guestrin, "Linear Characteristic Graphical Models: Representation, Inference and Applications", arxiv:1008.5325 [Graphical models with heavy-tailed latent variables]
- Thierry Bochud and Damien Challet, "Optimal approximations of
power-laws with exponentials", physics/0605149 ["We propose an
explicit recursive method to approximate a power-law with a finite sum of
weighted exponentials. Applications to moving averages with long memory are
discussed in relationship with stochastic volatility models." The last part
sounds like a rediscovery of Granger.]
- Laurent E. Calvet and
Adlai J. Fisher, Multifractal Volatility: Theory, Forecasting, and
Pricing [Thanks to Prof. Calvet for bringing this to my attention]
- Anna Carbone and Giuliano Castelli, "Scaling Properties of
Long-Range Correlated Noisy Signals,"
cond-mat/0303465
- C. Cattuto, V. Loreto and V. D. P. Servedio, "A Yule-Simon process
with memory", cond-mat/0608672 [Memo
to self: compare this to the auto-correlated Yule-Simon process in
Ijiri and Simon's book.]
- Arijit Chakrabarty, "Central Limit Theorem and Large Deviations for truncated heavy-tailed random vectors", arxiv:1003.2159
- Arijit Chakrabarty, Gennady Samorodnitsky, "Understanding heavy tails in a bounded world or, is a truncated heavy tail heavy or not?", arxiv:1001.3218
- Anirban Chakraborti, Marco Patriarca, "A Variational Principle for
Pareto's power
law", cond-mat/0605325
- Ali Chaouche and Jean-Noel Bacro, "Statistical Inference for
the Generalized Pareto Distribution: Maximum Likelihood Revisited", Communications in Statistics: Theory and Methods 35
(2006): 785--802
- F. Clementi, M. Gallegati, "Pareto's Law of Income Distribution:
Evidence for Germany, the United Kingdom, and the United
States", physics/0504217
- Cline, heavy-tailed noise, 1983 (?)
- B. Conrad and M. Mitzenmacher, "Power Laws for Monkeys Typing
Randomly: The Case of Unequal Probabilities", IEEE Transactions on
Information Theory 50 (2004): 1403--1414
- Bikramjit Das and Siddney I. Resnick, "QQ plots, Random sets and
data from a heavy tailed distribution", math.PR/0702551
- Anirban Dasgupta, John Hopcroft, Jon Kleinberg and
Mark Sandler, "On Learning Mixtures of Heavy-Tailed Distributions"
- T. Di Matteo, T. Aste and M. Gallegati, "Innovation flow through
social networks: Productivity distribution", physics/0406091 [Those look an
awful lot like log-normals to me.]
- Paul Doukhan, George Oppenheim and Murad S. Taqqu
(eds.), Theory and Applications of Long-Range Dependence
- Rick Durrett and Jason Schweinsberg, "Power laws for family sizes
in a duplication model", math.PR/0406216
= Annals of Probability 33 (2005): 2094--2126
- R. Fox and M. S. Taqqu
- "Noncentral Limit Thorems for Quadratic Forms in Random
Variables Having Long-Range Dependence," Annals of Probability
13 (1985) 428--446
- "Central Limit Theorems for Quadratic Forms in Random
Variables Having Long-Range Dependence," Probability Theory and Related
Fields 74 (1987): 213--240
- G. Frenkel, E. Katzav, M. Schwartz and N. Sochen, "Distribution of
Anomalous Exponents of Natural Images", Physical Review
Letters 97 (2006): 103902
- U. Frisch and D. Sornette, "Extreme Deviations and Applications",
J. Phys. I France 7 (1997): 1155--1171
- Akihiro Fujihara, Toshiya Ohtsuki and Hiroshi Yamamoto
- Akihiro Fujihara, Satoshi Tanimoto, Toshiya Ohtsuki, Hiroshi
Yamamoto, "Log-normal distribution in growing systems with weighted
multiplicative interactions", cond-mat/0511625
- Yoshi Fujiwara, Corrado Di Guilmi, Hideaki Aoyama, Mauro Gallegati,
Wataru Souma, "Do Pareto-Zipf and Gibrat laws hold true? An analysis with
European
Firms", cond-mat/0310061
- Xavier Gabaix, "Power Laws in Economics and Finance"
[PDF preprint]
- M. Ivette Gomes, M. Isabel Fraga Alves, Paulo Araujo Santos,
"PORT Hill and Moment Estimators for Heavy-Tailed Models",
Communications in Statistics:
Simulation and Computation 37
(2008): 1281--1306
- Alexander Gnedin, Ben Hansen, Jim Pitman, "Notes on the occupancy
problem with infinitely many boxes: general asymptotics and power
laws", math.PR/0701718
- J. A. Gubner, "Theorems and Fallacies in the Theory of
Long-Range-Dependent Processes", IEEE Transactions on
Information Theory 51 (2005): 1234--1239
- Alexandra Guerrero and Leonard A. Smith, "A maximum likelihood
estimator for long-range persistence", Physica
A 355 (2005): 619--632
- Rudolf Hanel and Stefan Thurner, "On the Derivation of power-law
distributions within standard statistical mechanics", cond-mat/0412016
- Bruce M. Hill and Michael Woodroofe, "Stronger Forms of Zipf's
Law", Journal of the American Statistical
Association 70 (1975): 212--219 [Deriving Zipf's law
from Bose-Einstein
statistics. JSTOR]
- Byoung Hee Hong, Kyoung Eun Lee, Jae Woo Lee, "Power Law in Firms
Bankruptcy", physics/0701302
- Y. Hosoya
- "The quasi-likelihood approach to statistical inference on
multiple time-series with long-range dependence," Journal of
Econometrics 73 (1996): 217--236
- "A limit theory for long-range dependence and statistical
inference on related models," Annals of Statistics
25 (1997): 105--137
- Henrik Hult and Gennady Samorodnitsky, "Large deviations for point processes based on stationary sequences with heavy tails", Journal of Applied Probability 47 (2010): 1--40
- Takashi Ichinomiya, "Power-law distribution in Japanese racetrack
betting", physics/0602165
- Milton Jara, Tomasz Komorowski and Stefano Olla,
"Limit theorems for additive functionals of a Markov chain", arxiv:0809.0177 [Convergence to alpha-stable
distributions]
- Predrag R. Jelenkovic, Jian Tan, "Modulated Branching Processes,
Origins of Power Laws and Queueing
Duality", 0709.4297
- Junghyo Jo, Jean-Yves Fortin, M. Y. Choi, "Weibull-type limiting distribution for replicative systems", Physical Review E 83 (2011): 031123, arxiv:1103.3038
- Taisei Kaizoji, "Power laws and market
crashes", physics/0603138
- Imen Kammoun, Vernoique Billat and Jean-Marc Bardet, "A new
stochastic process to model Heart Rate series during exhaustive run and an
estimator of its fractality
parameter", arxiv:0803.3675
[Includes statistical criticism of the common, but deeply unsatisfying,
"detrended fluctuation analysis" method of estimating the Hurst exponent.]
- B. Kaulakys and J. Ruseckas, "Stochastic nonlinear
differential equation generating 1/f noise", Physical Review E
70 (2004): 020101 = cond-mat.0408507
- K. Kiyani, S. C. Chapman and B. Hnat, "A method for extracting the
scaling exponents of a self-affine, non-Gaussian process from a finite length
timeseries", physics/0607238
- K. H. Kiyani, S. C. Chapman, N. W. Watkins, "Pseudo-nonstationarity in the scaling exponents of finite interval time series", Physical
Review E 79 (2009): 036109, arxiv:0808.2036
- Francois M. Longin, "The Asymptotic Distribution of Extreme Stock
Market Returns", The Journal of Business 69
(1996): 383--408
[JSTOR]
- Bruce D. Malamud, James D. A. Millington and George L. W. Perry,
"Characterizing wildfire regimes in the United States", Proceedings of the
National Academy of Sciences (USA) 102 (2005):
4694--4699
- Y. Malevergne, V.F. Pisarenko, D. Sornette, "Empirical
Distributions of Log-Returns: between the Stretched Exponential and the Power
Law?", physics/0305089
- Alon Manor and Nadav M. Shnerb, "Multiplicative Noise and Second Order Phase Transitions", Physical
Review Letters 103 (2009): 030601
- Natalia Markovich, Nonparametric Analysis of Univariate
Heavy-Tailed Data: Research and Practice
- Yosef E. Maruvka, David A. Kessler, Nadav M. Shnerb, "The
Birth-Death-Mutation process: a new paradigm for fat tailed
distributions", arxiv:1011.4110 [I
suspected from the abstract that this was Yet Another Rediscovery of the
Yule-Simon mechanism. However, after actually looking through the paper
(prompted by Dr. Shnerb), I see that they are in fact doing something more, and
that I was just wrong. I still need to read it properly, however, before
deciding what I think about the actual proposal.]
- Joseph L. McCauley, Gemunu H. Gunaratne, Kevin E. Bassler, "Hurst
Exponents, Markov Processes, and Fractional Brownian
motion", cond-mat/0609671
- Richard Metzler, "Comment on 'Power-law correlations in the
southern-oscillation-index fluctuations characterizing El
Nino'", Physical
Review E 67 (2003): 018201
- Salvatore Miccich`, "Modeling long-range memory with stationary Markovian processes", Physical
Review E 79 (2009): 031116, arxiv:arxiv:0806.0722
- Thomas Mikosch, Sidney Resnick, Holger Rootzén, and Alwin Stegeman. "Is Network Traffic Appriximated by Stable Lévy Motion or Fractional Brownian Motion?", Annals of Applied Probability 12 (2002): 23--68
- Edoardo Milotti, "Model-based fit procedure for power-law-like
spectra", physics/0510011
- Elliott W. Montroll and Michael Shlesinger, "Maximum entropy
formalism, fractals, scaling phenomena and 1/f noise: A tale of tails",
Journal of Statistical Physics 32 (1983):
209--230
- Eric Moulines, Francois Roueff, Murad S. Taqqu, "A Wavelet Whittle
estimator of the memory parameter of a non-stationary Gaussian time
series", math/0601070
- Newton J. Moura Jr. and Marcelo B. Ribeiro, "Zipf Law for Brazilian
Cities", physics/0511216
- J. F. Muzy, E. Bacry and A. Kozhemyak, "Extreme values and fat
tails of multifractal
fluctuations", Physical Review
E 73 (2006): 066114
= cond-mat/0509357
["problem of the estimation of extreme event occurrence probability for data
drawn from some multifractal process. We also study the heavy (power-law) tail
behavior of probability density function associated with such data. We show
that because of strong correlations, standard extreme value approach is not
valid and classical tail exponent estimators should be interpreted cautiously"]
- Richard Perline, "Strong, Weak and False Inverse Power Laws",
Statistical
Science 20 (2005): 68--88
- William Rea, Les Oxley, Marco Reale and Jennifer Brown,
"Estimators for Long Range Dependence: An Empirical Study", arxiv:0901.0762 [submitted to EJS]
- Sidney I. Resnick, Heavy-Tail Phenomena: Probabilistic and
Statistical Modeling
[Blurb]
- Sidney Resnick and Catalin Starica, "Tail Index Estimation for
Dependent Data", The Annals of Applied
Probability 8 (1998): 1156--1183
- Massimo Riccaboni, Fabio Pammolli, Sergey V. Buldyrev, Linda Ponta,
H. Eugene Stanley , "The Size Variance Relationship of Business Firm Growth
Rates", arxiv:0904.1404
= Proceedings of the National Academy of Sciences
(USA) 105 (2008): 19595--19600
- Alexander Roitershtein, "One-dimensional linear recursions with
Markov-dependent
coefficients", math/0409335
= Annals of
Applied Probability 17 (2007): 572--608 [To
summarize the abstract, suppose S(n) = A(n) + B(n)*S(n-1), where A(n) and B(n)
are Markov sequences. Then "the distribution tail of its stationary solution
has a power law decay." This sounds like Simon's argument made more general.]
- Holger Rootzen, M. Ross Leadbetter and Laurens de Haan, "On the
distribution of tail array sums for strongly mixing stationary
sequences", Annals of Applied Probability 8
(1998): 868--885
- Gennady Samorodnitsky and Murad S. Taqqu, Stable
Non-Gaussian Random Processes
- D. Sornette and V. F. Pisarenko, "Properties of a simple bilinear
stochastic model: estimation and predictability", physics/0703217
- Attilio L. Stella, Fulvio Baldovin, "Anomalous scaling due to correlations: Limit theorems and self-similar processes", arxiv:0909.0906
- Stilian A Stoev, George Michailidis, "On the Estimation of the Heavy-Tail Exponent in Time Series using the Max-Spectrum", arxiv:1005.4329
- Stilian A. Stoev and Murad S. Taqqu, "Limit Theorems for Sums of Heavy-tailed Variables with Random Dependent Weights",
Methodology and
Computing in Applied Probability 9 (2007): 55--87
- Sarah Touati, Mark Naylor, and Ian G. Main, "Origin and Nonuniversality of the Earthquake Interevent Time Distribution", Physical Review Letters 102 (2009): 168501
- Ciprian Tudor and Frederi Viens, "Variations and estimators for the selfsimilarity order through Malliavin calculus", arxiv:0709.3896
- Caglar Tuncay, "A universal model for languages and cities, and
their
lifetimes", physics/0703144
- Marta Tyran-Kaminska, "Convergence to Lévy stable processes under strong mixing conditions", arxiv:0907.1185
- Sergio Venturini, Francesca Dominici, Giovanni Parmigiani, "Gamma
shape mixtures for heavy-tailed distributions", Annals of Applied
Statistics 2 (2008): 756--776
= arxiv:0807.4663
- Rafal Weron
- "Estimating long range dependence: finite sample
properties and confidence intervals," cond-mat/0103510
- "Measuring long-range dependence in electricity prices,"
cond-mat/0103621
- T. S. T. Wong and W. K. Li, "A note on the estimation of extreme
value distributions using maximum product of spacings",
math.ST/0702830
- Wei Biao
Wu, Xiaofeng Shao, "Invariance principles for fractionally integrated
nonlinear
processes", math.PR/0608223
- Seokhoon Yun, "The Extremal Index of a Higher-Order
Stationary Markov Chain", The Annals of Applied Probability
8 (1998): 408--437
- Damian H. Zanette, "Zipf's law and city sizes: A short tutorial
review on multiplicative processes in urban
growth", arxiv:0704.3170
#
Branching Processes
A class of stochastic process
important as models in genetics and population biology, chemical kinetics, and
filtering. The basic idea is that there are a number of objects, often called
particles, which, in some random fashion, reproduce ("branch") and die out;
they can be of multiple types and occupy
differing spatial locations. They can
pursue their trajectories and their biographies either independently, or with
some kind of statistical dependence across particles.
The most basic version has one type of particle, and no spatial
considerations. At each time step, each parrticle gives rise to a random
number of offspring; the distribution of offspring is fixed, and the number is
independent across time-steps and across lineages (IID). This is the so-called
Galton-Watson branching process. Galton introduced it as a model of the
survival of (patrilneal) family names, so that only male offspring counted; he
required the distribution of time until a given lineage went extinct. This was
provided almost immediately by Watson, in a very elegant use of the method of
generating functions, which is, itself, reproduced in probability textbooks
down to the present day. (However, when I first encoutnered the problem, in a
probability class, the teacher presented it as one about the survival
of matrilineal lineages, defined by inheritance of mitochondrial DNA.
Whether this was conscious subversion of the patriarchy, or just a reflection
of the changing scientific interests between the 1890s and the 1990s, I
couldn't say.)
See also:
Epidemiology;
Social Contagion
Recommended (introductory):
- Geoffrey Grimmett and David Stirzaker, Probability and Random
Processes [This is my favorite probability textbook, and returns to
branching processes in many places.]
Recommended (forbiddingly technical):
- P. Del Moral and L. Miclo, "Branching and Interacting Particle
Systems Approximations of Feynman-Kac Formulae with Applications to Nonlinear
Filtering", in J. Azema, M. Emery, M. Ledoux and M. Yor
(eds)., Semainaire de Probabilites XXXIV (Springer-Verlag, 2000),
pp. 1--145 [Postscript
preprint. Looks like a trial run for Del Moral's book, below, which I've
yet to read.]
To read:
David Assaf, Larry Goldstein and Ester Samuel-Cahn, "An unexpected
connection between branching processes and optimal
stopping", math.PR/0510587
= Journal of Applied Probability 37 (2000):
613--6 [This sounds like a nice pedagogical topic for a course in stochastic
processes. I teach a course in stochastic processes....]
Michael Assaf and Baruch Meerson, "Spectral Theory of Metastability
and Extinction in Birth-Death
Systems", Physical
Review Letters 97 (2006): 200602
= cond-mat/0610415
Krishna B. Athreya, Branching Processes
K. B. Athreya, A.P. Ghosh, S. Sethuraman, "Growth of preferential
attachment random graphs via continuous-time branching
processes", math.PR/0701649
Ellen Baake, Hans-Otto Georgii, "Mutation, selection, and ancestry
in branching models: a variational
approach", q-bio.PE/0611018
Romulus Breban, Raffaele Vardavas and Sally Blower,
"Linking population-level models with growing networks: A class of epidemic models", Physical Review E 72 (2005): 046110
Nicolas Champagnat, Régis Ferrière, Sylvie
Méléar, "Individual-based probabilistic models of adaptive
evolution and various scaling
approximations", math.PR/0510453
Charles R. Doering, Khachik V. Sargsyan and Leonard M. Sander,
"Extinction times for birth-death processes: exact results, continuum
asymptotics, and the failure of the Fokker-Planck approximation", q-bio/0401016
Pierre Del Moral, Feynman-Kac Formulae: Genealogical and
Interacting Particle Systems [This looks really, really cool]
Janos Englander, "Branching diffusions, superdiffusions and random media", Probability Surveys 4 (2007): 303--364,
arxiv:0710.0236
Benjamin Golub and Matthew O. Jackson, "Using selection bias to explain the observed structure of Internet diffusions", Proceedings of the National Academy of Sciences (USA) 107 (2010): 10833--10836
Vicenc Gomez, Hilbert J. Kappen and Andreas Kaltenbrunner,
"Modeling the structure and evolution of discussion cascades", arxiv:1011.0673
P. Haccou et al., Branching Processes: Variation, Growth,
and Extinction of Populations
Jose Luis Iribarren and Esteban Moro, "Branching Dynamics of Viral
Information Spreading", <cite>Physical Review E 84 (2011): 046116
Predrag R. Jelenkovic, Jian Tan, "Modulated Branching Processes,
Origins of Power Laws and Queueing
Duality", 0709.4297
Junghyo Jo, Jean-Yves Fortin, M. Y. Choi, "Weibull-type limiting distribution for replicative systems", Physical Review E 83 (2011): 031123, arxiv:1103.3038
Jean-Francois Le Gall, Spatial Branching Processes,
Random Snakes and Partial Differential Equations
Brendan P. M. McCabe1, Gael M. Martin2, David Harris3, "Efficient probabilistic forecasts for counts", Journal
of the Royal Statistical Society B 73 (2011): 253--272
Sebastian Müller, "Strong recurrence for branching Markov
chains", arxiv:0710.4651
Victor M. Panaretos, "Partially observed branching processes for
stochastic
epidemics", Journal
of Mathematical Biology 54 (2007): 645--668
David Sankoff, "Branching Processes with Terminal Types:
Application to Context-Free Grammars", Journal of Applied
Probability 8 (1971): 233--240
[JSTOR]
D. Sornette and S. Utkin, "Limits of declustering methods for disentangling exogenous from endogenous events in time series with foreshocks, main shocks, and aftershocks", Physical Review E 79 (2009): 061110, arxiv:0903.3217
#
Complex Networks
Having written a whole pop-sci article about these things (see below), I won't explain them at all here. This notebook
is more of a placeholder than usual.
Stuff I should learn more about: structural complexity measures for graphs
and ensembles of random graphs; Gibbs measures for equilibrium ensembles of
graphs; Markovian graphs. Why does it seem like the edges are the
important random variables, rather than the nodes?
Data analysis in general
and community discovery in particular
get their own notebooks. So does the connection
between network topology and
synchronization. The "homophily or influence?" problem.
See also:
Biochemical Network Evolution;
Ecology;
Neuroscience;
Signal Transduction, Gene Regulation
and Control of Metabolism;
Social Networks;
Sociology of Science;
Statistical Mechanics;
Synchronization
Recommended, big picture:
- Reka Albert and Albert-Laszlo Barabasi, "Statistical Mechanics of
Complex Networks,"
cond-mat/0106096
- Rick Durrett, Random Graph Dynamics
[Durrett's site for
the book, including the full text of the introductory chapter.]
- Mark E. J. Newman
- "The structure and function of complex
networks," cond-mat/0303516
- Networks: An Introduction [A.k.a. "the big
black book"]
- Duncan Watts
Recommended, close-ups:
- Yael Artzy-Randrup, Sarel J. Fleishman, Nir Ben-Tal and Lewi Stone,
"Comment on 'Network Motifs: Simple Building Blocks of Complex Networks' and
'Superfamilies of Evolved and Designed Networks'", Science 305
(2004): 1107
- M. Argollo de Menezes and A.-L. Barabasi, "Fluctuations in network
dynamics", cond-mat/0306304
= Physical Review Letters 92 (2004): 028701
- Itai Benjamini, Nicolas Curien, "Ergodic Theory on Stationary Random Graphs", arxiv:1011.2526
- Aaron Clauset and Cristopher Moore, "How Do Networks Become
Navigable?", cond-mat/0309415
- Santo Fortunato, Alessandro Flammini, Filippo Menczer, "Scale-free
network growth by
ranking", cond-mat/0602081
- Vamsi Kalapala, Vishal Sanwalani, Aaron Clauset, and Cristopher
Moore, "Scale Invariance in Road
Networks", physics/0510198
= Physical Review E 73 (2006): 026130
- Eben Kenah and James Robins, "Second look at the spread of
epidemics on
networks", q-bio.QM/0610057
- Raya Khanin and Ernst Wit, "How Scale-Free Are Biological
Networks?",
Journal of Computational Biology
13 (2006): 810--818 [Ans.: not very scale-free at all.]
- Evelyn Fox Keller, "Revisiting 'scale-free'
networks", BioEssays 27
(2005): 1060--1068
- Judith Kleinfeld, "Could It Be a Big World After All? What the
Milgram Papers in the Yale Archive Reveal About the Original Small World Study"
[Six degrees of separation, for the general population, is quite
unsupported empirically. Of course it works for other kinds of networks, e.g.,
people in a common profession, or participating in a common institution; but
that's different. Preprint.]
- Adrián López García de Lomana, Qasim K. Beg,
G. de Fabritiis and Jordi Villà-Freixa, "Statistical Analysis of Global Connectivity and Activity Distributions in Cellular Networks", arxiv:1004.3138
- Manul Middendorf, Etay Ziv and Chris Wiggins, "Inferring Network
Mechanisms: The Drosophila melanogaster Protein Interaction
Network", q-bio.QM/0408010
[Machine learning meets complex
networks: specifically, learning decision trees to accurately classify networks
by the process which grew them. Neat.]
- Cristopher Moore, Gourab Ghoshal and Mark E. J. Newman, "Exact
solutions for models of evolving networks with addition and deletion of
nodes", cond-mat/0604069
= Physical Review
E 74 (2006): 036121 [The most amusing result is
that in a network where new nodes form links by preferential attachment, but
the size of addition and deletion of nodes exactly balance, the degree
distribution follows a stretched exponential (Weibull) distribution, rather
than a power law. This suggests, as they note, that the current exponent for
the web should increase as its growth-rate slows. "A sufficiently large
exponent would make the distribution indistinguishable experimentally from an
exponential or stretched exponential distribution, although we do not
realistically anticipate seeing behaivor of this type any time in the near
future."]
- Mark E. J. Newman
- M. E. J. Newman, Steven H. Strogatz and Duncan J. Watts,
"Random graphs with arbitrary degree distributions and their applications",
Physical Review E 64 (2001): 026118
= cond-mat/0007235
- Robin Pemantle and Brian Skyrms, "Network formation by
reinforcement learning: the long and medium run", math.PR/0404106 =
Mathematical Social Sciences 48 (2004): 315--327
- David M. Pennock et al., "Winners don't take all: Characterizing
the competition for links on the web", Proceedings of the National
Academy of Sciences 99 (2002): 5207--5211
- Bo Söderberg, "A General Formalism for Inhomogeneous Random
Graphs", cond-mat/0211063
[A re-invention of block models, but a clear one]
- Ricard V. Sole, Romualdo Pastor-Satorras, Eric Smith and Thomas
B. Kepler, "A model of large-scale proteome evolution," Advances in
Complex Systems 5 (2002): 43ff = cond-mat/0207311
- R. Vilela Mendes, "Tools for Network Dynamics", International Journal
of Bifurcations and Chaos 15 (2005): 1185--1213
[There is, admittedly, something of a bias here towards tools devised by Vilela
Mendes and collaborators, but I daresay I'd do the same were I to write such a
piece. More importantly, they're good stuff.]
- Duncan Watts and Steven Strogatz, "Collective Dynamics of
`Small-World' Networks," Nature 393 (1998):
440--442
Noted, but not fully recommended:
- Kartik Anand and Ginestra Bianconi, "Entropy measures for networks:
Toward an information theory of complex
topologies", Physical
Review E 80 (2009): 045102
= arxiv:0907.1514 [Basic
calculations comparing the log of the number of graphs compatible with given
macroscopic properties (the Boltzmann entropy; weirdly called the Gibbs entropy
here) with the Shannon entropy of the distribution of graphs where those
properties hold in the mean (which is the actual Gibbs entropy in statistical
mechanics). These are not equivalent, which evidently causes some distress to
the authors; no mention of the literature on ensemble in-equivalence and its
roots (see e.g. the
references here) or to the vast
literature on exponential families of random graphs (i.e., ensembles of maximum
Shannon entropy).]
Modesty forbids me to recommend:
- CRS, "Growth, Form, Function, Crashes," Santa Fe Institute
Bulletin 15:2 (2000) [On-line]
- CRS, "Networks and
Netwars"
To read:
- Juan A. Acebrón, Sergi Lozano, and Alex Arenas,
"Amplified Signal Response in Scale-Free Networks by Collaborative Signaling",
Physical Review
Letters 99 (2007): 128701
- Lada A. Adamic, R. M. Lukose, A. R. Puniyani and Bernardo
A. Huberman, "Search in Power-Law Networks,"
cs.NI/0103016
- Reka Albert, Istvan Albert and Gary L. Nakarad, "Structural
Vulnerability of the North American Power Grid", cond-mat/0401084
- David J. Aldous, "A Tractable Complex Network Model based on the
Stochastic Mean-field Model of Distance", cond-mat/0304701
- Eivind Almaas and A.-L. Barabasi, "Power laws in biological
networks", q-bio.MN/0401010
- Tanya Araujo, R. Vilela Mendes and Joao Seixas, "A dynamical
characterization of the small world phase,"
cond-mat/0204573
- Alex Arenas, Albert Diaz-Guilera, Jurgen Kurths, Yamir Moreno,
Changsong Zhou, "Synchronization in complex networks", Physics
Reports
469 (2008):
93--153, arxiv:0805.2976
- M. Argollo de Menezes and A.-L. Barabasi, "Separating internal and
external dynamics of complex systems", cond-mat/0406421
- K. B. Athreya, A. P. Ghosh, S. Sethuraman, "Growth of preferential
attachment random graphs via continuous-time branching
processes", math.PR/0701649
- J. P. Bagrow, E. M. Bollt, J. D. Skufca, D. ben-Avraham, "Portraits
of Complex
Networks", cond-mat/0703470
- Duygu Balcan and Ayse Erzan, "Content-based networks: A pedagogical
overview", Chaos
17 (2007): 026108
- Pierre Baldi et al., Modeling the Internet and the Web:
Probabilistic Methods and Algorithms
- L. Barnett, C. L. Buckley, S. Bullock, "A Graph Theoretic Interpretation of Neural Complexity", arxiv:1011.5334
- F. Barra and P. Gaspard, "Classical Dynamics on Graphs,"
nlin.CD/0011045
- Alain Barrat, Marc Barthelemy and Alessandro Vespignani
- "Weighted evolving networks: coupling topology and weights
dynamics", cond-mat/0401057
= PRL 92 (2004): 228701
- "Modeling the evolution of weighted networks", cond-mat/0406238
- "The effects of spatial constraints on the evolution of
weighted complex networks", physics/0504029
- Dynamical Processes on Complex Networks
[Blurb; favorable but not very
deep review in Journal of Statistical Physics]
- Yaneer Bar-Yam and Irving R. Epstein, "Response of complex networks
to stimuli", Proceedings
of the National Academy of Sciences 10.1073/pnas.0400673101
- Vladimir Batagelj and Ulrik Brandes, "Efficient generation of large
random networks", Physical Review
E
71 (2005): 036113
- E. Ben-Naim and P.L. Krapivsky, "Addition-Deletion Networks",
cond-mat/0703636
- Eli Ben-Naim and Zoltan Toroczkai (eds.), Complex
Networks
- N. Berger, C. Borgs, J. T. Chayes, R. M. D'Souza and R. D.
Kleinberg
- "Competition-Induced Preferential Attachment", cond-mat/0402268 [Short
version]
- "Degree Distribution of Competition-Induced Preferential
Attachment Graphs", cond-mat/0502205 [Version with
proofs]
- Cristoly Biely and Stefan Thurner, "Statistical mechanics of
scale-free networks at a critical point: Complexity without irreversibility?",
cond-mat/0507670
- Ginestra Bianconi, "A statistical mechanics approach for scale-free
networks and finite-scale
networks", cond-mat/0703191
- Ginestra Bianconi and Matteo Marsili, "Clogging and self-organized
criticality in complex networks", Physical Review
E 70 (2004): 035105(R)
= cond-mat/0312537
- Sven Bilke and C. Peterson, "Topological Properties of Citation
and Metabolic Networks",
cond-mat/0103361
- Sven Bilke and F. Sjunnesson, "Stability of the Kauffman Model,"
cond-mat/0107035
- Golnoosh Bizhani, Peter Grassberger, and Maya Paczuski, "Random
sequential renormalization and agglomerative percolation in networks:
Application to Erdos-Renyi and scale-free
graphs", Physical
Review E 84 (2011): 066111
- Philippe Blanchard and T. Krueger, "The `Cameo Principle' and the
Origin of Scale-Free Graphs in Social
Networks," cond-mat/0302611
- Vincent Blondel, Anahi Gajardo, Maureen Heymans, Pierre Senellart,
Paul Van Dooren, "A measure of similarity between graph
vertices", cs.IR/0407061
- Christian Borgs, Jennifer Chayes, Constantinos Daskalakis,
Sebastien Roch, "First to Market is not Everything: an Analysis of Preferential
Attachment with
Fitness", arxiv:0710.4982
["rigorous analysis of preferential attachment with fitness ... Depending on
the shape of the fitness distribution, we observe three distinct phases: a
first-mover-advantage phase, a fit-get-richer phase and an innovation-pays-off
phase."]
- Christian Borgs, Jennifer Chayes, Laszlo Lovasz, Vera T. Sos,
Balazs Szegedy and Katalin Vesztergombi, "Graph Limits and Parameter Testing"
[PDF reprint]
- Christian Borgs, Jennifer Chayes, Laszlo Lovasz, Vera T. Sos,
and Katalin Vesztergombi, "Convergent sequences of dense graphs"
- "I: Subgraph
frequencies, metric properties and testing" [PDF reprint]
- "II: Multiway cuts and statistical physics" [PDF preprint]
- Stefan Bornholdt and H. G. Schuster (eds.), Handbook of
Graphs and Networks: From the Genome to the Internet
- Romulus Breban, Raffaele Vardavas and Sally Blower,
"Linking population-level models with growing networks: A class of epidemic models", Physical Review E 72 (2005): 046110
- Markus Brede and Sitabhra Sinha, "Assortative mixing by degree
makes a network more unstable", cond-mat/0507710
- Tom Britton and Mathias Lindholm, "Dynamic Random Networks in Dynamic Populations", Journal of Statistical Physics 139
(2010): 518--535
- Z. Burda, J. D. Correia and Andre Krzywicki, "Statistical ensemble
of scale-free random graphs,"
cond-mat/0104155
- Guido Caldarelli, Scale-Free Networks: Complex Webs in Nature
and Technology [author's
book site]
- Guido Caldarelli and Alessandro Vespignani (eds.), Large Scale Structure and Dynamics of Complex Networks: From Information Technology to Finance and Natural Science
-
Callaway, Hopcroft, Kleinberg, Newman and Strogatz, "Are Randomly
Grown Graphs Really Random?"
cond-mat/0104546
- J. Camacho, R. Guimera and L. A. N. Amaral, "Robust Patterns in
Food Web Structure,"
cond-mat/0103114
- Andrea Capocci, G. Caldarelli and P. De Los Rios, "Quantitative
description and modeling of real networks,"
cond-mat/0206336
- Damon Centola, "Failure in Complex Social Networks",
Journal of Mathematical
Sociology
33 (2009): 654--68
- Michele Catanzaro, Marian Boguna and Romualdo Pastor-Satorras,
"Generation of uncorrelated random scale-free networks", cond-mat/0408110
- Sourav Chatterjee and Partha S. Dey, "Applications of Stein's method for concentration inequalities", Annals of Probability
38 (2010): 2443--2485, arxiv:0906.1034
- Sourav Chatterjee and S. R. S. Varadhan, "The large deviation principle for the Erdos-Renyi random graph", arxiv:1008.1946
- Mario Chavez, Miguel Valencia, Vito Latora, Jacques Martinerie, "Complex networks: new trends for the analysis of brain connectivity", arxiv:1002.0697
- Fan Chung and Linyuan Lu, Complex Graphs and Networks
[Blurb]
- L. Cisneros, J. Jimenez, M. G. Cosenza, and A. Parravano,
"Information transfer and nontrivial collective behavior in chaotic coupled
map networks,"
nlin.CD/0202010
- Jens Christian Claussen, "Offdiagonal Complexity: A computationally
quick complexity measure for graphs and networks", q-bio.MN/0410024
- Mauro Copelli, Paulo R. A. Campos, "Excitable Scale Free Networks",
q-bio.NC/0703004
- M. Copelli, R. M. Zorzenon dos Santos and J. S. Sa Martins,
"Emergence of Hierarchy on a Network of Complementary Agents,"
cond-mat/0110350
- Francois Coppex, Michel Droz and Adam Lipowski, "Extinction
dynamics of Lotka-Volterra ecosystems on evolving networks", q-bio.PE/0312030
- Luciano da F. Costa, Francisco A. Rodrigues, Gonzalo Travieso and
P. R. Villas Boas, "Characterization of complex networks: A survey of
measurements", cond-mat/0505185
- Luciano da F. Costa and Olaf Sporns, "Hierarchical Features of
Large-Scale Cortical Connectivity", q-bio.NC/0508007
- R. W. R. Darling and J. R. Norris, "Structure of large random
hypergraphs", Annals of Applied
Probability 15 (2005): 125--152 = math.PR/0503460 ["The theme of
this paper is the derivation of analytical formulae for certain large
combinatorial structures. The formulae are obtained via fluid limits of pure
jump-type Markov processes..."]
- M.A.M. de Aguiar and Y. Bar-Yam, "Spectral Analysis and the Dynamic
Response of Complex Networks", nlin.AO/0306043
- Charo I. Del Genio, Thilo Gross, and Kevin E. Bassler, "All Scale-Free Networks Are Sparse", Physical Review Letters 107 (2011): 178701
- Peter Dodds and Duncan Watts, "Universal behavior in a generalized
model of contagion", cond-mat/0403699
- S. N. Dorogovtsev, "Renormalization group for evolving networks,"
cond-mat/0301008
- S. N. Dorogovtsev, A. V. Goltsev and J. F. F. Mendes
- "Correlations in interacting systems with a network
topology", cond-mat/0506002
[Claims to have an argument establishing "dramatic weakening of correlations
between second and more distant neighbors on networks with fat-tailed degree
distributions", such that in the large size limit "only the pair correlations
between the nearest neighbors are observable". I'm skeptical that they could
have an adequately general argument to really establish this, but I'll read it
before coming to any conclusion.]
- "Ising Model on Networks with an Arbitrary Distribution of
Connections,"
cond-mat/0203227
- "Pseudofractal Scale-free Web,"
cond-mat/0112143
- S. N. Dorogovtsev and J. F. F. Mendes
- "Comment on `Breakdown of the Internet under Intentional
Attack'," cond-mat/0109083
- "Evolution of random networks,"
cond-mat/0106144
- Evolution of Networks: From Biological Nets to the
Internet and WWW
- S. N. Dorogovtsev, J. F. F. Mendes and A. N. Samukhin
- Raissa M. D'Souza, Soumen Roy, "Network Growth with Feedback",
arxiv:0805.4020
- Holger Ebel, Lutz-Ingo Mielsch and Stefan Bornholdt, "Scale-free
topology of e-mail networks,"
cond-mat/0201476
- Jean-Pierre Eckmann and Elisha Moses, "Curvature of Co-Links
Uncovers Hidden Thematic Layers in the World Wide Web,"
cond-mat/0110338
- Victor M. Eguiluz, Dante R. Chialvo, Guillermo A. Cecchi, Marwan
Baliki and A. Vania Apkarian, "Scale-free brain functional networks",
Physical Review
Letters 94 (2005): 018102 = cond-mat/0309092
- Andreas Engel, Remi Monasson and Alexander K. Hartmann, "On Large
Deviation Properties of Erdos-Renyi Random Graphs", Journal of Statistical
Physics 117 (2004): 387--426
- Ernesto Estrada and Juan A. Rodriguez-Velazquez
- Ying Fan, Menghui Li, Jiawei Chen, Liang Gao, Zengru Di and Jinshan
Wu, "Network of Econophysicists: a weighted network to investigate the
development of Econophysics", cond-mat/0401054
- Nadia Farid and Kim Christensen, "Evolving networks through
deletion and duplication", physics/0609172
- Iles Farkas, I. Derenyi, H. Jeong, Z. Neda, Z. N. Oltvai,
E. Ravasz, A. Schubert, Albert-Laszlo Barabasi and Tamas Vicsek, "Networks in
life: Scaling properties and eigenvalue spectra," cond-mat/0303106
- Iles Farkas, I. Derenyi, G. Palla, and T. Vicsek, "Equilibrium
statistical mechanics of network structures", cond-mat/0401640
- Iles J. Farkas, H. Jeong, Tamas Vicsek, Albert-Laszlo Barabasi and
Z. N. Oltvai, "The topology of the transcription regulatory network in the
yeast, S. cerevisiae,"
cond-mat/0205181
- Trevor Fenner, Mark Levene and George Loizou, "A Model for
Collaboration Networks Giving Rise to a Power Law Distribution with an
Exponential Cutoff", physics/0503184
- Ramon Ferrer i Cancho and Ricard V. Sole
- "The Small-World of Human Language," SFI Working Paper
01-03-016
- "Optimization in complex networks,"
cond-mat/01 1222
- O. Frank and D. Strauss, "Markov graphs", Journal of
the American Statistical Association 81 (1986):
832--842
- Agata Fronczak, Piotr Fronczak, Janusz A. Holyst,
"Fluctuation-dissipation relations for complex
networks", cond-mat/0509042
= Physical Review
E 73 (2006): 016108 [The fluctuations are
fluctuations of the network structure within an equilibrium
ensemble. This is an interesting idea, though I'd need to not just read it
but also think carefully before deciding whether it really applies to actual
networks]
- Diego Garlaschelli, Sebastian E. Ahnert, Thomas M. A. Fink and
Guido Caldarelli, "Temperature in complex
networks", cond-mat/0606805
- Kwang-Il Goh, B. Kahng, and D. Kim, "Spectra and eigenvectors of
scale-free networks,"
cond-mat/0103337
- Kwang-Il Goh, E. S. Oh, H. Jeong, B. Kahng, and D. Kim,
"Classification of scale free networks,"
cond-mat/0205232
- A. V. Goltsev, S. N. Dorogovtsev and J. F. F. Mendes, "Critical
Phenomena in Networks,"
cond-mat/0204596
- Jesus Gomez-Gardenes and Vito Latora, "Entropy rate of diffusion processes on complex networks", Physical
Review E 78 (2008): 065102
- Sean P. Gorman, Rajendra G. Kulkarni, Laurie A. Schintler and Roger
R. Stough, "A Predator Prey Approach to Diversity Based Defenses in
Heterogeneous Networks", cond-mat/0401017
- Geoffrey Grimmett, Probability on Graphs: Random
Processes on Graphs and Lattices [blurb]
- Andreas Grönlund, "The difference in directed structure of
Neural and Transcriptional Regulation Networks", cond-mat/0406268
- Andreas Grönlund, Kim Sneppen and Petter Minnhagen,
"Correlations in Networks Associated to Preferential Growth", cond-mat/0401537
- Thilo Gross and Bernd Blasius, "Adaptive Coevolutionary Networks -- A Review", arxiv:0709.1858
- Jean-Loup Guillaume and Matthieu Latap, "A Realistic Model for
Complex Networks," cond-mat/0307095
- Roger Guimera, A. Arenas and A. Diaz-Guilera, "Communication and
optimal hierarchical networks,"
cond-mat/0103112
- Aric Hagberg, Pieter J. Swart and Daniel A. Schult, "Designing
threshold networks with given structural and dynamical properties", Physical Review
E 74 (2006): 056116
- Laurent Hébert-Dufresne, Pierre-André Noël, Vincent Marceau, Antoine Allard, Louis J. Dubé, "Propagation dynamics on networks featuring complex topologies", arxiv:1005.1397
- Adriano de Jesus Holanda, Ivan Torres Pisa, Osame Kinouchi,
Alexandre Souto Martinez and Evandro Eduardo Seron Ruiz, "Thesaurus as a
complex network", cond-mat/0312586
- Petter Holme
- Petter Holme, Christofer R. Edling and Frederik Liljeros,
"Structure and time evolution of an Internet dating community", Social
Networks 26 (2004): 155-174
- Petter Holme and Gourab Ghoshal, "Dynamics of Networking Agents
Competing for High Centrality and Low
Degree", Physical Review
Letters 96 (2006): 098701
- Petter Holme and Beom Jun Kim
- Takashi Icinomiya, "Path-integral approach to the dynamics of a
sparse random network", cond-mat/0507285 = Physical Review
E 72 (2005): 016109
- Mads Ipsen and Alexander S. Mikhailov, "Evolutionary
reconstruction of networks,"
nlin.AO/0111023
- Shalev Itzkovitz, Reuven Levitt, Nadav Kashtan, Ron Milo, Michael
Itzkovitz and Uri Alon, "Coarse-Graining and Self-Dissimilarity of Complex
Networks", q-bio.MN/0405011
- Junji Ito and Kunihiko Kaneko, "Spontaneous structure formation
in a network of chaotic units with variable connection strengths,"
cond-mat/0108408
- Sarika Jalan and R. E. Amritkar, "Self-organized and driven phase
synchronization in coupled map scale free networks,"
nlin.AO/0201051
- Henrik Jeldtot Jensen, "Emergence of Network Structure in Models of
Collective Evolution and Evolutionary
Dynamics", arxiv:0709.2009
- Jürgen Jost and M. P. Joy, "Evolving networks with distance
preferences," cond-mat/0202343
- Brian Karrer, M. E. J. Newman, "Random graphs containing arbitrary distributions of subgraphs", arxiv:1005.1659
- Alon Keinan, Ben Sandbank, Claus C. Hilgetag, Isaac Meilijson and
Eytan Ruppin, "Fair Attribution of Functional Contribution in Artificial and
Biological Networks", Neural
Computation 16 (2004): 1887--1915
- Susan Khor, "Concurrency and Network Disassortativity",
Artificial Life 16 (2010): 225--232
- Beom Jun Kim, "Performance of networks of artificial neurons: The
role of clustering",
q-bio.NC/0402045
- Beom Jun Kim, Ala Trusina, Petter Holme, Petter Minnhagen, Jean
S. Chung, and M. Y. Choi, "Dynamic instabilities induced by asymmetric
influence: Prisoners' dilemma game on small-world networks,"
cond-mat/0206533
- Beom Jun Kim, Chang No Yoon, Seung Kee Han, and Hawoong Jeong,
"Path finding strategies in scale-free networks,"
cond-mat/0111232
- J. Kim, P. L. Krapivsky, B. Kahng and Sidney Redner, "Evolving
Protein Interaction Networks,"
cond-mat/0203167
- Osame Kinouchi, A. S. Martinez, G. F. Lima, G. M. Lourenco,
S. Risau-Gusman, "Deterministic walks in random networks: an application to
thesaurus graphs,"
cond-mat/0110217
- P. L. Krapivsky and Sidney Redner, "Finiteness and Fluctuations in
Growing Networks,"
cond-mat/0207107
- P. L. Krapivsky, Sidney Redner, and F. Leyvraz, "Connectivity of
Growing Random Networks,"
cond-mat/0005139
- Andre Krzywicki, "Defining statistical ensembles of random
graphs," cond-mat/0110574
- Marcelo Kuperman and Damian Zanette, "Stochastic resonance in a
model of opinion formation on small-world networks,"
cond-mat/0111289 [Of
course social psychology has to be just like the Ising model; how else
could physicists study it?]
- Maciej Kurant and Patrick Thiran, "Layered Complex
Networks", physics/0510194
- D.-S. Lee, K.-I. Goh, B. Kahng and D. Kim, "Sandpile avalanche
dynamics on scale-free networks", cond-mat/0401531
- E. A. Leicht, Petter Holme, and M. E. J. Newman, "Vertex similarity
in
networks", physics/0510143
["We consider methods for quantifying the similarity of vertices in
networks. We propose a measure of similarity based on the concept that two
vertices are similar if their immediate neighbors in the network are themselves
similar. This leads to a self-consistent matrix formulation of similarity that
can be evaluated iteratively using only a knowledge of the adjacency matrix of
the network. We test our similarity measure on computer-generated networks for
which the expected results are known, and on a number of real-world networks."]
- Chunguang Li, Philip K. Maini, "An evolving network model with
community
structure", physics/0510239
= Journal of Physics A: Mathematical and
General 38 (2005): 9741--9749
- Lun Li, David Alderson, Reiko Tanaka, John C. Doyle and Walter
Willinger, "Towards a Theory of Scale-Free Graphs: Definition, Properties, and
Implications", cond-mat/0501169 ["Although
the ``scale-free'' literature is large and growing, it gives neither a precise
definition of scale-free graphs nor rigorous proofs of many of their claimed
properties. In fact, it is easily shown that the existing theory has many
inherent contradictions and verifiably false claims." Ouch.]
- Susanna C. Manrubia, Jordi Delgado, and Bartolo Luque, "Small-world
behavior in a system of mobile elements,"
cond-mat/0102069
- Sergei Maslov and Kim Sneppen
- "Pattern Detection in Complex Networks: Correlation Profile
of the Internet,"
cond-mat/0205379
[Physicists rediscover bootstrap testing]
- "Specificity and stability in topology of protein
networks,"
cond-mat/0205380
- Vincent Marceau, Pierre-André Noël, Laurent Hébert-Dufresne, Antoine Allard, Louis J. Dubé, "Adaptive networks: coevolution of disease and topology", arxiv:1005.1299
- Carsten Marr, Marc-Thorsten Huett, "Outer-totalistic cellular automata on graphs", Physics Letters A 373 (2008):
546--549, arxiv:0812.2408
- Raoul-Martin Memmesheimer and Marc Timme, "Designing Complex
Networks", q-bio.NC/0606041
[i.e., designing networks of stylized neurons to spike in a given periodic
pattern. Well, it's a start...]
- Ron Milo, Shalev Itzkovitz, Nadav Kashtan, Reuven Levitt, Shai
Shen-Orr, Inbal Ayzenshtat, Michal Sheffer and Uri Alon, "Superfamilies of
Evolved and Designed Networks", Science 303
(2004): 1538--1542
- Andre A. Moreira, Jose S. Andrade, Hans J. Herrmann, Joseph O. Indekeu, "How to make a fragile network robust and vice versa",
Physical Review Letters 102 (2009): 018701,
arxiv:0812.3591
- Stefano Mossa, Marc Barthelemy, H. Eugene Stanley, and Luis
A. Nunes Amaral, "Truncation of power law behavior in `scale-free' network
models due to information filtering,"
cond-mat/0201421
- Adilson E. Motter, "Cascade control in complex networks", cond-mat/0401074
- Adilson E. Motter, Alessandro P. S. de Moura, Ying-Cheng Lai, and
Partha Dasgupta, "Topology of the conceptual network of language,"
cond-mat/0206530 =
Physical Review E 65 (2002): 065102(R)
- J. C. Nacher, T. Yamada, S. Goto, M. Kanehisa and T. Akutsu, "Two
complementary representations of a scale-free network", physics/0402072
- M. E. J. Newman, I. Jensen and R. M. Ziff, "Percolation and
epidemics in a two-dimensional small world,"
cond-mat/0108542
- Emi M. Nomura, Caterina Gratton, Renee M. Visser, Andrew
Kayser, Fernando Perez and Mark D'Esposito, "Double dissociation of two cognitive control networks in patients with focal brain lesions", Proceedings of the National Academy of Sciences (USA)
107 (2010): 121017--12022 [Supposedly involved
"graph-theory properties"]
- Jukka-Pekka Onnela, Jari Saramaki, Janos Kertesz, and Kimmo Kaski,
"Intensity and coherence of motifs in weighted complex networks",
Physical Review
E 71 (2005): 065103(R)
- R. N. Onody and P. A. de Castro, "Nonlinear Barabasi-Albert
Network", cond-mat/0402315
- Pekka Orponen and Satu Elisa Schaeffer, "Efficient Algorithms for
Sampling and Clustering of Large Nonuniform Networks", cond-mat/0406048
- Juyong Park and M. E. J. Newman, "The origin of degree correlations
in the Internet and other networks," cond-mat/0303327
- Derek J. de Solla Price
- "Networks of scientific papers", Science 149 (1965): 510--515
- "A general theory of bibliometric and other cumulative
advantage processes", J. Amer. Soc. Inform. Sci.
27 (1976): 292--306 [= Journal of the American Society
of Information Science?]
- Natasa Przulj, Derek G. Corneil and Igor Jurisica, "Modeling
Interactome: Scale-free or Geometric", q-bio.MN/0404017
- Anatolii A. Puhalskii, "Stochastic processes in random graphs",
math.PR/0402183 [Large
deviations for Erdos-Renyi graphs. Memo to self: how much work would it be to
extend this to Markovian graphs?]
- Amit Puniyani and Rajan Lukose, "Growing random networks under
constraints,"
cond-mat/0107391
- Filippo Radicchi, Alain Barrat, Santo Fortunato, Jose J. Ramasco, "Renormalization flows in complex networks", Physical Review E
79 (2009): 026104, arxiv:0811.2761
- Abolfazl Ramzanpour and V. Karimipour, "Simple models of small
world networks with directed links,"
cond-mat/0205244
- A. Rapoport
- "Contribution to the theory of random and biased
nets", Bulletin of Mathematical Biophysics 19
(1957): 257--277
- "Cycle distribution in random nets", Bulletin
of Mathematical Biophysics 10 (1968): 145--157
- Erzsebet Ravasz and Albert-Laszlo Barabasi, "Hierarchical
organization in complex networks,"
cond-mat/0206130 =
Physical Review E 67 (2003): 026112
- Hermann Riecke, Alex Roxin, Santiago Madruga and Sara A. Solla,
"Multiple attractors, long chaotic transients, and failure in small-world
networks of excitable
neurons", Chaos
17 (2007): 026110
- Olivier Rivoire and Julien Barré "Exactly solvable models of
adaptive
networks", cond-mat/0606754
["A satisfiability (SAT-UNSAT) transition takes place for many optimization
problems when the number of constraints, graphically represented by links
between variables nodes, is brought above some threshold. If the network of
constraints is allowed to adapt by redistributing its links, the SAT-UNSAT
transition may be delayed and preceded by an intermediate phase where the
structure self-organizes to satisfy the constraints." Which means what, for
the original logic problem? I should read the paper to see...]
- Hernan Rozenfeld and Daniel ben-Avraham, "Designer Nets from Local
Strategies", cond-mat/0401196
- Hernan D. Rozenfeld, Joseph E. Kirk, Erik M. Bollt and Daniel
ben-Avraham, "Statistics of Cycles: How Loopy is your Network?", cond-mat/0403536
- Manoj Pratim Samanta and Shoudan Liang, "Redundancies in
Large-scale Protein Interaction Networks," physics/0303027
- Alejandro D. Sanchez, Juan M. Lopez and Miguel A. Rodriguez,
"Nonequilibrium Phase Transitions in Directed Small-World Networks,"
cond-mat/0110500
- Leonard M. Sander, C. P. Warren and I. M. Sokolov, "Epidemics
disorder, and percolation," cond-mat/0301394
- Nima Sarshar and Vwani Roychowdhury, "Scale-Free and Stable
Structures in Complex Ad hoc Networks," cond-mat/0303041
- N. Schwartz, R. Cohen, D. ben-Avraham, A.-L. Barabasi and S.
Havlin, "Percolation in Directed Scale-Free Networks,"
cond-mat/0204523
- Philip Seliger, Stephen C. Young, and Lev S. Tsimring, "Plasticity
and learning in a network of coupled phase oscillators,"
nlin.AO/0110044
- M. Angeles Serrano and Marian Boguna
- M. Angeles Serrano, Dmitri Krioukov and Marian Boguna,
"Self-Similarity of Complex Networks and Hidden Metric
Spaces", Physical Review
Letters 100 (2008): 078701
- Dinghua Shi, Qinguha Chen and Liming Liu, "A Markov Chain-Based
Numerical Method for Calculating Network Degree Distributions", math-ph/0409080
- Julian Sienkiewicz, Piotr Fronczak, and Janusz A. Holyst,
"Log-periodic oscillations due to discrete effects in complex
networks", cond-mat/0608273
- Tiago Simas, Luis M. Rocha, "Stochastic model for scale-free networks with cutoffs", arxiv:0901.0159
- Ingve Simonsen, Kasper Astrup Eriksen, Sergei Maslov and Kim
Sneppen, "Diffusion on Complex Networks : A way to probe their large scale
topological structures",
cond-mat/0312476
= Physica A 336 (2004): 163--173
- Sitabhra Sinha and Sudeshna Sinha, "Evidence of universality for
the May-Wigner stability theorem for random networks with local dynamics",
nlin.AO/0402002
- Arne Skjeltorp and Alexander Belushkin (eds.), Dynamics of
Complex Interconnected Systems: Networks and Bioprocesses
- David M. D. Smith, Chiu Fan Lee and Neil F. Johnson, "Realistic
network growth using only local information: From random to scale-free and
beyond", cond-mat/0608733
- Sara Nadiv Soffer and Alexei Vazquez, "Network clustering
coefficient without degree-correlation biases", Physical Review
E 71 (2005): 057101 ["The clustering coefficient
quantifies how well connected are the neighbors of a vertex in a graph. In real
networks it decreases with the vertex degree, which has been taken as a
signature of the network hierarchical structure. Here we show that this
signature of hierarchical structure is a consequence of degree-correlation
biases in the clustering coefficient definition. We introduce a definition in
which the degree-correlation biases are filtered out, and provide evidence that
in real networks the clustering coefficient is constant or decays
logarithmically with vertex degree."]
- R. Solomonoff and A. Rapoport, "Connectivity of Random Nets",
Bulletin of Mathematical Biophysics 13 (1951):
107--117 [Yes, the
R. Solomonoff]
- Olaf Sporns, Networks of the Brain [Blurb]
- Russell K. Standish, "Complexity of Networks (reprise)", arxiv:0911.3482
- Dietrich Stauffer and Amnon Aharony, "Efficient Hopfield pattern
recognition on a scale-free neural network," cond-mat/0212601
- Hrvoje Stefancic and Vinko Zlatic, "Preferential attachment
with information filtering: node degree probability distribution properties",
cond-mat/0404495
- Gyorgy Szabo and Gabor Fath, "Evolutionary games on graphs",
cond-mat/0607344
- Gergely J Szollosi, Imre Derenyi, "Hierarchical meanfield theory of
evolutionary games on structured
populations", arxiv:0704.0357
- Kazuhiro Takemoto and Chikoo Oosawa, "Evolving networks by
merging cliques", Physical Review E 72
(2005): 046116
- Makoto Uchida and Susumu Shirayama, "Effect of initial conditions
on Glauber dynamics in complex
networks", cond-mat/0702482
- S. Valverde, R. Ferrer i Cancho and R. V. Sole, "Scale-free
Networks from Optimal Design,"
cond-mat/0204344
- J. van den Berg, Geoffrey R. Grimmett and Rinaldo B. Schinazi,
"Dependent Random Graphs and Spatial Epidemics", Annals of Applied
Probability 8 (1998): 317--336
- Alexei Vazquez, "Statistics of citation networks,"
cond-mat/0105031
- Alexei Vazquez and Albert-Laszlo Barabasi, "The inhomogeneous
evolution of subgraphs and cycles in complex networks", cond-mat/0501399
- Alexei Vazquez, A. Flammini, A. Maritan, and A. Vespignani,
"Modeling of protein interaction networks,"
cond-mat/0108043
- Alexei Vazquez and Martin Weigt, "Computational complexity arising
from degree correlations in networks,"
cond-mat/0207035
- R. Vilela Mendes, "Network Dependence of Strong Reciprocity",
Advances in Complex
Systems 7 (2004): 357--368
- Dmitri Volchenkov and Ph. Blanchard, "An Algorithm Generating Scale
Free Graphs,"
cond-mat/0204126
- Andreas Wagner, "How the global structure of protein interaction
networks evolves,"
cond-mat/0207043
- C. P. Warren, Leonard M. Sander and I. M. Sokolov, "Geography in a
Scale-Free Network Model,"
cond-mat/0207324
- Duncan J. Watts, Peter S. Dodds and Mark E. J. Newman, "Identity
and Search in Social Networks,"
cond-mat/0205383 =
Science 296 (2002): 1302--1305
- Martin Weigt, "Dynamics of heuristic optimization algorithms on
random graphs,"
cond-mat/0203281
- Chris Wiggins and Ilya Nemenman, "Process Pathway Inference via
Time Series Analysis,"
physics/0206031
- Walter Willinger et al., "Scaling Phenomena in the Internet:
Critically Examining Criticality", Proceedings of the National
Academy of Sciences 99 (2002): 2673--2580
- An-Cai Wu, Xin-Jian Xu, and Ying-Hai Wang, "Excitable
Greenberg-Hastings cellular automaton model on scale-free
networks",
Physical Review
E 75 (2007): 032901
- Ramon Xulvi-Brunet and I. M. Sokolov, "Evolving networks with
disadvantaged long-range connections,"
cond-mat/0205136
- Piet Van Mieghem, Graph Spectra for Complex
Networks [blurb]
- Jie Zhang, Changsong Zhou, Xiaoke Xu and Michael Small, "Mapping
from Architeture to Dynamics: A Unified View of Dynamical Processes on
Networks", arxiv:0908.2248 [From a
quick scan, vastly less general and ambitious than the title or even the
abstract promises; but worth looking at more closely.]
- Chaopin Zhu, Anthony Kuh, Juan Wang and Philippe De Wilde,
"Analysis of an evolving email
network", Physical
Review E 74 (2006): 046109
- Etay Ziv, Manuel Middendorf and Chris Wiggins,
"Information-Theoretic Approach to Network Modularity", q-bio.QM/0411033 = PRE 71
(2005): 046117
- Alexander Zumdieck, Marc Timme, Theo Geisel and Fred Wolf, "Long
Chaotic Transients in Complex Networks", cond-mat/0401038
#
Evolution (of Organisms)
[A proper discussion of evolution will appear here Any Time Now.]
Issues in evolution proper:
adaptation;
complexity;
developmental constraints and the
evolution of development;
ecology and co-evolution;
genetics;
sociobiology (in non-human beasties; in human beings, and what exactly
it can and cannot account for);
units of selection controversies (genes [Dawkins, Maynard Smith,
Williams] vs. gene-complexes [Lewontin, sorta] vs. organisms [Williams the
first time around] vs. groups [Sober?]) and group-selection arguments (when
can traits which benefit a higher level of selection at the expense of the
lower ones evolve? Probably never; the higher level entities don't have enough
coherence and persistence to act as replicators).
Query: What is known about the asymptotic distribution of the
population under (discrete-time) replicator dynamics? What if the space of
types in the replicator dynamics is infinite-dimensional? Or the fitness
function is subject to stochastic shocks? Or both? (This now has
its own notebook.)
Extensions of evolution:
to brain function;
to computer programming ;
to culture (memetics);
to economics;
to epistemology;
to psychology.
Mathematical modeling: classical population genetics à
la Fisher, Haldane and Wright, and its extensions via dynamics; game theory à la John Maynard
Smith. Connections to physics.
Agent-based modeling.
Challenges to neo-Darwinism: Here, as usual, my inclinations are
conservative, in that I really don't see what's wrong with the orthodox theory.
In any case, there don't seem to be any real alternatives yet
advanced. (Neutral mutations by definition explain the origins of neither
adaptations nor species.)
Recommended, non-technical:
- John Tyler Bonner
- On Development [Points out that it's a mistake
to think of just the adult form-and-behavior "evolving"; it's really the
whole life-cycle, complicating matters considerably...]
- On the Evolution of Complexity, by Means of Natural
Selection [Review]
- Richard Dawkins
- The Blind Watchmaker
- Climbing Mount Improbable
- The Selfish Gene
- Daniel Dennett, Darwin's Dangerous
Idea
- Theodosius Dobzhansky, "Nothing in Biology Makes Sense
Except in the Light of Evolution" [first pub. American Biology
Teacher 35 (March 1973): 125--129]
- Stephen Jay Gould, anything. [But be warned that a lot of what he
says about the Burgess Shale fauna in Wonderful Life is disputed
--- by his own sources...]
- François Jacob, The Possible and the Actual [I
really must get around to typing up my notes from his lecture on conserved gene
sequences]
- Charles H. Lineweaver, "Paleontological Tests: Human-like
Intelligence is not a Convergent Feature of
Evolution", arxiv:0711.1751 [A
very nice, and simple, way to see the error of some common ways of looking at
evolution]
- John Maynard Smith [As a god to population genetics; disciple of
Haldane]
- Did Darwin Get It Right? Essays on Games, Sex, and
Evolution
- The Theory of Evolution [An introductory book
which avoids getting tangled up in technical details without sacrificing
intellectual rigor or pushing any really strange ideas]
- and Eörs Szathmáry, The Origins of Life:
From the Birth of Life to the Origin of Language [Review: Major Transitions
Minor]
- Jacques Monod, Chance and Necessity [Review by Danny
Yee]
- Karl Sigmund, Games of Life: Explorations in Ecology,
Evolution and Behavior
- Homer W. Smith, Kamongo, or, The Lungfish and the
Padre
- talk.origins is where Usenet hashes
out creation vs. evolution. (It should be pretty obvious that I am on the side
of the apes.) There's a fine website
with archives of posts to the newsgroups, the excellent FAQs (read 'em
before attempting to convert the heathen), a nifty gallery of
fossils and the University of
Edicara
- Bruce Wallace and Adrian M. Srb, Adaptation
- Jonathan Weiner, The Beak of the Finch [Field
confirmation of neo-Darwinism, from, nicely enough, the Galapagos. (Please
don't start an argument about what it means to confirm a scientific theory
here.)]
Recommended, technical:
- Armen A. Allahverdyan and Chin-Kun Hu, "Replicators in Fine-grained
Environment: Adaptation and
Polymorphism", arxiv:0905.3297 [A
cute use of averaging techniques, though the particular one they employ will I
think break down if the perturbations to fitness are not strictly periodic.]
- Lauren W. Ancel and Walter Fontana, "Plasticity, Evolvability and
Modularity in RNA," Journal of Experimental Zoology (Molecular and
Developmental Evolution), 288 (2000): 242--283 [Reprint]
- Carl T. Bergstrom and Michael Lachmann, "The fitness value of
information", q-bio.PE/0510007
- Charles Darwin, On the Origin of
Species, by Means of Natural Selection, or Preservation of Favored Races in the
Struggle for Life [Get the fascimile of the first edition with the
introduction by Ernst Mayr]
- Richard Dawkins, The Extended
Phenotype [Review: Not
Your Parents' Web of Life]
- R. A. Fisher, The Genetical Theory of Natural
Selection
- Laura F. Galloway and Julie R. Etterson, "Transgenerational
Plasticity Is Adaptive in the Wild", Science 318
(2007): 1134--1136
- John Gerhart and Marc Kirschner, Cells, Embryos and
Evolution: Toward a Cellular and Developmental Understanding of Phenotypic
Variation and Evolutionary Adaptability
- John H. Gillespie, Population Genetics: A Concise
Guide [Review: Darwin's
Equations (II)]
- Peter Godfrey-Smith, Darwinian Populations and
Natural Selection
- J. B. S. Haldane, The Causes of Evolution [Review: Darwin's Equations]
- Paul H. Harvey and Mark D. Pagel, The Comparative Method in
Evolutionary Biology [Review: On the
Uses of Knowing Where Birds and Bees Come From]
- Josef Hofbauer and Karl Sigmund, The Theory of Evolution and
Dynamical Systems: Mathematical Aspects of Selection [Mathematically
explicit, i.e., full frontal differentiable manifolds]
- John Holland, Adaptation in Natural and Artificial Systems
- David Hull, Science and Selection: Essays on Biological
Evolution and Philosophy of Science
- T. H. Huxley, The
Huxley File [Nearly complete works, ed. and put on-line by Charles
Blinderman and David Joyce]
- Philip Kitcher, In Mendel's Mirror: Philosophical Reflections on Biology
- R. C. Lewontin, The Genetic Basis of Evolutionary
Change [An excellent book; but The
Dialectical Biologist may be safely ignored]
- John Maynard Smith, Evolution and the Theory of Games [Review
by Danny Yee]
- Mark E. J. Newman
- "Simple Models of Evolution and Extinction," adap-org/9910003
- MEJN and Gunther J. Eble, "Decline in Extinction Rates and
Self-Similarity in the Fossil Record," Paleobiology
25 (1999): 434--439 = adap-org/9809004
- MEJN and Richard G. Palmer, "Models of Extinction: A
Review," adap-org/9908002
- Martin Nilsson and Nigel Snoad, "Error Thresholds for Quasispecies
on Dynamic Fitness Landscapes," Physical Review Letters
84 (2000): 191--194
- Mark Pagel, "Inferring the Historical Patterns of Biological
Evolution," Nature 401 (1999): 877--884
[Warning: This paper makes the staggering mistake of confusing likelihood
(probability of a model's generating results like the data) with posterior
probability (probability of the model, given the data). Pagel definitely
doesn't make this elementary blunder elsewhere; I dunno what happened here.]
- David C. Queller, "The Spaniels of St. Marx and the Panglossian
Paradox: A Critique of a Rhetorical Programme," Quarterly Review of
Biology 70 (1995): 485--489
- Karl Sigmund, The Calculus of Selfishness
[Review: Honor Among
Thieves]
- Olivier Rivoire and Stanislas Leibler, "The Value of Information
for Populations in Varying Environments", arxiv:1010.5092
- Eric Smith and Supriya Krishnamurthy, "Symmetry and Collective Fluctuations in Evolutionary Games", SFI Working Paper 11-03-010
- Erik van Nimwegen, The Statistical Dynamics of Epochal
Evolution [Ph.D. thesis, University of Utrecht,
1999; on-line]
- George C. Willams, Adaptation and Natural Selection; a
Critique of Some Current Evolutionary Thought [Well, current in 1966,
anyway; that it is no longer current is due largerly to this book!]
- Guoping Zhu, G. Brian Golding and Antony M. Dean, "The Selective
Cause of an Ancient Adaptation", Science 307
(2005): 1279--1282 [Where "ancient" means "about 3.5 billion years old".]
To read:
- Stephen G. Alter, Darwinism and the Linguistic Image:
Language, Race, and Natural Theology in the Nineteenth Century
[Blurb]
- P. Ao, "Mathematical Structure of Evolutionary Theory", q-bio.QM/0403020 ["Here we
postulate three laws which form a mathematical framework to capture the essence
of Darwinian evolutionary dynamics. The second law is most quantitative and is
explicitly expressed by a unique form of stochastic differential equation."
Color me skeptical, but I haven't read beyond the abstract.]
- Wallace Arthur, Creatures of Accident: The Rise of the
Animal Kingdom
- John C. Avise, Evolutionary Pathways in Nature: A Phylogenetic
Approach [blurb]
- Ellen Baake and Wilfried Gabriel, "Biological Evolution through
Mutation, Selection, and Drift: An Introductory Review," cond-mat/9907372
- Ellen Baake, Hans-Otto Georgii, "Mutation, selection, and ancestry
in branching models: a variational
approach", q-bio.PE/0611018
- Richard K. Belew and Melanie Mitchell (eds.), Adaptive
Individuals in Evolving Populations: Models and Algorithms
- J. B. Beltman and P. Haccou, "Speciation through the learning of
habitat features", Theoretical Population
Biology 67 (2005): 189--202
- Ioana Bena, Michel Droz and Andrzej Pekalski, "Complex population
dynamics as a competition between multiple time-scale
phenomena", q-bio.PE/0703033
- Carl T. Bergstrom and Rustom Antia, "How do adaptive immune
systems control pathogens while avoiding
autoimmunity?", Trends in Ecology and
Evolution 21 (2006): 22--28
[PDF
reprint via Carl]
- C. T. Bergstrom and M. Rosvall, "The transmission sense of information", arxiv:0810.4168
- Carl Boettiger, Jonathan Dushoff and Joshua S. Weitz,
"Fluctuation Domains in Adaptive Evolution", Theoretical
Population Biology 77 (2010): 6--13, arxiv:1004.4233
- Peter Bowler, Evolution: The History of an Idea
- R. A. Blythe and A. J. McKane, "Stochastic Models of Evolution in
Genetics, Ecology and
Linguistics", cond-mat/0703478
- Daniel R. Brooks and Deborah A. McLennan, The Nature of
Diversity: An Evolutionary Voyage of Discovery [Blurb]
- Sean B. Carroll, From DNA to Diversity: Molecular Genetics
and the Evolution of Animal Design
- Nicolas Champagnat, Régis Ferrière, Sylvie
Méléar, "Individual-based probabilistic models of adaptive
evolution and various scaling
approximations", math.PR/0510453
- Nicolas Champagnat and Amaury Lambert, "Evolution of discrete
populations and the canonical diffusion of adaptive
dynamics", math.PR/0601643
- Kim Christensen, Simone A. di Collobiano, Matt Hall, and Henrik
J. Jensen, "Tangled Nature: a model of evolutionary ecology," cond-mat/0104116
- Freddy Bugge Christiansen, Theories of Population Variation in
Genes and Genomes [blurb, intro]
- Jens Christian Claussen and Arne Traulsen, "Non-Gaussian
fluctuations arising from finite populations: Exact results for the
evolutionary Moran
process", Physical
Review E 71 (2005): 025101
- Charles C. Cockell, Impossible Extinction: Natural
Catastrophes and the Supremacy of the Microbial World
[blurb]
- Elisheva Cohen, David A. Kessler and Herbert Levine, "Recombination
dramatically speeds up evolution of finite populations", q-bio.PE/0410015
- Claude Combes, The Art of Being a Parasite
[Review by Danny Yee]
- Vincent Courtillot, Evolutionary Catastrophes: The Science of
Mass Extinction
[blurb]
- Depew and Weber, Darwinism Evolving: Systems Dynamics and the
Genealogy of Evolution [Beware of genealogies which aren't actually
about families. Review
by John Maynard Smith.]
- Fabio Dercole and Sergio Rinaldi, Analysis of Evolutionary
Processes: The Adaptive Dynamics Approach and Its Applications
[Blurb, ch. 1]
- Adrian Desmond, Huxley: From Devil's Disciple to Evolution's
High Priest
- Theodosius Dobzhansky
- Mankind Evolving: The Evolution of the Human
Species
- Genetics and the Origin of Species
- Nikolay V. Dokholyan and Eugene I. Shakhnovich, "Understanding
hierarchical protein evolution from first principles," cond-mat/0104469
- Peter Donnelly, Stephen Leslie, "The coalescent and its descendants", arxiv:1006.1514
- Rick Durrett and Jason Schweinsberg, "A coalescent model for the
effect of advantageous mutations on the genealogy of a population", math.PR/0411071
- Gunther Eble
- Manfred Eigen, The Hypercycle, a Principle of Natural Self-organization
- Bryan K. Epperson, Geographical Genetics
- A. Eriksson, B. Haubold, and B. Mehlig, "Statistics of
selectively neutral genetic variation," physics/0111205
- David E. Fastovsky and David B. Weishampel, The Evolution and
Extinction of the Dinosaurs
[blurb]
- A. Feigl, "Essential conditions for evolution of communication
within a
species", Journal
of Theoretical Biology
254 (2008): 768--778, arxiv:0803.0412
- Luca Ferraro, Andrea Giansanti, Giovanni Giuliano and Vittorio
Rosato, "Co-expression of statistically over-represented peptides in proteomes:
a key to phylogeny?", q-bio.MN/0410011
- Walter Fontana and Peter Schuster, "Continuity in Evolution: On the
Nature of Transitions," SFI
Working Paper 98-04-030
- Barbara Forrest and Paul R. Gross, Creationism's Trojan
Horse: The Wedge of Intelligent Design
- Steven A. Frank, "Wright's adaptive landscape versus Fisher's fundamental theorem", arxiv:1102.3709
- Sergey Gavrilets, Fitness Landscapes and the Origin of
Species
- Diane P. Genereuz and Carl T. Bergstrom, "Evolution in Action:
Understanding antibiotic resistance" [forthcoming book
chapter; PDF]
- Ulrich Gerland and Terence Hwa, "On the Selection and Evolution of
Regulatory DNA Motifs," physics/0112039
- M. T. Ghiselin, The Triumph of the Darwinian Method
- John H. Gillespie, The Causes of Molecular Evolution
- Thomas J. Givnish and Kenneth J. Sytsma (eds.), Molecular
Evolution and Adaptive Radiation
- Stephen Jay Gould
- Full House
- Ontogeny and Phylogeny
- The Structure of Evolutionary Theory
- B. Rosemary Grant and Peter R. Grant, Evolutionary Dynamics
of a Natural Population: The Large Cactus Finch of the Galapagos
- Dan Graur and William Martin, "Reading the entrails of chickens:
molecular timescales of evolution and the illusion of precision",
Trends in Genetics 20 (2004): 80--86
[PDF
reprint]
- Joachim Hermisson, Oliver Redner, Holger Wagner and Ellen Baake,
"Mutation-Selection Balance: Ancestry, Load, and Maximum Principle," cond-mat/0202432
- Hofbauer and Sigmund, Evolutionary Games and Replicator
Dynamics
- G. Evelyn Hutchinson, The Ecological Theater and the
Evolutionary Play
- Eva Jablonka and Marion J. Lamb, Evolution in Four
Dimensions: Genetic, Epigenetic, Behavioral, and Symbolic Variation in the
History of Life [Blurb]
- Kavita Jain and Joachim Krug, "Evolutionary trajectories in rugged
fitness landscapes", q-bio.PE/0501028 = Journal of
Statistical Mechanics: Theory and Experiment (2005): P04008
- P. D. Jarvis, J. D. Bashford and J. G. Sumner, "Path integral
formulation of phylogenetic branching processes", q-bio.PE/0411047
- Marc Kirschner and John Gerhart, The Plausibility of
Life
[Capsule review
by Frenando Pereira]
- Laura Landweber and Erik Winfree (ed.), Evolution as
Computation [The dual subject to evolutionary
computation, as were]
- Daniel J. Lawson and Henrik Jeldtoft Jensen, "Evolution as
Diffusion in type
space", q-bio.PE/0609009
- Stanislas Leibler and Edo Kussell, "Individual histories and
selection in heterogeneous
populations", Proceedings
of the National Academy of Sciences (USA) 107 (2010):
13183--13188
- George Levine, Darwin Loves You: Natural Selection and the
Re-Enchantment of the World
- Konrad Lorenz, Evolution and Modification of Behavior
- Elisabeth Lloyd, The Structure and Confirmation of
Evolutionary Theory
- Long, Rise of the Fishes
- David J. C. MacKay, "Rate of Information Acquisition by a Species
subjected to Natural Selection" [link]
- Joshua S. Madin, John Alroy, Martin Aberhan, Franz T. Fürsich,
Wolfgang Kiessling, Matthew A. Kosnik, Peter J. Wagner, "Statistical
Independence of Escalatory Ecological Trends in Phanerozoic Marine
Invertebrates", Science 312
(2006): 897--900
- Marc Mangel, The Theoretical Biologist's Toolbox:
Quantitative Methods for Ecology and Evolutionary Biology
[blurb]
- Mohan Matthen, "Drift and 'Statistically Abstractive
Explanation'", Philosophy of Science
76 (2009): 464--487
- John Maynard Smith, Evolutionary Genetics [Review by Danny
Yee]
- Ernst Mayr
- Growth of Biological Thought
- Populations, Species and Evolution
- This Is Biology
- What Evolution Is
- Margaret McFall-Ngai, "Adaptive Immunity: Care for the community",
Nature 445
(2007): 153
- Géza Meszéna, Mats Gyllenberg, Frans J. Jacobs and
Johan A. J. Metz, "Link between Population Dynamics and Dynamics of Darwinian
Evolution",
Physical Review
Letters 95 (2005): 078105
- Richard E. Michod, Darwinian Dynamics: Evolutionary
Transitions in Fitness and Individuality
- Jacek Miekisz, "Evolutionary game theory and population dynamics",
q-bio.PE/0703062
[51 pp. lecture notes]
- Stephen P. Miller, Mark Lunzer, and Antony M. Dean, "Direct
Demonstration of an Adaptive Constraint", Science 314 (2006):
458--461
- Mitton, Sele