Notebooks

Analysis of Network Data

04 Apr 2012 13:57

That is, of data on the form of networks --- I don't (as such) care about packet flow or other aspects of computer networks...

Things I wish I knew how to do: bootstrap a network, non-parametrically. (The model with a fixed degree sequence is a start, but what's the equivalent of the block bootstraps used for time series, which preserve dependence?) Cross-validation on networks. (You could say that link prediction is leave-one-out CV, but how about k-fold CV?) Estimate a distribution over networks by somehow smoothing an adjacency matrix. — These may or may not be three aspects of a single problem.

Community discovery is an important sub-topic, and I like exponential family random graph models and graph limits enough to give them their own notebooks.

Although many of the relevant papers appear in the journal Social Networks, published by Elsevier, the company responsible for deliberately publishing pseudo-journals such as The Australasian Journal of Bone and Joint Medicine, I know of no particular reason to believe that their findings are problematic. It would, however, be good if the community could shift to a journal whose publishers do not subvert the peer-review process whenever they find it profitable to do so.

See also: Complex networks; Community discovery; Exponential families of random graph models; Homophily vs. influence; Relational learning Social networks; Statistics in general; Statistics of structured data;


Notebooks:     Hosted, but not endorsed, by the Center for the Study of Complex Systems