Articles

02/05/2025-- 04/25/2024

On Neighbourhood Cross Validation

Many varieties of cross validation would be statistically appealing for the estimation of smoothing and other penalized regression hyperparameters, were it not for the high cost of evaluating such criteria. Here it is shown how to efficiently and accurately compute and optimize a broad variety of cross validation criteria for a wide range of models estimated by minimizing a quadratically penalized loss. The leading order computational cost of hyperparameter estimation is made comparable to the cost of a single model fit given hyperparameters. In many cases this represents an $O(n)$ computational saving when modelling $n$ data. This development makes if feasible, for the first time, to use leave-out-neighbourhood cross validation to deal with the wide spread problem of un-modelled short range autocorrelation which otherwise leads to underestimation of smoothing parameters. It is also shown how to accurately quantifying uncertainty in this case, despite the un-modelled autocorrelation. Practical examples are provided including smooth quantile regression, generalized additive models for location scale and shape, and focussing particularly on dealing with un-modelled autocorrelation.
Simon N. Wood
02/05/2025-- 09/10/2024

Some statistical aspects of the Covid-19 response

This paper discusses some statistical aspects of the U.K. Covid-19 pandemic response, focussing particularly on cases where we believe that a statistically questionable approach or presentation has had a substantial impact on public perception, or government policy, or both. We discuss the presentation of statistics relating to Covid risk, and the risk of the response measures, arguing that biases tended to operate in opposite directions, overplaying Covid risk and underplaying the response risks. We also discuss some issues around presentation of life loss data, excess deaths and the use of case data. The consequences of neglect of most individual variability from epidemic models, alongside the consequences of some other statistically important omissions are also covered. Finally the evidence for full stay at home lockdowns having been necessary to reverse waves of infection is examined, with new analyses provided for a number of European countries.
Simon N. Wood Ernst C. Wit Paul M. McKeigue Danshu Hu Beth Flood Lauren Corcoran Thea Abou Jawad
03/26/2024-- 08/12/2022

Einstein gravity with generalized cosmological term from five-dimensional AdS-Maxwell-Chern-Simons gravity

Some time ago, the standard geometric framework of Einstein gravity was extended by gauging the Maxwell algebra as well as the so called AdS-Maxwell algebra. In this letter it is shown that the actions for these four-dimensional extended Einstein gravities can be obtained from the five-dimensional Chern-Simons gravities actions by using the Randall-Sundrum compactification procedure. It is found that the In\"on\"u-Wigner contraction procedure, in the Weimar-Woods sense, can be used both to obtain the Maxwell-Chern-Simons action from the AdS-Maxwell-Chern-Simons action and to obtain the Maxwell extension of Einstein gravity in 4D from the four-dimensional extended AdS-Maxwell-Einstein-Hilbert action. It is also shown that the extended four-dimensional gravities belongs to the Horndeski family of scalar-tensor theories.
L. Avilés J. Diaz D. M. Penafiel V. C. Orozco P. Salgado
06/27/2024-- 06/27/2024

Gratia: An R package for exploring generalized additive models

Generalized additive models (GAMs, Hastie & Tibshirani, 1990; Wood, 2017) are an extension of the generalized linear model that allows the effects of covariates to be modelled as smooth functions. GAMs are increasingly used in many areas of science (e.g. Pedersen, Miller, Simpson, & Ross, 2019; Simpson, 2018) because the smooth functions allow nonlinear relationships between covariates and the response to be learned from the data through the use of penalized splines. Within the R (R Core Team, 2024) ecosystem, Simon Wood's mgcv package (Wood, 2017) is widely used to fit GAMs and is a Recommended package that ships with R as part of the default install. A growing number of other R packages build upon mgcv, for example as an engine to fit specialised models not handled by mgcv itself (e.g. GJMR, Marra & Radice, 2023), or to make use of the wide range of splines available in mgcv (e.g. brms, B\"urkner, 2017). The gratia package builds upon mgcv by providing functions that make working with GAMs easier. gratia takes a tidy approach (Wickham, 2014) providing ggplot2 (Wickham, 2016) replacements for mgcv's base graphics-based plots, functions for model diagnostics and exploration of fitted models, and a family of functions for drawing samples from the posterior distribution of a fitted GAM. Additional functionality is provided to facilitate the teaching and understanding of GAMs.
Gavin L. Simpson
04/30/2015-- 04/30/2015

PROPs for Linear Systems

A PROP is a symmetric monoidal category whose objects are the nonnegative integers and whose tensor product on objects is addition. A morphism from $m$ to $n$ in a PROP can be visualized as a string diagram with $m$ input wires and $n$ output wires. For a field $k$, the PROP $\mathrm{FinVect}_k$ where morphisms are $k$-linear maps is used by Baez and Erbele to study signal-flow diagrams. We aim to generalize their result characterizing this PROP in terms of generators and relations by looking at the PROP $\mathrm{Mat}(R)$ of matrices of values in $R$, where $R$ is a commutative rig (that is, a generalization of a ring where the condition that each element has an additive inverse is relaxed). To this end, we show that the category of symmetric monoidal functors out of $\mathrm{Mat}(R)$ is equivalent to the category of bicommutative bimonoids equipped with a certain map of rigs; such functors are called algebras. By choosing $R$ correctly, we will see that the algebras of the PROP $\mathrm{FinSpan}$ of finite sets and spans between them are bicommutative bimonoids, while the algebras of the PROP $\mathrm{FinRel}$ of finite sets and relations between them are special bicommuative bimonoids and the algebras of $\mathrm{Mat}(\mathbb Z)$ are bicommutative Hopf monoids.
Simon Wadsley Nick Woods
01/08/2016-- 01/08/2016

Autocorrelated errors in experimental data in the language sciences: Some solutions offered by Generalized Additive Mixed Models

A problem that tends to be ignored in the statistical analysis of experimental data in the language sciences is that responses often constitute time series, which raises the problem of autocorrelated errors. If the errors indeed show autocorrelational structure, evaluation of the significance of predictors in the model becomes problematic due to potential anti-conservatism of p-values. This paper illustrates two tools offered by Generalized Additive Mixed Models (GAMMs) (Lin and Zhang, 1999; Wood, 2006, 2011, 2013) for dealing with autocorrelated errors, as implemented in the current version of the fourth author's mgcv package (1.8.9): the possibility to specify an ar(1) error model for Gaussian models, and the possibility of using factor smooths for random-effect factors such as subject and item. These factor smooths are set up to have the same smoothing parameters, and are penalized to yield the non-linear equivalent of random intercepts and random slopes in the classical linear framework. Three case studies illustrate these issues.
R. Harald Baayen Jacolien van Rij Cecile de Cat Simon N. Wood
07/07/2020-- 07/07/2020

qgam: Bayesian non-parametric quantile regression modelling in R

Generalized additive models (GAMs) are flexible non-linear regression models, which can be fitted efficiently using the approximate Bayesian methods provided by the mgcv R package. While the GAM methods provided by mgcv are based on the assumption that the response distribution is modelled parametrically, here we discuss more flexible methods that do not entail any parametric assumption. In particular, this article introduces the qgam package, which is an extension of mgcv providing fast calibrated Bayesian methods for fitting quantile GAMs (QGAMs) in R. QGAMs are based on a smooth version of the pinball loss of Koenker (2005), rather than on a likelihood function, hence jointly achieving satisfactory accuracy of the quantile point estimates and coverage of the corresponding credible intervals requires adopting the specialized Bayesian fitting framework of Fasiolo, Wood, Zaffran, Nedellec, and Goude (2020b). Here we detail how this framework is implemented in qgam and we provide examples illustrating how the package should be used in practice.
Matteo Fasiolo Simon N. Wood Margaux Zaffran Raphaël Nedellec Yannig Goude
11/21/2005-- 10/21/2005

Discovery of a nearby M9 dwarf

We report the discovery of a new M9.0 dwarf at only 8.2 pc, which we identified in our search for nearby ultracool dwarf (I-J >= 3.0, later than M8.0) in the DENIS database. We measure a very high proper motion of 2.5 arc-sec/yr. The PC3 index measured from its low-resolution spectrum gives a spectrophotometric distance of 8.2 pc. This makes it the third closest M9.0 dwarf.
N. Phan-Bao M. S. Bessell E. L. Martin G. Simon J. Guibert T. Forveille X. Delfosse F. Crifo N. Epchtein P. Wood F. Tajahmady
11/01/2006-- 11/01/2006

Renewal Aging as Emerging Property of Phase Synchronization

In this letter we examine a model recently proposed to produce phase synchronization [K. Wood et al, Phys. Rev. Lett. 96, 145701 (2006)] and we show that the onset to synchronization corresponds to the emergence of an intermittent process that is non-Poisson and renewal at the same time. We argue that this makes the model appropriate for the physics of blinking quantum dots, and the dynamics of human brain as well.
Simone Bianco Elvis Geneston Paolo Grigolini Massimiliano Ignaccolo
12/18/2008-- 12/18/2008

Moduli Webs and Superpotentials for Five-Branes

We investigate the one-parameter Calabi-Yau models and identify families of D5-branes which are associated to lines embedded in these manifolds. The moduli spaces are given by sets of Riemann curves, which form a web whose intersection points are described by permutation branes. We arrive at a geometric interpretation for bulk-boundary correlators as holomorphic differentials on the moduli space and use this to compute effective open-closed superpotentials to all orders in the open string couplings. The fixed points of D5-brane moduli under bulk deformations are determined.
Marco Baumgartl Simon Wood


with thanks to arxiv.org/