Computing and statistical data analysis: course outline


Statistical data analysis, main topics (approx. 10 lectures):

Probability. Definition and interpretation, Bayes' theorem, random variables, probability density functions, expectation values, transformation of variables, error propagation.
Examples of probability functions. Binomial, multinomial, Poisson, uniform, exponential, Gaussian, chi-square, Cauchy distributions.
The Monte Carlo method. Random number generators, the transformation method, the acceptance-rejection method.
Statistical tests. Significance and power of a test, choice of the critical region. Constructing test statistics: the Fisher discriminant, neural networks. Testing goodness-of-fit, chi^2-test, P-values.
Parameter estimation: general concepts. Samples, estimators, bias. Estimators for mean, variance, covariance.
The method of maximum likelihood. The likelihood function, ML estimators for parameters of Gaussian and exponential distributions. Variance of ML estimators, the information inequality, extended ML, ML with binned data.
The method of least squares. Relation to maximum likelihood, linear least squares fit, LS with binned data, testing goodness-of-fit, combining measurements with least squares.
Interval estimation. Classical confidence intervals: with Gaussian distributed estimator, for mean of Poisson variable. Setting limits, limits near a physical boundary.
Unfolding. Formulation of the problem: response function and matrix. Inversion of the response matrix, bin-by-bin correction factors. Regularized unfolding: regularization functions, bias and variance of estimators, choice of regularization parameter.

Computing:

Getting started: Operating systems, networks, windowing environments, introduction to PAW, introduction to programming, compiling and linking programs, document preparation.

Glen Cowan