alan-turing-institute
diff --git a/‎.gitignore‎
Lines changed: 3 additions & 1 deletion b/‎.gitignore‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎README.md‎
Lines changed: 8 additions & 0 deletions b/‎README.md‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎docs/DimensionReduction.rst‎
Lines changed: 33 additions & 0 deletions b/‎docs/DimensionReduction.rst‎
Lines changed: 33 additions & 0 deletions
diff --git a/‎docs/HistoryMatching.rst‎
Lines changed: 13 additions & 0 deletions b/‎docs/HistoryMatching.rst‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎docs/MCMC.rst‎
Lines changed: 8 additions & 0 deletions b/‎docs/MCMC.rst‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎docs/benchmarks/benchmarks.rst‎
Lines changed: 1 addition & 0 deletions b/‎docs/benchmarks/benchmarks.rst‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/benchmarks/mcmc_benchmark.rst‎
Lines changed: 7 additions & 0 deletions b/‎docs/benchmarks/mcmc_benchmark.rst‎
Lines changed: 7 additions & 0 deletions
diff --git a/‎docs/conf.py‎
Lines changed: 1 addition & 1 deletion b/‎docs/conf.py‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/index.rst‎
Lines changed: 9 additions & 0 deletions b/‎docs/index.rst‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎docs/methods/alt/AltBLPriors.rst‎
Lines changed: 142 additions & 0 deletions b/‎docs/methods/alt/AltBLPriors.rst‎
Lines changed: 142 additions & 0 deletions
@@ -1,4 +1,6 @@
 *.pyc
 *.DS_Store
-*.png
+docs/_build/*
+mogp_emulator/tests/*.png
+mogp_emulator/tests/*.pdf
 mogp_emulator/version.py
@@ -128,13 +128,21 @@ using 1, 2, 4, and 8 processess and notes the time required to perform the fitti
 will depend on the number of cores on the computer -- once you exceed the number of cores, the performance
 will degrade. As with the other benchmarks, Matplotlib can optionally be used to plot the results.
 
+##### MCMC Benchmark
+
+A benchmark applying the software to fitting an emulator with MCMC sampling is included. The code
+draws hyperparameter samples and compares the resulting posterior distributions with the values
+found via maximum likelihood estimation. If Matplotlib is installed, a histogram of the parameter
+samples is shown.
+
 ##### MICE Benchmark
 
 A benchmark comparing the MICE Sequential design method to Latin Hypercube sampling is also available.
 This creates designs of a variety of sizes and computes the error on unseen data for the 2D Branin
 function. It compares the accuracy of the sequential design to the Latin Hypercube for both the
 predictions and uncertainties.
 
+
 ### Documentation
 
 Building the documentation requires Sphinx/autodoc, which can be installed using `pip`. To build the documentatation, first install Sphinx and change to the `docs` directory. There is a Makefile in the
 
@@ -0,0 +1,33 @@
+.. _DimensionReduction:
+
+*********************************
+The ``DimensionReduction`` module
+*********************************
+
+.. automodule:: mogp_emulator.DimensionReduction
+
+
+---------------------------
+Dimension Reduction Classes
+---------------------------
+
+.. autoclass:: mogp_emulator.gKDR
+
+    .. automethod:: __init__
+    .. automethod:: __call__
+    .. automethod:: tune_parameters
+
+---------
+Utilities
+---------
+		    
+.. automethod:: mogp_emulator.DimensionReduction.gram_matrix
+
+.. automethod:: mogp_emulator.DimensionReduction.gram_matrix_sqexp
+
+.. automethod:: mogp_emulator.DimensionReduction.median_dist
+       
+.. rubric:: References
+.. [LG17] Liu, Xiaoyu, and Serge Guillas. "Dimension reduction for Gaussian process emulation: An application to the influence of bathymetry on tsunami heights." SIAM/ASA Journal on Uncertainty Quantification 5.1 (2017): 787-812. https://epubs.siam.org/doi/10.1137/16M1090648
+.. [Fukumizu1] https://www.ism.ac.jp/~fukumizu/software.html
+.. [FL13] Fukumizu, Kenji and Chenlei Leng. "Gradient-based kernel dimension reduction for regression." Journal of the American Statistical Association 109, no. 505 (2014): 359-370
@@ -0,0 +1,13 @@
+.. _HistoryMatching:
+
+**********************************
+The ``HistoryMatching`` Class
+**********************************
+
+.. automodule:: mogp_emulator.HistoryMatching
+    :noindex:
+
+.. autoclass:: mogp_emulator.HistoryMatching
+    :members:
+
+    .. automethod:: __init__
@@ -0,0 +1,8 @@
+.. _MCMC:
+
+**********************************
+The ``MCMC`` Module
+**********************************
+
+.. automodule:: mogp_emulator.MCMC
+    :members:
@@ -8,3 +8,4 @@ GP Emulator Benchmarks
    rosenbrock
    branin
    tsunami
+   mcmc_benchmark
@@ -0,0 +1,7 @@
+.. _mcmc_benchmark:
+
+**********************************
+MCMC Benchmark
+**********************************
+
+.. automodule:: mogp_emulator.tests.benchmark_MCMC
@@ -26,7 +26,7 @@
 # get version from package
 import mogp_emulator
 import re
-# The full version X.Y.Z
+# The full version X.Y.Z with development version if needed
 release = mogp_emulator.__version__
 # The short verion X.Y
 version = re.sub(r"(\d+\.\d+)", r"\1", mogp_emulator.__version__)
 
@@ -11,12 +11,21 @@ Welcome to Multi-Output GP Emulator's documentation!
    :caption: Contents:
 
    GaussianProcess
+   DimensionReduction
    MultiOutputGP
    Kernel
    ExperimentalDesign
    SequentialDesign
+   HistoryMatching
+   MCMC
    benchmarks/benchmarks
 
+.. toctree::
+   :maxdepth: 1
+   :caption: Uncertainty Quantification Methods
+
+   methods/methods
+
 
 
 Indices and tables
 
@@ -0,0 +1,142 @@
+.. _AltBLPriors:
+
+Alternatives: Prior specification for BL hyperparameters
+========================================================
+
+Overview
+--------
+
+In the fully :ref:`Bayes linear<DefBayesLinear>` approach to
+emulating a complex :ref:`simulator<DefSimulator>`, the
+:ref:`emulator<DefEmulator>` is formulated to represent prior
+knowledge of the simulator in terms of a :ref:`second-order belief
+specification<DefSecondOrderSpec>`. The BL prior specification
+requires the specification of beliefs about some
+:ref:`hyperparameters<DefHyperparameter>`, as discussed in the
+alternatives page on emulator prior mean function
+(:ref:`AltMeanFunction<AltMeanFunction>`), the discussion page on the
+GP covariance function
+(:ref:`DiscCovarianceFunction<DiscCovarianceFunction>`) and the
+alternatives page on emulator prior correlation function
+(:ref:`AltCorrelationFunction<AltCorrelationFunction>`).
+Specifically, in the :ref:`core problem<DiscCore>` that is the
+subject of the core threads (:ref:`ThreadCoreBL<ThreadCoreBL>`,
+:ref:`ThreadCoreGP<ThreadCoreGP>`) a vector :math:`\beta` defines the
+detailed form of the mean function, a scalar :math:`\sigma^2` quantifies
+the uncertainty or variability of the simulator around the prior mean
+function, while :math:`\delta` is a vector of hyperparameters defining
+details of the correlation function. Threads that deal with variations
+on the basic core problem may introduce further hyperparameters.
+
+A Bayes linear analysis requires hyperparameters to be given prior
+expectations, variances and covariances. We consider here ways to
+specify these prior beliefs for the hyperparameters of the core problem.
+Prior specifications for other hyperparameters are addressed in the
+relevant variant thread. Hyperparameters may be handled differently in
+the fully :ref:`Bayesian<DefBayesian>` approach - see
+:ref:`ThreadCoreGP<ThreadCoreGP>`.
+
+Choosing the Alternatives
+-------------------------
+
+The prior beliefs should be chosen to represent whatever prior knowledge
+the analyst has about the hyperparameters. However, the prior
+distributions will be updated with the information from a set of
+training runs, and if there is substantial information in the training
+data about one or more of the hyperparameters then the prior information
+about those hyperparameters may be irrelevant.
+
+In general, a Bayes linear specification requires statements of
+second-order beliefs for all uncertain quantities. In the current
+version of this Toolkit, the Bayes linear emulation approach does not
+consider the situation where :math:`\sigma^2` and :math:`\delta` are
+uncertain, and so we require the following:
+
+-  :math:`\text{E}[\beta_i]`, :math:`\text{Var}[\beta_i]`,
+   :math:`\text{Cov}[\beta_i,\beta_j]` - expectations, variances and
+   covariances for each coefficient :math:`\beta_i`, and covariances
+   between every pair of coefficients :math:`(\beta_i,\beta_j), i\neq j`
+-  :math:`\sigma^2=\text{Var}[w(x)]` - the variance of the residual
+   stochastic process
+-  :math:`\delta` - a value for the hyperparameters of the correlation
+   function
+
+The Nature of the Alternatives
+------------------------------
+
+Priors for :math:`\beta`
+~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Given a specified form for the basis functions :math:`h(x)` of :math:`m(x)` as
+described in the alternatives page on basis functions for the emulator
+mean (:ref:`AltBasisFunctions<AltBasisFunctions>`), we must specify
+expectation and variance for each coefficient :math:`\beta_i` and a
+covariance between every pair :math:`(\beta_i,\beta_j)`.
+
+As with the basis functions :math:`h(x)`, there are two primary means of
+obtaining a belief specification for :math:`\beta`.
+
+#. **Expert-led specification** - the specification can be made directly
+   by an expert using methods such as
+
+   a. Intuitive understanding of the magnitude and impact of the
+      physical effects represented by :math:`h(x)` leading to a direct
+      quantification of expectations, variances and covariances.
+   b. Assessing the difference between the model under study and another
+      well-understood model such as a fast approximate version or an
+      earlier version of the same simulator. In this approach, we can
+      combine the known information about the mean behaviour of the
+      second simulator with the belief statements about the differences
+      between the two simulator to construct an appropriate belief
+      specification for the hyperparameters -- see :ref:`multilevel
+      emulation<DefMultilevelEmulation>`.
+
+#. **Data-driven specification** - when prior beliefs are weak and we
+   have ample model evaluations, then prior values for :math:`\beta` are
+   typically not required and we can replace adjusted values for
+   :math:`\beta` with empirical estimates, :math:`\hat{\beta}`, obtained by
+   fitting the linear regression :math:`f(x)=h(x)^T\beta`. Our uncertainty
+   statements about :math:`\beta` can then be deduced from the "estimation
+   error" associated with :math:`\hat{\beta}`.
+
+Priors for :math:`\sigma^2`
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+The current version of the Toolkit requires a point value for the
+variance about the emulator mean, :math:`\sigma^2`. This corresponds
+directly to making a specification about :math:`\text{Var}[w(x)]`. As with
+the model coefficients above, there are two possible approaches to
+making such a quantification. An expert could make the specification by
+directly quantifying the magnitude of :math:`\sigma^2`. Alternatively, an
+expert assessment of the expected prior adequacy of the mean function at
+representing the variation in the simulator outputs can be combined with
+information on the variation of the simulator output, which allows for
+the deduction of a value of :math:`\sigma^2`. In the case of a data-driven
+assessment, the estimate for the residual variance :math:`\hat{\sigma}^2`
+can be used.
+
+In subsequent versions of the toolkit, Bayes linear methods will be
+developed for :ref:`learning<DefBLVarianceLearning>` about
+:math:`\sigma^2` in the emulation process. This will require making prior
+specifications about the squared emulator residuals.
+
+Priors for :math:`\delta`
+~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Specification of correlation function hyperparameters is a more
+challenging task. Direct elicitation can be difficult as the
+hyperparameter :math:`\delta` is hard to conceptualise - the alternatives
+page on prior distributions for GP hyperparameters
+(:ref:`AltGPPriors<AltGPPriors>`) provides some discussion on this
+topic, with particular application to the Gaussian correlation function.
+Alternatively, when given a large collection of simulator runs then
+:math:`\delta` can be crudely estimated using methods such as
+:ref:`variogram<ProcVariogram>` fitting on the empirical residuals.
+
+Assessing and updating uncertainties about :math:`\delta` raises both
+conceptual and technical problems as methods which would be optimal for
+assessing such parameters given realisations drawn from a corresponding
+stochastic process may prove to be highly non-robust when applied to
+functional computer output which is only represented very approximately
+by such a process. Methods for approaching this problem will appear in a
+subsequent version of the toolkit.