Skip to main content

Research Repository

Advanced Search

Using pedagogical and psychological insights to train analysts using confidential data (2023)
Journal Article
Green, E., & Ritchie, F. (in press). Using pedagogical and psychological insights to train analysts using confidential data. Journal of Privacy and Confidentiality,

With researchers increasingly gaining access to confidentiality data through restricted environments, interest has grown in the training of those researchers to protect confidentiality and to use the secure facility effectively. Researcher training,... Read More about Using pedagogical and psychological insights to train analysts using confidential data.

The just-about-right pilot sample size to control the error margin (2023)
Journal Article
Obodo, S., Toher, D., & White, P. (in press). The just-about-right pilot sample size to control the error margin. International Journal of Statistics and Probability, 12(No. 3, May 2023),

In practice, the required sample size for a two-arm randomised controlled trial cannot always be determined pre-study with great accuracy. This lack of accuracy has economic, ethical and scientific implications. The sample size for a pilot study is... Read More about The just-about-right pilot sample size to control the error margin.

The nonequilibrium potential today: A short review (2022)
Journal Article
Wio, H. S., Deza, J. I., Sánchez, A. D., García-García, R., Gallego, R., Revelli, J. A., & Deza, R. R. (2022). The nonequilibrium potential today: A short review. Chaos, Solitons and Fractals, 165(Part 1), 112778. https://doi.org/10.1016/j.chaos.2022.112778

A brief review is made of the birth and evolution of the “nonequilibrium potential” (NEP) concept. As if providing a landscape for qualitative reasoning were not helpful enough, the NEP adds a quantitative dimension to the qualitative theory of diffe... Read More about The nonequilibrium potential today: A short review.

Risk of disclosure when reporting commonly used univariate statistics (2022)
Conference Proceeding
Derrick, B., Green, E., Ritchie, F., & White, P. (2022). Risk of disclosure when reporting commonly used univariate statistics. In Lecture Notes in Computer Science (119-129). https://doi.org/10.1007/978-3-031-13945-1_9

When basic or descriptive summary statistics are reported, it may be possible that the entire sample of observations is inadvertently disclosed, or that members within a sample will be able to work out responses of others. Three sets of univariate su... Read More about Risk of disclosure when reporting commonly used univariate statistics.

10 is the safest number that there's ever been (2022)
Journal Article
Ritchie, F. (2022). 10 is the safest number that there's ever been. Transactions on data privacy, 15(2), 109-140

When checking frequency and magnitude tables for disclosure risk, the cell threshold (the minimum number of observations in each cell) is a crucial parameter. In rules-based environments, this is a hard limit on what can or can't be published. In pri... Read More about 10 is the safest number that there's ever been.

Disclosure risks in odds ratios and logistic regression (2022)
Presentation / Conference
Derrick, B., Green, E., Ritchie, F., & White, P. (2022, April). Disclosure risks in odds ratios and logistic regression. Paper presented at Scottish Economic Society Annual Conference 2022: Special session 'Protecting confidentiality in social science research outputs', Glasgow

When publishing statistics from confidential data, there exists a risk that the statistic might inadvertently reveal confidential information. Statistical disclosure control (SDC) aims to reduce that risk to an acceptable level. Most SDC theory is co... Read More about Disclosure risks in odds ratios and logistic regression.

Statistical disclosure control for HESA: Part 1: Review of SDC theory (2021)
Report
Green, E., & Ritchie, F. (2021). Statistical disclosure control for HESA: Part 1: Review of SDC theory. Higher Education Statistics Agency (HESA)

This report for the Higher Education Statistics Agency (HESA) is a summary of statistical disclosure control (SDC) methods for tabular outputs.

Estimation of the two-group pilot sample size with a cautionary note on Browne’s formula (2021)
Journal Article
Obodo, S., Toher, D., & White, P. (2021). Estimation of the two-group pilot sample size with a cautionary note on Browne’s formula. Journal of Applied Quantitative Methods, 16(3),

Using data obtained from a pilot study, Browne (1995) proposed a procedure for estimating the sample size needed for a definitive two-arm randomised controlled trial when the minimal important difference is specified. Simulations confirm these findi... Read More about Estimation of the two-group pilot sample size with a cautionary note on Browne’s formula.

Ordinal Logistic Regression as an alternative analysis strategy for the comparison of two independent samples (2021)
Journal Article
Bilski, B., Derrick, B., Toher, D., & White, P. (2021). Ordinal Logistic Regression as an alternative analysis strategy for the comparison of two independent samples. Journal of Applied Quantitative Methods, 16(3),

The two group between subjects design is pervasive with analyses often performed using the Mann Whitney Rank Sum test or using the Welch variant of the t-test. Using simulation it is shown that a dummy variable ordinal logistic regression (OLR) mode... Read More about Ordinal Logistic Regression as an alternative analysis strategy for the comparison of two independent samples.