Dr Richard Preen Richard2.Preen@uwe.ac.uk
Senior Research Fellow in Machine Learning
A multi-language toolkit for the semi-automated checking of research outputs
Preen, Richard J.; Albashir, Maha; Davy, Simon; Smith, Jim
Authors
Maha Albashir
Simon Davy
Jim Smith James.Smith@uwe.ac.uk
Professor in Interactive Artificial Intelligence
Abstract
This article presents a free and open source toolkit that supports the semi-automated checking of research outputs (SACRO) for privacy disclosure within secure data environments. SACRO is a framework that applies best-practice principles-based statistical disclosure control (SDC) techniques on-the-fly as researchers conduct their analyses. SACRO is designed to assist human checkers rather than seeking to replace them as with current automated rules-based approaches. The toolkit is composed of a lightweight Python package that sits over well-known analysis tools that produce outputs such as tables, plots, and statistical models. This package adds functionality to (i) automatically identify potentially disclosive outputs against a range of commonly used disclosure tests; (ii) apply optional disclosure mitigation strategies as requested; (iii) report reasons for applying SDC; and (iv) produce simple summary documents trusted research environment staff can use to streamline their workflow and maintain auditable records. This creates an explicit change in the dynamics so that SDC is something done with researchers rather than to them, and enables more efficient communication with checkers. A graphical user interface supports human checkers by displaying the requested output and results of the checks in an immediately accessible format, highlighting identified issues, potential mitigation options, and tracking decisions made. The major analytical programming languages used by researchers (Python, R, and Stata) are supported by providing front-end packages that interface with the core Python back-end. Source code, packages, and documentation are available under MIT license at https://github.com/AI-SDC/ACRO.
Journal Article Type | Article |
---|---|
Acceptance Date | Apr 24, 2025 |
Online Publication Date | May 1, 2025 |
Publication Date | May 1, 2025 |
Deposit Date | Apr 25, 2025 |
Publicly Available Date | Apr 25, 2025 |
Journal | IEEE Transactions on Privacy |
Electronic ISSN | 2836-208X |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Peer Reviewed | Peer Reviewed |
Volume | 2 |
Pages | 55-66 |
DOI | https://doi.org/10.1109/tp.2025.3566052 |
Keywords | Data privacy, data protection, privacy, statistical disclosure control, statistical software |
Public URL | https://uwe-repository.worktribe.com/output/14326847 |
Publisher URL | https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=10167710 |
Files
A multi-language toolkit for the semi-automated checking of research outputs
(434 Kb)
PDF
A multi-language toolkit for the semi-automated checking of research outputs
(2.3 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
SACRO: Semi-Automated Checking Of Research Outputs
(2023)
Presentation / Conference Contribution
Autoencoding with a classifier system
(2021)
Journal Article
Towards an evolvable cancer treatment simulator
(2019)
Journal Article
Evolutionary n-level hypergraph partitioning with adaptive coarsening
(2019)
Journal Article
Design mining microbial fuel cell cascades
(2018)
Journal Article
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search