Muhammad Bilal Muhammad.Bilal@uwe.ac.uk
TSU Research Fellow Timesheet NOM
TEMSET-24K: Densely annotated dataset for indexing multipart endoscopic videos using surgical timeline segmentation
Bilal, Muhammad; Alam, Mahmood; Bapu, Deepashree; Korsgen, Stephan; Lal, Neeraj; Bach, Simon; Hajiyavand, Amir M.; Ali, Muhammed; Soomro, Kamran; Qasim, Iqbal; Capik, Paweł; Khan, Aslam; Khan, Zaheer; Vohra, Hunaid; Caputo, Massimo; Beggs, Andrew D.; Qayyum, Adnan; Qadir, Junaid; Ashraf, Shazad Q.
Authors
Mahmood Alam
Deepashree Bapu
Stephan Korsgen
Neeraj Lal
Simon Bach
Amir M. Hajiyavand
Muhammed Ali
Dr Kamran Soomro Kamran.Soomro@uwe.ac.uk
Associate Professor in Artificial Intelligence
Iqbal Qasim
Dr Pawel Capik Pawel.Capik@uwe.ac.uk
Director of Post Graduate Research
Aslam Khan
Zaheer Khan Zaheer2.Khan@uwe.ac.uk
Professor in Computer Science
Hunaid Vohra
Massimo Caputo
Andrew D. Beggs
Adnan Qayyum
Junaid Qadir
Shazad Q. Ashraf
Abstract
Indexing endoscopic surgical videos is vital in surgical data science, forming the basis for systematic retrospective analysis and clinical performance evaluation. Despite its significance, current video analytics rely on manual indexing, a time-consuming process. Advances in computer vision, particularly deep learning, offer automation potential, yet progress is limited by the lack of publicly available, densely annotated surgical datasets. To address this, we present TEMSET-24K, an open-source dataset comprising 24,306 trans-anal endoscopic microsurgery (TEMS) video microclips. Each clip is meticulously annotated by clinical experts using a novel hierarchical labeling taxonomy encompassing “phase, task, and action” triplets, capturing intricate surgical workflows. To validate this dataset, we benchmarked deep learning models, including transformer-based architectures. Our in silico evaluation demonstrates high accuracy (up to 0.99) and F1 scores (up to 0.99) for key phases like “Setup” and “Suturing.” The STALNet model, tested with ConvNeXt, ViT, and SWIN V2 encoders, consistently segmented well-represented phases. TEMSET-24K provides a critical benchmark, propelling state-of-the-art solutions in surgical data science.
Journal Article Type | Article |
---|---|
Acceptance Date | Jul 18, 2025 |
Online Publication Date | Aug 14, 2025 |
Publication Date | Aug 14, 2025 |
Deposit Date | Aug 18, 2025 |
Publicly Available Date | Aug 19, 2025 |
Journal | Scientific Data |
Electronic ISSN | 2052-4463 |
Publisher | Nature Research (part of Springer Nature) |
Peer Reviewed | Peer Reviewed |
Volume | 12 |
Issue | 1 |
Article Number | 1424 |
DOI | https://doi.org/10.1038/s41597-025-05646-w |
Public URL | https://uwe-repository.worktribe.com/output/14823553 |
Files
TEMSET-24K: Densely annotated dataset for indexing multipart endoscopic videos using surgical timeline segmentation
(4.3 Mb)
PDF
Licence
http://creativecommons.org/licenses/by-nc-nd/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by-nc-nd/4.0/
You might also like
Can we revitalize interventional healthcare with AI-XR surgical metaverses?
(2023)
Presentation / Conference Contribution
Deep learning-based multi-target regression for traffic-related air pollution forecasting
(2023)
Journal Article
SegCrop: Segmentation-based dynamic cropping of endoscopic videos to address label leakage in surgical tool detection
(2023)
Presentation / Conference Contribution
Privacy-preserving artificial intelligence in healthcare: Techniques and applications
(2023)
Journal Article
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search