Skip to main content

Research Repository

Advanced Search

TEMSET-24K: Densely annotated dataset for indexing multipart endoscopic videos using surgical timeline segmentation

Bilal, Muhammad; Alam, Mahmood; Bapu, Deepashree; Korsgen, Stephan; Lal, Neeraj; Bach, Simon; Hajiyavand, Amir M.; Ali, Muhammed; Soomro, Kamran; Qasim, Iqbal; Capik, Paweł; Khan, Aslam; Khan, Zaheer; Vohra, Hunaid; Caputo, Massimo; Beggs, Andrew D.; Qayyum, Adnan; Qadir, Junaid; Ashraf, Shazad Q.

TEMSET-24K: Densely annotated dataset for indexing multipart endoscopic videos using surgical timeline segmentation Thumbnail


Authors

Muhammad Bilal Muhammad.Bilal@uwe.ac.uk
TSU Research Fellow Timesheet NOM

Mahmood Alam

Deepashree Bapu

Stephan Korsgen

Neeraj Lal

Simon Bach

Amir M. Hajiyavand

Muhammed Ali

Profile image of Kamran Soomro

Dr Kamran Soomro Kamran.Soomro@uwe.ac.uk
Associate Professor in Artificial Intelligence

Iqbal Qasim

Aslam Khan

Zaheer Khan Zaheer2.Khan@uwe.ac.uk
Professor in Computer Science

Hunaid Vohra

Massimo Caputo

Andrew D. Beggs

Adnan Qayyum

Junaid Qadir

Shazad Q. Ashraf



Abstract

Indexing endoscopic surgical videos is vital in surgical data science, forming the basis for systematic retrospective analysis and clinical performance evaluation. Despite its significance, current video analytics rely on manual indexing, a time-consuming process. Advances in computer vision, particularly deep learning, offer automation potential, yet progress is limited by the lack of publicly available, densely annotated surgical datasets. To address this, we present TEMSET-24K, an open-source dataset comprising 24,306 trans-anal endoscopic microsurgery (TEMS) video microclips. Each clip is meticulously annotated by clinical experts using a novel hierarchical labeling taxonomy encompassing “phase, task, and action” triplets, capturing intricate surgical workflows. To validate this dataset, we benchmarked deep learning models, including transformer-based architectures. Our in silico evaluation demonstrates high accuracy (up to 0.99) and F1 scores (up to 0.99) for key phases like “Setup” and “Suturing.” The STALNet model, tested with ConvNeXt, ViT, and SWIN V2 encoders, consistently segmented well-represented phases. TEMSET-24K provides a critical benchmark, propelling state-of-the-art solutions in surgical data science.

Journal Article Type Article
Acceptance Date Jul 18, 2025
Online Publication Date Aug 14, 2025
Publication Date Aug 14, 2025
Deposit Date Aug 18, 2025
Publicly Available Date Aug 19, 2025
Journal Scientific Data
Electronic ISSN 2052-4463
Publisher Nature Research (part of Springer Nature)
Peer Reviewed Peer Reviewed
Volume 12
Issue 1
Article Number 1424
DOI https://doi.org/10.1038/s41597-025-05646-w
Public URL https://uwe-repository.worktribe.com/output/14823553

Files





You might also like



Downloadable Citations