Thomas Rowland
Machine vision and deep learning for robotic harvesting of shiitake mushrooms
Rowland, Thomas; Hansen, Mark; Smith, Melvyn; Smith, Lyndon
Authors
Mark Hansen Mark.Hansen@uwe.ac.uk
Professor of Machine Vision and Machine Learning
Melvyn Smith Melvyn.Smith@uwe.ac.uk
Research Centre Director Vision Lab/Prof
Lyndon Smith Lyndon.Smith@uwe.ac.uk
Professor in Computer Simulation and Machine
Abstract
There is a substantial requirement for the modernisation of our agricultural practices. Current methods can be susceptible to labour shortages, damaging to the environment, and inefficient; and these factors can lead to food wastage so threatening food security. This paper aims to address this issue by employing 2D/3D computer vision and machine learning to assist with robotic harvesting of shiitake mushrooms. Shiitake mushrooms are one of the most valuable gourmet mush-rooms, but also the most labour intensive to grow. The work described demonstrates that state-of-the-art machine vision and deep learning techniques are useful for this application; and are effective at multiple stages throughout the harvest of shiitake mushrooms. Following the creation of the first publicly available segmented shiitake mushroom dataset, YOLOv8-seg and Detectron2 Mask R-CNN models were trained to an Average Precision of 94.9% and 77.7% respectively, segmenting up to 85 shiitake mushrooms in a single image. Additional exploratory work with a smaller keypoint dataset was conducted to determine the suitability of both of these architectures to plot the cut points for the harvest operation, with results proving the feasibility for the task
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 19th International Symposium on Visual Computing |
Start Date | Oct 21, 2024 |
End Date | Oct 23, 2024 |
Acceptance Date | Aug 26, 2024 |
Deposit Date | Oct 28, 2024 |
Journal | Lecture Notes in Computer Science |
Peer Reviewed | Peer Reviewed |
Public URL | https://uwe-repository.worktribe.com/output/13321473 |
Additional Information | The paper was accepted for oral presentation and delivered on 23/10/24. The papers are being collated for publication (by the end of November I believe). I'm not sure whether it will be open access or not. |
This file is under embargo due to copyright reasons.
Contact Mark.Hansen@uwe.ac.uk to request a copy for personal use.
You might also like
3D Machine vision and deep learning for enabling automated and sustainable assistive physiotherapy
(2024)
Presentation / Conference Contribution
Estimating water storage from images
(2024)
Presentation / Conference Contribution
Maize yield predictive models and mobile-based decision support system for smallholder farmers in Africa
(2022)
Presentation / Conference Contribution
A robust machine learning framework for diabetes prediction
(2021)
Presentation / Conference Contribution
Towards facial expression recognition for on-farm welfare assessment in pigs
(2021)
Journal Article
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search