ORCA
Online Research @ Cardiff

Clear Cookie - decide language by browser settings

Numeric prediction of dissolved oxygen status through two-stage training for classification-driven regression

Guo, Pengfei, Liu, Han

, Liu, Shuangyin and Xu, Longqin 2019. Numeric prediction of dissolved oxygen status through two-stage training for classification-driven regression. Presented at: International Conference on Machine Learning and Cybernetics, Kobe, Japan, 7-10 July 2019. 2019 International Conference on Machine Learning and Cybernetics (ICMLC). 10.1109/ICMLC48188.2019.8949196

[thumbnail of ICMLC_2019_Paper_6070.pdf]

Preview

PDF - Accepted Post-Print Version
Download (131kB) | Preview

Official URL: http://dx.doi.org/10.1109/ICMLC48188.2019.8949196

Abstract

Dissolved oxygen of aquaculture is an important measure of the quality of culture environment and how aquatic products have been grown. In the machine learning context, the above measure can be achieved by defining a regression problem, which aims at numerical prediction of the dissolved oxygen status. In general, the vast majority of popular machine learning algorithms were designed for undertaking classification tasks. In order to effectively adopt the popular machine learning algorithms for the above-mentioned numerical prediction, in this paper, we propose a two-stage training approach that involves transforming a regression problem into a classification problem and then transforming it back to regression problem. In particular, unsupervised discretization of continuous attributes is adopted at the first stage to transform the target (numeric) attribute into a discrete (nominal) one with several intervals, such that popular machine learning algorithms can be used to predict the interval to which an instance belongs in the setting of a classification task. Furthermore, based on the classification result at the first stage, some of the instances within the predicted interval are selected for training at the second stage towards numerical prediction of the target attribute value of each instance. An experimental study is conducted to investigate in general the effectiveness of the popular learning algorithms in the numerical prediction task and also analyze how the increase of the number of training instances (selected at the second training stage) can impact on the final prediction performance. The results show that the adoption of decision tree learning and neural networks lead to better and more stable performance than Naive Bayes, K Nearest Neighbours and Support Vector Machine.

Item Type:	Conference or Workshop Item (Paper)
Date Type:	Publication
Status:	Published
Schools:	Computer Science & Informatics
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science
ISBN:	9781728128160
ISSN:	2160-133X
Related URLs:	Organisation
Date of First Compliant Deposit:	19 July 2019
Date of Acceptance:	21 May 2019
Last Modified:	07 Dec 2022 13:47
URI:	https://orca.cardiff.ac.uk/id/eprint/124288

Citation Data

Cited 2 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item

Altmetric

Dimensions

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)