Bournemouth University

Staff Profile Pages

Assessing performance of artificial neural networks and re-sampling techniques for healthcare datasets

Authors: Saul, M., Rostami, S.

Journal: Health Informatics Journal

Publication Date: 14/03/2022

Volume: 28

Issue: 1

eISSN: 1741-2811

ISSN: 1460-4582

DOI: 10.1177/14604582221087109

Abstract:

Re-sampling methods to solve class imbalance problems have shown to improve classification accuracy by mitigating the bias introduced by differences in class size. However, it is possible that a model which uses a specific re-sampling technique prior to Artificial neural networks (ANN) training may not be suitable for aid in classifying varied datasets from the healthcare industry. Five healthcare-related datasets were used across three re-sampling conditions: under-sampling, over-sampling and combi-sampling. Within each condition, different algorithmic approaches were applied to the dataset and the results were statistically analysed for a significant difference in ANN performance. The combi-sampling condition showed that four out of the five datasets did not show significant consistency for the optimal re-sampling technique between the f1-score and Area Under the Receiver Operating Characteristic Curve performance evaluation methods. Contrarily, the over-sampling and under-sampling condition showed all five datasets put forward the same optimal algorithmic approach across performance evaluation methods. Furthermore, the optimal combi-sampling technique (under-, over-sampling and convergence point), were found to be consistent across evaluation measures in only two of the five datasets. This study exemplifies how discrete ANN performances on datasets from the same industry can occur in two ways: how the same re-sampling technique can generate varying ANN performance on different datasets, and how different re-sampling techniques can generate varying ANN performance on the same dataset.

https://eprints.bournemouth.ac.uk/36820/

Source: Scopus

Assessing performance of artificial neural networks and re-sampling techniques for healthcare datasets.

Authors: Saul, M., Rostami, S.

Journal: Health Informatics J

Publication Date: 2022

Volume: 28

Issue: 1

Pages: 14604582221087109

eISSN: 1741-2811

DOI: 10.1177/14604582221087109

Abstract:

Re-sampling methods to solve class imbalance problems have shown to improve classification accuracy by mitigating the bias introduced by differences in class size. However, it is possible that a model which uses a specific re-sampling technique prior to Artificial neural networks (ANN) training may not be suitable for aid in classifying varied datasets from the healthcare industry. Five healthcare-related datasets were used across three re-sampling conditions: under-sampling, over-sampling and combi-sampling. Within each condition, different algorithmic approaches were applied to the dataset and the results were statistically analysed for a significant difference in ANN performance. The combi-sampling condition showed that four out of the five datasets did not show significant consistency for the optimal re-sampling technique between the f1-score and Area Under the Receiver Operating Characteristic Curve performance evaluation methods. Contrarily, the over-sampling and under-sampling condition showed all five datasets put forward the same optimal algorithmic approach across performance evaluation methods. Furthermore, the optimal combi-sampling technique (under-, over-sampling and convergence point), were found to be consistent across evaluation measures in only two of the five datasets. This study exemplifies how discrete ANN performances on datasets from the same industry can occur in two ways: how the same re-sampling technique can generate varying ANN performance on different datasets, and how different re-sampling techniques can generate varying ANN performance on the same dataset.

https://eprints.bournemouth.ac.uk/36820/

Source: PubMed

Assessing performance of artificial neural networks and re-sampling techniques for healthcare datasets

Authors: Saul, M., Rostami, S.

Journal: HEALTH INFORMATICS JOURNAL

Publication Date: 01/2022

Volume: 28

Issue: 1

eISSN: 1741-2811

ISSN: 1460-4582

DOI: 10.1177/14604582221087109

https://eprints.bournemouth.ac.uk/36820/

Source: Web of Science

Assessing performance of artificial neural networks and re-sampling techniques for healthcare datasets.

Authors: Saul, M., Rostami, S.

Journal: Health informatics journal

Publication Date: 01/2022

Volume: 28

Issue: 1

Pages: 14604582221087109

eISSN: 1741-2811

ISSN: 1460-4582

DOI: 10.1177/14604582221087109

Abstract:

Re-sampling methods to solve class imbalance problems have shown to improve classification accuracy by mitigating the bias introduced by differences in class size. However, it is possible that a model which uses a specific re-sampling technique prior to Artificial neural networks (ANN) training may not be suitable for aid in classifying varied datasets from the healthcare industry. Five healthcare-related datasets were used across three re-sampling conditions: under-sampling, over-sampling and combi-sampling. Within each condition, different algorithmic approaches were applied to the dataset and the results were statistically analysed for a significant difference in ANN performance. The combi-sampling condition showed that four out of the five datasets did not show significant consistency for the optimal re-sampling technique between the f1-score and Area Under the Receiver Operating Characteristic Curve performance evaluation methods. Contrarily, the over-sampling and under-sampling condition showed all five datasets put forward the same optimal algorithmic approach across performance evaluation methods. Furthermore, the optimal combi-sampling technique (under-, over-sampling and convergence point), were found to be consistent across evaluation measures in only two of the five datasets. This study exemplifies how discrete ANN performances on datasets from the same industry can occur in two ways: how the same re-sampling technique can generate varying ANN performance on different datasets, and how different re-sampling techniques can generate varying ANN performance on the same dataset.

https://eprints.bournemouth.ac.uk/36820/

Source: Europe PubMed Central