Unified embedding and clustering

Authors: Allaoui, M., Kherfi, M.L., Cheriet, A. and Bouchachia, A.

Journal: Expert Systems with Applications

Volume: 238

ISSN: 0957-4174

DOI: 10.1016/j.eswa.2023.121923

Abstract:

This paper investigates the problem of treating embedding and clustering simultaneously to uncover data structure reliably by constraining manifold embedding through clustering. Conversely, most existing methods perform embedding sequentially, followed by clustering, which leads to a clustering that sometimes pushes data towards a direction induced by embedding. Instead, we perform them simultaneously through an original formulation, which allows for preserving the data's original structure in the embedding space and producing a better clustering assignment. To achieve this goal, we introduce a novel algorithm that unifies manifold embedding and clustering (UEC). The proposed UEC algorithm is based on a bi-objective loss function that combines data embedding and clustering, which is optimised using three different ways: (1) Comma Variant, (2) Plus Variant, and (3) Light Plus Variant. The experimental results with several real-world datasets show that UEC is competitive with the state-of-the-art embedding and clustering methods showing in particular that UEC allows for better preservation of the structure of the dataset resulting in better clustering performance.

https://eprints.bournemouth.ac.uk/39237/

Source: Scopus

Unified embedding and clustering

Authors: Allaoui, M., Kherfi, M.L., Cheriet, A. and Bouchachia, A.

Journal: EXPERT SYSTEMS WITH APPLICATIONS

Volume: 238

eISSN: 1873-6793

ISSN: 0957-4174

DOI: 10.1016/j.eswa.2023.121923

https://eprints.bournemouth.ac.uk/39237/

Source: Web of Science (Lite)

Unified embedding and clustering

Authors: Allaoui, M., Kherfi, M.L., Cheriet, A. and Bouchachia, A.

Journal: Expert Systems with Applications

Volume: 238

Issue: Part E

ISSN: 0957-4174

Abstract:

This paper investigates the problem of treating embedding and clustering simultaneously to uncover data structure reliably by constraining manifold embedding through clustering. Conversely, most existing methods perform embedding sequentially, followed by clustering, which leads to a clustering that sometimes pushes data towards a direction induced by embedding. Instead, we perform them simultaneously through an original formulation, which allows for preserving the data's original structure in the embedding space and producing a better clustering assignment. To achieve this goal, we introduce a novel algorithm that unifies manifold embedding and clustering (UEC). The proposed UEC algorithm is based on a bi-objective loss function that combines data embedding and clustering, which is optimised using three different ways: (1) Comma Variant, (2) Plus Variant, and (3) Light Plus Variant. The experimental results with several real-world datasets show that UEC is competitive with the state-of-the-art embedding and clustering methods showing in particular that UEC allows for better preservation of the structure of the dataset resulting in better clustering performance.

https://eprints.bournemouth.ac.uk/39237/

Source: BURO EPrints