Hajderanj, L.
ORCID: https://orcid.org/0009-0007-0445-3049, Weheliye, I. and Chen, D.
(2019)
A new supervised t-SNE with dissimilarity measure for effective data visualization and classification.
In: ICSIE '19: Proceedings of the 8th International Conference on Software and Information Engineering, 9-12 April 2019, Cairo, Egypt, pp. 232-236.
doi: 10.1145/3328833.3328853
Abstract/Summary
In this paper, a new version of the Supervised t- Stochastic Neighbor Embedding (S-tSNE) algorithm is proposed which introduces the use of a dissimilarity measure related to class information. The proposed S-tSNE can be applied in any high dimensional dataset for visualization or as a feature extraction for classification problems. In this study, the S-tSNE is applied to three datasets MNIST, Chest x-ray, and SEER Breast Cancer. The two-dimensional data generated by the S-tSNE showed better visualization and an improvement in terms of classification accuracy in comparison to the original t- Stochastic Neighbor Embedding(t-SNE) method. The results from k-nearest neighbors (k-NN) classification model which used the lower dimension space generated by the new S-tSNE method showed more than 20% improvement on average in accuracy in all the three datasets compared with the t-SNE method. In addition, the classification accuracy using the S-tSNE for feature extraction was even higher than classification accuracy obtained from the original high dimensional data.
Altmetric Badge
| Item Type | Conference or Workshop Item (Paper) |
| URI | https://centaur.reading.ac.uk/id/eprint/122818 |
| Identification Number/DOI | 10.1145/3328833.3328853 |
| Refereed | Yes |
| Divisions | Henley Business School > Digitalisation, Marketing and Entrepreneurship |
| Download/View statistics | View download statistics for this item |
University Staff: Request a correction | Centaur Editors: Update this record
Download
Download