SNN Input Parameters: how are they related?

Guilherme Moreira, Maribel Yasmina Santos, João Moura Pires: SNN Input Parameters: how are they related?. In: Parallel and Distributed Systems (ICPADS), 2013 International Conference on, pp. 492–497, IEEE 2013.

Abstract

Nowadays, organizations are facing several challenges when they try to analyze generated data with the aim of extracting useful information. This analytical capacity needs to be enhanced with tools capable of dealing with big data sets without making the analytical process a difficult task. Clustering is usually used, as this technique does not require any prior knowledge about the data. However, clustering algorithms usually require one or more input parameters that influence the clustering process and the results that can be obtained. This work analyses the relation between the three input parameters of the SNN (Shared Nearest Neighbor) algorithm and proposes specific guidelines for the identification of the appropriate input parameters that optimizes the processing time.

BibTeX (Download)

@inproceedings{moreira2013snn,
title = {SNN Input Parameters: how are they related?},
author = { Guilherme Moreira and Maribel Yasmina Santos and João Moura Pires},
url = {http://dx.doi.org/10.1109/ICPADS.2013.89},
year  = {2013},
date = {2013-01-01},
booktitle = {Parallel and Distributed Systems (ICPADS), 2013 International Conference on},
pages = {492--497},
organization = {IEEE},
abstract = {Nowadays, organizations are facing several challenges when they try to analyze generated data with the aim of extracting useful information. This analytical capacity needs to be enhanced with tools capable of dealing with big data sets without making the analytical process a difficult task. Clustering is usually used, as this technique does not require any prior knowledge about the data. However, clustering algorithms usually require one or more input parameters that influence the clustering process and the results that can be obtained. This work analyses the relation between the three input parameters of the SNN (Shared Nearest Neighbor) algorithm and proposes specific guidelines for the identification of the appropriate input parameters that optimizes the processing time.},
keywords = {Clustering, Density-based Clustering, Input Parameters Tuning, Shared Nearest Neighbour},
pubstate = {published},
tppubtype = {inproceedings}
}