Denoising Manifolds for Dimension

Jammalamadaka, Arvind K.

Serdica Mathematical Journal (2009)

  • Volume: 35, Issue: 1, page 109-116
  • ISSN: 1310-6600

Abstract

top
2000 Mathematics Subject Classification: 68T01, 62H30, 32C09.Locally Linear Embedding (LLE) has gained prominence as a tool in unsupervised non-linear dimensional reduction. While the algorithm aims to preserve certain proximity relations between the observed points, this may not always be desirable if the shape in higher dimensions that we are trying to capture is observed with noise. This note suggests that a desirable first step is to remove or at least reduce the noise in the observations before applying the LLE algorithm. While careful denoising involves knowledge of (i) the level of noise (ii) the local sampling density and (iii) the local curvature at the point in question, in most practical situations such information is not easily available. Under the model we discuss, a simple averaging of the neighboring points does reduce the noise and is easy to implement. We consider the Swiss roll example to illustrate how well this procedure works. Finally we apply these ideas on biological data and perform clustering after such a 2-step procedure of denoising and dimension reduction.

How to cite

top

Jammalamadaka, Arvind K.. "Denoising Manifolds for Dimension." Serdica Mathematical Journal 35.1 (2009): 109-116. <http://eudml.org/doc/281533>.

@article{Jammalamadaka2009,
abstract = {2000 Mathematics Subject Classification: 68T01, 62H30, 32C09.Locally Linear Embedding (LLE) has gained prominence as a tool in unsupervised non-linear dimensional reduction. While the algorithm aims to preserve certain proximity relations between the observed points, this may not always be desirable if the shape in higher dimensions that we are trying to capture is observed with noise. This note suggests that a desirable first step is to remove or at least reduce the noise in the observations before applying the LLE algorithm. While careful denoising involves knowledge of (i) the level of noise (ii) the local sampling density and (iii) the local curvature at the point in question, in most practical situations such information is not easily available. Under the model we discuss, a simple averaging of the neighboring points does reduce the noise and is easy to implement. We consider the Swiss roll example to illustrate how well this procedure works. Finally we apply these ideas on biological data and perform clustering after such a 2-step procedure of denoising and dimension reduction.},
author = {Jammalamadaka, Arvind K.},
journal = {Serdica Mathematical Journal},
keywords = {Nonlinear Dimension Reduction; Locally Linear Embedding; Noise Reduction; Smoothing; Nearest Neighbors; Clustering; nonlinear dimension reduction; locally linear embedding; noise reduction; smoothing; nearest neighbors; clustering},
language = {eng},
number = {1},
pages = {109-116},
publisher = {Institute of Mathematics and Informatics Bulgarian Academy of Sciences},
title = {Denoising Manifolds for Dimension},
url = {http://eudml.org/doc/281533},
volume = {35},
year = {2009},
}

TY - JOUR
AU - Jammalamadaka, Arvind K.
TI - Denoising Manifolds for Dimension
JO - Serdica Mathematical Journal
PY - 2009
PB - Institute of Mathematics and Informatics Bulgarian Academy of Sciences
VL - 35
IS - 1
SP - 109
EP - 116
AB - 2000 Mathematics Subject Classification: 68T01, 62H30, 32C09.Locally Linear Embedding (LLE) has gained prominence as a tool in unsupervised non-linear dimensional reduction. While the algorithm aims to preserve certain proximity relations between the observed points, this may not always be desirable if the shape in higher dimensions that we are trying to capture is observed with noise. This note suggests that a desirable first step is to remove or at least reduce the noise in the observations before applying the LLE algorithm. While careful denoising involves knowledge of (i) the level of noise (ii) the local sampling density and (iii) the local curvature at the point in question, in most practical situations such information is not easily available. Under the model we discuss, a simple averaging of the neighboring points does reduce the noise and is easy to implement. We consider the Swiss roll example to illustrate how well this procedure works. Finally we apply these ideas on biological data and perform clustering after such a 2-step procedure of denoising and dimension reduction.
LA - eng
KW - Nonlinear Dimension Reduction; Locally Linear Embedding; Noise Reduction; Smoothing; Nearest Neighbors; Clustering; nonlinear dimension reduction; locally linear embedding; noise reduction; smoothing; nearest neighbors; clustering
UR - http://eudml.org/doc/281533
ER -

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.