SenseClusters

SenseClusters is a package of (mostly) Perl programs that allows a user to cluster similar contexts together using unsupervised knowledge-lean methods. These techniques have been applied to word sense discrimination, email categorization, and name discrimination. The supported methods include the native SenseClusters techniques and Latent Semantic Analysis.

You can see a video tutorial entitled "Language Independent Methods of Clustering Similar Contexts" from EACL 2006 that introduces SenseClusters (135 minutes).

We have mailing lists for users and news and developers.

Computational Approaches to Measuring the Similarity of Short Contexts : A Review of Applications and Methods gives a good idea of some of the kinds of problems that can be approached with SenseClusters.

Try the Web Interface here (production) or here (backup) (both are version 1.01)

Download the current version (v1.03, released June 29, 2013) from CPAN or Sourceforge

Publications

Other Packages Used by SenseClusters

SenseClusters Development Team

Acknowledgments

The development of SenseClusters has been supported by a National Science Foundation Faculty Early Career Development (CAREER) Program award (#0092784, 2001-2007).

SourceForge.net Logo NSF Logo