Giter Site home page Giter Site logo

som's Introduction

som

A simple self-organizing map implementation in Python.

Self-organizing maps are also called Kohonen maps and were invented by Teuvo Kohonen.(1) They are an unsupervised machine learning technique to efficiently create spatially organized internal representations of various types of data. For example, SOMs are well-suited for the visualization of high-dimensional data.

This is a simple implementation of SOMs in Python. This SOM has periodic boundary conditions and therefore can be imagined as a "donut". The implementation uses numpy.

Usage

Download the file som.py and place it somewhere in your PYTHONPATH.

Then you can import and use the SOM class as follows:

import numpy as np
from som import SOM

# generate some random data with 36 features
data1 = np.random.normal(loc=-.25, scale=0.5, size=(500, 36))
data2 = np.random.normal(loc=.25, scale=0.5, size=(500, 36))
data = np.vstack((data1, data2))

som = SOM(10, 10)  # initialize the SOM
som.fit(data, 10000)  # fit the SOM for 10000 epochs

targets = 500 * [0] + 500 * [1]  # create some dummy target values

# now visualize the learned representation with the class labels
som.plot_point_map(data, targets, ['Class 0', 'Class 1'], filename='som.png')
som.plot_class_density(data, targets, t=0, name='Class 0', filename='class_0.png')

The same way you can handle your own data.

The SOM class has the following methods:

  • winner(vector): compute the winner neuron closest to a given data point in vector (Euclidean distance)
  • cycle(vector): perform one iteration in adapting the SOM towards the chosen data point in vector
  • fit(data, epochs, batch_size=1): train the SOM on the given data for several epochs
  • transform(data): transform given data in to the SOM space
  • distance_map(): get a map of every neuron and its distances to all neighbors
  • winner_map(data): get the number of times, a certain neuron in the trained SOM is winner for the given data
  • som_error(data): calculates the overall error as the average difference between the winning neurons and the data
  • plot_point_map(data, targets, targetnames, filename=None, colors=None, markers=None, density=True): visualize the som with all data as points around the neurons
  • plot_density_map(data, filename=None, internal=False): visualize the data density in different areas of the SOM.
  • plot_class_density(data, targets, t, name, colormap='Oranges', filename=None): plot a density map only for the given class

References:

(1) Kohonen, T. Self-Organized Formation of Topologically Correct Feature Maps. Biol. Cybern. 1982, 43 (1), 59โ€“69.

This work was partially inspired by ramalina's som implementation and JustGlowing's minisom.

som's People

Contributors

alexarnimueller avatar silvaemerson avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.