Package jazzparser :: Package misc :: Package raphsto :: Class RaphstoHmm

Class RaphstoHmm

                       object --+    
                                |    
utils.nltk.ngram.model.NgramModel --+
                                    |
                                   RaphstoHmm

Known Subclasses:

Hidden Markov Model that implements the model described in the paper.

States are in the form of a tuple (tonic,mode,chord) where tonic is in \{0, ..., 11\}; mode is one of \{constants.MODE_MAJOR, constants.MODE_MINOR\}; chord is one of \{constants.CHORD_I, ..., constants.CHORD_VII\}.

Emissions are in the form of a list of pairs (pc,r), where pc is a pitch class (like tonic above) and r is an onset time abstraction and is one of \{0, ...,3\}.

Unlike with NgramModel, the emission domain is the domain of values from which each element of an emission is selected. In other words, the actually domain of emissions is the powerset of emission_dom.

In the description of the model, r is described as a condition of the emission distribution. Although the model is truly replicated here, the interface suggests otherwise, since we treat the rythmic markers as if they're part of the emissions. From a conceptual point of view, this makes more sense and I think it's rather odd that the model doesn't treat them this way.

As for prior distributions (start state distribution), we ignore the tonic of the first state - it doesn't make any sense to look at it since the model is pitch-invariant throughout. We then just use our marginalized chord distribution and assume that the mode distribution is uniform (there are only two and it probably won't make much difference).

Note: mutable distributions: if you use mutable distributions for transition or emission distributions, make sure you invalidate the cache by calling clear_cache after updating the distributions. Various caches are used to speed up retreival of probabilities. If you fail to do this, you'll end up getting some values unpredictably from the old distributions

Instance Methods

[hide private]

__init__(self, key_transition_dist, chord_transition_dist, emission_dist, chord_dist, model_name='default', history='', description='', chord_set='scale+dom7')
x.__init__(...) initializes x; see help(type(x)) for signature source code

label(self, handler)
Produces labels for the midi data using the model.

source code

clear_cache(self)
Initializes or empties probability distribution caches.

source code

add_history(self, string)
Adds a line to the end of this model's history string.

source code

sequence_to_ngram(self, seq)

source code

ngram_to_sequence(self, ngram)

source code

last_label_in_ngram(self, ngram)

source code

backoff_ngram(self, ngram)

source code

set_chord_transition_probabilities(self, spec)
Sets the parameters of the chord transition distribution.

source code

retrain_unsupervised(self, *args, **kwargs)
Unsupervised training.

source code

transition_log_probability(self, state, previous_state)
Gives the probability P(label_i | label_(i-1), ..., label_(i-n)), where the previous labels are given in the sequence label_context.

source code

emission_log_probability(self, emission, state)
Gives the probability P(emission | label).

source code

forward_log_probabilities(self, sequence, normalize=True)
We override this to provide a faster implementation.

source code

backward_log_probabilities(self, sequence, normalize=True)
We override this to provide a faster implementation.

source code

normal_forward_probabilities(self, sequence)
If you want the normalized matrix of forward probabilities, it's ok to use normal (non-log) probabilities and these can be computed more quickly, since you don't need to sum logs (which is time consuming).

source code

normal_backward_probabilities(self, sequence)
Return the backward probability matrices a Numpy array.

source code

compute_gamma(self, sequence, forward=None, backward=None)
Computes the gamma matrix used in Baum-Welch.

source code

compute_xi(self, sequence, forward=None, backward=None)
Computes the xi matrix used by Baum-Welch.

source code

to_picklable_dict(self)
Produces a picklable representation of model as a dict.

source code

_get_my_filename(self)

source code

save(self)
Saves the model data to a file.

source code

delete(self)
Removes all the model's data.

source code

Inherited from utils.nltk.ngram.model.NgramModel: __repr__, backward_probabilities, decode_forward, decode_gamma, emission_probability, forward_backward_log_probabilities, forward_backward_probabilities, forward_probabilities, gamma_probabilities, generalized_viterbi, generate, get_all_ngrams, get_backoff_models, get_emission_matrix, get_transition_matrix, labeled_sequence_log_probability, normal_forward_backward_probabilities, precompute, transition_log_probability_debug, transition_probability, transition_probability_debug, viterbi_decode, viterbi_selector_probabilities

Inherited from utils.nltk.ngram.model.NgramModel (private): _get_model_type, _get_transition_backoff_scaler

Inherited from object: __delattr__, __format__, __getattribute__, __hash__, __new__, __reduce__, __reduce_ex__, __setattr__, __sizeof__, __str__, __subclasshook__

Class Methods

[hide private]

get_label_dom(cls, chord_set='scale+dom7') source code

initialize_chord_types(cls, probs, model_name='default', chord_set='scale+dom7')
Creates a new model with the distributions initialized naively to favour simple chord-types, as R&S do in the paper. source code

initialize_existing_model(cls, old_model_name, model_name='default')
Initializes a model using parameters from an already trained model. source code

from_picklable_dict(cls, data, model_name='default')
Reproduces an n-gram model that was converted to a picklable form using to_picklable_dict. source code

_get_model_dir(cls)

source code

_get_filename(cls, model_name)

source code

list_models(cls)
Returns a list of the names of available models.

source code

load_model(cls, model_name)

source code

Static Methods

[hide private]

get_trainer()

source code

train(*args, **kwargs)
We don't train a RaphstoHmm using the train method, since our training procedure is not the same as the superclass, so this would be confusing, as this method would require completely different input. source code

Class Variables

[hide private]

V = {0: 1, 1: 1, 2: 1, 3: 4, 4: 5}
This is the function (mapping) described in the model as V.

LABEL_DOM = None
hash(x)

Properties

[hide private]

_filename

Inherited from utils.nltk.ngram.model.NgramModel: model_type

Inherited from object: __class__

Method Details

Class RaphstoHmm

__init__(self, key_transition_dist, chord_transition_dist, emission_dist, chord_dist, model_name='default', history='', description='', chord_set='scale+dom7') (Constructor)

label(self, handler)

clear_cache(self)

train(*args, **kwargs) Static Method

initialize_chord_types(cls, probs, model_name='default', chord_set='scale+dom7') Class Method

set_chord_transition_probabilities(self, spec)

retrain_unsupervised(self, *args, **kwargs)

transition_log_probability(self, state, previous_state)

emission_log_probability(self, emission, state)

forward_log_probabilities(self, sequence, normalize=True)

backward_log_probabilities(self, sequence, normalize=True)

normal_forward_probabilities(self, sequence)

normal_backward_probabilities(self, sequence)

compute_gamma(self, sequence, forward=None, backward=None)

compute_xi(self, sequence, forward=None, backward=None)

to_picklable_dict(self)

from_picklable_dict(cls, data, model_name='default') Class Method

_filename

init(self, key_transition_dist, chord_transition_dist, emission_dist, chord_dist, model_name=`'default'`, history=`''`, description=`''`, chord_set=`'scale+dom7'`)
(Constructor)

train(*args, **kwargs)
Static Method

initialize_chord_types(cls, probs, model_name=`'default'`, chord_set=`'scale+dom7'`)
Class Method

from_picklable_dict(cls, data, model_name=`'default'`)
Class Method