Training interface to the C&C supertagger.
This automates the process of training the C&C supertagger on data
from the database. The data should first be generated using the script in
the annotator bin.
Training data should be in the Jazz Parser format, which differs
slightly from the C&C format. Instead of
<obs>|<pos>|<tag>, each chord should have be
represented as <chord>|<obs>|<pos>|<tag>. Use
generate_model_data to generate this from the database.
|
|
train_model(model,
data_filename,
holdout_partitions=0,
train_params={},
chordmap=None)
Train a C&C model by calling the C&C supertagger training
routine. |
source code
|
|
|
|
train_model_on_sequence_data(model,
data_filename,
*args,
**kwargs)
Same as train_model, but takes a db_mirrors sequence data file as
input, rather than a C&C training data file. |
source code
|
|
|
|
|
|
|
|