Skip to content
jdchoi77 edited this page Dec 13, 2014 · 4 revisions

NLP components in ClearNLP behave differently when different flags are set. All component flags are defined in CFlag.

COLLECT

The component is used for collecting lexicons before training. For example when this flag is set, AbstractPOSTagger collects lexicons such as ambiguity classes using POSCollector. This flag is not used for components that do not need to collect any lexicon before training (e.g., CommonTagger).

TRAIN

The component is used for generating training instances. When this flag is set, all features are extracted by using the oracle.

BOOTSTRAP

The component is used for generating training instances through bootstrapping. When this flag is set, the component requires a statistical model, trained from previous training or bootstrapping, and extracts features predicated by this model instead of the oracle. For instance, AbstractPOSTagger extracts features from part-of-peech tags automatically generated by the statistical model. See the following paper for more details about bootstrapping.

EVALUATE

The component is used for evaluating the statistical model. The gold-standard labels must be provided for this flag.

DECODE

The component is used for decoding where no gold-standard labels are provided.

Clone this wiki locally