Training tok2vec, parser, tagger, morphologizer. Keep getting 'XX' tag predictions, 'X' pos predictions, and 'dep' relation predictions #13504
Unanswered
skarokin
asked this question in
Help: Coding & Implementations
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm training tok2vec, parser, tagger, and morphologizer. When making predictions, my tagger is constantly making 'XX' predictions and my parser is predicting all relations as 'dep'. My config file was built as such. I did not make any modifications to the config after running the below command.
Im relatively certain that the problem is not my data. The data I originally trained on was an augmented set of OntoNotes 5.0. I tried training my same config on a completely unchanged OntoNotes 5.0. I still get many 'XX' predictions and 'dep' relations. Am I missing something from the pipeline?
This augmented OntoNotes 5.0 does NOT contain any new part of speech tags or dependency relations, it simply copies ~40% of sentences and changes the word form, the POS, and the tag based on some rules.
My metrics are:
I only need the parser, tagger, and morphologizer trained. Below is an image that shows a sample output when loading my model as such
nlp = spacy.load('data/models/train_5/model-last')
Beta Was this translation helpful? Give feedback.
All reactions