
When training a model, we don’t just want it to memorize our examples – we want

Minimising the gradient of the weights should result in predictions that areĬloser to the reference labels on the training data.

Training is an iterative process in which the model’s predictions are comparedĪgainst the reference annotations in order to estimate the gradient of the This could be a part-of-speech tag, a named entity or To trainĪ model, you first need training data – examples of text, and the labels you The weight valuesĪre estimated based on examples the model has seen during training. Prediction based on the model’s current weight values. Which part-of-speech tag to assign, or whether a word is a named entity – is a Every “decision” these components make – for example,

SpaCy’s tagger, parser, text categorizer and many other components are poweredīy statistical models. Train and update components on your own data and integrate custom models
