This function extracts named entities from texts, based on the entity tag
ent attributes of documents objects parsed by spaCy (see
a character object or a TIF-compliant corpus data.frame (see https://github.com/ropenscilabs/tif)
type of returned object, either
type of named entities, either
TRUE, the processing is parallelized
using spaCy's architecture (https://spacy.io/api)
data.frame of tokens
When the option
output = "data.frame" is selected, the
function returns a
data.frame with the following fields.
type of entity (e.g.
serial number ID of starting token.
This number corresponds with the number of
data.frame returned from
spacy_tokenize(x) with default options.
of words (tokens) included in a named entity (e.g. for an entity, "New York
length = 4)