Process annotated text — process_text • menzerath

Process annotated text

process_text(
  text,
  construct_delimiter,
  constituent_delimiter,
  subconstituent_delimiter,
  discontinued_constituent_delimiter_begin,
  discontinued_constituent_delimiter_end
)

Arguments

text: string
construct_delimiter: char
constituent_delimiter: char
subconstituent_delimiter: char
discontinued_constituent_delimiter_begin: char
discontinued_constituent_delimiter_end: char

Value

data.frame A data.frame with three columns: constituents, subconstituents and constructs

Examples

annotated_text <- paste0("Greece* {which* is* the* most* beau*ti*ful* coun*try*",
                        " +I* know* +}was* the* first* place* +we* vi*si*ted* in*",
                        "Eu*ro*pe* +.")
 counts_df <- process_text(annotated_text, "+", " ", "*", "{", "}")