Tabulated format to represent an annotated document’s entities. The format is a close derivative of the EntitiesTsv’s, with two single but important differences:
- The (non-labeled) “outside” text is NOT written
- ALL labeled entities are written. Because of this, overlapping entities are supported
The document text’s sections (parts) are still separated by new line.
Example
The format is best explained with an example 🙂 (contrast with that of EntitiesTsv’s):
-
From the annotated input document (note that it contains overlapping entities):
-
The resulting output is (the
»
character represents a tab,·
a space, and¬
a new line):