Entity Extraction Scripts
Entity Extraction Scripts A Sintelix configuration for marking up and creating connections between document text using a highly configurable scripting syntax. (EESs) are used to configure Sintelix to extract complex items of information from text. EESs are composed in the Sintelix Configuration User Interface.
EESs contain lists of Rules. Rules are formed of a Matching Pattern and a set of Output Phrases. Rules operate in sequence to extract whatever information is required from text.
EES scripts can be triggered to run only if a document has a particular tag. Documents can be tagged automatically with Sintelix's built-in tagger/classifier.
EESs can take advantage of other configuration capabilities in Sintelix:
- Word Lists
- Classification A configuration used for automatically adding document tags to Sintelix documents based on a pre-trained model.
- Tagging A configuration used for automatically adding document tags to Sintelix documents based on a pre-trained model.
What you can do with Entity Extraction Scripts
EESs are a highly productive tool for configuring Sintelix to extract specific information from text.
Example applications include:
- Document metadata extraction
- Citation and reference extraction
- Extraction of entities and references relevant to a topic, such as:
- legal cases
- police records
- geological exploration and survey reports
- patents
- open source intelligence collections A collection is a container for storing and organising ingested files and documents. Only the textual content is stored in collections, not the original files and documents.
- email metadata and thread analysis.