Entity Extraction Languages

The following table details the list of language that Sintelix supports, the corresponding language plugins, the supported text reference types and their system limitations:

Language

Plugin

Description

Supported Text Reference Types

System Limitations

Chinese (ZH)

Plugin Chinese

Chinese Language Plugin provides Named Entity Extraction support for Chinese (ZH) language documents.

Person
Organisation
Location
DateTime (non-Lexical)

None

Dutch (NL)

Plugin EU

European Languages Plugin provides Named Entity Extraction support for Dutch (NL)

Person
Organisation
Location
DateTime
Money
Time
Person- title
Position

None

French (FR)

Plugin EU

European Languages Plugin provides Named Entity Extraction support for French (FR)

Person
Organisation
Location
DateTime
Money
Time
Person-title
Position

None

German (DE)

Plugin EU

European Languages Plugin provides Named Entity Extraction support for German (DE)

Person
Organisation
Location
DateTime
Money
Time
Person-title
Position

None

Italian (IT)

Plugin EU

European Languages Plugin provides Named Entity Extraction support for Italian (IT)

Person
Organisation
Location
DateTime
Money
Time
Person-title
Position

None

Spanish (ES)

Plugin EU

European Languages Plugin provides Named Entity Extraction support for Spanish (ES)

Person
Organisation
Location
DateTime
Money
Time
Person-title
Position

None

Finnish

Plugin Finnish

Finnish Morphology Plugin provides Finnish tokenisation and morphological analysis, allowing custom configuration of Finnish entity extraction. It does not extract any entities by itself.

None

None

Bahasa Indonesia (ID)

Plugin Indonesian (beta)

Indonesian Language Plugin provides Named Entity Extraction for Bahasa Indonesia (ID) language documents.

Person
Organisation
Location
Position
Person-title

 

Arabic (AR)

Plugin Arabic

Arabic Language Plugin provides tokenisation, transliteration and limited entity extraction for Arabic (AR) documents.

Person
Organisation
Location

Only be installed on Windows deployments

Russian (RU)

Plugin Russian

Russian Language Plugin provides named entity extraction for the Russian (RU) language documents.

Person
Organisation
Location

Only be installed on Windows deployments