Requirements Before Installing Sintelix

Operating Systems
Operating Systems
Sintelix Agent works with:
- Windows 64-bit
- Linux (UBUNTU or CentOS)
- macOS (Agent Only)
Windows
Sintelix supports Windows operating systems that are within Microsoft's Extended Support End date which is typically 10 years.
Linux
Sintelix supports Linux and provide a WAR file for serving via Tomcat or Jetty. Limited support is provided for Red Hat Enterprise Linux (RHEL) 8 and up or other Linux distributions related to CentOS. Linux Installation instructions are based on a clean installed Linux machine.
Operating System | Standalone Sintelix | Sintelix Server | ||
---|---|---|---|---|
Minimum | Maximum | Minimum | Maximum | |
Windows |
Windows 10 - 22H2 Pro |
Windows 10 – latest release |
Windows Server 2019 |
Windows Server 2022 |
Ubuntu |
20.04.x LTS |
22.04.x LTS |
20.04.x LTS |
22.04.x LTS |
CentOS |
Stream 8 |
Stream 9 |
Stream 8 |
Stream 9 |

Hardware Requirements
Standalone Sintelix | Sintelix Server | |
---|---|---|
Minimum Recommended | Minimum Recommended | |
Processor |
2 cores or more – Intel or AMD processor |
8 cores or more |
Memory |
8 GB (with 4 GB Spare RAM) |
20 GB +1.5 GB per additional core |
Hard Disk |
20 GB |
20 GB |
Disk Space |
1.5 times the size of the original document(s) ingested |
1.5 times the size of the original document(s) ingested |
Hard Disk
We recommend storing the Sintelix database on an SSD. This significantly increases the speed of network formation (which is computationally intensive), especially for large networks, and makes search functions at least an order of magnitude faster.
RAM
The Java Virtual Machine (JVM) created to host Sintelix is designed to limit the usage of machine RAM to not more than 75% less 512 MB. This means that a 4-core deployment would require at least 11.17 GB of RAM. A 6-core deployment would require at least 15.17 GB of RAM.
Multiprocessor boards enable effective numbers of cores up to 256.
The allocation of RAM to Sintelix’s JVM can be overridden by the System Administrator.

Folders
Installation folder
Where the Sintelix program is installed.
Windows Default : c:\Program Files (x86)\Sintelix.
Server Library folder
This location is where source documents are loaded into the Sintelix database.
Accessing documents from the Server Library is the fastest way to load a large number of documents but requires a common file system directory where:
- users can copy documents, and
- Sintelix has read access.
Sintelix database folder
The folder where the Sintelix database is installed.
Windows Default : c:\Sintelix database.
Anti-virus software:
Sintelix makes frequent writes to its database location. Certain anti-virus software will automatically scan all the files written by Sintelix, which slows down Sintelix. It is highly recommended you exclude the Sintelix database location from real-time anti-virus scans. No executable or dangerous files are written there.

Processing Speeds
Speed
Sintelix has both document and network formation workflows, which can be run in series to provide incremental network formation.
The time taken to process and store documents increases linearly with the number of documents stored.
The time taken to create a network of any size increases faster than linearly (approximately the number of entities included to the power of 1.4).
Approximate processing speeds
Sintelix has both document and network formation work flows, which can be run in series to provide incremental network formation. Approximate processing speeds for collections and networks are:
- Entity extraction (processing documents for storage and search in Sintelix)
- 30 pages of text per core per second.
- 2.6 million pages per core per day.
- 80 million entities per core per day.
- Entity extraction with entity resolution (creating entity networks/graphs)
- 10 pages of text per core per second.
- 860,000 pages per core per day.
- 26 million entities per core per day.
- roughly 50 GB of electronic (that is, not scanned) documents per core per day, but this varies greatly with document type (PDF tends to be fast, DOC are slow).
- network formation gets progressively slower as the number (n) of entities increases (~n^1.4)
- Scanned documents (requiring OCR Optical Character Recognition, a method of converting images of typed, printed or handwritten text into machine-readable text.)
- approximately 1 second per core per page, that is, very slow (varies with the OCR server used).

Storage Size and Scalability
Storage Size
Sizing of storage requires care.
Sintelix document collections and networks are not limited in size by software considerations. All you need is enough disk space (see below).
Ratio | File size of an original document (No images) | ||
---|---|---|---|
.doc files | 3:1 | For a 3 MB document | 1 MB |
.docx files | 1.5:1 | For a 1.5 MB document | 1 MB |
.pdf files | 5:1 | For a 5 MB document | 1 MB |
The disk space required by Sintelix storage for both document collections and networks is about 1.5 times the size of the original document. We recommend that you verify this ratio by ingesting a balanced sample of documents into Sintelix and measuring the size of the Sintelix datastore before and after ingestion.
Document Processing Deployments
Document processing deployments are where nothing is saved to a Sintelix database.
Stateless use of Sintelix is only relevant for document processing where each document is sent separately to Sintelix via the Document Processing web service. Sintelix returns the facts (entities, relationships, metadata) separately for each document. No information in the documents or calculated from the documents is stored in Sintelix.
Storage (disk space) |
Allow at least 10 GB for Sintelix’s internal database for scratch purposes. |

Software
Java
Windows:
The Windows installation deploys with its own version of Java used only for Sintelix within the Sintelix server.
Linux:
Sintelix Version(s) |
Minimum Java Version |
Recommended Java Version |
---|---|---|
Up to 7.4.1 |
11 |
14 |
7.42 to 7.5.2 |
11 |
18 |
7.6 and above |
17 |
21 |
Browsers
It is recommended to upgrade the browser to the latest version as soon as upgraded versions are made available. Later versions of browsers often have higher levels of security and better application functionality.
Supported browsers include:
- Microsoft Edge
- Google Chrome
- Mozilla Firefox
Google Cloud platform
Sintelix uses Google Cloud platform to access APIs.
Microsoft Azure
Microsoft Azure is used by Media Processing Server for Transcription requirements.
Hosting Environment
Sintelix does not provide support for other services related to the hosting environment such as host network issues or Citrix/environment settings of the hosting system.
Third-party systems
If Sintelix is integrated into a third-party system, install Sintelix on the same machine as the third-party system or have a high-speed network between the computers hosting the two systems.

Internet Connectivity Requirements
Fetching documents by URL
Sintelix is capable of fetching documents by URL, using the http or https protocol. If Sintelix is isolated from the relevant network (for example, the Internet) then this functionality will not work.
Browsing documents with geographical information
When you use Sintelix to browse documents with geographical information (location names, coordinates) the web browser may attempt to connect to the Internet to show detailed maps (depending on your map provider selection). High detail maps will fail to function without an Internet connection.
Proxy server
If there is an additional proxy server to isolate Sintelix from the network, please make sure it doesn’t impose severe upload size restrictions as uploading documents larger than the restriction will fail.

Licensing Requirements
Connecting to the activation server
Depending on the Sintelix licensing agreement, Sintelix may need to establish a connection to the activation server. If this restriction is in place, Sintelix needs to be able to connect to the Internet to make the connection. Activation occurs on startup and every 24 hours. Note the instructions for configuring the firewall to allow access to the Sintelix server URL and port required to maintain activation. See Preparing to Install Sintelix.
MAC address restriction
Depending on the Sintelix licensing agreement, Sintelix may only start on computers with network adaptors that have a license-defined MAC address. If this restriction is in place, ensure that you install Sintelix on matching hardware. When providing a MAC address, we recommend using an Ethernet adapter address instead of a WiFi adapter address.
Dongle
Sintelix utilises USB Dongles to support verification of Activation Keys when running Sintelix in an offline environment. Refer to Dongle Licensing.