Configure and Secure Libraries

Libraries

Libraries are used to Add Documents to a Collection.

Libraries appear as tabs on the Add Documents panel, under Collections.

You can modify the libraries.xml file to:

  • add additional libraries, and

  • add security to libraries.

Types of Libraries

There are different types of libraries:

  • File System Library: (default tab: Server Library)

    A file location with folders and files. Provides a fast way to load large volumes of documents into Sintelix.

  • URL library: (default tab: From URLs)

    A list of URLs from which to extract text from web pages or files.

  • S3 Bucket: (optional)

    The S3 library allows you to ingest documents from an AWS S3 Bucket.

  • Email: (optional)

    The Email Library allows you to ingest emails from mail servers.

  • AppFileStore: (optional)

    The AppFileStore Library is a special library used by Sintelix Apps. It is similar to a File System Library and is only installed when running a Sintelix App. For more information, refer to the App documentation.

Adding Libraries

You can have more than one library of any type. For example, you may configure three file system libraries, if required.

When you add additional libraries, they appear as additional tabs in the Add Documents panel using the name assigned to the library.

Modify libraries.xml

To add libraries or add security to libraries you need to modify the libraries.xml file in the Sintelix database folder and then restart Sintelix.

File System Library

The File System library has the following xml syntax:

  • Class= library:FileSystem:

    • <rootpath> (required): A FileSystem library requires a rootpath defined to identify the location of the library. That location requires Sintelix to have read access.

    • <SecurityMetadata> (optional): To add user and/or group security to a Server Library. See Securing a Library

Securing a Library

You can add security settings to a File System library and an S3 library.

How it works

To be able to ingest a document from a secured library:

  • The user must have sufficient claims to be able to access the library

  • The project that the user is ingesting the document into must have the same or stricter security metadata than the library - this is protect a user from ingesting a document into an unsecured project or a project with looser permissions.

When a user does not have sufficient claims, they will not be able to see the library in the Sintelix User Interface.

URL List Library

You can add additional URL Libraries.

To add a URL List library use the following syntax:

  • Class: <UrlSystem> A system library used to add documents from a URL.

    • <cachepath> (optional): A UrlSystem library can include a location for storing cached files to reduce the need to repeatedly download files previously downloaded. The cachepath location requires read/write access.

    • <SafeMode> (optional): SafeMode is only required if you create a cache with a URL library. There are no controls over what a user can download from a URL, which is a risk. Disguises the filenames stored in the cache folder. For example, a user cannot identify an exe file.

S3 Bucket Library

The S3 Bucket library has the following xml syntax:

  • Class: library:S3:

    You need to indicate what credentials to use when connecting to the S3 library. The credentials are either:

    • <AWSStaticCredentials> : where you also need to provide an <AccessKeyId> and a <SecretAccessKey>

    • <AWSInstanceProfileCredentials> to use your existing AWS Instance credentials.

Email Library

The Email library has the following xml syntax:

  • class=library:Email: