Configuring Full-Text Search for Files Stored in File System or in Oracle Database

The samples provided with SDK use Apache Lucene as indexing engine and Apache Tika as content analysis toolkit.

The server implementation searches both file names and file content including content of Microsoft Office documents as well as any other documents which format is supported by Apache Tika, such as LibreOffice, OpenOffice, Adobe Reader, etc.

Searching Documents Content

You don't need to make any additional steps to enable search by document content. But there is an option to disable it. To do that you need to remove Tika Parsers dependencies in the sample maven configuration and rebuild sample.

<dependency>
            <groupId>org.apache.tika</groupId>
            <artifactId>tika-parsers</artifactId>
            <version>1.22</version>
</dependency>

 

See also:

Next Article:

WebDAV Server Samples Problems and Troubleshooting