Configuration property fixed at build time - All other configuration properties are overridable at runtime

Configuration property

Type

Default

The resource path within the application artifact to the tika-config.xml file.

Environment variable: QUARKUS_TIKA_TIKA_CONFIG_PATH

string

Comma separated list of the parsers which must be supported.

Most of the document formats recognized by Apache Tika are supported by default but it affects the application memory and native executable sizes. One can list only the required parsers in tika-config.xml to minimize a number of parsers loaded into the memory, but using this property is recommended to achieve both optimizations.

Either the abbreviated or full parser class names can be used. Only PDF and OpenDocument format parsers can be listed using the reserved 'pdf' and 'odf' abbreviations. Custom class name abbreviations have to be used for all other parsers. For example:

quarkus.tika.parsers = pdf
quarkus.tika.parsers = pdf,odf

This property will have no effect if the `tikaConfigPath' property has been set.

Environment variable: QUARKUS_TIKA_PARSERS

string

Controls how the content of the embedded documents is parsed. By default it is appended to the main document content. Setting this property to false makes the content of each of the embedded documents available separately.

Environment variable: QUARKUS_TIKA_APPEND_EMBEDDED_CONTENT

boolean

true

Configuration of the individual parsers. For example:

quarkus.tika.parsers = pdf,odf
quarkus.tika.parser-options.pdf.sort-by-position = true

Environment variable: QUARKUS_TIKA_PARSER_OPTIONS

Map<String,Map<String,String>>

Full parser class name for a given parser abbreviation. For example:

quarkus.tika.parsers = classparser
quarkus.tika.parser.classparser = org.apache.tika.parser.asm.ClassParser

Environment variable: QUARKUS_TIKA_PARSER

Map<String,String>