IBM watsonx.ai Chat and Generation Models

IBM watsonx.ai enables the development of generative AI applications using foundation models from IBM and Hugging Face.

This extension supports IBM watsonx as a service on IBM Cloud only.

Prerequisites

To use watsonx.ai models, configure the following required values in your application.properties file:

Base URL

The base-url depends on the region of your service instance:

Dallas: https://us-south.ml.cloud.ibm.com
Frankfurt: https://eu-de.ml.cloud.ibm.com
London: https://eu-gb.ml.cloud.ibm.com
Tokyo: https://jp-tok.ml.cloud.ibm.com
Sydney: https://au-syd.ml.cloud.ibm.com
Toronto: https://ca-tor.ml.cloud.ibm.com
Mumbai - https://ap-south-1.aws.wxai.ibm.com

quarkus.langchain4j.watsonx.base-url=https://us-south.ml.cloud.ibm.com

Project ID

Obtain the Project Id via:

Visit https://dataplatform.cloud.ibm.com/projects/?context=wx
Open your project and click the Manage tab.
Copy the Project ID from the Details section.

quarkus.langchain4j.watsonx.project-id=23d...

You may use the optional space-id as an alternative.

API Key

Create an API key by visiting https://cloud.ibm.com/iam/apikeys and clicking Create +.

quarkus.langchain4j.watsonx.api-key=your-api-key

You can also use the QUARKUS_LANGCHAIN4J_WATSONX_API_KEY environment variable.

Dependency

<dependency>
  <groupId>io.quarkiverse.langchain4j</groupId>
  <artifactId>quarkus-langchain4j-watsonx</artifactId>
  <version>1.8.0.CR2</version>
</dependency>

Even better, if you use the Quarkus platformn BOM (default for projects generated), add the Quarkus Langchain4J BOM and all dependency versions will align:

    <dependencyManagement>
        <dependencies>
            <dependency>
                <groupId>${quarkus.platform.group-id}</groupId>
                <artifactId>${quarkus.platform.artifact-id}</artifactId>
                <version>${quarkus.platform.version}</version>
                <type>pom</type>
                <scope>import</scope>
            </dependency>
            <dependency>
                <groupId>${quarkus.platform.group-id}</groupId>
                <artifactId>quarkus-langchain4j-bom</artifactId> (1)
                <version>${quarkus.platform.version}</version> (2)
                <type>pom</type>
                <scope>import</scope>
            </dependency>
        </dependencies>
    </dependencyManagement>

    <dependencies>
      <dependency>
        <groupId>io.quarkiverse.langchain4j</groupId>
        <artifactId>quarkus-langchain4j-watsonx</artifactId>
        (3)
      </dependency>
    </dependencies>

1	In your `dependencyManagement` section, add the `quarkus-langchain4j-bom`
2	Inherit the version from your platform version
3	Voilà, no need for version alignment anymore

If no other extension is installed, AI Services will automatically use this provider.

Chat Model

IBM watsonx.ai provides a variety of foundation models for text generation, chat-based interactions, and instruction-following tasks. These include both IBM-built models and third-party / community models. Quarkus integrates the LangChain4j WatsonxChatModel, exposing it as ChatModel / StreamingChatModel bean.

See the full model catalog:

IBM foundation models: Model Catalog – IBM
Third-party / community foundation models: Model Catalog – Third-Party

Configuration

Configure the chat model in your application.properties:

# Base Watsonx configuration
quarkus.langchain4j.watsonx.base-url=${BASE_URL}
quarkus.langchain4j.watsonx.api-key=${API_KEY}
quarkus.langchain4j.watsonx.project-id=${PROJECT_ID}

# Chat model
quarkus.langchain4j.watsonx.chat-model.model-name=ibm/granite-4-h-small

# Optional generation parameters
quarkus.langchain4j.watsonx.chat-model.max-output-tokens=0
quarkus.langchain4j.watsonx.chat-model.temperature=0.2

If a chat model is configured, Quarkus automatically registers a ChatModel / StreamingChatModel bean.

Injection

@Inject
ChatModel chatModel;

@Inject
StreamingChatModel chatModel;

Enabling Thinking / Reasoning Output

Some foundation models can include internal reasoning (also referred to as thinking) steps as part of their responses. Depending on the model, this reasoning may be embedded in the same text as the final response, or returned separately in a dedicated field from watsonx.ai.

To correctly enable and capture this behavior in Quarkus, you must configure the chat model with either thinking.tags (for ExtractionTags) or thinking.effort / thinking (for ThinkingEffort or boolean flag) in your application.properties. This ensures that LangChain4j can automatically extract the reasoning and response content from the model output.

Models that return reasoning and response together

Use ExtractionTags when the model outputs reasoning and response in the same text string. The tags define XML-like markers used to separate the reasoning from the final response.

# Example configuration for ibm/granite-3-3-8b-instruct
quarkus.langchain4j.watsonx.chat-model.model-name=ibm/granite-3-3-8b-instruct
quarkus.langchain4j.watsonx.chat-model.thinking.tags.think=think
quarkus.langchain4j.watsonx.chat-model.thinking.tags.response=response

Behavior

If both tags are specified, they are used directly to extract reasoning and response segments.
If only the reasoning tag is specified, everything outside that tag is considered the response.

@Inject
ChatModel thinkingChatModel;

var chatResponse = thinkingChatModel.chat(UserMessage.from("Why is the sky blue?"));
System.out.println(chatResponse.aiMessage().thinking());
System.out.println(chatResponse.aiMessage().text());

Models that return reasoning and response separately

For models that already return reasoning and response as separate fields, use the thinking.effort property to control how much reasoning the model applies during generation, or enable it using the boolean flag.

# Example configuration for openai/gpt-oss-120b
quarkus.langchain4j.watsonx.chat-model.model-name=openai/gpt-oss-120b
quarkus.langchain4j.watsonx.chat-model.thinking.effort=HIGH

Streaming Example

@Inject
StreamingChatModel streamingChatModel;

List<ChatMessage> messages = List.of(UserMessage.from("Why is the sky blue?"));

ChatRequest chatRequest = ChatRequest.builder()
    .messages(messages)
    .build();

streamingChatModel.chat(chatRequest, new StreamingChatResponseHandler() {

    @Override
    public void onPartialResponse(String partialResponse) {
        System.out.println("Partial: " + partialResponse);
    }

    @Override
    public void onPartialThinking(PartialThinking partialThinking) {
        System.out.println("Thinking: " + partialThinking.content());
    }

    @Override
    public void onCompleteResponse(ChatResponse completeResponse) {
        System.out.println("Complete: " + completeResponse);
    }

    @Override
    public void onError(Throwable error) {
        error.printStackTrace();
    }
});

Ensure that the selected model supports reasoning output.
Use thinking.tags for models that embed reasoning and response in a single text string.
Use thinking.effort or thinking=true for models that already separate reasoning and response automatically.

Embedding Model

IBM watsonx.ai provides multiple embedding models for converting text into vector representations suitable for semantic search, RAG pipelines, similarity comparison, and vector database integrations.

Quarkus integrates the LangChain4j WatsonxEmbeddingModel, exposing it as EmbeddingModel bean.

A list of supported embedding models can be found here:

Embedding Models

Configuration

Configure the embedding model by specifying its model name in application.properties:

# Base Watsonx configuration
quarkus.langchain4j.watsonx.base-url=${BASE_URL}
quarkus.langchain4j.watsonx.api-key=${API_KEY}
quarkus.langchain4j.watsonx.project-id=${PROJECT_ID}

# Embedding model configuration
quarkus.langchain4j.watsonx.embedding-model.model-name=ibm/slate-125m-english-rtrvr

If an embedding model is configured, Quarkus will automatically create and register a EmbeddingModel bean.

Injection

@Inject
EmbeddingModel embeddingModel;

Usage

Generating an embedding for a single text:

var response = embeddingModel.embed("Hello Watsonx!");

assertNotNull(response);
var embedding = response.content();

System.out.println("Embedding size: " + embedding.vector().length());

Generating embeddings for multiple text segments:

var embeddings = embeddingModel.embedAll(
    List.of(
        TextSegment.from("First document"),
        TextSegment.from("Second document")
    )
);

Scoring Model

IBM watsonx.ai provides scoring (reranking) models that evaluate the relevance between a query and a piece of text. Quarkus integrates the LangChain4j WatsonxScoringModel, exposing it as ScoringModel implementation.

Scoring models are especially useful for RAG pipelines, document ranking, and semantic relevance evaluation.

A list of supported scoring/reranker models is available here:

Rerank Models

Configuration

Configure the model by specifying its name in application.properties:

# Base Watsonx configuration
quarkus.langchain4j.watsonx.base-url=${BASE_URL}
quarkus.langchain4j.watsonx.api-key=${API_KEY}
quarkus.langchain4j.watsonx.project-id=${PROJECT_ID}

# Scoring model configuration
quarkus.langchain4j.watsonx.scoring-model.model-name=cross-encoder/ms-marco-minilm-l-12-v2

If an score model is configured, Quarkus will automatically create and register a ScoringModel bean.

Injection

@Inject
ScoringModel scoringModel;

Usage

You can score a single text against a query:

var response = scoringModel.score("Rerank this!", "Test to rerank 1");

assertNotNull(response);
assertNotNull(response.content());

double score = response.content();
System.out.println("Score: " + score);

Or score multiple documents at once:

var scores = scoringModel.scoreAll(
    List.of(
        TextSegment.from("Document A"),
        TextSegment.from("Document B")
    ),
    "User query"
);

System.out.println(scores); // list of relevance scores

Moderation Model

IBM watsonx.ai provides moderation capabilities through multiple detectors that can identify unsafe, sensitive, or policy-violating content. Quarkus integrates the LangChain4j WatsonxModerationModel, exposing each detector type as a dedicated configuration group.

Supported detector types include:

PII – Detects Personally Identifiable Information
HAP – Detects hate, abuse, or profanity
Granite Guardian – Detects harmful or risky content

Each detector can be enabled individually.

Configuration

Enable detectors in application.properties using their dedicated flags:

# Base Watsonx configuration
quarkus.langchain4j.watsonx.base-url=${BASE_URL}
quarkus.langchain4j.watsonx.api-key=${API_KEY}
quarkus.langchain4j.watsonx.project-id=${PROJECT_ID}

# Enable specific moderation detectors
quarkus.langchain4j.watsonx.moderation-model.hap.enabled=true
quarkus.langchain4j.watsonx.moderation-model.pii.enabled=true
quarkus.langchain4j.watsonx.moderation-model.granite-guardian.enabled=true

Each detector configuration group may also expose additional settings depending on its capabilities. If an score model is configured, Quarkus will automatically create and register a ModerationModel bean.

Injection

@Inject
ModerationModel moderationModel;

Usage

var response = moderationModel.moderate("Some text to analyze");

boolean flagged = response.content().flagged();
Map<String, Object> metadata = response.metadata();

System.out.println("Flagged? " + flagged);
System.out.println("Metadata: " + metadata);

Metadata

A moderation response includes metadata describing the detection:

Key	Description
detection	The assigned label for the detected content
detection_type	Detector that triggered the flag
start	Start index of the detected segment
end	End index of the detected segment
score	Confidence score

Key

Description

detection

The assigned label for the detected content

detection_type

Detector that triggered the flag

start

Start index of the detected segment

end

End index of the detected segment

score

Confidence score

Example:

System.out.println(metadata.get("detection_type"));
System.out.println(metadata.get("score"));

Text Extraction

The TextExtraction feature enables developers to extract text from high-value business documents stored in IBM Cloud Object Storage. Extracted text can be used for AI processing, key information identification, or further document analysis.

The API supports text extraction from the following file types:

PDF
GIF
JPG
PNG
TIFF
BMP
DOC
DOCX
HTML
JFIF
PPT
PPTX

The extracted text can be output in the following formats:

JSON
MARKDOWN
HTML
PLAIN_TEXT
PAGE_IMAGES

Configuration

To enable TextExtraction in your application, configure the following properties:

quarkus.langchain4j.watsonx.base-url=${BASE_URL}
quarkus.langchain4j.watsonx.api-key=${API_KEY}
quarkus.langchain4j.watsonx.project-id=${PROJECT_ID}
quarkus.langchain4j.watsonx.text-extraction.cos-url=<base-url>
quarkus.langchain4j.watsonx.text-extraction.document-reference.connection=<connection-id>
quarkus.langchain4j.watsonx.text-extraction.document-reference.bucket-name=<bucket-name>
quarkus.langchain4j.watsonx.text-extraction.results-reference.connection=<connection-id>
quarkus.langchain4j.watsonx.text-extraction.results-reference.bucket-name=<bucket-name>

cos-url: The endpoint where the IBM Cloud Object Storage instance is deployed. To find the appropriate value, refer to the IBM Cloud Object Storage endpoint table.
document-reference.connection: The connection asset ID containing credentials to access the source storage.
document-reference.bucket-name: The bucket where documents to be processed will be uploaded.
results-reference.connection: The connection asset ID containing credentials to access the output storage.
results-reference.bucket-name: The bucket where extracted text documents will be saved as new files.

The document reference properties define the source storage for input and uloaded files, while the results reference properties specify where the extracted content is stored. Both can refer to the same bucket or different ones.

For more information on how to get the connection parameter for the document-reference and results-reference you can refer to the documentation at this link.

Using Text Extraction

The TextExtraction class provides multiple methods for extracting text from documents. You can either extract text from an existing file in IBM Cloud Object Storage or upload a file and extract its content. To use TextExtraction, you need to inject an instance into your application. If multiple configurations are defined, you can specify the appropriate one using the @ModelName qualifier.

@Inject
TextExtractionService textExtraction;

@Inject
@ModelName("custom")
TextExtractionService customTextExtraction;

You can start the extraction process in two ways.

First, if the document is already stored in IBM Cloud Object Storage, you can initiate the extraction by using the following method:

TextExtractionResponse response = textExtraction.startExtraction("path/to/document");
String id = response.metadata().id();

Alternatively, if you’re working with a local file, you can upload it and start the extraction process with:

File file = new File("path/to/document");
File response = textExtraction.uploadAndStartExtraction(file);
String id = response.metadata().id();

After starting the extraction, you can check its status by calling:

TextExtractionResponse response = textExtraction.fetchExtractionRequest(extractionId);
String result = response.entity().results().status();

If you need to extract and retrieve the text immediately, you have two options.

You can either extract text from an existing file directly:

String extractedText = textExtraction.extractAndFetch("path/to/document");

Or upload the file and retrieve the extracted text immediately:

File file = new File("path/to/document");
String extractedText = textExtraction.uploadExtractAndFetch(file);

All extraction methods can accept a Parameters object to customize the behavior of the text extraction request.

The Parameters object allows fine-grained control over the extraction process.

var parameters = TextExtractionParameters.builder()
        .removeOutputFile(true)
        .removeUploadedFile(true)
        .requestedOutputs(MD)
        .mode(Mode.HIGH_QUALITY)
        .autoRotationCorrection(false)
        .outputDpi(16)
        .build()

File file = new File("path/to/document.pdf");
String extractedText = textExtraction.uploadExtractAndFetch(file, parameters));

Text Classification

The TextClassification feature enables you to classify text in your documents to identify whether the data in your file matches the key-value pair format in schema definitions for various document types.

By pre-processing the document, you can quickly verify whether a document is classified into one of the pre-defined schemas or a custom schema without performing key-value pair extraction, which can be a longer, resource-intensive process. You can then decide which schema to use to correctly extract text into fields in a key-value pair format.

The API supports text classification from the following file types:

BMP
DOC
DOCX
GIF
HTML
JFIF
JPG
MARKDOWN
PDF
PNG
PPT
PPTX
TIFF
XLSX

Configuration

To enable TextClassification in your application, configure the following properties:

quarkus.langchain4j.watsonx.base-url=${BASE_URL}
quarkus.langchain4j.watsonx.api-key=${API_KEY}
quarkus.langchain4j.watsonx.project-id=${PROJECT_ID}
quarkus.langchain4j.watsonx.text-classification.cos-url=<base-url>
quarkus.langchain4j.watsonx.text-classification.document-reference.connection=<connection-id>
quarkus.langchain4j.watsonx.text-classification.document-reference.bucket-name=<bucket-name>

cos-url: The endpoint where the IBM Cloud Object Storage instance is deployed. To find the appropriate value, refer to the IBM Cloud Object Storage endpoint table.
document-reference.connection: The connection asset ID containing credentials to access the source storage.
document-reference.bucket-name: The bucket where documents to be processed will be uploaded (or are already stored).

For more information on how to get the connection parameter for the document-reference you can refer to the documentation at this link.

Using Text Classification

The TextClassificationService class provides multiple methods for classifying documents. You can either classify text from an existing file in IBM Cloud Object Storage or upload a file and classify its content. To use TextClassificationService, you need to inject an instance into your application. If multiple configurations are defined, you can specify the appropriate one using the @ModelName qualifier.

@Inject
TextClassificationService classificationService;

@Inject
@ModelName("custom")
TextClassificationService customClassificationService;

You can start the classification process in two ways.

First, if the document is already stored in IBM Cloud Object Storage, you can initiate the classification by using the following method:

TextClassificationResponse response = classificationService.startClassification("path/to/document");
String id = response.metadata().id();

Alternatively, if you’re working with a local file, you can upload it and start the classification process with:

File file = new File("path/to/document");
TextClassificationResponse response = classificationService.uploadAndStartClassification(file);
String id = response.metadata().id();

After starting the classification, you can check its status by calling:

TextClassificationResponse response = classificationService.fetchClassificationRequest(classificationId);
String result = response.entity().results().status();

If you need to classify and retrieve the results immediately, you have two options.

You can either classify an existing file directly:

ClassificationResult result = classificationService.classifyAndFetch("path/to/document");

Or upload the file and retrieve the classification result immediately:

File file = new File("path/to/document");
ClassificationResult result = classificationService.uploadClassifyAndFetch(file);

All classification methods can accept a TextClassificationParameters object to customize the behavior of the request.

The TextClassificationParameters object allows fine-grained control over the classification process, including Classification Modes, OCR settings, and Semantic Configuration.

var parameters = TextClassificationParameters.builder()
        .classificationMode(ClassificationMode.EXACT)
        .languages(Language.ENGLISH, Language.FRENCH)
        .ocrMode(OcrMode.AUTO)
        .autoRotationCorrection(true)
        .removeUploadedFile(true)
        .build();

File file = new File("path/to/document.pdf");
ClassificationResult result = classificationService.uploadClassifyAndFetch(file, parameters));

Semantic Configuration

You can provide a TextClassificationSemanticConfig to the parameters. This allows you to define custom schemas, enabling the service to identify specific document types based on the presence of key-value pair fields you define.

The following example shows how to configure the service to classify a document as a specific "Invoice" type:

// 1. Define the fields expected in the document
var fields = KvpFields.builder()
    .add("invoice_date", KvpField.of("The date when the invoice was issued.", "2024-07-10"))
    .add("invoice_number", KvpField.of("The unique number identifying the invoice.", "INV-2024-001"))
    .add("total_amount", KvpField.of("The total amount to be paid.", "1250.50"))
    .build();

// 2. Define the Schema using the fields
var mySchema = Schema.builder()
    .documentDescription("A vendor-issued invoice listing purchased items, prices, and payment information.")
    .documentType("Invoice")
    .fields(fields)
    .build();

// 3. Create the Semantic Configuration
var semanticConfig = TextClassificationSemanticConfig.builder()
    .schemasMergeStrategy(SchemaMergeStrategy.REPLACE)
    .schemas(mySchema)
    .build();

// 4. Pass the configuration to the parameters
var parameters = TextClassificationParameters.builder()
    .languages(Language.ENGLISH)
    .semanticConfig(semanticConfig)
    .build();

ClassificationResult result = classificationService.uploadClassifyAndFetch(file, parameters);

Managing Requests and Files

The service also provides utility methods to manage the lifecycle of your requests and files:

// Delete a classification request history
classificationService.deleteRequest(requestId,
    TextClassificationDeleteParameters.builder().hardDelete(true).build());

// Delete a file from the bucket
classificationService.deleteFile("bucket-name", "filename.pdf");

Configuration

Configuration property fixed at build time - All other configuration properties are overridable at runtime

Configuration property	Type	Default
`quarkus.langchain4j.watsonx.chat-model.enabled` Whether the model should be enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_ENABLED`	boolean	`true`
`quarkus.langchain4j.watsonx.embedding-model.enabled` Whether the embedding model should be enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_ENABLED`	boolean	`true`
`quarkus.langchain4j.watsonx.scoring-model.enabled` Whether the scoring model should be enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_ENABLED`	boolean	`true`
`quarkus.langchain4j.watsonx.moderation-model.enabled` Whether the moderation model should be enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_ENABLED`	boolean	`true`
`quarkus.langchain4j.watsonx.base-url` Specifies the base URL of the watsonx.ai API. A list of all available URLs is provided in the IBM Watsonx.ai documentation at the this link. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BASE_URL`	string
`quarkus.langchain4j.watsonx.api-key` IBM Cloud API key. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_API_KEY`	string
`quarkus.langchain4j.watsonx.timeout` Timeout for watsonx.ai calls. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TIMEOUT`	Duration	`60s`
`quarkus.langchain4j.watsonx.version` The version date for the API of the form YYYY-MM-DD. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_VERSION`	string
`quarkus.langchain4j.watsonx.space-id` The space that contains the resource. Either `space_id` or `project_id` has to be given. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_SPACE_ID`	string
`quarkus.langchain4j.watsonx.project-id` The project that contains the resource. Either `space_id` or `project_id` has to be given. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_PROJECT_ID`	string
`quarkus.langchain4j.watsonx.log-requests` Whether the watsonx.ai client should log requests. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.log-responses` Whether the watsonx.ai client should log responses. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.enable-integration` Whether to enable the integration. Defaults to `true`, which means requests are made to the watsonx.ai provider. Set to `false` to disable all requests. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_ENABLE_INTEGRATION`	boolean	`true`
`quarkus.langchain4j.watsonx.iam.base-url` Base URL of the IAM Authentication API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_IAM_BASE_URL`	URI
`quarkus.langchain4j.watsonx.iam.timeout` Timeout for IAM authentication calls. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_IAM_TIMEOUT`	Duration	`10s`
`quarkus.langchain4j.watsonx.iam.grant-type` Grant type for the IAM Authentication API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_IAM_GRANT_TYPE`	string
`quarkus.langchain4j.watsonx.text-extraction.cos-url` Base URL of the Cloud Object Storage API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_COS_URL`	string	required
`quarkus.langchain4j.watsonx.text-extraction.document-reference.connection` The id of the connection asset that contains the credentials required to access the data. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_DOCUMENT_REFERENCE_CONNECTION`	string	required
`quarkus.langchain4j.watsonx.text-extraction.document-reference.bucket-name` The name of the bucket containing the input document. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_DOCUMENT_REFERENCE_BUCKET_NAME`	string	required
`quarkus.langchain4j.watsonx.text-extraction.results-reference.connection` The id of the connection asset used to store the extracted results. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_RESULTS_REFERENCE_CONNECTION`	string	required
`quarkus.langchain4j.watsonx.text-extraction.results-reference.bucket-name` The name of the bucket where the output files will be written. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_RESULTS_REFERENCE_BUCKET_NAME`	string	required
`quarkus.langchain4j.watsonx.text-extraction.log-requests` Whether text extraction requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.text-extraction.log-responses` Whether text extraction responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.text-extraction.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.text-classification.cos-url` Base URL of the Cloud Object Storage API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_COS_URL`	string	required
`quarkus.langchain4j.watsonx.text-classification.document-reference.connection` The id of the connection asset that contains the credentials required to access the data. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_CONNECTION`	string	required
`quarkus.langchain4j.watsonx.text-classification.document-reference.bucket-name` The name of the bucket containing the input document. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_BUCKET_NAME`	string	required
`quarkus.langchain4j.watsonx.text-classification.log-requests` Whether text extraction requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.text-classification.log-responses` Whether text extraction responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.text-classification.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.chat-model.model-name` Specifies the model to use for the chat completion. A list of all available models is provided in the IBM watsonx.ai documentation at the this link. To use a model, locate the `API model ID` column in the table and copy the corresponding model ID. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_MODEL_NAME`	string	`ibm/granite-4-h-small`
`quarkus.langchain4j.watsonx.chat-model.tool-choice` Specifies how the model should choose which tool to call during a request. This value can be: auto: The model decides whether and which tool to call automatically. required: The model must call one of the available tools. If `toolChoiceName` is set, this value is ignored. Setting this value influences the tool-calling behavior of the model when no specific tool is required. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOOL_CHOICE`	`auto`, `required`, `none`
`quarkus.langchain4j.watsonx.chat-model.tool-choice-name` Specifies the name of a specific tool that the model must call. When set, the model will be forced to call the specified tool. The name must exactly match one of the available tools defined for the service. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOOL_CHOICE_NAME`	string
`quarkus.langchain4j.watsonx.chat-model.frequency-penalty` Positive values penalize new tokens based on their existing frequency in the generated text, reducing the likelihood of the model repeating the same lines verbatim. Possible values: `-2 < value < 2` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_FREQUENCY_PENALTY`	double	`0`
`quarkus.langchain4j.watsonx.chat-model.logprobs` Specifies whether to return the log probabilities of the output tokens. If set to `true`, the response will include the log probability of each output token in the content of the message. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOGPROBS`	boolean	`false`
`quarkus.langchain4j.watsonx.chat-model.top-logprobs` An integer specifying the number of most likely tokens to return at each token position, each with an associated log probability. The option `logprobs` must be set to `true` if this parameter is used. Possible values: `0 ≤ value ≤ 20` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOP_LOGPROBS`	int
`quarkus.langchain4j.watsonx.chat-model.max-output-tokens` The maximum number of tokens that can be generated in the chat completion. The total length of input tokens and generated tokens is limited by the model’s context length. Set to 0 for the model’s configured max generated tokens. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_MAX_OUTPUT_TOKENS`	int	`1024`
`quarkus.langchain4j.watsonx.chat-model.presence-penalty` Applies a penalty to new tokens based on whether they already appear in the generated text so far, encouraging the model to introduce new topics rather than repeat itself. Possible values: `-2 < value < 2` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_PRESENCE_PENALTY`	double	`0`
`quarkus.langchain4j.watsonx.chat-model.seed` Random number generator seed to use in sampling mode for experimental repeatability. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_SEED`	int
`quarkus.langchain4j.watsonx.chat-model.stop` Defines one or more stop sequences that will cause the model to stop generating further tokens if any of them are encountered in the output. This allows control over where the model should end its response. If a stop sequence is encountered before the minimum number of tokens has been generated, it will be ignored. Possible values: `0 ≤ number of items ≤ 4` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_STOP`	list of string
`quarkus.langchain4j.watsonx.chat-model.temperature` Specifies the sampling temperature to use in the generation process. Higher values (e.g. `0.8`) make the output more random and diverse, while lower values (e.g. `0.2`) make the output more focused and deterministic. Possible values: `0 < value < 2` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TEMPERATURE`	double	`${quarkus.langchain4j.temperature:1.0}`
`quarkus.langchain4j.watsonx.chat-model.top-p` An alternative to sampling with `temperature`, called nucleus sampling, where the model considers the results of the tokens with `top_p` probability mass. So `0.1` means only the tokens comprising the top 10% probability mass are considered. Possible values: `0 < value < 1` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOP_P`	double	`1`
`quarkus.langchain4j.watsonx.chat-model.response-format` Specifies the desired format for the model’s output. Allowable values: `[text, json, json_schema]` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_RESPONSE_FORMAT`	`text`, `json`, `json-schema`
`quarkus.langchain4j.watsonx.chat-model.guided-choice` Specifies a set of allowed output choices. When this parameter is set, the model is constrained to return exactly one of the provided choices. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_GUIDED_CHOICE`	list of string
`quarkus.langchain4j.watsonx.chat-model.guided-grammar` Constrains the model output to follow a context-free grammar. If specified, the generated output will conform to the defined grammar. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_GUIDED_GRAMMAR`	string
`quarkus.langchain4j.watsonx.chat-model.guided-regex` Constrains the model output to match a regular expression pattern. If specified, the generated output must conform to the provided regex. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_GUIDED_REGEX`	string
`quarkus.langchain4j.watsonx.chat-model.length-penalty` Sets the length penalty to be applied during text generation. This penalty influences the length of the generated text. A length penalty discourages the model from generating overly long responses, or conversely, it can encourage more extended outputs. When the penalty value is greater than 1.0, it discourages generating longer responses. Conversely, a value less than 1.0 incentivizes the model to generate longer text. A value of 1.0 means no penalty, and the length of the output will be determined by other factors, such as the input prompt and model’s natural completion behavior. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LENGTH_PENALTY`	double
`quarkus.langchain4j.watsonx.chat-model.repetition-penalty` Sets the repetition penalty to be applied during text generation. This penalty helps to discourage the model from repeating the same words or phrases too often. The penalty value should be greater than 1.0 for repetition discouragement. A value of 1.0 means no penalty, and values above 1.0 increase the strength of the penalty. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_REPETITION_PENALTY`	double
`quarkus.langchain4j.watsonx.chat-model.log-requests` Whether chat model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.chat-model.log-responses` Whether chat model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.chat-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.chat-model.thinking.enabled` Enables or disables reasoning. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_ENABLED`	boolean
`quarkus.langchain4j.watsonx.chat-model.thinking.tags.think` The XML-like tag enclosing the model’s internal reasoning. Example: `<think> … </think>` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_TAGS_THINK`	string	required
`quarkus.langchain4j.watsonx.chat-model.thinking.tags.response` The XML-like tag enclosing the model’s final response. Optional — if not defined, all text outside the reasoning tag is treated as the response. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_TAGS_RESPONSE`	string
`quarkus.langchain4j.watsonx.chat-model.thinking.effort` Controls the reasoning effort level for models that separate reasoning and response automatically. Example values: `LOW`, `MEDIUM`, `HIGH`. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_EFFORT`	`low`, `medium`, `high`
`quarkus.langchain4j.watsonx.chat-model.thinking.include-reasoning` Determines whether the reasoning portion returned by the model should be included in the final response provided to the application. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_INCLUDE_REASONING`	boolean
`quarkus.langchain4j.watsonx.embedding-model.model-name` Specifies the ID of the model to be used. A list of all available models is provided in the IBM watsonx.ai documentation at the this link. To use a model, locate the `API model ID` column in the table and copy the corresponding model ID. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_MODEL_NAME`	string	`ibm/granite-embedding-278m-multilingual`
`quarkus.langchain4j.watsonx.embedding-model.log-requests` Whether embedding model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.embedding-model.log-responses` Whether embedding model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.embedding-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.scoring-model.model-name` The id of the model to be used. All available models are listed in the IBM Watsonx.ai documentation at the link: following link. To use a model, locate the `API model_id` column in the table and copy the corresponding model ID. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_MODEL_NAME`	string	`cross-encoder/ms-marco-minilm-l-12-v2`
`quarkus.langchain4j.watsonx.scoring-model.log-requests` Whether embedding model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.scoring-model.log-responses` Whether embedding model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.scoring-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.moderation-model.pii.enabled` Indicates whether the PII moderation model is enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_PII_ENABLED`	boolean	required
`quarkus.langchain4j.watsonx.moderation-model.hap.enabled` Indicates whether the HAP moderation model is enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_HAP_ENABLED`	boolean	required
`quarkus.langchain4j.watsonx.moderation-model.hap.threshold` Threshold value for HAP moderation model. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_HAP_THRESHOLD`	double
`quarkus.langchain4j.watsonx.moderation-model.granite-guardian.enabled` Indicates whether the GraniteGuardian moderation model is enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_GRANITE_GUARDIAN_ENABLED`	boolean	required
`quarkus.langchain4j.watsonx.moderation-model.granite-guardian.threshold` Threshold value for Granite Guardian moderation model. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_GRANITE_GUARDIAN_THRESHOLD`	double
`quarkus.langchain4j.watsonx.moderation-model.log-requests` Whether moderation model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.moderation-model.log-responses` Whether moderation model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.moderation-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.built-in-tool.base-url` Base URL for the built-in service. All available URLs are listed in the IBM Watsonx.ai documentation at the following link. Note: If empty, the URL is automatically calculated based on the `watsonx.base-url` value. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_BASE_URL`	string
`quarkus.langchain4j.watsonx.built-in-tool.timeout` Timeout for built-in tools APIs. If empty, the api key inherits the value from the `watsonx.timeout` property. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_TIMEOUT`	Duration	`10s`
`quarkus.langchain4j.watsonx.built-in-tool.log-requests` Whether the built-in rest client should log requests. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx.built-in-tool.log-responses` Whether the built-in rest client should log responses. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx.built-in-tool.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx.built-in-tool.tavily-search.api-key` Tavily API key. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_TAVILY_SEARCH_API_KEY`	string
`quarkus.langchain4j.watsonx.built-in-tool.python-interpreter.deployment-id` Deployment id. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_PYTHON_INTERPRETER_DEPLOYMENT_ID`	string
`quarkus.langchain4j.watsonx.built-in-tool.rag-query.vector-index-ids` Vector index ids Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_RAG_QUERY_VECTOR_INDEX_IDS`	list of string
Named model config	Type	Default
`quarkus.langchain4j.watsonx."model-name".base-url` Specifies the base URL of the watsonx.ai API. A list of all available URLs is provided in the IBM Watsonx.ai documentation at the this link. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__BASE_URL`	string
`quarkus.langchain4j.watsonx."model-name".api-key` IBM Cloud API key. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__API_KEY`	string
`quarkus.langchain4j.watsonx."model-name".timeout` Timeout for watsonx.ai calls. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TIMEOUT`	Duration	`60s`
`quarkus.langchain4j.watsonx."model-name".version` The version date for the API of the form YYYY-MM-DD. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__VERSION`	string
`quarkus.langchain4j.watsonx."model-name".space-id` The space that contains the resource. Either `space_id` or `project_id` has to be given. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SPACE_ID`	string
`quarkus.langchain4j.watsonx."model-name".project-id` The project that contains the resource. Either `space_id` or `project_id` has to be given. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__PROJECT_ID`	string
`quarkus.langchain4j.watsonx."model-name".log-requests` Whether the watsonx.ai client should log requests. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".log-responses` Whether the watsonx.ai client should log responses. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".enable-integration` Whether to enable the integration. Defaults to `true`, which means requests are made to the watsonx.ai provider. Set to `false` to disable all requests. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__ENABLE_INTEGRATION`	boolean	`true`
`quarkus.langchain4j.watsonx."model-name".iam.base-url` Base URL of the IAM Authentication API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__IAM_BASE_URL`	URI
`quarkus.langchain4j.watsonx."model-name".iam.timeout` Timeout for IAM authentication calls. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__IAM_TIMEOUT`	Duration	`10s`
`quarkus.langchain4j.watsonx."model-name".iam.grant-type` Grant type for the IAM Authentication API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__IAM_GRANT_TYPE`	string
`quarkus.langchain4j.watsonx."model-name".text-extraction.cos-url` Base URL of the Cloud Object Storage API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_COS_URL`	string	required
`quarkus.langchain4j.watsonx."model-name".text-extraction.document-reference.connection` The id of the connection asset that contains the credentials required to access the data. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_DOCUMENT_REFERENCE_CONNECTION`	string	required
`quarkus.langchain4j.watsonx."model-name".text-extraction.document-reference.bucket-name` The name of the bucket containing the input document. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_DOCUMENT_REFERENCE_BUCKET_NAME`	string	required
`quarkus.langchain4j.watsonx."model-name".text-extraction.results-reference.connection` The id of the connection asset used to store the extracted results. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_RESULTS_REFERENCE_CONNECTION`	string	required
`quarkus.langchain4j.watsonx."model-name".text-extraction.results-reference.bucket-name` The name of the bucket where the output files will be written. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_RESULTS_REFERENCE_BUCKET_NAME`	string	required
`quarkus.langchain4j.watsonx."model-name".text-extraction.log-requests` Whether text extraction requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".text-extraction.log-responses` Whether text extraction responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".text-extraction.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".text-classification.cos-url` Base URL of the Cloud Object Storage API. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_COS_URL`	string	required
`quarkus.langchain4j.watsonx."model-name".text-classification.document-reference.connection` The id of the connection asset that contains the credentials required to access the data. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_CONNECTION`	string	required
`quarkus.langchain4j.watsonx."model-name".text-classification.document-reference.bucket-name` The name of the bucket containing the input document. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_BUCKET_NAME`	string	required
`quarkus.langchain4j.watsonx."model-name".text-classification.log-requests` Whether text extraction requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".text-classification.log-responses` Whether text extraction responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".text-classification.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".chat-model.model-name` Specifies the model to use for the chat completion. A list of all available models is provided in the IBM watsonx.ai documentation at the this link. To use a model, locate the `API model ID` column in the table and copy the corresponding model ID. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_MODEL_NAME`	string	`ibm/granite-4-h-small`
`quarkus.langchain4j.watsonx."model-name".chat-model.tool-choice` Specifies how the model should choose which tool to call during a request. This value can be: auto: The model decides whether and which tool to call automatically. required: The model must call one of the available tools. If `toolChoiceName` is set, this value is ignored. Setting this value influences the tool-calling behavior of the model when no specific tool is required. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOOL_CHOICE`	`auto`, `required`, `none`
`quarkus.langchain4j.watsonx."model-name".chat-model.tool-choice-name` Specifies the name of a specific tool that the model must call. When set, the model will be forced to call the specified tool. The name must exactly match one of the available tools defined for the service. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOOL_CHOICE_NAME`	string
`quarkus.langchain4j.watsonx."model-name".chat-model.frequency-penalty` Positive values penalize new tokens based on their existing frequency in the generated text, reducing the likelihood of the model repeating the same lines verbatim. Possible values: `-2 < value < 2` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_FREQUENCY_PENALTY`	double	`0`
`quarkus.langchain4j.watsonx."model-name".chat-model.logprobs` Specifies whether to return the log probabilities of the output tokens. If set to `true`, the response will include the log probability of each output token in the content of the message. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOGPROBS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".chat-model.top-logprobs` An integer specifying the number of most likely tokens to return at each token position, each with an associated log probability. The option `logprobs` must be set to `true` if this parameter is used. Possible values: `0 ≤ value ≤ 20` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOP_LOGPROBS`	int
`quarkus.langchain4j.watsonx."model-name".chat-model.max-output-tokens` The maximum number of tokens that can be generated in the chat completion. The total length of input tokens and generated tokens is limited by the model’s context length. Set to 0 for the model’s configured max generated tokens. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_MAX_OUTPUT_TOKENS`	int	`1024`
`quarkus.langchain4j.watsonx."model-name".chat-model.presence-penalty` Applies a penalty to new tokens based on whether they already appear in the generated text so far, encouraging the model to introduce new topics rather than repeat itself. Possible values: `-2 < value < 2` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_PRESENCE_PENALTY`	double	`0`
`quarkus.langchain4j.watsonx."model-name".chat-model.seed` Random number generator seed to use in sampling mode for experimental repeatability. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_SEED`	int
`quarkus.langchain4j.watsonx."model-name".chat-model.stop` Defines one or more stop sequences that will cause the model to stop generating further tokens if any of them are encountered in the output. This allows control over where the model should end its response. If a stop sequence is encountered before the minimum number of tokens has been generated, it will be ignored. Possible values: `0 ≤ number of items ≤ 4` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_STOP`	list of string
`quarkus.langchain4j.watsonx."model-name".chat-model.temperature` Specifies the sampling temperature to use in the generation process. Higher values (e.g. `0.8`) make the output more random and diverse, while lower values (e.g. `0.2`) make the output more focused and deterministic. Possible values: `0 < value < 2` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TEMPERATURE`	double	`${quarkus.langchain4j.temperature:1.0}`
`quarkus.langchain4j.watsonx."model-name".chat-model.top-p` An alternative to sampling with `temperature`, called nucleus sampling, where the model considers the results of the tokens with `top_p` probability mass. So `0.1` means only the tokens comprising the top 10% probability mass are considered. Possible values: `0 < value < 1` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOP_P`	double	`1`
`quarkus.langchain4j.watsonx."model-name".chat-model.response-format` Specifies the desired format for the model’s output. Allowable values: `[text, json, json_schema]` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_RESPONSE_FORMAT`	`text`, `json`, `json-schema`
`quarkus.langchain4j.watsonx."model-name".chat-model.guided-choice` Specifies a set of allowed output choices. When this parameter is set, the model is constrained to return exactly one of the provided choices. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_GUIDED_CHOICE`	list of string
`quarkus.langchain4j.watsonx."model-name".chat-model.guided-grammar` Constrains the model output to follow a context-free grammar. If specified, the generated output will conform to the defined grammar. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_GUIDED_GRAMMAR`	string
`quarkus.langchain4j.watsonx."model-name".chat-model.guided-regex` Constrains the model output to match a regular expression pattern. If specified, the generated output must conform to the provided regex. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_GUIDED_REGEX`	string
`quarkus.langchain4j.watsonx."model-name".chat-model.length-penalty` Sets the length penalty to be applied during text generation. This penalty influences the length of the generated text. A length penalty discourages the model from generating overly long responses, or conversely, it can encourage more extended outputs. When the penalty value is greater than 1.0, it discourages generating longer responses. Conversely, a value less than 1.0 incentivizes the model to generate longer text. A value of 1.0 means no penalty, and the length of the output will be determined by other factors, such as the input prompt and model’s natural completion behavior. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LENGTH_PENALTY`	double
`quarkus.langchain4j.watsonx."model-name".chat-model.repetition-penalty` Sets the repetition penalty to be applied during text generation. This penalty helps to discourage the model from repeating the same words or phrases too often. The penalty value should be greater than 1.0 for repetition discouragement. A value of 1.0 means no penalty, and values above 1.0 increase the strength of the penalty. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_REPETITION_PENALTY`	double
`quarkus.langchain4j.watsonx."model-name".chat-model.log-requests` Whether chat model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".chat-model.log-responses` Whether chat model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".chat-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".chat-model.thinking.enabled` Enables or disables reasoning. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_ENABLED`	boolean
`quarkus.langchain4j.watsonx."model-name".chat-model.thinking.tags.think` The XML-like tag enclosing the model’s internal reasoning. Example: `<think> … </think>` Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_TAGS_THINK`	string	required
`quarkus.langchain4j.watsonx."model-name".chat-model.thinking.tags.response` The XML-like tag enclosing the model’s final response. Optional — if not defined, all text outside the reasoning tag is treated as the response. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_TAGS_RESPONSE`	string
`quarkus.langchain4j.watsonx."model-name".chat-model.thinking.effort` Controls the reasoning effort level for models that separate reasoning and response automatically. Example values: `LOW`, `MEDIUM`, `HIGH`. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_EFFORT`	`low`, `medium`, `high`
`quarkus.langchain4j.watsonx."model-name".chat-model.thinking.include-reasoning` Determines whether the reasoning portion returned by the model should be included in the final response provided to the application. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_INCLUDE_REASONING`	boolean
`quarkus.langchain4j.watsonx."model-name".embedding-model.model-name` Specifies the ID of the model to be used. A list of all available models is provided in the IBM watsonx.ai documentation at the this link. To use a model, locate the `API model ID` column in the table and copy the corresponding model ID. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_MODEL_NAME`	string	`ibm/granite-embedding-278m-multilingual`
`quarkus.langchain4j.watsonx."model-name".embedding-model.log-requests` Whether embedding model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".embedding-model.log-responses` Whether embedding model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".embedding-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".scoring-model.model-name` The id of the model to be used. All available models are listed in the IBM Watsonx.ai documentation at the link: following link. To use a model, locate the `API model_id` column in the table and copy the corresponding model ID. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_MODEL_NAME`	string	`cross-encoder/ms-marco-minilm-l-12-v2`
`quarkus.langchain4j.watsonx."model-name".scoring-model.log-requests` Whether embedding model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".scoring-model.log-responses` Whether embedding model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".scoring-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".moderation-model.pii.enabled` Indicates whether the PII moderation model is enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_PII_ENABLED`	boolean	required
`quarkus.langchain4j.watsonx."model-name".moderation-model.hap.enabled` Indicates whether the HAP moderation model is enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_HAP_ENABLED`	boolean	required
`quarkus.langchain4j.watsonx."model-name".moderation-model.hap.threshold` Threshold value for HAP moderation model. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_HAP_THRESHOLD`	double
`quarkus.langchain4j.watsonx."model-name".moderation-model.granite-guardian.enabled` Indicates whether the GraniteGuardian moderation model is enabled. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_GRANITE_GUARDIAN_ENABLED`	boolean	required
`quarkus.langchain4j.watsonx."model-name".moderation-model.granite-guardian.threshold` Threshold value for Granite Guardian moderation model. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_GRANITE_GUARDIAN_THRESHOLD`	double
`quarkus.langchain4j.watsonx."model-name".moderation-model.log-requests` Whether moderation model requests should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".moderation-model.log-responses` Whether moderation model responses should be logged. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.watsonx."model-name".moderation-model.log-requests-curl` Whether the watsonx.ai client should log requests as cURL commands. Environment variable: `QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_LOG_REQUESTS_CURL`	boolean	`false`

Configuration property

Type

Default

quarkus.langchain4j.watsonx.chat-model.enabled

Whether the model should be enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_ENABLED

boolean

true

quarkus.langchain4j.watsonx.embedding-model.enabled

Whether the embedding model should be enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_ENABLED

boolean

true

quarkus.langchain4j.watsonx.scoring-model.enabled

Whether the scoring model should be enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_ENABLED

boolean

true

quarkus.langchain4j.watsonx.moderation-model.enabled

Whether the moderation model should be enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_ENABLED

boolean

true

quarkus.langchain4j.watsonx.base-url

Specifies the base URL of the watsonx.ai API.

A list of all available URLs is provided in the IBM Watsonx.ai documentation at the this link.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BASE_URL

string

quarkus.langchain4j.watsonx.api-key

IBM Cloud API key.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_API_KEY

string

quarkus.langchain4j.watsonx.timeout

Timeout for watsonx.ai calls.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TIMEOUT

Duration

60s

quarkus.langchain4j.watsonx.version

The version date for the API of the form YYYY-MM-DD.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_VERSION

string

quarkus.langchain4j.watsonx.space-id

The space that contains the resource.

Either space_id or project_id has to be given.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_SPACE_ID

string

quarkus.langchain4j.watsonx.project-id

The project that contains the resource.

Either space_id or project_id has to be given.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_PROJECT_ID

string

quarkus.langchain4j.watsonx.log-requests

Whether the watsonx.ai client should log requests.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.log-responses

Whether the watsonx.ai client should log responses.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.enable-integration

Whether to enable the integration. Defaults to true, which means requests are made to the watsonx.ai provider. Set to false to disable all requests.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_ENABLE_INTEGRATION

boolean

true

quarkus.langchain4j.watsonx.iam.base-url

Base URL of the IAM Authentication API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_IAM_BASE_URL

URI

quarkus.langchain4j.watsonx.iam.timeout

Timeout for IAM authentication calls.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_IAM_TIMEOUT

Duration

10s

quarkus.langchain4j.watsonx.iam.grant-type

Grant type for the IAM Authentication API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_IAM_GRANT_TYPE

string

quarkus.langchain4j.watsonx.text-extraction.cos-url

Base URL of the Cloud Object Storage API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_COS_URL

string

required

quarkus.langchain4j.watsonx.text-extraction.document-reference.connection

The id of the connection asset that contains the credentials required to access the data.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_DOCUMENT_REFERENCE_CONNECTION

string

required

quarkus.langchain4j.watsonx.text-extraction.document-reference.bucket-name

The name of the bucket containing the input document.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_DOCUMENT_REFERENCE_BUCKET_NAME

string

required

quarkus.langchain4j.watsonx.text-extraction.results-reference.connection

The id of the connection asset used to store the extracted results.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_RESULTS_REFERENCE_CONNECTION

string

required

quarkus.langchain4j.watsonx.text-extraction.results-reference.bucket-name

The name of the bucket where the output files will be written.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_RESULTS_REFERENCE_BUCKET_NAME

string

required

quarkus.langchain4j.watsonx.text-extraction.log-requests

Whether text extraction requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.text-extraction.log-responses

Whether text extraction responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.text-extraction.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_EXTRACTION_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.text-classification.cos-url

Base URL of the Cloud Object Storage API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_COS_URL

string

required

quarkus.langchain4j.watsonx.text-classification.document-reference.connection

The id of the connection asset that contains the credentials required to access the data.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_CONNECTION

string

required

quarkus.langchain4j.watsonx.text-classification.document-reference.bucket-name

The name of the bucket containing the input document.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_BUCKET_NAME

string

required

quarkus.langchain4j.watsonx.text-classification.log-requests

Whether text extraction requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.text-classification.log-responses

Whether text extraction responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.text-classification.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_TEXT_CLASSIFICATION_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.chat-model.model-name

Specifies the model to use for the chat completion.

A list of all available models is provided in the IBM watsonx.ai documentation at the this link.

To use a model, locate the API model ID column in the table and copy the corresponding model ID.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_MODEL_NAME

string

ibm/granite-4-h-small

quarkus.langchain4j.watsonx.chat-model.tool-choice

Specifies how the model should choose which tool to call during a request.

This value can be:

auto: The model decides whether and which tool to call automatically.
required: The model must call one of the available tools.

If toolChoiceName is set, this value is ignored.

Setting this value influences the tool-calling behavior of the model when no specific tool is required.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOOL_CHOICE

auto, required, none

quarkus.langchain4j.watsonx.chat-model.tool-choice-name

Specifies the name of a specific tool that the model must call.

When set, the model will be forced to call the specified tool. The name must exactly match one of the available tools defined for the service.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOOL_CHOICE_NAME

string

quarkus.langchain4j.watsonx.chat-model.frequency-penalty

Positive values penalize new tokens based on their existing frequency in the generated text, reducing the likelihood of the model repeating the same lines verbatim.

Possible values: -2 < value < 2

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_FREQUENCY_PENALTY

double

0

quarkus.langchain4j.watsonx.chat-model.logprobs

Specifies whether to return the log probabilities of the output tokens.

If set to true, the response will include the log probability of each output token in the content of the message.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOGPROBS

boolean

false

quarkus.langchain4j.watsonx.chat-model.top-logprobs

An integer specifying the number of most likely tokens to return at each token position, each with an associated log probability. The option logprobs must be set to true if this parameter is used.

Possible values: 0 ≤ value ≤ 20

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOP_LOGPROBS

int

quarkus.langchain4j.watsonx.chat-model.max-output-tokens

The maximum number of tokens that can be generated in the chat completion. The total length of input tokens and generated tokens is limited by the model’s context length. Set to 0 for the model’s configured max generated tokens.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_MAX_OUTPUT_TOKENS

int

1024

quarkus.langchain4j.watsonx.chat-model.presence-penalty

Applies a penalty to new tokens based on whether they already appear in the generated text so far, encouraging the model to introduce new topics rather than repeat itself.

Possible values: -2 < value < 2

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_PRESENCE_PENALTY

double

0

quarkus.langchain4j.watsonx.chat-model.seed

Random number generator seed to use in sampling mode for experimental repeatability.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_SEED

int

quarkus.langchain4j.watsonx.chat-model.stop

Defines one or more stop sequences that will cause the model to stop generating further tokens if any of them are encountered in the output.

This allows control over where the model should end its response. If a stop sequence is encountered before the minimum number of tokens has been generated, it will be ignored.

Possible values: 0 ≤ number of items ≤ 4

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_STOP

list of string

quarkus.langchain4j.watsonx.chat-model.temperature

Specifies the sampling temperature to use in the generation process.

Higher values (e.g. 0.8) make the output more random and diverse, while lower values (e.g. 0.2) make the output more focused and deterministic.

Possible values: 0 < value < 2

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TEMPERATURE

double

${quarkus.langchain4j.temperature:1.0}

quarkus.langchain4j.watsonx.chat-model.top-p

An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.

Possible values: 0 < value < 1

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_TOP_P

double

1

quarkus.langchain4j.watsonx.chat-model.response-format

Specifies the desired format for the model’s output.

Allowable values: [text, json, json_schema]

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_RESPONSE_FORMAT

text, json, json-schema

quarkus.langchain4j.watsonx.chat-model.guided-choice

Specifies a set of allowed output choices.

When this parameter is set, the model is constrained to return exactly one of the provided choices.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_GUIDED_CHOICE

list of string

quarkus.langchain4j.watsonx.chat-model.guided-grammar

Constrains the model output to follow a context-free grammar.

If specified, the generated output will conform to the defined grammar.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_GUIDED_GRAMMAR

string

quarkus.langchain4j.watsonx.chat-model.guided-regex

Constrains the model output to match a regular expression pattern.

If specified, the generated output must conform to the provided regex.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_GUIDED_REGEX

string

quarkus.langchain4j.watsonx.chat-model.length-penalty

Sets the length penalty to be applied during text generation. This penalty influences the length of the generated text. A length penalty discourages the model from generating overly long responses, or conversely, it can encourage more extended outputs.

When the penalty value is greater than 1.0, it discourages generating longer responses. Conversely, a value less than 1.0 incentivizes the model to generate longer text. A value of 1.0 means no penalty, and the length of the output will be determined by other factors, such as the input prompt and model’s natural completion behavior.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LENGTH_PENALTY

double

quarkus.langchain4j.watsonx.chat-model.repetition-penalty

Sets the repetition penalty to be applied during text generation. This penalty helps to discourage the model from repeating the same words or phrases too often.

The penalty value should be greater than 1.0 for repetition discouragement. A value of 1.0 means no penalty, and values above 1.0 increase the strength of the penalty.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_REPETITION_PENALTY

double

quarkus.langchain4j.watsonx.chat-model.log-requests

Whether chat model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.chat-model.log-responses

Whether chat model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.chat-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.chat-model.thinking.enabled

Enables or disables reasoning.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_ENABLED

boolean

quarkus.langchain4j.watsonx.chat-model.thinking.tags.think

The XML-like tag enclosing the model’s internal reasoning.

Example: <think> … </think>

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_TAGS_THINK

string

required

quarkus.langchain4j.watsonx.chat-model.thinking.tags.response

The XML-like tag enclosing the model’s final response.

Optional — if not defined, all text outside the reasoning tag is treated as the response.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_TAGS_RESPONSE

string

quarkus.langchain4j.watsonx.chat-model.thinking.effort

Controls the reasoning effort level for models that separate reasoning and response automatically.

Example values: LOW, MEDIUM, HIGH.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_EFFORT

low, medium, high

quarkus.langchain4j.watsonx.chat-model.thinking.include-reasoning

Determines whether the reasoning portion returned by the model should be included in the final response provided to the application.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_CHAT_MODEL_THINKING_INCLUDE_REASONING

boolean

quarkus.langchain4j.watsonx.embedding-model.model-name

Specifies the ID of the model to be used.

A list of all available models is provided in the IBM watsonx.ai documentation at the this link.

To use a model, locate the API model ID column in the table and copy the corresponding model ID.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_MODEL_NAME

string

ibm/granite-embedding-278m-multilingual

quarkus.langchain4j.watsonx.embedding-model.log-requests

Whether embedding model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.embedding-model.log-responses

Whether embedding model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.embedding-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_EMBEDDING_MODEL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.scoring-model.model-name

The id of the model to be used.

All available models are listed in the IBM Watsonx.ai documentation at the link: following link.

To use a model, locate the API model_id column in the table and copy the corresponding model ID.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_MODEL_NAME

string

cross-encoder/ms-marco-minilm-l-12-v2

quarkus.langchain4j.watsonx.scoring-model.log-requests

Whether embedding model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.scoring-model.log-responses

Whether embedding model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.scoring-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_SCORING_MODEL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.moderation-model.pii.enabled

Indicates whether the PII moderation model is enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_PII_ENABLED

boolean

required

quarkus.langchain4j.watsonx.moderation-model.hap.enabled

Indicates whether the HAP moderation model is enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_HAP_ENABLED

boolean

required

quarkus.langchain4j.watsonx.moderation-model.hap.threshold

Threshold value for HAP moderation model.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_HAP_THRESHOLD

double

quarkus.langchain4j.watsonx.moderation-model.granite-guardian.enabled

Indicates whether the GraniteGuardian moderation model is enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_GRANITE_GUARDIAN_ENABLED

boolean

required

quarkus.langchain4j.watsonx.moderation-model.granite-guardian.threshold

Threshold value for Granite Guardian moderation model.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_GRANITE_GUARDIAN_THRESHOLD

double

quarkus.langchain4j.watsonx.moderation-model.log-requests

Whether moderation model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.moderation-model.log-responses

Whether moderation model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.moderation-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_MODERATION_MODEL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.built-in-tool.base-url

Base URL for the built-in service.

All available URLs are listed in the IBM Watsonx.ai documentation at the following link.

Note: If empty, the URL is automatically calculated based on the watsonx.base-url value.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_BASE_URL

string

quarkus.langchain4j.watsonx.built-in-tool.timeout

Timeout for built-in tools APIs.

If empty, the api key inherits the value from the watsonx.timeout property.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_TIMEOUT

Duration

10s

quarkus.langchain4j.watsonx.built-in-tool.log-requests

Whether the built-in rest client should log requests.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx.built-in-tool.log-responses

Whether the built-in rest client should log responses.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx.built-in-tool.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx.built-in-tool.tavily-search.api-key

Tavily API key.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_TAVILY_SEARCH_API_KEY

string

quarkus.langchain4j.watsonx.built-in-tool.python-interpreter.deployment-id

Deployment id.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_PYTHON_INTERPRETER_DEPLOYMENT_ID

string

quarkus.langchain4j.watsonx.built-in-tool.rag-query.vector-index-ids

Vector index ids

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX_BUILT_IN_TOOL_RAG_QUERY_VECTOR_INDEX_IDS

list of string

Named model config

Type

Default

quarkus.langchain4j.watsonx."model-name".base-url

Specifies the base URL of the watsonx.ai API.

A list of all available URLs is provided in the IBM Watsonx.ai documentation at the this link.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__BASE_URL

string

quarkus.langchain4j.watsonx."model-name".api-key

IBM Cloud API key.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__API_KEY

string

quarkus.langchain4j.watsonx."model-name".timeout

Timeout for watsonx.ai calls.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TIMEOUT

Duration

60s

quarkus.langchain4j.watsonx."model-name".version

The version date for the API of the form YYYY-MM-DD.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__VERSION

string

quarkus.langchain4j.watsonx."model-name".space-id

The space that contains the resource.

Either space_id or project_id has to be given.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SPACE_ID

string

quarkus.langchain4j.watsonx."model-name".project-id

The project that contains the resource.

Either space_id or project_id has to be given.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__PROJECT_ID

string

quarkus.langchain4j.watsonx."model-name".log-requests

Whether the watsonx.ai client should log requests.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx."model-name".log-responses

Whether the watsonx.ai client should log responses.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx."model-name".log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx."model-name".enable-integration

Whether to enable the integration. Defaults to true, which means requests are made to the watsonx.ai provider. Set to false to disable all requests.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__ENABLE_INTEGRATION

boolean

true

quarkus.langchain4j.watsonx."model-name".iam.base-url

Base URL of the IAM Authentication API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__IAM_BASE_URL

URI

quarkus.langchain4j.watsonx."model-name".iam.timeout

Timeout for IAM authentication calls.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__IAM_TIMEOUT

Duration

10s

quarkus.langchain4j.watsonx."model-name".iam.grant-type

Grant type for the IAM Authentication API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__IAM_GRANT_TYPE

string

quarkus.langchain4j.watsonx."model-name".text-extraction.cos-url

Base URL of the Cloud Object Storage API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_COS_URL

string

required

quarkus.langchain4j.watsonx."model-name".text-extraction.document-reference.connection

The id of the connection asset that contains the credentials required to access the data.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_DOCUMENT_REFERENCE_CONNECTION

string

required

quarkus.langchain4j.watsonx."model-name".text-extraction.document-reference.bucket-name

The name of the bucket containing the input document.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_DOCUMENT_REFERENCE_BUCKET_NAME

string

required

quarkus.langchain4j.watsonx."model-name".text-extraction.results-reference.connection

The id of the connection asset used to store the extracted results.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_RESULTS_REFERENCE_CONNECTION

string

required

quarkus.langchain4j.watsonx."model-name".text-extraction.results-reference.bucket-name

The name of the bucket where the output files will be written.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_RESULTS_REFERENCE_BUCKET_NAME

string

required

quarkus.langchain4j.watsonx."model-name".text-extraction.log-requests

Whether text extraction requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx."model-name".text-extraction.log-responses

Whether text extraction responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx."model-name".text-extraction.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_EXTRACTION_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx."model-name".text-classification.cos-url

Base URL of the Cloud Object Storage API.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_COS_URL

string

required

quarkus.langchain4j.watsonx."model-name".text-classification.document-reference.connection

The id of the connection asset that contains the credentials required to access the data.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_CONNECTION

string

required

quarkus.langchain4j.watsonx."model-name".text-classification.document-reference.bucket-name

The name of the bucket containing the input document.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_DOCUMENT_REFERENCE_BUCKET_NAME

string

required

quarkus.langchain4j.watsonx."model-name".text-classification.log-requests

Whether text extraction requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx."model-name".text-classification.log-responses

Whether text extraction responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx."model-name".text-classification.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__TEXT_CLASSIFICATION_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx."model-name".chat-model.model-name

Specifies the model to use for the chat completion.

A list of all available models is provided in the IBM watsonx.ai documentation at the this link.

To use a model, locate the API model ID column in the table and copy the corresponding model ID.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_MODEL_NAME

string

ibm/granite-4-h-small

quarkus.langchain4j.watsonx."model-name".chat-model.tool-choice

Specifies how the model should choose which tool to call during a request.

This value can be:

auto: The model decides whether and which tool to call automatically.
required: The model must call one of the available tools.

If toolChoiceName is set, this value is ignored.

Setting this value influences the tool-calling behavior of the model when no specific tool is required.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOOL_CHOICE

auto, required, none

quarkus.langchain4j.watsonx."model-name".chat-model.tool-choice-name

Specifies the name of a specific tool that the model must call.

When set, the model will be forced to call the specified tool. The name must exactly match one of the available tools defined for the service.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOOL_CHOICE_NAME

string

quarkus.langchain4j.watsonx."model-name".chat-model.frequency-penalty

Positive values penalize new tokens based on their existing frequency in the generated text, reducing the likelihood of the model repeating the same lines verbatim.

Possible values: -2 < value < 2

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_FREQUENCY_PENALTY

double

0

quarkus.langchain4j.watsonx."model-name".chat-model.logprobs

Specifies whether to return the log probabilities of the output tokens.

If set to true, the response will include the log probability of each output token in the content of the message.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOGPROBS

boolean

false

quarkus.langchain4j.watsonx."model-name".chat-model.top-logprobs

Possible values: 0 ≤ value ≤ 20

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOP_LOGPROBS

int

quarkus.langchain4j.watsonx."model-name".chat-model.max-output-tokens

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_MAX_OUTPUT_TOKENS

int

1024

quarkus.langchain4j.watsonx."model-name".chat-model.presence-penalty

Applies a penalty to new tokens based on whether they already appear in the generated text so far, encouraging the model to introduce new topics rather than repeat itself.

Possible values: -2 < value < 2

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_PRESENCE_PENALTY

double

0

quarkus.langchain4j.watsonx."model-name".chat-model.seed

Random number generator seed to use in sampling mode for experimental repeatability.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_SEED

int

quarkus.langchain4j.watsonx."model-name".chat-model.stop

Defines one or more stop sequences that will cause the model to stop generating further tokens if any of them are encountered in the output.

This allows control over where the model should end its response. If a stop sequence is encountered before the minimum number of tokens has been generated, it will be ignored.

Possible values: 0 ≤ number of items ≤ 4

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_STOP

list of string

quarkus.langchain4j.watsonx."model-name".chat-model.temperature

Specifies the sampling temperature to use in the generation process.

Higher values (e.g. 0.8) make the output more random and diverse, while lower values (e.g. 0.2) make the output more focused and deterministic.

Possible values: 0 < value < 2

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TEMPERATURE

double

${quarkus.langchain4j.temperature:1.0}

quarkus.langchain4j.watsonx."model-name".chat-model.top-p

Possible values: 0 < value < 1

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_TOP_P

double

1

quarkus.langchain4j.watsonx."model-name".chat-model.response-format

Specifies the desired format for the model’s output.

Allowable values: [text, json, json_schema]

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_RESPONSE_FORMAT

text, json, json-schema

quarkus.langchain4j.watsonx."model-name".chat-model.guided-choice

Specifies a set of allowed output choices.

When this parameter is set, the model is constrained to return exactly one of the provided choices.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_GUIDED_CHOICE

list of string

quarkus.langchain4j.watsonx."model-name".chat-model.guided-grammar

Constrains the model output to follow a context-free grammar.

If specified, the generated output will conform to the defined grammar.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_GUIDED_GRAMMAR

string

quarkus.langchain4j.watsonx."model-name".chat-model.guided-regex

Constrains the model output to match a regular expression pattern.

If specified, the generated output must conform to the provided regex.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_GUIDED_REGEX

string

quarkus.langchain4j.watsonx."model-name".chat-model.length-penalty

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LENGTH_PENALTY

double

quarkus.langchain4j.watsonx."model-name".chat-model.repetition-penalty

Sets the repetition penalty to be applied during text generation. This penalty helps to discourage the model from repeating the same words or phrases too often.

The penalty value should be greater than 1.0 for repetition discouragement. A value of 1.0 means no penalty, and values above 1.0 increase the strength of the penalty.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_REPETITION_PENALTY

double

quarkus.langchain4j.watsonx."model-name".chat-model.log-requests

Whether chat model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx."model-name".chat-model.log-responses

Whether chat model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx."model-name".chat-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx."model-name".chat-model.thinking.enabled

Enables or disables reasoning.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_ENABLED

boolean

quarkus.langchain4j.watsonx."model-name".chat-model.thinking.tags.think

The XML-like tag enclosing the model’s internal reasoning.

Example: <think> … </think>

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_TAGS_THINK

string

required

quarkus.langchain4j.watsonx."model-name".chat-model.thinking.tags.response

The XML-like tag enclosing the model’s final response.

Optional — if not defined, all text outside the reasoning tag is treated as the response.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_TAGS_RESPONSE

string

quarkus.langchain4j.watsonx."model-name".chat-model.thinking.effort

Controls the reasoning effort level for models that separate reasoning and response automatically.

Example values: LOW, MEDIUM, HIGH.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_EFFORT

low, medium, high

quarkus.langchain4j.watsonx."model-name".chat-model.thinking.include-reasoning

Determines whether the reasoning portion returned by the model should be included in the final response provided to the application.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__CHAT_MODEL_THINKING_INCLUDE_REASONING

boolean

quarkus.langchain4j.watsonx."model-name".embedding-model.model-name

Specifies the ID of the model to be used.

A list of all available models is provided in the IBM watsonx.ai documentation at the this link.

To use a model, locate the API model ID column in the table and copy the corresponding model ID.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_MODEL_NAME

string

ibm/granite-embedding-278m-multilingual

quarkus.langchain4j.watsonx."model-name".embedding-model.log-requests

Whether embedding model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx."model-name".embedding-model.log-responses

Whether embedding model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx."model-name".embedding-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__EMBEDDING_MODEL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx."model-name".scoring-model.model-name

The id of the model to be used.

All available models are listed in the IBM Watsonx.ai documentation at the link: following link.

To use a model, locate the API model_id column in the table and copy the corresponding model ID.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_MODEL_NAME

string

cross-encoder/ms-marco-minilm-l-12-v2

quarkus.langchain4j.watsonx."model-name".scoring-model.log-requests

Whether embedding model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx."model-name".scoring-model.log-responses

Whether embedding model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx."model-name".scoring-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__SCORING_MODEL_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.watsonx."model-name".moderation-model.pii.enabled

Indicates whether the PII moderation model is enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_PII_ENABLED

boolean

required

quarkus.langchain4j.watsonx."model-name".moderation-model.hap.enabled

Indicates whether the HAP moderation model is enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_HAP_ENABLED

boolean

required

quarkus.langchain4j.watsonx."model-name".moderation-model.hap.threshold

Threshold value for HAP moderation model.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_HAP_THRESHOLD

double

quarkus.langchain4j.watsonx."model-name".moderation-model.granite-guardian.enabled

Indicates whether the GraniteGuardian moderation model is enabled.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_GRANITE_GUARDIAN_ENABLED

boolean

required

quarkus.langchain4j.watsonx."model-name".moderation-model.granite-guardian.threshold

Threshold value for Granite Guardian moderation model.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_GRANITE_GUARDIAN_THRESHOLD

double

quarkus.langchain4j.watsonx."model-name".moderation-model.log-requests

Whether moderation model requests should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.watsonx."model-name".moderation-model.log-responses

Whether moderation model responses should be logged.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.watsonx."model-name".moderation-model.log-requests-curl

Whether the watsonx.ai client should log requests as cURL commands.

Environment variable: QUARKUS_LANGCHAIN4J_WATSONX__MODEL_NAME__MODERATION_MODEL_LOG_REQUESTS_CURL

boolean

false

About the Duration format

To write duration values, use the standard java.time.Duration format. See the Duration#parse() Java API documentation for more information.

You can also use a simplified format, starting with a number:

If the value is only a number, it represents time in seconds.
If the value is a number followed by ms, it represents time in milliseconds.

In other cases, the simplified format is translated to the java.time.Duration format for parsing:

If the value is a number followed by h, m, or s, it is prefixed with PT.
If the value is a number followed by d, it is prefixed with P.