Anthropic Chat Models

Anthropic is an AI safety and research company. It provides the Claude family of Large Language Models, designed with constitutional AI principles for safe and controllable output.

This extension allows you to integrate Claude models into your Quarkus applications via the Anthropic API.

Prerequisites

To use Anthropic models, you need an API key. Follow the steps on the Claude documentation portal to request access and retrieve your credentials.

Dependency

To enable Anthropic LLM integration in your project, add the following dependency:

<dependency>
  <groupId>io.quarkiverse.langchain4j</groupId>
  <artifactId>{provider-artifact}</artifactId>
  <version>1.7.2</version>
</dependency>

Even better, if you use the Quarkus platformn BOM (default for projects generated), add the Quarkus Langchain4J BOM and all dependency versions will align:

    <dependencyManagement>
        <dependencies>
            <dependency>
                <groupId>${quarkus.platform.group-id}</groupId>
                <artifactId>${quarkus.platform.artifact-id}</artifactId>
                <version>${quarkus.platform.version}</version>
                <type>pom</type>
                <scope>import</scope>
            </dependency>
            <dependency>
                <groupId>${quarkus.platform.group-id}</groupId>
                <artifactId>quarkus-langchain4j-bom</artifactId> (1)
                <version>${quarkus.platform.version}</version> (2)
                <type>pom</type>
                <scope>import</scope>
            </dependency>
        </dependencies>
    </dependencyManagement>

    <dependencies>
      <dependency>
        <groupId>io.quarkiverse.langchain4j</groupId>
        <artifactId>{provider-artifact}</artifactId>
        (3)
      </dependency>
    </dependencies>

1	In your `dependencyManagement` section, add the `quarkus-langchain4j-bom`
2	Inherit the version from your platform version
3	Voilà, no need for version alignment anymore

If no other LLM extension is installed, AI Services will automatically use the configured Anthropic chat model.

Configuration

Set your API key in the application.properties file:

quarkus.langchain4j.anthropic.api-key=...

You can also set it using the environment variable:

QUARKUS_LANGCHAIN4J_ANTHROPIC_API_KEY=...

By default, the extension uses the latest available Claude model. You can specify the model explicitly using:

quarkus.langchain4j.anthropic.chat-model.model-name=claude-opus-4-20250514

Refer to Anthropic’s model catalog for available versions, such as:

claude-sonnet-4-20250514
claude-3-opus-20240229
claude-3-haiku-20240307

Usage

You can inject the chat model directly:

@Inject ChatModel chatModel;

Or declare an AI service interface:

@RegisterAiService
public interface Assistant {
    String chat(String input);
}

Configuration Reference

Configuration property fixed at build time - All other configuration properties are overridable at runtime

Configuration property	Type	Default
`quarkus.langchain4j.anthropic.chat-model.enabled` Whether the model should be enabled Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_ENABLED`	boolean	`true`
`quarkus.langchain4j.anthropic.base-url` Base URL of the Anthropic API Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_BASE_URL`	string	`https://api.anthropic.com/v1/`
`quarkus.langchain4j.anthropic.api-key` Anthropic API key Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_API_KEY`	string	`dummy`
`quarkus.langchain4j.anthropic.version` The Anthropic version Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_VERSION`	string	`2023-06-01`
`quarkus.langchain4j.anthropic.disable-beta-header` If set to `true`, the `"anthopic-beta"` header will never be sent Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_DISABLE_BETA_HEADER`	boolean	`false`
`quarkus.langchain4j.anthropic.timeout` Timeout for Anthropic calls Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_TIMEOUT`	Duration	`10s`
`quarkus.langchain4j.anthropic.log-requests` Whether the Anthropic client should log requests Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.anthropic.log-responses` Whether the Anthropic client should log responses Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.anthropic.log-requests-curl` Whether the Anthropic client should log requests as cURL commands Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.anthropic.enable-integration` Whether to enable the integration. Defaults to `true`, which means requests are made to the Anthropic provider. Set to `false` to disable all requests. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_ENABLE_INTEGRATION`	boolean	`true`
`quarkus.langchain4j.anthropic.chat-model.model-name` Model name to use Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_MODEL_NAME`	string	`claude-3-haiku-20240307`
`quarkus.langchain4j.anthropic.chat-model.temperature` What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. It is generally recommended to set this or the `top-k` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_TEMPERATURE`	double	`0.7`
`quarkus.langchain4j.anthropic.chat-model.max-tokens` The maximum number of tokens to generate in the completion. The token count of your prompt plus `max_tokens` cannot exceed the model’s context length Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_MAX_TOKENS`	int	`1024`
`quarkus.langchain4j.anthropic.chat-model.top-p` Double (0.0-1.0). Nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered. It is generally recommended to set this or the `temperature` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_TOP_P`	double	`1.0`
`quarkus.langchain4j.anthropic.chat-model.top-k` Reduces the probability of generating nonsense. A higher value (e.g. 100) will give more diverse answers, while a lower value (e.g. 10) will be more conservative Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_TOP_K`	int	`40`
`quarkus.langchain4j.anthropic.chat-model.max-retries` The maximum number of times to retry. 1 means exactly one attempt, with retrying disabled. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_MAX_RETRIES`	int	`1`
`quarkus.langchain4j.anthropic.chat-model.stop-sequences` The custom text sequences that will cause the model to stop generating Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_STOP_SEQUENCES`	list of string
`quarkus.langchain4j.anthropic.chat-model.log-requests` Whether chat model requests should be logged Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.anthropic.chat-model.log-responses` Whether chat model responses should be logged Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.anthropic.chat-model.cache-system-messages` Cache system messages to reduce costs for repeated prompts. Requires minimum 1024 tokens (Claude Opus/Sonnet) or 2048-4096 tokens (Haiku). Supported models: Claude Opus 4.1, Sonnet 4.5, Haiku 4.5, and later models. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_CACHE_SYSTEM_MESSAGES`	boolean	`false`
`quarkus.langchain4j.anthropic.chat-model.cache-tools` Cache tool definitions to reduce costs. Requires minimum 1024 tokens (Claude Opus/Sonnet) or 2048-4096 tokens (Haiku). Supported models: Claude Opus 4.1, Sonnet 4.5, Haiku 4.5, and later models. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_CACHE_TOOLS`	boolean	`false`
`quarkus.langchain4j.anthropic.chat-model.thinking.type` The thinking type to enable Claude’s reasoning process Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_TYPE`	string
`quarkus.langchain4j.anthropic.chat-model.thinking.budget-tokens` The token budget for the model’s thinking process Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_BUDGET_TOKENS`	int
`quarkus.langchain4j.anthropic.chat-model.thinking.return-thinking` Whether thinking results should be returned in the response Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_RETURN_THINKING`	boolean	`false`
`quarkus.langchain4j.anthropic.chat-model.thinking.send-thinking` Whether previously stored thinking should be sent in follow-up requests Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_SEND_THINKING`	boolean	`true`
`quarkus.langchain4j.anthropic.chat-model.thinking.interleaved` Enable interleaved thinking for Claude 4 models, allowing reasoning between tool calls. Requires Claude 4 model (e.g., claude-opus-4-20250514) and thinking.type: enabled. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_INTERLEAVED`	boolean	`false`
Named model config	Type	Default
`quarkus.langchain4j.anthropic."model-name".base-url` Base URL of the Anthropic API Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__BASE_URL`	string	`https://api.anthropic.com/v1/`
`quarkus.langchain4j.anthropic."model-name".api-key` Anthropic API key Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__API_KEY`	string	`dummy`
`quarkus.langchain4j.anthropic."model-name".version` The Anthropic version Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__VERSION`	string	`2023-06-01`
`quarkus.langchain4j.anthropic."model-name".disable-beta-header` If set to `true`, the `"anthopic-beta"` header will never be sent Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__DISABLE_BETA_HEADER`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".timeout` Timeout for Anthropic calls Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__TIMEOUT`	Duration	`10s`
`quarkus.langchain4j.anthropic."model-name".log-requests` Whether the Anthropic client should log requests Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".log-responses` Whether the Anthropic client should log responses Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".log-requests-curl` Whether the Anthropic client should log requests as cURL commands Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__LOG_REQUESTS_CURL`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".enable-integration` Whether to enable the integration. Defaults to `true`, which means requests are made to the Anthropic provider. Set to `false` to disable all requests. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__ENABLE_INTEGRATION`	boolean	`true`
`quarkus.langchain4j.anthropic."model-name".chat-model.model-name` Model name to use Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_MODEL_NAME`	string	`claude-3-haiku-20240307`
`quarkus.langchain4j.anthropic."model-name".chat-model.temperature` What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. It is generally recommended to set this or the `top-k` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_TEMPERATURE`	double	`0.7`
`quarkus.langchain4j.anthropic."model-name".chat-model.max-tokens` The maximum number of tokens to generate in the completion. The token count of your prompt plus `max_tokens` cannot exceed the model’s context length Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_MAX_TOKENS`	int	`1024`
`quarkus.langchain4j.anthropic."model-name".chat-model.top-p` Double (0.0-1.0). Nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered. It is generally recommended to set this or the `temperature` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_TOP_P`	double	`1.0`
`quarkus.langchain4j.anthropic."model-name".chat-model.top-k` Reduces the probability of generating nonsense. A higher value (e.g. 100) will give more diverse answers, while a lower value (e.g. 10) will be more conservative Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_TOP_K`	int	`40`
`quarkus.langchain4j.anthropic."model-name".chat-model.max-retries` The maximum number of times to retry. 1 means exactly one attempt, with retrying disabled. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_MAX_RETRIES`	int	`1`
`quarkus.langchain4j.anthropic."model-name".chat-model.stop-sequences` The custom text sequences that will cause the model to stop generating Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_STOP_SEQUENCES`	list of string
`quarkus.langchain4j.anthropic."model-name".chat-model.log-requests` Whether chat model requests should be logged Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".chat-model.log-responses` Whether chat model responses should be logged Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".chat-model.cache-system-messages` Cache system messages to reduce costs for repeated prompts. Requires minimum 1024 tokens (Claude Opus/Sonnet) or 2048-4096 tokens (Haiku). Supported models: Claude Opus 4.1, Sonnet 4.5, Haiku 4.5, and later models. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_CACHE_SYSTEM_MESSAGES`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".chat-model.cache-tools` Cache tool definitions to reduce costs. Requires minimum 1024 tokens (Claude Opus/Sonnet) or 2048-4096 tokens (Haiku). Supported models: Claude Opus 4.1, Sonnet 4.5, Haiku 4.5, and later models. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_CACHE_TOOLS`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".chat-model.thinking.type` The thinking type to enable Claude’s reasoning process Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_TYPE`	string
`quarkus.langchain4j.anthropic."model-name".chat-model.thinking.budget-tokens` The token budget for the model’s thinking process Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_BUDGET_TOKENS`	int
`quarkus.langchain4j.anthropic."model-name".chat-model.thinking.return-thinking` Whether thinking results should be returned in the response Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_RETURN_THINKING`	boolean	`false`
`quarkus.langchain4j.anthropic."model-name".chat-model.thinking.send-thinking` Whether previously stored thinking should be sent in follow-up requests Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_SEND_THINKING`	boolean	`true`
`quarkus.langchain4j.anthropic."model-name".chat-model.thinking.interleaved` Enable interleaved thinking for Claude 4 models, allowing reasoning between tool calls. Requires Claude 4 model (e.g., claude-opus-4-20250514) and thinking.type: enabled. Environment variable: `QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_INTERLEAVED`	boolean	`false`

Configuration property

Type

Default

quarkus.langchain4j.anthropic.chat-model.enabled

Whether the model should be enabled

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_ENABLED

boolean

true

quarkus.langchain4j.anthropic.base-url

Base URL of the Anthropic API

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_BASE_URL

string

https://api.anthropic.com/v1/

quarkus.langchain4j.anthropic.api-key

Anthropic API key

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_API_KEY

string

dummy

quarkus.langchain4j.anthropic.version

The Anthropic version

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_VERSION

string

2023-06-01

quarkus.langchain4j.anthropic.disable-beta-header

If set to true, the "anthopic-beta" header will never be sent

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_DISABLE_BETA_HEADER

boolean

false

quarkus.langchain4j.anthropic.timeout

Timeout for Anthropic calls

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_TIMEOUT

Duration

10s

quarkus.langchain4j.anthropic.log-requests

Whether the Anthropic client should log requests

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_LOG_REQUESTS

boolean

false

quarkus.langchain4j.anthropic.log-responses

Whether the Anthropic client should log responses

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_LOG_RESPONSES

boolean

false

quarkus.langchain4j.anthropic.log-requests-curl

Whether the Anthropic client should log requests as cURL commands

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.anthropic.enable-integration

Whether to enable the integration. Defaults to true, which means requests are made to the Anthropic provider. Set to false to disable all requests.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_ENABLE_INTEGRATION

boolean

true

quarkus.langchain4j.anthropic.chat-model.model-name

Model name to use

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_MODEL_NAME

string

claude-3-haiku-20240307

quarkus.langchain4j.anthropic.chat-model.temperature

What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.

It is generally recommended to set this or the top-k property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_TEMPERATURE

double

0.7

quarkus.langchain4j.anthropic.chat-model.max-tokens

The maximum number of tokens to generate in the completion.

The token count of your prompt plus max_tokens cannot exceed the model’s context length

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_MAX_TOKENS

int

1024

quarkus.langchain4j.anthropic.chat-model.top-p

Double (0.0-1.0). Nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.

It is generally recommended to set this or the temperature property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_TOP_P

double

1.0

quarkus.langchain4j.anthropic.chat-model.top-k

Reduces the probability of generating nonsense. A higher value (e.g. 100) will give more diverse answers, while a lower value (e.g. 10) will be more conservative

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_TOP_K

int

40

quarkus.langchain4j.anthropic.chat-model.max-retries

The maximum number of times to retry. 1 means exactly one attempt, with retrying disabled.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_MAX_RETRIES

int

1

quarkus.langchain4j.anthropic.chat-model.stop-sequences

The custom text sequences that will cause the model to stop generating

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_STOP_SEQUENCES

list of string

quarkus.langchain4j.anthropic.chat-model.log-requests

Whether chat model requests should be logged

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.anthropic.chat-model.log-responses

Whether chat model responses should be logged

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.anthropic.chat-model.cache-system-messages

Cache system messages to reduce costs for repeated prompts. Requires minimum 1024 tokens (Claude Opus/Sonnet) or 2048-4096 tokens (Haiku). Supported models: Claude Opus 4.1, Sonnet 4.5, Haiku 4.5, and later models.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_CACHE_SYSTEM_MESSAGES

boolean

false

quarkus.langchain4j.anthropic.chat-model.cache-tools

Cache tool definitions to reduce costs. Requires minimum 1024 tokens (Claude Opus/Sonnet) or 2048-4096 tokens (Haiku). Supported models: Claude Opus 4.1, Sonnet 4.5, Haiku 4.5, and later models.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_CACHE_TOOLS

boolean

false

quarkus.langchain4j.anthropic.chat-model.thinking.type

The thinking type to enable Claude’s reasoning process

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_TYPE

string

quarkus.langchain4j.anthropic.chat-model.thinking.budget-tokens

The token budget for the model’s thinking process

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_BUDGET_TOKENS

int

quarkus.langchain4j.anthropic.chat-model.thinking.return-thinking

Whether thinking results should be returned in the response

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_RETURN_THINKING

boolean

false

quarkus.langchain4j.anthropic.chat-model.thinking.send-thinking

Whether previously stored thinking should be sent in follow-up requests

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_SEND_THINKING

boolean

true

quarkus.langchain4j.anthropic.chat-model.thinking.interleaved

Enable interleaved thinking for Claude 4 models, allowing reasoning between tool calls. Requires Claude 4 model (e.g., claude-opus-4-20250514) and thinking.type: enabled.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC_CHAT_MODEL_THINKING_INTERLEAVED

boolean

false

Named model config

Type

Default

quarkus.langchain4j.anthropic."model-name".base-url

Base URL of the Anthropic API

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__BASE_URL

string

https://api.anthropic.com/v1/

quarkus.langchain4j.anthropic."model-name".api-key

Anthropic API key

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__API_KEY

string

dummy

quarkus.langchain4j.anthropic."model-name".version

The Anthropic version

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__VERSION

string

2023-06-01

quarkus.langchain4j.anthropic."model-name".disable-beta-header

If set to true, the "anthopic-beta" header will never be sent

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__DISABLE_BETA_HEADER

boolean

false

quarkus.langchain4j.anthropic."model-name".timeout

Timeout for Anthropic calls

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__TIMEOUT

Duration

10s

quarkus.langchain4j.anthropic."model-name".log-requests

Whether the Anthropic client should log requests

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__LOG_REQUESTS

boolean

false

quarkus.langchain4j.anthropic."model-name".log-responses

Whether the Anthropic client should log responses

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__LOG_RESPONSES

boolean

false

quarkus.langchain4j.anthropic."model-name".log-requests-curl

Whether the Anthropic client should log requests as cURL commands

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__LOG_REQUESTS_CURL

boolean

false

quarkus.langchain4j.anthropic."model-name".enable-integration

Whether to enable the integration. Defaults to true, which means requests are made to the Anthropic provider. Set to false to disable all requests.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__ENABLE_INTEGRATION

boolean

true

quarkus.langchain4j.anthropic."model-name".chat-model.model-name

Model name to use

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_MODEL_NAME

string

claude-3-haiku-20240307

quarkus.langchain4j.anthropic."model-name".chat-model.temperature

What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.

It is generally recommended to set this or the top-k property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_TEMPERATURE

double

0.7

quarkus.langchain4j.anthropic."model-name".chat-model.max-tokens

The maximum number of tokens to generate in the completion.

The token count of your prompt plus max_tokens cannot exceed the model’s context length

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_MAX_TOKENS

int

1024

quarkus.langchain4j.anthropic."model-name".chat-model.top-p

It is generally recommended to set this or the temperature property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_TOP_P

double

1.0

quarkus.langchain4j.anthropic."model-name".chat-model.top-k

Reduces the probability of generating nonsense. A higher value (e.g. 100) will give more diverse answers, while a lower value (e.g. 10) will be more conservative

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_TOP_K

int

40

quarkus.langchain4j.anthropic."model-name".chat-model.max-retries

The maximum number of times to retry. 1 means exactly one attempt, with retrying disabled.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_MAX_RETRIES

int

1

quarkus.langchain4j.anthropic."model-name".chat-model.stop-sequences

The custom text sequences that will cause the model to stop generating

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_STOP_SEQUENCES

list of string

quarkus.langchain4j.anthropic."model-name".chat-model.log-requests

Whether chat model requests should be logged

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.anthropic."model-name".chat-model.log-responses

Whether chat model responses should be logged

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.anthropic."model-name".chat-model.cache-system-messages

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_CACHE_SYSTEM_MESSAGES

boolean

false

quarkus.langchain4j.anthropic."model-name".chat-model.cache-tools

Cache tool definitions to reduce costs. Requires minimum 1024 tokens (Claude Opus/Sonnet) or 2048-4096 tokens (Haiku). Supported models: Claude Opus 4.1, Sonnet 4.5, Haiku 4.5, and later models.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_CACHE_TOOLS

boolean

false

quarkus.langchain4j.anthropic."model-name".chat-model.thinking.type

The thinking type to enable Claude’s reasoning process

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_TYPE

string

quarkus.langchain4j.anthropic."model-name".chat-model.thinking.budget-tokens

The token budget for the model’s thinking process

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_BUDGET_TOKENS

int

quarkus.langchain4j.anthropic."model-name".chat-model.thinking.return-thinking

Whether thinking results should be returned in the response

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_RETURN_THINKING

boolean

false

quarkus.langchain4j.anthropic."model-name".chat-model.thinking.send-thinking

Whether previously stored thinking should be sent in follow-up requests

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_SEND_THINKING

boolean

true

quarkus.langchain4j.anthropic."model-name".chat-model.thinking.interleaved

Enable interleaved thinking for Claude 4 models, allowing reasoning between tool calls. Requires Claude 4 model (e.g., claude-opus-4-20250514) and thinking.type: enabled.

Environment variable: QUARKUS_LANGCHAIN4J_ANTHROPIC__MODEL_NAME__CHAT_MODEL_THINKING_INTERLEAVED

boolean

false

About the Duration format

To write duration values, use the standard java.time.Duration format. See the Duration#parse() Java API documentation for more information.

You can also use a simplified format, starting with a number:

If the value is only a number, it represents time in seconds.
If the value is a number followed by ms, it represents time in milliseconds.

In other cases, the simplified format is translated to the java.time.Duration format for parsing:

If the value is a number followed by h, m, or s, it is prefixed with PT.
If the value is a number followed by d, it is prefixed with P.