Mistral Embedding Models

Mistral provides highly efficient embedding models that can be used to build RAG systems and perform vector similarity search.

Prerequisites

See Mistral Chat Models for prerequisites, including obtaining an API key from the Mistral platform.

Add this dependency:

<dependency>
  <groupId>io.quarkiverse.langchain4j</groupId>
  <artifactId>{provider-artifact}</artifactId>
  <version>1.7.2</version>
</dependency>

Even better, if you use the Quarkus platformn BOM (default for projects generated), add the Quarkus Langchain4J BOM and all dependency versions will align:

    <dependencyManagement>
        <dependencies>
            <dependency>
                <groupId>${quarkus.platform.group-id}</groupId>
                <artifactId>${quarkus.platform.artifact-id}</artifactId>
                <version>${quarkus.platform.version}</version>
                <type>pom</type>
                <scope>import</scope>
            </dependency>
            <dependency>
                <groupId>${quarkus.platform.group-id}</groupId>
                <artifactId>quarkus-langchain4j-bom</artifactId> (1)
                <version>${quarkus.platform.version}</version> (2)
                <type>pom</type>
                <scope>import</scope>
            </dependency>
        </dependencies>
    </dependencyManagement>

    <dependencies>
      <dependency>
        <groupId>io.quarkiverse.langchain4j</groupId>
        <artifactId>{provider-artifact}</artifactId>
        (3)
      </dependency>
    </dependencies>

1	In your `dependencyManagement` section, add the `quarkus-langchain4j-bom`
2	Inherit the version from your platform version
3	Voilà, no need for version alignment anymore

Configuration

To enable the embedding model, configure:

quarkus.langchain4j.mistralai.api-key=...
quarkus.langchain4j.mistralai.embedding-model.model-name=mistral-embed

Available models: mistral-embed, codestral-embed, etc. (see https://docs.mistral.ai/platform/endpoints/#embeddings)

Example injection:

@Inject EmbeddingModel embeddingModel;

Configuration property fixed at build time - All other configuration properties are overridable at runtime

Configuration property	Type	Default
`quarkus.langchain4j.mistralai.chat-model.enabled` Whether the model should be enabled Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_ENABLED`	boolean	`true`
`quarkus.langchain4j.mistralai.embedding-model.enabled` Whether the model should be enabled Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_ENABLED`	boolean	`true`
`quarkus.langchain4j.mistralai.base-url` Base URL of Mistral API Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_BASE_URL`	string	`https://api.mistral.ai/v1/`
`quarkus.langchain4j.mistralai.api-key` Mistral API key Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_API_KEY`	string	`dummy`
`quarkus.langchain4j.mistralai.timeout` Timeout for Mistral calls Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_TIMEOUT`	Duration	`10s`
`quarkus.langchain4j.mistralai.chat-model.model-name` Model name to use Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_MODEL_NAME`	string	`mistral-tiny`
`quarkus.langchain4j.mistralai.chat-model.temperature` What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. It is generally recommended to set this or the `top-k` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_TEMPERATURE`	double	`0.7`
`quarkus.langchain4j.mistralai.chat-model.max-tokens` The maximum number of tokens to generate in the completion. The token count of your prompt plus `max_tokens` cannot exceed the model’s context length Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_MAX_TOKENS`	int
`quarkus.langchain4j.mistralai.chat-model.top-p` Double (0.0-1.0). Nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered. It is generally recommended to set this or the `temperature` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_TOP_P`	double	`1.0`
`quarkus.langchain4j.mistralai.chat-model.safe-prompt` Whether to inject a safety prompt before all conversations Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_SAFE_PROMPT`	boolean
`quarkus.langchain4j.mistralai.chat-model.random-seed` The seed to use for random sampling. If set, different calls will generate deterministic results. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_RANDOM_SEED`	int
`quarkus.langchain4j.mistralai.chat-model.log-requests` Whether chat model requests should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.mistralai.chat-model.log-responses` Whether chat model responses should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.mistralai.embedding-model.model-name` Model name to use Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_MODEL_NAME`	string	`mistral-embed`
`quarkus.langchain4j.mistralai.embedding-model.log-requests` Whether embedding model requests should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.mistralai.embedding-model.log-responses` Whether embedding model responses should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.mistralai.log-requests` Whether the Mistral client should log requests Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.mistralai.log-responses` Whether the Mistral client should log responses Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.mistralai.enable-integration` Whether to enable the integration. Defaults to `true`, which means requests are made to the Mistral AI provider. Set to `false` to disable all requests. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI_ENABLE_INTEGRATION`	boolean	`true`
Named model config	Type	Default
`quarkus.langchain4j.mistralai."model-name".base-url` Base URL of Mistral API Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__BASE_URL`	string	`https://api.mistral.ai/v1/`
`quarkus.langchain4j.mistralai."model-name".api-key` Mistral API key Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__API_KEY`	string	`dummy`
`quarkus.langchain4j.mistralai."model-name".timeout` Timeout for Mistral calls Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__TIMEOUT`	Duration	`10s`
`quarkus.langchain4j.mistralai."model-name".chat-model.model-name` Model name to use Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_MODEL_NAME`	string	`mistral-tiny`
`quarkus.langchain4j.mistralai."model-name".chat-model.temperature` What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. It is generally recommended to set this or the `top-k` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_TEMPERATURE`	double	`0.7`
`quarkus.langchain4j.mistralai."model-name".chat-model.max-tokens` The maximum number of tokens to generate in the completion. The token count of your prompt plus `max_tokens` cannot exceed the model’s context length Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_MAX_TOKENS`	int
`quarkus.langchain4j.mistralai."model-name".chat-model.top-p` Double (0.0-1.0). Nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered. It is generally recommended to set this or the `temperature` property but not both. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_TOP_P`	double	`1.0`
`quarkus.langchain4j.mistralai."model-name".chat-model.safe-prompt` Whether to inject a safety prompt before all conversations Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_SAFE_PROMPT`	boolean
`quarkus.langchain4j.mistralai."model-name".chat-model.random-seed` The seed to use for random sampling. If set, different calls will generate deterministic results. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_RANDOM_SEED`	int
`quarkus.langchain4j.mistralai."model-name".chat-model.log-requests` Whether chat model requests should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.mistralai."model-name".chat-model.log-responses` Whether chat model responses should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.mistralai."model-name".embedding-model.model-name` Model name to use Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__EMBEDDING_MODEL_MODEL_NAME`	string	`mistral-embed`
`quarkus.langchain4j.mistralai."model-name".embedding-model.log-requests` Whether embedding model requests should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__EMBEDDING_MODEL_LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.mistralai."model-name".embedding-model.log-responses` Whether embedding model responses should be logged Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__EMBEDDING_MODEL_LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.mistralai."model-name".log-requests` Whether the Mistral client should log requests Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__LOG_REQUESTS`	boolean	`false`
`quarkus.langchain4j.mistralai."model-name".log-responses` Whether the Mistral client should log responses Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__LOG_RESPONSES`	boolean	`false`
`quarkus.langchain4j.mistralai."model-name".enable-integration` Whether to enable the integration. Defaults to `true`, which means requests are made to the Mistral AI provider. Set to `false` to disable all requests. Environment variable: `QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__ENABLE_INTEGRATION`	boolean	`true`

Configuration property

Type

Default

quarkus.langchain4j.mistralai.chat-model.enabled

Whether the model should be enabled

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_ENABLED

boolean

true

quarkus.langchain4j.mistralai.embedding-model.enabled

Whether the model should be enabled

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_ENABLED

boolean

true

quarkus.langchain4j.mistralai.base-url

Base URL of Mistral API

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_BASE_URL

string

https://api.mistral.ai/v1/

quarkus.langchain4j.mistralai.api-key

Mistral API key

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_API_KEY

string

dummy

quarkus.langchain4j.mistralai.timeout

Timeout for Mistral calls

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_TIMEOUT

Duration

10s

quarkus.langchain4j.mistralai.chat-model.model-name

Model name to use

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_MODEL_NAME

string

mistral-tiny

quarkus.langchain4j.mistralai.chat-model.temperature

What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.

It is generally recommended to set this or the top-k property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_TEMPERATURE

double

0.7

quarkus.langchain4j.mistralai.chat-model.max-tokens

The maximum number of tokens to generate in the completion.

The token count of your prompt plus max_tokens cannot exceed the model’s context length

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_MAX_TOKENS

int

quarkus.langchain4j.mistralai.chat-model.top-p

Double (0.0-1.0). Nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.

It is generally recommended to set this or the temperature property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_TOP_P

double

1.0

quarkus.langchain4j.mistralai.chat-model.safe-prompt

Whether to inject a safety prompt before all conversations

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_SAFE_PROMPT

boolean

quarkus.langchain4j.mistralai.chat-model.random-seed

The seed to use for random sampling. If set, different calls will generate deterministic results.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_RANDOM_SEED

int

quarkus.langchain4j.mistralai.chat-model.log-requests

Whether chat model requests should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.mistralai.chat-model.log-responses

Whether chat model responses should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_CHAT_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.mistralai.embedding-model.model-name

Model name to use

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_MODEL_NAME

string

mistral-embed

quarkus.langchain4j.mistralai.embedding-model.log-requests

Whether embedding model requests should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.mistralai.embedding-model.log-responses

Whether embedding model responses should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_EMBEDDING_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.mistralai.log-requests

Whether the Mistral client should log requests

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_LOG_REQUESTS

boolean

false

quarkus.langchain4j.mistralai.log-responses

Whether the Mistral client should log responses

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_LOG_RESPONSES

boolean

false

quarkus.langchain4j.mistralai.enable-integration

Whether to enable the integration. Defaults to true, which means requests are made to the Mistral AI provider. Set to false to disable all requests.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI_ENABLE_INTEGRATION

boolean

true

Named model config

Type

Default

quarkus.langchain4j.mistralai."model-name".base-url

Base URL of Mistral API

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__BASE_URL

string

https://api.mistral.ai/v1/

quarkus.langchain4j.mistralai."model-name".api-key

Mistral API key

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__API_KEY

string

dummy

quarkus.langchain4j.mistralai."model-name".timeout

Timeout for Mistral calls

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__TIMEOUT

Duration

10s

quarkus.langchain4j.mistralai."model-name".chat-model.model-name

Model name to use

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_MODEL_NAME

string

mistral-tiny

quarkus.langchain4j.mistralai."model-name".chat-model.temperature

What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.

It is generally recommended to set this or the top-k property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_TEMPERATURE

double

0.7

quarkus.langchain4j.mistralai."model-name".chat-model.max-tokens

The maximum number of tokens to generate in the completion.

The token count of your prompt plus max_tokens cannot exceed the model’s context length

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_MAX_TOKENS

int

quarkus.langchain4j.mistralai."model-name".chat-model.top-p

It is generally recommended to set this or the temperature property but not both.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_TOP_P

double

1.0

quarkus.langchain4j.mistralai."model-name".chat-model.safe-prompt

Whether to inject a safety prompt before all conversations

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_SAFE_PROMPT

boolean

quarkus.langchain4j.mistralai."model-name".chat-model.random-seed

The seed to use for random sampling. If set, different calls will generate deterministic results.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_RANDOM_SEED

int

quarkus.langchain4j.mistralai."model-name".chat-model.log-requests

Whether chat model requests should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.mistralai."model-name".chat-model.log-responses

Whether chat model responses should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__CHAT_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.mistralai."model-name".embedding-model.model-name

Model name to use

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__EMBEDDING_MODEL_MODEL_NAME

string

mistral-embed

quarkus.langchain4j.mistralai."model-name".embedding-model.log-requests

Whether embedding model requests should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__EMBEDDING_MODEL_LOG_REQUESTS

boolean

false

quarkus.langchain4j.mistralai."model-name".embedding-model.log-responses

Whether embedding model responses should be logged

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__EMBEDDING_MODEL_LOG_RESPONSES

boolean

false

quarkus.langchain4j.mistralai."model-name".log-requests

Whether the Mistral client should log requests

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__LOG_REQUESTS

boolean

false

quarkus.langchain4j.mistralai."model-name".log-responses

Whether the Mistral client should log responses

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__LOG_RESPONSES

boolean

false

quarkus.langchain4j.mistralai."model-name".enable-integration

Whether to enable the integration. Defaults to true, which means requests are made to the Mistral AI provider. Set to false to disable all requests.

Environment variable: QUARKUS_LANGCHAIN4J_MISTRALAI__MODEL_NAME__ENABLE_INTEGRATION

boolean

true

About the Duration format

To write duration values, use the standard java.time.Duration format. See the Duration#parse() Java API documentation for more information.

You can also use a simplified format, starting with a number:

If the value is only a number, it represents time in seconds.
If the value is a number followed by ms, it represents time in milliseconds.

In other cases, the simplified format is translated to the java.time.Duration format for parsing:

If the value is a number followed by h, m, or s, it is prefixed with PT.
If the value is a number followed by d, it is prefixed with P.