OptionalapiOptionalcacheOptionalcallbackOptionalcallbacksOptionalconcurrencyOptionalfolderIDYandex Cloud Folder ID
OptionaliamYandex Cloud IAM token for service or user account
with the ai.languageModels.user role.
OptionalmaxThe maximum number of concurrent calls that can be made.
Defaults to Infinity, which means no limit.
OptionalmaxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
OptionalmaxMaximum limit on the total number of tokens used for both the input prompt and the generated response.
OptionalmetadataOptionalmodelModel name to use.
OptionalmodelURIModel URI to use.
OptionalmodelModel version to use.
OptionalonCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
OptionaltagsOptionaltemperatureWhat sampling temperature to use. Should be a double number between 0 (inclusive) and 1 (inclusive).
Optionalverbose
Yandex Cloud Api Key for service account with the
ai.languageModels.userrole.