OptionalapiThe API token to use for authentication.
If not provided, it will be read from the DEEPINFRA_API_TOKEN environment variable.
OptionalbatchThe maximum number of texts to embed in a single request. This is limited by the DeepInfra API to a maximum of 1024.
OptionalmaxThe maximum number of concurrent calls that can be made.
Defaults to Infinity, which means no limit.
OptionalmaxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
OptionalmodelThe model ID to use for generating completions.
Default: sentence-transformers/clip-ViT-B-32
OptionalonCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
Input parameters for the DeepInfra embeddings