OptionalbaseOptionalcacheOptionalcallbackOptionalcallbacksOptionalconcurrencyOptionalembeddingOptionalf16OptionalformatOptionalfrequencyOptionalkeepOptionallogitsOptionallowOptionalmainOptionalmaxThe maximum number of concurrent calls that can be made.
Defaults to Infinity, which means no limit.
OptionalmaxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
OptionalmetadataOptionalmirostatOptionalmirostatOptionalmirostatOptionalmodelThe model to use when making requests.
OptionalnumOptionalnumOptionalnumOptionalnumOptionalnumOptionalnumOptionalnumaOptionalonCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
OptionalpenalizeOptionalpresenceOptionalrepeatOptionalrepeatOptionalseedOptionalstopOptionaltagsOptionaltemperatureOptionaltfsZOptionaltopKOptionaltopPOptionaltypicalPOptionaluseOptionaluseOptionalverboseOptionalvocab
Optionally override the base URL to make request to. This should only be set if your Ollama instance is being server from a non-standard location.