OptionalapiSome APIs allow an API key instead
OptionalapiThe version of the API functions. Part of the path.
OptionalauthOptionalcacheOptionalcallbackOptionalcallbacksOptionalconcurrencyOptionalconvertOptionalendpointHostname for the API call (if this is running on GCP)
OptionallocationRegion where the LLM is stored (if this is running on GCP)
OptionalmaxThe maximum number of concurrent calls that can be made.
Defaults to Infinity, which means no limit.
OptionalmaxMaximum number of tokens to generate in the completion.
OptionalmaxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
OptionalmetadataOptionalmodelModel to use
OptionalmodelModel to use
Alias for model
OptionalonCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
OptionalplatformWhat platform to run the service on. If not specified, the class should determine this from other means. Either way, the platform actually used will be in the "platform" getter.
OptionalresponseAvailable for gemini-1.5-pro.
The output format of the generated candidate text.
Supported MIME types:
text/plain: Text output.application/json: JSON response in the candidates.OptionalsafetyOptionalsafetyOptionalstopOptionalstreamingWhether or not to stream.
OptionaltagsOptionaltemperatureSampling temperature to use
OptionaltopKTop-k changes how the model selects tokens for output.
A top-k of 1 means the selected token is the most probable among all tokens in the model’s vocabulary (also called greedy decoding), while a top-k of 3 means that the next token is selected from among the 3 most probable tokens (using temperature).
OptionaltopPTop-p changes how the model selects tokens for output.
Tokens are selected from most probable to least until the sum of their probabilities equals the top-p value.
For example, if tokens A, B, and C have a probability of .3, .2, and .1 and the top-p value is .5, then the model will select either A or B as the next token (using temperature).
Optionalverbose
Input to LLM class.