Class InferenceComponentSpecification
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<InferenceComponentSpecification.Builder,InferenceComponentSpecification>
Details about the resources to deploy with this inference component, including the model, container, and compute resources.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionfinal StringThe name of an existing inference component that is to contain the inference component that you're creating with your request.builder()The compute resources allocated to run the model, plus any adapter models, that you assign to the inference component.Defines a container that provides the runtime environment for a model that you deploy with an inference component.Settings that affect how the inference component caches data.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final inthashCode()final StringThe name of an existing SageMaker AI model object in your account that you want to deploy with the inference component.static Class<? extends InferenceComponentSpecification.Builder> Settings that take effect while the model container starts up.Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
modelName
The name of an existing SageMaker AI model object in your account that you want to deploy with the inference component.
- Returns:
- The name of an existing SageMaker AI model object in your account that you want to deploy with the inference component.
-
container
Defines a container that provides the runtime environment for a model that you deploy with an inference component.
- Returns:
- Defines a container that provides the runtime environment for a model that you deploy with an inference component.
-
startupParameters
Settings that take effect while the model container starts up.
- Returns:
- Settings that take effect while the model container starts up.
-
computeResourceRequirements
The compute resources allocated to run the model, plus any adapter models, that you assign to the inference component.
Omit this parameter if your request is meant to create an adapter inference component. An adapter inference component is loaded by a base inference component, and it uses the compute resources of the base inference component.
- Returns:
- The compute resources allocated to run the model, plus any adapter models, that you assign to the
inference component.
Omit this parameter if your request is meant to create an adapter inference component. An adapter inference component is loaded by a base inference component, and it uses the compute resources of the base inference component.
-
baseInferenceComponentName
The name of an existing inference component that is to contain the inference component that you're creating with your request.
Specify this parameter only if your request is meant to create an adapter inference component. An adapter inference component contains the path to an adapter model. The purpose of the adapter model is to tailor the inference output of a base foundation model, which is hosted by the base inference component. The adapter inference component uses the compute resources that you assigned to the base inference component.
When you create an adapter inference component, use the
Containerparameter to specify the location of the adapter artifacts. In the parameter value, use theArtifactUrlparameter of theInferenceComponentContainerSpecificationdata type.Before you can create an adapter inference component, you must have an existing inference component that contains the foundation model that you want to adapt.
- Returns:
- The name of an existing inference component that is to contain the inference component that you're
creating with your request.
Specify this parameter only if your request is meant to create an adapter inference component. An adapter inference component contains the path to an adapter model. The purpose of the adapter model is to tailor the inference output of a base foundation model, which is hosted by the base inference component. The adapter inference component uses the compute resources that you assigned to the base inference component.
When you create an adapter inference component, use the
Containerparameter to specify the location of the adapter artifacts. In the parameter value, use theArtifactUrlparameter of theInferenceComponentContainerSpecificationdata type.Before you can create an adapter inference component, you must have an existing inference component that contains the foundation model that you want to adapt.
-
dataCacheConfig
Settings that affect how the inference component caches data.
- Returns:
- Settings that affect how the inference component caches data.
-
toBuilder
Description copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilderin interfaceToCopyableBuilder<InferenceComponentSpecification.Builder,InferenceComponentSpecification> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an
SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
equalsBySdkFieldsin interfaceSdkPojo- Parameters:
obj- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToFieldin interfaceSdkPojo- Returns:
- The mapping between the field name and its corresponding field.
-