Using a trained AI model to make predictions or decisions on new data. Like applying learned knowledge to solve new problems.
When you upload a photo to identify a plant, the app uses model inference to predict what species it is based on its trained model.
All four clouds provide managed ways to deploy trained models and run inference via real-time (online) endpoints and/or batch jobs. They handle scaling, security, monitoring, and versioning around the inference workload.